2010年12月6日月曜日

What I'm thinking of... December 2010

Hi, thanks for reading Search Engine Trends. The weather outside is getting chilly, but my heart ain't! lol.

But seriously, there are lots of plans I am thinking of doing in 2011.

The search engine I'm making is almost finished, just need the most important part: ranking.

I thought of several ranking algorithms, but won't share it with you in detail here.

I am thinking of making a TF-IDF with semantic analysis.

Keep checking back here for latest news on Mohawk.

Also, I am thinking of renaming my search engine to something more POP. Any suggestions will be welcome, just drop me a line at: tsubasa_kato@hotmail.com or comment here.

If you think of some good name, I'll think about using it, and give the naming credit to you in a dedicated page.

2010年11月12日金曜日

Yahoo + Google = ? 動画。

Yahoo + Google = ? Yahooジャパンとグーグルが提携しますが、何が変わるかの動画を見つけました。。

Technologies to be used in next generation search engine.

First of all, I would like to introduce several technologies that are to be implemented in my next generation search engine.

I am going to use :

1. Most importantly, I will be using Hadoop to analyze logs and rank web results. I will be making a small cluster of computers, and use virtualization technology.

2. Semantic Web search. Metadata and ontology is an important part of semantic web search. I will be focusing on this from this year and start implementing by next year.

3. mod_pagespeed : a new mod by Google for using in Apache web server, which will at maximum make web page load time 50% shorter. The technology is new, and I will have to try it on my server to see if it will really make some change. I am excited for this.

2010年11月6日土曜日

Looking for partners in improving Mohawk.

I am currently looking for partners in improving Mohawk Search Engine. If you are interested, just DM me at @stingraze on twitter.

Looking for skills in: C, C++, Java, Cent OS, Perl, (preferably expert in all these) and skills in MySQL tuning.

Japanese skill will be helpful.

I will be looking for partners globally.

Tsubasa Kato

2010年11月4日木曜日

今日の気になったキーワード

今日の気になったキーワードを紹介します。

-Google クラウドの核心から-

「サーバーラック1つでも、すぐに現在のデータセンター並のハードウェアスレッドを持てるようになるだろう」

NAS装置

オーバーサブスクリプション

共有メモリシステム


「どこまでローエンドにできるか」- まさにこれですね!

メモリパリティ検知

誤り訂正符号付きDRAM

上記のキーワードを使って、僕のMohawkを改良、公開したいと思っています。

2010年10月25日月曜日

I tried using Polaris



I tried using Polaris to datamine, and used my resume to see how it works. It's quite interesting, but I have to install Chasen to use the Japanese Word Splitting.