Friday, November 25, 2005

P2P-based web search engine

Web search engines are currently the domain of large corporations like Yahoo, Microsoft, and Google. However, one can imagine an open-source, distributed, P2P based search engine. Traditional P2P systems like Gnutella use distributed searching to locate files, and the web is nothing more than a collection of HTML files. The P2P-based web search engine envisioned here would take the functions that would typically be running on Yahoo, Google, etc. servers and distribute them to clients running on end-user systems. You might think of this as Gnutella for web content.

An ideal client for a distributed web search engine might be one of the existing browsers like Firefox. In that case, a new ranking system could be used based on the number of times a specific page is actually visited (e.g. as opposed to the number of times a given paged is linked to, as used by Google). This information would, of course, need to be anonymized.

No comments: