Bug#452422: RFP: yacy -- distributed web crawler and search engine
* Package name : yacy
Version : 0.55
Upstream Author : Michael Christen <email@example.com>
* URL : http://yacy.net
* License : GPL
Programming Lang: Java
Description : distributed web crawler and search engine
YaCy is a scalable personal web crawler and web search engine. One YaCy
installation can organize more than 10 million documents, but YaCy can
operate search clusters of unlimited size.
YaCy has a peer-to-peer web index exchange interface and it does not need a
central server. Web crawls can be done collaborative with all other YaCy
peers. Resulting indexes are organized in a distributed hash table, and
search requests are pointed efficiently to specific, index-hosting peers.
YaCy can not only index texts from various file formats but also from
different media contents. A search result shows interesting text, image,
audio and video content with direct links to OGG, MP3, and video files.
Because YaCy is fully distributed, search results cannot be completely
censored, only filtered by single peer owners. However, in a privatly
operated search network the software provides a strong functionality to
control the content of the search cluster. In a public search network, a user
is anonymous because there is no central point where all search requests can
YaCy has a large number of users running their own peer to create a
independent and open search engine. The standard YaCy release is configured
in such a way that the software joins this public network. The software has a
number of community function like a co-operative bookmark system, a news,
blog and built-in wiki system.