[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#452422: RFP: yacy -- distributed web crawler and search engine

Package: wnpp
Severity: wishlist
X-Debbugs-CC: debian-devel@lists.debian.org

* Package name    : yacy
  Version         : 0.55
  Upstream Author : Michael Christen <mc@example.com>
* URL             : http://yacy.net
* License         : GPL
  Programming Lang: Java
  Description     : distributed web crawler and search engine

YaCy is a scalable personal web crawler and web search engine. One YaCy 
installation can organize more than 10 million documents, but YaCy can 
operate search clusters of unlimited size.

YaCy has a peer-to-peer web index exchange interface and it does not need a 
central server. Web crawls can be done collaborative with all other YaCy 
peers. Resulting indexes are organized in a distributed hash table, and 
search requests are pointed efficiently to specific, index-hosting peers.

YaCy can not only index texts from various file formats but also from 
different media contents. A search result shows interesting text, image, 
audio and video content with direct links to OGG, MP3, and video files.

Because YaCy is fully distributed, search results cannot be completely 
censored, only filtered by single peer owners. However, in a privatly 
operated search network the software provides a strong functionality to 
control the content of the search cluster. In a public search network, a user 
is anonymous because there is no central point where all search requests can 
be stored.

YaCy has a large number of users running their own peer to create a 
independent and open search engine. The standard YaCy release is configured 
in such a way that the software joins this public network. The software has a 
number of community function like a co-operative bookmark system, a news, 
blog and built-in wiki system.

Luca Brivio

