[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#92959: RFP: Larbin -- Larbin is a web crawler, NOT an indexer. GPLed



Package: wnpp
Version: N/A; reported 2001-04-05
Severity: wishlist



-- System Information
Debian Release: testing/unstable
Architecture: i386
Kernel: Linux king 2.2.17 #2 SMP Sun Nov 26 08:43:47 HKT 2000 i686

>From its web:
   Larbin is a web crawler (also called (web) robot, spider, scooter...). It
   is intended to fetch a large number of web pages to fill the database of a
   search engine. With a network fast enough, Larbin should be able to fetch
   more than 100 millions pages on a standard PC.
     
   Larbin is (just) a web crawler, NOT an indexer.
     
   Larbin was initially developped for the XYLEME project in the VERSO team
   at INRIA. The goal of Larbin was to go and fetch xml pages on the web to
   fill the database of an xml-oriented search engine. Thanks to its origins,
   Larbin is very generalistic (and easy to customize).

Web: http://pauillac.inria.fr/~ailleret/prog/larbin/index.html
version: V1.2.2 (2001-04-04)





Reply to: