[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Search Engine



On Thu, Sep 23, 1999 at 01:51:50PM -0400, James A. Treacy wrote:
> I've added debian-www to the reply since this is where most
> discussions of this nature take place.
> 
> On Thu, Sep 23, 1999 at 09:56:34AM -0700, Jake Sheridan wrote:
> > Hi guys,
> > 
> > I've been waiting to use the search engine at debian.org for a long
> > long time.....is it possible to speed up the process somehow.....thanx
> > 
> > Jake
> > 
> > P.S. I think you have a great site though....just missing the crucial
> > element of a search engine....!!!
> > 
> The problem is that none of the search programs we've checked to date have
> all the features we are looking for:
> 
> - Free (as in DFSG)
> - Able to handle large data sets (> 1 GB)
> - Able to keep separate indexes and merge them (so we don't have to reindex
>   previous months mail archives, but can simply merge those with the current
>   month). Merging indexes should be fairly efficient (in most cases where
>   merging is implemented, it is not).
> - Able to search on specific parts of the data. For example, searching on
>   subject or sender in mail. I don't care how this is implemented (through
>   separate indices or through use of regex, e.g. /^Subject: .*How to get rich/)
>   as long as it is possible.
> - Able to index files locally, i.e.  without going through a web server.
> - Able to search using regex (optional). Next down would be searching for
>   simple phrases. At a minimum, the ability to match arbitrary word endings
>   (the equivalent of /^keyword\w*/).
> 
> Here is what I've looked at so far:
> htdig - can't index locally, too slow

Hello!

I might be wrong, but I had used htdig a few months ago and i can remember it
_had_ some options for digging locally (i.e. no web server).
And as for speed, it is not on your list :->

Marcin

-- 

--------------------------------
Marcin Owsiany
porridge@pandora.info.bielsko.pl
--------------------------------


Reply to: