Re: Search Engine
On Thu, Sep 23, 1999 at 01:51:50PM -0400, James A. Treacy wrote:
> I've added debian-www to the reply since this is where most
> discussions of this nature take place.
>
> On Thu, Sep 23, 1999 at 09:56:34AM -0700, Jake Sheridan wrote:
> > Hi guys,
> >
> > I've been waiting to use the search engine at debian.org for a long
> > long time.....is it possible to speed up the process somehow.....thanx
> >
> > Jake
> >
> > P.S. I think you have a great site though....just missing the crucial
> > element of a search engine....!!!
> >
> The problem is that none of the search programs we've checked to date have
> all the features we are looking for:
>
> - Free (as in DFSG)
> - Able to handle large data sets (> 1 GB)
> - Able to keep separate indexes and merge them (so we don't have to reindex
> previous months mail archives, but can simply merge those with the current
> month). Merging indexes should be fairly efficient (in most cases where
> merging is implemented, it is not).
> - Able to search on specific parts of the data. For example, searching on
> subject or sender in mail. I don't care how this is implemented (through
> separate indices or through use of regex, e.g. /^Subject: .*How to get rich/)
> as long as it is possible.
> - Able to index files locally, i.e. without going through a web server.
> - Able to search using regex (optional). Next down would be searching for
> simple phrases. At a minimum, the ability to match arbitrary word endings
> (the equivalent of /^keyword\w*/).
>
> Here is what I've looked at so far:
> htdig - can't index locally, too slow
Hello!
I might be wrong, but I had used htdig a few months ago and i can remember it
_had_ some options for digging locally (i.e. no web server).
And as for speed, it is not on your list :->
Marcin
--
--------------------------------
Marcin Owsiany
porridge@pandora.info.bielsko.pl
--------------------------------
Reply to: