[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Bug#311795: ITP: rast -- A full text search system



On Sun, 2005-06-05 at 18:21 +0900, Junichi Uekawa wrote:
> Hi,
> 
> 
> > >   * N-gram based indexing; No dictionaries are needed
> > >   * Support many types of documents; e.g. HTML, MS Word
> > >   * Includes library for some programming languages
> > >   * Add text incrementally
> > 
> > Could you please explain what is N-gram and what is this package useful
> > for?
> 
> N-gram is when you use n-characters as a 'word'.
> 
> In some languages including Japanese, it is impossible to 
> determine a 'word', and N-gram is a method that defines a 
> 'word' as n-characters.

Do you mean "N number of characters", or is there some special
meaning to an n-character?

-- 
-----------------------------------------------------------------
Ron Johnson, Jr.
Jefferson, LA USA
PGP Key ID 8834C06B I prefer encrypted mail.

"In order to become the master, the politician poses as the
servant."
Charles de Gaulle

Attachment: signature.asc
Description: This is a digitally signed message part


Reply to: