Re: search.debian.org is online
Hi,
From: Tomohiro KUBOTA <debian@tmail.plala.or.jp>
Subject: Re: search.debian.org is online
Date: Mon, 30 Dec 2002 19:53:31 +0900 (JST)
> 1. handling of two-byte characters
> 2. extraction of words from sentences without whitespaces
I think I found the reason of the problem 1. Though mnogosearch
supports multibyte languages, it doesn't support them by default.
To support them, recompilation is needed.
mnogosearch-3.2.7$ ./configure --help
.....
--with-extra-charsets=CHARSET[,CHARSET,...]
Use additional non-default charsets:
none, all or a list from this set:
big5 gb2312 gbk japanese euc-kr gujarati tscii
.....
Note that "japanese" means Shift_JIS, which is not the encoding for
Debian Japanese web pages. Debian Japanese web pages are written
using ISO-2022-JP which seems not be supported by mnogosearch.
---
Tomohiro KUBOTA <kubota@debian.org>
http://www.debian.or.jp/~kubota/
Reply to: