[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: enable searching East Asian words at search.debian.org



On Mon, May 12, 2003 at 06:44:20PM +0900, Tomohiro KUBOTA wrote:
> Hi,
> 
> From: barbier@linuxfr.org (Denis Barbier)
> Subject: Re: enable searching East Asian words at search.debian.org
> Date: Mon, 12 May 2003 09:54:58 +0200
> 
> > My understanding of Josip mail is that when investigating your
> > instructions about mnogosearch, he wondered how input text has
> > to be encoded when filling search form.  This is a good question,
> > search page should tell which encoding to use when searching for
> > non-English words.
> 
> Yes, I know.  The solution is to write the search page in UTF-8,
> which has been available since last December when Craig and I
> discussed about this problem.
>
> For example, I can search an Russian word "Novosti" (of course in
> Cyrillic)

The point is: how are Cyrillic words passed by the web browser to the
search engine?
Are they encoded in ISO-8859-5, KOI8-R or UTF-8 charsets?

[...]
> Also, I can input Japanese words.  However, there will be no results
> for Japanese words because of problems I wrote.

Yes, I am pretty sure that Josip was investigating these problems when
he sent his mail and will implement your solution.

Denis



Reply to: