[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: enable searching East Asian words at search.debian.org



On Tue, May 13, 2003 at 10:02:15AM +0200, Peter Karlsson wrote:
> Denis Barbier:
> 
> > which means that e-acute has been converted twice, and no pages are
> > found.  Am I doing something wrong?
> 
> Your browser is most probably buggy, try upgrading it to the latest
> version and try again.
> 
> A form on a page which does not declare an accept-charset attribute on
> the form tag is supposed to use the encoding from the page the form was
> on for submitting it.

I am on sid and first tested with Lynx.  Now I performed other tests,
here are my results: in this table I represent how e-acute (0xe9 in
latin1 encoding) is escaped in the q= part of the results URL.

 -------------------------------------------
 browser\ env.| iso-8859-15 | utf-8        |
 -------------------------------------------
  lynx        | %C3%A9      | %C3%83%C2%A9 |
  w3m-en      | %E9         | %C3%A9       |
  mozilla     | %C3%A9      | %C3%A9       |
  konqueror   | %C3%A9      | %C3%A9       |
 -------------------------------------------

So my problems seems related to text browsers.  In UTF-8 environment,
the xterm window nicely displays UTF-8 encoded files, I cut'n'paste
the French word, and it appears fine in the browser.

> Or did you send it from the command line with unencoded non-ASCII
> characters?

No.

Denis



Reply to: