Re: enable searching East Asian words at search.debian.org
On Tue, May 13, 2003 at 10:02:15AM +0200, Peter Karlsson wrote:
> Denis Barbier:
>
> > which means that e-acute has been converted twice, and no pages are
> > found. Am I doing something wrong?
>
> Your browser is most probably buggy, try upgrading it to the latest
> version and try again.
>
> A form on a page which does not declare an accept-charset attribute on
> the form tag is supposed to use the encoding from the page the form was
> on for submitting it.
I am on sid and first tested with Lynx. Now I performed other tests,
here are my results: in this table I represent how e-acute (0xe9 in
latin1 encoding) is escaped in the q= part of the results URL.
-------------------------------------------
browser\ env.| iso-8859-15 | utf-8 |
-------------------------------------------
lynx | %C3%A9 | %C3%83%C2%A9 |
w3m-en | %E9 | %C3%A9 |
mozilla | %C3%A9 | %C3%A9 |
konqueror | %C3%A9 | %C3%A9 |
-------------------------------------------
So my problems seems related to text browsers. In UTF-8 environment,
the xterm window nicely displays UTF-8 encoded files, I cut'n'paste
the French word, and it appears fine in the browser.
> Or did you send it from the command line with unencoded non-ASCII
> characters?
No.
Denis
Reply to: