[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Searching and non-latin1 charsets



G'day,
  This is a message for all translators whose language doesn't use the
latin1 charset.

Could you guys at the very least create the webwml/<lang>/searchtmpl 
directory and copy search.wml and Makefile from the 
webwml/english/searchtmpl directory over?

There is a slight quirk with searching that was only pointed out to me
by the Russian-speaking udmsearch developers.  It's not the search
engines fault, rather a side-effect of charsets and web browsers.

OK, the Russian guy comes up, types a Russian word in and gets his
results, but all his high-bit characters are garbled! The reason is
the search engine is returning the correct byte sequence but it is
rendered wrong.

For example, the search engine spits out 213 which in koi8-r is a
"A with two dots on top of it" but in latin1 is "X with squiggle"
even though the search engine spits out 213, if the Russian template
is not there, it defaults to the English (latin1) template and the
browser "incorrectly" renders the character.

Save the file to disk, then load it and change the charset it uses and
it looks fine.

So it is important, even if you have not translated the slices, that you
have that template there, if only to tell the browser the right charset.
I'm thinking this may be important to note on the search page, but not
sure if it may confuse people.

  - Craig

-- 
Craig Small VK2XLZ  GnuPG:1C1B D893 1418 2AF4 45EE  95CB C76C E5AC 12CA DFA5
Eye-Net Consulting http://www.eye-net.com.au/        <csmall@eye-net.com.au>
MIEEE <csmall@ieee.org>                 Debian developer <csmall@debian.org>



Reply to: