[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: language_names

Thanks for this useful information.

Since most people won't have fonts installed to handle characters in
other charsets, this won't buy us much for now. We should still
make the change because it will prepare us for the future and
because it is the Right Thing.

On Fri, Jan 07, 2000 at 12:06:17AM +0900, Yoshizumi Endo wrote:
> As you know, some language_names at the bottom of each pages are
> broken for conflict of character set.  I think that to use "character
> entity references" is an adequate solution to avoid this problem.
>   "Character entity references in HTML 4.0"
>   http://www.w3.org/TR/1998/REC-html40-19980424/sgml/entities.html
> For example:
>   Español  -> Español
>   Româná   -> Românã
>   français -> français
All the language names in english/template/debian/language_names.wml
should be converted to this form. BTW, someones editor keeps changing
certain characters. For example ¼ is converted to 1/4. Using
character entity references should stop this problem.

> Also, "character entity references" can handle multibyte-characters
> with ISO10646 code set.
>   Japanese (Chinese characters) -> 日本語
>   Chinese  (Chinese characters) -> 中文
> I've committed new Japanese language_name with the "character entity
> references".  Then, if you have modern web browser (ex. mozilla M12 or
> IE5) and appropriate font-sets, you can read Japanese language_name at
> the bottom of the English top page.
One thing that confuses me (which may simply be a symptom of a
greater misunderstanding): will this work when the translations are
encoded using their native charset, e.g. iso-8859-1? My impression was
that the pages would need to be encoded entirely in something like
ISO10646 for this to work.

James (Jay) Treacy

Reply to: