Re: debconf w/ charset encoding support
Hi,
At Wed, 18 Sep 2002 10:21:25 +0900,
Junichi Uekawa wrote:
> There is a list posted in the bug entry, to assume the original unspecified
> encoding, by Tomohiro Kubota.
> Please use it.
Sorry, that list had too many entries than needed, while some
needed entries are lacking. The needed pairs of languages and
encodings for Debconf are avaiable at
http://www.debian.org/international/l10n/templates/
as I wrote yesterday. Since I found some bugs in the yesterday's
list and I also did some additional checks, I submit a renewed
list here.
-------------------------------------------------------------
LangCode Encoding Alternative for questionables Notes
-------------------------------------------------------------
bg CP1251 (1)
ca ISO-8859-15
cs ISO-8859-2
da ISO-8859-1 ISO-8859-15? (don't use Euro) (2)
da_DK ISO-8859-1 ISO-8859-15? (don't use Euro) (2)
de ISO-8859-15
es ISO-8859-15
fi ISO-8859-15
fr ISO-8859-15
gl ISO-8859-15
gl_ES ISO-8859-15
hu ISO-8859-2
hu_HU ISO-8859-2
it ISO-8859-15
ja EUC-JP
ko EUC-KR
lt ISO-8859-13 (3)
nb ISO-8859-1 ISO-8859-15? (not EU country) (2)
nl ISO-8859-15
no ISO-8859-1 ISO-8859-15? (not EU country) (2)
pl ISO-8859-2
pt ISO-8859-15
pt_BR ISO-8859-15
ro ISO-8859-16 ISO-8859-2? (4)
ru KOI8-R
sv ISO-8859-1 ISO-8859-15? (don't use Euro) (5)
tr_TR ISO-8859-9
zh_CN GB2312
zh_TW Big5
-------------------------------------------------------------
(1) The only one Bulgarian template (hylafax) seems to be written
in CP1251. (Only when the template is interpreted as CP1251,
it looks like a human language. ISO-8859-5, KOI8-R, and KOI8-U
cannot be human languages, though I don't know Bulgarian.)
(2) Note for non-Euro ISO-8859-1 countries:
I checked all templates which is listed in the l10n web site
and there were no templates which use bytes which mean different
characters in ISO-8859-1 and ISO-8859-15.
(3) The translator of the only one Lithuanian template
(Garabik) says that it is written in ISO-8859-13.
(4) Only one translation is for adduser package. However,
I don't know which encoding (ISO-8859-2 or ISO-8859-16)
is used....
(5) I checked all translations and found that main-menu package
has UTF-8-encoded (i.e., broken) template. All other templates
don't contain octets which are different in ISO-8859-1 and
ISO-8859-15.
To complete this list, following informations are needed:
1. Romanian encoding --- which encoding, ISO-8859-16 or ISO-8859-2,
is used for "adduser" package?
2. Non-Euro (previously-)ISO-8859-1 countries --- which encoding,
ISO-8859-1 or ISO-8859-15, is preferable encoding for Danish,
Norwegian, and Swedish?
---
Tomohiro KUBOTA <kubota@debian.org>
http://www.debian.or.jp/~kubota/
"Introduction to I18N" http://www.debian.org/doc/manuals/intro-i18n/
Reply to: