[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Release announcement simplified Chinese translation update



Matt Kraai <kraai@ftbfs.org> writes:

> On Mon, Feb 16, 2009 at 01:08:42AM +0800, Deng Xiyue wrote:
>> I don't know that translation of release announcement are put on web
>> using local locale instead of UTF-8.  I have checked the simplified
>> Chinese page[1], which is encoded in GB2312.  Maybe that is required by
>> policy.  I don't know which is the preferred way to store this in SVN
>> repository, so now I'm attaching both the UTF-8 version as
>> well as the GB2312 version for reference.  These 2 versions can be
>> converted to each other by "iconv" utility.
>
> You seem to have sent two copies of the UTF-8-encoded version.
>
> According to the Chinese .wmlrc, the character set used for the
> Chinese translation is Big5.  I tried to convert the file directly,
> which failed:
>
>  $ iconv -f UTF-8 -t BIG5 lenny-announcement.zh_CN.wml.utf8
>  <define-tag pagetitle>Debian GNU/Linux 5.0 iconv: illegal input sequence at position 43
>
> I was able to convert it by going through GB2312, however:
>
>  $ iconv -f UTF-8 -t GB2312 lenny-announcement.zh_CN.wml.utf8 | iconv -f GB2312 -t BIG5
>  ...
>
> I committed this version.  Would you please inspect the results and
> let me know if there are problems?

If all Chinese translation are by default converted to BIG5 then it is
problematic.  Simplified Chinese, as used in zh_CN locales, are supposed
to be converted to GB* encodings (GB2312, GBK, and GB18030, where
GB18030 is the current standard encoding); whereas Traditional Chinese,
as used in zh_TW, zh_HK, zh_SG or other traditional chinese locales, are
supposed to be converted to BIG5 encodings.  Maybe the .wmlrc needs
update.

Moreover, the garbled characters around "vendors" link still remains in
zh_CN, however doesn't show up in zh_TW.  CCing simplified Chinese list
for further insights.

Sincerely,
Deng Xiyue


Reply to: