Re: Release announcement simplified Chinese translation update
- To: Anthony Wong <ypwong@gmail.com>
- Cc: Arne Goetje <arne@linux.org.tw>, Anthony Fok <foka@debian.org>, ygh@debian.org, debian-www@lists.debian.org, debian-chinese-gb@lists.debian.org, "Debian-user in Chinese [ Big5 ]" <debian-chinese-big5@lists.debian.org>
- Subject: Re: Release announcement simplified Chinese translation update
- From: Vern Sun <s5unty@gmail.com>
- Date: Wed, 18 Feb 2009 23:10:03 +0800
- Message-id: <20090218151003.GA3946@debian>
- Mail-followup-to: Anthony Wong <ypwong@gmail.com>, Arne Goetje <arne@linux.org.tw>, Anthony Fok <foka@debian.org>, ygh@debian.org, debian-www@lists.debian.org, debian-chinese-gb@lists.debian.org, "Debian-user in Chinese [ Big5 ]" <debian-chinese-big5@lists.debian.org>
- In-reply-to: <46688e190902171043p3738e31sf2ffb71547e18cc8@mail.gmail.com>
- References: <499844F5.6030802@debian.org> <87zlgn39wl.fsf@users.alioth.debian.org> <20090216010302.GF1723@ftbfs.org> <87vdraddvi.fsf@gmail.com> <49996DDA.2090100@linux.org.tw> <20090216142828.GA20558@ftbfs.org> <49998D1C.2010304@linux.org.tw> <20090216163311.GA4739@ftbfs.org> <499A3741.3030602@linux.org.tw> <46688e190902171043p3738e31sf2ffb71547e18cc8@mail.gmail.com>
on 三, 2009-02-18 at 02:43 +0800, Anthony Wong wrote:
> I suggest 1. to convert all existing Chinese WML files for the Debian website
> from Big5 to UTF-8
>
> Any comments?
>
如果全部转换成 UTF-8 格式可能会存在问题,假设有两个用户(一个简体,一个繁体)都
贡献了一个翻译:
% cat foo.tc
中國
% cat foo.sc
中国
% enca foo.sc foo.tc
foo.sc: Universal transformation format 8 bits; UTF-8
foo.tc: Universal transformation format 8 bits; UTF-8
把简体用户贡献的翻译从 UTF-8 转到 GB2312 是正常的
~% iconv -f utf8 -t gb2312 foo.sc > foo.sc.gb
但是把繁体用户贡献的翻译从 UTF-8 转到 GB2312 是错误的
~% iconv -f utf8 -t gb2312 foo.tc > foo.tc.gb
iconv: illegal input sequence at position 3
同理,把简体用户贡献的翻译从 UTF-8 转到 BIG5 也是错误的
~% iconv -f utf8 -t big5 foo.sc > foo.sc.big
iconv: illegal input sequence at position 3
~% iconv -f utf8 -t big5 foo.tc > foo.tc.big
~% enca foo.*
foo.sc: Universal transformation format 8 bits; UTF-8
foo.sc.big: Traditional Chinese Industrial Standard; Big5
foo.sc.gb: Simplified Chinese National Standard; GB2312
foo.tc: Universal transformation format 8 bits; UTF-8
foo.tc.big: Traditional Chinese Industrial Standard; Big5
foo.tc.gb: Simplified Chinese National Standard; GB2312
--
Vern
2009-02-18
Attachment:
signature.asc
Description: Digital signature
Reply to: