Re: CJK workers, throw off you chains! (fwd)

To: debian-chinese-big5@lists.debian.org
Subject: Re: CJK workers, throw off you chains! (fwd)
From: Jonathan Chang <changcs@santos.ee.ntu.edu.tw>
Date: Fri, 23 Feb 2001 20:40:37 +0800
Message-id: <[🔎] 20010223204037.A23420@santos.ee.ntu.edu.tw>
In-reply-to: <[🔎] 20010223020225.A1458@yahoo.com>; from rigel863@yahoo.com on Fri, Feb 23, 2001 at 02:02:26AM -0700
References: <[🔎] 20010222202617.A4204@santos.ee.ntu.edu.tw> <[🔎] Pine.LNX.4.10.10102221102450.18694-100000@atlas.datexx.com> <[🔎] 20010223020225.A1458@yahoo.com>

On Fri, Feb 23, 2001 at 02:02:26AM -0700, rigel wrote:
> On Thu, Feb 22, 2001 at 11:14:41AM -0500, Thomas Chan wrote:
> > I don't know who else is interested in CCCII mappings, but there isn't
> > much time--3.1 will be released at the end of March.  CCCII mappings are
> > also problematic, because source separation has been abandoned.  (You can
> 
> Do you know why it was abandoned. Is it believed that CCCII has been covered
> by the combination of other charsets?
> 
> I personally am very interested to see how the 70195 han characters in
> unicode 3.1 compare out with 75684 [1] in CCCII. Given that CCCII contains
> a lot variants, there's good possibility that unicode already has more
> hanzi than CCCII. It'll be interesting to see which CCCII codes are not
> covered yet.
> 
> Although not exactly a fan of CCCII, I admire its well thoughted design.
> It will be useful to have a mapping between CCCII and unicode. A CCCII
> to CNS mapping will help some in this regard. Does anyone know such mapping
> exist?

Unihan-3.txt is kinda a mapping. Unfortunately, it is far from complete.

What I really care is a good approach to construct "indices" for Chinese
documents (especially, books). CCDB(Chinese Characters DataBase) of CCCII
provides some promising ways to order Chinese characters. That's why I would
like to see a complete Unihan <-> CCCII mapping.

> [1] There are some ambiguity about how many characters are encoded in CCCII.
>     According to Ken Lunde's CJKV Information Processing", the formal release
>     version has 53940 hanzi, while the draft version contains 75684 which is
>     the number I quoted. The book was published in 1999.
>

Could you tell me where to find the draft version containing 75684
characters??? And who maintains the CCCII standard now??? I really
want to know these answers.

	Best regards,

-- 
Chia-Sheng Chang (Jonathan Chang)
Institute of Communications Engineering
College of Electrical Engineering and Computer Science
National Taiwan University
Taipei, Taiwan 10617, R.O.C.
E-Mail: changcs@santos.ee.ntu.edu.tw

Reply to:

References:
- Re: CJK workers, throw off you chains! (fwd)
  - From: Jonathan Chang <changcs@santos.ee.ntu.edu.tw>
- Re: CJK workers, throw off you chains! (fwd)
  - From: Thomas Chan <thomas@atlas.datexx.com>
- Re: CJK workers, throw off you chains! (fwd)
  - From: rigel <rigel863@yahoo.com>

Prev by Date: Re: CJK workers, throw off you chains! (fwd)
Next by Date: Samba user
Previous by thread: Re: CJK workers, throw off you chains! (fwd)
Next by thread: Something when upgrading to unstable..
Index(es):
- Date
- Thread