Re: UTF-8 locales

To: David Starner <dstarner98@aasaa.ofe.org>
Cc: debian-i18n@lists.debian.org, debian-devel@lists.debian.org
Subject: Re: UTF-8 locales
From: Bernd Eckenfels <lists@lina.inka.de>
Date: Sun, 19 Nov 2000 22:50:54 +0100
Message-id: <[🔎] 20001119225054.A14582@lina.inka.de>
In-reply-to: <[🔎] 20001118200111.A12372@x8b4e516e.dhcp.okstate.edu>; from dvdeug@x8b4e516e.dhcp.okstate.edu on Sat, Nov 18, 2000 at 08:01:11PM -0600
References: <[🔎] 87r94gqd2e.wl@surfchem0.riken.go.jp> <[🔎] 200011131854.DAA16802@smtp5.dti.ne.jp> <[🔎] 87u29928rd.wl@surfchem0.riken.go.jp> <[🔎] 20001116004510.A3138@debian.org> <[🔎] 20001116094026.A12204@daisy.vocalis.com> <[🔎] 20001118225558.A1180@debian.org> <[🔎] 20001118200111.A12372@x8b4e516e.dhcp.okstate.edu>

On Sat, Nov 18, 2000 at 08:01:11PM -0600, David Starner wrote:
> Which includes the Chinese and Japenese, who need the characters found
> in the Supplementary Ideographic Planes, which means 4 byte characters.

Afaik UTF8 is not able to encode 32bit unicode? I thought this is because
the "living" languages are all restricted to 16bit? Hmm... i might be wrong.
Does that mean Java does not support asian languages with its 16bit Unicode?

<blockquote cite="http://www.unicode.org/unicode/standard/principles.html";>

  While 65,000 characters are sufficient for encoding most of the many
  thousands of characters used in major languages of the world, the Unicode
  standard and ISO 10646 provide an extension mechanism called UTF-16 that
  allows for encoding as many as a million more characters, without use of
  complex modes or escape codes.  This is sufficient for all known character
  encoding requirements, including full coverage of all historic scripts of
  the world.

</blockquote>

As I understand it, all living languages are contained in the "not-extended"
16bit set. No?

Greetings
Bernd
-- 
  (OO)      -- Bernd_Eckenfels@Wendelinusstrasse39.76646Bruchsal.de --
 ( .. )  ecki@{inka.de,linux.de,debian.org} http://home.pages.de/~eckes/
  o--o     *plush*  2048/93600EFD  eckes@irc  +497257930613  BE5-RIPE
(O____O)  When cryptography is outlawed, bayl bhgynjf jvyy unir cevinpl!

Reply to:

Follow-Ups:
- Re: UTF-8 locales
  - From: Tom Emerson <tree@basistech.com>
- Re: UTF-8 locales
  - From: David Starner <dvdeug@x8b4e516e.dhcp.okstate.edu>
- Re: UTF-8 locales
  - From: Tomohiro KUBOTA <tkubota@riken.go.jp>

References:
- UTF-8 locales
  - From: Tomohiro KUBOTA <tkubota@riken.go.jp>
- Re: UTF-8 locales
  - From: GOTO Masanori <gotom@debian.or.jp>
- Re: UTF-8 locales
  - From: Tomohiro KUBOTA <tkubota@riken.go.jp>
- Re: UTF-8 locales
  - From: Nicolás Lichtmaier <nick@debian.org>
- Re: UTF-8 locales
  - From: Edmund GRIMLEY EVANS <edmundo@rano.org>
- Re: UTF-8 locales
  - From: Nicolás Lichtmaier <nick@debian.org>
- Re: UTF-8 locales
  - From: David Starner <dvdeug@x8b4e516e.dhcp.okstate.edu>

Prev by Date: Re: [urgent] testing for 2.2.18 boot-floppies i386 needed
Next by Date: Re: [OT] Re: how do i make my program better?
Previous by thread: Re: UTF-8 locales
Next by thread: Re: UTF-8 locales
Index(es):
- Date
- Thread