Re: Asian Problems with Unicode

To: dstarner98@aasaa.ofe.org
Cc: debian-devel@lists.debian.org
Subject: Re: Asian Problems with Unicode
From: Robert Coie <rac@mata.intrigue.com>
Date: 11 Sep 1999 09:11:34 -0700
Message-id: <[🔎] 87hfl1pgw9.fsf@mata.intrigue.com>
In-reply-to: David Starner's message of "Sat, 11 Sep 1999 04:44:13 -0500"
References: <[🔎] 19990910002219E.1000@eccosys.com> <[🔎] 199909110009.RAA05787@mata.intrigue.com> <[🔎] 19990911044413.A5118@x8b4e53cd.dhcp.okstate.edu>

David Starner <dvdeug@x8b4e53cd.dhcp.okstate.edu> writes:

> First place, are these standards mutually exclusive? Is it a problem in
> practice to work with both?

They are encoding methods, so are mutually exclusive in the same sense
that base64 and uuencode are.  In practice, one major benefit for
using Unicode to work with Japanese is that information about which
encoding system is being used no longer needs to be stored separately
from data.  This benefit is lost if you don't know whether you are
seeing UCS-2 or UTF-8.

> Second, this isn't a big deal. I don't believe most people have huge 
> amounts of uncompressed text laying about, at least not enough to 
> make a doubling of the space make a real difference. As for compressed
> text, almost any compressor should get the text down to about the
> same space usage. (Feel free to prove me wrong here with real numbers.)

No, it's not a big deal, more just an inconvenience.  About the only
time it really caused me problems was when working with Java-SQL
connectivity.  There are UTF-8 extraction primitives, but no support
for working with UCS-2.  I had to either explain to the client why the
database was going to have 50% bloat, or roll unsupported access
primitives.

-- 
Robert Coie
Implementor, Apropos Ltd.

Reply to:

Follow-Ups:
- Re: Asian Problems with Unicode
  - From: Tomohiro KUBOTA <kubota@surfchem0.riken.go.jp>

References:
- Re: Multibyte encoding - what should a package provide?
  - From: sen_ml@eccosys.com
- Re: Asian Problems with Unicode
  - From: Robert Coie <rac@mata.intrigue.com>
- Re: Asian Problems with Unicode
  - From: David Starner <dvdeug@x8b4e53cd.dhcp.okstate.edu>

Prev by Date: ITP AutoClass
Next by Date: Re: Strategy: DNS server in main for potato?
Previous by thread: Re: Asian Problems with Unicode
Next by thread: Re: Asian Problems with Unicode
Index(es):
- Date
- Thread