[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: default character encoding for everything in debian

Norbert Preining dijo [Mon, Aug 10, 2009 at 08:55:27PM +0200]:
> On Mo, 10 Aug 2009, Roger Leigh wrote:
> > Of course there's a penalty for certain operations.  But UTF-8 is about
> > as compact as an extended encoding is going to get.
> Rubbish. You know why in Japan and other Asian countries UTF8 is not
> so common? Because many of their glyphs need 4 (four!) bytes, while
> for example jis-2022 (AFAIR) is much more compact.
> We are not living in an ASCII world anymore.

It's not that much about the size as it is about backwards
compatibility. We users of Latin-based alphabets migrate easily to
UTF8, with occassional problems where we use diacritics. Eastern Asian
encodings are _completely_ incompatible with UTF8, so it is just not
possible to tolerate broken text every now and then. Everything just
breaks completely.

Gunnar Wolf • gwolf@gwolf.org • (+52-55)5623-0154 / 1451-2244

Reply to: