Re: default character encoding for everything in debian

To: debian-devel@lists.debian.org
Subject: Re: default character encoding for everything in debian
From: "Giacomo A. Catenazzi" <cate@debian.org>
Date: Wed, 12 Aug 2009 07:54:33 +0200
Message-id: <[🔎] 4A825919.8090907@debian.org>
In-reply-to: <[🔎] 20090811183800.GE5487@const.famille.thibault.fr>
References: <[🔎] 200908101309.22076.thomas@koch.ro> <[🔎] 20090810114540.GA13301@puntila.winnegan.fake> <[🔎] 20090811013358.65b60b98@sbs173> <[🔎] 20090811182808.GE19541@cajita.gateway.2wire.net> <[🔎] 20090811183800.GE5487@const.famille.thibault.fr>

Samuel Thibault wrote:
> Gunnar Wolf, le Tue 11 Aug 2009 13:28:08 -0500, a écrit :
>> while length(str) in any language up to the 1990s was a mere
>> substraction, now we must go through the string checking each byte to
>> see if it is a Unicode marker and substract the appropriate number of
>> bytes.
> 
> Not necessarily.  Any sane implementation should just use wchar_t and
> substraction gets back.

An implementation that use wchar_t is usually not sane, but usually
it is (also) buggy. It is very difficult (AFAIK not impossible,
but I'm not so sure) to write portable (POSIX way, so with changing
locales) programs using wchar_t.

The only way I know is to use sanely the wchar_t is to use as the simple
C standard requirements: only one runtime environment and locale.

PS: note that the binary encoding depend on compiler environment (but
such info is not exported).

ciao
	cate

Reply to:

Follow-Ups:
- Re: default character encoding for everything in debian
  - From: Samuel Thibault <sthibault@debian.org>
- Re: default character encoding for everything in debian
  - From: Roger Leigh <rleigh@codelibre.net>

References:
- default character encoding for everything in debian
  - From: Thomas Koch <thomas@koch.ro>
- Re: default character encoding for everything in debian
  - From: Siggy Brentrup <debian@psycho.i21k.de>
- Re: default character encoding for everything in debian
  - From: Harald Braumann <harry@unheit.net>
- Re: default character encoding for everything in debian
  - From: Gunnar Wolf <gwolf@gwolf.org>
- Re: default character encoding for everything in debian
  - From: Samuel Thibault <sthibault@debian.org>

Prev by Date: Re: Virtual package dyndns-client
Next by Date: Re: default character encoding for everything in debian
Previous by thread: Re: default character encoding for everything in debian
Next by thread: Re: default character encoding for everything in debian
Index(es):
- Date
- Thread