Re: default character encoding for everything in debian

To: debian-devel@lists.debian.org
Subject: Re: default character encoding for everything in debian
From: Samuel Thibault <sthibault@debian.org>
Date: Tue, 11 Aug 2009 22:18:55 +0200
Message-id: <[🔎] 20090811201855.GU5487@const.famille.thibault.fr>
Mail-followup-to: debian-devel@lists.debian.org
In-reply-to: <[🔎] 200908111940.n7BJeZQO067901@neskaya.eckenfels.net>
References: <[🔎] 20090811183800.GE5487@const.famille.thibault.fr> <[🔎] 200908111940.n7BJeZQO067901@neskaya.eckenfels.net>

Bernd Eckenfels, le Tue 11 Aug 2009 21:40:35 +0200, a écrit :
> In article <[🔎] 20090811183800.GE5487@const.famille.thibault.fr> you wrote:
> > Not necessarily.  Any sane implementation should just use wchar_t
> 
> Which could be UTF16 and therefore still has complicatd length semantics. 

??

wchar_t may be 32 or 16bit (in which case it can't express unicode after
U+FFFF), but it's still meant to have the simple length semantics.

> And even with UTF32 there are combining characters.

Which account for one character. Then there is a problem of rendering
width of course, but as I said it's there anyway as soon as you have
a font with varying letter widths, string manipulation don't pose any
problem anyway.

> But the length could be defined in code units - its just a question
> how usefull it is.

Of course.  It's rarely useful to take into account character width
yourself, unless you are rendering on a tty, but then speed usually
doesn't matter and you can afford calling wcswidth() on your string
as late as possible.

Samuel

Reply to:

References:
- Re: default character encoding for everything in debian
  - From: Samuel Thibault <sthibault@debian.org>
- Re: default character encoding for everything in debian
  - From: Bernd Eckenfels <bernd-09@eckenfels.net>

Prev by Date: Re: Automatic Debug Packages
Next by Date: Re: Automatic Debug Packages
Previous by thread: Re: default character encoding for everything in debian
Next by thread: Re: default character encoding for everything in debian
Index(es):
- Date
- Thread