Bug#292330: use UTF-8 by default

To: Pierre Habouzit <madcoder@debian.org>, 292330@bugs.debian.org
Cc: martin f krafft <madduck@debian.org>, Thorsten Glaser <tg@mirbsd.de>
Subject: Bug#292330: use UTF-8 by default
From: Mike Hommey <mh@glandium.org>
Date: Mon, 18 Jun 2007 13:26:08 +0200
Message-id: <[🔎] 20070618112607.GA8033@glandium.org>
Reply-to: Mike Hommey <mh@glandium.org>, 292330@bugs.debian.org
In-reply-to: <[🔎] 20070618094804.GA30062@artemis.internal.dc7.debconf.org>
References: <[🔎] Pine.BSM.4.64L.0706152312280.5177@odem.66h.42h.de> <[🔎] 20070616094439.GA6113@lapse.madduck.net> <[🔎] Pine.BSM.4.64L.0706161222280.8577@herc.mirbsd.org> <[🔎] 20070616132626.GA16635@lapse.madduck.net> <[🔎] Pine.BSM.4.64L.0706161427380.14260@odem.66h.42h.de> <[🔎] 20070616164800.GA24328@lapse.madduck.net> <[🔎] 20070618094804.GA30062@artemis.internal.dc7.debconf.org>

On Mon, Jun 18, 2007 at 10:48:04AM +0100, Pierre Habouzit <madcoder@debian.org> wrote:
> On Sat, Jun 16, 2007 at 05:48:00PM +0100, martin f krafft wrote:
> > also sprach Thorsten Glaser <tg@mirbsd.de> [2007.06.16.1528 +0100]:
> > > That's what I did, but the idea is not to have to do that. (Besides,
> > > "C" is installed by default, so we need some kind of "C.UTF-8", whose
> > > role is – for LC_CTYPE – usually fulfilled by en_US.UTF-8.)
> > 
> > Please stop CCing debian-project.
> > 
> > Does a C.UTF-8 exist? If yes, then this is a sound proposal,
> > I think.
> 
>   it's not. We could create a neutral.utf-8 locale for sure, but a
> C.utf-8 is really bad, because some programs check the locale for 'C'
> and when they foind that use hand optimized functions to replace the
> localized libc ones. And thanks to POSIX, even if it looks gross, it's
> totally OK to do that.
> 
>   C charset is and should be ascii, that's an assumption you should not
> break. In fact, using an 8bit locale would often not harm, but a
> multi-byte one would be really really bad (as you would end up with e.g.
> strings split in the middle of a point code, *brrr* you definitely don't
> want that).

Note that you won't get strings split in the middle of a point code with
UTF-8.

Anyways, maybe the general problem is that there should be a way to
generate locales at the user level (and store everything in ~/.locale,
for example)

Mike

Reply to:

Follow-Ups:
- Bug#292330: use UTF-8 by default
  - From: Thorsten Glaser <tg@mirbsd.de>
- Re: Bug#292330: use UTF-8 by default
  - From: Michelle Konzack <linux4michelle@freenet.de>

References:
- Bug#292330: use UTF-8 by default
  - From: Thorsten Glaser <tg@mirbsd.de>
- Bug#292330: use UTF-8 by default
  - From: martin f krafft <madduck@debian.org>
- Bug#292330: use UTF-8 by default
  - From: Thorsten Glaser <tg@mirbsd.de>
- Bug#292330: use UTF-8 by default
  - From: martin f krafft <madduck@debian.org>
- Bug#292330: use UTF-8 by default
  - From: Thorsten Glaser <tg@mirbsd.de>
- Bug#292330: use UTF-8 by default
  - From: martin f krafft <madduck@debian.org>
- Bug#292330: use UTF-8 by default
  - From: Pierre Habouzit <madcoder@debian.org>

Prev by Date: Bug#292330: use UTF-8 by default
Next by Date: Bug#292330: use UTF-8 by default
Previous by thread: Bug#292330: use UTF-8 by default
Next by thread: Bug#292330: use UTF-8 by default
Index(es):
- Date
- Thread