Bug#292330: use UTF-8 by default

To: debian-project@lists.debian.org
Cc: 292330@bugs.debian.org
Subject: Bug#292330: use UTF-8 by default
From: Thorsten Glaser <tg@mirbsd.de>
Date: Mon, 18 Jun 2007 13:09:17 +0000 (UTC)
Message-id: <[🔎] Pine.BSM.4.64L.0706181306170.19131@odem.66h.42h.de>
Reply-to: Thorsten Glaser <tg@mirbsd.de>, 292330@bugs.debian.org
In-reply-to: <[🔎] 20070618112607.GA8033@glandium.org>
References: <[🔎] Pine.BSM.4.64L.0706152312280.5177@odem.66h.42h.de> <[🔎] 20070616094439.GA6113@lapse.madduck.net> <[🔎] Pine.BSM.4.64L.0706161222280.8577@herc.mirbsd.org> <[🔎] 20070616132626.GA16635@lapse.madduck.net> <[🔎] Pine.BSM.4.64L.0706161427380.14260@odem.66h.42h.de> <[🔎] 20070616164800.GA24328@lapse.madduck.net> <[🔎] 20070618094804.GA30062@artemis.internal.dc7.debconf.org> <[🔎] 20070618112607.GA8033@glandium.org>

Mike Hommey dixit:

>>   it's not. We could create a neutral.utf-8 locale for sure

Sounds like a plan. Maybe something short and uppercase, akin to
"C" and "POSIX", how about "STD.UTF-8"?

>> but a
>> C.utf-8 is really bad, because some programs check the locale for 'C'
>> and when they foind that use hand optimized functions to replace the
>> localized libc ones.

Ugh. Really? Nah, please spare me the details.

>Note that you won't get strings split in the middle of a point code with
>UTF-8.

This is possible with UTF-8.

Try this one: ł (U+0142) = C5 82
You can split between the C5 and the 82.

>Anyways, maybe the general problem is that there should be a way to
>generate locales at the user level (and store everything in ~/.locale,
>for example)

That'd be a nice additional idea, but it makes additional problems too,
for example quotas, or when do these get updated, or that duplication
is always bad. That would probably be a glibc issue then.

//mirabile
-- 
I believe no one can invent an algorithm. One just happens to hit upon it
when God enlightens him. Or only God invents algorithms, we merely copy them.
If you don't believe in God, just consider God as Nature if you won't deny
existence.		-- Coywolf Qi Hunt

Reply to:

Follow-Ups:
- Re: Bug#292330: use UTF-8 by default
  - From: Mike Hommey <mh@glandium.org>

References:
- Bug#292330: use UTF-8 by default
  - From: Thorsten Glaser <tg@mirbsd.de>
- Bug#292330: use UTF-8 by default
  - From: martin f krafft <madduck@debian.org>
- Bug#292330: use UTF-8 by default
  - From: Thorsten Glaser <tg@mirbsd.de>
- Bug#292330: use UTF-8 by default
  - From: martin f krafft <madduck@debian.org>
- Bug#292330: use UTF-8 by default
  - From: Thorsten Glaser <tg@mirbsd.de>
- Bug#292330: use UTF-8 by default
  - From: martin f krafft <madduck@debian.org>
- Bug#292330: use UTF-8 by default
  - From: Pierre Habouzit <madcoder@debian.org>
- Bug#292330: use UTF-8 by default
  - From: Mike Hommey <mh@glandium.org>

Prev by Date: Bug#292330: use UTF-8 by default
Next by Date: Re: Bug#292330: use UTF-8 by default
Previous by thread: Bug#292330: use UTF-8 by default
Next by thread: Re: Bug#292330: use UTF-8 by default
Index(es):
- Date
- Thread