[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#292330: use UTF-8 by default



martin f krafft <madduck@debian.org> writes:

> also sprach Thorsten Glaser <tg@mirbsd.de> [2007.06.16.1528 +0100]:
>> That's what I did, but the idea is not to have to do that. (Besides,
>> "C" is installed by default, so we need some kind of "C.UTF-8", whose
>> role is – for LC_CTYPE – usually fulfilled by en_US.UTF-8.)
>
> Please stop CCing debian-project.
>
> Does a C.UTF-8 exist? If yes, then this is a sound proposal,
> I think.

I believe that the "C" locale is supposed to be US_ASCII only.

Rationale:

ISO C99 §7.4 (ctype "functions that have locale-specific aspects only
when not in the "C" locale...)  Many of the constraints this section
for the various is* functions would restrict the "C" locale to
US_ASCII.  UTF-8 would break the constraints over e.g. whitespace
characters, digits, lowercase characters etc..

I'm not sure who would want to use C.UTF-8.  If you want a basic
non-Unicode locale with known behaviour on all platforms, the "C"
locale is useful, particularly when doing serialisation where
localisation would break file formats.  If you want Unicode, why would
you not just use the appropriate .UTF-8 locale for your
language/country?


Regards,
Roger

-- 
  .''`.  Roger Leigh
 : :' :  Debian GNU/Linux             http://people.debian.org/~rleigh/
 `. `'   Printing on GNU/Linux?       http://gutenprint.sourceforge.net/
   `-    GPG Public Key: 0x25BFB848   Please GPG sign your mail.

Attachment: pgpcuVITWm5jp.pgp
Description: PGP signature


Reply to: