[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Bug#292330: use UTF-8 by default



On Mon, Jun 18, 2007 at 01:09:17PM +0000, Thorsten Glaser <tg@mirbsd.de> wrote:
> Mike Hommey dixit:
> 
> >>   it's not. We could create a neutral.utf-8 locale for sure
> 
> Sounds like a plan. Maybe something short and uppercase, akin to
> "C" and "POSIX", how about "STD.UTF-8"?
> 
> >> but a
> >> C.utf-8 is really bad, because some programs check the locale for 'C'
> >> and when they foind that use hand optimized functions to replace the
> >> localized libc ones.
> 
> Ugh. Really? Nah, please spare me the details.
> 
> >Note that you won't get strings split in the middle of a point code with
> >UTF-8.
> 
> This is possible with UTF-8.
> 
> Try this one: ł (U+0142) = C5 82
> You can split between the C5 and the 82.

The vast majority of programs won't split a string in the middle of a
word.

Mike



Reply to: