[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: default character encoding for everything in debian



Roger Leigh, le Wed 12 Aug 2009 11:30:50 +0100, a écrit :
> > > The default is UTF-32 or UTF-16, whichever corresponds to the width of
> > > wchar_t.
> > 
> > This documentation is bogus BTW.  It should read "UCS-4 or UCS-2".
> 
> It's "strictly" correct according to the standard.
> http://en.wikipedia.org/wiki/UTF-32/UCS-4 for an overview.

« except that the UTF-32 standard has additional Unicode
semantics. »

In UTF-32 mode, gcc introduces a BOM, and in UTF-16 it allows without
warnings characters after U+FFFF.

Samuel


Reply to: