[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#139861: LC_CTYPE with UTF-8 doesn't work correctly



> Yesterday I noticed, that the UTF-8 encoding doesn't seem to be
> correctly supported by the current locales package.  I have problems
> using the lower and upper case conversion.
> 
> Here are two different ways to exploit this behaviour.  In both cases
> I used an=20
> "xterm -u8 -fn -misc-fixed-medium-r-normal--14-130-75-75-c-70-iso10646-1"=
> =20
> with LC_ALL=3Dde_DE.UTF-8 to test the programs.  In this email I display
> the umlaut characters in latin1, I will append the typescript with the
> real utf-8 encoding of the characters.
> 
> - Programs like tr (textutils 2.0-12):
> 
> $ tr [:lower:] [:upper:]
> oau=F6=E4=FC                           # the input
> OAU=F6=E4=FC                           # the output
> 
> The ASCII alphabetic characters are correctly transformed, the utf-8
> encoding umlauts are not.
> 
> - The bash (2.05a-9):
> $ for i in a A =E4 =C4; do case $i in [[:lower:]]) echo "$i is l"; esac; do=
> ne
> a is lc                          # the output
> 
> The =E4 umlaut should also be output.

It should be fixed in sid glibc 2.3.1, please check.

Regards,
-- gotom



Reply to: