Bug#139861: LC_CTYPE with UTF-8 doesn't work correctly
> Yesterday I noticed, that the UTF-8 encoding doesn't seem to be
> correctly supported by the current locales package. I have problems
> using the lower and upper case conversion.
>
> Here are two different ways to exploit this behaviour. In both cases
> I used an=20
> "xterm -u8 -fn -misc-fixed-medium-r-normal--14-130-75-75-c-70-iso10646-1"=
> =20
> with LC_ALL=3Dde_DE.UTF-8 to test the programs. In this email I display
> the umlaut characters in latin1, I will append the typescript with the
> real utf-8 encoding of the characters.
>
> - Programs like tr (textutils 2.0-12):
>
> $ tr [:lower:] [:upper:]
> oau=F6=E4=FC # the input
> OAU=F6=E4=FC # the output
>
> The ASCII alphabetic characters are correctly transformed, the utf-8
> encoding umlauts are not.
>
> - The bash (2.05a-9):
> $ for i in a A =E4 =C4; do case $i in [[:lower:]]) echo "$i is l"; esac; do=
> ne
> a is lc # the output
>
> The =E4 umlaut should also be output.
It should be fixed in sid glibc 2.3.1, please check.
Regards,
-- gotom
Reply to: