Bug#139861: LC_CTYPE with UTF-8 doesn't work correctly

To: Torsten Hilbrich <debian-user-german@myrkr.in-berlin.de>, 139861@bugs.debian.org
Subject: Bug#139861: LC_CTYPE with UTF-8 doesn't work correctly
From: GOTO Masanori <gotom@debian.or.jp>
Date: Wed, 26 Feb 2003 20:29:55 +0900
Message-id: <[🔎] 80d6lfxoss.wl@oris.opensource.jp>
Reply-to: GOTO Masanori <gotom@debian.or.jp>, 139861@bugs.debian.org

> Yesterday I noticed, that the UTF-8 encoding doesn't seem to be
> correctly supported by the current locales package.  I have problems
> using the lower and upper case conversion.
> 
> Here are two different ways to exploit this behaviour.  In both cases
> I used an=20
> "xterm -u8 -fn -misc-fixed-medium-r-normal--14-130-75-75-c-70-iso10646-1"=
> =20
> with LC_ALL=3Dde_DE.UTF-8 to test the programs.  In this email I display
> the umlaut characters in latin1, I will append the typescript with the
> real utf-8 encoding of the characters.
> 
> - Programs like tr (textutils 2.0-12):
> 
> $ tr [:lower:] [:upper:]
> oau=F6=E4=FC                           # the input
> OAU=F6=E4=FC                           # the output
> 
> The ASCII alphabetic characters are correctly transformed, the utf-8
> encoding umlauts are not.
> 
> - The bash (2.05a-9):
> $ for i in a A =E4 =C4; do case $i in [[:lower:]]) echo "$i is l"; esac; do=
> ne
> a is lc                          # the output
> 
> The =E4 umlaut should also be output.

It should be fixed in sid glibc 2.3.1, please check.

Regards,
-- gotom

Reply to:

Follow-Ups:
- Bug#139861: LC_CTYPE with UTF-8 doesn't work correctly
  - From: Torsten Hilbrich <debbug@myrkr.in-berlin.de>

Prev by Date: Bug#119528: locales support for Romanian
Next by Date: Bug#132573: marked as done (locales: localedef segfault)
Previous by thread: Bug#119528: locales support for Romanian
Next by thread: Bug#139861: LC_CTYPE with UTF-8 doesn't work correctly
Index(es):
- Date
- Thread