Bug#471021: locales: EastAsianAmbiguous character width is always 1 in UTF-8
Package: locales
Version: 2.7-9
Severity: normal
First, this has been reported to upstream,
but there is no progress for a while.
Please allow me to report same one.
http://sourceware.org/bugzilla/show_bug.cgi?id=4335
------------------------------------------------------------------------
According to /usr/share/i18n/charmaps/UTF-8.gz,
Character width is 1 by default. W(Wide) and F(Full Width) are 2.
% Character width according to Unicode 3.2.
% - Default width is 1.
% - Double-width characters have width 2; generated from
% "grep '^[^;]*;[WF]' EastAsianWidth.txt"
% and "grep '^[^;]*;[^WF]' EastAsianWidth.txt"
% - Non-spacing characters have width 0; generated from PropList.txt or
% "grep '^[^;]*;[^;]*;[^;]*;[^;]*;NSM;' UnicodeData.txt"
% - Format control characters have width 0; generated from
% "grep '^[^;]*;[^;]*;Cf;' UnicodeData.txt"
% - Zero width characters have width 0; generated from
% "grep '^[^;]*;ZERO WIDTH ' UnicodeData.txt"
A(Ambiguous) is expected that it is context-sensitive,
but its width is always 1 irrelevant to context.
According to http://www.unicode.org/reports/tr11/#Recommendations
> When mapping Unicode to East Asian legacy character encodings
>
> * Wide Unicode characters always map to fullwidth characters.
> * Narrow (and neutral) Unicode characters always map to halfwidth
> characters.
> * Halfwidth Unicode characters always map to halfwidth characters.
> * Ambiguous Unicode characters always map to fullwidth characters.
I think EastAsianAmbiguous character width should be 2 in CJK UTF-8
locale.
------------------------------------------------------------------------
-- System Information:
Debian Release: lenny/sid
APT prefers unstable
APT policy: (500, 'unstable'), (500, 'stable'), (1, 'experimental')
Architecture: i386 (i686)
Kernel: Linux 2.6.24 (SMP w/2 CPU cores; PREEMPT)
Locale: LANG=ja_JP.UTF-8, LC_CTYPE=ja_JP.UTF-8 (charmap=UTF-8) (ignored: LC_ALL set to ja_JP.UTF-8)
Shell: /bin/sh linked to /bin/bash
Versions of packages locales depends on:
ii debconf [debconf-2.0] 1.5.20 Debian configuration management sy
ii libc6 [glibc-2.7-1] 2.7-9 GNU C Library: Shared libraries
locales recommends no packages.
-- debconf-show failed
Reply to: