UTF-8 bugs (was: Deadline for jessie init system choice)
On 2013-12-14 14:46:03 -0700, Bob Proulx wrote:
> Pavel Volkov wrote:
> > What's wrong with UTF-8 currently?
>
> fmt: incorrect formatting of UTF-8 text
> http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=650381
>
> tr: fails to replace umlauts
> http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=388689
> tr fails with UTF-8
> http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=431231
> _CTYPE with UTF-8 doesn't work correctly
> http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=139861
> tr cannot handle unicode
> http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=613155
>
> uniq: merges obscure Cyrillic characters
> http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=649729
>
> I am sure there is more.
Here are a few other ones:
* scp output alignment bug with UTF-8/multibyte sequences
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=407088
I've just reported it upstream.
* xmessage ignores locale encoding
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=505893
(and in particular it is wrong with UTF-8 locales)
* xpp does not support UTF-8
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=630717
* xprop assumes that WM_ICON_NAME and WM_NAME are encoded in ISO-8859-1
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=699746
--
Vincent Lefèvre <vincent@vinc17.net> - Web: <http://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <http://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)
Reply to: