Hello all, while testing some odd locale configurations with the package I maintain (apt-listbugs), I stumbled upon an awkward crash. I am seeking the help of some knowledgeable Ruby expert, to understand what's going on. The crash may be reproduced with apt-listbugs version 0.1.14 or later (which uses the ruby-unicode library to correctly compute the width of UTF-8 strings). $ locale LANG=en_US.UTF-8 LANGUAGE= LC_CTYPE="en_US.UTF-8" LC_NUMERIC="en_US.UTF-8" LC_TIME="en_US.UTF-8" LC_COLLATE="en_US.UTF-8" LC_MONETARY="en_US.UTF-8" LC_MESSAGES="en_US.UTF-8" LC_PAPER="en_US.UTF-8" LC_NAME="en_US.UTF-8" LC_ADDRESS="en_US.UTF-8" LC_TELEPHONE="en_US.UTF-8" LC_MEASUREMENT="en_US.UTF-8" LC_IDENTIFICATION="en_US.UTF-8" LC_ALL= $ apt-listbugs -v 0.1.17 So far so good, everything works as expected. But, if I try the following odd locale configuration (Italian language with 'C' locale settings), I get a crash: $ LANGUAGE='it_IT:it' LC_ALL='C' locale LANG=en_US.UTF-8 LANGUAGE=it_IT:it LC_CTYPE="C" LC_NUMERIC="C" LC_TIME="C" LC_COLLATE="C" LC_MONETARY="C" LC_MESSAGES="C" LC_PAPER="C" LC_NAME="C" LC_ADDRESS="C" LC_TELEPHONE="C" LC_MEASUREMENT="C" LC_IDENTIFICATION="C" LC_ALL=C $ LANGUAGE='it_IT:it' LC_ALL='C' apt-listbugs -v /usr/lib/ruby/vendor_ruby/aptlistbugs/logic.rb:390:in `width': "\xC3" from ASCII-8BIT to UTF-8 (Encoding::UndefinedConversionError) from /usr/lib/ruby/vendor_ruby/aptlistbugs/logic.rb:390:in `<class:SimpleViewer>' from /usr/lib/ruby/vendor_ruby/aptlistbugs/logic.rb:387:in `<class:Viewer>' from /usr/lib/ruby/vendor_ruby/aptlistbugs/logic.rb:381:in `<top (required)>' from /usr/lib/ruby/2.1.0/rubygems/core_ext/kernel_require.rb:55:in `require' from /usr/lib/ruby/2.1.0/rubygems/core_ext/kernel_require.rb:55:in `require' from /usr/bin/apt-listbugs:349:in `<main>' The failing statements are: DeprecatedWarning = _("********** on_hold IS DEPRECATED. USE p INSTEAD to use pin **********") DeprecatedWarningHeader = "*" * Unicode.width(DeprecatedWarning) If I understand correctly, what seems to happen is that: • the DeprecatedWarning string is assigned the Italian translation of the English message • the English message is ASCII, while the Italian translation includes some non-ASCII UTF-8 characters • ruby-gettext returns a non-UTF-8 string, since LC_CTYPE is 'C' (?) • ruby-unicode fails to compute the width of this string, since it unsuccessfully attempts to convert it to UTF-8 (?) Now, my question is: which package should be blamed for the crash? Should I change the way I use ruby-unicode in apt-listbugs? Or should I report a bug against package ruby-unicode? Or against package ruby-gettext or ruby-locale? Could some Ruby expert shed some light on this issue, please? Thanks for your time! P.S.: I am not subscribed to debian-ruby, hence please Cc me on replies. -- http://www.inventati.org/frx/ There's not a second to spare! To the laboratory! ..................................................... Francesco Poli . GnuPG key fpr == CA01 1147 9CD2 EFDF FB82 3925 3E1C 27E1 1F69 BFFE
Attachment:
pgpjW26LVTv9f.pgp
Description: PGP signature