[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#864211: [locales] Tibetan script collation broken (Dzongkha and Tibetan)



Package: locales
Version: 2.24-11
Severity: normal

--- Please enter the report below this line. ---

Hello,

Tibetan or Dzongkha sorting do not work properly with the current locale data.

With the following test file:

$ cat tibt_order_test.txt
ལྔ
ང
ཅ
རྔ
སྔ
བརྔ
བསྔ

I get the following wrong result:

$ LC_COLLATE="dz_BT.utf8" sort tibt_order_test.txt
ང
བརྔ
བསྔ
རྔ
ལྔ
སྔ
ཅ

The correct result would be

ང
རྔ
ལྔ
སྔ
བརྔ
བསྔ
ཅ

Dz and bo have the same collation data in CLDR.

See https://github.com/eroux/tibetan-collation for more on tibetan collation.

Result of locale -a:

bo_CN
bo_CN.utf8
bo_IN
bo_IN.utf8
C
C.UTF-8
dz_BT
dz_BT.utf8
en_GB.utf8
en_US.utf8
fr_FR.utf8
POSIX

Thank you,
--
Elie

--- System information. ---
Architecture: Kernel:       Linux 4.9.0-3-amd64

Debian Release: 9.0
500 unstable ftp.fr.debian.org 500 stable repo.skype.com 500 stable dl.google.com 500 apt/stable download.sublimetext.com
--- Package information. ---
Depends          (Version) | Installed
==========================-+-===========
libc-bin         (>> 2.24) | 2.24-11
libc-l10n        (>> 2.24) | 2.24-11
debconf          (>= 0.5)  | 1.5.61
 OR debconf-2.0            |

Package's Recommends field is empty.

Package's Suggests field is empty.


Reply to: