[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#600310: wrong collation(?) order for et_EE.UTF-8 causes regexps to fail matching randomly



Package: locales
Version: 2.11.2-6
Severity: critical
Tags: l10n

There's a bug in et_EE.UTF-8 locale definition causing some latin
chars to be treated as non-letters.  These are at least in range
t..y inclusive, i.e. [t-y].  Like this:

 $ echo $LANG
 et_EE.UTF-8
 $ echo s | grep '[a-z]'
 s
 $ echo t | grep '[a-z]'
 $ _

I.e., the latin letter "t" does not match [a-z] regexp.

This is a critical issue unfortunately, because it makes various regex
failing to match, breaking random components.  In the actual problem case
the issue were that many cron jobs were not running on the system for a
mysterious reason, and the problem was because cron uses a regexp to
filter invalid cronjob names, which is /^[a-zA-Z0-9_-]+$/.

Thanks.

-- System Information:
Debian Release: squeeze/sid
  APT prefers testing
  APT policy: (990, 'testing'), (50, 'unstable'), (1, 'experimental')
Architecture: i386 (x86_64)

Kernel: Linux 2.6.35-amd64 (SMP w/2 CPU cores)
Locale: LANG=ru_RU.UTF-8, LC_CTYPE=ru_RU.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages locales depends on:
ii  debconf [debconf-2.0]         1.5.35     Debian configuration management sy
ii  libc6 [glibc-2.11-1]          2.11.2-6   Embedded GNU C Library: Shared lib

locales recommends no packages.

locales suggests no packages.

-- debconf information:
* locales/default_environment_locale: ru_RU.UTF-8
* locales/locales_to_be_generated: en_US.UTF-8 UTF-8, ru_RU.KOI8-R KOI8-R, ru_RU.UTF-8 UTF-8



Reply to: