[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Forking GNU libc locales



Hi folks,

I am working on forking GNU libc locales for more than 5 months,
and have now something usable.
Two new packages hit unstable recently:
  * belocs-locales-data: contains locale data files.  It is based
    upon the locales package, and Replaces/Provides/Conflicts with it.
  * belocs-locales-bin: contains modified versions of locale and
    localedef programs, which are needed to compile locale data files
    shipped in belocs-locales-data.

The belocs-locales-data package fixes most bugs reported against
upstream and Debian locales package.  It is currently based upon GNU
libc 2.3.4, but I intend to make large changes soon, and will then fork
it.  For this purpose, the belocs project has just been created on
Alioth.  If you submitted locale related changes to Bugzilla and/or BTS
which have not been integrated into belocs-locales-data, or are willing
to do so, please discuss these issues on the belocs-locales-data mailing
list on Alioth.  If your issues are specific to Debian, debian-i18n is
obviously a better place.

Here are some improvements provided by these packages (of course, much
more to come soon ;))
  * locale-gen compiles locales only if needed.  (I.e. locale does not
    exist or one of its input files has been modified)
  * Without argument, locale prints the LANGUAGE environment variable
    if it is set.
  * localedef reads iso-{639,3166,4217}.def files at run-time, to allow
    user changes in locale files without having to recompile localedef
    but keep checks for invalid values.
  * Several bugs in localedef have been fixed; e.g. we can now sort
    according to our braindead French collation rules, as specified in
    ISO 14651 ;)
  * Modified locale files (these changes are reported against GNU libc
    2.3.4, but not against the locales package)
    + az_AZ [BZ662]
      Updated version by Pablo Saratxaga <pablo at mandrakesoft com>.
    + bn_BD [BZ657]
      Updated version by Jamil Ahmed <jamil at bengalinux org>.
    + bs_BA [BZ663]
      Fix LC_COLLATE section, by Pablo Saratxaga <pablo at mandrakesoft com>.
    + cs_CZ
      Replace current collation rules by iso14651_t1 tailoring.
      Patch by Denis Barbier.
    + csb_PL [BZ546]
      New locale, by Andrzej Krzysztofowicz <ankry at mif pg gda pl>
    + csb_PL
      Load iso14651_t1 in LC_COLLATE.  Patch by Denis Barbier. 
    + da_DK [BZ672]
      Load iso14651_t1 in LC_COLLATE.  Patch by Denis Barbier. 
    + da_DK
      Define <ETH>, this symbol is used in collation rules.
      Patch by Denis Barbier.
    + de_CH, de_DE [BTS235759]
      In de_DE (resp. de_CH), transliterate <U201E> and <U201C>
      quotes to <U00BB> and <U00AB> (resp. <U00AB> and <U00BB>).
      Patch by Denis Barbier and the <debian-l10n-german at lists debian org>
      mailing list.
    + dz_BT
      New locale, by Pema Geyleg <pema_geyleg at druknet bt>, Pablo
      Saratxaga <pablo at mandrakesoft com> and Denis Barbier.
    + en_CA [BZ672]
      Load iso14651_t1 in LC_COLLATE.  Patch by Denis Barbier. 
    + eo_XX [BTS233308]
      New locale, by Edmund GRIMLEY EVANS <edmundo at rano org>.
    + es_AR [BTS108373]
      Add grouping and separator to number formatting. by Nicolas
      Lichtmaier <nick at debian org>
    + es_ES [BZ664]
      Load iso14651_t1 in LC_COLLATE.
      Patch by Pablo Saratxaga <pablo at mandrakesoft com>.
    + es_US [BZ672]
      Load iso14651_t1 in LC_COLLATE.  Patch by Denis Barbier. 
    + et_ET
      Replace current collation rules by iso14651_t1 tailoring.
      Patch by Denis Barbier.
    + eu_FR [BTS257840]
      New locale, by Christian Perrier <bubulle at debian org>.
    + fi_FI [BZ297, BZ672]
      Fix date_fmt. Patch by Lauri Nurmi <lanurmi at iki fi>.
      Load iso14651_t1 in LC_COLLATE.  Patch by Denis Barbier. 
    + fr_BE fr_CA fr_CH fr_FR fr_LU
      Fix grouping and thousand separator, by getting data from
      http://www.unicode.org/cldr/
      Patch by Denis Barbier. 
    + hr_HR
      Replace current collation rules by iso14651_t1 tailoring.
      Patch by Denis Barbier.
    + hsb_DE [BZ566]
      New locale, by Andrzej Krzysztofowicz <ankry at mif pg gda pl>
    + hsb_DE
      Load iso14651_t1 in LC_COLLATE.  Patch by Denis Barbier. 
    + hu_HU [BZ195]
      Add transliteration rules for hu_HU. Patch by Egmont Koblinger
      <egmont at uhulinux hu>
    + hy_AM [BZ68]
      Several updates by Ani <petrosyan at gmail com>.
    + is_IS
      Remove undefined symbols from collation rules.  Patch by Denis Barbier.
    + lt_LT
      Replace current collation rules by iso14651_t1 tailoring.
      Patch by Denis Barbier.
    + lv_LV
      Remove undefined symbols from collation rules.  Patch by Denis Barbier.
    + mg_MG [BTS271549]
      New locale, by Christian Perrier <bubulle at debian org> and
      Jaonary Rabarisoa <jaonary at ifrance com>.
    + nb_NO [BZ672]
      Load iso14651_t1 in LC_COLLATE.  Patch by Denis Barbier. 
    + nb_NO
      Define <ETH>, this symbol is used in collation rules.
      Patch by Denis Barbier.
    + nso_ZA [BZ486]
      New locale, by Dwayne Bailey <dwayne at translate org za>.
    + om_ET
      This locale does not compile with localedef from belocs-locales-bin,
      it needs some tweaking.
    + om_KE
      Some changes are also needed in this file to compile om_ET.
    + pl_PL
      Replace current collation rules by iso14651_t1 tailoring.
      Patch by Denis Barbier.
    + ru_RU [BZ352]
      Set thousand_sep and mon_thousand_set to non-breaking space based
      on input from Paul Pogonyshev <pogonyshev at gmx net> and Sergei Ivanov
      <svivanov at pdmi ras ru>.
    + ru_RU [BTS225516]
      Generate ru_RU.cp1251, requested by Michael Bravo
      <mbravo at wintermute spb ru>.
    + rw_RW [BZ451]
      New locale, by Steve Murphy <murf at e-tools com>.
    + si_LK [BZ367]
      New locale, by Anuradha Ratnaweera <anuradha at gnu org>.
    + sid_ET
      Fix compilation by localedef.
    + sl_SI
      Replace current collation rules by iso14651_t1 tailoring.
      Patch by Denis Barbier.
    + sq_AL [BZ665]
      Add proper LC_COLLATE section, by Pablo Saratxaga
      <pablo at mandrakesoft com>.
    + sr_CS, sr_CS@Latn [BZ38]
      New Serbian locales, by Danilo Segan <dsegan at gmx net>.
    + sr_CS@jekavian [BZ39]
      New sr_CS@jekavian locale (Serbian Jekavian locale for Serbia and
      Montenegro), provided by Danilo Segan <dsegan at gmx net>.
    + sr_YU
      Keep all old sr_YU locales for compatibility reasons.
    + ss_ZA [BZ533]
      New locale, by Dwayne Bailey <dwayne at translate org za>.
    + st_ZA [BZ485]
      Update by Dwayne Bailey <dwayne at translate org za>.
    + te_IN [BZ428]
      Fix several problems in te_IN. Patch by Sunil Mohan Adapa
      <sunil at atc tcs co in>
      The postal_fmt definition change had to be reverted because
      localedef does not recognize it.
    + tg_TJ [BZ666]
      Fix yesexpr, by Pablo Saratxaga <pablo at mandrakesoft com>.
    + tl_PH [BTS295810]
      Set yesexpr/noexpr/yesstr/nostr fields.  Patch by eric pareja
      <xenos at upm edu ph>
    + tn_ZA [BZ529]
      New locale, by Dwayne Bailey <dwayne at translate org za>.
    + tr_TR
      Load iso14651_t1 in LC_COLLATE.  Patch by Denis Barbier. 
    + ts_ZA [BZ532]
      New locale, by Dwayne Bailey <dwayne at translate org za>.
    + ve_ZA [BZ491]
      New locale, by Dwayne Bailey <dwayne at translate org za>.
    + vi_VN [BZ448]
      Updated version by Pablo Saratxaga <pablo at mandrakesoft com>.
    + wa_BE [BZ667]
      Fix weekday names to match new orthographic rules.
      Patch provided by Pablo Saratxaga <pablo at mandrakesoft com>.
    + wo_SN [BTS279697]
      New wo_SN locale, by Christian Perrier <bubulle at debian org>
      and Samba Ndao Diop <samba ndao diop at tiscali fr>.
    + yi_US [BZ668]
      Fix several spelling errors.
      Patch provided by Pablo Saratxaga <pablo at mandrakesoft com>.
    + xh_ZA [BZ493]
      Update by Dwayne Bailey <dwayne at translate org za>.
    + zu_ZA [BZ488]
      Update by Dwayne Bailey <dwayne at translate org za>.
  * Add UTF-8 variants for all locales to SUPPORTED, as well as some
    ISO-8859-15 variants (these changes are reported against GNU libc
    2.3.4, but not against the locales package)
    + aa_DJ.UTF-8/UTF-8
    + af_ZA.UTF-8/UTF-8
    + an_ES.UTF-8/UTF-8
    + br_FR.UTF-8/UTF-8
    + bs_BA.UTF-8/UTF-8
    + da_DK.ISO-8859-15/ISO-8859-15
    + en_GB.ISO-8859-15/ISO-8859-15
    + en_US.ISO-8859-15/ISO-8859-15
    + gd_GB.UTF-8/UTF-8
    + hy_AM/ARMSCII-8
    + hy_AM.UTF-8/UTF-8 
    + ka_GE.UTF-8/UTF-8
    + kk_KZ.UTF-8/UTF-8
    + lg_UG.UTF-8/UTF-8
    + mi_NZ.UTF-8/UTF-8
    + oc_FR.UTF-8/UTF-8
    + om_KE.UTF-8/UTF-8
    + so_DJ.UTF-8/UTF-8
    + so_KE.UTF-8/UTF-8
    + so_SO.UTF-8/UTF-8
    + sv_SE.ISO-8859-15/ISO-8859-15
    + tg_TJ.UTF-8/UTF-8
    + tl_PH.UTF-8/UTF-8
    + uz_UZ.UTF-8/UTF-8
    + yi_US.UTF-8/UTF-8
    + zh_SG.UTF-8/UTF-8

Denis



Reply to: