[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: utf8-migration-tool: [PATCH] remove the codeset and modifier of the current locale

su, 2007-01-07 kello 19:39 +0200, Martin-Éric Racine kirjoitti:
> su, 2007-01-07 kello 18:27 +0100, Nicolas François kirjoitti:
> > This patch removes the codeset and modifier of the current locale before
> > appending .UTF8.
> > 
> > i.e. any of ll_CC, ll_CC.CODESET, ll_CC@modifier and
> > ll_CC.CODESET@modifier will be changed to ll_CC
> > 
> > I tested the regex in a python interpreter, but I did not tested to run
> > utf8migrationtool with the locales mentioned in 405967 and 405968.
> Thanks for the patch! 
> Could everyone concerned by this please apply the patch and test whether
> it fixes the following issues:
> Bug#405878: utf8-migration-tool: crashes on start with message "TypeError: category LC_ALL is not supported"
> Bug#405967: utf8-migration-tool: cannot handle migration from ru_RU.KOI8-R
> Bug#405968: utf8-migration-tool: cannot handle migration from @euro locales

Looking at the patch, this will parse any locale up to the first dot or
arobas it finds and use the part before that to deduct the locale, which
should successfully convert any locale to UTF-8.

The only exception I can think of is cases where the arobas points to a
useful variant. One example is Serbian: the default encoding is Cyrillic
but there is an @latin variant also. There, utf8migrationtool would make
the wrong assumption.

Can anybody think of any other exception case that must be processed?

Martin-Éric Racine

Reply to: