[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#227273: www.debian.org: Japanese DDTP files are provided with EUC-JP endoding.



On Mon, Jan 26, 2004 at 03:58:22PM +0900, Tomohiro KUBOTA wrote:
> I checked http://packages.debian.org/unstable/misc/language-env.ja.html
> and found that closing escape sequences are missing.
>
[...explanation of iso-2022-jp]
> 
> I said closing escape sequences are missing.  This means the "here starts
> ASCII" part is missing.  Thus, all of the following ASCII characters
> (including HTML tags) are regarded as Japanese and causes Mojibake.

> I don't know what algorithm is used for generating the page, so I have
> no idea the reason of this broken page.

I used the Perl module Text::Iconv which itself uses iconv(3)
This module seems to suck or I am to dump to use it. If I convert the
raw Japanese Packages file with iconv(1) (which probably uses iconv(3), 
too) all escape sequences seem to be generated correctly, if I use
Text::Iconv->convert, only the very first one is.

I will either try another Perl module or just preprocess the Packages
file...

Gruesse,
-- 
Frank Lichtenheld <djpig@debian.org>
www: http://www.djpig.de/



Reply to: