* Tomohiro KUBOTA <debian@tmail.plala.or.jp> [2004-05-18 17:59]: > Alfie is kind enough to run Debian web page validation pages. > http://people.debian.org/~alfie/validate/ Yes, thanks again to Denis for the script so far. Much kudos! > However, a validation page for Japanese > http://people.debian.org/~alfie/validate/ja Oh, thanks for the notice. I wonder why this only happens for the Japanese file, though. Fixed that reference now, too. > seems to have a problem for a few weeks, like: Not that long -- I just run the script since a week or such. :) > Pages for other languages don't have this problem. Alfie or > someone, could you please fix this situation? Fixed. There is a different thing with the japanese files though: /org/www.debian.org/www/devel/debian-med/News/2002/index.ja.html /org/www.debian.org/www/international/l10n/templates/errors-by-maint.ja.html /org/www.debian.org/www/international/l10n/templates/errors-by-pkg.ja.html /org/www.debian.org/www/mirror/sponsors.ja.html These four files failed to get checked, because iconv can't convert them to utf8. This is mainly because of including different encodings in one file. I have tried to fix the debian-med page; although it wasn't compiled new again. Once it gets regenerated that page should go away. $> echo -n $file; iconv -f iso-2022-jp -t utf-8 $file # pseudo commandline errors-by-maint.ja.html iconv: illegal input sequence at position 9974 errors-by-pkg.ja.html iconv: illegal input sequence at position 8604 sponsors.ja.html iconv: illegal input sequence at position 13580 Will try to track those down as good as I am able, I wonder though why this doesn't fail for any other language.... I can just guess that all the other languages are iso-8859-something and the wrong byes are in that encoding, too. About the sponsors page: Fixed that, but haven't commited the fix yet: Please find attached a patch for Mirrors.masterlist that changes the remaining 8bit-characters to entities. There are already other entities in that very file so that shouldn't be any problem, it is now: english/mirror/Mirrors.masterlist: ASCII English text I will commit that change tomorrow if noone objects, according to the selfhtml reference I have chosen the correct entities. So, basically there are only errors-by-maint and errors-by-pkg left to take a look at, and those have the problem of the not-defined encoding for the Maintainer: control field.... There is a "should" regarding utf8 in the policy flying around, but that goes a) for the changelog and b) is /only/ a should. So I guess we are out of luck there. So long, Alfie P.S.: While writing this I've started validate by hand, the ja file is now down to 1k. :) -- "It's today!" said Piglet. "My favorite day," said Pooh.
Index: english/mirror/Mirrors.masterlist =================================================================== RCS file: /cvs/webwml/webwml/english/mirror/Mirrors.masterlist,v retrieving revision 1.548 diff -r1.548 Mirrors.masterlist 855c855 < Maintainer: Aurélien Beaujean <aure@debian.org>, ftpmaint@proxad.net --- > Maintainer: Aurélien Beaujean <aure@debian.org>, ftpmaint@proxad.net 1977c1977 < Location: São Paulo --- > Location: São Paulo 4242c4242 < Maintainer: Francesc Aulí <francesc.auli@uab.es> --- > Maintainer: Francesc Aulí <francesc.auli@uab.es> 4276c4276 < Maintainer: Rasmus Brøgger <Rasmus.Brogger@uni-c.dk> --- > Maintainer: Rasmus Brøgger <Rasmus.Brogger@uni-c.dk> 4291c4291 < Maintainer: Francesc Genové Muñoz <francesc@genove.info> --- > Maintainer: Francesc Genové Muñoz <francesc@genove.info> 4385c4385 < Location: Maceió, Alagoas --- > Location: Maceió, Alagoas 4473c4473 < Maintainer: Benjamin Indermühle <mirrormaster@solnet.ch> --- > Maintainer: Benjamin Indermühle <mirrormaster@solnet.ch> 4490c4490 < Maintainer: Benjamin Indermühle <mirrormaster@solnet.ch> --- > Maintainer: Benjamin Indermühle <mirrormaster@solnet.ch> 4566,4567c4566,4567 < Location: Université de Sherbrooke, Québec < Sponsor: Université de Sherbrooke http://www.usherbrooke.ca/ --- > Location: Université de Sherbrooke, Québec > Sponsor: Université de Sherbrooke http://www.usherbrooke.ca/ 4641c4641 < Maintainer: Juan Andrés Mussini <juan@nrc.pucpr.br> --- > Maintainer: Juan Andrés Mussini <juan@nrc.pucpr.br> 4683c4683 < Maintainer: Tom Rüger <Tom.Rueger@uni-bayreuth.de> --- > Maintainer: Tom Rüger <Tom.Rueger@uni-bayreuth.de> 4697c4697 < Location: Jönköping --- > Location: Jönköping 4699c4699 < Sponsor: Jönköping university computer society http://www.hj.se/ --- > Sponsor: Jönköping university computer society http://www.hj.se/ 4862c4862 < Sponsor: Institut für Informatik, Humboldt University Berlin http://www.informatik.hu-berlin.de/ --- > Sponsor: Institut für Informatik, Humboldt University Berlin http://www.informatik.hu-berlin.de/ 5065c5065 < Maintainer: Eberhard Mönkeberg <emoenke@gwdg.de> --- > Maintainer: Eberhard Mönkeberg <emoenke@gwdg.de> 5067c5067 < Location: Göttingen --- > Location: Göttingen
Attachment:
signature.asc
Description: Digital signature