On Wed, 18 Jun 2008, Andreas Tille wrote:
you get all the problematic descriptions (140kByte) which seem to be all non-ASCII character containing texts. Even in untranslated descriptions does this problem happen - just grep for "(lang='en')" in the output above. The strange thing is that genshi.Markup tries unicode(string) which fails - but it should not, because all the descriptions are properly formated UTF-8 strings.
unicode(repr(string))
solves the problem (many thanks to Enrico Zini).
The output has some other strange formatting issues, but I'll look after this...
Kind regards
Andreas.
--
http://fam-tille.de