[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#344304: qa.debian.org: no charset specified when browsing news



tags 344304 help
thanks

On Wed, Dec 21, 2005 at 08:01:25PM +0300, Anton Martchukov wrote:
> When browsing news in Package Tracking System, e.g.
> 
> http://packages.qa.debian.org/r/rus-ispell/news/1.html

The URL is meanwhile:

http://packages.qa.debian.org/r/rus-ispell/news/20060107T121720Z.html

> there is no charset specified neither in HTTP headers, nor in META tag
> within the web page. So, non-ASCII characters are not displayed
> correctly.

While you are right there is no charset in header nor <head> (well, now
there is, HTTP header states utf8), there's actually no less then two
more issues.

First, the mail in question didn't specifify a Content-Type-Encoding
header, while being utf8. It should've done so. This is not in the scope
of QA to fix, but anyway:

Secondly, mhonarc totally ruins the mail by transforming É into &xC3;.
Despite doing such things as copying lists.debian.org mhonarc config,
and/or incuding /usr/share/doc/mhonarc/examples/utf-8-encode.mrc, I
couldn't convince Mhonarc to actually just plainly output the source.

If someone could contribute a Mhonarc config that does the right thing,
that'd be greatly appreciated. Current config has nothing interesting
except just some headers to ignore. The new mhonarc config should do
charset conversion if possible, or just output the text as-is in the
case charset of the mail is utf8 or unknown.

--Jeroen

-- 
Jeroen van Wolffelaar
jeroen@wolffelaar.nl
http://jeroen.A-Eskwadraat.nl



Reply to: