Dear all, Here is an another type of bug on DPNhtml2mail.pl when generating a Japanese version of DPN. This comes out because of ad-hoc handling of UTF-8 flagged data. I have attached a debugged DPNhtml2mail.pl to this mail. please check it. The following is a complete output of abnormally terminated case. $ perl ./DPNhtml2mail.pl -i 2012/09 -l ja ------------------------------------------------------------------------ The Debian Project http://www.debian.org/ Debian プロジェクトニュース debian-publicity@lists.debian.org 2012 年 4 月 30 日 http://www.debian.org/News/weekly/2012/09/ ------------------------------------------------------------------------ 今年 9 号目の DPN、Debian コミュニティの会報、にようこそ。この号で取り上げられている話題は: * 自分の "クラウド" を Debian "Wheezy" 上に展開 * ニカラグアの Debian ツアー * ESRF における Debian * DPL からの一言 * インタビュー * その他のニュース * 今後のイベント * 新しい Debian 貢献者 * 次期リリースに関するリリースクリティカルバグの統計 * 重要な Debian セキュリティ勧告 * 新規の注目パッケージ * 作業が必要なパッケージ * これからも DPN を読みたいですか? Cannot decode string with wide characters at /usr/local/lib/perl/5.10.1/Encode.pm line 176. Adding a debugging of the bug, I also made small changes as listed below. 1. passing UTF-8 flagged data when create HTML::TokeParser::Simple object. 2. link address generation based on URI object. 3. alternatives to length function, columns function calculate visual width of full and/or half-size character mixed string in the unit of half-size character. 4. alternatives to Text::Wrap::wrap function, folding lines based on Unicode Line Breaking Algorithm (Text::LineFold::fold) which is also applicable to Japanese unlike Text::Wrap::wrap. Regards, AYANOKOUZI, Ryuunosuke -- AYANOKOUZI, Ryuunosuke <i38w7i3@yahoo.co.jp>
Attachment:
DPNhtml2mail.pl
Description: Binary data
Attachment:
pgpCPHexpUCek.pgp
Description: PGP signature