[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: New version of DPNhtml2mail.pl [was: Possible bug when generating mail version]



Dear all,

Here is an another type of bug on DPNhtml2mail.pl when generating a
Japanese version of DPN. This comes out because of ad-hoc handling
of UTF-8 flagged data. I have attached a debugged DPNhtml2mail.pl to
this mail. please check it.

The following is a complete output of abnormally terminated case.

$ perl ./DPNhtml2mail.pl -i 2012/09 -l ja
------------------------------------------------------------------------
The Debian Project                                http://www.debian.org/
Debian プロジェクトニュース  debian-publicity@lists.debian.org
2012 年 4 月 30 日               http://www.debian.org/News/weekly/2012/09/
------------------------------------------------------------------------

今年 9 号目の DPN、Debian コミュニティの会報、にようこそ。この号で取り上げられている話題は:

  * 自分の "クラウド" を Debian "Wheezy" 上に展開
  * ニカラグアの Debian ツアー
  * ESRF における Debian
  * DPL からの一言
  * インタビュー
  * その他のニュース
  * 今後のイベント
  * 新しい Debian 貢献者
  * 次期リリースに関するリリースクリティカルバグの統計
  * 重要な Debian セキュリティ勧告
  * 新規の注目パッケージ
  * 作業が必要なパッケージ
  * これからも DPN を読みたいですか?


Cannot decode string with wide characters at /usr/local/lib/perl/5.10.1/Encode.pm line 176.

Adding a debugging of the bug, I also made small changes as listed below.

1. passing UTF-8 flagged data when create HTML::TokeParser::Simple
   object.

2. link address generation based on URI object.

3. alternatives to length function, columns function calculate visual
   width of full and/or half-size character mixed string in the unit
   of half-size character.

4. alternatives to Text::Wrap::wrap function, folding lines based on
   Unicode Line Breaking Algorithm (Text::LineFold::fold) which is
   also applicable to Japanese unlike Text::Wrap::wrap.

Regards,
AYANOKOUZI, Ryuunosuke
-- 
AYANOKOUZI, Ryuunosuke <i38w7i3@yahoo.co.jp>

Attachment: DPNhtml2mail.pl
Description: Binary data

Attachment: pgpCPHexpUCek.pgp
Description: PGP signature


Reply to: