[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#174982: [PROPOSAL]: Debian changelogs should be UTF-8 encoded



> | Right now, people are putting whatever random characters they feel like
> | in Debian changelogs; they might be encoded in ISO-8859-1, BIG5,
> | ISO-8859-2, ISO-2022-JP, or who knows what.  This does come up in the
> | real world; I use apt-listchanges, and I fairly often see broken
> | characters in changelogs.  The solution is to define the charset of
> | changelogs as UTF-8.  That way, I can read all the changelogs at once
> | (currently using gnome-terminal) and it will work.
> 
> I think we shouldn't use must just yet, since this will cause a lot of
> packages (you know how many?) to be instantly buggy.  If you change
> the «must» to «should», I'll second the proposal.

Erm.. no, only those packages which comply with the latest 
version of policy, and it is pretty easy to add a  rule to 
change the encoding to utf-8 on the debian/rules install target.

Adding an example would be handy:

  To install changelog from upstream that is coded in ISO-8859-15, 
  the following rules in your debian/rules may help:


clean:
	.
	rm -f debian/ChangeLog-upstream
	.
	.

binary:
	.
	.
	.
  iconv -f iso-8859-15 -t utf-8 < ChangeLog > debian/ChangeLog-upstream
  dh_installchanges debian/ChangeLog-upstream
	.
	.




It may help if debhelper dh_installchangelog had a 
charset conversion option, maybe.



regards,
	junichi



Reply to: