--- Begin Message ---
- To: Debian Bug Tracking System <submit@bugs.debian.org>
- Subject: copyright-format: encoding issues
- From: Jakub Wilk <jwilk@debian.org>
- Date: Wed, 13 Jul 2011 22:29:07 +0200
- Message-id: <20110713202907.GA7478@jwilk.net>
Package: debian-policy
Version: 3.9.2.0
Severity: minor
The section sign (§) is incorrectly encoded, both in HTML and text
version of copyright-format:
$ zgrep 'The Debian Policy (' copyright-format.{html,txt.gz}
copyright-format.html: <p>The Debian Policy (§<a href=
copyright-format.txt.gz: The Debian Policy (AS:12.5) demands that each package is accompanied by a
--
Jakub Wilk
--- End Message ---
--- Begin Message ---
- To: 633800-done@bugs.debian.org
- Subject: Re: Bug#633800: copyright-format: encoding issues
- From: Charles Plessy <plessy@debian.org>
- Date: Mon, 21 Nov 2011 09:27:14 +0900
- Message-id: <20111121002714.GC17961@merveille.plessy.net>
- In-reply-to: <20110713235743.GB15314@merveille.plessy.net>
- References: <20110713202907.GA7478@jwilk.net> <20110713235743.GB15314@merveille.plessy.net>
> Le Wed, Jul 13, 2011 at 10:29:07PM +0200, Jakub Wilk a écrit :
> >
> > The section sign (§) is incorrectly encoded, both in HTML and text
> > version of copyright-format:
> >
> > $ zgrep 'The Debian Policy (' copyright-format.{html,txt.gz}
> > copyright-format.html: <p>The Debian Policy (§<a href=
> > copyright-format.txt.gz: The Debian Policy (AS:12.5) demands that each package is accompanied by a
Le Thu, Jul 14, 2011 at 08:57:43AM +0900, Charles Plessy a écrit :
> I think that this is a bug in the conversion to HTML, as copyright-format.xml
> is correctly encoded in UTF-8, and copyright-format.txt is derived from
> copyright-format.html.
…
> I chose jade for the conversion of DocBook XML because it was already used for
> the other documents in this package, but I would not mind switching to XSL
> stylesheets and xsltproc. Results are similar; the most visible difference is
> the absence of horizontal rulers in the XHTML pages produced by the XSL
> stylesheets.
Dear all,
The problematic section sign (§) was removed as a side effect of #640735. I
therefore close this bug.
There remains the problem that the conversion to HTML and text is not
Unicode-safe, but this is a common problem for the documents of the
debian-policy converted using jade: copyright-format and debconf_spec.
If there is interset, I can make a proposition based on xsltproc, in a new
bug entry, out of the scope of DEP 5.
Have a nice day,
--
Charles Plessy
Debian Med packaging team,
http://www.debian.org/devel/debian-med
Tsurumi, Kanagawa, Japan
--- End Message ---