Bug#241333: Mandate UTF-8 for changelog files
- To: 241333@bugs.debian.org
- Subject: Bug#241333: Mandate UTF-8 for changelog files
- From: Russ Allbery <rra@debian.org>
- Date: Sat, 05 Jul 2008 15:18:31 -0700
- Message-id: <[🔎] 874p74vutk.fsf_-_@windlord.stanford.edu>
- Reply-to: Russ Allbery <rra@debian.org>, 241333@bugs.debian.org
- In-reply-to: <87odisskwe.fsf@windlord.stanford.edu> (Russ Allbery's message of "Wed\, 04 Jul 2007 00\:11\:45 -0700")
- References: <Pine.LNX.4.58.0403311557390.13832@gradall.private.brainfood.com> <87wu50ypow.fsf@glaurung.internal.golden-gryphon.com> <20070526071440.GA28390@zulo.hadrons.org> <87odisskwe.fsf@windlord.stanford.edu>
Changing all changelog files in the archive to UTF-8 is already complete,
so this change will catch Policy up with current reality. Seconds?
diff --git a/policy.sgml b/policy.sgml
index 24c9072..9e7e1fb 100644
--- a/policy.sgml
+++ b/policy.sgml
@@ -1473,10 +1473,6 @@
</p>
<p>
-
- </p>
-
- <p>
The format of the <file>debian/changelog</file> allows the
package building tools to discover which version of the package
is being built and find out other release-specific information.
@@ -1582,6 +1578,10 @@
</p>
<p>
+ The entire changelog must be encoded in UTF-8.
+ </p>
+
+ <p>
For more information on placement of the changelog files
within binary packages, please see <ref id="changelogs">.
</p>
@@ -9822,36 +9822,6 @@ install-info --quiet --remove /usr/share/info/foobar.info
See <ref id="dpkgchangelog">.
</p>
- <p>
- It is recommended that the entire changelog be encoded in the
- <url id="http://www.cis.ohio-state.edu/cgi-bin/rfc/rfc2279.html" name="UTF-8">
- encoding of
- <url id="http://www.unicode.org/"
- name="Unicode">.<footnote>
- <p>
- I think it is fairly obvious that we need to
- eventually transition to UTF-8 for our package
- infrastructure; it is really the only sane char-set in
- an international environment. Now, we can't switch to
- using UTF-8 for package control fields and the like
- until dpkg has better support, but one thing we can
- start doing today is requesting that Debian changelogs
- are UTF-8 encoded. At some point in time, we can start
- requiring them to do so.
- </p>
- <p>
- Checking for non-UTF8 characters in a changelog is
- trivial. Dump the file through
- <example>iconv -f utf-8 -t ucs-4</example>
- discard the output, and check the return
- value. If there are any characters in the stream
- which are invalid UTF-8 sequences, iconv will exit
- with an error code; and this will be the case for the
- vast majority of other character sets.
- </p>
- </footnote>
- </p>
-
<sect2><heading>Defining alternative changelog formats
</heading>
--
Russ Allbery (rra@debian.org) <http://www.eyrie.org/~eagle/>
Reply to: