[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#175064: Debian policy documents should be UTF-8 encoded



On Tue, Jan 07, 2003 at 03:38:43PM -0800, Osamu Aoki wrote:
> > > I think it may be best if this encoding changes are done at the same
> > > time when this documentation is moved to docbook-xml format.
> > 
> > I'm entirely not convinced that we should go beyond ASCII because a program
> > can't be taught to convert © into (C). It's just that, nothing else.
> 
> I think we may be discussing slightly different things.  I do not
> understand what you ment by the above message.  They are taught to do
> so, I think.
> 
> Current debiasndoc-sgml creates copyright section starting with
> 
>   HTML:       Latain-1 code for ©  (1 character)
>   PDF:        Latain-1 code for ©  (1 character)
>   Plain text: ASCII 3 characters (C)
> 
> That was today's build of my documment results.  The source are like:
> 
>   <copyrightsummary>
>   Copyright &copy; 2001&ndash;2002 by Osamu Aoki &lt;&osamu;&gt;.
>   </copyrightsummary>
> 
> The plain text does not go beyond ASCII.  Neither UTF-8 nor ISO-8859-1.

% zgrep Š /usr/share/doc/debian-policy/policy.txt.gz
     Copyright Š 1996,1997,1998 Ian Jackson and Christian Schwarz.

How do you explain that? Honest question :)

-- 
     2. That which causes joy or happiness.



Reply to: