Bug#175064: Patch for UTF-8 build
tags 175064 patch
thanks
Hi,
Current build use LANG=C which should work since LaTeX is forced to use
C locale.
I think older debiandoc2* used to use character conversion to high
bit latin-1 character for © using latin-1 if locale is not
specified as -l option.
Under non latin-1, it shows up in
funny character. HEX 1A or Decimal 169 is
© = (C) under latin-1,7,8,9,13,14,15 and
Š = S with v on top under latin-2,4 .
So Josip's reply makes sense.
Also (C) encoded values are
UTF-8: 0xC2 0xA9
So it can A9 only does not work under UTF-8
So attached patch should work to build proper UTF-8 (Instead of ASCII
only) pages.
I am not pushing this hard for squeeze since we are deep freeze but if
someone wants it, please test it and use it.
Osamu
diff --git a/Makefile b/Makefile
index 9ab6801..8767276 100644
--- a/Makefile
+++ b/Makefile
@@ -18,10 +18,10 @@ perl-policy.sgml: version.ent
nsgmls -wall -gues $<
%.html/index.html: %.sgml
- LANG=C debiandoc2html $<
+ debiandoc2html -l en.UTF-8 $<
%-1.html: %.sgml
- LANG=C debiandoc2html -1 -b $*-1d $< && \
+ debiandoc2html -l en.UTF-8 -1 -b $*-1d $< && \
mv $*-1d.html/index.html $*-1.html && \
rmdir $*-1d.html
@@ -29,19 +29,19 @@ perl-policy.sgml: version.ent
tar -czf $(<:/index.html=.tar.gz) $(<:/index.html=)
%.txt: %.sgml
- LANG=C debiandoc2text $<
+ debiandoc2text -l en.UTF-8 $<
%.txt.gz: %.txt
gzip -cf9 $< > $@
%.ps: %.sgml
- LANG=C debiandoc2latexps $<
+ debiandoc2latexps -l en.UTF-8 $<
%.ps.gz: %.ps
gzip -cf9 $< > $@
%.pdf: %.sgml
- LANG=C debiandoc2latexpdf $<
+ debiandoc2latexpdf -l en.UTF-8 $<
%.pdf.gz: %.pdf
gzip -cf9 $< > $@
Reply to: