[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#175064: Patch for UTF-8 build



tags 175064 patch
thanks

Hi,

Current build use LANG=C which should work since LaTeX is forced to use
C locale.

I think older debiandoc2* used to use character conversion to high
bit latin-1 character for © using latin-1 if locale is not
specified as -l option.  

Under non latin-1, it shows up in
funny character.  HEX 1A or Decimal 169 is  
 © = (C) under latin-1,7,8,9,13,14,15 and 
 Š = S with v on top under latin-2,4 .
So Josip's reply makes sense.

Also (C) encoded values are
 UTF-8: 0xC2 0xA9
So it can A9 only does not work under UTF-8

So attached patch should work to build proper UTF-8 (Instead of ASCII
only) pages.

I am not pushing this hard for squeeze since we are deep freeze but if
someone wants it, please test it and use it.

Osamu

diff --git a/Makefile b/Makefile
index 9ab6801..8767276 100644
--- a/Makefile
+++ b/Makefile
@@ -18,10 +18,10 @@ perl-policy.sgml: version.ent
 	nsgmls -wall -gues $<
 
 %.html/index.html: %.sgml
-	LANG=C debiandoc2html $<
+	debiandoc2html -l en.UTF-8 $<
 
 %-1.html: %.sgml
-	LANG=C debiandoc2html -1 -b $*-1d $< && \
+	debiandoc2html -l en.UTF-8 -1 -b $*-1d $< && \
         mv $*-1d.html/index.html $*-1.html && \
         rmdir $*-1d.html
 
@@ -29,19 +29,19 @@ perl-policy.sgml: version.ent
 	tar -czf $(<:/index.html=.tar.gz) $(<:/index.html=)
 
 %.txt: %.sgml
-	LANG=C debiandoc2text $<
+	debiandoc2text -l en.UTF-8 $<
 
 %.txt.gz: %.txt
 	gzip -cf9 $< > $@
 
 %.ps: %.sgml
-	LANG=C debiandoc2latexps $<
+	debiandoc2latexps -l en.UTF-8 $<
 
 %.ps.gz: %.ps
 	gzip -cf9 $< > $@
 
 %.pdf: %.sgml
-	LANG=C debiandoc2latexpdf $<
+	debiandoc2latexpdf -l en.UTF-8 $<
 
 %.pdf.gz: %.pdf
 	gzip -cf9 $< > $@

Reply to: