Bug#295109: tex4ht: openoffice export error with manuscript style
Package: tex4ht
Version: 20030426-3
Severity: normal
Incorrect encoding of \oe{} in htlatex openoffice export when tex package
"manuscipt" is used. The following example generates an incorrect encoding
for \oe{} which prevents OO.o from loading the resulting "content.xml" file.
manuscript.sty posted on comp.text.tex 23 march 1998, unable to find a more
recent version (the one on CTAN failed to generate the proper fixed-font,
doublespaced output on regular tex to dvi processing)
>>> manuscript.sty header
%%% test version for a style producing manuscript-like output.
%%% Stefan Ulrich <ulrich@cis.uni-muenchen.de>, with
%%% suggestions by Rowland <rebecca@astrid.dog.u-net.com, no animal>
%%% and modifications by Steve Kilbane <steve@whitecrow.demon.co.uk>
>>> end manuscript.sty header
Simple test case tex document:
>>> testcase.tex
\documentclass{book}
\usepackage[submit]{manuscript}
\title{Test case for oe ligature}
\usepackage[T1]{fontenc}
\usepackage[latin1]{inputenc}
\usepackage[frenchb]{babel}
\FrenchLayout
\begin{document}
Rodrigue, as-tu du c\oe{}ur ?\\
Tout autre que mon père l'éprouverait sur l'heure
\end{document}
>>> end testcase.tex
Command issued:
htlatex testcase.tex "xhtml,ooffice" "ooffice/! -cmozhtf" "-coo"
With manuscript package used as above, the \oe{} is incorrectly encoded
as a single char (line 24):
>>> resulting content.xml subdocument, line numbers added
01:<?xml version="1.0" encoding="UTF-8"?>
02:<!DOCTYPE office:document-content PUBLIC "-//OpenOffice.org//DTD OfficeDocument 1.0//EN" "office.dtd">
03:<office:document-content
04:xmlns:chart="http://openoffice.org/2000/chart"
05:xmlns:dr3d="http://openoffice.org/2000/dr3d"
06:xmlns:draw="http://openoffice.org/2000/drawing"
07:xmlns:form="http://openoffice.org/2000/form"
08:xmlns:number="http://openoffice.org/2000/datastyle"
09:xmlns:office="http://openoffice.org/2000/office"
10:xmlns:script="http://openoffice.org/2000/script"
11:xmlns:style="http://openoffice.org/2000/style"
12:xmlns:table="http://openoffice.org/2000/table"
13:xmlns:text="http://openoffice.org/2000/text"
14:xmlns:fo="http://www.w3.org/1999/XSL/Format"
15:xmlns:math="http://www.w3.org/1998/Math/MathML"
16:xmlns:svg="http://www.w3.org/2000/svg"
17:xmlns:xlink="http://www.w3.org/1999/xlink"
18:office:class="text" office:version="1.0" >
19:<office:automatic-styles> <style:style style:name="tabular" style:family="table">
20:<style:properties style:rel-width="76%" fo:margin-left="1cm" fo:margin-right="1cm" table:align="margins"/>
21:</style:style>
22:
23:</office:automatic-styles> <office:body>
24:<!--l. 13--><text:p text:style-name="p-noindent">Rodrigue, as-tu du c÷ur?
25:<text:line-break/>Tout autre que mon père l'éprouverait sur l'heure
26: </text:p>
27:</office:body>
28:</office:document-content>
>>> end content.xml subdocument
Document fails to load in OO.o 1.1.4 with the following error:
"read error - format error in subdocument content.xml, position 24,69"
The encoding is correct when manuscript package is not used:
>>> resulting content.xml subdocument, excerpt
24:<!--l. 13--><text:p text:style-name="p-noindent">Rodrigue, as-tu du cœur ?
>>> end content.xml subdocument excerpt
-- System Information:
Debian Release: testing/unstable
APT prefers unstable
APT policy: (500, 'unstable'), (1, 'experimental')
Architecture: i386 (i686)
Kernel: Linux 2.6.6
Locale: LANG=fr_FR@euro, LC_CTYPE=fr_FR@euro (charmap=ISO-8859-15) (ignored: LC_ALL set to fr_FR@euro)
Versions of packages tex4ht depends on:
ii libc6 2.3.2.ds1-20 GNU C Library: Shared libraries an
ii libkpathsea3 2.0.2-26 path search library for teTeX (run
ii tetex-bin 2.0.2-26 The teTeX binary files
-- no debconf information
Reply to: