[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#295109: tex4ht: openoffice export error with manuscript style



Package: tex4ht
Version: 20030426-3
Severity: normal


Incorrect encoding of \oe{} in htlatex openoffice export when tex package
"manuscipt" is used. The following example generates an incorrect encoding
for \oe{} which prevents OO.o from loading the resulting "content.xml" file.

manuscript.sty posted on comp.text.tex 23 march 1998, unable to find a more 
recent version (the one on CTAN failed to generate the proper fixed-font, 
doublespaced output on regular tex to dvi processing)

>>> manuscript.sty header
%%% test version for a style producing manuscript-like output.
%%% Stefan Ulrich <ulrich@cis.uni-muenchen.de>, with
%%% suggestions by Rowland <rebecca@astrid.dog.u-net.com, no animal>
%%% and modifications by Steve Kilbane <steve@whitecrow.demon.co.uk>
>>> end manuscript.sty header


Simple test case tex document:
>>> testcase.tex
\documentclass{book}
\usepackage[submit]{manuscript}

\title{Test case for oe ligature}

\usepackage[T1]{fontenc}
\usepackage[latin1]{inputenc}
\usepackage[frenchb]{babel}
\FrenchLayout

\begin{document}

Rodrigue, as-tu du c\oe{}ur ?\\
Tout autre que mon père l'éprouverait sur l'heure

\end{document}
>>> end testcase.tex


Command issued:
htlatex testcase.tex "xhtml,ooffice" "ooffice/! -cmozhtf" "-coo"


With manuscript package used as above, the \oe{} is incorrectly encoded
as a single char (line 24):

>>> resulting content.xml subdocument, line numbers added
01:<?xml version="1.0" encoding="UTF-8"?> 
02:<!DOCTYPE office:document-content PUBLIC "-//OpenOffice.org//DTD OfficeDocument 1.0//EN" "office.dtd"> 
03:<office:document-content  
04:xmlns:chart="http://openoffice.org/2000/chart";  
05:xmlns:dr3d="http://openoffice.org/2000/dr3d";  
06:xmlns:draw="http://openoffice.org/2000/drawing";  
07:xmlns:form="http://openoffice.org/2000/form";  
08:xmlns:number="http://openoffice.org/2000/datastyle";  
09:xmlns:office="http://openoffice.org/2000/office";  
10:xmlns:script="http://openoffice.org/2000/script";  
11:xmlns:style="http://openoffice.org/2000/style";  
12:xmlns:table="http://openoffice.org/2000/table";  
13:xmlns:text="http://openoffice.org/2000/text";  
14:xmlns:fo="http://www.w3.org/1999/XSL/Format";  
15:xmlns:math="http://www.w3.org/1998/Math/MathML";  
16:xmlns:svg="http://www.w3.org/2000/svg";  
17:xmlns:xlink="http://www.w3.org/1999/xlink";  
18:office:class="text" office:version="1.0" > 
19:<office:automatic-styles> <style:style style:name="tabular" style:family="table"> 
20:<style:properties style:rel-width="76%" fo:margin-left="1cm" fo:margin-right="1cm" table:align="margins"/> 
21:</style:style> 
22: 
23:</office:automatic-styles>     <office:body>
24:<!--l. 13--><text:p text:style-name="p-noindent">Rodrigue, as-tu du c÷ur?
25:<text:line-break/>Tout autre que mon p&#x00E8;re l'&#x00E9;prouverait sur l'heure
26:   </text:p> 
27:</office:body> 
28:</office:document-content>
>>> end content.xml subdocument

Document fails to load in OO.o 1.1.4 with the following error:
"read error - format error in subdocument content.xml, position 24,69"


The encoding is correct when manuscript package is not used:

>>> resulting content.xml subdocument, excerpt
24:<!--l. 13--><text:p text:style-name="p-noindent">Rodrigue, as-tu du c&#x0153;ur ?
>>> end content.xml subdocument excerpt



-- System Information:
Debian Release: testing/unstable
  APT prefers unstable
  APT policy: (500, 'unstable'), (1, 'experimental')
Architecture: i386 (i686)
Kernel: Linux 2.6.6
Locale: LANG=fr_FR@euro, LC_CTYPE=fr_FR@euro (charmap=ISO-8859-15) (ignored: LC_ALL set to fr_FR@euro)

Versions of packages tex4ht depends on:
ii  libc6                       2.3.2.ds1-20 GNU C Library: Shared libraries an
ii  libkpathsea3                2.0.2-26     path search library for teTeX (run
ii  tetex-bin                   2.0.2-26     The teTeX binary files

-- no debconf information



Reply to: