Bug#434056: Inputenc and XeTeX don't work together

To: Frank Küster <frank@debian.org>
Cc: 434056@bugs.debian.org, 434056-forwarded@bugs.debian.org, Atsuhito Kohda <kohda@pm.tokushima-u.ac.jp>, Juliusz Chroboczek <Juliusz.Chroboczek@pps.jussieu.fr>
Subject: Bug#434056: Inputenc and XeTeX don't work together
From: Jonathan Kew <jonathan_kew@sil.org>
Date: Mon, 3 Sep 2007 14:47:15 +0100
Message-id: <[🔎] D0AF3FA5-CF55-4621-B1D2-04A9C1FB2C71@sil.org>
Reply-to: Jonathan Kew <jonathan_kew@sil.org>, 434056@bugs.debian.org
In-reply-to: <87odgnpokn.fsf@riesling.zuerich.kuesterei.ch>
References: <7i1wf2p4qp.fsf@lanthane.pps.jussieu.fr> <20070721.223233.74729896.kohda@pm.tokushima-u.ac.jp> <7i1wezc5bd.fsf@lanthane.pps.jussieu.fr> <20070726.173438.193690988.kohda@pm.tokushima-u.ac.jp> <7i7iomx0yz.fsf@lanthane.pps.jussieu.fr> <87odgnpokn.fsf@riesling.zuerich.kuesterei.ch>

On 31 Aug 2007, at 7:05 pm, Frank Küster wrote:

Hi Jonathan,

here's a bug report we got in the Debian BTS about using inputenc with
XeTeX.  The full conversation is at http://bugs.debian.org/434056, but

the first paragraph cited describes the wish quite well. Theproblem is

described in more detail in the initial messages, but maybe you're
familiar with it.

What's your view on that? TIA for your answer,


Hi Frank,

Yes, I'm familiar with the issue. I normally tell XeTeX users thatthey should not be using [utf8]{inputenc} at all, as the engine readsUTF-8 natively. I've sometimes thought that it would be good for thepackage to recognize when it is loaded under XeTeX, and automaticallydisable itself (perhaps with a warning), as this is a fairly commonmistake for new users.

I seem to recall discussing this with one of the LaTeX team at aconference some time ago (maybe Chris? Morten?), but have notfollowed up on it recently.

A further step would be to also support other input encodings via theinputenc package. This would require changing the \XeTeXinputencodingsetting to map the text to Unicode correctly. Then a legacy-encodedfile that says

  \usepackage[cp1250]{inputenc}
or
  \usepackage[applemac]{inputenc}

(or whatever) could work correctly with Unicode fonts in XeTeX. Butthe utf8 case is the common one, so it would be nice if at least thatone worked transparently.

The correct place to address this issue is in the base LaTeX release;it's not a Debian (or other distro) bug. But in the absence of anupstream fix, you might want to try and come up with a patch -- Ithink it would be helpful to users.

JK

Frank

Juliusz Chroboczek <Juliusz.Chroboczek@pps.jussieu.fr> wrote:

TeX, pdfTeX, Omega or XeTeX, he should be able to say

  \usepackage[utf8]{inputenc}

and the right thing for the current implementation of TeX should
magically happen.

I suspect you can think so because you use a language
in which there is little difference between utf8 and
normal encoding, for CJK (, Arabic, Hindi, ?) I'm afraid
things are not going so magically ;-)


Ehm... no.  XeTeX uses UTF-8 for input, and so does TeX (or e-TeX, or

pdfTeX) with utf8.def. Legacy encodings are completely irrelevantfor

this discussion.

The point is that the four languages I use regularly are all covered
by the small subset of Unicode that works correctly when you say

  \usepackage[utf8]{inputenc}
  \usepackage[T1]{fontenc}

I realise that people for whom that is the case are a minority(there's

probably not much more than 1.2 billion of us in the world). However,
just because it doesn't work for most people doesn't mean it should
stop working for us.

\usepackage{ifxetex}
\ifxetex\else
\usepackage[utf8]{inputenc}
\fi


Yes, i'm currently doing roughly that (but with TeX primitives rather

than the ifxetex package). However I believe that this should behandled

automatically by inputenc.

                                        Juliusz


--
Frank Küster
Debian Developer (teTeX/TeXLive)

Reply to:

Prev by Date: Bug#436729: texlive-fonts-recommended: short description of urw fonts unhelpful in package dsc
Next by Date: Bug#440796: texi2dvi: Suggests to install transitional tetex-bin
Previous by thread: luatex_0.11.0-1_i386.changes is NEW
Next by thread: Bug#440796: texi2dvi: Suggests to install transitional tetex-bin
Index(es):
- Date
- Thread