[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#995392: ghostscript: ps2pdf trashes some characters



Control: retitle -1 ghostscript: pdfwrite incorrectly deals with embedded ToUnicode CMap
Control: found -1 9.27~dfsg-2+deb10u4

This Debian bug actually covers several similar Ghostscript bugs.
I consider the most general bug given by the testcase below,
which is still not fixed upstream.

On 2021-11-03 05:04:36 +0100, Vincent Lefevre wrote:
> \documentclass{article}
> \usepackage[T1]{fontenc}
> \usepackage{lmodern}
> \pdfglyphtounicode{Scaron}{0160}
> \pdfgentounicode=1
> \begin{document}
> \thispagestyle{empty}
> 'ê
> \end{document}

This testcase shows that this bug is not new and could have always
been present in Ghostscript. The fact is that I had never used
\pdfglyphtounicode{Scaron}{0160} before (I don't need it), and
I noticed this bug only due to TeX Live 2021, which now uses this
mapping (among others), unless all mappings are disabled by the
user with an explicit \pdfgentounicode=0.

This remaining bug (the other ones being recently fixed upstream)
might be specific to this mapping. A partial cause may be that
the ' character is transformed to a /quoteright, which leads to
a /Differences that confuses Ghostscript when generating the
ToUnicode CMap.

-- 
Vincent Lefèvre <vincent@vinc17.net> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)


Reply to: