Bug#998458: ghostscript: pdfwrite emits incorrect ToUnicode CMap entries
On 2021-11-04 16:18:17 +0100, Vincent Lefevre wrote:
> Ghostscript, e.g. via the ps2pdf wrapper, emits incorrect
> ToUnicode CMap entries, making text non-searchable, partly
> unreadable via pdftotext, and affecting copy-paste too.
>
> Testcase chartest1.pdf attached. It was generated with TeX Live 2021
> on the following LaTeX source:
>
> \documentclass[12pt]{article}
> \usepackage[T1]{fontenc}
> \begin{document}
> \thispagestyle{empty}
> Test: float.
> \end{document}
>
> This chartest1.pdf file contains the text "Test: float.". But when
> converted with ps2pdf (or gs directly), one gets: "Test: ŕoat.".
The following ghostscript versions all behave correctly:
ghostscript 9.27~dfsg-2+deb10u4 (Debian 10.11 (buster))
ghostscript 9.53.3~dfsg-7+deb11u1 (stable-security)
ghostscript 9.53.3~dfsg-8
--
Vincent Lefèvre <vincent@vinc17.net> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)
Reply to: