[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#998458: ghostscript: pdfwrite emits incorrect ToUnicode CMap entries



On 2021-11-04 16:18:17 +0100, Vincent Lefevre wrote:
> Ghostscript, e.g. via the ps2pdf wrapper, emits incorrect
> ToUnicode CMap entries, making text non-searchable, partly
> unreadable via pdftotext, and affecting copy-paste too.
> 
> Testcase chartest1.pdf attached. It was generated with TeX Live 2021
> on the following LaTeX source:
> 
> \documentclass[12pt]{article}
> \usepackage[T1]{fontenc}
> \begin{document}
> \thispagestyle{empty}
> Test: float.
> \end{document}
> 
> This chartest1.pdf file contains the text "Test: float.". But when
> converted with ps2pdf (or gs directly), one gets: "Test: ŕoat.".

The following ghostscript versions all behave correctly:
  ghostscript 9.27~dfsg-2+deb10u4 (Debian 10.11 (buster))
  ghostscript 9.53.3~dfsg-7+deb11u1 (stable-security)
  ghostscript 9.53.3~dfsg-8

-- 
Vincent Lefèvre <vincent@vinc17.net> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)


Reply to: