[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#998458: ghostscript: pdfwrite emits incorrect ToUnicode CMap entries



Package: ghostscript
Version: 9.55.0~~rc1~dfsg-1
Severity: normal
Tags: upstream fixed-upstream
Forwarded: https://bugs.ghostscript.com/show_bug.cgi?id=704478

Ghostscript, e.g. via the ps2pdf wrapper, emits incorrect
ToUnicode CMap entries, making text non-searchable, partly
unreadable via pdftotext, and affecting copy-paste too.

Testcase chartest1.pdf attached. It was generated with TeX Live 2021
on the following LaTeX source:

\documentclass[12pt]{article}
\usepackage[T1]{fontenc}
\begin{document}
\thispagestyle{empty}
Test: float.
\end{document}

This chartest1.pdf file contains the text "Test: float.". But when
converted with ps2pdf (or gs directly), one gets: "Test: ŕoat.".

This bug was introduced upstream by
  commit 4d91c6ad3e76e19f36d23a50dce253fbbc7d0560 (2020-12-11)
and fixed by
  commit b4e8434defb8e05ea05bb130b92217290efd2fba (2021-10-25)

Note: this does not fix all pdfwrite bugs concerning the
ToUnicode CMap. For additional issues, see

  https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=995392

-- System Information:
Debian Release: bookworm/sid
  APT prefers unstable-debug
  APT policy: (500, 'unstable-debug'), (500, 'stable-updates'), (500, 'stable-security'), (500, 'unstable'), (500, 'testing'), (500, 'stable'), (1, 'experimental')
Architecture: amd64 (x86_64)

Kernel: Linux 5.14.0-3-amd64 (SMP w/8 CPU threads)
Kernel taint flags: TAINT_PROPRIETARY_MODULE, TAINT_OOT_MODULE, TAINT_UNSIGNED_MODULE
Locale: LANG=POSIX, LC_CTYPE=C.UTF-8 (charmap=UTF-8), LANGUAGE not set
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages ghostscript depends on:
ii  libc6   2.32-4
ii  libgs9  9.55.0~~rc1~dfsg-1

ghostscript recommends no packages.

Versions of packages ghostscript suggests:
ii  ghostscript-x  9.55.0~~rc1~dfsg-1

-- no debconf information

-- 
Vincent Lefèvre <vincent@vinc17.net> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)

Attachment: chartest1.pdf
Description: Adobe PDF document


Reply to: