--- Begin Message ---
- To: Debian Bug Tracking System <submit@bugs.debian.org>
- Subject: ghostscript: ps2pdf modifies ASCII text of a PDF file, breaking conversion to text and searching for text (regression)
- From: Vincent Lefevre <vincent@vinc17.net>
- Date: Thu, 5 Oct 2023 17:04:38 +0200
- Message-id: <20231005150438.GA1908781@cventin.lip.ens-lyon.fr>
Package: ghostscript
Version: 10.02.0~dfsg-2
Severity: important
Tags: upstream
Forwarded: https://bugs.ghostscript.com/show_bug.cgi?id=707237
(Note that this is some form of data corruption.)
If I run ps2pdf on the PDF file from
https://www.open-std.org/jtc1/sc22/wg14/www/docs/n3163.pdf
the text
arithmetic. Also, C99’s informative annex G offered [...]
is changed to
arithmetic. !lso, C99’s informative annex G offered [...]
i.e. the letter "A" is changed to the exclamation point "!".
Note that this is a regression: there is no such issue with the
ghostscript 10.0.0~dfsg-11+deb12u1 package under Debian 12 (bookworm).
The above PDF file was generated with Microsoft Word, and an
uncompressed stream shows unusual content. This is where the
bug occurs.
I've reported the bug upstream and put more technical details
there.
-- System Information:
Debian Release: trixie/sid
APT prefers unstable-debug
APT policy: (500, 'unstable-debug'), (500, 'stable-updates'), (500, 'stable-security'), (500, 'unstable'), (500, 'testing'), (500, 'stable'), (1, 'experimental')
merged-usr: no
Architecture: amd64 (x86_64)
Foreign Architectures: i386
Kernel: Linux 6.5.0-1-amd64 (SMP w/12 CPU threads; PREEMPT)
Kernel taint flags: TAINT_PROPRIETARY_MODULE, TAINT_OOT_MODULE, TAINT_UNSIGNED_MODULE
Locale: LANG=POSIX, LC_CTYPE=C.UTF-8 (charmap=UTF-8), LANGUAGE not set
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled
Versions of packages ghostscript depends on:
ii libc6 2.37-12
ii libgs10 10.02.0~dfsg-2
ghostscript recommends no packages.
ghostscript suggests no packages.
-- no debconf information
--
Vincent Lefèvre <vincent@vinc17.net> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)
--- End Message ---
--- Begin Message ---
- To: 1053521-done@bugs.debian.org
- Subject: Re: ghostscript: ps2pdf modifies ASCII text of a PDF file, breaking conversion to text and searching for text (regression)
- From: Vincent Lefevre <vincent@vinc17.net>
- Date: Mon, 20 Nov 2023 16:01:27 +0100
- Message-id: <20231120150127.GA312623@cventin.lip.ens-lyon.fr>
- In-reply-to: <20231005150438.GA1908781@cventin.lip.ens-lyon.fr>
- References: <20231005150438.GA1908781@cventin.lip.ens-lyon.fr>
Version: 10.02.1~dfsg-1
On 2023-10-05 17:04:38 +0200, Vincent Lefevre wrote:
> If I run ps2pdf on the PDF file from
>
> https://www.open-std.org/jtc1/sc22/wg14/www/docs/n3163.pdf
>
> the text
>
> arithmetic. Also, C99’s informative annex G offered [...]
>
> is changed to
>
> arithmetic. !lso, C99’s informative annex G offered [...]
>
> i.e. the letter "A" is changed to the exclamation point "!".
Fixed upstream, and I could check that this is also fixed
in 10.02.1~dfsg-1.
--
Vincent Lefèvre <vincent@vinc17.net> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)
--- End Message ---