Bug#928738: printer-driver-cups-pdf Still Produces PDF Files that Lack Searchable Text and are Unusable with pdftotext
Brian Potkin wrote:
> Neil Ormos wrote:
>> Package: printer-driver-cups-pdf
>> Version: 3.0.1-5
>> Severity: important
>> Dear Maintainer,
>> In prior bug reports, users complained that
>> CUPS-PDF or printer-driver-cups-pdf produced
>> PDF files in which text was represented in
>> image format, or was not searchable. [...]
>> I have installed
>> printer-driver-cups-pdf_3.0.1-5 (the current
>> version distributed in Buster), and it appears
>> that *whatever* is stored in the PDF files
>> produced by printer-driver-cups-pdf, it's not
>> searchable text. Also, when these files are
>> processed by pdftotext, the results do not
>> contain recognizable text. [...]
> Thank you for your report, Neil.
Hi Brian:
> Please post the output of 'lpoptions -p PDF'
############################################################
copies=1 device-uri=cups-pdf:/ finishings=3 job-cancel-after=10800 job-hold-until=no-hold job-priority=50 job-sheets=none,none marker-change-time=0 number-up=1 printer-commands=AutoConfigure,Clean,PrintSelfTestPage printer-info=PDF printer-is-accepting-jobs=true printer-is-shared=false printer-location printer-make-and-model='Generic CUPS-PDF Printer' printer-state=3 printer-state-change-time=1557509283 printer-state-reasons=none printer-type=10678348 printer-uri-supported=ipp://localhost/printers/PDF
############################################################
> and, for a printed HTML
> page, what 'pdfinfo <PDF_file> gives.
Results of pdfinfo on the file produced on a system running Stretch:
############################################################
Title: (file:///home/uuu/zzz-scratch-0510-2/foo1.html)
Author: (uuu)
Creator: GPL Ghostscript 926 (ps2write)
Producer: GPL Ghostscript 9.26
CreationDate: Fri May 10 12:28:03 2019 CDT
ModDate: Fri May 10 12:28:03 2019 CDT
Tagged: no
UserProperties: no
Suspects: no
Form: none
JavaScript: no
Pages: 1
Encrypted: no
Page size: 612 x 792 pts (letter)
Page rot: 0
File size: 12485 bytes
Optimized: no
PDF version: 1.4
############################################################
Results of pdfinfo on the file produced on a system running Squeeze:
############################################################
Title: file:///home/uuu/zzz-scratch-0510-4/foo1.html
Producer: pdftopdf
CreationDate: Fri May 10 12:31:43 2019 CDT
ModDate: Fri May 10 12:31:43 2019 CDT
Tagged: no
UserProperties: no
Suspects: no
Form: none
JavaScript: no
Pages: 1
Encrypted: no
Page size: 612 x 792 pts (letter)
Page rot: 0
File size: 8922 bytes
Optimized: no
PDF version: 1.3
############################################################
Best regards,
--Neil Ormos
Reply to: