[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#894068: ocrmypdf: New dependency on PyMuPDF for v6.0.0



Hello,

On Mon, Mar 26 2018, James R Barlow wrote:

> Thanks for the information. That's a worryingly high wall to climb and
> I'm concerned about implications for other platforms as well.
>
> I would appreciate if you can see about getting an exception, but I
> think I will change PyMuPDF to an optional but recommended dependency
> fairly soon.

That would be great in the meantime.

> I haven't made a major investment in it as yet with new code, but it
> does provide some powerful features that would be a major engineering
> effort to replicate and are likely not going to materialize in another
> open source library anytime soon. (Specifically: incremental updates,
> safe editing of PDF/A, PDF object garbage collection, fast
> rasterizing, robust text extraction.) The most commonly used Python
> PDF library, PyPDF2, is essentially unmaintained and in poor shape.

Having thought some more, I think our best bet will be to try to get
pymupdf to support linking against the static version of mupdf.  We have
techniques in Debian to deal with security updates in that case (called
binNMUs if you want to look them up).

-- 
Sean Whitton

Attachment: signature.asc
Description: PGP signature


Reply to: