[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: How to extract text from PDF?

Andrius wrote:
Hi lads,

technical question: is it possible to extract text from PDF? From PDF to txt.

You can apt-get pdfedit

Mileage will vary, only some PDF play nice.

There is a new facility in pdfedit which might help in some difficult cases, pdf to xml but don't expect pain free.

I seem to recall ghostscript can do text extract too. (pdf is only a ps wrapper) (goes and hides)

Reply to: