Hi lads,technical question: is it possible to extract text from PDF? From PDF to txt.
You can apt-get pdfedit Mileage will vary, only some PDF play nice.There is a new facility in pdfedit which might help in some difficult cases, pdf to xml but don't expect pain free.
I seem to recall ghostscript can do text extract too. (pdf is only a ps wrapper) (goes and hides)