Re: search through postscript documents?
On Thu, Mar 03, 2005 at 12:15:40PM +0100, Joerg Reckers wrote:
> Is there a way(program) to search for expressions in a postscript document?
> and to copy + paste words out of a ghostview-program to text?
> As i am using Kghostview now, and i am missing these features, so i will ask
> on this list. :-)
> thanks, joerg
Maintainer: J.H.M. Dassen (Ray) <email@example.com>
Depends: gs | gs-aladdin (>= 3.51), libc6 (>= 2.3.2.ds1-4)
Description: Extract text from PostScript and PDF files
pstotext extracts text (in the ISO 8859-1 character set) from a PostScript
or PDF (Portable Document Format) file. Thus, pstotext is similar to the
ps2ascii program that comes with ghostscript. The output of pstotext is
however better than that of ps2ascii, because pstotext deals better with
punctuation and ligatures.
So, you can pipe the output to some shellscript, with sed or gawk in the
background to process the text.