proofing searchable pdf files

To: Debian User <debian-user@lists.debian.org>
Subject: proofing searchable pdf files
From: Gary Roach <gary719_list1@verizon.net>
Date: Thu, 30 Oct 2014 17:47:41 -0700
Message-id: <[🔎] 5452DC2D.2040502@verizon.net>

Hi all,

Problem:

I am working on an archiving project and wish to archive documents tosearchable pdf files but can't seem to figure out how to proof read andcorrect the text overlay. Any suggestions.


System:
	Debian Wheezy
	Intel i5-750 processor
	HP Officejet Pro 8600 wireless all in one printer/fax/scanner
	gscan2pdf software with Tesseract ocr
	300 to 600 dpi scans.

Tesseract seems to do a really great job but I have no good way ofproving this or correcting any mistakes. Some of the documents are 100years old and may not be in such great shape. I can always retypeeverything but would like to avoid this, as much as possible, forobvious reasons.


Gary R.

Reply to:

Follow-Ups:
- Re: proofing searchable pdf files
  - From: Doug <dmcgarrett@optonline.net>
- Re: proofing searchable pdf files
  - From: Gary Dale <garydale@torfree.net>
- Re: proofing searchable pdf files
  - From: Gary Roach <gary719_list1@verizon.net>

Prev by Date: Re: Preventing the computer from shutting down.
Next by Date: Re: proofing searchable pdf files
Previous by thread: Re: terminal spreadsheet - sc fork
Next by thread: Re: proofing searchable pdf files
Index(es):
- Date
- Thread