[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: document archiving w/ scanner



On Sat, 2004-07-10 at 01:14 +0200, martin f krafft wrote:
> also sprach William Ballard <40618.nospam@comcast.net> [2004.07.10.0041 +0200]:
> > Search the archives for my and other's discussions about project 
> > gutenbergs tests with gocr and other open source OCR programs.
> 
> great pointer. I guess the conclusion here is that gocr and clara
> pretty much suck and for any serious work, I have to go with
> OmniPage or other commercial products. Damn.

At my last employer, I used Ascent Capture (on windows) to scan images
and index them against a postgresql+debian server and used a wxPython
application I wrote to search and view them. We used indexing info
(date, names, etc.) instead of the text of the documents, but Ascent
Capture can do that too. Obviously there are non-free parts to that
solution, but that was the best I was able to come up with. If you'd
like some more info on that setup feel free to drop me a line off-list.

-Mark



Reply to: