Re: OCR questions
Rodolfo Medina wrote:
> I tried gocr and the result was quite miserable. Then I tried with MS Windows
> and it was almost perfect. Somewhere in the web I read that OCR software
> under
> Linux is very poor at the moment and that it's better to use MS Windows for
> that: unfortunately my test seems to confirm that. What do you Debian listers
> think?
bob@proulx.com (Bob Proulx) writes:
> I think you should check out these articles.
>
> http://google-code-updates.blogspot.com/2006/08/announcing-tesseract-ocr.html
>
> http://code.google.com/p/tesseract-ocr/
>
> http://www.linux.com/articles/57222
>
> http://sourceforge.net/projects/tesseract-ocr
Thanks.
I installed tesseract with configure, make, make install, then tried to run it
but got the following error message:
Unable to load unicharset file /usr/local/share/tessdata/eng.unicharset
. In the README file there is:
Non-Windows:
You have to tell Tesseract through a standard unix mechanism where to find
its data directory. You must either:
./configure
make
make install
to move the data files to the standard place, or:
export TESSDATA_PREFIX="directory in which your tessdata resides/"
(or equivalent) in your .profile or whatever or setenv to set the environment
variable. Note that the directory must end in a /
HAVING tesseract and tessdata IN THE SAME DIRECTORY DOES NOT WORK ANY MORE.
. I tried with `export TESSDATA_PREFIX="/usr/local/share/tessdata/"', but
nothing. Now I'm stuck. Any suggestion please?
Rodolfo
Reply to: