[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: OCR questions



Rodolfo Medina wrote:

> I tried gocr and the result was quite miserable.  Then I tried with MS Windows
> and it was almost perfect.  Somewhere in the web I read that OCR software
> under
> Linux is very poor at the moment and that it's better to use MS Windows for
> that: unfortunately my test seems to confirm that.  What do you Debian listers
> think?



bob@proulx.com (Bob Proulx) writes:

> I think you should check out these articles.
>
>   http://google-code-updates.blogspot.com/2006/08/announcing-tesseract-ocr.html
>
>   http://code.google.com/p/tesseract-ocr/
>
>   http://www.linux.com/articles/57222
>
>   http://sourceforge.net/projects/tesseract-ocr


Thanks.
I installed tesseract with configure, make, make install, then tried to run it
but got the following error message:

 Unable to load unicharset file /usr/local/share/tessdata/eng.unicharset

.  In the README file there is:

Non-Windows:
You have to tell Tesseract through a standard unix mechanism where to find
its data directory. You must either:
./configure
make
make install
to move the data files to the standard place, or:
export TESSDATA_PREFIX="directory in which your tessdata resides/"
(or equivalent) in your .profile or whatever or setenv to set the environment
variable. Note that the directory must end in a /
HAVING tesseract and tessdata IN THE SAME DIRECTORY DOES NOT WORK ANY MORE.


.  I tried with `export TESSDATA_PREFIX="/usr/local/share/tessdata/"', but
nothing.  Now I'm stuck.  Any suggestion please?

Rodolfo



Reply to: