Re: OCR questions

To: debian-user@lists.debian.org
Subject: Re: OCR questions
From: Rodolfo Medina <rodolfo.medina@gmail.com>
Date: Sat, 21 Jul 2007 22:25:43 +0200
Message-id: <[🔎] 87zm1pv708.fsf@gmail.com>
References: <877iqe4s98.fsf@gmail.com> <20070608085703.cbbb955a.celejar@gmail.com> <20070609045125.GH4974@localhost.localdomain> <[🔎] 87644dhd4y.fsf_-_@gmail.com> <[🔎] 20070721181027.GA2633@dementia.proulx.com>

Rodolfo Medina wrote:

> I tried gocr and the result was quite miserable.  Then I tried with MS Windows
> and it was almost perfect.  Somewhere in the web I read that OCR software
> under
> Linux is very poor at the moment and that it's better to use MS Windows for
> that: unfortunately my test seems to confirm that.  What do you Debian listers
> think?

bob@proulx.com (Bob Proulx) writes:

> I think you should check out these articles.
>
>   http://google-code-updates.blogspot.com/2006/08/announcing-tesseract-ocr.html
>
>   http://code.google.com/p/tesseract-ocr/
>
>   http://www.linux.com/articles/57222
>
>   http://sourceforge.net/projects/tesseract-ocr

Thanks.
I installed tesseract with configure, make, make install, then tried to run it
but got the following error message:

 Unable to load unicharset file /usr/local/share/tessdata/eng.unicharset

.  In the README file there is:

Non-Windows:
You have to tell Tesseract through a standard unix mechanism where to find
its data directory. You must either:
./configure
make
make install
to move the data files to the standard place, or:
export TESSDATA_PREFIX="directory in which your tessdata resides/"
(or equivalent) in your .profile or whatever or setenv to set the environment
variable. Note that the directory must end in a /
HAVING tesseract and tessdata IN THE SAME DIRECTORY DOES NOT WORK ANY MORE.

.  I tried with `export TESSDATA_PREFIX="/usr/local/share/tessdata/"', but
nothing.  Now I'm stuck.  Any suggestion please?

Rodolfo

Reply to:

Follow-Ups:
- Re: OCR questions
  - From: Florian Kulzer <florian.kulzer+debian@icfo.es>

References:
- OCR questions (was: How to acquire text so to edit it?)
  - From: Rodolfo Medina <rodolfo.medina@gmail.com>
- Re: OCR questions (was: How to acquire text so to edit it?)
  - From: bob@proulx.com (Bob Proulx)

Prev by Date: Re: can't get the login window to appear after leaving machine...
Next by Date: Re: Debian not auto mount my CDs / DVDs
Previous by thread: Re: OCR questions (was: How to acquire text so to edit it?)
Next by thread: Re: OCR questions
Index(es):
- Date
- Thread