Book scanning frontend application?
Hi.
It appears that ocropus and tesseract look pretty promising. I am
wondering, does anyone know of an active project to develop a typical
free software book scanning frontend application like OpenBook or
similar products from the commercial world? I know that Emacspeak has
some
code to interface ocropus, but that is a little bit too much tied into
Emacspeak for my current tastes.
Any hints? If no, we should probably develop such a thing, do you have
wishes for a feature list? To me, a frontend needs to:
* Keep track of page numbers, allowing me to delete pages and renumber
them.
* Provide speech output and scanning in background. I.e., while speech
is reading the text, scanning new pages should not interrupt speech.
This is very comfortable when reading a book, you can turn pages
during listening to the text with a minimum of interaction, i.e.,
just a single key press per page.
* Allow to edit the text so that scanning errors can be corrected.
Ideally, with a feedback mechanism that populates a dictionary for the
OCR engine.
* A pronouncation dictionary, ideally with a submission system so that
we can collect good improvements from users and eventually incorporate
them into the engines we were using (like espeak).
Since I am not a US citizen, I am not terrible interested in Bookshare
integration, but I guess that such features would be desireable as well.
Any thoughts?
--
CYa,
⡍⠁⠗⠊⠕
Reply to: