[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Book scanning frontend application?



Hi.

It appears that ocropus and tesseract look pretty promising.  I am
wondering, does anyone know of an active project to develop a typical
free software book scanning frontend application like OpenBook or
similar products from the commercial world?  I know that Emacspeak has
some
code to interface ocropus, but that is a little bit too much tied into
Emacspeak for my current tastes.

Any hints?  If no, we should probably develop such a thing, do you have
wishes for a feature list?  To me, a frontend needs to:
 * Keep track of page numbers, allowing me to delete pages and renumber
   them.
 * Provide speech output and scanning in background.  I.e., while speech
   is reading the text, scanning new pages should not interrupt speech.
   This is very comfortable when reading a book, you can turn pages
   during listening to the text with a minimum of interaction, i.e.,
   just a single key press per page.
 * Allow to edit the text so that scanning errors can be corrected.
   Ideally, with a feedback mechanism that populates a dictionary for the
   OCR engine.
 * A pronouncation dictionary, ideally with a submission system so that
   we can collect good improvements from users and eventually incorporate
   them into the engines we were using (like espeak).

Since I am not a US citizen, I am not terrible interested in Bookshare
integration, but I guess that such features would be desireable as well.

Any thoughts?
-- 
CYa,
  ⡍⠁⠗⠊⠕


Reply to: