[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#1004115: ITP: epub2txt2 -- extract text from EPUB documents



Package: wnpp
Severity: wishlist
Owner: Thiago Pezzo <pezzo@protonmail.com>
X-Debbugs-Cc: debian-devel@lists.debian.org, pezzo@protonmail.com

* Package name    : epub2txt2
  Version         : 2.01
  Upstream Author : Kevin Boone <kevin@railwayterrace.com>
* URL             : https://github.com/kevinboone/epub2txt2
* License         : GPL-3, GPL-3+
  Programming Lang: C
  Description     : extract text from EPUB documents

epub2txt is a simple command-line utility for extracting text from EPUB
documents and, optionally, re-flowing it to fit a text display of a particular
number of columns. It is intended for reading EPUB e-books on embedded systems
that can't host a graphical EPUB viewer, or converting such e-books to read on
those systems. However, it should be robust enough for other purposes, such as
batch indexing of EPUB document collections.
.
epub2txt favours speed and low memory usage over accuracy of rendering. Most
of the formatting of the source document will be lost but, with a text-only
display, this is likely to be of little consequence. It will output
UTF8-encoded text by default, but can be told to output ASCII, in which case
it will try to convert non-ASCII characters into something displayable if
possible.


Reply to: