Bug#511307: ITP: tmextractors -- A pure java library for extracting text from Word documents
you may want to have a look at
http://poi.apache.org/
as tmextractors seem to duplicate functionality
/Johan
Florian Richter wrote:
> Package: wnpp
> Severity: wishlist
> Owner: Florian Richter <Florian_Richter@gmx.de>
>
> * Package name : tmextractors
> Version : 1.0
> Upstream Author : Benryan Software Inc.
> * URL : http://www.textmining.org/
> * License : LGPL
> Programming Lang: Java
> Description : A pure java library for extracting text from Word documents
>
> This is a pure java library for extracting text from Word documents.
>
> -- System Information:
> Debian Release: 5.0
> APT prefers testing
> APT policy: (500, 'testing')
> Architecture: i386 (i686)
>
>
>
>
--
--
------------------------------------------------
Johan Henriksson
MSc Engineering
PhD student, Karolinska Institutet
http://mahogny.areta.org http://www.endrov.net
Reply to: