[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#499606: ITP: tika -- a Java library for extracting textual information from various documents



Package: wnpp
Severity: wishlist
Owner: "Jan-Pascal van Best" <janpascal@vanbest.org>

* Package name    : tika
  Version         : 0.2-SNAPSHOT
  Upstream Author : Jukka Zitting <jukka@apache.org> and others
* URL             : http://incubator.apache.org/tika/
* License         : Apache 2.0
  Programming Lang: Java
  Description     : a Java library for extracting textual information from various documents

Apache Tika is a toolkit for detecting and extracting metadata and structured
text content from various documents using existing parser libraries.

-- System Information:
Debian Release: lenny/sid
  APT prefers testing
  APT policy: (990, 'testing'), (400, 'unstable')
Architecture: amd64 (x86_64)



Reply to: