[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#580275: ITP: liblingua-en-tagger-perl -- Part-of-speech tagger for English natural language processing.



Package: wnpp
Owner: Chris Butler <chrisb@debian.org>
Severity: wishlist
X-Debbugs-CC: debian-devel@lists.debian.org,debian-perl@lists.debian.org

* Package name    : liblingua-en-tagger-perl
  Version         : 0.15
  Upstream Author : Aaron Coburn <aaron@coburncuadrado.com>
* URL             : http://search.cpan.org/dist/Lingua-EN-Tagger/
* License         : GPL-3+
  Programming Lang: Perl
  Description     : Part-of-speech tagger for English natural language processing.

Lingua::EN::Tagger is a perl module which implements a probability based,
corpus-trained tagger that assigns POS tags to English text based on a lookup
dictionary and a set of probability values. The tagger assigns appropriate
tags based on conditional probabilities - it examines the preceding tag to
determine the appropriate tag for the current word. Unknown words are
classified according to word morphology or can be set to be treated as nouns
or other parts of speech.

The tagger also extracts as many nouns and noun phrases as it can, using a
set of regular expressions.



Reply to: