[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#605905: ITP: frog -- tagger and parser for Dutch language

Package: wnpp
Severity: wishlist
Owner: Joost van Baal <joostvb-debian-bugs@mdcc.cx>

* Package name    : frog
  Version         : 0.1
  Upstream Author : ILK Research Group, Tilburg University, http://ilk.uvt.nl
* URL             : http://ilk.uvt.nl/tadpole
* License         : GPL-3
  Programming Lang: C++, Python
  Description     : tagger and parser for Dutch language

 Memory-Based Learning (MBL) is a machine-learning method applicable to a wide
 range of tasks in Natural Language Processing (NLP).
 Frog is a modular system integrating a morphosyntactic tagger, lemmatizer,
 morphological analyzer, and dependency parser for the Dutch language.  It is
 based upon it's predecessor TADPOLE (TAgger, Dependency Parser, and
 mOrphoLogical analyzEr).  Using Memory-Based Learning techniques, Tadpole
 tokenizes, tags, lemmatizes, and morphologically segments word tokens in
 incoming Dutch UTF-8 text files, and assigns a dependency graph to each
 sentence.  Tadpole is particularly targeted at the increasing need for fast,
 automatic NLP systems applicable to very large (multi-million to billion word)
 document collections that are becoming available due to the progressive
 digitization of both new and old textual data.
 Frog is a product of the ILK (Induction of Linguistic Knowledge) research
 group of the Tilburg University and the CNTS research group of the University
 of Antwerp.
 If you do scientific research in NLP, Frog will likely be of use to you.


Frog depends upon the TiMBL, TimblServer, and Mbt software packages.  These are
available from

 deb http://apt.ticc.uvt.nl lenny main
 deb-src http://apt.ticc.uvt.nl lenny main

.  ITP's for these packages are planned.  Frog has not yet been publically
released (Tadpole has, see
http://ilk.uvt.nl/downloads/pub/software/tadpole-0.6.tar.gz .)



Attachment: signature.asc
Description: Digital signature

Reply to: