[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Bug#205689: ITP: juman -- a Japanese Morphological Analysis System



On Saturday 16 August 2003 12:25, TSUCHIYA Masatoshi wrote:
> Package: wnpp
> Severity: wishlist
>
> * Package name    : juman
>   Version         : 4.0
>   Upstream Author : Sadao Kurohashi <nl-resource@kc.t.u-tokyo.ac.jp>
> * URL or Web page : http://www.kc.t.u-tokyo.ac.jp/nl-resource/juman.html
> * License         : BSD
>   Description     : a Japanese Morphological Analysis System
>
> JUMAN is a morphological analysys system. It can segment and tokenize
> Japanese text string, and can output with many additional informations
> (pronunciation, semantic information, and others).  It will print the
> result of such an operation to the standard output, so that it can
> either written to a file or further processed.

You should add a note about encoding issues. (After a quick try, it seems that 
it only accepts EUC-JP input (and outputs the same))

Mike



Reply to: