[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: PinYin Standard




erm, do you mean that sometimes the same char has different pronounciations? Or what? you can of course use the xcin input table as a database, because every chinese char has the pinyin writing in this table. if you want to differ between certain situations when you have to use which pronounciation, then of course you should look on the following chars and have a little dictionary included that decides which pronounciation to use.
I think a simple perl script should fit your needs in this case.

$>wc pinyin.cin -->  20976   41951  217690 pinyin.cin

Here is the wc on pinyin.cin I thinkI can first base on this to build a prototype. 20976 line include multiple chars input... so total singal char to pinyin don't really map all the gbk. Also the standard pronouncation of the word should include
the tone 1-6 for cantonese 1-4 for mandarin...

ftp://ftp.unicode.org/Public/UNIDATA/Unihan.txt
in this file you get more detail how it should look like....
in pinyin.cin... may not be enought for my use... of course better then
nothing :)
Thanks alot of your input

Alex








Reply to: