[Corpora-List] chinese pos tagger/lemmatizer

Marco Baroni baroni at sslmit.unibo.it
Thu Jan 19 13:28:23 UTC 2006


Dear all,

Does anybody know of a tokenizer/POS tagger  for the Chinese language, 
ideally with these characteristics:

- documented in English
- free or cheap
- runs on the Unix command line, more or less out-of-the-box

Moreover, we are also looking for a tool/electronic resource that, given a 
tokenized word, would provide a pinyin transcription of the word. Does such 
a tool exist?

Thanks in advance for the advice.

Regards,

Marco


-- 
Marco Baroni
SSLMIT, University of Bologna
http://sslmit.unibo.it/~baroni



More information about the Corpora mailing list