[Corpora-List] chinese pos tagger/lemmatizer
Marco Baroni
baroni at sslmit.unibo.it
Thu Jan 19 13:28:23 UTC 2006
Dear all,
Does anybody know of a tokenizer/POS tagger for the Chinese language,
ideally with these characteristics:
- documented in English
- free or cheap
- runs on the Unix command line, more or less out-of-the-box
Moreover, we are also looking for a tool/electronic resource that, given a
tokenized word, would provide a pinyin transcription of the word. Does such
a tool exist?
Thanks in advance for the advice.
Regards,
Marco
--
Marco Baroni
SSLMIT, University of Bologna
http://sslmit.unibo.it/~baroni
More information about the Corpora
mailing list