[Corpora-List] Chinese Tokenization

Alla Shashkina alkasiko at yahoo.de
Wed Aug 15 11:45:15 UTC 2012


Hi Jiajin and all,

is there any good tokenizer also for traditional Chinese?

Thanks,
Alla

On Aug 14, 2012, at 2:37 PM, Xu Jiajin <ustcxujj at gmail.com> wrote:

> Hi Ajay,
> 
> You can get a copy of ICTCLAS tokeniser developed by Dr. Kevin Zhang at http://www.ictclas.org/ictclas_download.aspx.
> 
> ICTCLAS is one of the best Chinese tokenisers.
> 
> Jiajin XU
> Ph.D., associate professor
> National Research Centre for Foreign Language Education
> Beijing Foreign Studies University
> 
> On Tue, Aug 14, 2012 at 8:17 PM, Ajay <ajay0221 at gmail.com> wrote:
> Dear Corpora list members,
> 
> I am looking for Chinese Tokenization and Chinese Lemmatizer tool to tokenize Chinese Wikipedia text.
> Please suggest a open-source, and freely available tool.
> 
> Regards,
> Ajay Dubey 
> M.S. by Research
> SIEL, IIIT, Hyderabad
> 
> 
> 
> 
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
> 
> 
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20120815/8a6820ac/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list