[Corpora-List] Chinese Tokenization
Alla Shashkina
alkasiko at yahoo.de
Wed Aug 15 11:45:15 UTC 2012
Hi Jiajin and all,
is there any good tokenizer also for traditional Chinese?
Thanks,
Alla
On Aug 14, 2012, at 2:37 PM, Xu Jiajin <ustcxujj at gmail.com> wrote:
> Hi Ajay,
>
> You can get a copy of ICTCLAS tokeniser developed by Dr. Kevin Zhang at http://www.ictclas.org/ictclas_download.aspx.
>
> ICTCLAS is one of the best Chinese tokenisers.
>
> Jiajin XU
> Ph.D., associate professor
> National Research Centre for Foreign Language Education
> Beijing Foreign Studies University
>
> On Tue, Aug 14, 2012 at 8:17 PM, Ajay <ajay0221 at gmail.com> wrote:
> Dear Corpora list members,
>
> I am looking for Chinese Tokenization and Chinese Lemmatizer tool to tokenize Chinese Wikipedia text.
> Please suggest a open-source, and freely available tool.
>
> Regards,
> Ajay Dubey
> M.S. by Research
> SIEL, IIIT, Hyderabad
>
>
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20120815/8a6820ac/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list