[Corpora-List] Mandarin corpus with Pinyin and sentence contexts?

Stephen Politzer-Ahles spa268 at nyu.edu
Mon Sep 1 10:21:01 UTC 2014


Hello all,

I am looking for a corpus that meets the following criteria:

1) includes the actual raw sentences (not just frequency counts)
2) has Pinyin as well as characters
3) can be downloaded in full (not just queried via a web interface)

So far I'm only aware of the Lancaster corpus; some other corpora, like the
Academica Sinica corpus and the HKUST telephone corpus, might also meet my
needs but they're not free so I don't know what they're like.

Any suggestions would be greatly appreciated!

Best,
Steve


Stephen Politzer-Ahles
New York University, Abu Dhabi
Neuroscience of Language Lab
http://www.nyu.edu/projects/politzer-ahles/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140901/c2272c81/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list