Corpora: non-english corpora
jre at comp.leeds.ac.uk
jre at comp.leeds.ac.uk
Fri Jun 1 10:30:22 UTC 2001
Greetings all
I am holding out my begging bowl again! I am trying to find non-english
PoS-TAGGED corpora, which can be a little as a few thousand words. I am ideally looking for such languages as Arabic, Hindi, Russian, Basque, Spanish, Vietnamese, Latin and even Sanskrit. Any of these or similar would be most welcome.
Ever hopefull..
John
********************************************************
John Elliott
Centre for Computer Analysis of Language and Speech
University of Leeds
email: jre at scs.leeds.ac.uk
phone: 0113 233 6827
Web-site http://www.scs.leeds.ac.uk/jre
********************************************************
More information about the Corpora
mailing list