[Corpora-List] Australian newspaper corpora
Monika Bednarek
Monika.Bednarek at phil.uni-augsburg.de
Sat May 12 03:57:48 UTC 2007
Dear all,
a while ago I asked about information on Australian newspaper
corpora. Thank you very much to Eric Atwell, Khurshid Ahmad, Steven
Bird, and Martin Wynne for very helpful suggestions.
Here's a summary of the responses:
- use the web as corpus (with web-as-corpus collection tool such as
WWW-Bootcat http://corpora.fi.muni.cz/bootcat/
or WeBoCa http://code.google.com/p/weboca/)
- ICAMe (files from The Age)
- The NLTK corpus distribution:
http://nltk.sourceforge.net/wiki/index.php/Corpora
- the Bank of English at the University of Birmingham
Best regards,
Monika
More information about the Corpora
mailing list