[Corpora-List] Australian newspaper corpora

Monika Bednarek Monika.Bednarek at phil.uni-augsburg.de
Sat May 12 03:57:48 UTC 2007


Dear all,

a while ago I asked about information on Australian newspaper 
corpora. Thank you very much to Eric Atwell, Khurshid Ahmad, Steven 
Bird, and Martin Wynne for very helpful suggestions.

Here's a summary of the responses:

- use the web as corpus (with web-as-corpus collection tool such as 
WWW-Bootcat http://corpora.fi.muni.cz/bootcat/
or WeBoCa http://code.google.com/p/weboca/)
- ICAMe (files from The Age)
- The NLTK corpus distribution: 
http://nltk.sourceforge.net/wiki/index.php/Corpora
- the Bank of English at the University of Birmingham


Best regards,

Monika



More information about the Corpora mailing list