[Corpora-List] More text added to my USENET corpus. Also: new info on its availability.

Cyrus Shaoul cyrus.shaoul at ualberta.ca
Fri Mar 16 22:17:06 UTC 2007


Fellow list members,

I have just uploaded the USENET corpus data from the first three months 
of 2007 to our server. This will add approximately 1 billion more
words of text to the archive.

As always, please go this the following URL to download all or part of 
the corpus:

    
http://www.psych.ualberta.ca/~westburylab/downloads/usenetcorpus.download.html

Also, after many people requested it, I have been able to set up an 
secondary server for those who are connected to non-academic networks.
The only limitation is that this new server can only serve up 1Gb of 
data per day (aggregated across all users).
Please report any problems with this new server to me.
(For those one non-academic networks, the new server is automatically 
chosen for you when you use the URL above.)

Thanks,

Cyrus


-- 
=[=]={=}=[=]={=}=[=]={=}=[=]={=}=[=]={=}
Cyrus Shaoul
http://www.psych.ualberta.ca/~westburylab/
University of Alberta
=[=]={=}=[=]={=}=[=]={=}=[=]={=}=[=]={=}


-------------- next part --------------
A non-text attachment was scrubbed...
Name: cyrus.shaoul.vcf
Type: text/x-vcard
Size: 293 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20070316/f5d0891f/attachment.vcf>


More information about the Corpora mailing list