<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=iso-8859-1">
<META content="MSHTML 6.00.6000.16414" name=GENERATOR></HEAD>
<BODY>
<DIV><SPAN class=643120714-06042007>Dear All, </SPAN></DIV>
<DIV><SPAN class=643120714-06042007></SPAN> </DIV>
<DIV><SPAN class=643120714-06042007>Here is a quick summary of the messages I
got in response to my recent query on email corpora. I'd like to thank the
following list members for helpful pointers: </SPAN></DIV>
<DIV><SPAN class=643120714-06042007>Stefan Bordag</SPAN></DIV>
<DIV><SPAN class=643120714-06042007>Chris Jordan</SPAN></DIV>
<DIV><SPAN class=643120714-06042007>Sabine Bartsch</SPAN></DIV>
<DIV><SPAN class=643120714-06042007>Ramesh Krishnamurthy</SPAN></DIV>
<DIV><SPAN class=643120714-06042007></SPAN></DIV>
<DIV><SPAN class=643120714-06042007></SPAN> </DIV>
<DIV><SPAN class=643120714-06042007>Stefan Bordag mentioned the (huge) USENET
corpus which does not contain emails but texts of a similar type (from an
internet discussion forum): <A
href="http://www.psych.ualberta.ca/~westburylab/downloads/usenetcorpus.download.html"><U><FONT
color=#0000ff>http://www.psych.ualberta.ca/~westburylab/downloads/usenetcorpus.download.html</U></FONT></A></SPAN></DIV>
<DIV><SPAN class=643120714-06042007></SPAN> </DIV>
<DIV><SPAN class=643120714-06042007>Chris Jordan suggested the SpamAssassin
Corpus (<A
href="http://spamassassin.apache.org/">http://spamassassin.apache.org/</A>).</SPAN></DIV>
<DIV><SPAN class=643120714-06042007></SPAN> </DIV>
<DIV><SPAN class=643120714-06042007>Sabine Bartsch and Ramesh Krishnamurthy sent
me a link to the Wolverhampton junk email corpus(<A
href="http://clg.wlv.ac.uk/projects/junk-email/">http://clg.wlv.ac.uk/projects/junk-email/</A>);
Sabine also mentioned the email messages corpus from W3C lists (<A
href="http://tides.umiacs.umd.edu/webtrec/trecent/parsed_w3c_corpus.html">http://tides.umiacs.umd.edu/webtrec/trecent/parsed_w3c_corpus.html</A>).
</SPAN></DIV>
<DIV><SPAN class=643120714-06042007></SPAN> </DIV>
<DIV><SPAN class=643120714-06042007></SPAN></DIV>
<DIV><SPAN class=643120714-06042007>I have now got plenty of corpus material to
keep my 'Analysing Texts' students busy... Thanks! </SPAN></DIV>
<DIV><SPAN class=643120714-06042007></SPAN> </DIV>
<DIV><SPAN class=643120714-06042007>Very best wishes... Ute</SPAN></DIV>
<DIV><SPAN class=643120714-06042007></SPAN> </DIV>
<DIV> </DIV>
<DIV align=left>
<DIV align=left>
<DIV
align=left>************************************************************</DIV>
<DIV> </DIV>
<DIV>Dr. Ute Römer<BR>English Department<BR>Leibniz University of
Hanover<BR>Königsworther Platz 1<BR>30167 Hannover<BR>Germany</DIV>
<DIV> </DIV>
<DIV>Phone: +49 (0)511 762 2997<BR>Fax: +49 (0)511 762 2996<BR>Please note NEW
e-mail address: <A title=mailto:ute.roemer@engsem.uni-hannover.de
href="mailto:ute.roemer@engsem.uni-hannover.de">ute.roemer@engsem.uni-hannover.de</A><BR><A
title=http://www.uteroemer.com/
href="http://www.uteroemer.com/">http://www.uteroemer.com</A><BR><A
href="http://www.engsem.uni-hannover.de/angli/">http://www.engsem.uni-hannover.de/angli/</A><BR></DIV></DIV>
<DIV> </DIV></DIV>
<DIV> </DIV></BODY></HTML>