[Corpora-List] Nigerian scam corpus
    radev at umich.edu 
    radev at umich.edu
       
    Wed Sep 19 02:14:16 UTC 2007
    
    
  
I have a corpus with 4,000 Nigerian spam messages.
> 
> --===============1910948852==
> Content-Type: multipart/alternative;
> 	boundary="_7e5f2a67-a470-44e8-9ca5-9460a4d22e54_"
> 
> --_7e5f2a67-a470-44e8-9ca5-9460a4d22e54_
> Content-Type: text/plain; charset="iso-8859-1"
> Content-Transfer-Encoding: quoted-printable
> 
> 
> Hi Dimitri,
> 
> I have a Nigerian Scam corpus of a couple hundred letters (emails) I think.=
>   But these aren't tagged.   Let me know.
> 
> Jerry
> 
> > Date: Tue, 18 Sep 2007 18:16:58 -0400
> > From: dimitridf at yahoo.com
> > To: CORPORA at uib.no
> > Subject: [Corpora-List] Nigerian scam corpus
> >=20
> > Hello to everyone,
> >=20
> > Does anyone here knows of a recent Nigerian scam (scam 419 or advance fee=
>  fraud)
> > corpus?=20
> > See: http://en.wikipedia.org/wiki/Advance_fee_fraud
> > Possibly in both French and English...=20
> >=20
> > Thank you very much in advance for your help.=20
> >=20
> > Dimitri della Faille
> >=20
> >=20
> >=20
> >       Essayez le Tout-nouveau Yahoo! Courriel, pour voir un aper=E7u du p=
> anneau de lecture de courriels !
> > http://us.rd.yahoo.com/evt=3D40705/*http://mrd.mail.yahoo.com/try_beta?.i=
> ntl=3Dcf
> >=20
> > _______________________________________________
> > Corpora mailing list
> > Corpora at uib.no
> > http://mailman.uib.no/listinfo/corpora
> 
> _________________________________________________________________
> Explore the seven wonders of the world
> http://search.msn.com/results.aspx?q=3D7+wonders+world&mkt=3Den-US&form=3DQ=
> BRE=
> 
> --_7e5f2a67-a470-44e8-9ca5-9460a4d22e54_
> Content-Type: text/html; charset="iso-8859-1"
> Content-Transfer-Encoding: quoted-printable
> 
> <html>
> <head>
> <style>
> .hmmessage P
> {
> margin:0px;
> padding:0px
> }
> body.hmmessage
> {
> FONT-SIZE: 10pt;
> FONT-FAMILY:Tahoma
> }
> </style>
> </head>
> <body class=3D'hmmessage'>
> Hi Dimitri,<br><br>I have a Nigerian Scam corpus of a couple hundred letter=
> s (emails) I think.  But these aren't tagged.   Let me know.=
> <br><br>Jerry<br><br>> Date: Tue, 18 Sep 2007 18:16:58 -0400<br>> Fro=
> m: dimitridf at yahoo.com<br>> To: CORPORA at uib.no<br>> Subject: [Corpora=
> -List] Nigerian scam corpus<br>> <br>> Hello to everyone,<br>> <br=
> >> Does anyone here knows of a recent Nigerian scam (scam 419 or advance=
>  fee fraud)<br>> corpus? <br>> See: http://en.wikipedia.org/wiki/Adva=
> nce_fee_fraud<br>> Possibly in both French and English... <br>> <br>&=
> gt; Thank you very much in advance for your help. <br>> <br>> Dimitri=
>  della Faille<br>> <br>> <br>> <br>>       Essayez le Tout-nouv=
> eau Yahoo! Courriel, pour voir un aper=E7u du panneau de lecture de courrie=
> ls !<br>> http://us.rd.yahoo.com/evt=3D40705/*http://mrd.mail.yahoo.com/=
> try_beta?.intl=3Dcf<br>> <br>> ______________________________________=
> _________<br>> Corpora mailing list<br>> Corpora at uib.no<br>> http:=
> //mailman.uib.no/listinfo/corpora<br><br /><hr />Explore the seven wonders =
> of the world <a href=3D'http://search.msn.com/results.aspx?q=3D7+wonders+wo=
> rld&mkt=3Den-US&form=3DQBRE' target=3D'_new'>Learn more!</a></body>
> </html>=
> 
> --_7e5f2a67-a470-44e8-9ca5-9460a4d22e54_--
> 
> 
> --===============1910948852==
> Content-Type: text/plain; charset="iso-8859-1"
> MIME-Version: 1.0
> Content-Transfer-Encoding: quoted-printable
> Content-Disposition: inline
> 
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
> 
> --===============1910948852==--
> 
> 
> 
-- 
Dragomir R. Radev                    Associate Professor
SI, CSE, Ling                     U. Michigan, Ann Arbor 
http://www.eecs.umich.edu/~radev         radev at umich.edu              
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
    
    
More information about the Corpora
mailing list