[Corpora-List] Phishing email corpus

Vlado Keselj vlado at cs.dal.ca
Mon May 1 12:20:45 UTC 2006


There is a collection at the site of Anti-Phishing Working Group: 
http://www.antiphishing.org/phishing_archive.html

I have my own collection and am interested in sharing it in order to 
create a larger public-domain collection.

Best regards,
--Vlado

On Sun, 30 Apr 2006 radev at umich.edu wrote:

> I don't know him. I have been collecting these myself after I read the
> paper in Scientific American:
> 
> @article{bennett&al.03,
>   author =      {Bennett, Charles H. and Li, Ming and Ma, Bin},
>   title =       {{Chain Letters and Evolutionary Histories}},
>   journal =     {{Scientific American}},
>   month =       {June},
>   year =        {2003},
>   pages =       {76--81},
>   url =   {http://www.sciam.com/article.cfm?colID=1&articleID=0003D476-1852-1EB7-BDC0809EC588EEDF},
> }
> 
> 
> j_kurjian at hotmail.com wrote:
> > 
> > Well, Lawrence Kestenbaum is in Michigan somewhere so you might have more 
> > luck than I did.  He has a quite a few on his site, but he claims to have 15 
> > or 20 thousand on his hard drive!
> > J
> > 
> > 
> > 
> > >
> > >I have a larger collection of "Nigerian" Letters, more than 2,500 of
> > >them, collected since 1998. If anyone is interested, drop me a note.
> > >
> > >D.
> > >
> > >j_kurjian at hotmail.com wrote:
> > > >
> > > > Nicklas -
> > > > I don't know if this is what you're looking for but I have a collection 
> > >o=
> > > > f=20
> > > > "Nigerian Letters," about 100 of them.  They are not tagged.  If you 
> > >are=20
> > > > handy with a spider or offline browser, you might be able to get some 
> > >fro=
> > > > m:
> > > > http://potifos.com/fraud/
> > > > Last year I contacted it's owner, Lawrence Kestenbaum, and he almost 
> > >agre=
> > > > ed=20
> > > > to make his massive collection available to me for a corpus.  Then it 
> > >fel=
> > > > l=20
> > > > through.  His site has lotto letters and probably other types too.  
> > >Hope=20
> > > > that helps.  Let me know if you want what I have.
> > > > Jerry Kurjian
> > > >
> > > >
> > > > >
> > > > >Hi
> > > > >
> > > > >My name is Nicklas Karlsson, and I'm a student at V=E4xj=F6 University 
> > >i=
> > > > n=20
> > > > >Sweden.
> > > > >I'm working on my bachelor's degree with a phishing detection and 
> > >warnin=
> > > > g=20
> > > > >project.
> > > > >The part I'm working on is a classification module, which will 
> > >classify=20
> > > > >emails to find the phishing emails and mark them for further 
> > >investigati=
> > > > on.
> > > > >To develop this and test it I need a collection of phishing emails. 
> > >I've=
> > > > =20
> > > > >collected a few but I still need more.
> > > > >
> > > > >So I wonder if anyone has or knows where I can find a corpus with 
> > >phishi=
> > > > ng=20
> > > > >emails?
> > > > >
> > > > >I'll post a list of all replys sent directly to me.
> > > > >
> > > > >Thanks
> > > > >
> > > > >Nicklas Karlsson
> > > > >
> > > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > >
> > >
> > >--
> > >Dragomir R. Radev                                         radev at umich.edu
> > >Associate Professor of Information, Electrical Engineering and
> > >Computer Science, and Linguistics, the University of Michigan, Ann Arbor
> > >Phone: 734-615-5225   Fax: 734-764-2475    http://www.si.umich.edu/~radev
> > 
> > 
> > 
> > 
> > 
> 
> 
> 



More information about the Corpora mailing list