[Corpora-List] Request People Name Corpus (English)

Scott Crossley sacrossley at gmail.com
Mon Jun 14 18:24:54 UTC 2010


The NLTK data package has an English name corpus categorized by gender  
that is free to download. Not sure how up-to-date it is or  
representative, but it is pretty big.

http://www.nltk.org/


>
>
>
> -----Original Message-----
> From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On  
> Behalf Of Waleed Oransa
> Sent: Monday, June 14, 2010 12:54 PM
> To: corpora at uib.no
> Subject: [Corpora-List] Request People Name Corpus (English)
>
> Hello all,
>
> I am looking for People Name Corpus in English, categorized by  
> gender. do you know of such one exists? Some web sites have such  
> data (e.g. baby names, etc.) so I thought to check with you first  
> since it needs some effort to extract the names from the web beside  
> possible copyright issue. I appreciate your help.
>
> of course, similar parallel corpus is fine, especially English- 
> Arabic one.
>
> Thank you!
> Waleed
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

Scott Crossley, Ph.D.
Linguistics/TESOL

Department of English
Mississippi State University
http://www.msstate.edu/dept/english/faculty/crossley.htm
(662) 325-2355

Institute for Intelligent Systems
University of Memphis
http://www.iismemphis.org/

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100614/4ea97378/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list