[Corpora-List] free tagged corpus

Carlos Rodriguez crodriguezp at gmail.com
Thu Nov 17 00:33:06 UTC 2005


The Natural Language Toolkit distributes a datapack with the basic 
python modules that contains parts of the brown corpus, with POS tags 
and a paralel treebank. You can also add other modules for 
Spanis/Catalan corpora similarly tagged.

http://nltk.sf.net

-- 
Carlos Rodríguez
------------------------
Center for Genomic Sciences, UNAM
Computational Genomics Program
http://www.ccg.unam.mx/Computational_Genomics



radev at umich.edu wrote:
>You can use the Penn Treebank - if you are members of the LDC.
>
>Bayan Shawar wrote:
>  
>>Hello,
>>  The British National Corpus is available (BNC
>>Corpus), and it has a spoken part, all of the BNC is
>>tagged using PoS tags.
>>
>>/Thanks,
>>Delip Rao
>>-----------
>>Hopefully this is useful,
>>Bayan
>>--- Delip Rao <deliprao at yahoo.com> wrote:
>>
>>    
>>>Hello All,
>>>
>>>Is there any freely available part-of-speech tagged
>>>corpus for research/non-commercial use?
>>>
>>>
>>>
>>>AIDB LAB,
>>>IIT MADRAS
>>>
>>>
>>>	
>>>	
>>>		
>>>__________________________________ 
>>>Do you Yahoo!? 
>>>New and Improved Yahoo! Mail - 1GB free storage! 
>>>http://sg.whatsnew.mail.yahoo.com
>>>
>>>
>>>      
>>
>>		
>>___________________________________________________________ 
>>How much free photo storage do you get? Store your holiday 
>>snaps for FREE with Yahoo! Photos http://uk.photos.yahoo.com
>>
>>
>>
>>    
>
>
>  



More information about the Corpora mailing list