[Corpora-List] free tagged corpus
Carlos Rodriguez
crodriguezp at gmail.com
Thu Nov 17 00:33:06 UTC 2005
The Natural Language Toolkit distributes a datapack with the basic
python modules that contains parts of the brown corpus, with POS tags
and a paralel treebank. You can also add other modules for
Spanis/Catalan corpora similarly tagged.
http://nltk.sf.net
--
Carlos Rodríguez
------------------------
Center for Genomic Sciences, UNAM
Computational Genomics Program
http://www.ccg.unam.mx/Computational_Genomics
radev at umich.edu wrote:
>You can use the Penn Treebank - if you are members of the LDC.
>
>Bayan Shawar wrote:
>
>>Hello,
>> The British National Corpus is available (BNC
>>Corpus), and it has a spoken part, all of the BNC is
>>tagged using PoS tags.
>>
>>/Thanks,
>>Delip Rao
>>-----------
>>Hopefully this is useful,
>>Bayan
>>--- Delip Rao <deliprao at yahoo.com> wrote:
>>
>>
>>>Hello All,
>>>
>>>Is there any freely available part-of-speech tagged
>>>corpus for research/non-commercial use?
>>>
>>>
>>>
>>>AIDB LAB,
>>>IIT MADRAS
>>>
>>>
>>>
>>>
>>>
>>>__________________________________
>>>Do you Yahoo!?
>>>New and Improved Yahoo! Mail - 1GB free storage!
>>>http://sg.whatsnew.mail.yahoo.com
>>>
>>>
>>>
>>
>>
>>___________________________________________________________
>>How much free photo storage do you get? Store your holiday
>>snaps for FREE with Yahoo! Photos http://uk.photos.yahoo.com
>>
>>
>>
>>
>
>
>
More information about the Corpora
mailing list