[Corpora-List] Pre-annotated Corpus

Kevin B. Cohen kevin.cohen at gmail.com
Fri Sep 24 17:40:12 UTC 2010


Check out OntoNotes, which is now available for free through LDC.  I
don't know about the RDF triplets, but it does have semantic
annotation based on an ontology, which is unusual in the big scheme of
things.

Kevin

On Fri, Sep 24, 2010 at 10:35 AM, sunny <sbrahayu at gmail.com> wrote:
> Thank you for those who replied. Below is the description of required
> annotation.
> The research is around ranking documents in a corpus based on the semantic
> annotation. The annotated corpus needed for the research is semantically
> annotated corpus based on an ontology. The annotation need to depicts
> relation between concepts in the form of RDF triplets.
> Any suggestion is highly appreciated.
>
> On Fri, Sep 24, 2010 at 11:03 PM, Kevin B. Cohen <kevin.cohen at gmail.com>
> wrote:
>>
>> What is your research about?  We need to know what kinds of annotation
>> you need...
>>
>>
>> Kevin
>>
>> On Fri, Sep 24, 2010 at 8:17 AM, sunny <sbrahayu at gmail.com> wrote:
>> > Dear All,
>> > I am looking for a pre-annotated corpus as an input data for my research
>> > work. Currently, I'm using the free version of OCAS, but the size of
>> > articles was too little. The OCAS corpus and ontology can be found here
>> > "http://idocument.opendfki.de/wiki/Evaluation/Corpus/OlympicGames2004".
>> > Can
>> > anybody suggest any pre-annotated corpus and ontology used for the
>> > annotation, please?
>> > Thank you,
>> > Regards,
>> > Syarifah
>> > PhD student
>> > Faculty of Information Science & Technology
>> > Universiti Kebangsaan Malaysia
>> >
>> > _______________________________________________
>> > Corpora mailing list
>> > Corpora at uib.no
>> > http://mailman.uib.no/listinfo/corpora
>> >
>> >
>>
>>
>>
>> --
>> Kevin Bretonnel Cohen, PhD
>> Biomedical Text Mining Group Lead, Center for Computational
>> Pharmacology, U. Colorado School of Medicine
>> and
>> Lead Artificial Intelligence Engineer, The MITRE Corporation, Human
>> Language Technology Division
>> 303-916-2417 (cell) 303-377-9194 (home)
>> http://compbio.ucdenver.edu/Hunter_lab/Cohen
>
>



-- 
Kevin Bretonnel Cohen, PhD
Biomedical Text Mining Group Lead, Center for Computational
Pharmacology, U. Colorado School of Medicine
and
Lead Artificial Intelligence Engineer, The MITRE Corporation, Human
Language Technology Division
303-916-2417 (cell) 303-377-9194 (home)
http://compbio.ucdenver.edu/Hunter_lab/Cohen

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list