[Corpora-List] Tree-Structured Named Entities corpora ?

Damien Nouvel damien at nouvels.net
Mon Dec 9 14:58:31 UTC 2013


Hi all,
Up : I'd be much interested if such a corpus exists for English and
MUC/CoNLL entities (pers, loc, org, etc.)?

Best,
Damien


2013/12/9 Yoann Dupont <yoa.dupont at gmail.com>

> Thank you very much M. Galibert.
>
> Best regards,
>
>
> 2013/12/9 Galibert Olivier <Olivier.Galibert at lne.fr>
>
>>
>>   Hi,
>>
>> The Quaero named entities annotation guide follows that kind of
>> structure.  Two corpora are already available through ELRA/ELDA:
>> - ELRA-S0349 Quaero Broadcast News Extended Named Entity corpus
>> - ELRA-W0073 Quaero Old Press Extended Named Entity corpus
>>
>> A third one, linked to the ETAPE evaluation, should be made available
>> sometimes next year.
>>
>> The annotation guide is available at
>> http://www.quaero.org/media/files/bibliographie/quaero-guide-annotation-2011.pdf
>>
>> Best,
>>
>>   OG.
>>
>>
>> -----Original Message-----
>> From: corpora-bounces at uib.no on behalf of Yoann Dupont
>> Sent: Mon 12/9/2013 11:29 AM
>> To: corpora at uib.no
>> Subject: [Corpora-List] Tree-Structured Named Entities corpora ?
>>
>> Greetings Corpora-List,
>>
>> I am currently looking for corpora with tree-structured named entities.
>>
>> A simple example of tree structuration would be a person which has a first
>> and last name : "Barack Obama" is a person whose first name is "Barack"
>> and
>> last name is "Obama". A parsing would then be : *(PER (NAME.FIRST*
>> Barack*)
>> (NAME.LAST* Obama*))*
>> Another example would be geographical addresses.
>>
>> I know some corpora that could fit this definition : the SemEval'2007 task
>> 9 corpora (tree-structured NE in Spanish and Catalan) and the GENIA corpus
>> (tree-structured NE for biomedical entities in English).
>>
>> Does any of you know other tree-structured NE corpora ?
>>
>> Thank you kindly in advance,
>>
>> --
>> Yoann DUPONT
>>
>>
>
>
> --
> Yoann DUPONT
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>


-- 
damien at nouvels.net
GSM: +33 (0) 6 63 56 27 17
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20131209/da2e1491/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list