[Corpora-List] Tree-Structured Named Entities corpora ?

Khalid CHOUKRI choukri at elda.org
Mon Dec 9 17:17:48 UTC 2013


Hi Yoann

I am cc this email to Valerie,  she will check if we do have any corresponding English data (and Olivier mentioned the French ones)

Best regards
Khalid

Galibert Olivier wrote, On 09/12/2013 11:49:
>    Hi,
>
> The Quaero named entities annotation guide follows that kind of structure.  Two corpora are already available through ELRA/ELDA:
> - ELRA-S0349 Quaero Broadcast News Extended Named Entity corpus
> - ELRA-W0073 Quaero Old Press Extended Named Entity corpus
>
> A third one, linked to the ETAPE evaluation, should be made available sometimes next year.
>
> The annotation guide is available at http://www.quaero.org/media/files/bibliographie/quaero-guide-annotation-2011.pdf
>
> Best,
>
>    OG.
>
>
> -----Original Message-----
> From: corpora-bounces at uib.no on behalf of Yoann Dupont
> Sent: Mon 12/9/2013 11:29 AM
> To: corpora at uib.no
> Subject: [Corpora-List] Tree-Structured Named Entities corpora ?
>   
> Greetings Corpora-List,
>
> I am currently looking for corpora with tree-structured named entities.
>
> A simple example of tree structuration would be a person which has a first
> and last name : "Barack Obama" is a person whose first name is "Barack" and
> last name is "Obama". A parsing would then be : *(PER (NAME.FIRST* Barack*)
> (NAME.LAST* Obama*))*
> Another example would be geographical addresses.
>
> I know some corpora that could fit this definition : the SemEval'2007 task
> 9 corpora (tree-structured NE in Spanish and Catalan) and the GENIA corpus
> (tree-structured NE for biomedical entities in English).
>
> Does any of you know other tree-structured NE corpora ?
>
> Thank you kindly in advance,
>

-- 

*Khalid Choukri *
ELRA General secretary & ELDA CEO
email: choukri at elda.org;
Web: www.elra.info www.elda.org
Tel. +33 1 43 13 33 33 - Fax. +33 1 43 13 33 30

****************************************************
** Info on LREC: www.lrec-conf.org
****************************************************




*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20131209/872fb54c/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list