[Corpora-List] Tree-Structured Named Entities corpora ?

Yoann Dupont yoa.dupont at gmail.com
Wed Dec 11 13:35:46 UTC 2013


Hi all,

Khalid, thanks a lot for forwarding this resquest.

Best regards,


2013/12/9 Khalid CHOUKRI <choukri at elda.org>

>      Hi Yoann
>
> I am cc this email to Valerie,  she will check if we do have any corresponding English data (and Olivier mentioned the French ones)
>
> Best regards
> Khalid
>
>
> Galibert Olivier wrote, On 09/12/2013 11:49:
>
>   Hi,
>
> The Quaero named entities annotation guide follows that kind of structure.  Two corpora are already available through ELRA/ELDA:
> - ELRA-S0349 Quaero Broadcast News Extended Named Entity corpus
> - ELRA-W0073 Quaero Old Press Extended Named Entity corpus
>
> A third one, linked to the ETAPE evaluation, should be made available sometimes next year.
>
> The annotation guide is available at http://www.quaero.org/media/files/bibliographie/quaero-guide-annotation-2011.pdf
>
> Best,
>
>   OG.
>
>
> -----Original Message-----
> From: corpora-bounces at uib.no on behalf of Yoann Dupont
> Sent: Mon 12/9/2013 11:29 AM
> To: corpora at uib.no
> Subject: [Corpora-List] Tree-Structured Named Entities corpora ?
>
> Greetings Corpora-List,
>
> I am currently looking for corpora with tree-structured named entities.
>
> A simple example of tree structuration would be a person which has a first
> and last name : "Barack Obama" is a person whose first name is "Barack" and
> last name is "Obama". A parsing would then be : *(PER (NAME.FIRST* Barack*)
> (NAME.LAST* Obama*))*
> Another example would be geographical addresses.
>
> I know some corpora that could fit this definition : the SemEval'2007 task
> 9 corpora (tree-structured NE in Spanish and Catalan) and the GENIA corpus
> (tree-structured NE for biomedical entities in English).
>
> Does any of you know other tree-structured NE corpora ?
>
> Thank you kindly in advance,
>
>
>
> --
>
> * Khalid Choukri *
> ELRA General secretary & ELDA CEO
> email: choukri at elda.org;
> Web: www.elra.info www.elda.org
> Tel. +33 1 43 13 33 33 - Fax. +33 1 43 13 33 30
>
>
>
>
>
>
>
>
> * *************************************************** ** Info on LREC:
> www.lrec-conf.org <http://www.lrec-conf.org>
> **************************************************** *
>



-- 
Yoann DUPONT
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20131211/395fcd1a/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list