[Corpora-List] Call for Participation: SemEval10 Task 1
Montse Nofre
montsenofre at ub.edu
Wed Feb 10 10:15:49 UTC 2010
To whom it may concern.
Our apologies if you receive multiple postings of this CFP
*SemEval-2010*
* *
*Task1: Coreference Resolution in multiple languages*
(http://stel.ub.edu/semeval2010-coref/)
The purpose of this e-mail is to encourage participation in the task *“Coreference Resolution in multiple languages”* in the 5^th International Workshop on Semantic Evaluations, SemEval-2010 http://semeval2.fbk.eu/semeval2.php.
*GENERAL TASK DESCRIPTION*
Using coreference information has been shown to be beneficial in a
number of NLP applications including Information Extraction, Text
Summarization, Question Answering and Machine Translation. This task is
concerned with automatic coreference resolution for six different
languages: Catalan, Dutch, English, German, Italian and Spanish. Two
tasks are proposed for each of the languages:
* *Full task*. Detection of full coreference chains, composed by
named entities, pronouns, and full noun phrases.
* *Subtask*. Pronominal resolution, i.e., finding the antecedents of
the pronouns in the text.
In particular, we *aim*:
*(i)* To study the *portability *of coreference resolution systems
*across languages* (Catalan, Dutch, English, German, Italian, Spanish)
* To what extent is it possible to implement a general system that
is portable to the three languages?
* How much language-specific tuning is necessary?
* Are there significant differences between Germanic and Romance
languages? And between languages of the same family?
*(ii)* To study how helpful *morphology, syntax and semantics* are to
solve coreference relations.
* How much preprocessing is needed?
* How much does the quality of the preprocessing modules (perfect
linguistic input vs. noisy automatic input) affect the performance
of state-of-the-art coreference resolution systems?
* Is morphology more helpful than syntax? Or semantics? Or is syntax
more helpful than semantics?
* (iii)* To compare four different *evaluation metrics* (MUC, B-CUBED,
CEAF and BLANC) for coreference resolution.
* Do all evaluation metrics provide the same ranking? Is there one
that provides a more accurate picture of a system's accuracy?
* Is there a strong correlation between them?
* Can statistical systems be optimized under all four metrics at the
same time?
Although we target at general systems addressing the full multilingual
task, we will allow taking part in any full/sub-task of any language.
*ORGANIZERS*
* Véronique Hoste <http://webs.hogent.be/%7Evhos368/> (Hogeschool Gent)
* Lluís Màrquez <http://www.lsi.upc.edu/%7Elluism/> (TALP,
Universitat Politècnica de Catalunya)
* M. Antònia Martí <http://clic.ub.edu/en/angles-toni> (CLiC,
University of Barcelona)
* Massimo Poesio <http://cswww.essex.ac.uk/staff/poesio/>
(University of Essex / Università di Trento)
* Marta Recasens <http://clic.ub.edu/en/marta+recasens> (CLiC,
University of Barcelona)
* Emili Sapena <http://www.lsi.upc.edu/%7Eesapena/> (TALP,
Universitat Politècnica de Catalunya)
* Mariona Taulé <http://clic.ub.edu/en/node/101> (CLiC, University
of Barcelona)
* Yannick Versley <http://www.versley.de/> (Universität Tübingen)
*IMPORTANT DATES*
Training data release: February 10, 2010 (from http://semeval2.fbk.eu/semeval2.php)
Deadline for submission of systems: April 2, 2010
*VENUE*
The Workshop will be held in conjunction with ACL July 11-16, Uppsala,
http://acl2010.org <http://acl2010.org/>
For more information, please visit: http://stel.ub.edu/semeval2010-coref/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: montsenofre.vcf
Type: text/x-vcard
Size: 214 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100210/56eb382e/attachment-0001.vcf>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list