[Corpora-List] ResPubliQA 2009: Prelilminary Call for Participation
Pamela Forner
forner at celct.it
Fri Dec 19 14:47:22 UTC 2008
Apologies for cross-posting
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ResPubliQA exercise at the
MULTILINGUAL QUESTION ANSWERING TRACK AT CLEF 2009
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
------------------------------------------------------------------
Preliminary Call for Participation
------------------------------------------------------------------
We are glad to announce that a NEW EXERCISE will be proposed this year within the Question Answering track at CLEF. For more information and instructions visit the new ResPubliQA website at:
http://celct.isti.cnr.it/ResPubliQA/index.php
We invite participation, both from academic institutions and industrial organizations, on this new task.
Guidelines describing the task will be distributed among the participants and will be downloadable from the ResPubliQA website. Participants will also be provided with training data in order to have the opportunity to test the systems with procedures to be used in the formal evaluation campaign. The results of the evaluation will be disseminated at the final workshop which will be organized in Corfu in conjunction with ECDL 2009.
ResPubliQA 2009: TASK OVERVIEW
TASK DESCRIPTION: Systems receive natural language questions as input, and must return one paragraph containing the answer from the document collection. No exact answer is required neither multiple responses.
DOCUMENT COLLECTION: The subset of JRC-Acquis documents that have parallel aligned translations into all the languages involved will be used, namely Bulgarian, Dutch, English, French, German, Italian, Portuguese, Romanian and Spanish. The sub-collection is available at the ResPubliQA website http://celct.isti.cnr.it/ResPubliQA/Downloads
QUESTIONS: a pool of 500 independent questions (factoid, definition, reason, purpose and procedure) is provided
o NO LIST questions
o NO topic related questions (questions linked to the same topic)
o NO NIL questions
ANSWERS: one of the following two responses must be returned
a) one single paragraph containing the candidate answer. Multi-paragraph answers are not considered in this task
b) the string NOA to indicate that the system prefers not to answer the question.
Systems that give no answers (NOA) instead of wrong answers will be rewarded by the evaluation measure. Answer Validation techniques (including Machine Learning) are expected to be used for taking this final decision.
LANGUAGES INVOLVED: Basque (EU), Bulgarian (BG), Dutch (NL), English (EN), French (FR), German (DE), Italian (IT), Portuguese (PT), Romanian (RO) and Spanish (ES).
A monolingual English (EN) task will also be activated this year, as both the exercize and the collection are different from TREC.
Basque has been included exclusively as a source language, as there is no Basque collection available.
SCHEDULE
Corpora Release: December, 19 2008
Preliminary Track Guidelines: December, 19 2008
Training data: December 19, 2008
Registration Open: February 1, 2009
Final Track Guidelines: February 1, 2009
Test Sets Release: May 25, 2009
Submissions of Runs by Participants: June 5, 2009
Release of Individual Results: from July 15, 2009
Submission of Papers for Working Notes: August 14, 2009
CLEF Workshop (in Corfu, Greece): 30 September - 02 October 2009
The participants will have 5 DAYS to upload their submissions, starting from the moment when the questions are downloaded, and not later than June 5, 2009.
TRACK COORDINATORS AND ORGANIZERS
- UNED (coordinator)
Spanish Distance Learning University, Spain
Anselmo Peñas
- CELCT (coordinator)
Center for the Evaluation of Language and Communication Technology, Italy
Pamela Forner and Danilo Giampiccolo
- ELDA/ELRA
Evaluations and Language Resources Distribution Agency, France
Nicolas Moreau
- University of Limerick, Ireland
Richard Sutcliffe
- BTB
Bulgarian Academy of Science, Bulgaria
Petya Osenova
- UAIC and RACAI, Romania
Alexandru Ioan Cuza University and Romanian Academy Research Institute for Artificial Intelligence, Romania
Corina Forascu
- UBC
University of Basque Country, Spain
Iñaki Alegria
================================
Pamela Forner
CELCT (web: www.celct.it<http://www.celct.it/>)
Center for the Evaluation of Language and Communication Technologies
Via alla Cascata 56/c
38100 Povo - TRENTO -Italy
email: forner at celct.it<mailto:forner at celct.it>
tel.: +39 0461 314 804
fax: +39 0461 314 846
Secretary Phone: +39 0461 314 870
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20081219/b880cb35/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list