33.2400, Calls: English; Portuguese; Spanish; Computational Linguistics/United Arab Emirates

The LINGUIST List linguist at listserv.linguistlist.org
Wed Aug 3 06:27:58 UTC 2022


LINGUIST List: Vol-33-2400. Wed Aug 03 2022. ISSN: 1069 - 4875.

Subject: 33.2400, Calls: English; Portuguese; Spanish; Computational Linguistics/United Arab Emirates

Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Billy Dickson
Managing Editor: Lauren Perkins
Team: Helen Aristar-Dry, Everett Green, Sarah Goldfinch, Nils Hjortnaes,
        Joshua Sims, Billy Dickson, Amalia Robinson, Matthew Fort
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Hosted by Indiana University

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Everett Green <everett at linguistlist.org>
================================================================


Date: Wed, 03 Aug 2022 06:16:43
From: Horacio Saggion [horacio.saggion at upf.edu]
Subject: Shared Task on Lexical Simplification for English, Portuguese  and Spanish

 
Full Title: Shared Task on Lexical Simplification for English, Portuguese  and Spanish 
Short Title: TSAR2022-ST 

Date: 08-Dec-2022 - 08-Dec-2022
Location: Abu Dhabi, United Arab Emirates 
Contact Person: Horacio Saggion
Meeting Email: horacio.saggion at upf.edu
Web Site: https://taln.upf.edu/pages/tsar2022-st/#home 

Linguistic Field(s): Computational Linguistics 

Subject Language(s): English (eng)
                     Portuguese (por)
                     Spanish (spa)

Call Deadline: 07-Sep-2022 

Meeting Description:

Shared Task on Lexical Simplification for English, Portuguese  and Spanish

In conjunction with the TSAR-2022 Workshop @EMNLP2022

*** CALL FOR PARTICIPATION ***
*** CHECK WEB SITE FOR DETAILED INFORMATION ***

Lexical Simplification is the process of reducing the lexical complexity of a
text by replacing difficult words with easier to read (or understand)
expressions while preserving the original information and meaning. Lexical
Simplification (LS) aims at facilitating reading comprehension to different
target readerships such as foreign language learners, native speakers with low
literacy levels, second language learners or people with different kinds of
reading impairments.  This new Lexical Simplification Shared Task features
three similar datasets in three different languages: English, Brazilian
Portuguese, and Spanish. 

Definition of the task 

Given a sentence containing a complex word, systems should return an ordered
list of “simpler” valid substitutes for the complex word in its original
context. The list of simpler words (up to a maximum of 10) returned by the
system should be ordered by the confidence the system has in its prediction
(best predictions first). The ordered list must not contain ties. 

An instance of the task for the English language is:

1. “That prompted the military to deploy its largest warship, the BRP Gregorio
del Pilar, which was recently acquired from the United States.”

Complex word: deploy

For this instance a system may suggest the following ranked substitutes: send,
move, position, redeploy, employ, situate…

Systems should only produce simplifications that are good contextual fits
(semantically and syntactically).

Participating teams can register (details below) for three different tracks,
one per language:  English monolingual (EN), Portuguese (Brazilian)
monolingual (PT-BR), and Spanish monolingual (ES)

Participating teams will be allowed to submit up to 3 runs per track. 

Data: The three datasets (trial data with gold annotations and test data
without gold annotations)and the evaluation script will be available through a
GitHub repository).  

Evaluation Metrics: The evaluation metrics to be applied in the TSAR-2022
Shared Task are the following: MAP at K, Potential at K, and Accuracy at K@top1 (please
consult our Web site for details).


Call for Participation:

Publication: Participating teams will be invited to submit system description
papers to be published in the TSAR-2022 Workshop proceedings.

Important dates

* Registration opens: July 19th, 2022
* Release of  sample/trial instances with gold annotations: July 20th, 2022
* Release of evaluation metrics and code: July 22th, 2022
* Registration deadline: September 7, 2022
* Test set release (without gold annotations): September 8, 2022
* Submissions of systems' output due: September 15, 2022
* Official results announced: September 30, 2022
* Test set release (wit gold annotations): September 30, 2022
* Submission of Shared Tasks papers deadline: October 15, 2022
* Shared Task Papers Reviews due: November 1, 2022
* Camera-ready deadline for Shared-task papers: November 10, 2022
* TSAR Workshop and Shared Task: December 8, 2022

Registering your team: 

Please access this form to register for the TSAR-2022 Shared Task on Lexical
Simplification.
https://forms.gle/6iNm5cTRueA78ri17

Website and Shared Task Guidelines

Please visit the TSAR-2022 Shared Task website to obtain further information
about the Guidelines, Datasets, and team registration. 
https://taln.upf.edu/pages/tsar2022-st




------------------------------------------------------------------------------

***************************    LINGUIST List Support    ***************************
 The 2020 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
  to find out how to donate and check how your university, country or discipline
     ranks in the fund drive challenges. Or go directly to the donation site:
                   https://crowdfunding.iu.edu/the-linguist-list

                        Let's make this a short fund drive!
                Please feel free to share the link to our campaign:
                    https://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-33-2400	
----------------------------------------------------------





More information about the LINGUIST mailing list