29.256, FYI: Duolingo Shared Task on Second Language Acquisition Modeling

The LINGUIST List linguist at listserv.linguistlist.org
Tue Jan 16 21:46:30 UTC 2018


LINGUIST List: Vol-29-256. Tue Jan 16 2018. ISSN: 1069 - 4875.

Subject: 29.256, FYI: Duolingo Shared Task on Second Language Acquisition Modeling

Moderators: linguist at linguistlist.org (Damir Cavar, Malgorzata E. Cavar)
Reviews: reviews at linguistlist.org (Helen Aristar-Dry, Robert Coté,
                                   Michael Czerniakowski)
Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           http://funddrive.linguistlist.org/donate/

Editor for this issue: Kenneth Steimel <ken at linguistlist.org>
================================================================


Date: Tue, 16 Jan 2018 16:46:19
From: Bozena Pajak [bozena at duolingo.com]
Subject: Duolingo Shared Task on Second Language Acquisition Modeling

 
2018 Duolingo Shared Task on Second Language Acquisition Modeling (SLAM)

Duolingo invites research teams to participate in the first SLA Modeling
(SLAM) Shared Task, in conjunction with the 13th BEA Workshop and the
NAACL-HLT 2018 conference. You can access the detailed task description at:
http://sharedtask.duolingo.com .

The goal of this task is to predict future mistakes that learners of English,
Spanish, and French will make, based on a history of mistakes they have made
in the past. The data set contains more than 2 million tokens (words) from
exercises submitted by 6,000+ students over the course of their first 30 days
using Duolingo (https://www.duolingo.com).

New and interesting research opportunities in this task:

- There are three tracks for learners of (1) English, (2) Spanish, and (3)
French. Teams are encouraged to explore features which generalize across all
three languages.
- Anonymized learner IDs and time data will be provided. This allows teams to
explore various personalized, adaptive SLA modeling approaches.
- The sequential nature of the data also allows teams to model language
learning (and forgetting!) over time.

Training and development data, baseline code, and evaluation scripts are now
ready and available for the task. Test data will be release in February 2018,
with final evaluations taking place in March. For more details, please consult
the task website.

Shared Task Website:

http://sharedtask.duolingo.com

Shared Task Discussion Group:

https://groups.google.com/forum/#!forum/sla-modeling

Important Dates:

Jan 10, 2018 - Data release (phase 1): TRAIN and DEV sets
Feb 19, 2018 - Data release (phase 2): blind TEST set
Mar 19, 2018 - Final predictions deadline
Mar 21, 2018 - Final results announcement
Mar 28, 2018 - Draft system papers due
Apr 16, 2018 - Camera-ready system papers due
Jun 05, 2018 - Workshop at NAACL-HLT in New Orleans!

Task Organizers:

Burr Settles (Duolingo), Chris Brust (Duolingo), Erin Gustafson (Duolingo),
Masato Hagiwara (Duolingo), Bozena Pajak (Duolingo), Joseph Rollinson
(Duolingo), Hideki Shima (Duolingo), Nitin Madnani (ETS)

Best regards,
SLAM Shared Task Organizers
 



Linguistic Field(s): Computational Linguistics





 



------------------------------------------------------------------------------

*****************    LINGUIST List Support    *****************
Please support the LL editors and operation with a donation at:
            http://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-29-256	
----------------------------------------------------------






More information about the LINGUIST mailing list