31.380, Calls: English; Computational Linguistics/France
The LINGUIST List
linguist at listserv.linguistlist.org
Mon Jan 27 21:33:42 UTC 2020
LINGUIST List: Vol-31-380. Mon Jan 27 2020. ISSN: 1069 - 4875.
Subject: 31.380, Calls: English; Computational Linguistics/France
Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Jeremy Coburn
Managing Editor: Becca Morris
Team: Helen Aristar-Dry, Everett Green, Sarah Robinson, Peace Han, Nils Hjortnaes, Yiwen Zhang, Julian Dietrich
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org
Homepage: http://linguistlist.org
Please support the LL editors and operation with a donation at:
https://funddrive.linguistlist.org/donate/
Editor for this issue: Lauren Perkins <lauren at linguistlist.org>
================================================================
Date: Mon, 27 Jan 2020 16:32:46
From: Reinhard Rapp [reinhardrapp at gmx.de]
Subject: 13th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA
Full Title: 13th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA
Short Title: BUCC 2020
Date: 11-May-2020 - 11-May-2020
Location: Marseille, France
Contact Person: Reinhard Rapp
Meeting Email: reinhardrapp at gmx.de
Web Site: https://comparable.limsi.fr/bucc2020/
Linguistic Field(s): Computational Linguistics
Subject Language(s): English (eng)
Call Deadline: 25-Feb-2020
Meeting Description:
13th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA
Co-located with LREC 2020, Pharo Palace, Marseille, France
Monday, May 11, 2020
SHARED TASK: Bilingual dictionary induction from comparable corpora
Website workshop: https://comparable.limsi.fr/bucc2020/
Website shared task: https://comparable.limsi.fr/bucc2020/bucc2020-task.html
Invited speaker: Holger Schwenk, Facebook AI Research
Call for Papers:
Motivation:
In the language engineering and the linguistics communities, research in
comparable corpora has been motivated by two main reasons. In language
engineering, on the one hand, it is chiefly motivated by the need to use
comparable corpora as training data for statistical NLP applications such as
statistical and neural machine translation or cross-lingual retrieval. In
linguistics, on the other hand, comparable corpora are of interest in
themselves by making possible cross-language discoveries and comparisons. It
is generally accepted in both communities that comparable corpora are
documents in one or several languages that are comparable in content and form
in various degrees and dimensions. We believe that the linguistic definitions
and observations related to comparable corpora can improve methods to mine
such corpora for applications of statistical NLP. As such, it is of great
interest to bring together builders and users of such corpora.
Topics:
We solicit contributions on all topics related to comparable corpora,
including but not limited to the following:
Building Comparable Corpora:
• Human translations
• Automatic and semi-automatic methods
• Methods to mine parallel and non-parallel corpora from the web
• Tools and criteria to evaluate the comparability of corpora
• Parallel vs non-parallel corpora, monolingual corpora
• Rare and minority languages, across language families
• Multi-media/multi-modal comparable corpora
Applications of comparable corpora:
• Human translations
• Language learning
• Cross-language information retrieval & document categorization
• Bilingual projections
• Machine translation
• Writing assistance
• Machine learning techniques using comparable corpora
Mining from Comparable Corpora:
• Induction of morphological, grammatical, and translation rules from
comparable corpora
• Extraction of parallel segments or paraphrases from comparable corpora
• Extraction of bilingual and multilingual translations of single words and
multi-word expressions, proper names, and named entities from comparable
corpora
• Induction of multilingual word classes from comparable corpora
• Cross-language distributional semantics
Submission Information: Please see the BUCC 2018 website at
http://comparable.limsi.fr/bucc2020/
Important Dates:
25 February 2020: Paper submission deadline
12 March 2020: Notification of acceptance
mid March 2020 (tentative): Early bird registration (reduced rates)
2 April, 2020: Camera ready final papers
May 11, 2020: Workshop date
Shared Task: Bilingual dictionary induction from comparable corpora. For
information on the shared task see
https://comparable.limsi.fr/bucc2020/bucc2020-task.html
Workshop and Shared Task Organizers:
Reinhard Rapp (Magdeburg-Stendal University of Applied Sciences and University
of Mainz, Germany), Chair and contact person: reinhardrapp (at] gmx (dot) de
Pierre Zweigenbaum (Université Paris-Saclay,CNRS, Orsay, France)
Serge Sharoff (University of Leeds, United Kingdom)
------------------------------------------------------------------------------
*************************** LINGUIST List Support ***************************
The 2019 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
to find out how to donate and check how your university, country or discipline
ranks in the fund drive challenges. Or go directly to the donation site:
https://iufoundation.fundly.com/the-linguist-list-2019
Let's make this a short fund drive!
Please feel free to share the link to our campaign:
https://funddrive.linguistlist.org/donate/
----------------------------------------------------------
LINGUIST List: Vol-31-380
----------------------------------------------------------
More information about the LINGUIST
mailing list