33.580, Calls: Comp Ling, Gen Ling, Semantics, Text/Corpus Ling, Translation/France

The LINGUIST List linguist at listserv.linguistlist.org
Tue Feb 15 07:32:40 UTC 2022


LINGUIST List: Vol-33-580. Tue Feb 15 2022. ISSN: 1069 - 4875.

Subject: 33.580, Calls: Comp Ling, Gen Ling, Semantics, Text/Corpus Ling, Translation/France

Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Billy Dickson
Managing Editor: Lauren Perkins
Team: Helen Aristar-Dry, Everett Green, Sarah Goldfinch, Nils Hjortnaes,
      Joshua Sims, Billy Dickson, Amalia Robinson, Matthew Fort
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Everett Green <everett at linguistlist.org>
================================================================


Date: Tue, 15 Feb 2022 02:31:50
From: Reinhard Rapp [reinhardrapp at gmx.de]
Subject: 15th Workshop on Comparable Corpora with Shared Task on Multilingual Term Extraction

 
Full Title: 15th Workshop on Comparable Corpora with Shared Task on Multilingual Term Extraction 
Short Title: BUCC 2022 

Date: 25-Jun-2022 - 25-Jun-2022
Location: Marseille, France 
Contact Person: Reinhard Rapp
Meeting Email: reinhardrapp at gmx.de
Web Site: https://comparable.limsi.fr/bucc2022/ 

Linguistic Field(s): Computational Linguistics; General Linguistics; Semantics; Text/Corpus Linguistics; Translation 

Call Deadline: 10-Apr-2022 

Meeting Description:

The workshop is devoted to all topics related to comparable (and parallel)
corpora.


Call for Papers:

**************************************************************

15th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA (BUCC)
WITH SHARED TASK ON MULTILINGUAL TERMINOLOGY EXTRACTION FROM COMPARABLE
CORPORA

Co-located with LREC 2022 (Marseille)

Saturday, June 25, 2022

Paper submission deadline: April 10, 2022

Workshop website: https://comparable.limsi.fr/bucc2022/

Shared task website: https://comparable.limsi.fr/bucc2022/bucc2022-task.html

LREC website: https://lrec2022.lrec-conf.org/en/

**************************************************************

MOTIVATION

In the language engineering and the linguistics communities, research in
comparable corpora has been motivated by two main reasons. In language
engineering, on the one hand, it is primarily motivated by the need to use
comparable corpora as training data for statistical NLP applications such as
statistical and neural machine translation or cross-lingual information
retrieval. In linguistics, on the other hand, comparable corpora are of
interest because they enable cross-language discoveries and comparisons. It is
generally accepted in both communities that comparable corpora consist of
documents that are comparable in content and form in various degrees and
dimensions across several languages, dialects, or varieties. Parallel corpora
are on the one end of this spectrum, unrelated corpora on the other.

TOPICS

We solicit contributions on all topics related to comparable (and
parallel) corpora, including but not limited to the following:

Building Comparable Corpora:

* Automatic and semi-automatic methods
* Methods to mine parallel and non-parallel corpora from the web
* Tools and criteria to evaluate the comparability of corpora
* Parallel vs non-parallel corpora, monolingual corpora
* Rare and minority languages, across language families
* Multi-media/multi-modal comparable corpora

Applications of comparable corpora:

* Human translation
* Language learning
* Cross-language information retrieval & document categorization
* Bilingual and multilingual projections
* (Unsupervised) machine translation
* Writing assistance
* Machine learning techniques using comparable corpora

Mining from Comparable Corpora:

* Cross-language distributional semantics and pre-trained multilingual
transformer models
* Creation of bilingual and multilingual embeddings from comparable corpora 
* Methods to derive parallel from non-parallel corpora (e.g. to provide for
low-resource languages in neural machine translation)
* Extraction of bilingual and multilingual translations of single words,
multi-word expressions, proper names, named entities, sentences, and
paraphrases from comparable corpora, etc.
* Induction of morphological, grammatical, and translation rules from
comparable corpora
* Induction of multilingual word classes from comparable corpora

Comparable Corpora in the Humanities:

* Comparing linguistic phenomena across languages in contrastive linguistics
* Analyzing properties of translated language in translation studies
* Studying language change over time in diachronic linguistics
* Assigning texts to authors via authors' corpora in forensic linguistics
* Comparing rhetorical features in discourse analysis
* Studying cultural differences in sociolinguistics
* Analyzing language universals in typological research

IMPORTANT DATES

April 10, 2022: Paper submission deadline
May 3, 2022: Notification of acceptance
May 23, 2022: Camera ready final papers
June 25, 2022: Workshop date

For updates see the workshop website at
https://comparable.limsi.fr/bucc2022/

PRACTICAL INFORMATION

Registration for the workshop will be via the main conference website at
https://lrec2022.lrec-conf.org/en/

SUBMISSION GUIDELINES

Please follow the style sheet and templates provided for the main 
conference at https://lrec2022.lrec-conf.org/en/submission2022/authors-kit/
Papers should be submitted as a PDF file using the START conference
manager at https://www.softconf.com/lrec2022/BUCC/




------------------------------------------------------------------------------

***************************    LINGUIST List Support    ***************************
 The 2020 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
  to find out how to donate and check how your university, country or discipline
     ranks in the fund drive challenges. Or go directly to the donation site:
                   https://crowdfunding.iu.edu/the-linguist-list

                        Let's make this a short fund drive!
                Please feel free to share the link to our campaign:
                    https://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-33-580	
----------------------------------------------------------






More information about the LINGUIST mailing list