[ACLA-CAAL] Third CfP MUMTTT 2022 - The 5th Workshop on Multi-word Units in Machine Translation and Translation Technology (Malaga)

Gennaro Nolano nolanogenn at gmail.com
Mon Jun 27 11:57:02 UTC 2022

*apologies for cross-posting*


The 5th Workshop on Multi-word Units in Machine Translation and Translation
Technology(MUMTTT 2022)

Malaga, 30th September 2022

In conjunction with EUROPHRAS 2022 – International Conference
“‘Computational and Corpus-based Phraseology’”

Third Call for Papers

Following the success of the previous editions of the workshops on
Multi-word Units in Machine Translation and TranslationTechnology–the 2013
edition at the MT Summit in Nice, France, and the editions at the European
Society of Phraseology Conference in Malaga, Spain (2015 and 2019), in
London, UK (2017)  we are announcing the fifth edition to be held in
conjunction with the International Conference “Computational and
Corpus-based Phraseology which is jointly organised by the European
Association for Phraseology (EUROPHRAS), the University of Malaga (Research
Group in Lexicography and Translation <http://lexytrad.es/>, IUITLM
<http://iuitlm.uma.es/>), the University of Wolverhampton (Research Group
in Computational Linguistics <http://rgcl.wlv.ac.uk/>, RIILP
<http://www.wlv.ac.uk/riilp>) and the Bulgarian Association for
Computational Linguistics <http://acl-bg.org/>.

The MUMTTT workshop will be held on the last day of the conference, 30th
September 2022.  It will provide a forum for researchers and practitioners
in the fields of (Computational) Linguistics, (Computational) Phraseology,
Translation Studies and Translation Technology to discuss recent advances
in the area of multi-word unit processing and to coordinate research
efforts across disciplines in order to improve the integration of
multi-word units in machine translation and translation technology tools.

Multi-word units are word combinations which range from compounds such as
‘credit card’ to idiomatic expressions such as “’it is raining cats and
dogs’ and are acknowledged as one of the major challenges in natural
language processing (NLP), because of their lexical, syntactic, semantic,
pragmatic and/or statistical idiosyncrasies.

In spite of the relative progress achieved in translation technology with
the adoption of neural approaches and in the processing of particular types
of units such as verb-particle constructions, the identification,
interpretation and translation of multi-word units in general still
represent open challenges, both from a theoretical and a practical point of
view. The idiosyncratic morpho-syntactic, semantic and translational
properties of multi-word units pose many obstacles even to human
translators, mainly because of intrinsic ambiguities, structural and
lexical asymmetries between languages, and, finally, cultural differences.

In recent years, growing attention has been paid to integrating multi-word
units (MWUs) in machine translation and translation technology tools, as it
has been acknowledged that it is not possible to create large scale
language solutions without properly handling MWUs of all types. As a matter
of fact, researchers are now addressing the problems posed by MWU
processing and translation using different formalisms and techniques, such
as automatic recognition of MWUs in a monolingual or bilingual setting;
alignment and paraphrasing techniques; development and use of (handcrafted)
monolingual and bilingual language resources; creation of annotated
monolingual and parallel corpora, development of strategies for handling
syntactically flexible units in language analysis and translation modules,
development of evaluation projects. The aim of the workshop is to bring
together researchers and practitioners working on MWU processing from
various perspectives, in order to enable cross-fertilisation and foster the
creation of innovative solutions that can only arise from interdisciplinary
collaborations. In particular, the workshop welcomes interactions between
NLP researchers working on the computational treatment of multi-word units,
experts in (computational) phraseology working on challenging topics of
their discipline, as well as translation practitioners, to the benefit of
applying their latest results to advance the state of the art in MWU
Topics of Interest

The MUMTTT 2022 workshop invites the submission of papers reporting on
original and unpublished research on topics related to MWU processing in
machine translation and translation technology, including


   Lexical, syntactic, semantic and translational aspects in MWU

   Theoretical approaches to MWUs (e.g., collostructional analysis of MWU,
   cognitive approaches to processing MWUs, etc.)

   Development of multilingual MWU resources

   Identification and acquisition of MWUs and variants

   Learning semantic information about MWUs from monolingual, parallel or
   comparable corpora

   Development and use of MWU resources in machine translation and
   translation technology

   Development of corpora for the extraction and translation of MWUs

   Compilation of resources for the extraction and translation of multiword

   Creation of MWU-annotated corpora with a focus on translation aspects

   Paraphrasing of MWUs applied to the improvement of machine translation

   MWUs and word alignment techniques

   MWUs in machine translation

   MWUs in translation memory systems

   MWUs in term extraction

   MWU-centred machine translation evaluation

   Evaluation of MWU translation

   MWUs in CAT tools

   Multilingualism and MWU processing

   Psycholinguistic studies of MWU processing in a bilingual setting.

Submission Guidelines

Submissions must consist of full-text papers and should not exceed 7 pages
excluding references, they should be a minimum of 5 pages long. The
accepted papers will be published as workshop e-proceedings together with
Europhras short papers and posters with ISBN, will be assigned a DOI and
will be also available at the time of the conference. The papers should be
in English.

Authors of accepted papers will receive guidelines regarding how to produce
camera-ready versions of their papers for inclusion in the proceedings.

Each submission will be reviewed by at least three programme committee
members. Accepted papers will be presented orally or as posters, as
determined by the programme committee. There will be no distinction in the
workshop proceedings between papers presented orally or as posters.



Access this link <https://www.softconf.com/m/europhras2022/> to make a
submission via the START conference management system (Please, be sure to
select MUMTTT as Topic). The tutorial on how to create a START account can
be found here

Submissions should be compliant with the above templates and should be
uploaded as pdf files in START. (START is configured to accept pdf files

Should you require any assistance with the submission, please do not
hesitate to contact us at mumttt2022 at gmail.com.
Important Dates

Deadline for paper submission:  10th June 2022  15th July 2022 – EXTENDED

Acceptance notification: 8th July 2022 31st July 2022

Final camera-ready version:  2nd September 2022

MUMTTT 2022 workshop: 30th  September  2022
Workshop Chairs

Gloria Corpas Pastor, Universidad de Málaga, Spain

Ruslan Mitkov, University of Wolverhampton, United Kingdom

Johanna Monti, Università degli Studi di Napoli “L’Orientale” Italy

Maria Pia di Buono, Università degli Studi di Napoli “L’Orientale” Italy
Organising Committee

Gennaro Nolano, Università degli Studi di Napoli “L’Orientale” Italy

Giulia Speranza, Università degli Studi di Napoli “L’Orientale” Italy

Khadija Ait ElFqih, Università degli Studi di Napoli “L’Orientale” Italy

*Gennaro Nolano | **Publications
<https://scholar.google.com/citations?hl=en&user=iYyekwYAAAAJ> | *
PhD Student in NLP & Digital Humanities
Unior NLP Research Group @ University of Napoli "L'Orientale"
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/acla-caal/attachments/20220627/8c66b041/attachment-0001.htm>

More information about the ACLA-CAAL mailing list