Conf: MWE 2010, July 31 2010, Beijing, China

Thierry Hamon thierry.hamon at UNIV-PARIS13.FR
Fri Jul 16 19:12:49 UTC 2010

Date: Wed, 14 Jul 2010 22:20:12 +0200
From: Eric Laporte <eric.laporte at>
Message-ID: <4C3E1BFC.9070307 at>

                MWE'2010: Call for Participation

       *** Early bird registration deadline extended!*** 
                        July 31, 2010 

                          COLING 2010
               Workshop on Multiword Expressions:
             from Theory to Applications (MWE 2010)


  endorsed by the Special Interest Group on the Lexicon of the
       Association for Computational Linguistics (SIGLEX)

                          Beijing, China


Multiword Expressions (MWEs) are a ubiquitous component of natural
languages and appear steadily on a daily basis, both in specialized
and in general-purpose communication. While easily mastered by native
speakers, their interpretation poses a major challenge for automated
analysis due to their flexible and heterogeneous nature.  Therefore,
the automated processing of MWEs is desirable for any natural language
application that involves some degree of semantic interpretation,
e.g., Machine Translation, Information Extraction, and Question

In spite of the recent advances in the field, there is a wide range of
open problems that prevent MWE treatment techniques from full
integration in current NLP systems. In MWE'2010, we will be interested
in major challenges in the overall process of MWE treatment, asking
for original research related but not limited to the following topics:

* MWE resources: Although underused in most current state-of-the-art
  approaches, resources are key for developing real-world applications
  capable of interpreting MWEs. We call for papers describing the
  process of building MWE resources, constructed both manually and
  automatically from text corpora; we are also interested in assessing
  the usability of such resources in various MWE tasks.

* Hybrid approaches: We invite research on integrating heterogeneous
  MWE treatment techniques and resources in NLP applications. Such
  hybrid approaches can aim, for example, at the combination of
  results from symbolic and statistical approaches, at the fusion of
  manually built and automatically extracted resources, or at the
  design of language learning techniques.

* Domain adaptation: Real-world NLP applications need to be robust to
  deal with texts coming from different domains. We thus call for
  papers assessing the performance of MWE methods across domains or
  describing domain adaptation techniques for MWEs.

* Multilingualism: Parallel and comparable corpora are gaining
  popularity as a resource for automatic MWE discovery and
  treatment. We are also interested in the integration of MWE
  processing in multilingual applications such as machine translation
  and multilingual information retrieval, as well as in porting
  existing monolingual MWE approaches to new languages.


Jul 31, 2010              Early bird registration deadline
Aug 28, 2010              Workshop


    Aravind K. Joshi - University of Pennsylvania, USA 
    Kyo Kageura - University of Tokyo, Japan

* Automatic Extraction of Arabic Multiword Expressions
  Mohammed Attia, Antonio Toral, Lamia Tounsi, Pavel Pecina and Josef
  van Genabith 

* Automatic Extraction of Complex Predicates in Bengali
  Dipankar Das, Santanu Pal, Tapabrata Mondal, Tanmoy Chakroborty and
  Sivaji Bandyopadhyay 

* Computational Lexicography of Multi-Word Units. How Efficient Can It Be?
  Filip Graliński, Agata Savary, Monika Czerepowicka and Filip

* Standardizing Complex Functional Expressions in Japanese Predicates:
  Applying Theoretically-Based Paraphrasing Rules
  Tomoko Izumi, Kenji Imamura, Genichiro Kikui and Satoshi Sato 

* Handling Named Entities and Compound Verbs in Phrase-Based
  Statistical Machine Translation
  Santanu Pal, Sudip Kumar Naskar, Pavel Pecina and Sivaji

* Construction of Chinese Idiom Knowledge-base and Its Applications
  Lei Wang and Shiwen Yu 

* Sentence Analysis and Collocation Identification
  Eric Wehrli, Violeta Seretan and Luka Nerima 

* Application of the Tightness Continuum Measure to Chinese
  Information Retrieval
  Ying Xu, Randy Goebel, Christoph Ringlstetter and Grzegorz Kondrak 

   * Contrastive filtering of domain specific multi-word terms from
     different types of corpora
     Francesca Bonin, Felice Dell'Orletta, Simonetta Montemagni and
     Giulia Venturi 

   * Identification of Reduplication in Bengali Corpus and their
     Semantics Analysis: A Rule Based Approach
     Tanmoy Chakraborty and Sivaji Bandyopadhyay 

   * A Hybrid Approach for Functional Expression Identification in a
     Japanese Reading Assistant
     Gregory Hazelbeck and Hiroaki Saito 
   * An efficient, generic approach to extracting multi-word
     expressions from dependency trees
    Scott Martens and Vincent Vandeghinste 


Inaki Alegria          (University of the Basque Country, Spain)
Dimitra Anastasiou     (Limerick University, Ireland)
Timothy Baldwin        (University of Melbourne, Australia)
Colin Bannard          (University of Texas at Austin, USA)
Francis Bond           (Nanyang Technological University , Singapore)
Paul Cook              (University of Toronto, Canada)
Beatrice Daille        (Nantes University, France)
Gael Dias              (Beira Interior University, Portugal)
Stefan Evert           (University of Osnabrueck, Germany)
Roxana Girju           (University of Illinois at Urbana-Champaign, USA)
Nicole Gregoire        (University of Utrecht, The Netherlands)
Chikara Hashimoto      (National Institute of Information and Communications Technology, Japan)
Marti Hearst           (University of California at Berkeley, USA)
Kyo Kageura            (University of Tokyo, Japan)
Min-Yen Kan            (National University of Singapore, Singapore)
Adam Kilgarriff        (Lexical Computing Ltd, UK)
Su Nam Kim             (University of Melbourne, Australia)
Anna Korhonen          (University of Cambridge, UK)
Zornitsa Kozareva      (University of Southern California, USA)
Brigitte Krenn         (Austrian Research Institute for Artificial Intelligence, Austria)
Cvetana Krstev         (University of Belgrade, Serbia)
Rosamund Moon          (University of Birmingham, UK)
Jan Odijk              (University of Utrecht, The Netherlands)
Stephan Oepen          (University of Oslo, Norway)
Darren Pearce          (London Knowledge Lab, UK)
Pavel Pecina           (Charles University, Czech Republic)
Scott Piao             (Lancaster University, UK)
Thierry Poibeau        (CNRS and École Normale Supérieure, France)
Elisabete Ranchhod     (University of Lisbon, Portugal)
Barbara Rosario        (Intel Labs, USA)
Diarmuid Ó Séaghdha    (University of Cambridge, UK)
Violeta Seretan        (University of Geneva, Switzerland)
Stan Szpakowicz        (University of Ottawa, Canada)
Beata Trawinski        (University of Tuebingen, Germany)
Vivian Tsang           (Bloorview Research Institute, Canada)
Kyioko Uchiyama        (Keio University, Japan)
Ruben Urizar           (University of the Basque Country, Spain)
Tony Veale             (University College Dublin, Ireland)


Eric Laporte           (Universite Paris-Est, France)
Preslav Nakov          (National University of Singapore, Singapore)
Carlos Ramisch         (University of Grenoble, France)
Aline Villavicencio    (Federal University of Rio Grande do Sul, Brazil)

For any inquiries regarding the workshop please send an e-mail to
mwe2010workshop at

Message diffuse par la liste Langage Naturel <LN at>
Informations, abonnement :
English version       : 
Archives                 :

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  :

More information about the Ln mailing list