Appel: 3rd Workshop on Building and Using Comparable Corpora, LREC 2010

Thierry Hamon thierry.hamon at UNIV-PARIS13.FR
Tue Feb 16 19:53:14 UTC 2010

Date: Tue, 16 Feb 2010 16:19:05 +0100
From: "Reinhard Rapp" <reinhardrapp at>
Message-ID: <EF2D553CD0BA41739E6C0617B9600A33 at HPNOTEBOOK>

Apologies for multiple postings
Please distribute to colleagues


     Last Call for Papers


     Applications of Parallel and Comparable Corpora in
     Natural Language Engineering and the Humanities

     LREC 2010 post-conference workshop, May 22, 2010

     Mediterranean Conference Centre, Valletta, Malta

     Deadline for submissions: February 19, 2010

     Invited speaker: Adam Kilgarriff (Lexical Computing Ltd)


We are inviting submissions for the third edition of the Workshop on
Building and Using Comparable Corpora. Comparable corpora are
collections of documents that are comparable in content and form in
various degrees and dimensions. This definition includes many types of
parallel and non-parallel multilingual corpora, but also sets of
monolingual corpora that are used for comparative purposes.

By bringing together researchers from several disciplines, including
natural language engineering and the humanities, the workshop aims at
showing the full breadth of research on comparable corpora.

We solicit contributions on a range of topics, including but not
limited to:

* Building Comparable Corpora

 - Automatic and semi-automatic methods
 - Methods to mine parallel and non-parallel corpora from the Web
 - Tools and criteria to evaluate the comparability of corpora
 - Parallel vs non-parallel corpora, monolingual corpora
 - Rare and minority languages
 - Comparisons across language families
 - Multi-media/multi-modal comparable corpora

* Applications of Comparable Corpora

 - Human translations
 - Language learning
 - Cross-language information retrieval & document categorization
 - Bilingual projections
 - (Statistical) machine translation
 - Writing assistance

* Mining from Comparable Corpora

 - Extraction of parallel segments or paraphrases from
   comparable corpora
 - Extraction of bilingual and multilingual translations of
   single words and multi-word expressions, proper names,
   named entities, etc.


February 19, 2010  Deadline for paper submissions
March 12, 2010  Notification of acceptance
March 22, 2010  Camera-ready copies due
May 22, 2010  Workshop


Adam Kilgarriff (Lexical Computing Ltd)


Reinhard Rapp (University of Tarragona, Spain)
Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France)
Serge Sharoff (University of Leeds, UK)


Srinivas Bangalore (AT&T Labs, USA)
Caroline Barrière (National Research Council Canada)
Chris Biemann (Microsoft / Powerset, San Francisco, USA)
Lynne Bowker  (University of Ottawa, Canada)
Hervé Déjean (Xerox Research Centre Europe, Grenoble, France)
Kurt Eberle (Lingenio, Heidelberg, Germany)
Andreas Eisele (DFKI Saarbrücken, Germany)
Éric Gaussier (Université Joseph Fourier, Grenoble, France)
Gregory Grefenstette  (Exalead, Paris, France)
Pascale Fung (Hong Kong University of Science & Technology, China)
Silvia Hansen-Schirra (University of Mainz, Germany)
Hitoshi Isahara (NICT, Tokyo, Japan)
Kyo Kageura (University of Tokyo, Japan)
Min-Yen Kan (National University of Singapore)
Natalie Kübler (Université Paris Diderot, France)
Adam Kilgarriff (Lexical Computing Ltd, UK)
Philippe Langlais (Université de Montréal, Canada)
Emmanuel Morin (Université de Nantes, France)
Tony McEnery (Lancaster University, UK)
Dragos Stefan Munteanu (Language Weaver Inc., USA)
Carol Peters (ISTI-CNR, Pisa, Italy)
Reinhard Rapp (University of Tarragona, Spain)
Sujith Ravi (ISI, University of Southern California, USA)
Serge Sharoff  (University of Leeds, UK)
Michel Simard (National Research Council Canada)
Richard Sproat (OGI School of Science and Technology, USA)
Michael Zock (LIF, CNRS Marseille, France)
Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France)


Papers of no more than 10 pages and following the LREC main conference
formatting details as available at should
be submitted as a PDF file via the START conference manager at Papers will be blind
reviewed by at least two members of the Programme
Committee. Therefore, authors' names and affiliations should not
appear in the paper.  Accepted papers will be published in the
workshop proceedings.

This year, when submitting a paper from the START page, authors will
be asked to participate in a survey on the use of language resources
by providing essential information about the resources (in a broad
sense, i.e. also technologies, standards, evaluation kits, etc.) that
have been used for the work described in the paper or are a new result
of the research.  For further information on this new iniative, please
refer to


If you have questions, please consult the workshop website at or contact Reinhard Rapp
(e-mail: reinhardrapp AT gmx DOT de )

Message diffuse par la liste Langage Naturel <LN at>
Informations, abonnement :
English version       : 
Archives                 :

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  :

More information about the Ln mailing list