[Corpora-List] Post-doc position at the Xerox Research Centre Europe

Nicola Cancedda nicola.cancedda at xrce.xerox.com
Fri Jun 23 16:34:59 UTC 2006


*Background*

The Learning and Content Analysis (LCA) group of the Xerox Research
Centre Europe (XRCE) is developing data mining technologies, some of
which have been delivered to and are now being used by several Xerox
business groups, including text clustering and categorization, in
both monolingual and multilingual settings. LCA currently focuses on
developing new solutions for:

   1. Multilingual applications: multilingual lexicon extraction,
      cross-language information retrieval, induction of machine
      translation systems from multilingual corpora
   2. Multimedia mining: categorization, clustering and retrieval of
      multimedia data (this project is conducted in collaboration with
      the Image Processing group of XRCE)
   3. Device mining, where devices are considered either in isolation or
      connected via a network. This activity aims, among other things,
      at building systems for diagnosing devices, for preventive
      maintenance, for monitoring the evolution of devices over time,
      and for issuing early warnings of failure.

*Description*

XRCE is seeking a postdoc researcher to contribute to the
multilingual application activities in the context of the EU funded
project "Statistical Multilingual Analysis for Retrieval and
Translation" (SMART). This project is focused on advancing the state
of the art in Statistical Machine Translation and Cross-Language
Textual Information Access technologies by the means of modern
Statistical Learning. XRCE is involved in the whole spectrum of
SMART activities:

    * Advanced models for Statistical Machine Translation
    * Advanced Language Models
    * Translation and Language Model Adaptation and Combination
    * Cross-Language Textual Information Access

Contributions to some or all these activities are expected, in close
collaboration with other members of the group.

XRCE maintains a high level of publications (in both journals and
conferences) and patents. We expect the successful candidate to be
part of these effort.

The duration of the contract is of 18 months, starting October 1st,
2006.

*Keywords:* Statistical Machine Translation, Machine Learning, Optimization

*Technical requirements:* PhD in computer science, statistics,
mathematics or optimization with excellent knowledge of machine
learning (especially statistical learning). Experience in the
implementation of scalable optimization algorithms and/or in
statistical machine translation is a definite plus.

Good programming skills in C, C++, Python or Matlab.

A good command of English is required, as well as open-mindedness
and the will to collaborate with a team.

*Contact*

Applications should be sent in electronic form to:

xrce-candidates at xrce.xerox.com

Informal inquiries can be addressed to:

Dr. Nicola Cancedda,
Nicola.Cancedda at xrce.xerox.com
XRCE - 6, Chemin de Maupertuis 38240 Meylan - France
Tel: 33 4 76 61 51 59
Fax: 33 4 76 61 50 99



More information about the Corpora mailing list