Habilitation: Alexandre Allauzen, Modeles statistiques pour la traduction automatique
Thierry Hamon
hamon at LIMSI.FR
Sat Jan 18 08:00:56 UTC 2014
Date: Thu, 16 Jan 2014 13:21:33 +0100
From: Alexandre Allauzen <allauzen at limsi.fr>
Message-ID: <52D7CECD.4040602 at limsi.fr>
X-url: http://www.limsi.fr/Pratique/acces/
J'ai le plaisir de vous inviter à ma soutenance d'Habilitation à Diriger
des Recherches intitulée "Modèles statistiques pour la traduction
automatique", ainsi qu'au pot qui suivra. La soutenance aura lieu le
jeudi 30 janvier 2014 à 14h30 dans la salle de conférence du LIMSI-CNRS.
Le jury est composé de :
Laurent Besacier, Professeur des Universités, Université J. Fourier,
Grenoble 1
Alain Denise, Professeur des Universités, Université Paris-Sud
George Foster, Senior Research Officer, Conseil national de recherches
Canada
Hermann Ney, Professeur, Université d’Aix-la-Chapelle
Isabelle Tellier, Professeur des Universités, Université Paris 3-
Sorbonne Nouvelle
François Yvon, Professeur des Universités, Université Paris-Sud
Accès: pour venir au LIMSI, vous pouvez consulter la page web
http://www.limsi.fr/Pratique/acces/
--
Résumé:
The advent of Internet has deeply modified our communication habits and
especially our relationship with multilingualism: everyone needs to
access and broadcast information that are not in its native language,
and to electronically communicate with other Internet users who speak or
write in different languages. These different factors create a new
demand for translation services, that can only be realistically
fulfilled by machine translation tools.
With the exponential increase of electronic corpora along with the
development of statistical methods, tremendous progress has been made
over the last two decades in automatic translation of natural languages
(i.e Machine Translation). Nowadays, statistical machine translation
(SMT) can be considered as the most promising approach. However, given
the complexity of the task and its peculiarities, modelling and
estimation issues remain to be addressed and motivate a large part of my
work. My habilitation report is focused on 4 lines of my research
activities, each described in a chapter :
- The phrase-based SMT approach and more precisely its n-gram variant
are described, along with an overview of my recurrent work on the
development of state of the art SMT systems for international
evaluation campaigns like WMT and IWSLT.
- The associations between a source and a target phrases and their
statistics, which are at the heart of phrase-based SMT systems, rely
on noisy information provided by word alignment models. This chapter
investigates a model-based replacement of this important preliminary
step, in which the posterior probabilities of the word alignment links
are introduced in the construction of the translation models. The
posterior distributions are estimated using a maximum-entropy
classifier.
- Given the inventory of translation units, most SMT architectures
follow a two-steps procedure in which several probabilistic models are
first estimated independently, and then combined in a global linear
model, the weights of which are chosen so as to maximize an overall
translation quality criterion. In this line of research, an end-to-end
discriminative model is introduced, in which all the parameters are
learned discriminatively in a unified manner. The derivations of the
translation process that defines reordering and segmentation
operations are introduced as latent variables in an exponential model.
- Most statistical models for SMT consider words and phrases as events
of discrete random variables. The resulting representations are very
sparse and ignore relationships that may exist among these units. This
lack of structure hinders the generalization and adaptation power of
the resulting models. To remedy these issues, a continuous space
translation model is introduced, in which words or phrases are
projected in a continuous space, and represented by dense vectors of
small dimensions. The translation probabilities are then expressed as
a smooth function of these representations. The projection and the
probability estimation are jointly learned using a multi layer neural
network.
The conclusion of the report summarizes my contributions and proposes
directions for future work. These directions are focused on word
reordering in SMT, continuous vector space models and weakly supervised
learning.
Alexandre Allauzen
Univ. Paris-Sud/LIMSI-CNRS
Tel : 01.69.85.80.41 (80.88)
Bur : 29 LIMSI Bat. S
allauzen at limsi.fr
-------------------------------------------------------------------------
Message diffuse par la liste Langage Naturel <LN at cines.fr>
Informations, abonnement : http://www.atala.org/article.php3?id_article=48
English version :
Archives : http://listserv.linguistlist.org/archives/ln.html
http://liste.cines.fr/info/ln
La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion : http://www.atala.org/
ATALA décline toute responsabilité concernant le contenu des
messages diffusés sur la liste LN
-------------------------------------------------------------------------
More information about the Ln
mailing list