These: Jean-Philippe Prost, Modelisation de la gradience syntaxique par analyse relachee a base de contraintes

Tue Dec 2 20:01:20 UTC 2008

Date: Sun, 30 Nov 2008 13:38:07 +0000
From: "Jean-Philippe Prost" <jpprost at gmail.com>
Message-ID: <b5e0c390811300538v42af43b9tb82c149c8c55b1ed at mail.gmail.com>
X-url: http://maps.google.com/maps/ms?ie=UTF8&hl=en&msa=0&msid=104974011772425543414.00045c6f34d65786a1256&ll=43.305053,5.378687&spn=0.01054,0.01929&z=16
X-url: http://www.ics.mq.edu.au/~jpprost/
(english version below)

Bonjour,

j'ai le plaisir de vous inviter à la soutenance de ma thèse, qui se
tiendra:

  le mercredi 10 décembre 2008 à 13h,
  à la faculté de Marseille St Charles, Université de Provence

(plan http://maps.google.com/maps/ms?ie=UTF8&hl=en&msa=0&msid=104974011772425543414.00045c6f34d65786a1256&ll=43.305053,5.378687&spn=0.01054,0.01929&z=16),
amphi de Sciences Naturelles,
ainsi qu'au pot qui suivra.

La thèse s'intitule "Modélisation de la gradience syntaxique par
analyse relâchée à base de contraintes".

Ce travail a été réalisé en cotutelle entre le Laboratoire Parole et
Langage (LPL) de l'université de Provence, Aix-en-Provence, et le
Centre for Language Technology (CLT) de Macquarie University, Sydney,
Australie.

Le jury sera composé de:
Alexis Nasr, U. de la Méditerranée, président
Denys Duchier, U. d'Orléans, rapporteur
Gerald Penn, U. of Toronto, rapporteur
Eric de la Clergerie, INRIA, examinateur
Philippe Blache, CNRS-U. de Provence, co-directeur
Diego Molla Aliod, Macquarie U., co-directeur

Résumé ci-joint.

Sincèrement,

Jean-Philippe Prost
http://www.ics.mq.edu.au/~jpprost/

-- Résumé

La grammaticalité d'une phrase est habituellement conçue comme une
notion binaire: une phrase est soit grammaticale, soit
agrammaticale. Cependant, bon nombre de travaux se penchent de plus en
plus sur l'étude de degrés d'acceptabilité intermédiaires, auxquels le
terme de 'gradience' fait parfois référence. A ce jour, la majorité de
ces travaux s'est concentrée sur l'étude de l'évaluation humaine de la
gradience syntaxique. La présente étude explore la possibilité de
construire un modèle robuste qui s'accorde avec ces jugements humains.

Nous suggérons d'élargir au langage mal formé les concepts de
'Gradience Intersective' (GI) et de 'Gradience Subsective' (GS),
proposés par Aarts pour la modélisation de jugements graduels. Selon
ce nouveau modèle, le problème que soulève la gradience concerne la
classification d'un énoncé dans une catégorie particulière, selon des
critères basés sur les caractéristiques syntaxiques de l'énoncé. Nous
nous attachons à étendre la notion de GI afin que le choix de la
meilleure solution parmi un ensemble de candidats soit modélisé par un
critère d'optimalité, tandis que la GS est étendue afin que le degré
de typicité de cette structure au sein de sa catégorie soit modélisé
par le calcul d'un degré d'acceptabilité grammaticale. Quant aux
caractéristiques syntaxiques requises, notre étude de différents
cadres de représentation pour la syntaxe montre qu'elles peuvent
aisément être représentées dans un cadre de syntaxe modèle-théorique
(Model-Theoretic Syntax). Nous optons pour l'utilisation des
Grammaires de Propriétés (GP), qui offrent, précisément, de modéliser
la caractérisation d'un énoncé. Nous présentons ici une solution
entièrement automatisée pour la modélisation de la gradience
syntaxique, qui procède de la caractérisation d'une phrase bien ou mal
formée, de la génération d'un arbre syntaxique optimal, et du calcul
d'un degré d'acceptabilité grammaticale pour l'énoncé.

A travers le développement de ce nouveau modèle, la contribution de ce
travail comporte trois volets.  Premièrement, nous spécifions un
système logique pour les GP qui permet de réviser sa formalisation
sous l'angle de la théorie des modèles.  Deuxièmement, nous présentons
notre implantation du processus d'analyse relâchée à base de
contraintes ('Loose Satisfaction Chart Parsing', ou LSCP), dont nous
prouvons qu'elle génère une analyse syntaxique complète et optimale.
Et troisièmement, nous présentons un modèle numérique pour
l'estimation du degré d'acceptabilité grammaticale d'un énoncé, pour
lequel nous mesurons une bonne corrélation avec des jugements humains.

Mots-clés : Gradience, acceptabilité, grammaticalité, optimalité,
configuration, syntaxe modèle-théorique (Model-Theoretic Syntax),
Grammaires de Propriétés, analyse syntaxique tabulaire par
contraintes, robustesse, satisfaction relâchée de contraintes.

--------------- English version ----------------------
Dear all,

it is my pleasure to invite you to my Ph.D. defense, which will take
place:

  on Wednesday 10 December 2008 at 1pm,
  at faculté de Marseille St Charles, Université de Provence

(map
http://maps.google.com/maps/ms?ie=UTF8&hl=en&msa=0&msid=104974011772425543414.00045c6f34d65786a1256&ll=43.305053,5.378687&spn=0.01054,0.01929&z=16),

amphi de Sciences Naturelles (Natural Sciences theatre),
and to the function to follow.

The thesis' title is "Modelling Syntactic Gradience with Loose
Constraint-based Parsing".

This work was carried out under a joint institutional "cotutelle"
agreement, between the Laboratoire Parole et Langage (LPL) at
Université de Provence, Aix-en-Provence, France, and the Centre for
Language Technology (CLT) at Macquarie University, Sydney, Australia.

The jury comprises:
Alexis Nasr, U. de la Méditerranée, President of the jury,
Denys Duchier, U. d'Orléans, reporting examiner,
Gerald Penn, U. of Toronto, reporting examiner,
Eric de la Clergerie, INRIA, examiner,
Philippe Blache, CNRS-U. de Provence, co-supervisor
Diego Molla Aliod, Macquarie U., co-supervisor

Abstract follows.

Sincerely yours,

Jean-Philippe Prost
http://www.ics.mq.edu.au/~jpprost/

-- Abstract

The grammaticality of a sentence has conventionally been treated in a
binary way: either a sentence is grammatical or not. A growing body of
work, however, focuses on studying intermediate levels of
acceptability, sometimes referred to as 'gradience'. To date, the bulk
of this work has concerned itself with the exploration of human
assessments of syntactic gradience.  This dissertation explores the
possibility to build a robust computational model that accords with
these human judgements.

We suggest that the concepts of Intersective Gradience (IG) and
Subsective Gradience (SG) introduced by Aarts for modelling graded
judgements be extended to cover deviant language. Under such a new
model, the problem then raised by gradience is to classify an
utterance as a member of a specic category according to its syntactic
characteristics. More specifically, we extend IG so that the problem
of choosing the most suitable syntactic structure for an utterance
among a set of candidates is addressed with a criterion of optimality,
while SG is extended so that calculating the degree of prototypicality
of an utterance is addressed in rating this utterance according to its
grammatical acceptability. As for the required syntactic
characteristics, our investigation of different syntactic frameworks
shows that they can easily be represented in Model-Theoretic Syntax;
we choose to use Property Grammars (PG), which offers to model the
characterisation of an utterance. We present here a fully automated
solution for modelling syntactic gradience, which characterises any
well formed or ill formed input sentence, generates an optimal parse
for it, then rates the utterance according to its grammatical
acceptability.

Through the development of such a new model of gradience, the main
contribution of this work is three-fold.  First, we specify a
model-theoretic logical framework for PG, which bridges different gaps
observed in the existing formalisation.  Second, we present our
implementation of Loose Satisfaction Chart Parsing (LSCP), a dynamic
programming approach based on mechanisms of loose constraint
satisfaction, which is proven to always find the full parse of optimal
merit.  And third, after postulating that human judgements of
acceptability can be predicted by factors derivable from LSCP, we
present a numeric model for rating an utterance according to its
syntactic gradience. The model shows a good correlation with human
judgements.

Keywords: Gradience, acceptability, grammaticality, optimality,
configuration, Model-Theoretic Syntax, Property Grammars,
characterisation, constraint-based chart parsing, robustness, loose
constraint satisfaction.

-------------------------------------------------------------------------
Message diffuse par la liste Langage Naturel <LN at cines.fr>
Informations, abonnement : http://www.atala.org/article.php3?id_article=48
English version       : 
Archives                 : http://listserv.linguistlist.org/archives/ln.html
                                http://liste.cines.fr/info/ln

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  : http://www.atala.org/
-------------------------------------------------------------------------