30.2597, Support: English; French; Computational Linguistics: PhD, CNRS
The LINGUIST List
linguist at listserv.linguistlist.org
Mon Jul 1 19:27:17 UTC 2019
LINGUIST List: Vol-30-2597. Mon Jul 01 2019. ISSN: 1069 - 4875.
Subject: 30.2597, Support: English; French; Computational Linguistics: PhD, CNRS
Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Jeremy Coburn
Managing Editor: Becca Morris
Team: Helen Aristar-Dry, Everett Green, Sarah Robinson, Peace Han, Nils Hjortnaes, Yiwen Zhang, Julian Dietrich
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org
Homepage: http://linguistlist.org
Please support the LL editors and operation with a donation at:
https://funddrive.linguistlist.org/donate/
Editor for this issue: Becca Morris <becca at linguistlist.org>
================================================================
Date: Mon, 01 Jul 2019 15:26:52
From: Nabil Hathout [Nabil.Hathout at univ-tlse2.fr]
Subject: English; French; Computational Linguistics: PhD, CNRS, France
Institution/Organization: CNRS
Department: CLLE
Web Address: https://clle.univ-tlse2.fr/
Level: PhD
Duties: Research
Specialty Areas: Computational Linguistics
Required Language(s): English (eng)
French (fra)
Description:
PhD position: Acquisition of linguistic phenomena by means of neural
embeddings
GENERAL INFORMATION
Reference : UMR5263-ANNCAM-012
Workplace : TOULOUSE
Date of publication : Tuesday, June 18, 2019
Scientific Responsible name : Nabil HATHOUT and Tim VAN DE CRUYS
Type of Contract : PhD Student contract / Thesis offer
Contract Period : 36 months
Start date of the thesis : 1 October 2019
Proportion of work : Full time
Remuneration : 2 135,00 € gross monthly
Description of the Thesis Topic:
Neural embeddings, used to construct semantic representations for words as
well as sentences, have become ubiquitous in the field of natural language
processing. Although their use leads to very good performance in a wide range
of tasks, they present difficulties related to their opacity. In particular,
much uncertainty remains about their ability to capture linguistic knowledge.
The goal of this PhD studentship will be to investigate the ability of neural
embeddings to acquire specific linguistic phenomena.
The first of these phenomena concerns the selection preferences of verbal
predicates. Most verbs have a preference for arguments that belong to
particular semantic classes. Theoretically, sentence embeddings implicitly
model these preferences, as well as the semantic features of the predicate and
its arguments. On the other hand, it remains unclear whether they are able to
correctly model more irregular cases, where the verb selects for a
semantically atypical argument. The thesis will include a thorough study of
the interaction of the verb with its arguments during the construction of
sentence embeddings: how does the representation of the verb contextualize
itself in the presence of its arguments, and vice versa?
Related to the first phenomenon, we will study the extent to which phrase
embeddings are able to capture subcategorization frames. In a given context, a
verb selects a subcategorization framework that depends on its syntactic and
semantic properties, as well as those of the context. Again, sentence
embeddings theoretically capture this information, but are they also able to
identify irregular arguments? The study will examine the behavior of sentence
embeddings in contexts where the arguments either do or do not conform to the
verb subcategorization frame.
Work Context:
The candidate will jointly carry out their research at the CLLE laboratory
(CARTEL team) and the IRIT laboratory (Melodi team) at the University of
Toulouse. CLLE develops NLP methods and tools for linguistics, while IRIT is a
computer laboratory internationally recognized for its research on artificial
intelligence and natural language processing; IRIT is also one of the founding
laboratories of ANITI (3IA), the artificial intelligence institute of
Toulouse.
Funding:
The thesis will be funded for a period of 3 years. The monthly remuneration is
2135 euros gross (1715 euros net).
Profile:
- Master's degree in NLP or computer science;
- Strong knowledge of NLP and machine learning;
- Strong programming skills (Python);
- Good command of English and French;
- Good writing and oral presentation skills.
Application:
Applications must be filed on the application link provided below.
Inquiries:
Feel free to contact Nabil Hathout (nabil.hathout at univ-tlse2.fr) or Tim Van de
Cruys (tim.vandecruys at irit.fr) to have more information.
Additional Information:
Keywords: neural embeddings; compositionality; contextual models;
subcategorization frames.
The PHD will be co-directed by Nabil Hathout (CNRS, CLLE) and Tim Van de Cruys
(CNRS, IRIT).
Application Deadline: 27-Nov-2019
Mailing Address for Applications:
Attn: Nabil Hathout
CLLE/ERSS. Maison de la recherche
Université de Toulouse Jean-Jaurès
Toulouse 31058
France
Web Address for Applications: https://emploi.cnrs.fr/Offres/Doctorant/UMR5263-ANNCAM-012/Default.aspx?lang=EN
Contact Information:
Nabil Hathout
Nabil.Hathout at univ-tlse2.fr
------------------------------------------------------------------------------
*************************** LINGUIST List Support ***************************
The 2019 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
to find out how to donate and check how your university, country or discipline
ranks in the fund drive challenges. Or go directly to the donation site:
https://iufoundation.fundly.com/the-linguist-list-2019
Let's make this a short fund drive!
Please feel free to share the link to our campaign:
https://funddrive.linguistlist.org/donate/
----------------------------------------------------------
LINGUIST List: Vol-30-2597
----------------------------------------------------------
More information about the LINGUIST
mailing list