Job: Post-doctorat - GREYC CNRS - Caen

Wed Jun 11 09:28:41 UTC 2008

Date: Mon, 09 Jun 2008 16:08:58 +0200
From: Thierry Charnois <thierry.charnois at info.unicaen.fr>
Message-ID: <484D397A.8010500 at info.unicaen.fr>
X-url: http://bingo2.greyc.fr/openPositions/postDoc08GREYC_Fr
X-url: http://bingo2.greyc.fr/
X-url: http://bingo2.greyc.fr/openPositions/postDoc08GREYC_En

English version: see below 

L'appel à candidature est aussi disponible à :
http://bingo2.greyc.fr/openPositions/postDoc08GREYC_Fr

------------------------------------------------------------------------

Le laboratoire GREYC CNRS UMR 6072 de l'université de Caen
Basse-Normandie propose un post-doc dans le cadre du projet Bingo2
(Knowledge discovery for and by inductive queries in post-genomic
applications) soutenu par l'ANR : http://bingo2.greyc.fr/

Sujet : découverte automatique de règles pour l'annotation de corpus
=====

L'objectif général de cette recherche concerne la découverte de
connaissances à partir de textes bio-médicaux. Face à l'explosion
quantitative de telles données, l'accès à l'information pertinente est
devenue un enjeu majeur.  Il existe un savoir-faire au GREYC sur les
méthodes de fouille de données et du traitement automatique des
langues (TAL), notamment en extraction d'information et nous
proposons, dans cette recherche, d'exploiter les complémentarités de
ces domaines. Au niveau du TAL, l'acquisition automatique de
ressources linguistiques (lexiques, règles d'extraction) par des
techniques de fouille de texte (FT) est une voie prometteuse pour
prendre en compte la diversité des corpus et leur évolution, et
permettre une analyse automatique des textes et les annoter. Au niveau
de la FT, il s'agit de concevoir de nouvelles méthodes de fouille
intégrant les spécificités du texte.

Le post-doc travaillera sur cette problématique qui inclut les points
suivants :

* utilisation et adaptation des méthodes de FT pour l'apprentissage de
  règles d'extraction ;

* conception d'une méthode pour l'annotation sémantique de corpus ;

* développement d'une interface permettant la recherche d'information
  au sein des corpus annotés.

* Profil

Nous recherchons un informaticien ayant des connaissances dans au
moins l'un des domaines du traitement automatique des langues, de la
fouille de données ou de l'apprentissage automatique.

* Lieu, durée et rémunération

Contrat de 12 mois (possibilité de prolongation), à pourvoir en
septembre 2008
Rémunération : 1900 euros net par mois
Le poste est à pourvoir au GREYC.

* Comment candidater

Les candidatures sont à envoyer dès que possible et seront étudiées
jusqu'à ce que le poste soit pourvu.

Envoyer un CV et une lettre de motivation, ainsi que deux noms de
personnes susceptibles de recommander le candidat, par courrier
électronique à
Thierry.Charnois at info.unicaen.fr
Bruno.Cremilleux at info.unicaen.fr

Thierry Charnois +33 2 31 56 73 77 Thierry.Charnois at info.unicaen.fr
Bruno Crémilleux +33 2 31 56 74 35 Bruno.Cremilleux at info.unicaen.fr
GREYC - CNRS UMR 6072, Université de Caen, Campus Côte de Nacre F-14032 Caen
Cedex - France 

************************************************************************
English version: http://bingo2.greyc.fr/openPositions/postDoc08GREYC_En
************************************************************************

A post-doctoral position in Computer Science, Natural Language
Processing and Data Mining is opened at the GREYC laboratory (CNRS UMR
6072) at University of Caen.

Working context: the position is funded by the French ANR agency,
project Bingo2 (Knowledge discovery for and by inductive queries in
post-genomic applications): http://bingo2.greyc.fr/

Location: Caen, Normandy, France

Topic: Discovering Extraction Rules for annotating corpus

Knowledge discovery in bio-medical texts is a big challenge due to the
large amount and variety of texts. The GREYC laboratory has performed
a variety of research on data mining, natural language processing and
extraction information (EI). This work proposes a promising
cross-fertilization of NLP and machine learning and data mining
techniques on biomedical texts. About NLP, automatically acquiring and
discovering extraction rules is a key point for parsing and annotating
text collections which are indefinitely updated and continually
grow. About data mining, the goal is to design new text mining methods
for taking account textual specificities.

Main features of the work include:
* using and adapting data mining methods for learning extraction
  rules;
* designing a method for semantic annotation of corpus;
* developing an interface for information retrieval from texts.

* Profile

The applicant should have a Ph.D degree in Computer Science with
background in Natural Language Processing or data mining or machine
learning.

* Place, duration and remuneration

12 months contract (with a possible extension), starting from
September 2008.
Net salary:  1900 euros per month
The position requires working at GREYC (see address below).

* How to apply

Please apply as soon as possible. Screening of applications will begin
immediately and continue until the position is filled.

Send CV and cover letter, with names of two references, to the
following emails:

Thierry Charnois +33 2 31 56 73 77 Thierry.Charnois at info.unicaen.fr
Bruno Crémilleux +33 2 31 56 74 35 Bruno.Cremilleux at info.unicaen.fr
GREYC - CNRS UMR 6072, Université de Caen, Campus Côte de Nacre F-14032 Caen
Cedex - France 

-------------------------------------------------------------------------
Message diffuse par la liste Langage Naturel <LN at cines.fr>
Informations, abonnement : http://www.atala.org/article.php3?id_article=48
English version       : 
Archives                 : http://listserv.linguistlist.org/archives/ln.html
                                http://liste.cines.fr/info/ln

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  : http://www.atala.org/
-------------------------------------------------------------------------