[Corpora-List] Deadline extension: Workshop TALaRE 2013 - NLP for French and European Regional Languages

Delphine Bernhard dbernhard at unistra.fr
Tue Mar 26 13:38:54 UTC 2013


DEADLINE EXTENDED TO APRIL 5, 2013

Call for papers

Workshop TALaRE 2013: Natural Language Processing for French and 
European Regional Languages

Held in conjunction with TALN 2013 (20e conférence sur le Traitement 
Automatique des Langues Naturelles, Sables d'Olonne, France, June, 
17th-21st 2013)

Research in natural language processing for under-resourced languages is 
currently an active area, in a global perspective of cultural heritage 
preservation. Regional languages generally fall into this category, as 
electronic resources for these languages are rare and sometimes 
non-existent. Providing electronic resources for these languages 
(including written corpora, lexicons and dictionaries) is a major asset 
for supporting their dissemination, teaching, preservation or 
standardization. It is, among others, necessary to develop written 
corpora, which are the most representative of language use, by 
collecting written works of various genres (literature, theater, poetry, 
storytelling, press ...) and, for some languages, by taking variation 
into account (dialectal, phonological or graphical variations). The 
second step is logically to enrich the corpora with annotations. The 
development of annotated corpora for regional languages raises many 
methodological issues. It is not always possible to directly transpose 
existing models for resource-rich languages, partly because of dialectal 
and phonological variation and the lack of writing standards. The 
corpora are also a basis for the development of dictionaries, lexicons 
and glossaries and are necessary for the description of the actual use 
of a language. On the other hand, dictionaries and lexicons are needed 
to support the development of the corpora (optical character 
recognition, lemmatization and morpho-syntactic analysis). When these 
resources already exist for a language (dictionaries, lexicons, 
bilingual glossaries coupling a regional and a national language), the 
question arises as to how information contained in these resources can 
be shared and possibly be enriched with additional annotations 
(phonetic, morphosyntactic, syntactic, ...). Finally, corpora and 
lexicons are necessary for the development of natural language 
processing tools (morpho-syntactic analysis or syntactic analyzers ...).

Beyond the technical and methodological challenges, the more pragmatic 
difficulties related to the lack of financial and human resources to 
carry out the creation of resources should not be neglected. This 
workshop aims to bring together researchers involved in the creation of 
language resources and "basic" NLP tools for French and European 
regional languages, in order to share their views, methodologies and 
techniques.

We invite submission of papers on the constitution of resources and 
tools for regional or minority languages of France and Europe (including 
languages from overseas departments and territories of France).

Topics of interest include, but are not limited to:
- Resources: Written corpus building ; Development of lexicons, 
dictionaries, glossaries
- Tools : Scanning, OCR and text encoding ; Linguistic annotations 
(manual and automatic for morpho-syntactic or syntactic analysis,...) ; 
Corpus management and query
- Articulation between theory and practice when dealing with variation

IMPORTANT DATES
- Paper submission deadline: April 5, 2013
- Notification of paper acceptance: April 26, 2013
- Deadline for camera-ready versions: May 10, 2013

PAPER SUBMISSION

Papers will be written in French for French-speaking authors or English 
for non-French-speaking authors. They should have from 12 to 14 pages in 
the TALN 2013 format.  A LaTeX style file and a MS Word template are 
available on the conference website 
(http://www.taln2013.org/soumettre/). Selected articles will be 
allocated 30 minutes for the oral presentation (including discussion).

Authors should submit the papers in PDF through the submission page at
https://www.easychair.org/conferences/?conf=talare2013

SELECTION CRITERIA
The selection criteria will be the same as those that apply for TALN 
2013 research articles.

ORGANIZING COMMITTEE
- Marianne Vergez-Couret, CLLE-ERSS, Université de Toulouse 2
- Delphine Bernhard, LILPA, Université de Strasbourg
- Jean-Michel Eloy, LESCLAP, Université de Picardie
- Christophe Rey, LESCLAP, Université de Picardie

PROGRAM COMMITTEE (in progress)
Martine Adda-Decker     LPP, Université Paris 3
Vincent Berment     CLIPS-GÉTA, Université Joseph Fourier, Grenoble
Myriam Bras     CLLE-ERSS, Université de Toulouse 2
Alain Dawson     LESCLAP, Université de Picardie
Nuria Gala     LIF, Aix-Marseille Université
Nabil Hathout     CLLE-ERSS, Université de Toulouse 2
Anne-Laure Ligozat     LIMSI, CNRS
Jean-Marie Pierrel     ATILF, Université de Lorraine
Yves Scherrer     ALPAGE, Université Paris 7
Pascal Vaillant     Université Paris 13

Contact : Marianne Vergez-Couret (vergez at univ-tlse2.fr)

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list