[Corpora-List] Deadline extension: Workshop TALaRE 2013 - NLP for French and European Regional Languages
Delphine Bernhard
dbernhard at unistra.fr
Tue Mar 26 13:38:54 UTC 2013
DEADLINE EXTENDED TO APRIL 5, 2013
Call for papers
Workshop TALaRE 2013: Natural Language Processing for French and
European Regional Languages
Held in conjunction with TALN 2013 (20e conférence sur le Traitement
Automatique des Langues Naturelles, Sables d'Olonne, France, June,
17th-21st 2013)
Research in natural language processing for under-resourced languages is
currently an active area, in a global perspective of cultural heritage
preservation. Regional languages generally fall into this category, as
electronic resources for these languages are rare and sometimes
non-existent. Providing electronic resources for these languages
(including written corpora, lexicons and dictionaries) is a major asset
for supporting their dissemination, teaching, preservation or
standardization. It is, among others, necessary to develop written
corpora, which are the most representative of language use, by
collecting written works of various genres (literature, theater, poetry,
storytelling, press ...) and, for some languages, by taking variation
into account (dialectal, phonological or graphical variations). The
second step is logically to enrich the corpora with annotations. The
development of annotated corpora for regional languages raises many
methodological issues. It is not always possible to directly transpose
existing models for resource-rich languages, partly because of dialectal
and phonological variation and the lack of writing standards. The
corpora are also a basis for the development of dictionaries, lexicons
and glossaries and are necessary for the description of the actual use
of a language. On the other hand, dictionaries and lexicons are needed
to support the development of the corpora (optical character
recognition, lemmatization and morpho-syntactic analysis). When these
resources already exist for a language (dictionaries, lexicons,
bilingual glossaries coupling a regional and a national language), the
question arises as to how information contained in these resources can
be shared and possibly be enriched with additional annotations
(phonetic, morphosyntactic, syntactic, ...). Finally, corpora and
lexicons are necessary for the development of natural language
processing tools (morpho-syntactic analysis or syntactic analyzers ...).
Beyond the technical and methodological challenges, the more pragmatic
difficulties related to the lack of financial and human resources to
carry out the creation of resources should not be neglected. This
workshop aims to bring together researchers involved in the creation of
language resources and "basic" NLP tools for French and European
regional languages, in order to share their views, methodologies and
techniques.
We invite submission of papers on the constitution of resources and
tools for regional or minority languages of France and Europe (including
languages from overseas departments and territories of France).
Topics of interest include, but are not limited to:
- Resources: Written corpus building ; Development of lexicons,
dictionaries, glossaries
- Tools : Scanning, OCR and text encoding ; Linguistic annotations
(manual and automatic for morpho-syntactic or syntactic analysis,...) ;
Corpus management and query
- Articulation between theory and practice when dealing with variation
IMPORTANT DATES
- Paper submission deadline: April 5, 2013
- Notification of paper acceptance: April 26, 2013
- Deadline for camera-ready versions: May 10, 2013
PAPER SUBMISSION
Papers will be written in French for French-speaking authors or English
for non-French-speaking authors. They should have from 12 to 14 pages in
the TALN 2013 format. A LaTeX style file and a MS Word template are
available on the conference website
(http://www.taln2013.org/soumettre/). Selected articles will be
allocated 30 minutes for the oral presentation (including discussion).
Authors should submit the papers in PDF through the submission page at
https://www.easychair.org/conferences/?conf=talare2013
SELECTION CRITERIA
The selection criteria will be the same as those that apply for TALN
2013 research articles.
ORGANIZING COMMITTEE
- Marianne Vergez-Couret, CLLE-ERSS, Université de Toulouse 2
- Delphine Bernhard, LILPA, Université de Strasbourg
- Jean-Michel Eloy, LESCLAP, Université de Picardie
- Christophe Rey, LESCLAP, Université de Picardie
PROGRAM COMMITTEE (in progress)
Martine Adda-Decker LPP, Université Paris 3
Vincent Berment CLIPS-GÉTA, Université Joseph Fourier, Grenoble
Myriam Bras CLLE-ERSS, Université de Toulouse 2
Alain Dawson LESCLAP, Université de Picardie
Nuria Gala LIF, Aix-Marseille Université
Nabil Hathout CLLE-ERSS, Université de Toulouse 2
Anne-Laure Ligozat LIMSI, CNRS
Jean-Marie Pierrel ATILF, Université de Lorraine
Yves Scherrer ALPAGE, Université Paris 7
Pascal Vaillant Université Paris 13
Contact : Marianne Vergez-Couret (vergez at univ-tlse2.fr)
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list