24.1418, Calls: Computational Linguistics, Text/Corpus Linguistics/France
linguist at linguistlist.org
linguist at linguistlist.org
Tue Mar 26 19:26:58 UTC 2013
LINGUIST List: Vol-24-1418. Tue Mar 26 2013. ISSN: 1069 - 4875.
Subject: 24.1418, Calls: Computational Linguistics, Text/Corpus Linguistics/France
Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Veronika Drake, U of Wisconsin Madison
Monica Macaulay, U of Wisconsin Madison
Rajiv Rao, U of Wisconsin Madison
Joseph Salmons, U of Wisconsin Madison
Anja Wanner, U of Wisconsin Madison
<reviews at linguistlist.org>
Homepage: http://linguistlist.org
Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from
Amazon!
USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21
For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.
Editor for this issue: Alison Zaharee <alison at linguistlist.org>
================================================================
Date: Tue, 26 Mar 2013 15:25:57
From: Marianne Vergez-Couret [vergez at univ-tlse2.fr]
Subject: Workshop TALaRE 2013: Natural Language Processing for French and European Regional Languages
E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=24-1418.html&submissionid=10169881&topicid=3&msgnumber=1
Full Title: Workshop TALaRE 2013: Natural Language Processing for French and European Regional Languages
Short Title: TALaRE
Date: 21-Jun-2013 - 21-Jun-2013
Location: Sables d'Olonne, France
Contact Person: Marianne Vergez-Couret
Meeting Email: vergez at univ-tlse2.fr
Web Site: http://www.taln2013.org/ateliers/atelier-talare-2013/
Linguistic Field(s): Computational Linguistics; Text/Corpus Linguistics
Call Deadline: 05-Apr-2013
Meeting Description:
Research in natural language processing for under-resources languages is currently an active area, in a global perspective of cultural heritage preservation. Regional languages generally fall into this category, as electronic resources for these languages are rare and sometimes non-existent. Providing electronic resources for these languages (including written corpora, lexicons and dictionaries) is a major asset for supporting their dissemination, teaching, preservation or standardization. It is, among others, necessary to develop written corpora, which are the most representative of language use, by collecting written works of various genres (literature, theater, poetry, storytelling, press, etc.) and, for some languages, by taking variation into account (dialectal, phonological or graphical variations). The second step is logically to enrich the corpora with annotations. The development of annotated corpora for regional languages raises many methodological issues. It is not always possible to directly transpose existing models for resource-rich languages, partly because of dialectal and phonological variation and the lack of writing standards. The corpora are also a basis for the development of dictionaries, lexicons and glossaries and are necessary for the description of the actual use of a language. On the other hand, dictionaries and lexicons are needed to support the development of the corpora (optical character recognition, lemmatization and morpho-syntactic analysis). When these resources already exist for a language (dictionaries, lexicons, bilingual glossaries coupling a regional and a national language), the question arises as to how information contained in these resources can be shared and possibly be enriched with additional annotations (phonetic, morphosyntactic, syntactic, etc.). Finally, corpora and lexicons are necessary for the development of natural language processing tools (morpho-syntactic analysis or syntactic analyzers, etc.).
Beyond the technical and methodological challenges, the more pragmatic difficulties related to the lack of financial and human resources to carry out the creation of resources should not be neglected. This workshop aims to bring together researchers involved in the creation of language resources and ‘basic’ NLP tools for French and European regional languages, in order to share their views, methodologies and techniques.
2nd Call for Papers:
We invite submission of papers on the constitution of resources and tools for regional or minority languages of France and Europe (including languages from overseas departments and territories of France).
Topics of interest include, but are not limited to:
Resources:
- Written corpus building
- Development of lexicons, dictionaries, glossaries
Tools:
- Scanning, OCR and text encoding
- Linguistic annotations (manual and automatic for morphosyntactic or syntactic analysis, etc.)
- Corpus management and query
- Articulation between theory and practice when dealing with variation
Paper Submission:
Papers will be written in French for French-speaking authors or English for non-French-speaking authors. They should have from 12 to 14 pages in the TALN 2013 format. A LaTeX style file and a MS Word template are available on the conference website (http://www.taln2013.org/soumettre/). Selected articles will be allocated 30 minutes for the oral presentation (including discussion).
Authors should submit the papers in PDF through the submission page at https://www.easychair.org/conferences/?conf=talare2013.
Selection Criteria:
The selection criteria will be the same as those that apply for TALN 2013 research articles.
Organizing Committee:
Marianne Vergez-Couret, CLLE-ERSS, Université de Toulouse 2
Delphine Bernhard, LILPA, Université de Strasbourg
Jean-Michel Eloy, LESCLAP, Université de Picardie
Christophe Rey, LESCLAP, Université de Picardie
Program Committee:
Martine Adda-Decker, LPP, Université Paris 3
Vincent Berment, CLIPS-GÉTA, Université Joseph Fourier, Grenoble
Myriam Bras, CLLE-ERSS, Université de Toulouse 2
Alain Dawson, LESCLAP, Université de Picardie
Nuria Gala, LIF, Aix-Marseille Université
Nabil Hathout, CLLE-ERSS, Université de Toulouse 2
Anne-Laure Ligozat, LIMSI, CNRS
Jean-Marie Pierrel, ATILF, Université de Lorraine
Yves Scherrer, ALPAGE, Université Paris 7
Pascal Vaillant, Université Paris 13
----------------------------------------------------------
LINGUIST List: Vol-24-1418
----------------------------------------------------------
More information about the LINGUIST
mailing list