[Corpora-List] 2nd CFP: NEW! MT Track! LREC 2010 Workshop: Multilingual resources and tools for Central and (South-)Eastern European languages

Milena Slavcheva milena at lml.bas.bg
Wed Feb 3 10:39:54 UTC 2010


SECOND CALL FOR PAPERS

 

Workshop on "EXPLOITATION OF MULTILINGUAL RESOURCES AND TOOLS FOR CENTRAL AND (SOUTH-) EASTERN EUROPEAN LANGUAGES"

23rd May 2010, La Valleta , Malta

 

organised in conjunction with LREC 2010 conference (17-23 May 2010, La Valleta, Malta)


http://www.c-phil.uni-hamburg.de/view/Main/LrecWorkshop2010

 

The reconciliation of differences in the availability of language resources and tools for more intensively and less intensively spoken languages has been the main concern of  several European initiatives. Central and (South-) Eastern European languages can be the subject of a case study in that respect: integration of diverse languages into a broad language community. 

 

The main result of these initiatives was the increased production of language resources and especially language technology tools for Central, Eastern and Southern European languages in recent years. While monolingual systems have achieved performances comparable to those for intensively studied languages, still a lot of work has to be invested in multilingual tools for applications such as machine translation or cross-lingual information retrieval.  At least three major issues have critical influence on the performance of such systems:

  

- the availability of the appropriate quantities of data for training and evaluation; 

- the analysis of structural linguistic differences among languages so as to be able to improve statistical methods with targeted linguistic knowledge;

- the availability of knowledge bases for incorporation into language processing systems.

 

The identification of key aspects of linguistic modelling and resource supply for multilingual technologies involving Central, Eastern and Southern European languages can have impact not only on the local improvement of such systems but also on the overall development of multilingual technologies. The same holds for well established or emerging linguistic knowledge representation frameworks, which can only benefit from embedding components for Central, Eastern and Southern European languages.

 

We are looking for submission of original, unpublished work related to Central and (South)-Eastern European languages in the following areas:



- Automatic identification of comparable or parallel corpora

- Extraction of linguistic knowledge from comparable or parallel corpora

- Improvement of statistical methods with knowledge extracted from comparable or parallel corpora

- Domain adaptation of statistical methods in multilingual context

- Multilingual systems involving Central, Eastern and Southern European languages

- Production, management and interfacing of knowledge bases including Central, Eastern and Southern European languages

- Machine translation for Central, Eastern and Southern European languages

 

NEW! SPECIAL TRACK ON MACHINE TRANSLATION

 

A special track will be dedicated to machine translation for Central and (South-)Eastern European languages. The main goal of this track is to sum up linguistic particularities of these languages, which are challenges for an MT-System. In this respect we are encouraging submission of papers analysing the translations of the test data at

https://www.c-phil.uni-hamburg.de/view/Main/LREC10MTTrack

in any of the languages referred to as Central or (South-) Eastern European.

 

Authors can use any on-line, open-source or commercially available system.  The source language is English. Papers submitted under this track should discuss evaluation criteria specified at 


https://www.c-phil.uni-hamburg.de/view/Main/LREC10MTTrack 


Accepted papers (both main session and special track) will be published in workshop proceedings. We intend to publish revised versions of the papers in a volume in LINCOM Series. Demonstrations of existing systems are also welcome.

 

Submission Details

Submissions have to be made through the START system of the main LREC conference at https://www.softconf.com/lrec2010/MultiLR2010/. 

Contributors should submit a PDF file no longer than 10 DIN-A4 pages. The formatting details should follow the LREC conference formatting details at http://www.lrec-conf.org/lrec2010/?Author-s-Kit-and-Templates. Papers will be blind reviewed by minimum 3 members of the Programme Committee.

 

When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e., also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of their research. 

For further information on this new initiative, please refer to:  

http://www.lrec-conf.org/lrec2010/?LREC2010-Map-of-Language-Resources

 

NEW Important Dates

Deadline for paper submission: 22 February  2010

Notification of acceptance / rejection: 15 March 2010

Submission of final papers: 25 March 2010

Workshop:  23 May 2010

 

Organising Committee

Stelios Piperidis, ILSP, Natural Language and Knowledge Extraction Department, Athens Greece Milena Slavcheva, Bulgarian Academy of Science, Institute for Parallel Processing, Linguistic Modeling Department, Sofia, Bulgaria 

Cristina Vertan, University of Hamburg, Arbeitsstelle "Computerphilologie", Hamburg, Germany 

  

Programme Committee

Galja Angelova (IPP, Bulgarian Academy of Sciences)
Bogdan Babych (Centre for Translation Studies, Leeds, UK)
Damir Cavar (University of Zadar, Croatia)
Tomaz Erjavec (Jozef Stefan Institute, Slovenia)
Maria Gavrilidou (ILSP, Athens, Greece)
Walther von Hahn (University of Hamburg, Germany)
Cvetana Krstev (University of Belgrade, Serbia)
Vladislav Kubon (Charles University, Prague, Czech Republic)
Petya Osenova (University of Sofia, Bulgaria)
Stelios Piperidis (ILSP, Athens, Greece)
Gabor Proszeky (Morphologic, Hungary)
Adam Przepiorkowski (IPAN, Polish Academy of Sciences)
Milena Slavcheva (IPP, Bulgarian Academy of Sciences)
Kiril Simov (IPP, Bulgarian Academy of Sciences)
Dan Tufis (Romanian Academy of Sciences)
Cristina Vertan (University of Hamburg, Germany)
Dusko Vitas (University of Belgrade, Serbia)

 

Contact


For further questions please see the web page of the workshop at: http://www.c-phil.uni-hamburg.de/view/Main/LrecWorkshop2010

or contact Dr. Cristina Vertan at:

cristina DOT vertan AT uni-hamburg DOT de

tel: +4940428384767



Milena Slavcheva

Linguistic Modeling Department
Institute for Parallel Processing
Bulgarian Academy of Sciences
25A, Acad. Georgi Bonchev St.
1113 Sofia, Bulgaria

phone: (+359) 2 979 2812
e-mail: milena at lml.bas.bg
URL:  http://lml.bas.bg/~milena

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100203/79b7ca95/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list