[Corpora-List] [CfP] REMINDER: Workshop on multiling. resources, technolog.&eval for Balkan&Slavic Lang.
vertan at informatik.uni-hamburg.de
vertan at informatik.uni-hamburg.de
Mon Jun 15 13:01:56 UTC 2009
*** Appologies for multiple postings ***
Last Call for Papers
DEADLINE 20th June 2009 (0:00 GMT+2)
Workshop on: Multilingual resources, technologies and evaluation for
Balkan and Slavic
languages
- associated with the RANLP 2009 (http://lml.bas.bg/ranlp2009/)-
17 September 2009, Borovets, Bulgaria
Workshop page: http://www.c-phil.uni-hamburg.de/view/Main/RanlpBalkan09
The economical and political changes during the last years in Europe
have led to a major shift of activities in the language technology
area. On one side the access for all citizens to global information
has become a real demand, on the other side the specific European
context with a big number of national and minority languages and the
exponential number of documents appearing every day, makes this
desiderata a real challenge.
Language Processing is now seen as the main technology being able to
give people access to information (no matter where it was produced) in
their own languages. Despite important developments, the
infrastructure in terms of languages resources (data and tools) for
less spoken languages , (especially Balkan and Slavic languages) are
still far behind the achieved standard for major west European ones.
As most part of the current LT-Applications rely on data-driven
methods, one major drawback in the development of language resources
for these languages is the lack of training and evaluation data , as
well as reference systems for comparing results.
Well-known corpora like JRC-ACQUIS and OPUS, although a significant
step forward,:
- still do not cover all languages in Balkan area
- are collection of documents in specialised language and therefore
decrease the
performance of systems trained on this data when testing on other
domains and registers.
In order to shorten this bottleneck, it is necessary to develop,
promote and make available data which can be used for training and
evaluation. In addition, it is important to know which systems have
been developed for which applications, on which data were tested and
with which evaluation results.
The aim of the current workshop is to make a first step in this
direction. We encourage submission of original, unpublished work
related to Balkan and Slavic languages in the following areas:
- data-set profiling for Balkan and Slavic languages
- development of multilingual resources (parallel and comparable
corpora, lexica, etc)
- multilingual and cross-lingual applications (IR, IE)
- machine translation
- evaluation of machine translation and cross-lingual IR/IE systems
- evaluation protocols and measures
- usage of language independent models for the considered languages
Submissions will be made through the START System of the main conference at
https://www.softconf.com/ranlp09/RanlpBalkan/cgi-bin/scmd.cgi?scmd=basicSubmit
no later that June 20th (0:00 GMT+2),
and should follow the formatting instructions specified on
http://lml.bas.bg/ranlp2009/ under "Submission Info". Submissions
should be ANONYMOUS and should not be longer as 8 A4-pages (full
article).
For further information pleae contact Cristina Vertan at:
vertan AT informatik DOT informatik DOT uni-hamburg DOT de
Important Dates
==================
Paper Submission 20 June 2009
Notification of Acceptance 20 July 2009
Submission of final papers 13 August 2009
Organisers
============
Elena Paskaleva (Bulgarian Academy of Sciences)
Stelios Piperidis (ILSP, Greece)
Milena Slavcheva (Bulgarian Academy of Sciences)
Cristina Vertan (University of Hamburg)
Programme committee (already confirmed)
======================
Tomaz Erjavec (Jozef Stefan Institute, Slovenia)
Maria Gavrilidou (ILSP, Greece)
Walther v. Hahn (University of Hamburg)
Cvetana Krstev (University of Belgrad)
Vladislav Kubon (Chales university Prague)
Petya Osenova (University of Sofia, Bulgaria)
Elena Paskaleva (Bulgarian Academy of Sciences)
Stelios Piperidis (ILSP, Greece)
Gabor Proszeky (Morphologic, Hungary)
Adam Przepiùrkowski (IPAN, Polish Academy of Sciences)
Milena Slavcheva (Bulgarian Academy of Sciences)
Marco Tadic (University of Zagreb, Croatia)
Dan Tufis (Romanian Academy of Sciences)
Cristina Vertan (University of Hamburg)
Dusko Vitas (University of Belgrade, Serbia)
Dr. Cristina Vertan
University of Hamburg / Department of Computer Science
Natural Language Systems Division
Vogt-Kölln Strasse 30
22527 Hamburg
Germany
phone: +4940428832519 / + 4940428384767
fax: +4940428832515
http://nats-www.informatik.uni-hamburg.de/~cri
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list