[Corpora-List] Deadline extension: Second Workshop on Annotation and Exploitationof Parallel Corpora
Kiril Simov
kivs at bultreebank.org
Tue Jun 28 07:48:47 UTC 2011
Second Workshop on Annotation and Exploitation of Parallel Corpora
(AEPC 2)
http://www.bultreebank.org/AEPC2/
RANLP 2011 Workshop
15th or 16th September, Hissar, Bulgaria
Deadline extension: 2nd July 2011
This workshop is a follow-up of the First Workshop on
Annotation and Exploitation of Parallel Corpora
http://math.ut.ee/tlt9/aepc/index.html
The creation of parallel corpora has been very active
especially since 90s. The globalization, the extension of EU
with new countries as well as the availability of open-source
places for information, such as Wikipedia, DBPeadia, etc.
required a multilingual approach towards the interpersonal and
official communication. This status quo produced a lot of
parallel data – especially administrative and political
documents in several languages (EuroParl), but also news
(SETIMES) and texts on various topics (wikipedia, bi- and
multilingual web sites). However, the fast compilation of large
amounts of data very often compromised in lower quality of
paralleling texts. Here comes the challenge to discover the
inconsistencies in these huge quantities of parallel data, to
process them in adequate ways, and to exploit them for various
applications: QA, Information Retrieval, Machine Translation,
etc. The parallel corpora go beyond word-to-word alignments.
They rely on dependency, constituent or semantic pairings.
There appeared guidelines and tools for aligning
linguistic structures, which raised up the issue of
transferability of aligning schemes from one language to
another, and also for the compatibility among various
resources.
Topics
The topics of interest include, but are not limited to:
- Strategies for creation of annotated parallel corpora
- Annotation guidelines for alignment
- Annotation alignment transfer over languages
- Tools for manual and automatic processing and exploitation of parallel
corpora
- Problems in manual and automatic alignment
- Syntax-based and semantic-based approaches to using parallel corpora in MT
- Parallel Grammars
- Parallel Statistical Parsing
- Usability of the existing parallel resources for various applications
Organizers
Jörg Tiedemann (Uppsala University)
Kiril Simov (IICT, Bulgarian Academy of Sciences)
Petya Osenova (Sofia University “St. Kl. Ohridski”
and IICT Bulgarian Academy of Sciences)
Radovan Garabik (JÚĽŠ, Slovak Academy of Sciences)
Important Dates
- Deadline for paper submission: 2nd July 2011
- Notification of acceptance: 29th July 2011
- Final version of the papers: 29th August 2011
Submission details
http://lml.bas.bg/ranlp2011/submissions.php
We ask for papers (6 to 10 pages conforming to the RANLP
guidelines - http://lml.bas.bg/ranlp2011/submissions.php).
We welcome work in progress reports if they
contain at least preliminary results. Papers should be
submitted in PDF. Submissions should be made via the START
System - https://www.softconf.com/ranlp11/aepc2011/.
Program Committee
Antonio Branco (University of Lisbon)
Nicoletta Calzolari (Institute of Computational Linguistics of the National
Research Council)
Koenraad De Smedt (University of Bergen)
Dan Flickinger (Stanford University)
Dale Gerdemann (University of Tübingen)
Voula Giouli (Institute for Language and Speech Processing)
Silvia Hansen (University of Mainz)
Erhard Hinrichs (University of Tübingen)
Valia Kordoni (University of Saarland)
Vladislav Kubon (Charles University)
Lothar Lemnitzer (Berlin-Brandenburg Academy of Sciences and Humanities)
Preslav Nakov (National University of Singapore)
Cristina Vertan (University of Hamburg)
Eline Westerhout (University of Utrecht)
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list