[Corpora-List] Second Call for Papers: ACL Workshop on Balto-Slavonic NLP and IE
Ralf Steinberger
ralf.steinberger at jrc.it
Fri Mar 2 08:01:56 UTC 2007
SECOND CALL FOR PAPERS
------------------------
Balto-Slavonic Natural Language Processing 2007 (BSNLP 2007)
with Special Theme: Information Extraction and Enabling Technologies
June 29, 2007
Prague, Czech Republic
http://langtech.jrc.it/BSNLP2007
BSNLP will be held in conjunction with the ACL 2007 conference
(http://ufal.mff.cuni.cz/acl2007/)
and is co-organised by the European Commission's Joint Research Centre.
TOPIC AND MOTIVATION:
The recent political and economic changes in Central and Eastern Europe
and the related on-going enlargement of the European Union brings into focus
new
cultures and languages. Among them, the languages from the Balto-Slavonic
group have an outstanding role because of their rich cultural heritage and
the
widespread use - over 400 million speakers.
The topic of the workshop addresses Natural Language Processing (NLP) for
the
Balto-Slavonic languages, with the focus on Information Extraction (IE) and
enabling technologies for this language family. The task of IE is to
identify a
predefined set of concepts from natural language text. The spectrum of IE
tasks ranges from named-entity recognition, through relation extraction and
co-reference resolution to the identification of complex events and
cross-document entity profile extraction. Although a considerable
amount of IE-related work exists, most of the studies are concentrated on a
few major languages. Research on this topic, as well as on general-purpose
NLP
tools in the context of Balto-Slavonic languages, is still in its early
stage and is
progressing relatively slowly. Due to some specific phenomena like the
highly inflectional
character and relatively free word order, a construction of IE systems and
other
language processing tools (question-answering, text summarization, machine
translation)
for these languages is an intriguing and challenging task.
This workshop can be seen as the follow-up to the successful workshop on
Information Extraction for Slavonic and Other Central and Eastern European
Languages (http://lml.bas.bg/IESL2003) held in conjunction with the RANLP
2003
conference. It is also related to the EACL 2003 workshop on Morphological
Processing of Slavic Languages (http://nl.ijs.si/mpsl03). In particular, we
would
strongly encourage submissions describing systems, resources or solutions
that
are made available to the wider public, as these would help to promote
computational linguistics applications for these languages.
AREAS OF INTEREST include, but are not limited to:
A. Specific challenges for Balto-Slavonic NLP, in particular in the context
of IE
and underlying technologies
- text segmentation
- morphological analysis
- morphology models
- morpho-syntactic disambiguation
- named-entity recognition
- named-entity disambiguation (e.g., geo-referencing)
- named-entity lemmatisation
- term and keyword extraction
- name variant recognition and merging
- syntactic parsing and chunking
- co-reference resolution
- word sense disambiguation
- corpus-based knowledge acquisition
B. Multilingual IE frameworks and techniques applied to these languages
- tools and resources (freely available for research purposes will be
preferred)
- experience with, and evaluation of, linguistic data and processing
resources
- comparative evaluation between languages
C. IE solutions for these languages:
- scenario template filling / event extraction
- relation extraction
- automatic pattern learning
- corpus studies and statistical techniques for IE
- IE from Web sources
- IE-based ontology population
- IE evaluation
- IE techniques for Question/Answering and Answer Extraction
- Utilisation of IE-based techniques in other NLP applications
INTENDED AUDIENCE
The goal of this workshop is to bring together researchers and practitioners
working on NLP for Balto-Slavonic languages, in particular on IE and core
technologies supporting IE for these languages. The workshop will
give an opportunity to exchange ideas and experience, to discuss
hard-to-tackle problems in this field of research, and to make available
resources more widely known.
SUBMISSION
Papers should describe original work and should indicate the state of
completion of the reported results. In particular, an overlap with
previously
published work should be clearly mentioned. Submissions will be
judged on correctness, novelty, technical strength, clarity of presentation,
usability, and significance/relevance to the workshop.
Submissions should follow the two-column format of the ACL 2007
main-conference proceedings and should not exceed eight (8) pages,
including references. We recommend to use either the LaTeX style file
or the Microsoft-Word style file, which can be found at
http://ufal.mff.cuni.cz/acl2007/styles.
The reviewing will be blind. Therefore, the paper should not include the
authors'
names and affiliations. Furthermore, self-citations and other references
that could
reveal the author's identity should be avoided.
Submission will be electronic. The only accepted format for submitted papers
is Adobe PDF. Papers must be submitted no later than April 1, 2006
using the submission webpage http://langtech.jrc.it/BSNLP2007/submission.
Submissions will be reviewed by 3 members of the Program Committee.
Authors of accepted papers will receive guidelines regarding how to produce
camera-ready versions of their papers for inclusion in the ACL workshop
proceedings.
IMPORTANT DATES
Workshop Paper Submission deadline: April 1
Notification of Acceptance: April 25
Camera-ready Version: May 9
Workshop: June 29, 2007.
LOCATION
Prague, the capital of the Czech Republic, is located in the centre of
Europe. It
is one of the most valuable historical city reserves in Europe. The
historical core
of the city is listed in the UNESCO World Cultural and Natural Heritage
Register.
The workshop itself will take place in the TOP HOTEL Praha, located in the
quiet
neighbourhood of the Prague 4 district, only 15-20 minutes from the historic
centre
of Prague.
Prague is easily reachable by car, bus or train from Central Europe (only
3-hour
drive from Vienna or Budapest or 4 hours from Berlin or Munich), by cheap
flights
from the rest of Europe, and by several direct flights from overseas.
FURTHER INFORMATION
For further information please write to bsnlp2007 at jrc.it
or check the workshop web page http://langtech.jrc.it/BSNLP2007.
PROGRAM COMMITTEE
Tania Avgustinova (University of Saarland / DFKI, Germany)
Kalina Bontcheva (University of Sheffield, UK)
Tomaz Erjavec (Jozef Stefan Institute, Slovenia)
Vaclav Kubon (Charles University Prague, Czech Republic)
Anna Kupsc (Loria, France)
Ruta Marcinkeviciene (Vytautas Magnus University, Kaunas, Lithuania)
Agnieszka Mykowiecka (Polish Academy of Sciences, Poland)
Jakub Piskorski (Joint Research Centre, Italy)
Bruno Pouliquen (Joint Research Centre, Italy)
Hristo Tanev (Joint Research Centre, Italy)
Marko Tadic (University of Zagreb, Croatia)
Agata Savary (University of Tours, France)
Kiril Simov (Bulgarian Academy of Sciences, Bulgaria)
Wojciech Skut (Google Inc., USA)
Ralf Steinberger (Joint Research Centre, Italy)
Dusko Vitas (University of Beograd, Serbia)
Roman Yangarber (University of Helsinki, Finland)
PROGRAM COMMITTEE CHAIR
Jakub Piskorski (Joint Research Centre, Italy)
Hristo Tanev (Joint Research Centre, Italy)
ORGANIZING COMMITTEE
Jakub Piskorski (Joint Research Centre, Italy)
Bruno Pouliquen (Joint Research Centre, Italy)
Hristo Tanev (Joint Research Centre, Italy)
Ralf Steinberger (Joint Research Centre, Italy)
More information about the Corpora
mailing list