[Corpora-List] Deadline Extension: LANGUAGE TECHNOLOGY FOR CLOSELY-RELATED LANGUAGES AND LANGUAGE VARIANTS
Cristina Vertan
cristina.vertan at uni-hamburg.de
Thu Jul 31 06:56:34 UTC 2014
Appologies for cross-posting
================================================================
DEADLINE EXTENSION 2 AUGUST 2014
LANGUAGE TECHNOLOGY FOR CLOSELY-RELATED LANGUAGES AND
LANGUAGE VARIANTS
Workshop associated with EMNL 2014
29 October 2014, Doha, QATAR
http://emnlp2014.org/workshops/LT4CloseLang/call.html
================================================================
MOTIVATION
Recent initiatives in language technology have lead to the development
of at least minimal language processing toolkits for all EU-official
languages, as well as for languages with a large number of speakers
worldwide such as Chinese and Arabic. This is a big step towards
the automatic processing and/or extraction of information, especially
from official documents and newspapers, where the standard, literary
language is used.
Apart from those official languages, a large number of dialects or
closely-related language variants are in daily use, not only as spoken
colloquial languages but also in written media
and social networks.
Building language resources and tools from scratch is expensive, but
the efforts can often be reduced by making use of pre-existing
resources and tools for related, resource-richer languages. Examples
of language variants include the different variants of Spanish in Latin
America, the Arabic dialects in North Africa and the Middle East,
German in Germany, Austria and Switzerland, French in France and in
Belgium, Dutch in the Netherlands and Flemish in Belgium, etc.
Examples of pairs of related languages include Swedish-Norwegian,
Bulgarian-Macedonian, Serbian-Bosnian, Spanish-Catalan,
Russian-Ukrainian, Irish-Gaelic Scottish, Malay-Indonesian,
Turkish?Azerbaijani, Mandarin-Cantonese, Hindi?Urdu, and
many other.
This workshop intends to bring together specialists working on
LT-Applications dealing with various related language pairs, discuss
novel approaches in exploring language closeness, and raise attention
on this particular topic. A previous version of this workshop was
organised at
RANLP 2013 and showed a great interest from communities worldwide as
well as the necessity for further activities.
SUBMISSION
We are looking for original unpublished work related (but not limited
to) following topics
- Adaptation of monolingual tools for closely-related languages and
language variants
- Case studies of using language resources and tools for standard
languages on documents
in language variants
- Machine translation among closely related languages
- Evaluation of language resources and tools for language variants and
close languages.
Linguistic issues in adaptation of language resources and tools (e.g.,
semantic
discrepancies, lexical gaps, false friends)
Submission should be done using START:
https://www.softconf.com/emnlp2014/LT4CloseLang14
Papers should be up to 9 pages long and should follow the formatting
instructions for EMNLP'2014 under:
http://emnlp2014.org/submissions.html
IMPORTANT DEADLINES
Submission deadline (EXTENDED): AUGUST 2, 2014, 11:59 p.m. PST
Acceptance/rejection notification: August 26, 2014
Camera-ready deadline: September 12, 2014, 11:59 p.m. PST
Workshop: October 25 or 29, 2014
ORGANISING COMMITTEE
Preslav Nakov, Qatar Computing Research Institute
Petya Osenova, Bulgarian Academy of Sciences
Cristina Vertan, University of Hamburg
PROGRAMME COMMITTEE
Laura Alonso y Alemany (Univeristy of Cordoba, Argentina)
César Antonio Aguilar (Pontificia Universidad Católica de Chile,
Santiago de Chile, Chile)
José Castaño (University of Buenos Aires, Argentina)
David Chiang (University of Southern California, USA)
Marta Costa-Jussà (Institute for Infocomm Research, Singapore)
Walter Daelemans (University of Antwerp, Belgium)
Kareem Darwish (Qatar Computing Research Institute, Qatar)
Tomaz Erjavec (Jozef Stefan Institute, Slovenia)
Maria Gavrilidou (ILSP, Greece)
Francisco Guzman (Qatar Computing Research Institute, Qatar)
Barry Haddow (University of Edinburgh, UK)
Nizar Habash (Columbia University, USA)
Walther v. Hahn (University of Hamburg,Germany)
Francisco Guzman Herrera (Qatar Computing Research Institute, Qatar)
Cvetana Krstev (University of Belgrade, Serbia)
Vladislav Kubon (Charles University Prague, Czech Republic)
Thang Luong Minh (Stanford university, USA)
John Nerbonne (University of Groningen, Netherlands)
Graham Neubig (Nara Institute of Science and Technology, Japan)
Kemal Oflazer (Carnegie-Mellon University, Qatar)
Maciej Ogrodniczuk (IPAN, Polish Academy of Sciences, Poland)
Slav Petrov (Google, New York, USA)
Stefan Riezler (University of Heidelberg, Germany)
Laurent Romary (INRIA, France)
Hassan Sajjad (Qatar Computing Research Institute, Qatar)
Kiril Simov (Bulgarian Academy of Sciences)
Milena Slavcheva (Bulgarian Academy of Sciences)
Marco Tadic (University of Zagreb, Croatia)
Jörg Tiedemann (Uppsala University, Sweden)
Dusko Vitas (University of Belgrade, Serbia)
Stephan Vogel (Qatar Computing Research Institute, Qatar)
Pidong Wang (National University of Singapore, Singapore)
Taro Watanabe (NICT, Japan)
--
============================================================
Dr. Cristina Vertan
Arbeitsstelle "Computerphilologie"
und
AB. Natürlichsprachliche Systeme (NATS)
vogt-Kölln Strasse 30
22527 Hamburg
Raum/room F-534b
tel: +49 40 42883 2319
fax: +49 40 42883 2385
http://nats-www.informatik.uni-hamburg.de/CristinaVertan
============================================================
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list