[Corpora-List] CfP Extension LREC 2012 Workshop Processing Cultural Heritage Objects

Milena Slavcheva milena at lml.bas.bg
Fri Feb 24 13:15:48 UTC 2012


  FINAL CALL for PAPERS - EXTENDED DEADLINE

ADAPTATION OF LANGUAGE RESOURCES AND TOOLS FOR PROCESSING CULTURAL 
HERITAGE OBJECTS

26 May 2012, Istanbul, Turkey
(morning session)

Workshop associated with LREC 2012 (21-27 May, 2012)

http://www.c-phil.uni-hamburg.de/view/Main/LrecWorkshop2012

Recently, the collaboration between the NLP community and the 
specialists in various areas of the Humanities has become more efficient 
and fruitful due to the common aim of exploring and preserving cultural 
heritage data. It is worth mentioning the efforts made during the 
digitization campaigns in the last years and within a series of 
initiatives in the Digital Humanities, especially in making Old 
Manuscripts available through Digital Libraries.

Most parts of these libraries are made available not only to researchers 
in a certain Humanities domain (such as, classical philologists, 
historians, historical linguists), but also to common users. This fact 
has posited new requirements to the functionalities offered by the 
Digital Libraries, and thus imposed the usage of methods from Language 
Technology for content analysis and content presentation in a form 
understandable to the end user.

There are several challenges related to the above mentioned issues:

  * Lack of adequate training material for real-size applications:
    although the Digital Libraries usually cover a large number of
    documents, it is difficult to collect a statistically significant
    corpus for a period of time in which the language remained unchanged.
  * In most cases the language historical variants lack firmly
    established syntactic or morphological structures and that makes the
    definition of a robust set of rules extremely difficult. Historical
    texts often constitute a mixture of several languages including
    Latin, Ancient Greek, Slavonic, etc.
  * Historical texts contain a great number of abbreviations, which
    follow different models.
  * The conception of the world is somewhat different from ours (that
    is, different thinking about the Earth, different views in medicine,
    astronomy, etc.), which makes it more difficult to build the
    necessary knowledge bases.

Having in mind the number of contemporary languages and their historical 
variants, it is practically impossible to develop brand new language 
resources and tools for processing older texts. Whenever possible a 
solution, but at the same time a real challenge, is to investigate how 
to adapt existing language resources and tools, as well as to provide 
(where necessary) training material in the form of corpora or lexicons 
for a certain period of time in history.


We are looking for submission of original, unpublished work, related to 
the following topics:


  * Language tools and resources for analysis of old textual material or
    language variants
  * Adaptation of LT-tools, developed for modern languages, to the
    historical variants of the same languages
  * Transcription and transliteration problems and solutions
  * Named Entity recognition for historical texts
  * Development of dedicated historical corpora and lexica
  * (Semi-) automatic extraction of content related metadata
  * Semantic linkage of heterogeneous data within digital libraries
  * Word sense disambiguation in old texts
  * Multilingual issues in historical documents
  * Evaluation of tools for processing of historical texts


Submission details

Submissions have to be made through the START system of the main LREC 
2012 conference at:
https://www.softconf.com/lrec2012/CulturalHeritage2012/


Papers describing completed work should be no longer than eight pages. 
Papers describing work in progress should be between four and six 
pages.  The demonstration of prototype systems is particularly 
encouraged. The authors of papers referring to an existing prototype 
will be offered the possibility to demonstrate the system in a special 
session.

Papers should follow the LREC formatting guidelines.  Papers will be 
reviewed by minimum 3 members of the Programme Committee.

When submitting a paper from the START page, authors will be asked to 
provide essential information about resources (in a broad sense, i.e. 
also technologies, standards, evaluation kits, etc.) that have been used 
for the work described in the paper or are a new result of their research.

For further information on this new initiative, please refer to:
http://www.lrec-conf.org/lrec2012/?LRE-Map-2012


Important dates

Deadline for paper submission (EXTENDED):    3rd March 2012
Notification of acceptance / rejection:                   14th March 2012
Submission of final papers:                                    28th 
March 2012
Workshop:                                                                 26 
May 2012

Programme Committee

David Baumann (School of Computer Science, Carnegie Mellon, USA)
Walter Daelemans (University of Antwerp, Belgium)
Günther Görz (University Erlangen, Germany)
Walther v. Hahn (University of Hamburg, Germany)
Piroska Lendvai (Hungarian Academy of Sciences, Hungary)
Anke Lüdeling (Humboldt University , Berlin, Germany)
Gábor Prószéky (MorphoLogic, Hungary)
Laurent Romary (LORIA-INRIA, Nancy, France)
Éric Laporte (Université Paris-Est Marne-la-Vallée, France)
Kiril Simov (IICT, Bulgarian Academy of Sciences, Bulgaria)
Manfred Thaler (Cologne University, Germany, Germany)
Tamás Váradi (Hungarian Academy of Sciences, Hungary)
Martin Wynne (University of Oxford)
Kalliopi Zervanou (University of Tilburg, the Netherlands)

Organisation Committee

Petya Osenova(Bulgarian Academy of Sciences)
Stelios Piperidis (ILSP, Greece)
Milena Slavcheva (Bulgarian Academy of Sciences)
Cristina Vertan (University of Hamburg)

  For any additional details concerning the workshop, please, contact 
Cristina Vertan (cristinaDOTvertanATuni-hamburgDOTde)

-- 
Dr. Milena Slavcheva

Linguistic Modeling Department
Institute of Information and Communication Technologies
Bulgarian Academy of Sciences
2, Acad. G. Bonchev St.
1113 Sofia, Bulgaria

Phone:    +359 2 979 3269
E-mail:   milena at lml.bas.bg
Web page: http://lml.bas.bg/~milena

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20120224/f0219db9/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list