25.336, Calls: Computational Ling, Historical Ling, Text/Corpus Ling, Ling & Lit/Iceland

Mon Jan 20 18:33:48 UTC 2014

LINGUIST List: Vol-25-336. Mon Jan 20 2014. ISSN: 1069 - 4875.

Subject: 25.336, Calls: Computational Ling, Historical Ling, Text/Corpus Ling, Ling & Lit/Iceland

Moderator: Damir Cavar, Eastern Michigan U <damir at linguistlist.org>

Reviews: 
Monica Macaulay, U of Wisconsin Madison
Rajiv Rao, U of Wisconsin Madison
Joseph Salmons, U of Wisconsin Madison
Mateja Schuck, U of Wisconsin Madison
Anja Wanner, U of Wisconsin Madison
       <reviews at linguistlist.org>

Homepage: http://linguistlist.org

Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from
Amazon!

USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21

For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.

Editor for this issue: Anna White <awhite at linguistlist.org>
================================================================  

Visit LL's Multitree project for over 1000 trees dynamically generated
from scholarly hypotheses about language relationships:
          http://multitree.linguistlist.org/

Date: Mon, 20 Jan 2014 13:33:17
From: Cristina Vertan [cristina.vertan at uni-hamburg.de]
Subject: Language Resources and Technologies for Processing and Linking Historical Documents

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=25-336.html&submissionid=26395781&topicid=3&msgnumber=1

Full Title: Language Resources and Technologies for Processing and Linking Historical Documents 
Short Title: LRT4HDA 

Date: 26-May-2014 - 26-May-2014
Location: Reykjavik, Iceland 
Contact Person: Cristina Vertan
Meeting Email: cristina.vertan at uni-hamburg.de
Web Site: http://www.c-phil.uni-hamburg.de/view/Main/LTforHisLangArhives2014 

Linguistic Field(s): Computational Linguistics; Historical Linguistics; Ling & Literature; Text/Corpus Linguistics 

Call Deadline: 09-Feb-2014 

Meeting Description:

Recently, collaboration between the NLP community and specialists in various areas of the Humanities has become more efficient and fruitful due to the common aim of exploring and preserving cultural heritage data. It is worth mentioning the efforts made during the digitisation campaigns in the last years and within a series of initiatives in the Digital Humanities, especially in making old manuscripts available through Digital Libraries.

Given the number of contemporary languages and their historical variants, it is practically impossible to develop brand new language resources and tools for processing older texts. Therefore, the real challenge is to adapt existing language resources and tools, as well as to provide (where necessary) training material in the form of corpora or lexicons for a certain period of time in history. 

Another issue regarding historical documents is their usage after they are stored in digital libraries. Historical documents are not only browsed but together with adequate tools they may serve as basis for re-interpretation of historical facts, discovery of new connections, causal relations between events etc. In order to be able to make such analysis, historical documents should be linked among themselves, on the one hand, and with modern knowledge bases, on the other. Activities in the area of Linked Open Data (LOD) play a major role in this respect.

A particular type of historical documents are the newspaper collections and archives. Newspapers reflect what is going on in society, and constitute a rich data collection for many types of humanities research, ranging from history, political and social sciences to linguistics, both synchronic and diachronic, and both national and cross-national. They represent an important resource for analysis of changes at all levels which emerged in Europe with begin of the industrialization period. 

The aim of this workshop is to bring together researchers working in the interdisciplinary domain of cultural heritage, specialists in natural language and speech processing working with less-resourced languages as well as key players among Linked Open Data initiatives. They are expected to analyse problems and brainstorm solutions in the automatic analysis of historical documents, uni- or multimedia, their deep annotation and interlinking.

The workshop is organised in collaboration with CLARIN (http://www.clarin.eu).

2nd Call for Papers

Topics of Interest:

We are looking for contributions on original, unpublished work in the topic areas of the workshop, including (but not limited to) the following:

- Language  tools and resources for the analysis of older textual material
- Adaptation of language technology tools developed for modern languages to their historical variants transcription and transliteration problems and solutions
- Named Entity Recognition for historical texts
- Development of dedicated historical corpora and lexica as Linked Open Data
- (Semi-) automatic extraction of content related metadata
- Semantic linkage of heterogeneous data within digital libraries
 - Linkage of historical documents with available Linked Open Data
- Word sense disambiguation in old texts
- Multilingual issues in historical texts
- Applications concerning less resourced cultural heritage languages  such as Old Norse,  early Arabic, Ottoman Turkish, Old Church Slavonic, older forms of Balkan languages

A special track will be dedicated to the acquisition and analysis of historical newspaper archives. Submissions for this special track should address topics related to following aspects:

- Determining if, for a given language and period, digital (or digitised) newspapers exist at all
- Access rights to digital newspaper collection for research purposes, and for publishing the results
- OCR solutions and limitations 
- Extraction of articles from digitised archives
- Metadata annotation
- Showcases of successful humanities research projects based on digital or digitised newspapers 
- Publishing, sharing and storing results and by-products

Submission Details:

Submissions should be made through the system of the main LREC conference. Papers describing completed work should be no longer than eight pages. Papers describing work in progress should be between four and six pages. We encourage in particular the demonstration of prototype systems, and papers including reference to an existing prototype will be offered the possibility to demonstrate their system in a particular session.

Papers should respect the LREC formatting guidelines. Papers will be reviewed by minimum 3 members of the Programme Committee.

When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones.

Submissions for the workshop can be done using the following link: http://www.softconf.com/lrec2014/LRT4HDA/.

Papers dealing with processing of newspaper archives should be submitted to the ''Newspaper'' track; all others to ''main track''.

Important Dates:

Submission deadline:  9 February 2014
Notification of acceptance: 9 March 2014
Final papers due: 23 March 2014

Organizing Committee:

Kristín Bjarnadóttir (The Arni Magnusson Institute for Icelandic Studies
Iceland), 
Matthew Driscoll (Arnamagnean Institute, Copenhagen, Denmark), 
Steven Krauwer (CLARIN ERIC, Netherlands) 
Stelios Piperidis (ILSP, Athens, Greece), 
Cristina Vertan (University of Hamburg, Germany)
Martin Wynne (Oxford, UK)

Programme Committee: Please see the conference web site.

----------------------------------------------------------
LINGUIST List: Vol-25-336	
----------------------------------------------------------
Visit LL's Multitree project for over 1000 trees dynamically generated
from scholarly hypotheses about language relationships:
          http://multitree.linguistlist.org/