22.4436, Confs: Text/Corpus Ling, Ling & Literature, Comp Ling/Germany

linguist at LINGUISTLIST.ORG linguist at LINGUISTLIST.ORG
Mon Nov 7 15:35:35 UTC 2011


LINGUIST List: Vol-22-4436. Mon Nov 07 2011. ISSN: 1069 - 4875.

Subject: 22.4436, Confs: Text/Corpus Ling, Ling & Literature, Comp Ling/Germany

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>

Reviews: Veronika Drake, U of Wisconsin-Madison
Monica Macaulay, U of Wisconsin-Madison
Rajiv Rao, U of Wisconsin-Madison
Joseph Salmons, U of Wisconsin-Madison
Anja Wanner, U of Wisconsin-Madison
       <reviews at linguistlist.org>

Homepage: http://linguistlist.org

The LINGUIST List is funded by Eastern Michigan University,
and donations from subscribers and publishers.

Editor for this issue: Amy Brunett <brunett at linguistlist.org>
================================================================  

LINGUIST is pleased to announce the launch of an exciting new feature:  
Easy Abstracts! Easy Abs is a free abstract submission and review facility 
designed to help conference organizers and reviewers accept and process 
abstracts online.  Just go to: http://www.linguistlist.org/confcustom, and 
begin your conference customization process today! With Easy Abstracts, 
submission and review will be as easy as 1-2-3!

===========================Directory==============================  

1)
Date: 07-Nov-2011
From: Marco Passarotti [marco.passarotti at unicatt.it]
Subject: Annotation of Corpora for Research in the Humanities


-------------------------Message 1 ---------------------------------- 
Date: Mon, 07 Nov 2011 10:35:14
From: Marco Passarotti [marco.passarotti at unicatt.it]
Subject: Annotation of Corpora for Research in the Humanities

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=22-4436.html&submissionid=4535475&topicid=4&msgnumber=1
 
Annotation of Corpora for Research in the Humanities 
Short Title: ACRH 

Date: 05-Jan-2012 - 05-Jan-2012 
Location: Heidelberg, Germany 
Contact: Marco Passarotti 
Contact Email: marco.passarotti at unicatt.it 
Meeting URL: http://www.coli.uni-saarland.de/conf/ACRH10/ 

Linguistic Field(s): Computational Linguistics; Ling & Literature; Text/Corpus Linguistics 

Meeting Description: 

The workshop on 'Annotation of Corpora for Research in the Humanities' will be held on January 5, 2012 at the University of Heidelberg (Germany) (http://www.coli.uni-saarland.de/conf/ACRH10/). 

The workshop aims at building a tighter collaboration between people working in various areas of the Humanities (such as literature, philology, history etc.) and the research community involved in developing, using and making accessible annotated corpora.

Addressing topics related to annotated corpora for research in the Humanities is an interdisciplinary task, which involves corpus and computational linguists (mostly those working in literary computing), philologists, scholars in the Humanities and computer scientists. However, this interdisciplinarity is not fully realised yet. Indeed, philologists and scholars are not used to exploit NLP tools and language resources such as annotated corpora; in turn, computational linguists are more prone to develop language resources for NLP purposes only. For instance, although many corpora that play a relevant role for research in Humanities are today available in digital format (theatrical plays, contemporary novels, critical literature, literary reviews etc.), only a few of them are linguistically tagged, while most still lack linguistic tagging at all. Historical corpora are also a case of special interest, since their creation demands a strong interplay between computational linguistics and more traditional scholarship. Over the past few years a number of historical annotated corpora have been started, among which are treebanks for Middle, Early Modern and Old English, Early New High German, Medieval Portuguese, Ugaritic, Latin, Ancient Greek and several translations of the New Testament into Indo-European languages. The experience of these ever-growing groups of projects can provide many suggestions on the methodology as well as on the practice of interaction between literary studies, philology and corpus linguistics. Moreover, we believe that a tighter collaboration between people working in the Humanities and the research community involved in developing annotated corpora is needed since, while annotating a corpus from scratch still remains a labor-intensive and time-consuming task, today this is simplified by intensively exploiting prior experience in the field.

The workshop will be co-located with the Tenth International Workshop on Treebanks and Linguistic Theories (TLT10), which will be held on January 6-7, 2012 (http://tlt10.cl.uni-heidelberg.de).

Invited Speaker:

Gregory Crane (Tufts University, Boston, USA) 

Workshop Program:

9:15- 9:30 
Opening

9:30-10:30 
Invited lecture: Greg Crane 

10:30-11:00 
Coffee break

11:00-11:30 
Stefanie Dipper:
Morphological and Part-of-Speech Tagging of Historical Language Data: A Comparison

11:30-12:00 
Iris Hendrickx and Rita Marquilhas: From Old Texts to Modern Spellings: An Experiment in Automatic Normalisation

12:00-12:30 
Eiríkur Rögnvaldsson, Anton Karl Ingason, Einar Freyr Sigurðsson and Joel C. Wallenberg: Creating a Dual-purpose Treebank

12:30-14:00 
Lunch

14:00-14:30 
Kristin Bech and Kristine Eide: The Annotation of Morphology, Syntax and Information Structure in a Multilayered Diachronic Corpus

14:30-15:00
Asif Ekbal, Francesca Bonin, Sriparna Saha, Egon Stemle, Eduard Barbu, Fabio Cavulli, Christian Girardi, Filippo Nardelli and Massimo Poesio: Rapid Adaptation of NE Resolvers for Humanities Domains using Active Annotation

15:00-15:30 
Maria Sukhareva, Zahurul Islam, Armin Hoenen and Alexander Mehler: A Three-step Model of Language Detection in Multilingual Ancient Texts

15:30-16:00 
Coffee break

16:00-16:30 
Cerstin Mahlow and Britta Juska-Bacher: Exploring New High German Texts for Evidence of Phrasemes

16:30-17:00
Michael Piotrowski and Stefan Höfler: Building Corpora for the Philological Study of Swiss Legal Texts

17:00-17:45 
Poster session
Massimo Manca, Linda Spinazzé, Luigi Tessarolo, Paolo Mastandrea and Federico Boschetti: Musisque Deoque: Text Retrieval on Critical Editions
Timo Korkiakangas and Marco Passarotti: Challenges in Annotating Medieval Charters in Latin
Dain Kaplan, Ryu Iida, Kikuko Nishina and Takenobu Tokunaga: Slate -- A Tool for Creating and Maintaining Annotated Corpora
Voula Giouli: Annotating Corpora from Various Sources in the Humanities Domain: Shortcomings and Issues

17:45-18:00 
Closing






-----------------------------------------------------------
LINGUIST List: Vol-22-4436	
----------------------------------------------------------



More information about the LINGUIST mailing list