25.4307, Media: Treebank of Early New High German

The LINGUIST List via LINGUIST linguist at listserv.linguistlist.org
Thu Oct 30 00:53:39 UTC 2014



LINGUIST List: Vol-25-4307. Wed Oct 29 2014. ISSN: 1069 - 4875.

Subject: 25.4307, Media:  Treebank of Early New High German

Moderators: Damir Cavar, Indiana U <damir at linguistlist.org>
            Malgorzata E. Cavar, Indiana U <gosia at linguistlist.org>

Reviews: reviews at linguistlist.org
Anthony Aristar <aristar at linguistlist.org>
Helen Aristar-Dry <hdry at linguistlist.org>
Sara Couture, Indiana U <sara at linguistlist.org>

Homepage: http://linguistlist.org

Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from
Amazon!

USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21

For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.

Editor for this issue: Malgorzata Cavar <gosia at linguistlist.org>
================================================================  

Visit LL's Multitree project for over 1000 trees dynamically generated
from scholarly hypotheses about language relationships:
          http://multitree.linguistlist.org/
					
					

Date: Wed, 29 Oct 2014 20:53:04
From: Ulrike Demske [udemske at uni-potsdam.de]
Subject: Treebank of Early New High German

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=25-4307.html&submissionid=35962177&topicid=21&msgnumber=1
 
We recently released the Mercurius Treebank of Early New High German
(1350-1650) to the public. It is being hosted by the INESS treebanking
infrastructure (http://iness.uib.no) and is subject to the CC-BY license.

The Mercurius Treebank is a syntactically annotated corpus of the early
newspapers 'Mercurius' and 'Annus Christi', published in 1667 and 1597,
respectively. It comprises a total of 170.000 tokens and 8.400 syntactically
annotated sentences. It was annotated using a hybrid annotation scheme based
on TIGER, which combines both dependency and constituency information. Each
text segment was independently annotated by two annotators using the @nnotate
tool (Brants and Plaehn, 2000).

The Mercurius Treebank can be found on the INESS website (http://iness.uib.no)
by navigating to 'Treebank selection' -> German -> deu-mercurius-con, and then
selecting 'Sentence Overview' after accepting the license agreement. It may be
necessary to create an OpenIdP account first.

The treebank is searchable with an extended TIGERSearch syntax (Meurer, 2012).
For example, the search query [word=''Kirche''] will return all sentences
which contain the word ''Kirche'', while the search query [cat=''PP''] will
return all sentences which contain a prepositional phrase node.

The present corpus has been compiled in 2003 to 2005 at Saarland University as
a pilot to a much larger project aiming at the establishment of a
syntactically annotated reference corpus for the period of Early New High
German (1350-1650) as a whole 
(http://www.uni-potsdam.de/guvdds/projekte/aktproj.html).
 


Linguistic Field(s): Historical Linguistics

Subject Language(s): German (deu)






----------------------------------------------------------
LINGUIST List: Vol-25-4307	
----------------------------------------------------------
Visit LL's Multitree project for over 1000 trees dynamically generated
from scholarly hypotheses about language relationships:
          http://multitree.linguistlist.org/
					
					



    


More information about the LINGUIST mailing list