[Corpora-List] Treebank of Old Indo-European Languages

Dag Trygve Truslew Haug d.t.t.haug at ifikk.uio.no
Wed May 7 07:24:08 UTC 2008


At the University of Oslo we are currently developing a parallel 
treebank of the old Indo-European versions of the New Testament (Greek, 
Latin, Gothic, Armenian, Old Church Slavonic). As of today we have 
annotated 10037 sentences, but only 1122 of these have been reviewed by 
  a second person.

We are now opening our site for external users at http://logos.uio.no:3000

When you register you will be able to browse our texts and to search for 
word forms and lemmata. You can also access the syntactic and 
morphological annotation this way, but there are for the moment no ways 
of searching the morphology or syntax directly. You can also download 
XML dumps of our data. Only reviewed data is made available to external 
users.

The project for which the treebank is being developed is described on 
http://www.hf.uio.no/ifikk/proiel/ where you will also find the 
guidelines for syntactic annotation (also available on the corpus site) 
and other material.

We will start work on the other versions (Gothic, Armenian, Old Church 
Slavonic) very soon, and we are grateful for any hints about extant 
resources which could be useful for us.

---------------------------------------------
Dag Haug
Associate Professor of Latin
Department of Philosophy, Classics, History of Arts and Ideas
PO Box 1020 Blindern
N-0315 Oslo
Norway

daghaug at ifikk.uio.no

www.hf.uio.no/ifikk/proiel


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list