[Corpora-List] Treebank of Old Indo-European Languages
Dag Trygve Truslew Haug
d.t.t.haug at ifikk.uio.no
Wed May 7 07:24:08 UTC 2008
At the University of Oslo we are currently developing a parallel
treebank of the old Indo-European versions of the New Testament (Greek,
Latin, Gothic, Armenian, Old Church Slavonic). As of today we have
annotated 10037 sentences, but only 1122 of these have been reviewed by
a second person.
We are now opening our site for external users at http://logos.uio.no:3000
When you register you will be able to browse our texts and to search for
word forms and lemmata. You can also access the syntactic and
morphological annotation this way, but there are for the moment no ways
of searching the morphology or syntax directly. You can also download
XML dumps of our data. Only reviewed data is made available to external
users.
The project for which the treebank is being developed is described on
http://www.hf.uio.no/ifikk/proiel/ where you will also find the
guidelines for syntactic annotation (also available on the corpus site)
and other material.
We will start work on the other versions (Gothic, Armenian, Old Church
Slavonic) very soon, and we are grateful for any hints about extant
resources which could be useful for us.
---------------------------------------------
Dag Haug
Associate Professor of Latin
Department of Philosophy, Classics, History of Arts and Ideas
PO Box 1020 Blindern
N-0315 Oslo
Norway
daghaug at ifikk.uio.no
www.hf.uio.no/ifikk/proiel
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list