21.1406, FYI: New Testament Corpus Available

linguist at LINGUISTLIST.ORG linguist at LINGUISTLIST.ORG
Tue Mar 23 16:07:48 UTC 2010


LINGUIST List: Vol-21-1406. Tue Mar 23 2010. ISSN: 1068 - 4875.

Subject: 21.1406, FYI: New Testament Corpus Available

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews: Monica Macaulay, U of Wisconsin-Madison  
Eric Raimy, U of Wisconsin-Madison  
Joseph Salmons, U of Wisconsin-Madison  
Anja Wanner, U of Wisconsin-Madison  
       <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, 
and donations from subscribers and publishers.

Editor for this issue: Elyssa Winzeler <elyssa at linguistlist.org>
================================================================  

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.cfm.

===========================Directory==============================  

1)
Date: 22-Mar-2010
From: Dag Haug < daghaug at ifikk.uio.no >
Subject: New Testament Corpus Available
 

	
-------------------------Message 1 ---------------------------------- 
Date: Tue, 23 Mar 2010 12:06:39
From: Dag Haug [daghaug at ifikk.uio.no]
Subject: New Testament Corpus Available

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=21-1406.html&submissionid=2619974&topicid=6&msgnumber=1
  


The PROIEL project has now completed a morphologically tagged and
syntactically parsed corpus of the Ancient Greek text of the Gospels, based
on Ulrik Sandborg-Petersen's electronic version of Tischendorff's edition.

The corpus is available for browsing after (free) registration at

http://foni.uio.no:3000 

There is at the moment no query interface for morphology and syntax, but it
is possible to download the raw data in various xml-based formats including
TIGER-XML for querying in TIGERSearch.

The Greek Gospel text is part of the PROIEL corpus, which contains several
old Indo-European bible translations (Latin, Gothic, Old Church Slavic and
Armenian) and some non-biblical texts. All the texts in the corpus can be
browsed, but only the morphological and syntactic analyses that have been
controlled after annotation can be seen by external users. Currently that
includes:

1. The complete Greek Gospels and scattered sentences from the rest of the NT.
2. The whole gospel of Mark + Luke 1-22 in Jerome's Latin translation, as
well as scattered sentences from the rest of the NT.
3. The whole of Mark and Luke  + Matthew 5-19 from the Codex Marianus
4. The extant parts of Mark, Luke and Matthew in Gothic.

The publicly available corpus currently contains 18688 sentences

We are constantly cross-checking and reviewing new sentences, and new
exports are generated daily. We foresee that the Gospels texts will be
available in all four languages before the summer, but we need help to
complete the Armenian.

The corpus is made available under the Creative Commons Attribution -
Noncommercial - Share Alike 3.0 license.

More information about the PROIEL project at:  www.hf.uio.no/ifikk/proiel

Dag Haug 



Linguistic Field(s): Historical Linguistics
                     Text/Corpus Linguistics





 




-----------------------------------------------------------
This Year the LINGUIST List hopes to raise $65,000. This money will go to help 
keep the List running by supporting all of our Student Editors for the coming year.

See below for donation instructions, and don't forget to check out our Space Fund 
Drive 2010 and join us for a great journey!

http://linguistlist.org/fund-drive/2010/

There are many ways to donate to LINGUIST!

You can donate right now using our secure credit card form at  
https://linguistlist.org/donation/donate/donate1.cfm

Alternatively you can also pledge right now and pay later. To do so, go to: 
https://linguistlist.org/donation/pledge/pledge1.cfm

For all information on donating and pledging, including information on how to 
donate by check, money order, or wire transfer, please visit: 
http://linguistlist.org/donation/

The LINGUIST List is under the umbrella of Eastern Michigan University and as 
such can receive donations through the EMU Foundation, which is a registered 
501(c) Non Profit organization. Our Federal Tax number is 38-6005986. These 
donations can be offset against your federal and sometimes your state tax return 
(U.S. tax payers only). For more information visit the IRS Web-Site, or contact 
your financial advisor.

Many companies also offer a gift matching program, such that they will match 
any gift you make to a non-profit organization. Normally this entails your 
contacting your human resources department and sending us a form that the 
EMU Foundation fills in and returns to your employer. This is generally a simple 
administrative procedure that doubles the value of your gift to LINGUIST, without 
costing you an extra penny. Please take a moment to check if your company 
operates such a program.

Thank you very much for your support of LINGUIST! 
-----------------------------------------------------------
LINGUIST List: Vol-21-1406	

	



More information about the LINGUIST mailing list