21.1335, FYI: Corpus Release: Corpus OVI dell'Italiano antico

linguist at LINGUISTLIST.ORG linguist at LINGUISTLIST.ORG
Thu Mar 18 18:53:21 UTC 2010


LINGUIST List: Vol-21-1335. Thu Mar 18 2010. ISSN: 1068 - 4875.

Subject: 21.1335, FYI: Corpus Release: Corpus OVI dell'Italiano antico

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews: Monica Macaulay, U of Wisconsin-Madison  
Eric Raimy, U of Wisconsin-Madison  
Joseph Salmons, U of Wisconsin-Madison  
Anja Wanner, U of Wisconsin-Madison  
       <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, 
and donations from subscribers and publishers.

Editor for this issue: Elyssa Winzeler <elyssa at linguistlist.org>
================================================================  

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.cfm.

===========================Directory==============================  

1)
Date: 18-Mar-2010
From: Giulio Vaccaro < piovanoarlotto at gmail.com >
Subject: Corpus Release: Corpus OVI dell'Italiano antico
 

	
-------------------------Message 1 ---------------------------------- 
Date: Thu, 18 Mar 2010 14:52:35
From: Giulio Vaccaro [piovanoarlotto at gmail.com]
Subject: Corpus Release: Corpus OVI dell'Italiano antico

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=21-1335.html&submissionid=2618775&topicid=6&msgnumber=1
  


A new version of Corpus OVI dell'italiano antico is now available online! 
After this update, this corpus consists of 1978 texts with 21,817,929
words, 443,810 different word forms, 116,224 lemmas and 3,615,478
lemmatized occurrences. 

Corpus TLIO aggiuntivo
For not yet lemmatized texts awaiting inclusion in the Corpus OVI, an
additional corpus has been created, the Corpus TLIO aggiuntivo, which at
present contains 306 texts with 1,189,808 words and 71,900 different word
forms.

Archivio Datini
In collaboration with the Archivio di Stato of the Tuscan town of Prato,
OVI has developed a lemmatized database containing all published letters
(3000 texts with 1,100,987 words and 50,139 different word forms, 7,591
lemmas and 146,741 lemmatized occurrences) in the archive of the great
Tuscan merchant Francesco di Marco Datini (1335-1410).

Corpus ARTESIA
Corpus ARTESIA, created by University of Catania, is hosted on the OVI
server. It consists of 239 early Sicilian texts, with currently 1,025,367
words.

Further informations
www.vocabolario.org

Consiglio Nazionale delle Ricerche 
Institute Opera del Vocabolario Italiano
Firenze, via di Castello 46 
I-50141
tel. +39 055 452841 
fax +39 055 452843
e-mail ovi at ovi.cnr.it 



Linguistic Field(s): Text/Corpus Linguistics





 




-----------------------------------------------------------
This Year the LINGUIST List hopes to raise $65,000. This money will go to help 
keep the List running by supporting all of our Student Editors for the coming year.

See below for donation instructions, and don't forget to check out our Space Fund 
Drive 2010 and join us for a great journey!

http://linguistlist.org/fund-drive/2010/

There are many ways to donate to LINGUIST!

You can donate right now using our secure credit card form at  
https://linguistlist.org/donation/donate/donate1.cfm

Alternatively you can also pledge right now and pay later. To do so, go to: 
https://linguistlist.org/donation/pledge/pledge1.cfm

For all information on donating and pledging, including information on how to 
donate by check, money order, or wire transfer, please visit: 
http://linguistlist.org/donation/

The LINGUIST List is under the umbrella of Eastern Michigan University and as 
such can receive donations through the EMU Foundation, which is a registered 
501(c) Non Profit organization. Our Federal Tax number is 38-6005986. These 
donations can be offset against your federal and sometimes your state tax return 
(U.S. tax payers only). For more information visit the IRS Web-Site, or contact 
your financial advisor.

Many companies also offer a gift matching program, such that they will match 
any gift you make to a non-profit organization. Normally this entails your 
contacting your human resources department and sending us a form that the 
EMU Foundation fills in and returns to your employer. This is generally a simple 
administrative procedure that doubles the value of your gift to LINGUIST, without 
costing you an extra penny. Please take a moment to check if your company 
operates such a program.

Thank you very much for your support of LINGUIST! 
-----------------------------------------------------------
LINGUIST List: Vol-21-1335	

	



More information about the LINGUIST mailing list