17.2698, Software: ELRA Language Resources Catalogue Update 09/06

linguist at LINGUISTLIST.ORG linguist at LINGUISTLIST.ORG
Thu Sep 21 14:23:07 UTC 2006


LINGUIST List: Vol-17-2698. Thu Sep 21 2006. ISSN: 1068 - 4875.

Subject: 17.2698, Software: ELRA Language Resources Catalogue Update 09/06

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
 
Reviews: Laura Welcher, Rosetta Project / Long Now Foundation  
         <reviews at linguistlist.org> 

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.

Editor for this issue: Svetlana Aksenova <svetlana at linguistlist.org>
================================================================  

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.


===========================Directory==============================  

1)
Date: 21-Sep-2006
From: Helene Mazo < mazo at elda.org >
Subject: ELRA Language Resources Catalogue Update 09/06 

	
-------------------------Message 1 ---------------------------------- 
Date: Thu, 21 Sep 2006 10:20:51
From: Helene Mazo < mazo at elda.org >
Subject: ELRA Language Resources Catalogue Update 09/06 
 

Our on-line catalogue has moved to the following address:
http://catalog.elra.info. Please update your bookmarks.


We are happy to announce that new Written Language Resources are now
available in our catalogue.

*** ELRA-L0072 PAROLE-SIMPLE-CLIPS PISA Italian Lexicon ***
PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been
elaborated over three different projects. The PAROLE-SIMPLE-CLIPS Pisa
Italian Lexicon comprises a total of 387,267 phonetic units, 53,044
morphological units (53,044 lemmas), 37,406 syntactic units (28,111 lemmas)
and 28,346 semantic units (19,216 lemmas). The PAROLE-SIMPLE-CLIPS Pisa
Italian Lexicon was encoded at the semantic level, in full accordance with
the international standards set out in the PAROLE-SIMPLE model and based on
EAGLES. Syntactic and semantic encoding were performed jointly with Thamus
(Consortium for Multilingual Documentary Engineering), which is responsible
for 25,000 extra entries (to be released soon).
This lexicon is subdivided into five different subsets:
L0072-01 Full lexicon
L0072-02 Phonetic layer
L0072-03 Morphological layer
L0072-04 Syntactic layer
L0072-05 Semantic layer
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=881&language=en

*** ELRA-W0043 PAROLE Italian Corpus ***
The PAROLE Italian Corpus comprises 3,135,651 words collected from four
different domains: newspapers (2,179,800 words), periodicals (143,810
words), books (564,964 words), miscellaneous (247,077 words). Data are
morphosyntactically annotated and lemmatized.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=886&language=en

*** ELRA-W0044 Italian Syntactic-Semantic Treebank (ISST) ***
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=887&language=en


For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli at elda.org 


Linguistic Field(s): Computational Linguistics
                     Text/Corpus Linguistics

Subject Language(s): Italian (ita)





-----------------------------------------------------------
LINGUIST List: Vol-17-2698	

	



More information about the LINGUIST mailing list