[Corpora-List] ERRATUM: ELRA - Language Resources Catalogue - Update

ELDA info at elda.org
Tue Jun 20 12:54:57 UTC 2006


ERRATUM: A wrong layout of this announcement was posted to you earlier 
today. The current posting contains a more useful layout.  Please 
discard the previous posting. Sorry for any inconvenience this may have 
caused you.


Our apologies if you have received multiple copies of this announcement

*******************************************************************
ELRA - Language Resources Catalogue - Update
*******************************************************************
We are happy to announce that new Text and Speech Language Resource are 
now available in our catalogue.
To view all the Language Resources available, you can visit our on-line 
catalogue : http://catalog.elda.org/index.php?language=en

*** L0067 English lexicon with morphological information ***
This English lexicon is made up of 174,000 inflected forms corresponding 
to 68,000 simple word lemmas (including 31,900 nouns, 11,800 verbs, 
19,900 adjectives, 4,100 adverbs, 300 pronouns, articles, 
prepositions/postpositions and conjunctions). Each line in the resource 
file shows an inflected form, its part of speech, its related lemma and 
its morphological information.
For more information, see 
http://catalog.elda.org:8080/product_info.php?products_id=867&osCsid=0a57b78fd3504ecf1c75825782d061de 


*** L0068 French lexicon with morphological information ***
This French lexicon is made up of 424,000 inflected forms corresponding 
to 55,000 simple word lemmas (including 34,400 nouns, 7,300 verbs, 
11,700 adjectives, 1,400 adverbs, 200 pronouns, articles, 
prepositions/postpositions and conjunctions). Each line in the resource 
file shows an inflected form, its part of speech, its related lemma and 
its morphological information.
For more information, see 
http://catalog.elda.org:8080/product_info.php?products_id=868&osCsid=0a57b78fd3504ecf1c75825782d061de 


*** L0069 Italian lexicon with morphological information ***
This Italian lexicon is made up of 862,500 inflected forms corresponding 
to 112,000 simple word lemmas (including 66,340 nouns, 12,030 verbs, 
28,080 adjectives, 4,890 adverbs, 660 pronouns, articles, 
prepositions/postpositions and conjunctions). Each line in the resource 
file shows an inflected form, its part of speech, its related lemma and 
its morphological information.
For more information, see 
http://catalog.elda.org:8080/product_info.php?products_id=869&osCsid=0a57b78fd3504ecf1c75825782d061de 


*** L0070 Italian lexicon with morphological information and clitic 
verbs ***
This Italian lexicon is the same as the one described in ELRA-L0069, but 
with the addition of clitic verbs, which increases the number of 
inflected forms to 1,800,000 (still corresponding to 112,000 simple 
words lemmas). It contains 66,340 nouns, 12,030 verbs, 28,080 
adjectives, 4,890 adverbs, 660 pronouns, articles, 
prepositions/postpositions and conjunctions. Each line in the resource 
file shows an inflected form, its part of speech, its related lemma and 
its morphological information.
For more information, see 
http://catalog.elda.org:8080/product_info.php?products_id=870&osCsid=0a57b78fd3504ecf1c75825782d061de 


*** L0071 Spanish lexicon with morphological information ***
This Spanish lexicon is made up of 816,000 inflected forms corresponding 
to 104,000 simple word lemmas (including 52,000 nouns, 9,800 verbs, 
21,200 adjectives, 20,500 adverbs, 500 pronouns, articles, 
prepositions/postpositions and conjunctions). Each line in the resource 
file shows an inflected form, its part of speech, its related lemma and 
its morphological information.
For more information, see 
http://catalog.elda.org:8080/product_info.php?products_id=871&osCsid=0a57b78fd3504ecf1c75825782d061de 


*** S0217 BITS Logatome Synthesis Corpus ­ BITS-LG ***
This corpus contains 11,036 recordings of logatomes spoken by 4 
professional German speakers covering all German diphone combinations as 
well as the most prominent combination German - French - English. Each 
logatome was recorded in three channels: close microphone, large 
membrane microphone and laryngographic signal. All diphones are 
segmented and labelled into phonemic units.
For more information, see 
http://catalog.elda.org:8080/product_info.php?products_id=866&osCsid=0a57b78fd3504ecf1c75825782d061de 


For more information on the catalogue, please contact Valérie Mapelli 
mailto:mapelli at elda.org



More information about the Corpora mailing list