[Corpora-List] ERRATUM: ELRA - Language Resources Catalogue - Update
ELDA
info at elda.org
Tue Jun 20 12:54:57 UTC 2006
ERRATUM: A wrong layout of this announcement was posted to you earlier
today. The current posting contains a more useful layout. Please
discard the previous posting. Sorry for any inconvenience this may have
caused you.
Our apologies if you have received multiple copies of this announcement
*******************************************************************
ELRA - Language Resources Catalogue - Update
*******************************************************************
We are happy to announce that new Text and Speech Language Resource are
now available in our catalogue.
To view all the Language Resources available, you can visit our on-line
catalogue : http://catalog.elda.org/index.php?language=en
*** L0067 English lexicon with morphological information ***
This English lexicon is made up of 174,000 inflected forms corresponding
to 68,000 simple word lemmas (including 31,900 nouns, 11,800 verbs,
19,900 adjectives, 4,100 adverbs, 300 pronouns, articles,
prepositions/postpositions and conjunctions). Each line in the resource
file shows an inflected form, its part of speech, its related lemma and
its morphological information.
For more information, see
http://catalog.elda.org:8080/product_info.php?products_id=867&osCsid=0a57b78fd3504ecf1c75825782d061de
*** L0068 French lexicon with morphological information ***
This French lexicon is made up of 424,000 inflected forms corresponding
to 55,000 simple word lemmas (including 34,400 nouns, 7,300 verbs,
11,700 adjectives, 1,400 adverbs, 200 pronouns, articles,
prepositions/postpositions and conjunctions). Each line in the resource
file shows an inflected form, its part of speech, its related lemma and
its morphological information.
For more information, see
http://catalog.elda.org:8080/product_info.php?products_id=868&osCsid=0a57b78fd3504ecf1c75825782d061de
*** L0069 Italian lexicon with morphological information ***
This Italian lexicon is made up of 862,500 inflected forms corresponding
to 112,000 simple word lemmas (including 66,340 nouns, 12,030 verbs,
28,080 adjectives, 4,890 adverbs, 660 pronouns, articles,
prepositions/postpositions and conjunctions). Each line in the resource
file shows an inflected form, its part of speech, its related lemma and
its morphological information.
For more information, see
http://catalog.elda.org:8080/product_info.php?products_id=869&osCsid=0a57b78fd3504ecf1c75825782d061de
*** L0070 Italian lexicon with morphological information and clitic
verbs ***
This Italian lexicon is the same as the one described in ELRA-L0069, but
with the addition of clitic verbs, which increases the number of
inflected forms to 1,800,000 (still corresponding to 112,000 simple
words lemmas). It contains 66,340 nouns, 12,030 verbs, 28,080
adjectives, 4,890 adverbs, 660 pronouns, articles,
prepositions/postpositions and conjunctions. Each line in the resource
file shows an inflected form, its part of speech, its related lemma and
its morphological information.
For more information, see
http://catalog.elda.org:8080/product_info.php?products_id=870&osCsid=0a57b78fd3504ecf1c75825782d061de
*** L0071 Spanish lexicon with morphological information ***
This Spanish lexicon is made up of 816,000 inflected forms corresponding
to 104,000 simple word lemmas (including 52,000 nouns, 9,800 verbs,
21,200 adjectives, 20,500 adverbs, 500 pronouns, articles,
prepositions/postpositions and conjunctions). Each line in the resource
file shows an inflected form, its part of speech, its related lemma and
its morphological information.
For more information, see
http://catalog.elda.org:8080/product_info.php?products_id=871&osCsid=0a57b78fd3504ecf1c75825782d061de
*** S0217 BITS Logatome Synthesis Corpus BITS-LG ***
This corpus contains 11,036 recordings of logatomes spoken by 4
professional German speakers covering all German diphone combinations as
well as the most prominent combination German - French - English. Each
logatome was recorded in three channels: close microphone, large
membrane microphone and laryngographic signal. All diphones are
segmented and labelled into phonemic units.
For more information, see
http://catalog.elda.org:8080/product_info.php?products_id=866&osCsid=0a57b78fd3504ecf1c75825782d061de
For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli at elda.org
More information about the Corpora
mailing list