[Corpora-List] ELRA - Language Resources Catalogue - Update
ELDA
info at elda.org
Wed Feb 21 17:52:43 UTC 2007
Our apologies if you have received multiple copies of this announcement.
*******************************************************************
ELRA - Language Resources Catalogue - Update
*******************************************************************
ELRA is happy to announce that 4 new Speech Resources are now available
in its catalogue.
*ELRA-S0157 NetDC Arabic BNSC (Broadcast News Speech Corpus)
*The NetDC Arabic BNSC (Broadcast News Speech Corpus) is a corpus
developed by ELDA in the framework of the European-funded project
Network of Data Centres (NetDC). The project was done in collaboration
with the LDC (Linguistic Data Consortium), which has produced a similar
corpus from the news broadcasted by Voice of America Arabic in the
United States. The database contains ca. 22.5 hours of broadcast news
speech recorded from Radio Orient (France) during a 3-month period.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=13&language=en
<http://catalog.elra.info/product_info.php?products_id=13&language=en>
*ELRA-S0232 Swiss-German Speecon database
*The Swiss-German Speecon database comprises the recordings of 550 adult
Swiss-German speakers and 50 child Swiss-German speakers who uttered
respectively over 290 items and 210 items (read and spontaneous).
For more information, see:
http://catalog.elra.info/product_info.php?products_id=982&language=en
<http://catalog.elra.info/product_info.php?products_id=982&language=en>
*ELRA-S0233 US English Speecon database
*The US English Speecon database comprises the recordings of 550 adult
Swiss-German speakers and 50 child Swiss-German speakers who uttered
respectively over 290 items and 210 items (read and spontaneous).
For more information, see:
http://catalog.elra.info/product_info.php?products_id=983&language=en
<http://catalog.elra.info/product_info.php?products_id=983&language=en>
*ELRA-S0234 SALA Spanish Chilean Database
*The SALA Spanish Chilean Database comprises 1,024 Chilean speakers (477
males, 547 females) recorded over the Chilean fixed telephone network.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=981&language=en
<http://catalog.elra.info/product_info.php?products_id=981&language=en>
Moreover, the contents of the following two LC-STAR phonetic lexica was
updated:
*ELRA-S0207 LC-STAR Catalan phonetic lexicon
*The LC-STAR Catalan phonetic lexicon comprises more than 100,000 words,
including a set of more than 45,000 common words and a set of more than
45,000 proper names (including person names, family names, cities,
streets, companies and brand names) with phonetic transcriptions in
SAMPA. The lexicon is provided in XML format.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=832&language=en
<http://catalog.elra.info/product_info.php?products_id=832&language=en>
*ELRA-S0208 LC-STAR Spanish phonetic lexicon
*The LC-STAR Spanish phonetic lexicon comprises more than 100,000 words,
including a set of more than 45,000 common words and a set of more than
45,000 proper names (including person names, family names, cities,
streets, companies and brand names) with phonetic transcriptions in
SAMPA. The lexicon is provided in XML format.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=833&language=en
<http://catalog.elra.info/product_info.php?products_id=833&language=en>
For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli at elda.org
Our on-line catalogue has moved to the following address:
http://catalog.elra.info. Please update your bookmarks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20070221/5d91890b/attachment.htm>
More information about the Corpora
mailing list