[Corpora-List] ELRA - Language Resources Catalogue - Update

ELDA info at elda.org
Wed Feb 21 17:52:43 UTC 2007


Our apologies if you have received multiple copies of this announcement.

*******************************************************************
ELRA - Language Resources Catalogue - Update
*******************************************************************

ELRA is happy to announce that 4 new Speech Resources are now available 
in its catalogue.

*ELRA-S0157 NetDC Arabic BNSC (Broadcast News Speech Corpus)
*The NetDC Arabic BNSC (Broadcast News Speech Corpus) is a corpus 
developed by ELDA in the framework of the European-funded project 
Network of Data Centres (NetDC). The project was done in collaboration 
with the LDC (Linguistic Data Consortium), which has produced a similar 
corpus from the news broadcasted by Voice of America Arabic in the 
United States. The database contains ca. 22.5 hours of broadcast news 
speech recorded from Radio Orient (France) during a 3-month period.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=13&language=en 
<http://catalog.elra.info/product_info.php?products_id=13&language=en>

*ELRA-S0232 Swiss-German Speecon database
*The Swiss-German Speecon database comprises the recordings of 550 adult 
Swiss-German speakers and 50 child Swiss-German speakers who uttered 
respectively over 290 items and 210 items (read and spontaneous).
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=982&language=en 
<http://catalog.elra.info/product_info.php?products_id=982&language=en>

*ELRA-S0233 US English Speecon database
*The US English Speecon database comprises the recordings of 550 adult 
Swiss-German speakers and 50 child Swiss-German speakers who uttered 
respectively over 290 items and 210 items (read and spontaneous).
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=983&language=en 
<http://catalog.elra.info/product_info.php?products_id=983&language=en>

*ELRA-S0234 SALA Spanish Chilean Database
*The SALA Spanish Chilean Database comprises 1,024 Chilean speakers (477 
males, 547 females) recorded over the Chilean fixed telephone network.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=981&language=en 
<http://catalog.elra.info/product_info.php?products_id=981&language=en>


Moreover, the contents of the following two LC-STAR phonetic lexica was 
updated:

*ELRA-S0207 LC-STAR Catalan phonetic lexicon
*The LC-STAR Catalan phonetic lexicon comprises more than 100,000 words, 
including a set of more than 45,000 common words and a set of more than 
45,000 proper names (including person names, family names, cities, 
streets, companies and brand names) with phonetic transcriptions in 
SAMPA. The lexicon is provided in XML format.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=832&language=en 
<http://catalog.elra.info/product_info.php?products_id=832&language=en>

*ELRA-S0208 LC-STAR Spanish phonetic lexicon
*The LC-STAR Spanish phonetic lexicon comprises more than 100,000 words, 
including a set of more than 45,000 common words and a set of more than 
45,000 proper names (including person names, family names, cities, 
streets, companies and brand names) with phonetic transcriptions in 
SAMPA. The lexicon is provided in XML format.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=833&language=en 
<http://catalog.elra.info/product_info.php?products_id=833&language=en> 


For more information on the catalogue, please contact Valérie Mapelli 
mailto:mapelli at elda.org

Our on-line catalogue has moved to the following address: 
http://catalog.elra.info. Please update your bookmarks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20070221/5d91890b/attachment.htm>


More information about the Corpora mailing list