Ressources: ELRA - Language Resources Catalogue - Update

Thierry Hamon hamon at LIMSI.FR
Wed Oct 1 20:26:18 UTC 2014


Date: Wed, 01 Oct 2014 15:40:59 +0200
From: ELRA ELDA Information <info at elda.org>
Message-ID: <542C046B.9030108 at elda.org>
X-url: http://catalog.elra.info/product_info.php?products_id=1224&language=en
X-url: http://catalog.elra.info/product_info.php?products_id=1226&language=en
X-url: http://catalog.elra.info/product_info.php?products_id=1227&language=en
X-url: http://catalog.elra.info/product_info.php?products_id=1228&language=en


Our apologies if you have received multiple copies of this announcement.

*****************************************************************
ELRA - Language Resources Catalogue - Update
*****************************************************************

We are happy to announce that 1 new Speech Resource and 3 new Written 
Corpora are now available in our catalogue.

*ELRA-S0371 PortMedia French and Italian corpus*
This corpus contains 700 transcribed dialogues from about 140 French
speakers and 604 transcribed dialogues from about 150 Italian speakers
(several dialogues per speaker). The method chosen for the corpus
construction process is that of a 'Wizard of Oz' (WoZ) system. This
consists of simulating a natural language man-machine dialogue. The
scenario was built in the domain of touristic information and
reservation. A manual transcription and semantic annotation of the
corpus are provided with corresponding wave files.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=1224&language=en

*ELRA-W0078 NE3L named entities Arabic corpus*
The Arabic corpus contains 103,363 words coming from articles extracted
from "Le Monde Diplomatique" newspaper, and published in 2004. 2 named
entity categories were taken into account: Time and Amount.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=1226&language=en

*ELRA-W0079 NE3L named entities Chinese corpus*
The Chinese corpus contains 79,302 words coming from articles extracted
from "Le Monde Diplomatique" newspaper, and published in 2001. 3 named
entity categories were taken into account: Person, Place and
Organisation.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=1227&language=en 

*ELRA-W0080 NE3L named entities Russian corpus*
The Russian corpus contains 75,784 words coming from articles extracted
from "Izvestia" newspaper, and published in 1995. 2 named entity
categories were taken into account: Time and Amount.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=1228&language=en 

For more information on the catalogue, please contact Valérie Mapelli 
(mapelli at elda.org)

Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates: 
http://www.elra.info/LRs-Announcements.html

-------------------------------------------------------------------------
Message diffuse par la liste Langage Naturel <LN at cines.fr>
Informations, abonnement : http://www.atala.org/article.php3?id_article=48
English version       : 
Archives                 : http://listserv.linguistlist.org/archives/ln.html
                                http://liste.cines.fr/info/ln

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  : http://www.atala.org/

ATALA décline toute responsabilité concernant le contenu des
messages diffusés sur la liste LN
-------------------------------------------------------------------------



More information about the Ln mailing list