ELRA - Language Resources Catalogue - Update

ELRA ELDA Information info at ELDA.ORG
Wed Oct 1 13:40:59 UTC 2014


Our apologies if you have received multiple copies of this announcement.

*****************************************************************
ELRA - Language Resources Catalogue - Update
*****************************************************************

We are happy to announce that 1 new Speech Resource and 3 new Written 
Corpora are now available in our catalogue.

*ELRA-S0371 PortMedia French and Italian corpus*
This corpus contains 700 transcribed dialogues from about 140 French 
speakers and 604 transcribed dialogues from about 150 Italian speakers 
(several dialogues per speaker). The method chosen for the corpus 
construction process is that of a 'Wizard of Oz' (WoZ) system. This 
consists of simulating a natural language man-machine dialogue. The 
scenario was built in the domain of touristic information and 
reservation. A manual transcription and semantic annotation of the 
corpus are provided with corresponding wave files.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=1224&language=en

*ELRA-W0078 NE3L named entities Arabic corpus*
The Arabic corpus contains 103,363 words coming from articles extracted 
from "Le Monde Diplomatique" newspaper, and published in 2004. 2 named 
entity categories were taken into account: Time and Amount.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=1226 
<http://catalog.elra.info/product_info.php?products_id=1226&language=en>&language=en 
<http://catalog.elra.info/product_info.php?products_id=1226&language=en>

*ELRA-W0079 NE3L named entities Chinese corpus*
The Chinese corpus contains 79,302 words coming from articles extracted 
from "Le Monde Diplomatique" newspaper, and published in 2001. 3 named 
entity categories were taken into account: Person, Place and Organisation.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=1227 
<http://catalog.elra.info/product_info.php?products_id=1227&language=en>&language=en 
<http://catalog.elra.info/product_info.php?products_id=1227&language=en>

*ELRA-W0080 NE3L named entities Russian corpus*
The Russian corpus contains 75,784 words coming from articles extracted 
from "Izvestia" newspaper, and published in 1995. 2 named entity 
categories were taken into account: Time and Amount.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=1228 
<http://catalog.elra.info/product_info.php?products_id=1228&language=en>&language=en 
<http://catalog.elra.info/product_info.php?products_id=1228&language=en>


For more information on the catalogue, please contact Valérie Mapelli 
mailto:mapelli at elda.org

Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates: 
http://www.elra.info/LRs-Announcements.html
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/hpsg-l/attachments/20141001/3461a87a/attachment.htm>
-------------- next part --------------



More information about the HPSG-L mailing list