ELRA - Language Resources Catalogue - Update
ELRA ELDA Information
info at ELDA.ORG
Wed Oct 1 13:40:59 UTC 2014
Our apologies if you have received multiple copies of this announcement.
*****************************************************************
ELRA - Language Resources Catalogue - Update
*****************************************************************
We are happy to announce that 1 new Speech Resource and 3 new Written
Corpora are now available in our catalogue.
*ELRA-S0371 PortMedia French and Italian corpus*
This corpus contains 700 transcribed dialogues from about 140 French
speakers and 604 transcribed dialogues from about 150 Italian speakers
(several dialogues per speaker). The method chosen for the corpus
construction process is that of a 'Wizard of Oz' (WoZ) system. This
consists of simulating a natural language man-machine dialogue. The
scenario was built in the domain of touristic information and
reservation. A manual transcription and semantic annotation of the
corpus are provided with corresponding wave files.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=1224&language=en
*ELRA-W0078 NE3L named entities Arabic corpus*
The Arabic corpus contains 103,363 words coming from articles extracted
from "Le Monde Diplomatique" newspaper, and published in 2004. 2 named
entity categories were taken into account: Time and Amount.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=1226
<http://catalog.elra.info/product_info.php?products_id=1226&language=en>&language=en
<http://catalog.elra.info/product_info.php?products_id=1226&language=en>
*ELRA-W0079 NE3L named entities Chinese corpus*
The Chinese corpus contains 79,302 words coming from articles extracted
from "Le Monde Diplomatique" newspaper, and published in 2001. 3 named
entity categories were taken into account: Person, Place and Organisation.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=1227
<http://catalog.elra.info/product_info.php?products_id=1227&language=en>&language=en
<http://catalog.elra.info/product_info.php?products_id=1227&language=en>
*ELRA-W0080 NE3L named entities Russian corpus*
The Russian corpus contains 75,784 words coming from articles extracted
from "Izvestia" newspaper, and published in 1995. 2 named entity
categories were taken into account: Time and Amount.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=1228
<http://catalog.elra.info/product_info.php?products_id=1228&language=en>&language=en
<http://catalog.elra.info/product_info.php?products_id=1228&language=en>
For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli at elda.org
Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates:
http://www.elra.info/LRs-Announcements.html
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/hpsg-l/attachments/20141001/3461a87a/attachment.htm>
-------------- next part --------------
More information about the HPSG-L
mailing list