Ressources: ELRA - Language Resources Catalogue - Update

Thierry Hamon thierry.hamon at UNIV-PARIS13.FR
Sat Feb 16 20:01:08 UTC 2013


Date: Thu, 14 Feb 2013 10:45:27 +0100
From: ELRA ELDA Information <info at elda.org>
Message-ID: <511CB237.1020004 at elda.org>
X-url: http://catalog.elra.info/product_info.php?products_id=1194&language=en
X-url: http://catalog.elra.info/product_info.php?products_id=1195&language=en

Our apologies if you have received multiple copies of this announcement.

*****************************************************************
ELRA - Language Resources Catalogue - Update
*****************************************************************

ELRA is happy to announce that QUAERO Structured Named Entity Language
Resources are now available in its catalogue.
A Written Corpus and a Broadcast Resource annotated with Structured
Named Entities from the QUAERO Programme are now being released (free
for academic research):

*ELRA-W0073 Quaero Old Press Extended Named Entity corpus*
This corpus consists of the manual annotation of 76 newspaper issues 
published in 1890-1891 and provided by the French National Library 
(Bibliothèque Nationale de France). Three different titles are used (Le 
Temps, La Croix and Le Figaro) for a total of 295 pages. The corpus is 
fully manually annotated according to the Quaero extended and structured 
named entity definition.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=1194&language=en

*ELRA-S0349 Quaero Broadcast News Extended Named Entity corpus*
This corpus consists of the manual annotation of (i) the ESTER 2 (see 
also ELRA-S0338) manual transcription corpus and (ii) the Quaero Speech 
Recognition Evaluation corpus (manual and automatic transcriptions 
coming from 3 different ASR systems). The corpus is fully manually 
annotated according to the Quaero extended and structured named entity 
definition.
For more information, see: 
http://catalog.elra.info/product_info.php?products_id=1195&language=en

These two corpora are described in :
S. Rosset, C. Grouin, K. Fort, O. Galibert, J. Kahn, P. Zweigenbaum. 
Structured Named Entities in two distinct press corpora: Contemporary 
Broadcast News and Old Newspapers. In Proc. of LAW VI, 2012.

QUAERO is a research and innovation program adressing automatic
processing of multimedia and multilingual content aiming at the
development of new tools for navigating in large volumes of text and
audiovisual content. The research and development undertaken covers
automatic information retrieval, analysis, segmentation and
classification of text, speech, music, image and video. The program,
supported by OSEO, gathers 32 French and German partners -- large
groups, small and medium size enterprises, research laboratories and
public organizations.  The program consists of a number of application
projects aiming at industrial targets and markets that are supported by
a common shared research structure. Real world data sets (corpora) are
used to define the evaluation tasks and to conduct the research
challenges between partners. The use of systematic periodic technology
evaluation allows to assess progress made and to select the most
promising technical and scientific approaches. After nearly five years
of existence, Quaero is a very active eco-system that has produced in
excess of 700 scientific publications, more than 25 awards, numerous top
3 rankings in technology evaluation campaigns, 31 patent applications
and many innovative prototypes.

To find out more about QUAERO, please visit the following website: 
http://www.quaero.org

For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli at elda.org

Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates: 
http://www.elra.info/LRs-Announcements.html

-------------------------------------------------------------------------
Message diffuse par la liste Langage Naturel <LN at cines.fr>
Informations, abonnement : http://www.atala.org/article.php3?id_article=48
English version       : 
Archives                 : http://listserv.linguistlist.org/archives/ln.html
                                http://liste.cines.fr/info/ln

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  : http://www.atala.org/
-------------------------------------------------------------------------



More information about the Ln mailing list