[Corpora-List] Compilation of language resources for French - ELRA

Khalid CHOUKRI choukri at elda.org
Tue Apr 12 16:10:32 UTC 2011


Dear Yannick

Yannick Versley wrote, On 12/04/2011 12:56:
> Dear Valérie,
>
> The "free" price point is interesting especially for masters students 
> who have the
> desire to do actual research, but may have to provide the materials 
> out of their own
> pocket. For these people, a price of 200-500EUR is definitely out of 
> reach (some would
> balk at 50EUR, which I'd understand if you need to combine data from 
> multiple resources
> to carry out your research), and labeling these "at media cost" will 
> not change this.
>
It is also the role of ELRA (and LDC) to make sure that researchers have 
access to the resources they need to carry out their research
So, students who are prevented from doing so (because of the resource 
cost) should talk to us and I am sure we will find an efficient way to 
deal with that;

> I do think that LDC and ELRA play an important role in the ecosystem 
> around language resources,
> but I am also sure that, to ensure the widest possible use of a 
> resource in academic research,
> the most effective way is to make it available free of cost and under 
> a liberal license,
> as has been done with the Lefff. I understand that this is not always 
> possible, but
> I applaud the people behind Lefff (and similar resources) for making 
> it a possibility.
Many of us, including ELRA, are working toward this (as an example you 
may want to look at the work being carried out in the EC project 
META-NET (www.meta-net.eu) that aims at such purpose;
Unfortunately, many resources will remain governed by the "market" 
conditions, despite all our efforts

Best regards
Khalid
P.S. Resources that are donated to ELRA for free (no royalties to be 
paid back to the owners)  will be distributed to the community free of 
charge



>
> Best wishes,
> Yannick Versley
>
> On Tue, Apr 12, 2011 at 12:08 PM, Valérie Mapelli <mapelli at elda.org 
> <mailto:mapelli at elda.org>> wrote:
>
>      Dear Corpora readers,
>
>     Recently, Ineta Sejane circulated a message listing a number of
>     French language resources.
>     With the aim to contribute to the enrichment of this list, we
>     identified some language resources, with a French component,
>     available in the ELRA Catalogue, which are either free or at media
>     cost for research purposes. These are distributed as follows:
>
>     *Written Corpora:
>     *W0003    CRATER corpus
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=84>
>     W0004    ECI/MCI (European Corpus Initiative/Multilingual Corpus
>     I)
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=85>
>     W0013    TSNLP (Test Suites for NLP Testing)
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=51>
>     W0015    Text corpus of "Le Monde"
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=438>
>     W0017    MULTEXT JOC Corpus
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=534>
>     W0018    ARCADE/ROMANSEVAL corpus
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=535>
>     W0023    MLCC Multilingual and Parallel Corpora
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=764>
>     W0025-01    A "scientific" corpus of modern French ("La Recherche"
>     magazine) - Raw data
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=594>
>     W0025-02    A "scientific" corpus of modern French ("La Recherche"
>     magazine) - Complete version
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=595>
>     W0032    Modern French Corpus including Anaphors Tagging
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=634>
>     W0033    CRATER 2 Corpus
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=636>
>     W0036-01    "Le Monde Diplomatique" Text corpus in French -
>     archives 1980-1998
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=7>
>     W0036-02    "Le Monde Diplomatique" Text corpus in French -
>     archives from 1999
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=9>
>
>     *Lexicons:
>     *L0010    MULTEXT Lexicons
>     <http://catalog.elra.info/product_info.php?products_id=29>
>     M0020    EuroWordNet French
>     <http://catalog.elra.info/product_info.php?products_id=550>
>
>     *Speech LRs:
>     *S0006    BREF-80
>     <http://catalog.elra.info/product_info.php?products_id=36>
>     S0007    BREF-POLYGLOT
>     <http://catalog.elra.info/product_info.php?products_id=37>
>     S0021    M2VTS Speaker Verification Database
>     <http://catalog.elra.info/product_info.php?products_id=758>
>     S0033    BDBRUIT
>     <http://catalog.elra.info/product_info.php?products_id=80>
>     S0060    MULTEXT Prosodic database
>     <http://catalog.elra.info/product_info.php?products_id=530>
>     S0088    Twin database - TWINDB1
>     <http://catalog.elra.info/product_info.php?products_id=579>
>     S0163    ILPho phonetic lexicon
>     <http://catalog.elra.info/product_info.php?products_id=760>
>     S0238    MIST Multi-lingual Interoperability in Speech Technology
>     database <http://catalog.elra.info/product_info.php?products_id=988>
>     S0241    ESTER Corpus
>     <http://catalog.elra.info/product_info.php?products_id=999>
>     S0305    EPAC Corpus: orthographic transcriptions
>     <http://catalog.elra.info/product_info.php?products_id=1119>
>
>     *Evaluation Packages:
>     *E0008    The CLEF Test Suite for the CLEF 2000-2003 Campaigns -
>     Evaluation Package
>     <http://catalog.elra.info/product_info.php?products_id=888>
>     E0018    ARCADE II Evaluation Package
>     <http://catalog.elra.info/product_info.php?products_id=992>
>     E0019    CESART Evaluation Package
>     <http://catalog.elra.info/product_info.php?products_id=993>
>     E0020    CESTA Evaluation Package
>     <http://catalog.elra.info/product_info.php?products_id=994>
>     E0021    ESTER Evaluation Package
>     <http://catalog.elra.info/product_info.php?products_id=995>
>     E0022    EQueR Evaluation Package
>     <http://catalog.elra.info/product_info.php?products_id=996>
>     E0023    EvaSy  Evaluation Package
>     <http://catalog.elra.info/product_info.php?products_id=997>
>     E0024    MEDIA Evaluation Package
>     <http://catalog.elra.info/product_info.php?products_id=998>
>     E0034    EASy Evaluation Package
>     <http://catalog.elra.info/product_info.php?products_id=1112>
>     E0036    CLEF AdHoc-News Test Suites (2004-2008) - Evaluation
>     Package <http://catalog.elra.info/product_info.php?products_id=1127>
>     E0038    CLEF Question Answering Test Suites (2003-2008) -
>     Evaluation Package
>     <http://catalog.elra.info/product_info.php?products_id=1129>
>     W0029    Amaryllis Corpus - Evaluation Package
>     <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=626>
>
>     Other French language resources and many other languages are
>     available both for research and commercial communities in our
>     catalogue that you may visit at:
>     http://catalogue.elra.info
>
>     Best regards,
>
>     Valérie Mapelli
>
>
>     _______________________________________________
>     UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
>     Corpora mailing list
>     Corpora at uib.no <mailto:Corpora at uib.no>
>     http://mailman.uib.no/listinfo/corpora
>
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20110412/aa134aff/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: choukri.vcf
Type: text/x-vcard
Size: 328 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20110412/aa134aff/attachment-0001.vcf>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list