[Corpora-List] Compilation of language resources for French - ELRA
Khalid CHOUKRI
choukri at elda.org
Tue Apr 12 16:10:32 UTC 2011
Dear Yannick
Yannick Versley wrote, On 12/04/2011 12:56:
> Dear Valérie,
>
> The "free" price point is interesting especially for masters students
> who have the
> desire to do actual research, but may have to provide the materials
> out of their own
> pocket. For these people, a price of 200-500EUR is definitely out of
> reach (some would
> balk at 50EUR, which I'd understand if you need to combine data from
> multiple resources
> to carry out your research), and labeling these "at media cost" will
> not change this.
>
It is also the role of ELRA (and LDC) to make sure that researchers have
access to the resources they need to carry out their research
So, students who are prevented from doing so (because of the resource
cost) should talk to us and I am sure we will find an efficient way to
deal with that;
> I do think that LDC and ELRA play an important role in the ecosystem
> around language resources,
> but I am also sure that, to ensure the widest possible use of a
> resource in academic research,
> the most effective way is to make it available free of cost and under
> a liberal license,
> as has been done with the Lefff. I understand that this is not always
> possible, but
> I applaud the people behind Lefff (and similar resources) for making
> it a possibility.
Many of us, including ELRA, are working toward this (as an example you
may want to look at the work being carried out in the EC project
META-NET (www.meta-net.eu) that aims at such purpose;
Unfortunately, many resources will remain governed by the "market"
conditions, despite all our efforts
Best regards
Khalid
P.S. Resources that are donated to ELRA for free (no royalties to be
paid back to the owners) will be distributed to the community free of
charge
>
> Best wishes,
> Yannick Versley
>
> On Tue, Apr 12, 2011 at 12:08 PM, Valérie Mapelli <mapelli at elda.org
> <mailto:mapelli at elda.org>> wrote:
>
> Dear Corpora readers,
>
> Recently, Ineta Sejane circulated a message listing a number of
> French language resources.
> With the aim to contribute to the enrichment of this list, we
> identified some language resources, with a French component,
> available in the ELRA Catalogue, which are either free or at media
> cost for research purposes. These are distributed as follows:
>
> *Written Corpora:
> *W0003 CRATER corpus
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=84>
> W0004 ECI/MCI (European Corpus Initiative/Multilingual Corpus
> I)
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=85>
> W0013 TSNLP (Test Suites for NLP Testing)
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=51>
> W0015 Text corpus of "Le Monde"
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=438>
> W0017 MULTEXT JOC Corpus
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=534>
> W0018 ARCADE/ROMANSEVAL corpus
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=535>
> W0023 MLCC Multilingual and Parallel Corpora
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=764>
> W0025-01 A "scientific" corpus of modern French ("La Recherche"
> magazine) - Raw data
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=594>
> W0025-02 A "scientific" corpus of modern French ("La Recherche"
> magazine) - Complete version
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=595>
> W0032 Modern French Corpus including Anaphors Tagging
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=634>
> W0033 CRATER 2 Corpus
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=636>
> W0036-01 "Le Monde Diplomatique" Text corpus in French -
> archives 1980-1998
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=7>
> W0036-02 "Le Monde Diplomatique" Text corpus in French -
> archives from 1999
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=9>
>
> *Lexicons:
> *L0010 MULTEXT Lexicons
> <http://catalog.elra.info/product_info.php?products_id=29>
> M0020 EuroWordNet French
> <http://catalog.elra.info/product_info.php?products_id=550>
>
> *Speech LRs:
> *S0006 BREF-80
> <http://catalog.elra.info/product_info.php?products_id=36>
> S0007 BREF-POLYGLOT
> <http://catalog.elra.info/product_info.php?products_id=37>
> S0021 M2VTS Speaker Verification Database
> <http://catalog.elra.info/product_info.php?products_id=758>
> S0033 BDBRUIT
> <http://catalog.elra.info/product_info.php?products_id=80>
> S0060 MULTEXT Prosodic database
> <http://catalog.elra.info/product_info.php?products_id=530>
> S0088 Twin database - TWINDB1
> <http://catalog.elra.info/product_info.php?products_id=579>
> S0163 ILPho phonetic lexicon
> <http://catalog.elra.info/product_info.php?products_id=760>
> S0238 MIST Multi-lingual Interoperability in Speech Technology
> database <http://catalog.elra.info/product_info.php?products_id=988>
> S0241 ESTER Corpus
> <http://catalog.elra.info/product_info.php?products_id=999>
> S0305 EPAC Corpus: orthographic transcriptions
> <http://catalog.elra.info/product_info.php?products_id=1119>
>
> *Evaluation Packages:
> *E0008 The CLEF Test Suite for the CLEF 2000-2003 Campaigns -
> Evaluation Package
> <http://catalog.elra.info/product_info.php?products_id=888>
> E0018 ARCADE II Evaluation Package
> <http://catalog.elra.info/product_info.php?products_id=992>
> E0019 CESART Evaluation Package
> <http://catalog.elra.info/product_info.php?products_id=993>
> E0020 CESTA Evaluation Package
> <http://catalog.elra.info/product_info.php?products_id=994>
> E0021 ESTER Evaluation Package
> <http://catalog.elra.info/product_info.php?products_id=995>
> E0022 EQueR Evaluation Package
> <http://catalog.elra.info/product_info.php?products_id=996>
> E0023 EvaSy Evaluation Package
> <http://catalog.elra.info/product_info.php?products_id=997>
> E0024 MEDIA Evaluation Package
> <http://catalog.elra.info/product_info.php?products_id=998>
> E0034 EASy Evaluation Package
> <http://catalog.elra.info/product_info.php?products_id=1112>
> E0036 CLEF AdHoc-News Test Suites (2004-2008) - Evaluation
> Package <http://catalog.elra.info/product_info.php?products_id=1127>
> E0038 CLEF Question Answering Test Suites (2003-2008) -
> Evaluation Package
> <http://catalog.elra.info/product_info.php?products_id=1129>
> W0029 Amaryllis Corpus - Evaluation Package
> <http://catalog.elra.info/product_info.php?cPath=42_43&products_id=626>
>
> Other French language resources and many other languages are
> available both for research and commercial communities in our
> catalogue that you may visit at:
> http://catalogue.elra.info
>
> Best regards,
>
> Valérie Mapelli
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no <mailto:Corpora at uib.no>
> http://mailman.uib.no/listinfo/corpora
>
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20110412/aa134aff/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: choukri.vcf
Type: text/x-vcard
Size: 328 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20110412/aa134aff/attachment-0001.vcf>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list