[Corpora-List] about French corpus & tools

cyrille cyrilleml at kbuilder.net
Tue Apr 22 07:30:41 UTC 2003


cyrille a écrit:

> Hello,
>
> Perhaps I'm a little bit out of subject in this list, but I would like
> to ask to specialized people if they now free corpus & tools to work
> on French language.
> For my brain, I would like to write some tools, but not make years of
> studies ;o)
>
> So, I'm waiting for all links you can know or remember about French
> language extraction, interpretation & manipulation tools.
>
> Thanks

Thanks again for all your answers.

I made a html page with all of them. I did not yet make a summary, but
only a copy and paste ;o{
Feel free to help me for make it better !

    http://ktaland.com/french-language/french-tools.html

regards,
Cyrille

here it is :


  [en] Tools for extracting, interpreting andmanipulating French Language.
  [fr] Outils pour extraire, interpréter et manipuler la langue française.

[en] This page contains answers for da following question posted on
Corpora-List <CORPORA at HIT.UIB.NO> :
[fr] Cette page contient les réponses données à la question suivante
posée sur Corpora-List <CORPORA at HIT.UIB.NO> :

    Hello,

    Perhaps I'm a little bit out of subject in this list, but I would
    like to ask to specialized people if they now free corpus & tools to
    work on French language.
    For my brain, I would like to write some tools, but not make years
    of studies ;o)

    So, I'm waiting for all links you can know or remember about French
    language extraction, interpretation & manipulation tools.

    Thanks,
    Cyrille Giquello

[en] I've dropped senders email, because they have answered to me on my
private email, not on da list,
so perhaps they does not want to see their name and email on a public place.

[en] Please, fill free to complete this page by mailing to me at
<cyrille at ktaland.com>.
[fr] S'il vous plait, n'hésitez pas à compléter ou corriger cette page
en me contactant à <cyrille at ktaland.com>.

------------------------------------------------------------------------

You may find free tools to work on french (as well as on other laguages)
in the following URLs:

INTEX: http://www.nyu.edu/pages/linguistics/intex
UNITEX: http://www-igm.univ-mlv.fr/~unitex/
GLOSSANET: http://glossa.fltr.ucl.ac.be/ (search engine on online
newspapers based on Unitex)

------------------------------------------------------------------------

one quite well known tool is Intex, which runs under Windows, see
http://www.nyu.edu/pages/linguistics/intex/ for more information and
download.

There exists a reimplementation now that runs under Linux as well as under
Windows and that uses Unicode. Its name is Unitex, for more information and
for download see http://www-igm.univ-mlv.fr/~unitex/.

------------------------------------------------------------------------

BTW: http://wall.jussieu.fr/dyn/Context2
will provide you with some French and English corpora, parallel or
comparable. If you need more info, you can write me directly (et
peut-être en français?

------------------------------------------------------------------------

Here you can get the Brill tagger trained for french and some other tools :
http://jupiter.inalf.cnrs.fr/cgi-bin/mep.exe?HTML=mep_winbrill.txt

Fastr is a parser for term and variant recognition. Fastr take as input a
corpus and a list of terms and ouputs the indexed corpus in which terms
and variants are recognize. You will find it at on the Home page of
Chritian Jacquemin somewhere on the site :
http://www.limsi.fr/Recherche/AMI/PageAMI.html

The Jean Veronis Page :
http://www.up.univ-mrs.fr/~veronis/ with tools and data.

------------------------------------------------------------------------

hyperonyms and hyponyms : EuroWordNet for French. EuroWordNet (not free
for languages other than Spanish):
http://www.illc.uva.nl/EuroWordNet/

------------------------------------------------------------------------



More information about the Corpora mailing list