[Corpora-List] The ARTFL Project profiled
Angus B. Grieve-Smith
grvsmth at panix.com
Mon Mar 8 13:51:28 UTC 2010
On Pages 10-14 of /Tableau/, there is a profile of the ARTFL project
at the University of Chicago, which began as a collaboration with the
French CNRS to clean up the FRANTEXT corpus, and has since been extended
to other corpora.
http://humanities.uchicago.edu/tableau/issues/fall_2009.pdf
The FRANTEXT corpus of canonical French works was compiled for
the /Trésor de la langue française/ dictionary, from first editions in
the 1960s on punch cards and paper tape. The ARTFL website offers
several tools for searching the corpus.
http://artfl-project.uchicago.edu/
Most of the cleaned-up text files are available for download from
Gallica, as are scanned images of many of the documents.
http://gallica.bnf.fr/
I worked as a student employee at ARTFL in 1993 and 1994; for some
reason I particularly remember cleaning up Nerval's translation of
/Faust/, and Sainte-Beuve's history of Port-Royal. Since then I have
used the FRANTEXT corpus for several projects, including my
dissertation. I highly recommend it to anyone interested in written
French up to 1950, with the caveat that it is strongly biased towards
the canon.
--
-Angus B. Grieve-Smith
grvsmth at panix.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100308/60764a95/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list