[Corpora-List] The ARTFL Project profiled

Angus B. Grieve-Smith grvsmth at panix.com
Mon Mar 8 13:51:28 UTC 2010


    On Pages 10-14 of /Tableau/, there is a profile of the ARTFL project 
at the University of Chicago, which began as a collaboration with the 
French CNRS to clean up the FRANTEXT corpus, and has since been extended 
to other corpora.

http://humanities.uchicago.edu/tableau/issues/fall_2009.pdf

    The FRANTEXT corpus of canonical French works was compiled for 
the /Trésor de la langue française/ dictionary, from first editions in 
the 1960s on punch cards and paper tape.  The ARTFL website offers 
several tools for searching the corpus.

http://artfl-project.uchicago.edu/

    Most of the cleaned-up text files are available for download from 
Gallica, as are scanned images of many of the documents.

http://gallica.bnf.fr/

    I worked as a student employee at ARTFL in 1993 and 1994; for some 
reason I particularly remember cleaning up Nerval's translation of 
/Faust/, and Sainte-Beuve's history of Port-Royal.  Since then I have 
used the FRANTEXT corpus for several projects, including my 
dissertation.  I highly recommend it to anyone interested in written 
French up to 1950, with the caveat that it is strongly biased towards 
the canon.

-- 
				-Angus B. Grieve-Smith
				grvsmth at panix.com

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20100308/60764a95/attachment.htm>
-------------- next part --------------
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list