Corpora: size of French corpora in words

Jean Veronis Jean.Veronis at newsup.univ-mrs.fr
Wed Apr 24 14:06:26 UTC 2002


At 08:36 24/04/2002 -0300, Tony Berber Sardinha wrote:

>Dear colleagues
>
>Does anyone know the size of the Tresor de la Langue Française and of Frantext
>in running words? I found out how many texts there are in Frantext, roughly.
>I need this information for a text I'm writing.

Frantext:

* ca. 3500 texts from 16th to 20th century
* ca. 190 million tokens
* 80% literature 20% science & technology

--jv



More information about the Corpora mailing list