Corpora: Relatve text length

Jean Veronis Jean.Veronis at newsup.univ-mrs.fr
Wed Apr 24 17:39:41 UTC 2002


French is slightly longer that English. Ratio in words varies from 1.08 to 
1.16, measured on 6 different text types.

See details in:

Véronis, J. & Langlais, Ph. (2000). Evaluation of parallel text alignment 
systems: the ARCADE project. In J. Véronis (Ed.), Parallel text processing: 
Alignment and use of translation corpora (pp. 369-388). Dordrecht: Kluwer 
Academic Publishers.

[Table 1, p. 374]


http://www.up.univ-mrs.fr/veronis/parallel-book.html



More information about the Corpora mailing list