Corpora: Relatve text length
Jean Veronis
Jean.Veronis at newsup.univ-mrs.fr
Wed Apr 24 17:39:41 UTC 2002
French is slightly longer that English. Ratio in words varies from 1.08 to
1.16, measured on 6 different text types.
See details in:
Véronis, J. & Langlais, Ph. (2000). Evaluation of parallel text alignment
systems: the ARCADE project. In J. Véronis (Ed.), Parallel text processing:
Alignment and use of translation corpora (pp. 369-388). Dordrecht: Kluwer
Academic Publishers.
[Table 1, p. 374]
http://www.up.univ-mrs.fr/veronis/parallel-book.html
More information about the Corpora
mailing list