[Corpora-List] quantities of publicly available parallel text?

Chris Dyer redpony at umd.edu
Wed Feb 27 02:50:15 UTC 2008


Dear colleagues,

Is anyone aware of attempts to estimate how much machine-readable
parallel text is publicly available?  I'm trying to get a general
sense of the scale of parallel data we currently have (and are likely
to have in the future, assuming current growth trends).  Does anyone
have any statistics on this sort of thing?

Many thanks--
Chris

------------------------
Chris Dyer
Dept. of Linguistics
University of Maryland

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list