[Corpora-List] Parallel corpora that are directly searchable on the web?

Michal Kren kren at trnka.ff.cuni.cz
Tue Nov 9 13:30:51 UTC 2010


You can also use InterCorp that is being built at the Institute of the
Czech National Corpus. The InterCorp currently covers 20 languages, most
of the texts are semi-automatically aligned fiction (i.e. no EU documents
or technical manuals). The current size is 44 million words, and a large
addition is underway. The data are lemmatised and morphosyntactically
annotated where available, which means about a half of the languages.

More info can be found at http://www.korpus.cz/english/intercorp-info.php

Prior registration is required, but it is free of charge:
http://www.korpus.cz/english/prohlaseni-aj.php

The access point for registered users is http://www.korpus.cz/Park/

Regards

Michal Kren


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list