[Corpora-List] Parallel corpora that are directly searchable on the web?
Michal Kren
kren at trnka.ff.cuni.cz
Tue Nov 9 13:30:51 UTC 2010
You can also use InterCorp that is being built at the Institute of the
Czech National Corpus. The InterCorp currently covers 20 languages, most
of the texts are semi-automatically aligned fiction (i.e. no EU documents
or technical manuals). The current size is 44 million words, and a large
addition is underway. The data are lemmatised and morphosyntactically
annotated where available, which means about a half of the languages.
More info can be found at http://www.korpus.cz/english/intercorp-info.php
Prior registration is required, but it is free of charge:
http://www.korpus.cz/english/prohlaseni-aj.php
The access point for registered users is http://www.korpus.cz/Park/
Regards
Michal Kren
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list