[Corpora-List] Free German Corpus or lexicon

Adrien Barbaresi adrien.barbaresi at ens-lyon.fr
Fri Jul 13 11:54:50 UTC 2012


Dear Sofia,

Here are several resources that may interest you :

1. Corpora

⋅ I released a (freely available) corpus of German political speeches :
http://purl.org/corpus/german-speeches

⋅ As I cannot republish what I have due to copyright issues, I recently
open-sourced a specialized corpus-building tool that enables you to
gather about 130.000 articles (i.e. more than 100 millions of tokens)
from the newspaper 'Die Zeit' :
http://code.google.com/p/zeitcrawler/


2. Word lists

⋅ DeReWo – corpus-based wordlists, a project of the IDS Mannheim :
http://www.ids-mannheim.de/kl/projekte/methoden/derewo.html



Best regards,

-- 
Adrien Barbaresi <adrien.barbaresi at ens-lyon.fr>
http://perso.ens-lyon.fr/adrien.barbaresi




_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list