[Corpora-List] 2nd release of the German Political Speeches Corpus
Adrien Barbaresi
adrien.barbaresi at ens-lyon.fr
Tue Mar 20 13:47:28 UTC 2012
Dear all,
I released the second version of the German Political Speeches Corpus
and Visualization on the occasion of the DGfS-CL Poster-Session in
Frankfurt, where I presented a poster.
The changes are detailed in this blog post :
http://perso.ens-lyon.fr/adrien.barbaresi/blog/?p=1045
A brief summary :
The resource consists of speeches by the last German Presidents and
Chancellors as well as a few ministers, all gathered from official
sources. It provides raw data, metadata and tokenized text with
part-of-speech tagging and lemmas in XML TEI format for researchers that
are able to use it and a simple visualization interface for those who
want to get a glimpse of what is in the corpus before downloading it or
thinking about using more complete tools.
The visualization output is in valid CSS/XHTML format, it takes
advantage of recent standards. The purpose is to give a sort of
Zeitgeist, an insight on the topics developed by a government official
and on the evolution in the use of general concepts.
This resource is freely available under a CC BY-SA licence :
http://purl.org/corpus/german-speeches
Regards,
--
Adrien Barbaresi <adrien.barbaresi at ens-lyon.fr>
http://perso.ens-lyon.fr/adrien.barbaresi
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list