[Corpora-List] Workshop Web-based Corpora and Lexicology

Roland Meyer roland.meyer at sprachlit.uni-regensburg.de
Wed Sep 13 15:13:19 UTC 2006


The internet is increasingly being used as a source of linguistic information, both as data for linguistics proper and for numerous applications in information science. The Department of Information Science/Media Informatics and the Department of Slavic Languages and Literatures at Regensburg University/Germany are pleased to announce a workshop on "Web-based Corpora and Lexicology", featuring presentations on the construction, annotation, and use of large, uncontrolled corpora, in order to provide a basis for a discussion of their potential benefits in various scientific domains.

Workshop "Web-based Corpora and Lexicology"

Regensburg University
Tuesday, September 19, 2006
HS 5, Central Lecture Hall
http://www.medieninformatik.it

Programme:
09.30 Welcome Address by Ingrid Neumann-Holzschuh, Dean, Philosophical Faculty IV, Regensburg University
09.45 Adam Kilgarriff, Lexicography MasterClass Ltd: "Googleology is Bad Science"
10.30 Christian Wolff, Regensburg University: "Corpus Comparison and Time Series Analysis"
11.15 Coffee Break
11.30 Jürgen Reischer, Regensburg University: "Information Mining in WordNet"
12.15 Alexander Mehler, Bielefeld University: "Automatic Classification of Hypertext Graphs. Toward a Structural Model of Webgenres"
13.00 Lunch Break
14.00 Serge Sharoff, Leeds University: "Turning the Web into the BNC: collecting and using Internet corpora"
14.45 Uwe Quasthoff, Leipzig University: "The production process of the Leipzig Corpora Collection"
15.30 Coffee Break
15.45 Adam Przepiórkowski, Polish Academy of Sciences: "Shallow Analysis for the Syntactic Annotation of a Large Polish Corpus"
16.30 Christian Biemann, Leipzig University: "Unsupervised POS-Tagging"
17.15 Closing Statement

We cordially invite students and researchers working on the above topics to join and discuss.

Sincerely,
Dr. Roland Meyer and Prof. Dr. Christian Wolff, Regensburg University



More information about the Corpora mailing list