[Corpora-List] Wikipedia Talk Page Conversations Corpus

Cristian Danescu-Niculescu-Mizil cristiand at cs.stanford.edu
Tue Sep 18 16:52:19 UTC 2012


Announcing the availability of the Wikipedia Talk Page Conversations Corpus, a large collection of conversations extracted from Wikipedia editors' talk pages.  The data includes over 125,000 conversations involving about 30,000 editors. Metadata such as editor's status, time of status change and gender is included.  This corpus is released together with the paper:

"Echoes of power: Language effects and power differences in social interaction"
Cristian Danescu-Niculescu-Mizil, Lillian Lee, Bo Pang, and Jon Kleinberg
WWW 2012

The download site is:
http://www.mpi-sws.org/~cristian/Echoes_of_power.html


Cristian Danescu-Niculescu-Mizil, Lillian Lee, Bo Pang, and Jon Kleinberg
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list