[Corpora-List] 3rd CfP / extended deadline: KONVENS 2014 Workshop: NLP 4 CMC: Natural Language Processing for Computer-Mediated Communication / Social Media

"Michael Beißwenger" michael.beisswenger at uni-dortmund.de
Sat Jun 14 11:19:49 UTC 2014


EXTENDED DEADLINE -- 3rd CALL FOR PAPERS:

===================================================================
Workshop:
NLP 4 CMC: Natural Language Processing for Computer-Mediated
Communication /Social Media
===================================================================

Pre-conference workshop at KONVENS2014
Hildesheim/Germany
October 6, 2014.

https://sites.google.com/site/nlp4cmc/

Submission Deadline (EXTENDED): July 15, 2014


TOPIC AND SCOPE OF THE WORKSHOP:

Over the past decade, there has been a growing interest in collecting,
processing and analyzing data from genres of social media and
computer-mediated communication (CMC): As part of large corpora which have
been automatically crawled from the WWW, CMC data are often regarded as an
unloved "bycatch" which is difficult to handle with NLP tools that have
been optimized for processing edited text; on the other hand, these data
are important parts of web corpora for all research and application
contexts which require data sets that represent the diversity of genres
and linguistic variation on the web. For corpus-based variational
linguistics, CMC corpora are an important resource for closing the "CMC
gap" both in corpora of contemporary written language and in corpora of
spoken language: Since CMC and social media make up an important part of
everyday communication, investigations into language change and linguistic
variation need to be able to include CMC and social media data into their
empirical analyses.

Nevertheless, the development of approaches and tools for processing the
linguistic and structural peculiarities of CMC genres and for building CMC
corpora is lacking behind the interest of dealing with these types of data
in the field of language technology, corpus-based linguistics and web
mining.

The goal of this workshop is to provide a platform for the presentation of
results and ongoing work in adapting NLP tools for processing CMC / social
media data. The focus of the workshop is on German data, but submissions
on NLP approaches, annotation experiments etc. for data of other European
languages are also welcome as long as they can make a significant
contribution to the further development of the processing of CMC
phenomena.

TOPICS OF INTEREST:

We encourage the submission of long and short research and demo papers
including, but not restricted to the following topics related to social
media / computer-mediated communication:

*  Corpora and lexical semantic resources for the analysis of social media
/ computer-mediated communication

*  Normalization (spelling correction, ...)

*  Automatic preprocessing (tokenization, POS tagging, lemmatization,
parsing, word sense disambiguation)

*  Annotation of linguistic and structural features in social media / CMC
data (annotation schemas, annotation experiments, ...)

*  Domain adaptation

*  Automatic methods in corpus-based CMC / social media analysis
(sentiment, summarization, trend detection, ...)

*  Big-data social media analysis


IMPORTANT DATES

*  Submissions due: 15 July 2014

*  Notification: 08 August 2014

*  Camera-ready papers due: 30 August 2014

*  Workshop: 6 October 2014


SUBMISSIONS:

Submissions should include the names and addresses of all authors and meet
the following requirements:

*  Full Papers (8 pages)

*  Short Papers (2-4 pages) or Extended Abstracts (500-1000 words):
position papers or work in progress

*  Demonstrations (2-4 pages): presentation of systems or prototypes

*  Submissions need to be made in English and should be in PDF format

*  Submissions need to follow the KONVENS format
(http://www.uni-hildesheim.de/konvens2014/pages/Submissions.html)

Submissions will be accepted via the Easychair system:
https://www.easychair.org/conferences/?conf=nlp4cmc

ORGANIZERS:

*  Michael Beißwenger (TU Dortmund University)
*  Torsten Zesch (University of Duisburg-Essen)

The workshop is organized by the special interest group "Social Media /
Computer-Mediated Communication" of the German Scoiety for Computational
Linguistics & Language Technology (GSCL) (http://gscl.org/ak-ibk.html).

PROGRAM COMITEE:

* Sabine Bartsch (TU Darmstadt)
* Thomas Bartz (TU Dortmund)
* Michael Beißwenger (TU Dortmund)
* Thierry Chanier (Université Blaise Pascal, Clermont-Ferrand)
* Isabella Chiari (Università "La Sapienza", Rome)
* Stefanie Dipper (Ruhr-Universität Bochum)
* Stefan Evert (Universität Erlangen)
* Iris Hendrickx (Radboud University Nijmegen)
* Verena Henrich (Universität Tübingen)
* Lothar Lemnitzer (BBAW, Berlin)
* Anke Lüdeling (Humboldt-Universität Berlin)
* Harald Lüngen (IDS, Mannheim)
* Preslav Nakov (Qatar QCRI)
* Günter Neumann (DFKI, Saarbrücken)
* Melanie Neunerdt (RWTH Aachen)
* Nelleke Oostdijk (Radboud University Nijmegen)
* Ines Rehbein (Universität Potsdam)
* Benoît Sagot (Université Paris 13, Sorbonne Paris Cité)
* Roman Schneider (IDS, Mannheim)
* Egon W. Stemle (EURAC, Bozen)
* Angelika Storrer (Universität Mannheim)
* Kay-Michael Würzner (Universität Potsdam)
* Torsten Zesch (Universität Duisburg-Essen)

WORKSHOP WEBSITE:

https://sites.google.com/site/nlp4cmc/



__________________________________________
Priv.-Doz. Dr. Michael Beißwenger
Technische Universität Dortmund
Institut für deutsche Sprache und Literatur
D-44221 Dortmund
Fon: +49 (0)231 755 2902
Mail: michael.beisswenger at tu-dortmund.de
http://www.michael-beisswenger.de
http://www.germanistik.tu-dortmund.de
empirikom: http://www.empirikom.net


_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list