24.23, Confs: Computational Linguistics, Text/Corpus Linguistics/Germany

linguist at linguistlist.org linguist at linguistlist.org
Mon Jan 7 19:17:37 UTC 2013


LINGUIST List: Vol-24-23. Mon Jan 07 2013. ISSN: 1069 - 4875.

Subject: 24.23, Confs: Computational Linguistics, Text/Corpus Linguistics/Germany

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>

Reviews: Veronika Drake, U of Wisconsin Madison
Monica Macaulay, U of Wisconsin Madison
Rajiv Rao, U of Wisconsin Madison
Joseph Salmons, U of Wisconsin Madison
Anja Wanner, U of Wisconsin Madison
       <reviews at linguistlist.org>

Homepage: http://linguistlist.org

Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from
Amazon!

USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21

For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.

Editor for this issue: Xiyan Wang <xiyan at linguistlist.org>
================================================================  


Date: Mon, 07 Jan 2013 14:15:44
From: Stefanie Dipper [dipper at linguistics.rub.de]
Subject: DGfS Workshop: Modelling Non-Standardized Writing

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=24-23.html&submissionid=6320282&topicid=4&msgnumber=1
 
DGfS Workshop: Modelling Non-Standardized Writing 

Date: 13-Mar-2013 - 15-Mar-2013 
Location: Potsdam, Germany 
Contact: Michael Beißwenger 
Contact Email: nonstandard at linguistics.rub.de 
Meeting URL: http://empirikom.net/bin/view/Aktivitaeten/DgfsNonstandard 

Linguistic Field(s): Computational Linguistics; Text/Corpus Linguistics 

Meeting Description: 

Modelling Non-Standardized Writing
Workshop at the 35th Annual Conference of the German Linguistic Society (DGfS)
Potsdam, Germany 

Invited Speakers:

Jennifer Foster (Dublin City University, Ireland)
Christian Mair (University of Freiburg, Germany)

Non-standardized writing can be found in various contexts: for instance, when there are no generally established written language standards (e.g. in languages with an emergent writing system); when existing standards are deliberately ignored by the authors (in creative writing) or replaced by principles of verbalization that are borrowed from oral communication (e.g. when passing notes in a classroom); or when the norms of standard written language have not yet been fully acquired (as in learner texts). In particular, non-standardized writing plays an important role in research on Internet-based communication. The language used in emails, online forums, chats, blogs, etc. sometimes shows significant deviations from standard written language, while exhibiting characteristics of oral language at the levels of lexicon, morphology, and syntax. Well-known phenomena include nonstandard and creative spellings, ad-hoc speedwriting, graphic emulation of prosody and emphasis, acronyms, emoticons, and written representation of non-verbal behavior.

The systematic description of non-standardized writing and its structural and functional analysis are key challenges not only for research on Internet-based communication but also for data-based linguistic studies and corpus creation in many other research areas. The goal of the workshop is to shed light on this problem from the perspective of various research fields and to explore common issues and possible solutions. The main focus is on non-standardized writing phenomena in:

Genres of Internet-based communication
Language communities with emergent literacy
Historic varieties of language
Learner texts

Workshop Organizers:

Michael Beißwenger, TU Dortmund
Stefanie Dipper, Ruhr-Universität Bochum
Stefan Evert, TU Darmstadt
Bianka Trevisan, RWTH Aachen 

Wednesday, March 13 2013:

14.00-14.30
Michael Beißwenger / Stefanie Dipper / Stefan Evert / Bianka Trevisan: Introduction

14.30-15.00
Felix Bildhauer: Majuskelschreibung in Webkorpora: Verteilung und Funktion 

15.00-15.30
Ulrike Sayatz / Roland Schäfer: Klitika und Apostrophschreibung in Webkorpora zwischen Graphematik und Registerklassifikation

15:30-16.00
Klaus Geyer: Dialektgraphien in Mundart-Ausgaben von Asterix-Comics: Systematische, kreative und Mündlichkeitsaspekte in Interaktion

16.30-17.00
Sandra Waldenberger: Schriftsprachliche Variation und emergenter Standard im Übergang zur nhd. Orthographie

17.00-17.30
Michael Piotrowski: Towards Computational Graphemics

17.30-18.00
Ines Rehbein / Sören Schalowski / Nadja Reinhold / Emiel Visser: Äh... Ähm... Filled Pauses in Computer-Mediated Communication 

18.00-18.30
Stella Neumann / Paula Niemietz / Tatiana Serbina: Linguistic Annotation of Text Fragments in a Keystroke Logged Translation corpus

Thursday, March 14 2013:

09.00-10.00
Jennifer Foster (keynote): #hardtoparse: The Challenges of Parsing the Language of Social Media

10.00-10.30
Markus Dickinson / Mohammad Khan / Sandra Kübler: Towards Parsing YouTube Comments 

10.30-11.00
Marc Reznicek / Heike Zinsmeister: Why Learner Texts are Easy to Tag. A Comparative Evaluation of Part-of-Speech Tagging of Kobalt

11.30-12.00
Aivars Glaznieks / Egon Stemle / Andrea Abel / Verena Lyding: Herausforderungen bei der Erstellung eines L1-Lernerkorpus: Lösungsvorschläge aus dem Projekt ''KoKo''

12.00-12.30
Thomas Schmidt: Orthographische Normalisierung und PoS-Tagging von Transkriptionen gesprochener Sprache

12.30-13.00
Meikal Mumin: Explaining the Unexplainable -- On the Challenges of Transliterating Arabic Based Script

Friday, March 15 2013:

11.30-12.30
Christian Mair (keynote): From Transcribed Speech to Visual Ethnolinguistic Repertoires in Four Stages: Writing Pidgin and Creole Languages in Diasporic Web Forums

12.30-13.00
Thomas Bartz / Angelika Storrer: Korpusbasierte Analyse internetbasierter Kommunikation: Phänomene und Herausforderungen

13.00-13.30
Kay-Michael Würzner / Lothar Lemnitzer / Alexander Geyken / Bryan Jurish: Linguistische Annotation von Dokumenten internetbasierter Kommunikation- eine explorative Analyse 

13.30-14.00
Christa Dürscheid / Simone Ueberwasser: (Kein) liberaler Umgang mit der orthographischen Norm. Empirische Befunde zur schriftlichen Alltagskommunikation

Reserve list:

1: Luc Belling: Andersschreibungen luxemburgischer Jugendlicher auf digitalen Pinnwänden
2: Monika Eller: Oral Strategies in Online User Feedback 
3: Hagen Hirschmann: Korpusgesteuerte Syntaxanalysen von Lernersprache
4: Armin Hoenen: Schreibererziehung und nicht standardisierte Schriftlichkeit








----------------------------------------------------------
LINGUIST List: Vol-24-23	
----------------------------------------------------------



More information about the Linguist mailing list