23.3863, FYI: New Falko German Learner Corpus Release

linguist at linguistlist.org linguist at linguistlist.org
Mon Sep 17 17:10:24 UTC 2012


LINGUIST List: Vol-23-3863. Mon Sep 17 2012. ISSN: 1069 - 4875.

Subject: 23.3863, FYI: New Falko German Learner Corpus Release

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>

Reviews: Veronika Drake, U of Wisconsin Madison
Monica Macaulay, U of Wisconsin Madison
Rajiv Rao, U of Wisconsin Madison
Joseph Salmons, U of Wisconsin Madison
Anja Wanner, U of Wisconsin Madison
       <reviews at linguistlist.org>

Homepage: http://linguistlist.org

Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from
Amazon!

USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21

For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.

Editor for this issue: Brent Miller <brent at linguistlist.org>
================================================================  


Date: Mon, 17 Sep 2012 13:10:15
From: Marc Reznicek [marc.reznicek at staff.hu-berlin.de]
Subject: New Falko German Learner Corpus Release

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=23-3863.html&submissionid=4553626&topicid=6&msgnumber=1
 The error-annotated German learner corpus Falko has released a new
subcorpus: FalkoEssayL2WHIGv2.0 including 195 argumentative essays by
advanced learners of German (117,189 tokens).

For each text two full-text target hypotheses (a minimal morphosyntactic
normalization and an extended semantic-pragmatic version) have been manually
annotated. 

Each representation has been POS-tagged and lemmatized (Treetagger &
rfTagger). rfTagger morphological annotation has been integrated as well.

On this basis, tags indicating differences between the learner text and its
POS and lemma annotations and the respective target hypotheses (POS & lemma)
have been added.

The corpus is freely available under the following link:

http://korpling.german.hu-berlin.de/falko-suche

The annotation guidelines can be found here:
http://www.linguistik.hu-berlin.de/institut/professuren/korpuslinguistik/for
schung/falko/Falko-Handbuchv2.0.pdf 



Linguistic Field(s): Language Acquisition
                     Text/Corpus Linguistics

Subject Language(s): German (deu)



----------------------------------------------------------
LINGUIST List: Vol-23-3863	
----------------------------------------------------------



More information about the LINGUIST mailing list