22.4529, FYI: German SALSA Corpus Release 2.0

linguist at LINGUISTLIST.ORG linguist at LINGUISTLIST.ORG
Sat Nov 12 18:55:46 UTC 2011


LINGUIST List: Vol-22-4529. Sat Nov 12 2011. ISSN: 1069 - 4875.

Subject: 22.4529, FYI: German SALSA Corpus Release 2.0

Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>

Reviews: Veronika Drake, U of Wisconsin-Madison
Monica Macaulay, U of Wisconsin-Madison
Rajiv Rao, U of Wisconsin-Madison
Joseph Salmons, U of Wisconsin-Madison
Anja Wanner, U of Wisconsin-Madison
       <reviews at linguistlist.org>

Homepage: http://linguistlist.org

The LINGUIST List is funded by Eastern Michigan University,
and donations from subscribers and publishers.

Editor for this issue: Brent Miller <brent at linguistlist.org>
================================================================  

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.cfm.

===========================Directory==============================  

1)
Date: 11-Nov-2011
From: Josef Ruppenhofer [josefr at coli.uni-saarland.de]
Subject: German SALSA Corpus Release 2.0


-------------------------Message 1 ---------------------------------- 
Date: Sat, 12 Nov 2011 13:55:35
From: Josef Ruppenhofer [josefr at coli.uni-saarland.de]
Subject: German SALSA Corpus Release 2.0

E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=22-4529.html&submissionid=4535712&topicid=6&msgnumber=1
 
The second and final release of the SALSA corpus, a German corpus 
with semantic role annotations in the Berkeley FrameNet paradigm is 
available for download at http://www.coli.uni-
saarland.de/projects/salsa/corpus/.

The corpus was created by the SALSA project at Saarland University 
under the direction of Manfred Pinkal. Work on the corpus was 
supported by funds from a Leibniz prize awarded to Manfred Pinkal 
and by the German Science Foundation (DFG; grants PI  154/9-3, PI 
154/8-1).

The frame semantic annotations are applied on top of the TIGER 
treebank, a syntactically annotated German newspaper corpus. Salsa 
release 2 references TIGER version 2.1.

More information on TIGER and FrameNet can be found here:

http://www.ims.uni-stuttgart.de/projekte/TIGER/
https://framenet.icsi.berkeley.edu/fndrupal/

SALSA uses the frames of FrameNet releases 1.2 and 1.3 for the 
German annotation, wherever available and appropriate. In addition, 
SALSA has developed a number of ''proto-frames'', i.e., predicate-
specific frames, to provide coverage for predicate instances currently 
not covered by FrameNet. The total size of the annotation is roughly 
20.000 verbal target instances and, new in Salsa release 2, more than 
17.000 nominal target instances.

More information on SALSA can be found on the website:

http://www.coli.uni-saarland.de/projects/salsa/

The annotation scheme is described in:

A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado and M. Pinkal. The 
SALSA Corpus: a German Corpus Resource for Lexical Semantics. In: 
Proceedings of LREC 2006, Genoa, Italy.

If you have any questions, feel free to send an email to

salsa-mit at coli.uni-sb.de 



Linguistic Field(s): Computational Linguistics
                     Semantics
                     Text/Corpus Linguistics





 





-----------------------------------------------------------
LINGUIST List: Vol-22-4529	
----------------------------------------------------------



More information about the LINGUIST mailing list