22.4529, FYI: German SALSA Corpus Release 2.0
linguist at LINGUISTLIST.ORG
linguist at LINGUISTLIST.ORG
Sat Nov 12 18:55:46 UTC 2011
LINGUIST List: Vol-22-4529. Sat Nov 12 2011. ISSN: 1069 - 4875.
Subject: 22.4529, FYI: German SALSA Corpus Release 2.0
Moderators: Anthony Aristar, Eastern Michigan U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews: Veronika Drake, U of Wisconsin-Madison
Monica Macaulay, U of Wisconsin-Madison
Rajiv Rao, U of Wisconsin-Madison
Joseph Salmons, U of Wisconsin-Madison
Anja Wanner, U of Wisconsin-Madison
<reviews at linguistlist.org>
Homepage: http://linguistlist.org
The LINGUIST List is funded by Eastern Michigan University,
and donations from subscribers and publishers.
Editor for this issue: Brent Miller <brent at linguistlist.org>
================================================================
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.cfm.
===========================Directory==============================
1)
Date: 11-Nov-2011
From: Josef Ruppenhofer [josefr at coli.uni-saarland.de]
Subject: German SALSA Corpus Release 2.0
-------------------------Message 1 ----------------------------------
Date: Sat, 12 Nov 2011 13:55:35
From: Josef Ruppenhofer [josefr at coli.uni-saarland.de]
Subject: German SALSA Corpus Release 2.0
E-mail this message to a friend:
http://linguistlist.org/issues/emailmessage/verification.cfm?iss=22-4529.html&submissionid=4535712&topicid=6&msgnumber=1
The second and final release of the SALSA corpus, a German corpus
with semantic role annotations in the Berkeley FrameNet paradigm is
available for download at http://www.coli.uni-
saarland.de/projects/salsa/corpus/.
The corpus was created by the SALSA project at Saarland University
under the direction of Manfred Pinkal. Work on the corpus was
supported by funds from a Leibniz prize awarded to Manfred Pinkal
and by the German Science Foundation (DFG; grants PI 154/9-3, PI
154/8-1).
The frame semantic annotations are applied on top of the TIGER
treebank, a syntactically annotated German newspaper corpus. Salsa
release 2 references TIGER version 2.1.
More information on TIGER and FrameNet can be found here:
http://www.ims.uni-stuttgart.de/projekte/TIGER/
https://framenet.icsi.berkeley.edu/fndrupal/
SALSA uses the frames of FrameNet releases 1.2 and 1.3 for the
German annotation, wherever available and appropriate. In addition,
SALSA has developed a number of ''proto-frames'', i.e., predicate-
specific frames, to provide coverage for predicate instances currently
not covered by FrameNet. The total size of the annotation is roughly
20.000 verbal target instances and, new in Salsa release 2, more than
17.000 nominal target instances.
More information on SALSA can be found on the website:
http://www.coli.uni-saarland.de/projects/salsa/
The annotation scheme is described in:
A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado and M. Pinkal. The
SALSA Corpus: a German Corpus Resource for Lexical Semantics. In:
Proceedings of LREC 2006, Genoa, Italy.
If you have any questions, feel free to send an email to
salsa-mit at coli.uni-sb.de
Linguistic Field(s): Computational Linguistics
Semantics
Text/Corpus Linguistics
-----------------------------------------------------------
LINGUIST List: Vol-22-4529
----------------------------------------------------------
More information about the LINGUIST
mailing list