24.2250, FYI: 40474 Split Compounds from GermaNet Available

linguist at linguistlist.org linguist at linguistlist.org
Fri May 31 14:03:47 UTC 2013

LINGUIST List: Vol-24-2250. Fri May 31 2013. ISSN: 1069 - 4875.

Subject: 24.2250, FYI: 40474 Split Compounds from GermaNet Available

Moderator: Damir Cavar, Eastern Michigan U <damir at linguistlist.org>

Reviews: Veronika Drake, U of Wisconsin Madison
Monica Macaulay, U of Wisconsin Madison
Rajiv Rao, U of Wisconsin Madison
Joseph Salmons, U of Wisconsin Madison
Mateja Schuck, U of Wisconsin Madison
Anja Wanner, U of Wisconsin Madison
       <reviews at linguistlist.org>

Homepage: http://linguistlist.org

Do you want to donate to LINGUIST without spending an extra penny? Bookmark
the Amazon link for your country below; then use it whenever you buy from

USA: http://www.amazon.com/?_encoding=UTF8&tag=linguistlist-20
Britain: http://www.amazon.co.uk/?_encoding=UTF8&tag=linguistlist-21
Germany: http://www.amazon.de/?_encoding=UTF8&tag=linguistlistd-21
Japan: http://www.amazon.co.jp/?_encoding=UTF8&tag=linguistlist-22
Canada: http://www.amazon.ca/?_encoding=UTF8&tag=linguistlistc-20
France: http://www.amazon.fr/?_encoding=UTF8&tag=linguistlistf-21

For more information on the LINGUIST Amazon store please visit our
FAQ at http://linguistlist.org/amazon-faq.cfm.

Editor for this issue: Brent Miller <brent at linguistlist.org>

Date: Fri, 31 May 2013 10:03:44
From: Verena Henrich [verena.henrich at uni-tuebingen.de]
Subject: 40474 Split Compounds from GermaNet Available

E-mail this message to a friend:
We are happy to announce the availability of 40474 German nominal compounds
from GermaNet release 8.0 that have been split into their constituent parts,
i.e., modifier and head. This dataset has been constructed semi-automatically
and all compound splits have been manually post-corrected.

The list of split compounds is freely available for download at

For many applications, it is helpful to have information about the parts of
the compound, as usually the semantic interpretation is based on the meaning
of its parts. What makes compound splitting for German a challenging task is
the fact that compounding, which is a very productive word formation process
in German, is not always simple string concatenation. It often involves the
presence of intervening linking elements or the elision of word-final
characters in the modifier constituent of a compound.

For more information about GermaNet, please consult the project website:

Linguistic Field(s): Computational Linguistics
                     Text/Corpus Linguistics

Subject Language(s): German (deu)


LINGUIST List: Vol-24-2250	

More information about the Linguist mailing list