[Corpora-List] New resources and tools for GermaNet
Verena Henrich
verena.henrich at uni-tuebingen.de
Mon Jan 7 14:04:21 UTC 2013
Apologies for cross postings
------------------------
We are happy to announce the availability of three new resources for the
German wordnet GermaNet:
1) WebCAGe (short for: Web-Harvested Corpus Annotated with GermaNet
Senses) is a domain-independent web-harvested corpus that has been
semi-automatically annotated with senses from GermaNet. The corpus has
been constructed on the basis of a the sense alignment of GermaNet
senses with senses from the online dictionary Wiktionary. Wiktionary
senses are frequently illustrated by one or more example sentences,
which in turn are often linked to external references, including
Wikipedia articles and other textual web sources. Therefore, the
GermaNet-Wiktionary alignment and the various pointers contained in
Wiktionary example sentences made it possible to automatically assemble
a corpus annotated with GermaNet senses. In order to assure good
quality, all automatic annotations have been manually verified. WebCAGe
is freely available for download at:
http://www.sfs.uni-tuebingen.de/en/ascl/resources/corpora/webcage.html
2) Semantic Relatedness API: To calculate semantic relatedness between
two word senses in GermaNet, we have reimplemented a suite of semantic
relatedness algorithms for German that are well-known for English,
including the methods proposed by Leacock and Chodorow (1998), Wu and
Palmer (1994), Hirst and St-Onge (1998), Resnik (1995), Jiang and
Conrath (1997), Lin (1998), and Lesk (1986):
http://www.sfs.uni-tuebingen.de/GermaNet/tools.shtml#SemRelAPI
3) GernEdiT (GermaNet Editing Tool) is a graphical editor that supports
several ways to search and visualize GermaNet data. It can be used to
browse through the GermaNet graph, to list all senses of a word, and to
view all properties of a particular word sense, including relations to
other word senses, Wiktionary paraphrases, and English translations.
GernEdiT is used by the lexicographers to maintain and extend the
GermaNet database in a user-friendly way. Now we made GernEdiT available
for interested users:
http://www.sfs.uni-tuebingen.de/GermaNet/tools.shtml#GernEdiT
GermaNet is a lexical-semantic net that relates German nouns, verbs, and
adjectives semantically by grouping lexical units that express the same
concept into synsets and by defining semantic relations between these
synsets. GermaNet has much in common with the English WordNet
(http://wordnet.princeton.edu) and can be viewed as an on-line thesaurus
or a light-weight ontology.
GermaNet has been developed and maintained within various projects by
the research group for General and Computational Linguistics (Director:
Prof. Dr. Erhard Hinrichs) at the University of Tübingen since 1997.
For more information about GermaNet, please consult the project website:
http://www.sfs.uni-tuebingen.de/GermaNet/
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list