[Corpora-List] Faster tool for WordNet Similarity measures
Mark Sammons
mssammon at illinois.edu
Tue Feb 1 21:28:22 UTC 2011
Hi, Suzan.
If what you are after is just a similarity score, you could
try the Cognitive Computation Group's WordNet-based similarity metric,
written in c++ and -- anecdotally -- pretty fast.
It runs as an xmlrpc service, which imposes a certain network latency
overhead, *but* is language neutral, which is one appealing feature. Within
our research group, many users call it and cache the response to reduce
processing time still further. If you are coding in c++, then you could of course
call WNSim directly.
We've used WNSim in a number of research projects, including our work on
Recognizing Textual Entailment and on Distant Supervision. Here's the page for
the WNSim code:
http://cogcomptest.cs.illinois.edu/page/software_view/21
There's also a link to a technical report that describes the underlying methodology.
You can take a look at the output using the demo here:
http://cogcomp.cs.illinois.edu/demo/wnsim/
Regards,
Mark
---- Original message ----
>Date: Tue, 1 Feb 2011 10:25:23 +0100
>From: Suzan Verberne <s.verberne at let.ru.nl>
>Subject: [Corpora-List] Faster tool for WordNet Similarity measures
>To: Corpora List <corpora at uib.no>
>
>Hi all,
>
>I have previously been using Pedersen's WordNet Similarity module (
>http://search.cpan.org/dist/WordNet-Similarity/lib/WordNet/Similarity.pm
>) for calculating the similarity or relatedness between pairs of
>words. Now I started to use it again but I noticed that it is way too
>slow for a real-time application (which is what I need now).
>
>I originally wrote a simple Perl script that calls the module (shown
>below) but it takes almost five seconds to run. Almost all this time
>is spent on calling the module so for batch scripts it is fine (then
>the module is only called once for multiple requests), but I need it
>to work in real time in a retrieval experiment and then 5 seconds is
>too long.
>
>Does anyone know an alternative (fast!) tool for calculating
>Similarity and/or Relatedness between two words? It might be using
>either a Wu & Palmer-like measure or a Lesk-type measure.
>
>Thanks!
>Suzan Verberne
>
>#! /usr/bin/perl
> use WordNet::QueryData;
> use WordNet::Similarity::path;
> my $wn = WordNet::QueryData->new;
> my $measure = WordNet::Similarity::path->new ($wn);
> my $value = $measure->getRelatedness("car#n#1", "bus#n#2");
> print "car (sense 1) <-> bus (sense 2) = $value\n";
>
>
>--
>Suzan Verberne, postdoctoral researcher
>Centre for Language and Speech Technology
>Radboud University Nijmegen
>Tel: +31 24 3611134
>Email: s.verberne at let.ru.nl
>http://lands.let.ru.nl/~sverbern/
>--
>
>_______________________________________________
>Corpora mailing list
>Corpora at uib.no
>http://mailman.uib.no/listinfo/corpora
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list