[Corpora-List] Algorithm for hypernym chaining?

Adam Funk a.funk at dcs.shef.ac.uk
Fri Jun 25 13:41:54 UTC 2010


Sorry if this is a bit off-topic, but at least it's for an NLP
application.  I have a file of pairs of WordNet synset numbers that
represent hypernym-hyponym pairs, for example:

100001930,100001740
100002137,100001740
100002452,100001930
100002684,100001930
100003553,100002684
100003993,100003553
100004258,100003553
100004475,100004258
100005787,100004475
100005930,100004475

and I want to produce all the longest possible chains of them.  The
output for that list is as follows:

100002137,100001740
100002452,100001930,100001740
100003993,100003553,100002684,100001930,100001740
100005787,100004475,100004258,100003553,100002684,100001930,100001740
100005930,100004475,100004258,100003553,100002684,100001930,100001740

I have a program that does this correctly for small samples of the data
I'm supposed to process, but I've estimated (using a log plot) that it
will take about 10^595 seconds to run over the whole list (about 89000
pairs), so I think I'm doing it wrong.

I haven't been able to find an algorithm for this, because I don't know
what the computational problem is called.  Can anyone point me in the
direction of something suitable?

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list