[Corpora-List] New resource: Semcor 1.6 features for WSD

Eneko Agirre e.agirre at ehu.es
Thu Feb 22 13:59:19 UTC 2007


Dear list members,

in the context of the Semeval 2007 competition
(http://nlp.cs.swarthmore.edu/semeval/), the IXA NLP group has released
machine learning features for all content words with more than 10
occurrences in SemCor.

These features can be freely used for developing all-words supervised
Word Sense Disambiguation systems. The sense tags correspond to synsets of
WordNet v. 1.6, but the senses can be easily mapped to other versions (see
for instance http://www.lsi.upc.es/~nlp/tools/mapping.html).

You can download it from the Semeval WSD-CLIR task website:

http://ixa2.si.ehu.es/semeval-clir/

or directly from:

http://ixa2.si.ehu.es/semeval-clir/index_fitxategiak/task1.semcor1.6feats.v2.tar.gz

best

eneko, oier and david

-- 
---------------------http://ji.ehu.es/eneko------------------
Eneko Agirre                          PLEASE NOTE NEW E-MAIL:
Informatika Fakultatea                mailto: e.agirre at ehu.es
649 p.k. - 20.080 Donostia              fax: (+34) 943 015590
Euskal Herria / Basque Country          tel: (+34) 943 015019



More information about the Corpora mailing list