[Corpora-List] KILO

Adam Kilgarriff adam at lexmasterclass.com
Fri Aug 12 06:11:59 UTC 2005


HECTOR was a research project between Oxford University Press (for
lexicography) and DEC (for computing) in the early 1990s.  It explored the
potential for improving dictionaries and computational lexicons through
close engagement with corpus data and use of sophisticated computing and NLP
software.  Lexicographers manually sense-tagged large sets of corpus
examples for a few hundred words, using hardware and software with what was,
for the time, a ground-breaking corpus search facility (and later turned
into Altavista, the Google of its day).  The project was the inspiration for
my work in the WASPS project (http://wasps.itri.bton.ac.uk
<http://wasps.itri.bton.ac.uk/>  ) and, more recently, the Sketch Engine
(http://sketchengine.co.uk <http://sketchengine.co.uk/>  )  Project leaders
from OUP's side were Sue Atkins (http://www.lexmasterclass.com
<http://www.lexmasterclass.com/>  ) and Patrick Hanks
http://www.patrickhanks.com/ 

 

It provided the English SENSEVAL-1 dictionary and corpus, used to evaluate
Word Sense Disambiguation (WSD) systems (http://www.senseval.org
<http://www.senseval.org/>  )

 

SEMCOR is a manually sense-tagged corpus produced by the Princeton WordNet
team, based on the WordNet sense inventory, freely available and which has
been very widely used for WSD and related tasks.  KILO was also a WordNet
team project - see http://wordnet.princeton.edu
<http://wordnet.princeton.edu/>  for more info (to state the obvious)

 

Adam

 

 

-----Original Message-----
From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
Behalf Of InuH
Sent: 11 August 2005 23:22
To: CORPORA at UIB.NO
Subject: [Corpora-List] KILO

 

Hi list members,

I wonder if anyone could give me detailed information about KILO, HECTOR and
SEMCOR?

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20050812/d1a391ba/attachment.htm>


More information about the Corpora mailing list