[Corpora-List] content based categories

Evgeniy Gabrilovich gabr at cs.technion.ac.il
Tue Dec 23 12:27:18 UTC 2003


Dear Martha Hofman,

There is an additional resource built around WordNet that labels word senses
with broad categories such as Economy, Medicine or Geography.
You can find more info about this project (called "WordNet domains") at
http://tcc.itc.it/research/textec/topics/disambiguation/wordnetdomains.html
According to your description, WordNet synsets might present too
fine-grained
distinctions for your task, while these broader categories might be more
appropriate.

Evgeniy.
--
Evgeniy Gabrilovich
Ph.D. student in Computer Science
Department of Computer Science, Technion - Israel Institute of Technology
Technion City, Haifa 32000, Israel
E-mail: gabr at cs.technion.ac.il WWW: http://www.cs.technion.ac.il/~gabr
Phone: (office) +972-4-8294948


  -----Original Message-----
  From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no]On
Behalf Of Martha Hofman
  Sent: Monday, December 22, 2003 18:16
  To: corpora at hd.uib.no
  Subject: [Corpora-List] content based categories



  Dear list members,

  To my question concerning content based word categories, I received the
following suggestions.
  My question was: For my thesis concerning an electronic dictionary for
second language learners, I am looking for content based categories of
nouns, verbs, adjectives and nouns. Who knows where I can find such
categories?

  Thanks for your reactions,

  Martha Hofman



  Suggestions:

  <http://dictionary.cambridge.org/researchers.htm>
  All senses are coded into semantic categories as part of a
"SmartThesaurus", and naturally all senses are also categorised
grammatically.

  For reviewing an electronic dictionary containing such data: for £15 buy
the CD-ROM, see <http://dictionary.cambridge.org/cald/cdrom>.

  Wordsmyth Children's Dictionary, at http://www.wordsmyth.net.  Some
entries have word relations data, called "Word Explorer" - for example, the
entry for "emotion".

  CELEX. has few types of nouns and verbs, but gives frequency estimates for
each.

  Verbs and nouns divided into "synsets" in the WordNet files:
http://www.cogsci.princeton.edu/~wn/

  Princeton WordNet (English)
  GermaNet (German)
  BalkaNet (several Middle and East European languages)
  EuroWordNet (multi-lingual)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20031223/732debc0/attachment.htm>


More information about the Corpora mailing list