[Corpora-List] Seeking new results on WSD in applications

Philip Resnik resnik at umiacs.umd.edu
Fri Oct 21 16:28:10 UTC 2005


I recently posted a query to this list asking for pointers to "any
recent (2004 or later) results bearing on the usefulness of word sense
disambiguation (WSD) for end-user applications (such as machine
translation or question answering, and excluding WSD in support of
NLP-internal tasks such as parsing, semantic role labeling, learning
selectional preferences, etc.)."  I'd like to express my thanks to
Chris Brew, Chris Callison-Burch, Katrin Erk, Jesus Jimenez, Rada
Mihalcea, Paul Rayson, and Luis Villarejo for their replies.  Here's
my attempt at organizing/summarizing.

One category of responses concerned recent work on using WSD
techniques in connection with lexical selection in MT (with or without
an explicit sense inventory).  Relevant references include:

  Clara Cabezas and Philip Resnik, ``Using WSD Techniques for Lexical
  Selection in Statistical Machine Translation'', Technical report
  CS-TR-4736/LAMP-TR-124/UMIACS-TR-2005-42, July 2005.
  http://lampsrv01.umiacs.umd.edu/pubs/TechReports/LAMP_124/LAMP_124.pdf

  Marine Carpuat and Dekai Wu. "Word Sense Disambiguation
  vs. Statistical Machine Translation". 43rd Annual Meeting of the
  Association for Computational Linguistics (ACL-2005). Ann Arbor, MI:
  Jun 2005.

  Marine Carpuat and Dekai Wu. "Evaluating the Word Sense
  Disambiguation Performance of Statistical Machine
  Translation". Second International Joint Conference on Natural
  Language Processing (IJCNLP-2005). Jeju, Korea: Oct 2005.

  Je'su's Gime'nez, Llui's Ma'rquez, and German Rigau, Automatic
  Translation of WordNet Glosses. (EUROLAN'05 Cross-Language Knowledge
  Induction Workshop), July 2005.
  http://www.lsi.upc.edu/~jgimenez/PUBS/eurolan2005-gmr.pdf

  D. Vickrey, L. Biewald, M. Teyssier, and D. Koller. "Word-Sense
  Disambiguation for Machine Translation." Conference on Empirical
  Methods in Natural Language Processing (EMNLP), October 2005.
  http://ai.stanford.edu/~koller/papers.cgi?topic=9

Another set of references concerns use of WSD-like tagging of semantic
categories in applications involving "interpretation of results" by
the user, such as requirements engineering, content analysis,
dictionary search tools, and providing assistance for human
translators.

  Lo"fberg L, Juntunen J-P, Nykanen A, Varantola K, Rayson P, Archer
  D. (2004). Using a semantic tagger as dictionary search tool. In
  Williams G. and Vessier S. (eds.) Proceedings of the 11th EURALEX
  (European Association for Lexicography) International Congress
  (Euralex 2004), Lorient, France, 6-10 July 2004. Universite' de
  Bretagne Sud. Volume I, pp. 127-134.

  S. Sharoff, P. Rayson, O. Mudraya, A. Wilson and T. McEnery
  (2004). A tool for assisting translators using automatic semantic
  annotation. Presented at Corpus Use and Learning to Translate
  (CULT-BCN) Barcelona, January 22nd-24th 2004.

  Sawyer, P., Rayson, P. and Cosh, K. (forthcoming) Shallow Knowledge
  as an Aid to Deep Understanding in Early-Phase Requirements
  Engineering. IEEE Transactions on Software Engineering.

Two replies related to enabling tasks, not end-user applications per se:

  Colin Bannard and Chris Callison-Burch.  "Paraphrasing with bilingual
  parallel corpora." ACL-2005.
  http://www-csli.stanford.edu/~ccb/publications/ACL_2005a.pdf. They
  "found that WSD played an important role, though only did simulated
  WSD rather than doing it properly" (Chris C-B, p.c.)

  Katrin Erk is "working on the automatic assignment of FrameNet
  frames, which can be seen as a WSD task", and she suggested looking
  at Tatu and Moldovan, "A Semantic Approach to Recognizing Textual
  Entailment" (EMNLP-2005).  She also mentioned related work aimed at
  using FrameNet for Question Answering:
  http://www.springerlink.com/(k23eicebjg3n5w45qp0frc45)/app/home/contribution.asp?referrer=parent&backto=issue,6,42;journal,579,2263;linkingpublicationresults,1:105633,1

Finally, Chris Brew kindly suggested four other places/directions in
which to look for applications: ICALL (Intelligent Computer-Aided
Language Learning); support for lexicographers; support for librarians
assigning Library of Congress labels (or similar) to books (or
similar); disambiguation of user queries to search engines in order to
place ads within the search results.  (If anyone has specific recent
references related to these suggestions I'd love to know what they
are.)

Thanks again to everyone for their responses!

  Philip

  ----------------------------------------------------------------
  Philip Resnik, Associate Professor
  Department of Linguistics and Institute for Advanced Computer Studies

  1401 Marie Mount Hall            UMIACS phone: (301) 405-6760       
  University of Maryland           Linguistics phone: (301) 405-8903
  College Park, MD 20742 USA	   Fax: (301) 314-2644 / (301) 405-7104
  http://umiacs.umd.edu/~resnik	   E-mail: resnik at umiacs.umd.edu



More information about the Corpora mailing list