Corpora: NAACL-01 WordNet and Other Lexical Resources Workshop

Wed Jan 17 18:32:28 UTC 2001

________________________________________________________________

WordNet and Other Lexical Resources:
Applications, Extensions and Customizations

* Please note merger and extended deadline! *

NAACL 2001 Workshop

Carnegie Mellon University, Pittsburgh

3 and 4 June, 2001

Sponsored by the Association for Computational Linguistics Special
Interest Group on the Lexicon.

Previously announced as two different workshops:
- WordNet: Extensions and NLP Applications
- Customizing Lexical Resources

Lexical resources have become important basic tools within NLP and
related fields. The range of resources available to the researcher is
diverse and vast - from simple word lists to complex MRDs and
thesauruses. The resources contain a whole range of different types of
explicit linguistic information presented in different formats and at various
levels of granularity. Also, much information is left implicit in the
description, e.g. the definition of lexical entries generally contains
genus, encyclopaedic and usage information.

The majority of resources used by NLP researchers were not intended
for computational uses. For instance, MRDs are a by-product of the
dictionary publishing industry, and WordNet was an experiment in
modelling the mental lexicon.

In particular, WordNet has become a valuable resource in the human
language technology and artificial intelligence. Due to its vast
coverage of English words, WordNet provides with general
lexico-semantic information on which open-domain text processing is
based. Furthermore, the development of WordNets in several other
languages extends this capability to trans-lingual applications,
enabling text mining across languages. For example, in Europe, WordNet
has been used as the starting point for the development of a
multilingual database for several European languages (the EuroWordNet
project).
Other resources such as the Longman Dictionary of Contemporary English
and Roget's Thesaurus have also been used for various NLP tasks.

The topic of this workshop is the exploitation of existing resources
for particular computational tasks such as Word Sense Disambiguation,
Generation, Information Retrieval, Information Extraction, Question
Answering and Summarization. We invite paper submissions that include
but are not limited to the following topics:

- Resource usage in NLP and AI

- Resource extension in order to reflect the lexical coverage within a
  particular domain;

- Resource augmentation by e.g. adding extra word senses, enriching
the information associated with the existing entries.
For instance, recently, several extensions of the WordNet lexical
database have been initiated, in the United States and abroad, with
the goal of providing the NLP community with additional knowledge that
models pragmatic information not always present in the texts but
required by document processing;

- Improvement of the consistency or quality of resources by
  e.g. homogenizing lexical descriptions, making implicit lexical
  knowledge explicit and clustering word senses;

- Merging resources, i.e. combining the information in more than one
  resource e.g. by producing a mapping between their senses. For
  instance, WordNet has been incorporated in several other linguistic
  and general knowledge bases (e.g. FrameNet and CYC);

- Corpus-based acquisition of knowledge;

- Mining common sense knowledge from resources;

- Multilingual WordNets and applications;

Paper submission

 Submissions must use the NAACL latex style or Microsoft Word style. Paper
 submissions should consist of a full paper (6 pages or less).

 NAACL style file
 NAACL bibliography style file
 Latex sample file
 Microsoft Word Template file

Submission procedure

Electronic submission only. For U.S. papers please send the pdf or
postscript file of your paper to: moldovan at seas.smu.edu. Please submit
papers from other countries to w.peters at dcs.shef.ac.uk.
Because review is blind, no author information is included as part of
the paper.
A separate identification page must be sent by email including title,
all authors, theme area, keywords, word count, and an abstract of no
more than 5 lines. Late submissions will not be accepted. Notification
of receipt will be e-mailed to the first author shortly after
receipt.
Please address any questions to moldovan at seas.smu.edu or
w.peters at dcs.shef.ac.uk

Important dates

 Paper submission deadline: February 20, 2001

 Notification of acceptance: March 10, 2001

 Camera ready due: March 25, 2001

 Workshop date: June 3 and 4, 2001

Organizers

Sanda Harabagiu, SMU, sanda at seas.smu.edu
Dan Moldovan, SMU, moldovan at seas.smu.edu
Wim Peters, University of Sheffield, wim at dcs.shef.ac.uk
Mark Stevenson, University of Sheffield, marks at dcs.shef.ac.uk
Yorick Wilks, University of Sheffield, yorick at dcs.shef.ac.uk

Programme Committee

Roberto Basili (Universita di Roma Tor Vergata)
Martin Chodorow (Hunter College of CUNY)
Christianen Fellbaum (Princeton University)
Ken Haase (MIT)
Sanda Harabagiu (SMU)
Graeme Hirst (University of Toronto)
Robert Krovetz, NEC
Claudia Leacock (ETS)
Steven Maiorano (AAT)
Rada Mihalcea (SMU)
Dan Moldovan (SMU)
Simonetta Montemagni (Istituto di Linguistica Computazionale, Pisa)
Martha Palmer (University of Pennsylvania)
Maria Tereza Pazienza (Universita di Roma Tor Vergata)
Wim Peters (University of Sheffield)
German Rigau (Universitat Politecnica de Catalunya)
Mark Stevenson (University of Sheffield)
Randee Tengi (Princeton University)
Paola Velardi (University of Roma "La Sapienza")
Ellen Voorhees (NIST)
Piek Vossen (Sail Labs)
Yorick Wilks (University of Sheffield)

Workshop URL:
http://www.seas.smu.edu/~moldovan/mwnw/