Corpora: NAACL-01 WordNet and Other Lexical Resources Workshop
Priscilla Rasmussen
rasmusse at cs.rutgers.edu
Wed Jan 17 18:32:28 UTC 2001
________________________________________________________________
WordNet and Other Lexical Resources:
Applications, Extensions and Customizations
* Please note merger and extended deadline! *
NAACL 2001 Workshop
Carnegie Mellon University, Pittsburgh
3 and 4 June, 2001
Sponsored by the Association for Computational Linguistics Special
Interest Group on the Lexicon.
Previously announced as two different workshops:
- WordNet: Extensions and NLP Applications
- Customizing Lexical Resources
Lexical resources have become important basic tools within NLP and
related fields. The range of resources available to the researcher is
diverse and vast - from simple word lists to complex MRDs and
thesauruses. The resources contain a whole range of different types of
explicit linguistic information presented in different formats and at various
levels of granularity. Also, much information is left implicit in the
description, e.g. the definition of lexical entries generally contains
genus, encyclopaedic and usage information.
The majority of resources used by NLP researchers were not intended
for computational uses. For instance, MRDs are a by-product of the
dictionary publishing industry, and WordNet was an experiment in
modelling the mental lexicon.
In particular, WordNet has become a valuable resource in the human
language technology and artificial intelligence. Due to its vast
coverage of English words, WordNet provides with general
lexico-semantic information on which open-domain text processing is
based. Furthermore, the development of WordNets in several other
languages extends this capability to trans-lingual applications,
enabling text mining across languages. For example, in Europe, WordNet
has been used as the starting point for the development of a
multilingual database for several European languages (the EuroWordNet
project).
Other resources such as the Longman Dictionary of Contemporary English
and Roget's Thesaurus have also been used for various NLP tasks.
The topic of this workshop is the exploitation of existing resources
for particular computational tasks such as Word Sense Disambiguation,
Generation, Information Retrieval, Information Extraction, Question
Answering and Summarization. We invite paper submissions that include
but are not limited to the following topics:
- Resource usage in NLP and AI
- Resource extension in order to reflect the lexical coverage within a
particular domain;
- Resource augmentation by e.g. adding extra word senses, enriching
the information associated with the existing entries.
For instance, recently, several extensions of the WordNet lexical
database have been initiated, in the United States and abroad, with
the goal of providing the NLP community with additional knowledge that
models pragmatic information not always present in the texts but
required by document processing;
- Improvement of the consistency or quality of resources by
e.g. homogenizing lexical descriptions, making implicit lexical
knowledge explicit and clustering word senses;
- Merging resources, i.e. combining the information in more than one
resource e.g. by producing a mapping between their senses. For
instance, WordNet has been incorporated in several other linguistic
and general knowledge bases (e.g. FrameNet and CYC);
- Corpus-based acquisition of knowledge;
- Mining common sense knowledge from resources;
- Multilingual WordNets and applications;
Paper submission
Submissions must use the NAACL latex style or Microsoft Word style. Paper
submissions should consist of a full paper (6 pages or less).
NAACL style file
NAACL bibliography style file
Latex sample file
Microsoft Word Template file
Submission procedure
Electronic submission only. For U.S. papers please send the pdf or
postscript file of your paper to: moldovan at seas.smu.edu. Please submit
papers from other countries to w.peters at dcs.shef.ac.uk.
Because review is blind, no author information is included as part of
the paper.
A separate identification page must be sent by email including title,
all authors, theme area, keywords, word count, and an abstract of no
more than 5 lines. Late submissions will not be accepted. Notification
of receipt will be e-mailed to the first author shortly after
receipt.
Please address any questions to moldovan at seas.smu.edu or
w.peters at dcs.shef.ac.uk
Important dates
Paper submission deadline: February 20, 2001
Notification of acceptance: March 10, 2001
Camera ready due: March 25, 2001
Workshop date: June 3 and 4, 2001
Organizers
Sanda Harabagiu, SMU, sanda at seas.smu.edu
Dan Moldovan, SMU, moldovan at seas.smu.edu
Wim Peters, University of Sheffield, wim at dcs.shef.ac.uk
Mark Stevenson, University of Sheffield, marks at dcs.shef.ac.uk
Yorick Wilks, University of Sheffield, yorick at dcs.shef.ac.uk
Programme Committee
Roberto Basili (Universita di Roma Tor Vergata)
Martin Chodorow (Hunter College of CUNY)
Christianen Fellbaum (Princeton University)
Ken Haase (MIT)
Sanda Harabagiu (SMU)
Graeme Hirst (University of Toronto)
Robert Krovetz, NEC
Claudia Leacock (ETS)
Steven Maiorano (AAT)
Rada Mihalcea (SMU)
Dan Moldovan (SMU)
Simonetta Montemagni (Istituto di Linguistica Computazionale, Pisa)
Martha Palmer (University of Pennsylvania)
Maria Tereza Pazienza (Universita di Roma Tor Vergata)
Wim Peters (University of Sheffield)
German Rigau (Universitat Politecnica de Catalunya)
Mark Stevenson (University of Sheffield)
Randee Tengi (Princeton University)
Paola Velardi (University of Roma "La Sapienza")
Ellen Voorhees (NIST)
Piek Vossen (Sail Labs)
Yorick Wilks (University of Sheffield)
Workshop URL:
http://www.seas.smu.edu/~moldovan/mwnw/
More information about the Corpora
mailing list