Corpora: Coling Grammar Engineering & Evaluation Workshop

Richard.Sutcliffe Richard.Sutcliffe at ul.ie
Fri Mar 22 13:19:31 UTC 2002


Call for Papers

Grammar Engineering and Evaluation
Sunday 1 September 2002

Workshop to be held in conjunction with Coling 2002, Taipei
24 August - 1 September 2002

www.coling2002.sinica.edu.tw/
www.csis.ul.ie/gee02

Overview

Grammars are central components of many types of NLP system. The
workshop will be concerned with methods for the effective
engineering and evaluation of grammars with particular emphasis
on their use in real-world applications.

Background

Recent years have seen the development of techniques and
resources to support robust, deep grammatical analysis of
language in real-world domains, for instance in flexible
human-computer dialog systems (e.g.  the Dutch OVIS prototype
train information system) and speech-to-speech translation
(e.g. the Verbmobil system). The demands of these types of tasks
have driven significant advances in areas such as parser
efficiency, hybrid statistical / symbolic approaches to
disambiguation, and the acquisition of large-scale lexicons. In
response to these successes deep language processing is starting
to be deployed in commercial applications such as automated email
response.

The effective development, maintenance and enhancement of
grammars is a central issue in such efforts, and the size and
complexity of realistic grammars forces these processes to be
tackled in ways that have much in common with software
engineering. Thus, two common metrics defined over grammars are
coverage and degree of overgeneration; these can be evaluated by
applying the grammar to manually-constructed test suites of
grammatical and ungrammatical inputs, ideally supported by
automated profiling and visualisation tools. Examples of test
suites include those that have been produced on the TSNLP, DiET
and Verbmobil projects, while the Saarbruecken [incr tsdb()]
system is one of the established profiling tools. Since grammars
are expensive to develop, another important concern is the
effective re-use of existing grammatical resources: some grammar
formalisms facilitate this by for example allowing grammar
writers to structure the grammar hierarchically or in terms of
individual classes with modularised behaviour. A further issue is
how to support a team of grammarians working on the same or
related grammars; a notable effort in this area is the Xerox-led
collaborative ParGram project developing parallel grammars for
several different languages.


Objectives

The objectives of the workshop will be to summarise what has been
achieved in the areas of grammar engineering and evaluation, to
establish the common themes between different approaches and to
discuss future trends, with particular emphasis on real-world
applications. The focus will be on grammars rather than parsing
algorithms or the accuracy of parsing systems, on approaches
which enable re-use of resources, and on methods which are
suitable for multilingual systems.

In particular, contributions are solicited in the following areas:

  * Methods of grammar development and discussions of their strengths
  and weaknesses;

  * Standards for encoding grammatical information in a theory-neutral
  fashion;

  * Comparisons of manual techniques with those involving learning from
  treebanks;

  * Techniques for establishing the effectiveness, coverage or quality
  of a grammar;

  * The determination of time or effort required to achieve a level of
  performance or to adapt an existing grammar to a new application
  domain;

  * The application of a grammatical formalism to widely different
  languages; and

  * Issues in porting grammars between languages.


Submissions

Abstracts for workshop contributions should not exceed two A4 pages
(excluding references). An additional title page should state: the
title; author(s); affiliation(s); and contact author's e-mail address,
as well as postal address, telephone and fax numbers.

Submission is to be sent by email, preferably in Postscript or PDF
format, to Richard Sutcliffe by Friday 26 April 2002. Abstracts
will be reviewed by at least 3 members of the program committee.

Formatting instructions for the final full version of papers will be
sent to authors after notification of acceptance.

Accepted papers will appear in the printed proceedings which will be
available to all those who register for the workshop.

The proceedings of all workshops will also be included in the Coling
CD ROM along with the tutorials and the proceedings of the main
conference.

Important Dates

Deadline for Submissions: Fri 26 April 2002
Notification of Acceptance: Fri 24 May 2002
Final Versions of Papers Due: Fri 28 June 2002
Workshop: Sun 1 September 2002


Workshop Chairs

John A. Carroll
Cognitive and Computing Sciences
University of Sussex
Falmer, Brighton BN1 9QH
UK

johnca at cogs.susx.ac.uk
www.cogs.susx.ac.uk/lab/nlp/carroll/

Nelleke H. J. Oostdijk
Department of Language and Speech
University of Nijmegen
P.O. Box 9103
6500 HD Nijmegen
The Netherlands

n.oostdijk at let.kun.nl
lands.let.kun.nl/TSpublic/tosca/

Richard F. E. Sutcliffe (Contact Person)
Department of Computer Science
and Information Systems
University of Limerick
Limerick, Ireland

Richard.Sutcliffe at ul.ie
www.csis.ul.ie/staff/richard.sutcliffe


Programme Committee

Rens Bod, University of Amsterdam
Ted Briscoe, University of Cambridge
John Carroll, University of Sussex
Anette Frank, DFKI Saarbruecken
Gregory Grefenstette, Clairvoyance, Pittsburgh
Claire Grover, University of Edinburgh
Sadao Kurohashi, The University of Tokyo
Stephan Oepen, CSLI Stanford
Nelleke Oostdijk, University of Nijmegen
Richard Sutcliffe, University of Limerick
Atro Voutilainen, Conexor oy



More information about the Corpora mailing list