12.949, Confs: Linguistic Databases (IRCS)

The LINGUIST Network linguist at linguistlist.org
Thu Apr 5 16:39:30 UTC 2001


LINGUIST List:  Vol-12-949. Thu Apr 5 2001. ISSN: 1068-4875.

Subject: 12.949, Confs: Linguistic Databases (IRCS)

Moderators: Anthony Aristar, Wayne State U.<aristar at linguistlist.org>
            Helen Dry, Eastern Michigan U. <hdry at linguistlist.org>
            Andrew Carnie, U. of Arizona <carnie at linguistlist.org>

Reviews (reviews at linguistlist.org):
	Simin Karimi, U. of Arizona
	Terence Langendoen, U. of Arizona

Editors (linguist at linguistlist.org):
	Karen Milligan, WSU 		Naomi Ogasawara, EMU
	Lydia Grebenyova, EMU		Jody Huellmantel, WSU
	James Yuells, WSU		Michael Appleby, EMU
	Marie Klopfenstein, WSU		Ljuba Veselinova, Stockholm U.

Software: John Remmers, E. Michigan U. <remmers at emunix.emich.edu>
          Gayathri Sriram, E. Michigan U. <gayatri at linguistlist.org>

Home Page:  http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.



Editor for this issue: Jody Huellmantel <jody at linguistlist.org>
 ==========================================================================
Please keep conferences announcement as short as you can; LINGUIST
will not post conference announcements which in our opinion are
excessively long.


=================================Directory=================================

1)
Date:  Thu, 05 Apr 2001 11:35:09 EDT
From:  Steven Bird <sb at unagi.cis.upenn.edu>
Subject:  Linguistic Databases (IRCS)

-------------------------------- Message 1 -------------------------------

Date:  Thu, 05 Apr 2001 11:35:09 EDT
From:  Steven Bird <sb at unagi.cis.upenn.edu>
Subject:  Linguistic Databases (IRCS)



		   IRCS WORKSHOP ON LINGUISTIC DATABASES

			University of Pennsylvania
			     Philadelphia, USA
			    11-13 December 2001

	       http://www.ldc.upenn.edu/annotation/database/


	       Sponsored by the National Science Foundation
	    and the Institute for Research in Cognitive Science

			       Organized by:
	       Steven Bird, Peter Buneman and Mark Liberman
	      Department of Computer and Information Science,
       Department of Linguistics, and the Linguistic Data Consortium
			University of Pennsylvania


Linguistic databases are digital repositories of structured information
intended to document natural language and natural communicative
interaction.  Over the last decade, linguistic databases have come to stand
at the center of empirical research in the language sciences, and in the
development of new human language technologies.  Like genomic databases,
linguistic databases are complex, evolving and richly annotated
repositories, and pose interesting challenges for efficient representation,
indexing and query.  And like most scientific databases, linguistic
databases have made little use of standard database technology.

The goals of the workshop are to take stock of existing research in
linguistic databases, to identify the key problems, and to explore
applications of current database research to these problems.  More broadly,
the workshop will help define the research questions of a new "linguistic
database community" and initiate the ongoing interchange of relevant
problems and results between this community and the database community at
large.

The workshop will address a selection of the following topics:

MODELS:
* models for text databases, speech databases, multimodal databases,
  typological databases, geographical databases (language maps),
  and metadata repositories
* relational, object-oriented and semi-structured models for
  representing linguistic annotations
* representations for specific linguistic datatypes (e.g. databases of
  aligned parallel text)
* modelling temporal and (geo)spatial structure
* critical analysis of existing linguistic databases

LANGUAGES:
* query of multilayer annotations
* linguistic applications/extensions of XML query languages
* analysis of existing ad hoc query languages
* queries over temporal and (geo)spatial structure

OTHER TOPICS:
* database support (e.g. what standard database technology has proven
  worthwhile for linguistic databases?)
* appropriate indexing methods for linguistic strings and structures
* archiving and preservation
* metadata standards serving as finding aids for linguistic databases
* data provenance / data lineage
* annotation servers

Provisional Timetable

Call for papers:    posted in May
Extended abstracts: due in August
Final papers:       due in November

Website and Mailing List

Subsequent announcements will be posted to this list, and on the workshop
website: http://www.ldc.upenn.edu/annotation/database/

Steven Bird, Peter Buneman and Mark Liberman

-
Steven Bird     http://www.ldc.upenn.edu/sb/       sb at ldc.upenn.edu
Peter Buneman   http://www.cis.upenn.edu/~peter/   peter at cis.upenn.edu
Mark Liberman   http://www.ldc.upenn.edu/myl/      myl at unagi.cis.upenn.edu

---------------------------------------------------------------------------
LINGUIST List: Vol-12-949



More information about the LINGUIST mailing list