[Corpora-List] Extended Deadline (3 March): Language Technology for Cultural Heritage Data (LaTeCH-08)

Caroline Sporleder csporled at CoLi.Uni-SB.DE
Mon Feb 18 17:34:42 UTC 2008


Due to several requests we have extended the deadline for LaTeCH-08 and
we now also invite short papers and demos.

*********************************************************

                       CALL FOR PAPERS

 ***DEADLINE EXTENSION AND INVITATION FOR SHORT PAPERS/DEMOS***

                    LREC 2008 Workshop on

       Language Technology for Cultural Heritage Data
                         (LaTeCH 2008)

                         Special Theme:
          "Resources and Tools for Studying Language
                     Variety and Change"

               1 June, 2008, Marrakech, Morocco

                 http://ilk.uvt.nl/latech08/

       *Extended* submission deadline: *3 March 2008*


*********************************************************

The Second Workshop on Language Technology for Cultural Heritage Data
(LaTeCH 2008) will be held in conjunction with LREC 2008, and will
take place on June 1 in Marrakech, Morocco.

* SCOPE AND TOPICS

Museums, archives, and libraries around the world maintain large
collections of cultural and scientific heritage objects, such as
archaeological artefacts, audio and video recordings, or manuscripts,
archival documents and other written sources. Such collections are a
potentially very valuable resource for specialists and laypersons
alike, provided they can be easily accessed and automatically
processed. Furthermore, textual cultural heritage resources, such as
old manuscripts and early printed books, are not only interesting for
their information content, but are also an invaluable source for
linguistic research on diachronic and synchronic language variety and
change. While several large scale digitisation projects are currently
underway to make cultural heritage resources more accessible, it is
equally important to develop powerful tools to search, link, enrich,
and mine the digitised data. Language technology has a crucial role to
play in this, even for collections which are primarily non-textual,
since text is the pervasive medium used for meta-data. At the same
time, the cultural heritage domain poses special challenges for the
NLP community, including the use of historic or non-standard language,
the presence of OCR or transcription errors in the data, and the
necessity to deal with data from various media.

For LaTeCH 2008, we invite papers on language technology for cultural
heritage data in general and on the special theme of "Studying
Language Variety and Change". Topics of interest include, but are not
limited to, the following:

    - enriching cultural heritage data by inducing meta-data
    - dealing with linguistic variation and non-standard or
      non-contemporary use of language
    - automatic error detection and cleaning
    - adapting existing NLP tools for the cultural heritage domain
    - linking and retrieving information from different sources, media,
      and languages
    - representing cultural heritage data to different audiences
      (personalisation, text simplification, text summarisation, text
      generation from databases, hypertext generation)
    - knowledge discovery in cultural heritage data
    - complex annotation tools
    - determination of word and sentence boundaries within manuscripts
    - resources for and treatment of dialects (general solutions)
    - annotations of language variety on the orthographic, morphological,
      and syntactic level
    - global language resource management systems
    - repositories of cultural and scientific heritage data


* SUBMISSIONS

Authors are invited to submit papers on original, unpublished
work in the topic area of this workshop. In addition to long papers
presenting completed work, we also invite short papers and demos:

 - Long papers should present completed work and should not exceed 8
   pages.

 - Short papers/demos can present work in progress or the description
   of a system, and should not exceed 5 pages.

All submissions should be formatted using the LREC 2007 stylefiles,
which are available at:

http://www.lrec-conf.org/lrec2008/Download-Templates.html

The reviewing process will be blind; the papers should not
include the authors' names and affiliations or any references to web
sites, project names etc. revealing the authors' identity. Each
submission will be reviewed by at least two members of the program
committee. Accepted papers will be published in the workshop proceedings.

Papers should be submitted electronically, no later than March 3,
2008 via the following website:
https://www.softconf.com/LREC2008/LaTeCH2008/submit.html

The only accepted format for submitted papers is Adobe PDF.


* IMPORTANT DATES

3  March  - Extended deadline for workshop papers
28 March  - Notification of acceptance
 9 April  - Camera-ready papers due
 1 June   - Workshop held at LREC 2008


* PROGRAM COMMITTEE

Ion Androutsopoulos, Athens University of Economics and Business, Greece
Timothy Baldwin, University of Melbourne, Australia
David Bamman, Perseus, USA
David Birnbaum, University of Pittsburgh, USA
Antal van den Bosch, Tilburg University, The Netherlands
Andrea Bozzi, ILC-CNR, Pisa, Italy
Kate Byrne, University of Edinburgh, Scotland
Paul Clough, Sheffield University, UK
Greg Crane, Perseus, USA
Vania Dimitrova, University of Leeds, UK
Milena P. Dobreva, HATII, University of Glasgow, Scotland
Mick O'Donnell, Universidad Autonoma de Madrid, Spain
Matthew James Driscoll, Kobenhavns Universitet, Denmark
Franciska de Jong, University of Twente, The Netherlands
Claire Grover, University of Edinburgh, Scotland
Ben Hachey, University of Edinburgh, Scotland
Djoerd Hiemstra, University of Twente, The Netherlands
Dolores Iorizzo, Imperial College London, UK
Christer Johansson, University of Bergen, Norway
Jaap Kamps, University of Amsterdam, The Netherlands
Vangelis Karkaletsis, NCSR "Demokritos", Greece
Piroska Lendvai, Tilburg University, The Netherlands
Anke Luedeling, Humboldt-Universitaet, Germany
Roland Meyer, University of Regensburg, Germany
Maria Milosavljevic, University of Edinburgh, Scotland
Marie-Francine Moens, Katholieke Universiteit Leuven, Belgium
John Nerbonne, Rijksuniversiteit Groningen, The Netherlands
Marco Passarotti, Universita Cattolica del Sacro Cuore, Italy
Martin Reynaert, Tilburg University, The Netherlands
Kiril Ribarov, Charles University, Czech Republic
Maarten de Rijke, University of Amsterdam, The Netherlands
Peter Robinson, ITSEE, UK
Maria Simi, University of Pisa, Italy
Caroline Sporleder, Saarland University, Germany


* ORGANISING COMMITTEE

Caroline Sporleder (Co-Chair), Saarland University, Germany
Kiril Ribarov (Co-Chair), Charles University, Czech Republic
Antal van den Bosch, Tilburg University, The Netherlands
Milena P. Dobreva, HATII, University of Glasgow, Scotland
Matthew James Driscoll, Kobenhavns Universitet, Denmark
Claire Grover, University of Edinburgh, Scotland
Piroska Lendvai, Tilburg University, The Netherlands
Anke Luedeling, Humboldt-Universitaet, Germany
Marco Passarotti, Universita Cattolica del Sacro Cuore, Italy


* FURTHER INFORMATION

Workshop web page:      http://ilk.uvt.nl/latech08/

LREC 2008 web page:     http://www.lrec-conf.org/lrec2008/



* CONTACT INFORMATION

Caroline Sporleder, csporled (at) coli.uni-sb.de
Kiril Ribarov, ribarov (at) ufal.mff.cuni.cz


-- 
---------------------------------------------------------
Caroline Sporleder
Computational Linguistics, Saarland University
csporled at coli.uni-sb.de
http://www.coli.uni-saarland.de/~csporled
---------------------------------------------------------

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list