[Corpora-List] CFP: Workshop on "Strategies for developing machine translation for minority languages"

Briony Williams b.williams at bangor.ac.uk
Mon Jan 9 14:57:37 UTC 2006


                  FIRST CALL FOR PAPERS

      "Strategies for developing machine translation for minority languages"
      ======================================================================

            5th SALTMIL Workshop on Minority Languages
                 on Tuesday May 23rd 2006 (morning)
            Magazzini del Cotone Conference Centre, Genoa, Italy

    Organised in conjunction with LREC 2006: Fifth International Conference on
      Language Resources and Evaluation, Genoa, Italy, 24-26 May 2006

Background
==========

This workshop continues the series of LREC workshops organized by SALTMIL 
(SALTMIL is the ISCA Special Interest Group for Speech And Language 
Technology for Minority Languages:  http://isl.ntf.uni-lj.si/SALTMIL/ ):

The minority or "less resourced" languages of the world are under increasing 
pressure from the major languages (especially English), and many of them lack 
full political recognition.  Some minority languages have been well 
researched linguistically, but most have not, and the vast majority do not 
yet possess basic speech and language resources (such as text and speech 
corpora, lexicons, POS taggers, etc) which would enable the commercial 
development of products.

The workshop aims to share information on tools and best practice, so that 
isolated researchers will not need to start from nothing.  An important 
aspect will be the forming of personal contacts, which can minimise 
duplication of effort.  There will be a balance between presentations of 
existing language resources, and more general presentations designed to give 
background information needed by all researchers present.

Format
======

The workshop will begin with the following presentations from invited speakers:

  * Delyth Prys (University of Wales, Bangor):  "The BLARK matrix and its 
relation to the language resources situation for the Celtic languages."
  * Hermann Ney (Rheinisch-Westfälische Technische Hochschule, Aachen, 
Germany):  "Statistical Machine Translation with and without a bilingual 
training corpus"
  * Mikel Forcada (Universitat d’Alacant, Spain):  "Open source machine 
translation: an opportunity for minor languages"
  * Lori Levin (Carnegie Mellon University, USA):  "Omnivorous MT: Using 
whatever resources are available."
  * Anna Sågvall Hein (University of Uppsala, Sweden):  "Approaching new 
languages in machine translation."

These talks will then be followed by a poster session featuring contributed 
papers.

Papers
======

Papers are invited that describe research and development in the following areas:

  * The BLARK (Basic Language Resource Kit) matrix at ELDA, and how it 
relates to minority languages.
  * The advantages and disadvantages of different corpus-based strategies for 
developing MT, with reference to a) speed of development, and b) level of 
researcher expertise required.
  * What open-source or free language resources are available for developing MT?
  * Existing resources for minority languages, with particular emphasis on 
software tools that have been found useful.

All contributed papers will be presented in poster format. All contributions 
will be printed in the workshop proceedings (CD). They will also be published 
on the SALTMIL website.

Timetable:
=========

  * Paper submission deadline:   Feb 17, 2006
  * Notification of acceptance:  March 10, 2006
  * Final version of paper:      April 10, 2006
  * Workshop:                    May 23, 2006 (morning)

Submissions:
===========

Abstracts should be in English, and up to 4 pages long. The submission format 
is PDF.

Papers will be reviewed by members of the organising committee. The reviews 
are not anonymous.

Accepted papers may be up to 6 pages long. The final papers should be in the 
format specified for the proceedings by the LREC organisers.

Each submission should include: title; author(s); affiliation(s); and contact 
author's e-mail address, postal address, telephone and fax numbers.

Abstracts should be sent via e-mail to Briony Williams at b.williams @ 
bangor.ac.uk.  The deadline for submission is February 17th.

Organising committee
====================

  * Briony Williams (University of Wales, Bangor, UK:  b.williams @ bangor.ac.uk)
  * Kepa Sarasola (University of the Basque Country: ksarasola @ si.ehu.es)
  * Bojan Petek (University of Ljubljana, Slovenia: bojan.petek @ uni-lj.si)
  * Julie Berndsen (University College Dublin, Ireland: julie.berndsen @ ucd.ie)
  * Atelach Alemu Argaw (University of Stockholm, Sweden: atelach @ dsv.su.se)



More information about the Corpora mailing list