Appel: HLT-NAACL 2003 : Building and Using Parallel Texts

alexis.nasr at LINGUIST.JUSSIEU.FR alexis.nasr at LINGUIST.JUSSIEU.FR
Mon Mar 31 16:59:37 UTC 2003


                   == Apologies for multiple postings ==

Notice the April 1 Deadline for late-breaking short papers submissions!

                  -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-

                       FINAL CALL FOR LATE-BREAKING PAPERS


                      Building and Using Parallel Texts:
                   Data Driven Machine Translation and Beyond

                          An HLT-NAACL 2003 Workshop
                              Edmonton, Alberta
                                May 31, 2003

                        http://www.cs.unt.edu/~rada/wpt

                  -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-


Short (Late-Breaking) Papers at the HLT-NAACL Workshop on Building and
Using Parallel Text provide a venue for authors to present
late-breaking results. Submitted Short Papers will be carefully
evaluated on the basis of originality, significance, technical
soundness, and clarity of exposition.

Short papers are due on April 1, 2003 5pm your local time.

Short papers are restricted to 4 pages in length. They must be
submitted in camera-ready format; see
www.hlt-naacl03.org/format.html. Short papers that are not in PDF or
are incorrectly formatted may be rejected on that basis. Authors are
strongly encouraged to use the LaTeX style files or MSWord equivalents
available on the website.

Submissions must describe original, completed, unpublished work, and
include concrete evaluation results when appropriate. See full paper
submission information for topics of interest.

Reviewing will not be blind. Because we need camera-ready formatted
papers, authors must include their identifying information on the
paper.  This is to accommodate the late-breaking format; we need time
to review the papers and get the accepted papers to the printer in
time.

Note that notification date for acceptance and rejection is April 7,
with final camera ready copy due on April 10.


                  -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-

                         GENERAL WORKSHOP INFORMATION

The goal of this workshop is to provide a forum for researchers
working on problems related to the creation and use of parallel
text. Recent events have demonstrated once again the importance of
inter-language communication, and reinforce the need for advances in
machine translation (MT) and multi-lingual processing tools.

The workshop will be centered around the problem of building and using
parallel corpora, which are vital resources for efficiently deriving
multi-lingual text processing tools. In addition to regular papers,
the workshop also includes a shared task that will result in a
comparative evaluation of word alignment techniques.

We invite submissions of papers addressing any of the following
issues:

- Construction of parallel corpora, including the automatic
identification and harvesting of parallel corpora from the Web.

- Methods to evaluate the quality of parallel corpora and word
alignments

- Tools for processing parallel corpora, including automatic sentence
alignment, word alignment, phrase alignment, detection of omissions
and gaps in translations, and others

- Using parallel corpora for data driven Machine Translation

- Using parallel corpora for the derivation of language processing
tools in new languages

- Using parallel corpora for automatic corpora annotation

- Language learning applied to parallel corpora

- Translation memory systems as a source of aligned corpora

While we invite submissions addressing any of the above topics, or
related issues, we particularly welcome work involving parallel
corpora addressing languages with scarce resources.

We expect to make arrangements with a journal in Natural Language
Processing or Computational Linguistics for a special issue that will
include selected papers from this workshop.

Invited Speaker:
-=-=-=-=-=-=-=-=

Elliot Macklovitch, University of Montreal

Shared Task:
-=-=-=-=-=-=

All researchers who have a word alignment system available are invited
to participate in the shared task, individually or as part of a team.

Participants in the shared task will be provided with common sets of
training data, consisting of Romanian-English and French-English
parallel texts. Participants will be given approximately one month to
train their systems with this data, and then previously held out test
data will be released. Participants will run their alignment system on
this test data and submit their results, which will be evaluated using
a common set of metrics. See the workshop website for details
regarding the shared task.


Submission format:
-=-=-=-=-=-=-=-=-=

Submissions should consist of late-breaking papers of max. 4 pages,
formatted following the NAACL 2003 guidelines. In addition, teams
participating in the word alignment shared task are invited to submit
short papers (max. 4 pages) describing their systems and/or evaluation
methodology.

Send your submission (a ps or pdf file), prepared for anonymous
review, to both:

Rada Mihalcea, University of North Texas, rada at cs.unt.edu
and
Ted Pedersen, University of Minnesota, Duluth, tpederse at d.umn.edu

Important dates:
-=-=-=-=-=-=-=-=

Deadline for regular paper submissions:         March 10 (passed)
Deadline for results submissions:               March 25 (shared
task, passed)
Deadline for late-breaking papers submissions:  April 1
Deadline for short paper submissions:           April 1 (shared
task)
Notification of acceptance - for regular papers:April 1
             - for late-breaking papers:        April 7
Deadline for camera-ready papers:               April 10

Organization Committee:
-=-=-=-=-=-=-=-=-=-=-=-

Rada Mihalcea, University of North Texas
Ted Pedersen, University of Minnesota, Duluth

Program Committee:
-=-=-=-=-=-=-=-=-=

Lars Ahrenberg, Linkoping University
Nicoletta Calzolari, University of Pisa
Tim Chklovski, Massachusetts Institute of Technology
Mona Diab, University of Maryland
Ulrich Germann, Information Sciences Institute
Daniel Gildea, University of Pennsylvania
Maria das Gracas Volpe Nunes, University of Sao Paulo
Nancy Ide, Vassar College
Lucia Helena Machado Rino, Federal University of Sao Carlos
Eduard Hovy, University of Southern California / Information
Sciences Institute
Philippe Langlais, University of Montreal
Elliot Macklovitch, University of Montreal
Daniel Marcu, University of Southern California / Information
Sciences Institute
Dan Melamed, New York University
Magnus Merkel, Linkoping University
Ruslan Mitkov, University of Wolverhampton
Grace Ngai, Hong Kong Polytechnic University
Hermann Ney, RWTH Aachen
Franz Och, Information Sciences Institute
Kemal Oflazer, Sabanci University
Kishore Papineni, IBM
Jessie Pinkham, Microsoft Research
Andrei Popescu-Belis, ISSCO/TIM/ETI University of Geneva
Florence Reeder, MITRE
Philip Resnik, University of Maryland
Antonio Ribeiro, Joint Research Centre, Ispra, Italy
Michel Simard, University of Montreal
Harold Somers, University of Manchester Institute of Science and
Technology
Arturo Trujillo, Canon Research Centre Europe
Jean Veronis, University of Provence
Clare Voss, Army Research Lab
Dan Tufis, RACAI Romania
Yorick Wilks, University of Sheffield
-------------------------------------------------------------------------
Message diffusé par la liste Langage Naturel <LN at cines.fr>
Informations, abonnement : http://www.biomath.jussieu.fr/LN/LN-F/
English version          : http://www.biomath.jussieu.fr/LN/LN/
Archives                 : http://listserv.linguistlist.org/archives/ln.html

La liste LN est parrainée par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhésion  : http://www.atala.org/
-------------------------------------------------------------------------



More information about the Ln mailing list