Corpora: ACL-2001 Workshop on Data-driven Call for Papers

Priscilla Rasmussen rasmusse at cs.rutgers.edu
Tue Feb 20 23:29:09 UTC 2001


 	Call for papers
 	Workshop on Data-driven MT
 	ACL'2001 Conference
 	Toulouse, France
 	Invited speaker:  Hermann Ney, RWTH Aachen
 	Deadline for paper submissions:			April 6, 2001
 	Deadline for notification of paper acceptance: 	April 27, 2001
 	Deadline for camera-ready papers:   		 May 16, 2001
 	
 	Workshop Date:                         		July 7, 2001

 	Details on submissions listed below.

 	With the increased availability of online corpora, data-driven
 approaches have become central to the NL community.  A variety of
 data-driven approaches have been used to help build Machine Translation
 systems -- example-based, statistical  MT, and other machine learning
 approaches - and there are all sorts of possibilities for hybrid systems.
 We wish to bring together proponents of as many techniques as possible to
 engage in a discussion of which combinations will yield maximal success in
 translation.

 	We propose to center the workshop on Data Driven MT, by which we
 mean all approaches which develop algorithms and programs to exploit data
 in the development of MT, primarily the use of large bilingual corpora
 created by human translators, and serving as a source of training data for
 MT systems. We are specifically interested in papers about

 		*	statistical machine translation (modeling, training,
 search)
 		*	machine-learning in translation
 		*	example-based machine translation
 		*	acquisition of multilingual training data
 		*	evaluation of data driven methods (also with
 rule-based methods)
 		*	combination of various translation systems;
 integration of classical rule-based and data driven approaches
 		*	word/sentence alignment  methods

 	An especially important question that we wish to address is which
 techniques are best for each of the subparts of a complete MT system -
 e.g. learning grammars, building lexicons, parsing  input data,
 determining transfer principles, generating target text, etc.

 	We will strongly encourage papers on  systems which show
 demonstrable progress over previously chosen methods, and which have been
 integrated in an actual end-to-end system. Test results or demos will be
 given strongest preference for participation.

 	Organizers:
 	Jessie Pinkham, Microsoft Research jessiep at microsoft.com
 <mailto:jessiep at microsoft.com
 	http://research.microsoft.com/~jessiep/

 	Kevin Knight  USC/ISI; knight at isi.edu <mailto:knight at isi.edu
 	Web page http://www.isi.edu/~knight/

 	Franz Josef Och, RWTH Aachen; och at informatik.rwth-aachen.de
 	http://www-i6.informatik.rwth-aachen.de/Colleagues/och/

 	SUBMISSION FORMAT AND INSTRUCTIONS:
 	Electronic submissions only; send the postscript or pdf form of your
 submission to: Deborah Coughlin   deborahc at microsoft.com .

 	Submissions should follow the two-column format of ACL proceedings
 and should not exceed eight (8) pages, including references. We
 	strongly recommend the use of ACL LaTeX style files or Microsoft
 Word Style files tailored for this year's conference. They are
 	available from the ACL-2001 program committee Web-site at
 <http://acl2001.dfki.de/style/ .

 	As reviewing will be blind, a separate identification page must be
 sent by email. The identification page should include the paper title,
 	authors' names, affiliations, and email addresses, up to 5 keywords
 specifying the subject area, and a short summary (up to 5 lines).
 	The paper should not include the authors' names and affiliations.



More information about the Corpora mailing list