[Corpora-List] Call for Participation: Workshop on Using Linguistic Information for Hybrid Machine Translation

Gorka Labaka gorka.labaka at ehu.es
Wed Nov 9 13:32:58 UTC 2011


==============================================================

  

                                               CALL FOR PARTICIPATION

  

Workshop on Using Linguistic Information for Hybrid Machine Translation

  

Friday, November 18, 2011. Barcelona, Spain.

http://ixa2.si.ehu.es/lihmt2011  

==============================================================

  

Background

=========

Following on the OpenMT Workshop on Mixing Approaches to Machine Translation in 2008 (http://ixa2.si.ehu.es/matmt-2008  <http://ixa.si.ehu.es/mammt-2008>), the aim of this OpenMT-2 Workshop on Using Linguistic Information for Hybrid Machine Translation (HMT) is to promote corpus-based methods and technologies that combine resources and algorithms from the three general approaches to MT: rule-based (RBMT), example-based (EBMT) and statistical (SMT).

  

Corpus-driven Machine Translation (EBMT and SMT) is nowadays prevalent within the MT research community and translation results obtained using these approaches have now reached a reasonably useful level of quality, especially for assimilation and when the target language is English. But such data-driven MT systems base their knowledge on aligned bilingual corpora, and the accuracy of their output depends heavily on the quality and size of that corpora and the quality of the alignment. Large and reliable bilingual corpora are unavailable for many language pairs. In addition, translating into morphologically rich target languages makes the training of data-driven systems rather more difficult.

  

Workshop Programme

================

The central issues to be addressed during the one-day workshop include: methods and techniques for integrating structural information (syntactic and semantic) into HMT, methods and techniques for handling morphologically rich languages (e.g. Basque) within HMT, alternative approaches to automatic MT evaluation which rely on linguistic criteria.

  

The programme will include three invited plenary talks, each addressing one of the central issues above, and the presentation of nine refereed contributions on related topics. The invited speakers include:

·         Ondřej Bojar (Charles University, Czech Republic)

       /Rich Morphology and What We Can Expect from Hybrid Approaches to MT,/

·         Alon Lavie (Carnegie Mellon University, USA)

       /Statistical MT with Syntax and Morphology: Challenges and Some Solutions/,//

·         Lucia Specia (University of Wolverhampton, UK)

       /Linguistic Indicators for Quality Estimation of Machine Translation/.

  

On Saturday, November 19, the Shared Task on Applying Machine Learning

Techniques to Optimise the Division of Labour in Hybrid Machine 
Translation (ML4HMT 2011) will take place at the Barcelona Media, Av. 
Diagonal, 177.

Programme committee

================

  

Co-Chair: David Farwell (Technical University of Catalonia, TALP, Barcelona)

Co-Chair: Gorka Labaka (University of the Basque Country, Donostia)

  

Iñaki Alegria (University of the Basque Country, Donostia)

Ondřej Bojar (Charles University, Czech Republic)

Arantza Díaz de Ilarraza (University of the Basque Country, Donostia)

Chris Dyer (Carnegie Mellon University, US)

Cristina España (Technical University of Catalonia, TALP, Barcelona)

Marcello Federico (Fondazione Bruno Kessler, Italy)

Mikel Forcada (University of Alacant, Alicante)

Adrià de Gispert (University of Cambridge, UK)

Kevin Knight (Information Sciences Institute, US)

Phillip Koehn (University of Edinburgh, UK)

José Mariño (Technical University of Catalonia, TALP, Barcelona)

Lluís Màrquez (Technical University of Catalonia, TALP, Barcelona)

Hermann Ney (RWTH-Aachen, Germany)

Daniele Pighin (Technical University of Catalonia, TALP, Barcelona)

Aarne Ranta (Chalmers University of Technology, Gothenburg, Sweden)

Marta R. Costa-jussà (Barcelona Media, Spain)

Felipe Sánchez-Martínez (University of Alacant, Alicante)

Kepa Sarasola (University of the Basque Country, Donostia)

Lucia Specia (University of Wolverhampton, UK)

Dekai Wu (Hong Kong University of Science and Technology, China)

  

Local organization

==============

Centre for Speech and Language Applications and Technologies (TALP), Technical University of Catalonia (UPC).

Committee members: David Farwell (Chair), Cristina España, Meritxell Gonzàlez, Lluís Màrquez, Daniele Pighin.

  

  

About the OpenMT-2 Project

=====================

  

The main innovative points of the OpenMT-2 project are:

·         The design of hybrid systems combining traditional linguistic rules, example-based methods and statistical methods.

·         The development of MT evaluation methods based on linguistically motivated metrics.

·         The implementation of Open Source Systems.

·         The use of advanced syntactic and semantic processing in MT.

  

For further details, see the OpenMT-2 website:http://ixa.si.ehu.es/openmt2

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20111109/da512ff2/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list