16.258, Diss: Text/Corpus Ling: Zimina-Poirot: ' Quantitative..'

Wed Jan 26 19:40:32 UTC 2005

LINGUIST List: Vol-16-258. Wed Jan 26 2005. ISSN: 1068 - 4875.

Subject: 16.258, Diss: Text/Corpus Ling: Zimina-Poirot: ' Quantitative..'

Moderators: Anthony Aristar, Wayne State U <aristar at linguistlist.org>
            Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>

Reviews (reviews at linguistlist.org) 
        Sheila Collberg, U of Arizona  
        Terry Langendoen, U of Arizona  

Homepage: http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.

Editor for this issue: Takako Matsui <tako at linguistlist.org>
================================================================  

To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.

===========================Directory==============================  

1)
Date: 26-Jan-2005
From: Maria Zimina-Poirot < zimina at msh-paris.fr >
Subject: Quantitative Approaches of Extracting Translation Resources from Parallel Corpora 

-------------------------Message 1 ---------------------------------- 
Date: Wed, 26 Jan 2005 14:33:26
From: Maria Zimina-Poirot < zimina at msh-paris.fr >
Subject: Quantitative Approaches of Extracting Translation Resources from Parallel Corpora 

Institution: University Paris 3, Sorbonne nouvelle 
Program: Doctorat en Sciences du langage 
Dissertation Status: Completed 
Degree Date: 2004 

Author: Maria Zimina-Poirot

Dissertation Title: Quantitative Approaches of Extracting Translation Resources 
from Parallel Corpora 

Dissertation URL:  http://www.univ-paris3.fr/recherche/sites/edll/student/stmz/index.htm

Linguistic Field(s): Text/Corpus Linguistics
                     Translation

Dissertation Director(s):
André Salem

Dissertation Abstract:

This research work presents the results of a series of experiments devoted
to the development of new tools for intertextual textometric exploration of
translation corpora. Various methods of textual statistics have been
adapted for use in a multilingual context and put into practice for
parallel text processing, such as: repeated segments extraction,
characteristic elements computation, bi-textual topography, multiple
co-occurrences, factorial analysis, automatic classification, etc. Examples
of concrete applications illustrate the use of each of these methods in a
multilingual context. These examples are accompanied by sample translation
resources obtained on quantitative bases from the parallel French/English
corpus of the Convention for the Protection of Human Rights. The suggested
approach opens up new horizons for automatic exploration of lexical
equivalences of translation corpora by a variety of users: translators,
foreign language teachers, terminologists, lexicographers, etc. 

-----------------------------------------------------------
LINGUIST List: Vol-16-258