Corpora: Book: Parallel text processing

Jean Veronis Jean.Veronis at newsup.univ-mrs.fr
Fri Oct 6 07:14:39 UTC 2000


**** NEW BOOK *** NEW BOOK *** NEW BOOK *** NEW BOOK *** NEW BOOK ****



                        KLUWER ACADEMIC PUBLISHERS
                   TEXT, SPEECH AND LANGUAGE TECHNOLOGY
                               Volume 13
              Series editors: Nancy Ide and Jean Véronis




                            PARALLEL TEXT PROCESSING
                    Alignment and Use of Translation Corpora

                                    edited by
                                   JEAN VÉRONIS
                 Université de Provence, Aix-en-Provence, France


With the rising importance of multilingualism in language industries, 
brought about by global markets and world-wide information exchange, 
parallel corpora, i.e. corpora of texts accompanied by their translation, 
have become key resources in the development of natural language processing 
tools. The applications based upon parallel corpora are numerous and 
growing in number: multilingual lexicography and terminology, machine and 
human translation, cross-language information retrieval, language learning, 
etc.

The book's chapters have been commissioned from major figures in the field 
of parallel corpus building and exploitation, with the aim of showing the 
state of the art in parallel text alignment and use ten to fifteen years 
after the first parallel-text alignment techniques were developed. Within 
the book, the following broad themes are addressed: (i) techniques for the 
alignment of parallel texts at various levels such as sentence, clause, and 
word; (ii) the use of parallel texts in fields as diverse as translation, 
lexicography, and information retrieval; (iii) available corpus resources 
and the evaluation of alignment methods.

The book will be of interest to researchers and advanced students of 
computational linguistics, terminology, lexicography and translation, both 
in academia and industry.


Kluwer Academic Publishers, Dordrecht

      Hardbound, ISBN 0-7923-6546-1
      August 2000, 428 pp.
      NLG 300.00 / USD 160.00 / GBP 99.00

---------------------------------------------------------------------


                               CONTENTS AND CONTRIBUTORS


Foreword.

Terminological note.

Preface by M. Kay.

Contributors.

Introduction.

     1. From the Rosetta stone to the information society.
        A survey of parallel text processing; J. Véronis.

Alignment Methodology.

     2. Pattern recognition for mapping bitext correspondence;
        I.D. Melamed.

     3. Multilingual text alignment. Aligning three or more
        versions of a text; M. Simard.

     4. A comprehensive bilingual word alignment system.
        Application to disparate languages: Hebrew and English;
        Y. Choueka, E. S. Conley, I. Dagan

     5. A knowledge-lite approach to word alignment;
        L. Ahrenberg, M. Andersson, M. Merkel

     6. From sentences to words and clauses; S. Piperidis, H. Papageorgiou,
        S. Boutsis

     7. Bracketing and aligning words and constituents in parallel
        text using Stochastic Inversion Transduction Grammars; D. Wu.

     8. The translation network. A model for a fine-grained description
        of translations; D. Santos.

     9. Parallel text alignment using crosslingual information retrieval
        techniques; C. Fluhr, F. Bisson, F. Elfateb

     10. Parallel alignment of structured documents;
         L. Romary, P. Bonhomme.

Applications.

     11. A statistical view on bilingual lexicon extraction.
         From parallel corpora to non-parallel corpora; P. Fung.

     12. Terminology extraction from parallel technical texts; I. Blank.

     13. Term alignment in use. Machine-aided human translation;
         É. Gaussier, D. Hull, S. Ait-Mokhtar

     14. Automatic dictionary extraction for cross-language information
         retrieval; R.D. Brown, J. Carbonell, Y. Yang

     15. Parallel text in computer-assisted language learning;
         J. Nerbonne.

Resources and Evaluation.

     16. Japanese-English aligned bilingual corpora; H. Isahara,
         M. Haruno.

     17. Building a parallel corpus of English/Panjabi; S. Singh,
         T. McEnery, P. Baker

     18. Sharing of translation memory databases derived from aligned
         parallel text; A.K. Melby.

     19. Evaluation of parallel text alignment systems.
         The ARCADE project; J. Véronis, P. Langlais.

Index of terms.

Index of authors.

Index of languages and writing systems.

---------------------------------------------------------------------

                            PREVIOUS VOLUMES


    Volume 1:  Recent Advances in Parsing Technology
               Harry Bunt, Masaru Tomita (Eds.)
               Hardbound, ISBN 0-7923-4152-X, 1996

    Volume 2:  Corpus-Based Methods in Language and Speech Processing
               Steve Young, Gerrit Bloothooft (Eds.)
               Hardbound, ISBN 0-7923-4463-4, 1997

    Volume 3:  An introduction to text-to-speech synthesis
               Thierry Dutoit
               Hardbound, ISBN 0-7923-4498-7, 1997

    Volume 4:  Exploring textual data
               Ludovic Lebart, André Salem and Lisette Berry
               Hardbound, ISBN 0-7923-4840-0, December 1997

    Volume 5:  Time Map Phonology:
               Finite State Models and Event Logics in Speech
               Recognition
               Julie Carson-Berndsen
               Hardbound, ISBN 0-7923-4883-4, 1997

    Volume 6:  Predicative Forms in Natural Language and in
               Lexical Knowledge Bases
               Patrick Saint-Dizier (Ed.)
               Hardbound, ISBN 0-7923-5499-0, December 1998

    Volume 7:  Natural Language Information Retrieval
               Tomek Strzalkowski (Ed.)
               Hardbound, ISBN 0-7923-5685-3, April 1999

    Volume 8:  Techniques in Speech Acoustics
               Jonathan Harrington, Steve Cassidy
               Hardbound, ISBN 0-7923-5731-0, July 1999

    Volume 9:  Syntactic Wordclass Tagging
               Hans van Halteren (Ed.)
               Hardbound, ISBN 0-7923-5896-1, August 1999

    Volume 10: Breadth and Depth of Semantic Lexicons
               Viegas, E. (Ed.)
               Hardbound, ISBN 0-7923-6039-7, November 1999

    Volume 11: Natural Language Processing Using Very Large Corpora
               Armstrong, S., Church, K.W., Isabelle, P.,
               Manzi, S., Tzoukermann, E., Yarowsky, D. (Eds.)
               Hardbound, ISBN 0-7923-6055-9, November 1999

    Volume 12: Lexicon Development for Speech and Language Processing
               Frank van Eynde & Dafydd Gibbon (Eds.)
               Hardbound, ISBN 0-7923-6368-X, April 2000.


Check the series Web page for order information:

    http://www.wkap.nl/series.htm/TLTB



More information about the Corpora mailing list