Corpora: Book: Parallel text processing
Jean Veronis
Jean.Veronis at newsup.univ-mrs.fr
Fri Oct 6 07:14:39 UTC 2000
**** NEW BOOK *** NEW BOOK *** NEW BOOK *** NEW BOOK *** NEW BOOK ****
KLUWER ACADEMIC PUBLISHERS
TEXT, SPEECH AND LANGUAGE TECHNOLOGY
Volume 13
Series editors: Nancy Ide and Jean Véronis
PARALLEL TEXT PROCESSING
Alignment and Use of Translation Corpora
edited by
JEAN VÉRONIS
Université de Provence, Aix-en-Provence, France
With the rising importance of multilingualism in language industries,
brought about by global markets and world-wide information exchange,
parallel corpora, i.e. corpora of texts accompanied by their translation,
have become key resources in the development of natural language processing
tools. The applications based upon parallel corpora are numerous and
growing in number: multilingual lexicography and terminology, machine and
human translation, cross-language information retrieval, language learning,
etc.
The book's chapters have been commissioned from major figures in the field
of parallel corpus building and exploitation, with the aim of showing the
state of the art in parallel text alignment and use ten to fifteen years
after the first parallel-text alignment techniques were developed. Within
the book, the following broad themes are addressed: (i) techniques for the
alignment of parallel texts at various levels such as sentence, clause, and
word; (ii) the use of parallel texts in fields as diverse as translation,
lexicography, and information retrieval; (iii) available corpus resources
and the evaluation of alignment methods.
The book will be of interest to researchers and advanced students of
computational linguistics, terminology, lexicography and translation, both
in academia and industry.
Kluwer Academic Publishers, Dordrecht
Hardbound, ISBN 0-7923-6546-1
August 2000, 428 pp.
NLG 300.00 / USD 160.00 / GBP 99.00
---------------------------------------------------------------------
CONTENTS AND CONTRIBUTORS
Foreword.
Terminological note.
Preface by M. Kay.
Contributors.
Introduction.
1. From the Rosetta stone to the information society.
A survey of parallel text processing; J. Véronis.
Alignment Methodology.
2. Pattern recognition for mapping bitext correspondence;
I.D. Melamed.
3. Multilingual text alignment. Aligning three or more
versions of a text; M. Simard.
4. A comprehensive bilingual word alignment system.
Application to disparate languages: Hebrew and English;
Y. Choueka, E. S. Conley, I. Dagan
5. A knowledge-lite approach to word alignment;
L. Ahrenberg, M. Andersson, M. Merkel
6. From sentences to words and clauses; S. Piperidis, H. Papageorgiou,
S. Boutsis
7. Bracketing and aligning words and constituents in parallel
text using Stochastic Inversion Transduction Grammars; D. Wu.
8. The translation network. A model for a fine-grained description
of translations; D. Santos.
9. Parallel text alignment using crosslingual information retrieval
techniques; C. Fluhr, F. Bisson, F. Elfateb
10. Parallel alignment of structured documents;
L. Romary, P. Bonhomme.
Applications.
11. A statistical view on bilingual lexicon extraction.
From parallel corpora to non-parallel corpora; P. Fung.
12. Terminology extraction from parallel technical texts; I. Blank.
13. Term alignment in use. Machine-aided human translation;
É. Gaussier, D. Hull, S. Ait-Mokhtar
14. Automatic dictionary extraction for cross-language information
retrieval; R.D. Brown, J. Carbonell, Y. Yang
15. Parallel text in computer-assisted language learning;
J. Nerbonne.
Resources and Evaluation.
16. Japanese-English aligned bilingual corpora; H. Isahara,
M. Haruno.
17. Building a parallel corpus of English/Panjabi; S. Singh,
T. McEnery, P. Baker
18. Sharing of translation memory databases derived from aligned
parallel text; A.K. Melby.
19. Evaluation of parallel text alignment systems.
The ARCADE project; J. Véronis, P. Langlais.
Index of terms.
Index of authors.
Index of languages and writing systems.
---------------------------------------------------------------------
PREVIOUS VOLUMES
Volume 1: Recent Advances in Parsing Technology
Harry Bunt, Masaru Tomita (Eds.)
Hardbound, ISBN 0-7923-4152-X, 1996
Volume 2: Corpus-Based Methods in Language and Speech Processing
Steve Young, Gerrit Bloothooft (Eds.)
Hardbound, ISBN 0-7923-4463-4, 1997
Volume 3: An introduction to text-to-speech synthesis
Thierry Dutoit
Hardbound, ISBN 0-7923-4498-7, 1997
Volume 4: Exploring textual data
Ludovic Lebart, André Salem and Lisette Berry
Hardbound, ISBN 0-7923-4840-0, December 1997
Volume 5: Time Map Phonology:
Finite State Models and Event Logics in Speech
Recognition
Julie Carson-Berndsen
Hardbound, ISBN 0-7923-4883-4, 1997
Volume 6: Predicative Forms in Natural Language and in
Lexical Knowledge Bases
Patrick Saint-Dizier (Ed.)
Hardbound, ISBN 0-7923-5499-0, December 1998
Volume 7: Natural Language Information Retrieval
Tomek Strzalkowski (Ed.)
Hardbound, ISBN 0-7923-5685-3, April 1999
Volume 8: Techniques in Speech Acoustics
Jonathan Harrington, Steve Cassidy
Hardbound, ISBN 0-7923-5731-0, July 1999
Volume 9: Syntactic Wordclass Tagging
Hans van Halteren (Ed.)
Hardbound, ISBN 0-7923-5896-1, August 1999
Volume 10: Breadth and Depth of Semantic Lexicons
Viegas, E. (Ed.)
Hardbound, ISBN 0-7923-6039-7, November 1999
Volume 11: Natural Language Processing Using Very Large Corpora
Armstrong, S., Church, K.W., Isabelle, P.,
Manzi, S., Tzoukermann, E., Yarowsky, D. (Eds.)
Hardbound, ISBN 0-7923-6055-9, November 1999
Volume 12: Lexicon Development for Speech and Language Processing
Frank van Eynde & Dafydd Gibbon (Eds.)
Hardbound, ISBN 0-7923-6368-X, April 2000.
Check the series Web page for order information:
http://www.wkap.nl/series.htm/TLTB
More information about the Corpora
mailing list