12.58, FYI: Portuguese-English Parallel Corpus: COMPARA

The LINGUIST Network linguist at linguistlist.org
Thu Jan 11 21:26:44 UTC 2001


LINGUIST List:  Vol-12-58. Thu Jan 11 2001. ISSN: 1068-4875.

Subject: 12.58, FYI: Portuguese-English Parallel Corpus: COMPARA

Moderators: Anthony Aristar, Wayne State U.<aristar at linguistlist.org>
            Helen Dry, Eastern Michigan U. <hdry at linguistlist.org>
            Andrew Carnie, U. of Arizona <carnie at linguistlist.org>

Reviews: Terence Langendoen: U. of Arizona <terry at linguistlist.org>
	 Simin Karimi: U. of Arizona <simin at linguistlist.org>

Editors: Karen Milligan, Wayne State U. <karen at linguistlist.org>
         Michael Appleby, E. Michigan U. <michael at linguistlist.org>
         Lydia Grebenyova, E. Michigan U. <lydia at linguistlist.org>
         Jody Huellmantel, Wayne State U. <jody at linguistlist.org>
         Marie Klopfenstein, Wayne State U. <marie at linguistlist.org>
	 Naomi Ogasawara, E. Michigan U. <naomi at linguistlist.org>
	 James Yuells, Wayne State U. <james at linguistlist.org>
         Ljuba Veselinova, Stockholm U. <ljuba at linguistlist.org>

Software: John Remmers, E. Michigan U. <remmers at emunix.emich.edu>
          Gayathri Sriram, E. Michigan U. <gayatri at linguistlist.org>

Home Page:  http://linguistlist.org/

The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.


Editor for this issue: Lydia Grebenyova <lydia at linguistlist.org>

=================================Directory=================================

1)
Date:  Mon, 08 Jan 2001 16:22:59 +0100
From:  Diana Maria de Sousa Marques Pinto dos Santos <Diana.Santos at informatics.sintef.no>
Subject:  Portuguese-English Parallel Corpus - COMPARA

-------------------------------- Message 1 -------------------------------

Date:  Mon, 08 Jan 2001 16:22:59 +0100
From:  Diana Maria de Sousa Marques Pinto dos Santos <Diana.Santos at informatics.sintef.no>
Subject:  Portuguese-English Parallel Corpus - COMPARA

We are pleased to announce that COMPARA, the Portuguese-English parallel
corpus, is now available at http://www.portugues.mct.pt/COMPARA/Welcome.html

COMPARA is open-ended, freely available on the Web, and made for people who
have never used corpora before as well as for experienced corpus users.
COMPARA's criteria for text alignment allow corpus users to investigate
translational discourse changes such as when and where translators have
chosen to join, separate, delete, add and reorder sentences. Users can also
inspect translators' notes, and the corpus admits more than one translation
per source text.

Only six parallel fiction texts have been fully processed so far, but
permission has been obtained to incorporate many more.

COMPARA is encoded according to the IMS Corpus Workbench system, developed
at the University of Stuttgart, and is distributed on the WWW via the
DISPARA interface, developed by the Computational Processing of Portuguese
project.

We are grateful for any comments and suggestions regarding both COMPARA and
the DISPARA interface. We also welcome the collaboration of all those
interested in contributing to the COMPARA-DISPARA project.

Ana Frankenberg-Garcia & Diana Santos
compara at informatics.sintef.no


---------------------------------------------------------------------------
LINGUIST List: Vol-12-58



More information about the LINGUIST mailing list