[Corpora-List] New Portuguese-English corpus available

Diana Santos Diana.Santos at sintef.no
Mon Nov 9 12:06:16 UTC 2009


We are pleased to announce that CorTrad, a subproject of the COMET project, is now available for search on the Web at http://www.fflch.usp.br/dlm/comet/consulta_cortrad.html

Making CorTrad available on the Web is a joint USP (http://www.usp.br/internacional/home.php?&idioma=en), NILC (http://www.nilc.icmc.usp.br/nilc/) and Linguateca (http://www.linguateca.pt) project, using the DISPARA system.

CorTrad features two special properties:
it is multiversion (with several versions of a translated text)
it has specific search capabilities relative to the structure of the specific texts

Currently it comprises three subcorpora:
- technical text, with a Brazilian cookbook translated into English
- scientific magazine, with Brazilian short research news translated into English
- Australian short stories translated into Portuguese

The corpora are annotated with PALAVRAS for Portuguese and with CLAWS for English.

We welcome comments and feedback!

The CorTrad team
Diana Santos, Elisa D. Teixeira, Sandra Aluísio and Stella E.O.Tagnin
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list