[Corpora-List] Portuguese Morpholympics data finally released

Santos Diana Diana.Santos at sintef.no
Tue Dec 9 13:42:19 UTC 2003


Portuguese Morpholympics data released
======================================

Linguateca is pleased to announce that data, results and programs of the
first evaluation contest for Portuguese, Morfolimpíadas, are now available
for download at Linguateca's site. www.linguateca.pt -> Avaliação conjunta
-> Morfolimpíadas (and also as a tar file).
  
The 1st Portuguese Morpholympics's last round took place at Avalon'2003 the
28th June 2003, at Faro, Universidade do Algarve. We have finally managed to
create a distribution with everything in place.

The winner of the contest was Eckhard Bick's PALMORF. Seven other systems,
from Portugal and Brazil, participated in the evaluation contest and kindly
gave us the right to distribute the material.

We distribute the golden list used, the input texts, the output of every
system (after anonymization) and the programs used to compute the results.
We also provide extensive documentation (in Portuguese), including the
actual results made available already in June.

We believe that a corpus of differently tokenized and morhologically
analysed running Portuguese text is interesting for further research in
Portuguese morphology, tokenization and to improve evaluation setups.

URLs:
www.linguateca.pt/Morfolimpiadas/
www.linguateca.pt/avalon2003/

For the organizing committee (Luis Costa, Paulo Rocha and myself)
Diana Santos
====================================
Diana Santos, Diana.Santos at sintef.no
Linguateca, http://www.linguateca.pt
Oslo node: SINTEF Telecom & Informatics
Pb 124 Blindern, N-0314 Oslo Norway



More information about the Corpora mailing list