[Corpora-List] Tagged corpus for Galician language

Fco. Mario Barcala Rodríguez mario.barcala at mundo-r.com
Fri Nov 6 08:50:31 UTC 2009


Dear all:

I am pleased to announce a new linguistic resource developed at Centro Ramón Piñeiro para a Investigación en Humanidades (http://www.cirp.es).

It is a tagged corpus for Galician language revised by hand which include more than 300.000 gramatical elements extracted from texts of newspapers and journals. So, it is suitable to be used to train different statistical linguistic tools.

You can find more information and a link to download it at:

http://corpus.cirp.es/xiada

(Descargas/Corpus de adestramento section).

It is released under the LGPLLR license (see COPYING file of the package for details).

Regards,

--
Fco. Mario Barcala Rodríguez
Computing manager of CORGA project

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list