[Corpora-List] Tagged corpus for Galician language
Fco. Mario Barcala Rodríguez
mario.barcala at mundo-r.com
Fri Nov 6 08:50:31 UTC 2009
Dear all:
I am pleased to announce a new linguistic resource developed at Centro Ramón Piñeiro para a Investigación en Humanidades (http://www.cirp.es).
It is a tagged corpus for Galician language revised by hand which include more than 300.000 gramatical elements extracted from texts of newspapers and journals. So, it is suitable to be used to train different statistical linguistic tools.
You can find more information and a link to download it at:
http://corpus.cirp.es/xiada
(Descargas/Corpus de adestramento section).
It is released under the LGPLLR license (see COPYING file of the package for details).
Regards,
--
Fco. Mario Barcala Rodríguez
Computing manager of CORGA project
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list