<html>

<body>

Dear all:<br />

<br />

I am pleased to announce the presentation of several computational resources for Galician language developed at the Centro Ramón Piñeiro para a Investigación en Humanidades (http://www.cirp.es).<br />

<br />

First, an updated version of CORGA (Reference Corpus of Present-day Galician Language). This latest version (1.4) reaches 25 million words and includes two new features:<br />

<br />

- Queries can now be made in newspaper sections.<br />

<br />

- A list for word frequencies is ready to download.<br />

<br />

<br />

It is available at the usual location:<br />

<br />

http://corpus.cirp.es/corga<br />

<br />

<br />

Second, an online Tagger/Lemmatizer for Galician language (XIADA), which has a low rate of errors and can be used to tag sentences.<br />

<br />

It is available at:<br />

<br />

http://corpus.cirp.es/xiada<br />

<br />

The generic lexicon used by the tagger can be downloaded at the same url. It consists of 724,000 entries including a tag, lemma, normative indication and source for each of them.<br />

<br />

Both this lexicon and the frequency list are distributed under the Lesser General Public License For Linguistic Resources (LGPLLR). See http://sanskrit.inria.fr/DATA/LGPLLR.html for details.<br />

<br />

Finally, a 250,000 form subcorpus (300,000 gramatical elements). It is published online using a new search system which allows to query forms, tags and/or lemmas. It was automatically tagged by XIADA and manually revised.<br />

<br />

This new search system is available at: <br />

<br />

http://corpus.cirp.es/corgaetq<br />

<br />

Our long-term aim is to offer all CORGA texts through this system to get an outstanding improvement on queries and results.<br />

<br />

Regards,<br />

<br />

Fco. Mario Barcala Rodríguez<br />

Computing manager of CORGA project

</body>

</html>