Ressources: ELRA - Language Resources Catalogue - Update

Thierry Hamon thierry.hamon at UNIV-PARIS13.FR
Sat Jun 13 19:19:37 UTC 2009

Date: Thu, 11 Jun 2009 11:37:09 +0200
From: info at
Message-ID: <4A30D045.8060003 at>

Our apologies if you have received multiple copies of this

ELRA - Language Resources Catalogue - Update

ELRA is happy to announce that 1 new Written Corpus is now available
in its catalogue:

ELRA-W0050 The CINTIL Corpus ? International Corpus of Portuguese
CINTIL-Corpus Internacional do Português is a linguistically
interpreted written and spoken corpus of European Portuguese. It is
composed of one million annotated tokens, each one of which verified
by human expert annotators. The annotation comprises information on
part-of-speech, open class lemma and inflection, multi-word
expressions pertaining to the class of adverbs and to the closed POS
classes, and multi-word proper names (for named entity
recognition). The corpus is developed over raw textual materials of
several types, of which 30% are spoken materials.
For more information, see:

For more information on the catalogue, please contact Valérie Mapelli
mapelli at

Visit our On-line Catalogue:
Visit the Universal Catalogue:
Archives of ELRA Language Resources Catalogue Updates:

Message diffuse par la liste Langage Naturel <LN at>
Informations, abonnement :
English version       : 
Archives                 :

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  :

More information about the Ln mailing list