Ressources: ELRA - Language Resources Catalogue - Update

Thierry Hamon thierry.hamon at LIPN.UNIV-PARIS13.FR
Tue Mar 31 15:52:41 UTC 2009

Date: Tue, 31 Mar 2009 16:07:50 +0200
From: info at
Message-ID: <49D223B6.4020200 at>

Our apologies if you have received multiple copies of this

ELRA - Language Resources Catalogue - Update

ELRA is happy to announce that 1 new Written Corpus is now available
in its catalogue:

**ELRA-W0049 "Le Monde Diplomatique" Arabic tagged corpus*
This corpus contains 102,960 vowelised, lemmatised and tagged words
(58 texts from Le Monde Diplomatique Arabic, see also ELRA-W0036-04).
To each text are associated 3 files: raw text in Arabic, vowelized
text in Arabic, one XML file containing the morphological annotation
of the text.  For more information,

For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli at

Visit our On-line Catalogue:
Visit the Universal Catalogue:
Archives of ELRA Language Resources Catalogue Updates:

Message diffuse par la liste Langage Naturel <LN at>
Informations, abonnement :
English version       : 
Archives                 :

La liste LN est parrainee par l'ATALA (Association pour le Traitement
Automatique des Langues)
Information et adhesion  :

More information about the Ln mailing list