[Corpora-List] French corpora for POS tagger evaluation

DJamé Seddah djame.seddah at free.fr
Thu Feb 21 11:06:53 UTC 2013


Le 15 févr. 2013 à 10:28, Olivier Austina a écrit :

> Hello,
> 
> I am looking for a standard French corpora for POS tagger evaluation. Where
> can I download the corpus please. Thanks.
> 
> -- 
> Regards
> Austina

Hi,
you can also get the Sequoia Treebank (a freely available treebank for French, with an French Treebank based annotation scheme). 

https://www.rocq.inria.fr/alpage-wiki/tiki-index.php?page=CorpusSequoia

Also, if you're interested on comparing with the state-of-the-art POS tagging of French,  
 I'd suggest you to sign a license for the French Treebank  http://www.llf.cnrs.fr/Gens/Abeille/French-Treebank-fr.php ) and  
 contact  Marie Candito (marie.candito at linguist.jussieu.fr) to get the data set  that has been used  the most in the literature.

(see http://aclweb.org/aclwiki/index.php?title=POS_Tagging_(State_of_the_art) )

Best,
Djamé


_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list