[Corpora-List] Automatically checking a treebank for errors

DJamé Seddah djame.seddah at free.fr
Fri Jun 17 20:36:44 UTC 2011


Dear Kevin,

you may have a look to the work done on the French Treebank by Natalie Schuter and Josef van Genabith
on restructing and correcting a treebank for French (http://www.itu.dk/people/nael/Publications.html, NLP section ).


Best,

Djamé




Le 17 juin 2011 à 21:47, Kevin B. Cohen a écrit :

> Does anyone know of any tricks for automatically checking a Penn
> Treebank-style corpus for obvious errors?  I've done some simple stuff
> in the past for checking POS tags, like looking for punctuation marks
> with non-punctuation tags, which turned out to be really fruitful, but
> I can't think of anything clever to do for the syntactic structures.
> 
> Kev
> 
> -- 
> Kevin Bretonnel Cohen, PhD
> Biomedical Text Mining Group Lead, Computational Bioscience Program,
> U. Colorado School of Medicine
> 303-916-2417 (cell) 303-377-9194 (home)
> http://compbio.ucdenver.edu/Hunter_lab/Cohen
> 
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora


_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list