[Corpora-List] Training Corpus for Readability Difficulty

Benjamin Van Durme vandurme at cs.rochester.edu
Thu Oct 16 13:26:58 UTC 2008


Kevyn Collins-Thompson created such a corpus for English while at CMU,
using grade-level categorized material (1-12).  Unfortunately he was
unable to share it the last time I asked, as it was based on
exclusively licensed documents.

If something like this is out there and public, especially if it
differentiates between younger vs. older children, I would also like
to know about it.

ben

---
Benjamin Van Durme
University of Rochester

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list