[Corpora-List] Training Corpus for Readability Difficulty
Benjamin Van Durme
vandurme at cs.rochester.edu
Thu Oct 16 13:26:58 UTC 2008
Kevyn Collins-Thompson created such a corpus for English while at CMU,
using grade-level categorized material (1-12). Unfortunately he was
unable to share it the last time I asked, as it was based on
exclusively licensed documents.
If something like this is out there and public, especially if it
differentiates between younger vs. older children, I would also like
to know about it.
ben
---
Benjamin Van Durme
University of Rochester
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list