[Corpora-List] CCGbank

Julia Hockenmaier juliahr at cis.upenn.edu
Wed Jun 8 18:18:55 UTC 2005


CCGbank is now available from the LDC:
http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2005T13

CCGbank is a translation of the Penn Treebank into a corpus of
Combinatory Categorial Grammar derivations.  It pairs syntactic
derivations with sets of word-word dependencies which approximate the
underlying predicate-argument structure.  CCGbank contains 99.44% of
the sentences in the Penn Treebank, for which it corrects a number of
inconsistencies and errors in the original annotation.
CCGbank can also be searched with Douglas Rohde's TGrep2, version 1.15 or higher.


Julia Hockenmaier and Mark Steedman
juliahr at cis.upenn.edu, steedman at inf.ed.ac.uk

http://groups.inf.ed.ac.uk/ccg



More information about the Corpora mailing list