[Corpora-List] CCGbank
Julia Hockenmaier
juliahr at cis.upenn.edu
Wed Jun 8 18:18:55 UTC 2005
CCGbank is now available from the LDC:
http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2005T13
CCGbank is a translation of the Penn Treebank into a corpus of
Combinatory Categorial Grammar derivations. It pairs syntactic
derivations with sets of word-word dependencies which approximate the
underlying predicate-argument structure. CCGbank contains 99.44% of
the sentences in the Penn Treebank, for which it corrects a number of
inconsistencies and errors in the original annotation.
CCGbank can also be searched with Douglas Rohde's TGrep2, version 1.15 or higher.
Julia Hockenmaier and Mark Steedman
juliahr at cis.upenn.edu, steedman at inf.ed.ac.uk
http://groups.inf.ed.ac.uk/ccg
More information about the Corpora
mailing list