<table cellspacing="0" cellpadding="0" border="0" ><tr><td valign="top" style="font: inherit;">Hi, Kevin,<br><br>There is also some work at LDC on using a TAG-based decomposition of the treebank to compare syntactic structures that may be relevant:<br><br>
<a class="fixed" href="https://mail.ldc.upenn.edu/horde/services/go.php?url=http%3A%2F%2Fpapers.ldc.upenn.edu%2FACL2011%2FDerivationTrees_TBErrorDetection.pdf" target="_blank">http://papers.ldc.upenn.edu/ACL2011/DerivationTrees_TBErrorDetection.pdf</a><br>
<br>
Seth Kulick, Ann Bies, and Justin Mott<br>
Using Derivation Trees for Treebank Error Detection<br>
ACL 2011, Portland, Oregon, USA, June 19-24, 2011<br>
Available: Paper in PDF<br><br>Thanks,<br><br>Ann<br><br><br>--- On <b>Fri, 6/17/11, Kevin B. Cohen <i><kevin.cohen@gmail.com></i></b> wrote:<br><blockquote style="border-left: 2px solid rgb(16, 16, 255); margin-left: 5px; padding-left: 5px;"><br>From: Kevin B. Cohen <kevin.cohen@gmail.com><br>Subject: [Corpora-List] Automatically checking a treebank for errors<br>To: "Corpora List" <corpora@uib.no><br>Date: Friday, June 17, 2011, 3:47 PM<br><br><div class="plainMail">Does anyone know of any tricks for automatically checking a Penn<br>Treebank-style corpus for obvious errors? I've done some simple stuff<br>in the past for checking POS tags, like looking for punctuation marks<br>with non-punctuation tags, which turned out to be really fruitful, but<br>I can't think of anything clever to do for the syntactic structures.<br><br>Kev<br><br>-- <br>Kevin Bretonnel Cohen, PhD<br>Biomedical Text Mining Group Lead, Computational
Bioscience Program,<br>U. Colorado School of Medicine<br>303-916-2417 (cell) 303-377-9194 (home)<br><a href="http://compbio.ucdenver.edu/Hunter_lab/Cohen" target="_blank">http://compbio.ucdenver.edu/Hunter_lab/Cohen</a><br><br>_______________________________________________<br>UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>Corpora mailing list<br><a ymailto="mailto:Corpora@uib.no" href="/mc/compose?to=Corpora@uib.no">Corpora@uib.no</a><br><a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br></div></blockquote></td></tr></table>