[Corpora-List] Off-the-shelf coreference packages
Mark Sammons
mssammon at illinois.edu
Mon Jul 18 19:36:24 UTC 2011
Hi, All.
Craig Pfeifer said:
>
>You could try the coreference module from the U Illinois Cognitive
>Computation Group:
>
>http://cogcomp.cs.illinois.edu/page/software_view/18
>
>one caveat : "To train the coreference classifier, you will need
>annotated training data such as the LDC's ACE 2004 corpus. "
Just in the interest of clarity: the downloadable jar already comes
with a trained classifier, so you should be able to use it as-is without
training it.
Regarding the types of errors it makes: it is somewhat conservative,
according to precision/recall/f1 statistics, in that it tends to make errors of
link omission rather than of adding spurious links. However, it still adds
*some* spurious links.
It also anecdotally performs better on medium-long texts (i.e. more than
just one or two sentences) than on short texts, presumably because
additional context helps.
I hope this helps.
Regards,
Mark
Mark Sammons
Principal Research Scientist
Cognitive Computation Group
University of Illinois, Dept. Computer Science
(217) 265-6759 mssammon at illinois.edu
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list