[Corpora-List] Corpora annotated with coreference relations and word senses?
Josef Meyer
jmeyer at ics.mq.edu.au
Sat Dec 14 04:42:40 UTC 2002
Dear List,
I am a PhD student currently doing research into resolving associative
anaphors. My approach to filtering antecedents involves learning
semantic relationships between concepts from a (automatically) parsed
corpus. Until now, my evaluations have made use of about 1000
instances of associative anaphors drawn from a set of encyclopedia
entries; however, this is clearly inadequate for the sort of
evaluation that I would like to include in my thesis.
I was wondering about the availability of corpora of written
English annotated with the following types of information:
(1) Coreference relationships between NPs
(2) Bridging relationships between NPs, and
(3) Word sense information (WordNet synsets) for NPs
Ideally I would like to find a number of domain-restricted corpora
annotated with all three types of information; however, I am well
aware that it is unlikely that any such corpus currently exists.
What I am hoping for is a pointer to a corpus annotated with (1) and
(3).
Currently I am aware of the following publically-available corpora
that are annotated with WordNet senses:
- SemCor, which contains a subset of the brown corpus annotated with
WordNet 1.6 senses (http://www.cogsci.princeton.edu/~wn/wn1.6.shtml)
- The Senseval-1 and Senseval-2 corpora, the latter of which contains
many instances of a limited set of words annotated with WordNet 1.7
senses, and is mainly drawn from the WSJ entries in the Penn
Treebank (http://www.senseval.org/)
I have also come across the following corpora that are annotated with
coreference information:
- A corpus of 7 texts (mainly instructional) from the University of
Wolverhampton (http://clg.wlv.ac.uk/resources/corpus.html)
Some time ago I thought that I had seen a set of WSJ entries annotated
with coreference information, but I wasn't able to locate this in the
quick search that I performed yesterday.
Regards,
- jo
--
+-------------------------------------------------------------------+
| Josef Meyer | http://www.mri.mq.edu.au/~jmeyer |
| Information and Communication | jmeyer at ics.mq.edu.au |
| Sciences, Macquarie University | Phone: +61 2 9850 9571 |
| NSW, Australia 2109 | Fax: +61 2 9850 9542 |
+-------------------------------------------------------------------+
More information about the Corpora
mailing list