[Corpora-List] ParCor v1.0 - A Parallel Pronoun-Coreference Corpus
Jörg Tiedemann
Jorg.Tiedemann at lingfil.uu.se
Mon May 26 07:15:18 UTC 2014
Dear corpora readers,
We are happy to announce a new public resource which is available from
http://opus.lingfil.uu.se/ParCor/
ParCor 1.0 is a parallel corpus of texts in which pronoun coreference -- reduced coreference in which pronouns are used as referring expressions -- has been annotated. It consists of a collection of parallel English-German documents from two different text genres: TED Talks (transcribed planned speech), and EU Bookshop publications (written text). All documents in the corpus have been manually annotated with respect to the type and location of each pronoun and, where relevant, its antecedent.
The corpus is intended to be used both as a resource from which to learn systematic differences in pronoun use between languages and ultimately for developing and testing informed Statistical Machine Translation systems aimed at addressing the problem of pronoun coreference in translation.
If you make use of the ParCor corpus in your work, please cite the following article:
• Liane Guillou, Christian Hardmeier, Aaron Smith, Jörg Tiedemann and Bonnie Webber (2014): ParCor 1.0: A Parallel Pronoun-Coreference Corpus to Support Statistical MT, In Proceedings of LREC 2014, Reykjavik, Iceland
Please come to our poster if you happen to be at LREC this year!
Liane Guillou, Christian Hardmeier, Aaron Smith, Jörg Tiedemann and Bonnie Webber
**********************************************************************************
Jörg Tiedemann jorg.tiedemann at lingfil.uu.se<mailto:jorg.tiedemann at lingfil.uu.se>
Dep. of Linguistics and Philology http://stp.lingfil.uu.se/~joerg/
Uppsala University tel: +46 (0)18 - 471 1412
Box 635, SE-751 26 Uppsala/SWEDEN fax: +46 (0)18 - 471 1094
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140526/6be71fc0/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list