[Corpora-List] ParCor v1.0 - A Parallel Pronoun-Coreference Corpus

Jörg Tiedemann Jorg.Tiedemann at lingfil.uu.se
Mon May 26 07:15:18 UTC 2014


Dear corpora readers,

We are happy to announce a new public resource which is available from
http://opus.lingfil.uu.se/ParCor/

ParCor 1.0 is a parallel corpus of texts in which pronoun coreference -- reduced coreference in which pronouns are used as referring expressions -- has been annotated. It consists of a collection of parallel English-German documents from two different text genres: TED Talks (transcribed planned speech), and EU Bookshop publications (written text). All documents in the corpus have been manually annotated with respect to the type and location of each pronoun and, where relevant, its antecedent.

The corpus is intended to be used both as a resource from which to learn systematic differences in pronoun use between languages and ultimately for developing and testing informed Statistical Machine Translation systems aimed at addressing the problem of pronoun coreference in translation.

If you make use of the ParCor corpus in your work, please cite the following article:

• Liane Guillou, Christian Hardmeier, Aaron Smith, Jörg Tiedemann and Bonnie Webber (2014): ParCor 1.0: A Parallel Pronoun-Coreference Corpus to Support Statistical MT, In Proceedings of LREC 2014, Reykjavik, Iceland

Please come to our poster if you happen to be at LREC this year!

Liane Guillou, Christian Hardmeier, Aaron Smith, Jörg Tiedemann and Bonnie Webber



**********************************************************************************
 Jörg Tiedemann                                   jorg.tiedemann at lingfil.uu.se<mailto:jorg.tiedemann at lingfil.uu.se>
 Dep. of Linguistics and Philology           http://stp.lingfil.uu.se/~joerg/
 Uppsala University                                  tel:  +46 (0)18 - 471 1412
 Box 635, SE-751 26 Uppsala/SWEDEN    fax: +46 (0)18 - 471 1094



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140526/6be71fc0/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list