[Corpora-List] CODA Parallel Monologue-Dialogue Corpus

P.Piwek p.piwek at open.ac.uk
Thu Jul 29 16:38:35 UTC 2010


 
  **************************************************************************
  The CODA corpus of parallel annotated monologue and dialogue
 
  Version 1.0 (July 2010) - NOW AVAILABLE FOR DOWNLOAD

  http://computing.open.ac.uk/coda/data.html
   
  **************************************************************************
 
The corpus contains approximately 700 turns of human-authored expository 
dialogue (by Mark Twain and George Berkeley) which has been aligned
with monologue that expresses the same information as the dialogue.
The monologue side is annotated with Coherence Relations (RST), and 
the dialogue side with Dialogue Act tags. All annotations are in XML format.  
See the CODA Corpus Annotation Manual at 
http://computing.open.ac.uk/coda/publications.html
for further details.

The corpus is available under a Creative Commons 
Attribution-NonCommercial-ShareAlike License

Please consult the CODA website (http://computing.open.ac.uk/coda/) or write to 
MCT-CODA-Project at open.ac.uk for additional information. See also:

Constructing the CODA Corpus: A Parallel Corpus of Monologues and Expository 
Dialogues  S. Stoyanchev and P. Piwek, 7th international conference 
on Language Resources and Evaluation (LREC) 2010, 
Malta http://oro.open.ac.uk/20919/1/LREC2010-CODA-final.pdf
 
                         ******************************************

The CODA project is supported by the EPSRC under grant EP/G020981/1  

-- 
The Open University is incorporated by Royal Charter (RC 000391), an exempt charity in England & Wales and a charity registered in Scotland (SC 038302).


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list