a new dialogue data Corpus: Dialogue Diversity Corpus (DDC)

William Mann bill_mann at SIL.ORG
Wed Oct 9 18:49:00 UTC 2002


  
Announcement

DIALOGUE DIVERSITY CORPUS

http://www-rcf.usc.edu/~billmann/diversity

(apologies if you receive multiple copies)

A new corpus is available for facilitating research on human dialogue. 

The Dialogue Diversity Corpus (DDC) gives direct access to a set of dialogue transcripts (13 sources, more than 12 hours of dialogue, all in English.). It also gives a set of links and methods for accessing hundreds of additional dialogues (principally in English.) Several sources provide speech data as well as transcripts.

The dialogues in this corpus occurred in a very diverse collection of interactive situations. Thus it is a data resource for studies of the breadth of coverage of particular dialogue models, and for studies that compare dialogue from different situations. 

For smaller projects such as pilot studies, program testing and even some term papers, the direct access portion will be sufficient. The access methods may yield enough dialogue data for some much larger studies. 

The corpus is designed for data finding rather than for bulk processing. Taken as a whole, it is irregular and not homogeneous in any way. It is generally unsuitable for drawing any conclusions about dialogue taken as a single category.

===============
William C. Mann
 
bill_mann at sil.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/discours/attachments/20021009/7ed32356/attachment.htm>


More information about the Discours mailing list