[Corpora-List] release : HESITA Corpus

Sara Candeias saracandeias at co.it.pt
Tue Jun 11 11:32:20 UTC 2013


We are proud to announce the release of the HESITA Corpus, a large collection of annotations of hesitations/disfluencies events, acoustical environments, speaking styles, speaker characteristics and respiratory events, among other characteristic sounds.

The corpus consists of television daily news broadcasts audio collected over a month (about 27 hours of European Portuguese continuous speech), out of which a total of 4608 hesitation events are annotated.


This corpus is accompanied by the paper:

"HESITA(tions) in Portuguese: a database."

Candeias, S., Celorico, D., Proença, J., Perdigão, F.

DiSS 2013, ISCA endorsed Interspeech 2013 satellite workshop, August  21-23, 2013, KTH Royal Institute of Technology, Stockholm, Sweden.

 
The download site is:

http://lsi.co.it.pt/spl/hesitation/downloads.html

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20130611/da689449/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list