[Corpora-List] are there corpora of fast speech?

Ute Römer ute.roemer at uni-koeln.de
Wed Jan 15 08:09:36 UTC 2003


Dear Dinoj and others,

With the Bergen Corpus of London Teenage Language (COLT) Eric Atwell
mentioned you can get both transcripts and MP3 files (55 hours of
spontaneous conversation on CDs -- don't know how many words per minute you
get). Also, there is a sound-text-alignment, so you can search the corpus
and get hyperlinks to the sound files. The texts are orthographically
transcribed (and word class tagged); I doubt that you will manage to find
many phonemically (or even phonetically) transcribed corpora (if any at
all -- who would want to phonetically transcribe 500,000 words or more?).
For more information on COLT see http://www.hit.uib.no/colt/

Good luck with your research!
Best wishes... Ute

_______________________

Ute Römer
English Department
University of Cologne
Albertus-Magnus-Platz 1
50923 Köln
Germany

Phone: 0049 (0)221 470 3038
Email: ute.roemer at uni-koeln.de
_______________________


----- Original Message -----
From: "Dinoj Surendran" <dinoj at cs.uchicago.edu>
To: <CORPORA at HIT.UIB.NO>
Sent: Tuesday, January 14, 2003 7:43 PM
Subject: [Corpora-List] are there corpora of fast speech?


> Dear list members,
>
> Does anyone know if there is a (at least) phonetically transcribed corpus
> of fast English speech? A corpus of spontaneous speech known to have
> several fast speakers could also work. And while I would prefer to have
> both the sound files and the transcription files, the latter only will
> still be of use.
>
> Thanks,
>
> Dinoj Surendran
> Graduate Student
> Computer Science Dept
> University of Chicago
>
>



More information about the Corpora mailing list