[Corpora-List] Spoken corpus with sound files?

Bernard Bel bernarbel at gmail.com
Sat Feb 18 13:14:10 UTC 2012


You could try EUROM1

http://sldr.org/sldr000034

which is freely accessible to scholars.

Bernard Bel <bernard.bel at lpl-aix.fr>
Laboratoire Parole et Langage
http://lpl-aix.fr
13604 Aix-en-Provence Cedex 1 (France)

Speech & Language Data Repository (SLDR, formerly CRDO-Aix)
http://sldr.org



>Dear All,
> 
>Might anyone be able to point me in the direction of a spoken corpus that fulfils the following criteria:
> 
>¾    Freely available;
>¾    With access to the speech files;
>¾    Of reasonable size – a student of mine is looking to study fairly infrequent patterns so we’re talking in excess of a million words required, certainly.
> 
>I’ve tried MICASE, but given that phonological analysis through PRAAT or some like software is involved, the quality of the recordings aren’t fit for the required purpose. I know some of the BNC spoken data is available from Oxford Uni’s webpages. The only probably with this is sound files are very long (three plus hours) and this doesn’t make the task at hand at all easy.
> 
>Any suggestions gratefully received.
> 
>Best wishes,
>Ben


_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list