[Corpora-List] tip request

suhel2 at tin.it suhel2 at tin.it
Fri Aug 1 17:43:09 UTC 2008


Dear all,
I've been looking for some time for a large corpus in
'canonical' (for the language in question) phonetic
transcription with 
no success. It needs to be canonical (having been
transcribed from 
orthography rather than actual pronunciation) to
avoid idiolects and 
dialects variations, and it needs to represent
connected speech, 
meaning no word lists, possibly. I need it to test
an unsupervised 
generic syllabification algorithm of mine, so any
language could do.
Your tips are my last resort!
Thank you so much,

Suhel Jaber
Ca' 
Foscari University of Venice

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list