[Corpora-List] Croatian Corpus

Anthony Weaver aweaver at cs.sunysb.edu
Wed Apr 9 16:13:41 UTC 2003


	I am doing some speech recognition for Croatian and I would like
to know if there is a freely available corpus for Croatian?  I am
specifically looking for a text corpus, maybe no smaller than about 20K
words.

 I would also like to know if there are any papers discussing Croatian pronunciation?
More specifically, it has been explained to me by a native speaker, and on
various sites on the web that Croatian pronunciation is mostly
unambiguous, but I have been unable to find any papers/research that would
support or refute this claim.  In English, each letter can have multiple
pronunciations, but this does not seem to occur for Croatian.  All help is
greatly appreciated.

Tony



More information about the Corpora mailing list