Dense corpora

Hao Wang haowang at usc.edu
Wed Apr 27 07:35:24 UTC 2011


Thank everyone who replied. Here is a quick summary of the dense corpora 
that now I am aware of.

Available on CHILDES
Estonian by Maigi Vija
at age 2 and also 3 (6 weeks of one-hour recordings, 5 days per week

Naima in the Providence Corpus
about 1.5 hrs. a week from around 1;4 years until around 3.

Thomas (UK English)
Five one-hour sessions per week, from 2 to 3.

Lara (UK English)
1;9 to 3;3, 119 hours and 50 mins, 48,940 child utterances,
80,397 mother utterances, 13,767 father utterances

Leo (German)

Anne and Aran in Manchester corpus also have fair large number of utterances
anne 1;10.07-2;9.10, 19868 child utterances, 36220 mother utterances
aran 1;11.12-2;10.28, 17111 child utterances, 34487 mother utterances

Others
Orit Ashkenazi from Tel aviv university
2 Hebrew speaking toddlers 1;8-2;3 which is being coded and analyzed.

Deb Patel at MIT media lab

Best,
Hao
--
Graduate Student
Department of Psychology
University of Southern California

-- 
You received this message because you are subscribed to the Google Groups "Info-CHILDES" group.
To post to this group, send email to info-childes at googlegroups.com.
To unsubscribe from this group, send email to info-childes+unsubscribe at googlegroups.com.
For more options, visit this group at http://groups.google.com/group/info-childes?hl=en.



More information about the Info-childes mailing list