New French and English Comparable Corpora
Brian MacWhinney
macw at cmu.edu
Wed Aug 15 18:42:28 UTC 2007
Dear Info-CHILDES,
I am happy to announce the addition to CHILDES of two new corpora
designed to provide a direct comparison between French and English,
while also providing excellent material for the study of the learning
of each language separately. The two corpora each include six
children (3 male, 3 female) studied intensively between the ages of
one and three. The English data are from Katherine Demuth's group at
Brown University in Providence RI and the French data are from
Harriet Jisa's group at the University of Lyon 2. The files are all
linked to video and the video for 10 of the children is available
from the web. For easier web playback, we will eventually extract
MP3 and Wave files from the video. A few of the segments of the
French data are not yet in place, but will be configured shortly.
All of the data are also transcribed throughout in IPA on the %pho
line, making this now the largest database available for the study of
child phonology.
The English corpus can be found under /Eng-USA/Providence and the
French corpus can be found under /Romance/French/Lyon. Fuller
documentation can be found in the database manuals.
Many thanks to Katherine, Harriet, and their coworkers for this
enormously impressive contribution. I believe this corpus will have
a major impact on a wide variety of issues in the study of language
development.
--Brian MacWhinney
More information about the Info-childes
mailing list