New Cantonese corpus

Brian MacWhinney macw at cmu.edu
Sat Nov 8 18:45:40 UTC 2003


Dear Info-CHILDES,

  I am happy to announce the addition to CHILDES of a new cross-sectional
corpus from Cantonese.  The corpus include 70 files of interviews with 70
children ages 2;6 to 5;6.  The main lines are romanized and there is an
accompanying %mor line.  We plan to add a line with Chinese characters soon.
Digitized audio is also available, although it is not yet linked to the
files.

The construction of this database was funded by a grant from the Hong Kong
Government Language Fund to Paul Fletcher, Thomas H-T. Lee, Samuel Leung and
Stephanie Stokes and from a Hong Kong University Grant to Zehava Weizman and
Paul Fletcher. The present form of the database was produced by
Dr Zehava Weizman and Emily Ma. Research using these data should cite these
sources:

Milestones in the learning of spoken Cantonese by pre-school children.
Language Fund, Hong Kong. Paul Fletcher, Thomas H.T. Lee, Samuel Leung, and
Stephanie Stokes (1996-1999).

Fletcher, P., Leung, S. C-S., Stokes, S. F., & Weizman, Z.
O. (2000). Cantonese pre-school language development: A guide. Hong Kong:
Department of Speech and Hearing Sciences.

Weizman, Z and Fletcher, P. A comparative study of language development:
English and Cantonese pre-schoolers in Hong Kong. Committee on Research and
Conference Grants, Unoversity of Hong Kong. (2000).

Thanks to all the members of the Hong Kong University team for the
contribution of this important data set.  In the database, the folder name
for this corpus is simply "HKU" for Hong Kong University.

--Brian MacWhinney



More information about the Info-childes mailing list