[Corpora-List] Tips for syntactically annotated corpora of child speech?

Tomaz Erjavec tomaz.erjavec at ijs.si
Sat Jun 23 10:17:27 UTC 2012


Thanks to all for tips about using CHILDES as a treebank. Below a summary, in case anybody else will find this useful:

- The Brown dataset in CHILDES is annotated with syntactic dependencies (part of them are even manually assigned dependencies):
http://childes.psy.cmu.edu/data/Eng-USA/Brown.zip  /Grzegorz Chrupała/

- There is a Penn-treebank version of (part of ) CHILDES; it was parsed with the Charniak parser and somewhat manually corrected, and can be search with Standford's Tregex:
http://www.socsci.uci.edu/~lpearl/CoLaLab/TestingUG/childestreebank.html   /Bob Berwick/

- Sketch Engine contains most CHILDES/TalkBank data and its and CQL language can be used to formulate complex queries:
http://the.sketchengine.co.uk   /Adam Kilgarriff/

- There is a Google Group called CHIBOLTS for technical questions about using the CLAN program with CHILDES:
http://groups.google.com/group/chibolts/about  /Jean Crawford/

Given the time constraints on the seminar (and the fact that the University of Nova Gorica where Calum is studying has SkE licences) we will go for exploring CHILDES in SkE, but the other answers are very much appreciated too, esp. if we take this research further.

Best,
Tomaž

-----Original Message-----
From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of Tomaz Erjavec
Sent: Tuesday, June 19, 2012 1:48 PM
To: Corpora at uib.no
Subject: [Corpora-List] Tips for syntactically annotated corpora of child speech?

Dear all,
I'm posting this on behalf of a student of mine - any tips gratefully received:

Dear All,
I am trying to use CHILDES to determine whether children acquire quantifiers in subject position before object position, but I am not sure whether I can use CHILDES to perform such a search. Does anyone know whether this sort of search is supported? I am finding it rather difficult to determine whether such a search is possible from reading the manual.
If anyone has any suggestions re an alternative corpus of spontaneous kid's speech that allows searching by subject/object with an easy-to-use concordancer, I would greatly appreciate hearing about it!
Best regards,
Calum Riach

-- 
Tomaž Erjavec, http://nl.ijs.si/et/
Dept. of Knowledge Technologies, Jožef Stefan Institute, Ljubljana



_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list