Searching for Double Object Ditransitives
Brian MacWhinney
macw at cmu.edu
Wed Dec 21 20:39:35 UTC 2011
Dear Liam,
The best way to do this would be to create a test file. That file would include
as much variation in the configuration of double object sentences as you can think of.
You would start by collecting about 60 such sentences by hand and eye from
various corpora. Then perhaps you would imagine some other possible combinations.
Then you would see if your search strings correctly located each occurrence.
If you can first do the work of composing a test file, we could go from there.
Regarding your %mor line attempt, I can easily think of many cases it would miss, such
as sentences with two nouns as objects. In theory the %gra line should be more definitive,
but the level of accuracy of tagging of objects there is at about 90%, so the GRASP tagger
is itself going to miss some things.
Generally, this is probably going to take repeated work and testing.
-- Brian MacWhinney
On Dec 21, 2011, at 1:58 PM, Liam Considine wrote:
> Hey Chibolts Community,
>
> I am working on extracting double object ditransitive occurrences from
> the CHILDES corpus.
>
> "John give me the cookie"
>
> I've tried a handful of different searches on the %mor and %gra line.
> I would really like some other people who are familiar with CLAN
> syntax to check out my searches. I have already made a search for the
> prepositional dative so I am trying for this search to exclude those
> instances.
>
> Here is my %mor line attempt:
> combo +t*CHI +t%mor +sv*^(pro*)^(det*+qn*+pro*)^(n*+pro*) +k +r2 +u
> *.cha
>
> My first %gra line form:
> combo +t*CHI +t%gra +s"1|0|ROOT^2|1|OBJ^((3|4|DET^ 4|1|OBJ2)+3|1|
> OBJ2)" +k +r2 +u *.cha
>
> My best effort %gra line:
> combo +t*CHI +t%gra +s"(1|2|SUBJ^2|0|ROOT^3|2|OBJ^((4|2|OBJ2)+(4|
> 5DET^5|2|OBJ2)))+(1|0|ROOT^2|1|OBJ^((3|4|DET^ 4|1|OBJ2)+3|1|OBJ2))" +k
> +r2 +u *.cha
>
> I've selected the same data files from CHILDES as Anat Ninio does in
> the book "Syntactic Development Its input and output." This seems to
> be about 75% of all the files available.
>
> With my bigger %gra search i'm getting about 1075 hits. Is this
> consistent with the frequency of occurrence others have seen? Does my
> syntax have any glaring errors?
>
> Thanks for all the time and energy,
> Liam Considine
>
> --
> You received this message because you are subscribed to the Google Groups "chibolts" group.
> To post to this group, send email to chibolts at googlegroups.com.
> To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/chibolts?hl=en.
>
>
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To post to this group, send email to chibolts at googlegroups.com.
To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com.
For more options, visit this group at http://groups.google.com/group/chibolts?hl=en.
More information about the Chibolts
mailing list