Excluding stories and songs from corpus

sit591 at g.harvard.edu sit591 at g.harvard.edu
Wed Apr 10 00:38:46 UTC 2019


Hi Prof. MacWhinney,

Thanks for your reply! Well, I guess it will take me a while to do this.

I have another question regarding the same study. Right now, I am using the 
code *kwal +sX -w10 +w5 -t*CHI, *where X is meant to be a placeholder for 
the words that I am interested in searching in the input. Ideally, however, 
I would prefer selecting a stretch of talk like this only if the target 
utterance that contains the word X does not constitute a repetition of the 
immediately preceding line (e.g., the parent only uses X because another 
speaker said X in the immediately preceding line). My question is pretty 
much the same as above: is there a practical way to exclude repetitive 
utterances of this sort?

Thank you so much!

Simge



On Monday, April 8, 2019 at 5:00:42 PM UTC-4, sit... at g.harvard.edu wrote:
>
> Hi all,
>
> I am doing a corpus study using the Providence corpus right now. For the 
> purposes of this study, I am interested in analyzing only the utterances 
> that are produced by the speakers during their natural conversational 
> exchanges, but the corpus also includes many stretches of talk that consist 
> of the stories that parents read to the children, or songs and nursery 
> rhymes they sing, etc. Is there a practical way to weed out these parts 
> from the corpus or do I have to face the gargantuan task of eliminating 
> them manually?
>
> Thanks in advance for your help!
>
> Simge Topaloglu
>
>

-- 
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To post to this group, send email to chibolts at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/53006962-d369-49f6-9e9e-809ca73708b5%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20190409/2b8bf74e/attachment.htm>


More information about the Chibolts mailing list