Running FREQ for bilingual transcripts
Brian MacWhinney
macw at cmu.edu
Wed Jun 29 18:41:45 UTC 2016
Dear Lulu,
I think you want +s:[- zho]” in this case, not –s”[- zho]” When I run
freq +s"[- yue]" +t*CHI *.cha +u
on CharlotteEng, I get both the English words marked as @s and the Cantonese.
--Brian
From: ChiBolts <chibolts at googlegroups.com> on behalf of Lulu <lulusong at gmail.com>
Reply-To: ChiBolts <chibolts at googlegroups.com>
Date: Tuesday, June 28, 2016 at 10:34 PM
To: ChiBolts <chibolts at googlegroups.com>
Subject: Re: Running FREQ for bilingual transcripts
Hi Brian,
I tried to run the reverse command on the same transcript (mostly English with a dozen words in Chinese)
freq +tTCH -s"[- zho]" +s”*@s” *.cha (I added * after @s because my transcript also tags if the @s word is a noun or a verb)
hoping to add the few @s English words embedded in [- zho] lines to the English word counts, but only got 0's. With +s"*@s*" removed, I get good results which don't include the @s English words. Not sure how I can fix this.
Thanks!
Lulu
On Tuesday, June 28, 2016 at 10:22:40 PM UTC-4, Lulu wrote:
Dear Brian,
That just did magic! Thank you so much!
Best,
Lulu
On Tuesday, June 28, 2016 at 10:15:26 PM UTC-4, Brian MacWhinney wrote:
Dear Lulu,
Without seeing your transcripts, I can’t say exactly what is wrong. However, if you run this similar command on the CharlotteEng folder in the YipMatthews corpus, you get good results:
freq +t*CHI +s"[- yue]" *.cha
The idea is that this will include all words on the [- yue] lines including those with @s, although the latter are pretty rare. If you want to exclude those, just add –s”*@s”
-- Brian MacWhinney
From: ChiBolts <chib... at googlegroups.com> on behalf of Lulu <lulu... at gmail.com>
Reply-To: ChiBolts <chib... at googlegroups.com>
Date: Tuesday, June 28, 2016 at 5:10 PM
To: ChiBolts <chib... at googlegroups.com>
Subject: Running FREQ for bilingual transcripts
Hi Brian and team members,
I ran the freq command
freq +tTCH +s"[- zho]" *.cha
for transcripts that contain bilingual utterances (e.g., *TCH: [- zho] this at s$n 星). The dominant language of the transcripts was English so we marked utterances that contained Chinese with [- zho]. The output types and tokens included all the English words that were marked @s. I thought I would get the types and tokens of all the Chinese words by running the above command. Is the problem with the transcript or the command?
Thank you!
Lulu
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+u... at googlegroups.com.
To post to this group, send email to chib... at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/0e5d867c-79b1-4d36-87be-1303f390a83b%40googlegroups.com<https://groups.google.com/d/msgid/chibolts/0e5d867c-79b1-4d36-87be-1303f390a83b%40googlegroups.com?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com<mailto:chibolts+unsubscribe at googlegroups.com>.
To post to this group, send email to chibolts at googlegroups.com<mailto:chibolts at googlegroups.com>.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/56bb5824-2983-4e64-9111-42841037333f%40googlegroups.com<https://groups.google.com/d/msgid/chibolts/56bb5824-2983-4e64-9111-42841037333f%40googlegroups.com?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To post to this group, send email to chibolts at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/5CE71C1E-2608-4C1E-87ED-CCC936E2670F%40cmu.edu.
For more options, visit https://groups.google.com/d/optout.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20160629/810d76e9/attachment.htm>
More information about the Chibolts
mailing list