<div dir="ltr">Hi Brian,<br><br>I tried to run the reverse command on the same transcript (mostly English with a dozen words in Chinese)<br><br>freq +tTCH -s"[- zho]" +s”*@s” *.cha (I added * after @s because my transcript also tags if the @s word is a noun or a verb)<br><br>hoping to add the few @s English words embedded in [- zho] lines to the English word counts, but only got 0's. With +s"*@s*" removed, I get good results which don't include the @s English words. Not sure how I can fix this.<br><br>Thanks!<br><br>Lulu<br><br>On Tuesday, June 28, 2016 at 10:22:40 PM UTC-4, Lulu wrote:<blockquote class="gmail_quote" style="margin: 0;margin-left: 0.8ex;border-left: 1px #ccc solid;padding-left: 1ex;"><div dir="ltr">Dear Brian,<br><br>That just did magic! Thank you so much!<br><br>Best,<br>Lulu<br><br>On Tuesday, June 28, 2016 at 10:15:26 PM UTC-4, Brian MacWhinney wrote:<blockquote class="gmail_quote" style="margin:0;margin-left:0.8ex;border-left:1px #ccc solid;padding-left:1ex">
<div bgcolor="white" link="blue" vlink="purple" lang="EN-US">
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:Calibri">Dear Lulu,</span></p>
<p class="MsoNormal" style="text-indent:9.0pt"><span style="font-size:11.0pt;font-family:Calibri">Without seeing your transcripts, I can’t say exactly what is wrong.  However, if you run this similar command on the CharlotteEng folder in the YipMatthews corpus,
 you get good results:</span></p>
<p class="MsoNormal" style="text-indent:9.0pt"><span style="font-size:11.0pt;font-family:Calibri"> </span></p>
<p class="MsoNormal" style="text-indent:9.0pt"><span style="font-size:11.0pt;font-family:Calibri">freq +t*CHI +s"[- yue]" *.cha</span></p>
<p class="MsoNormal" style="text-indent:9.0pt"><span style="font-size:11.0pt;font-family:Calibri"> </span></p>
<p class="MsoNormal" style="text-indent:9.0pt"><span style="font-size:11.0pt;font-family:Calibri">The idea is that this will include all words on the [- yue] lines including those with @s, although the latter are pretty rare.  If you want to exclude those,
 just add –s”*@s”</span></p>
<p class="MsoNormal" style="text-indent:9.0pt"><span style="font-size:11.0pt;font-family:Calibri"> </span></p>
<p class="MsoNormal" style="text-indent:9.0pt"><span style="font-size:11.0pt;font-family:Calibri">-- Brian MacWhinney</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:Calibri"> </span></p>
<div style="border:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b><span style="font-family:Calibri;color:black">From: </span>
</b><span style="font-family:Calibri;color:black">ChiBolts <<a rel="nofollow">chib...@googlegroups.com</a>> on behalf of Lulu <<a rel="nofollow">lulu...@gmail.com</a>><br>
<b>Reply-To: </b>ChiBolts <<a rel="nofollow">chib...@googlegroups.com</a>><br>
<b>Date: </b>Tuesday, June 28, 2016 at 5:10 PM<br>
<b>To: </b>ChiBolts <<a rel="nofollow">chib...@googlegroups.com</a>><br>
<b>Subject: </b>Running FREQ for bilingual transcripts</span></p>
</div>
<div>
<p class="MsoNormal"> </p>
</div>
<div>
<div>
<div>
<p class="MsoNormal">Hi Brian and team members,<br>
<br>
I ran the freq command<br>
<br>
freq +tTCH +s"[- zho]" *.cha<br>
<br>
for transcripts that contain bilingual utterances (e.g., *TCH:    [- zho] this@s$n
<span style="font-family:"MS Mincho"" lang="ZH-CN">星</span>). The dominant language of the transcripts was English so we marked utterances that contained Chinese with [- zho]. The output types and tokens included all the English words that were marked @s. I
 thought I would get the types and tokens of all the Chinese words by running the above command. Is the problem with the transcript or the command?<br>
<br>
Thank you!<br>
<br>
Lulu</p>
</div>
<p class="MsoNormal">-- <br>
You received this message because you are subscribed to the Google Groups "chibolts" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to
<a rel="nofollow">chibolts+u...@googlegroups.com</a><wbr>.<br>
To post to this group, send email to <a rel="nofollow">chib...@googlegroups.com</a>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/0e5d867c-79b1-4d36-87be-1303f390a83b%40googlegroups.com?utm_medium=email&utm_source=footer" rel="nofollow" target="_blank" onmousedown="this.href='https://groups.google.com/d/msgid/chibolts/0e5d867c-79b1-4d36-87be-1303f390a83b%40googlegroups.com?utm_medium\x3demail\x26utm_source\x3dfooter';return true;" onclick="this.href='https://groups.google.com/d/msgid/chibolts/0e5d867c-79b1-4d36-87be-1303f390a83b%40googlegroups.com?utm_medium\x3demail\x26utm_source\x3dfooter';return true;">
https://groups.google.com/d/<wbr>msgid/chibolts/0e5d867c-79b1-<wbr>4d36-87be-1303f390a83b%<wbr>40googlegroups.com</a>.<br>
For more options, visit <a href="https://groups.google.com/d/optout" rel="nofollow" target="_blank" onmousedown="this.href='https://groups.google.com/d/optout';return true;" onclick="this.href='https://groups.google.com/d/optout';return true;">https://groups.google.com/d/<wbr>optout</a>.</p>
</div>
</div>
</div>
</div>
</blockquote></div></blockquote></div>
<p></p>
-- <br />
You received this message because you are subscribed to the Google Groups "chibolts" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com">chibolts+unsubscribe@googlegroups.com</a>.<br />
To post to this group, send email to <a href="mailto:chibolts@googlegroups.com">chibolts@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/56bb5824-2983-4e64-9111-42841037333f%40googlegroups.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/chibolts/56bb5824-2983-4e64-9111-42841037333f%40googlegroups.com</a>.<br />
For more options, visit <a href="https://groups.google.com/d/optout">https://groups.google.com/d/optout</a>.<br />