<div dir="auto">I'll try that. Thank you!</div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, 2 Jul 2020, 18:17 Leonid Spektor, <<a href="mailto:spektor@andrew.cmu.edu">spektor@andrew.cmu.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word;line-break:after-white-space">Cynthia,<div><br></div><div><span style="white-space:pre-wrap"> </span>I would suggets the following commands to extract the right gems from your data files:</div><div><br></div><div><div><font color="#ff2600">gem +d +t@ID +sdisputa_1 +f<span>disputa_1</span> filenames.cha</font></div><div><font color="#ff2600">gem +d +t@ID +sdisputa_2 +fdisputa_2 filenames.cha</font></div><div> etc...</div><div><br></div><div>Next run <font color="#ff2600"><font face="Lucida Grande">freq +d3 *.disputa*.cex</font></font><font face="Lucida Grande"> on resulting GEM output files. The Excel output will be called <font color="#ff2600">stat.frq.xls</font>.</font><br><br>Leonid.
</div>
<div><br><blockquote type="cite"><div>On Jul 2, 2020, at 16:49, Cynthia Audisio <<a href="mailto:cpaudisio@gmail.com" target="_blank" rel="noreferrer">cpaudisio@gmail.com</a>> wrote:</div><br><div><div dir="ltr"><div><div><div>Hello Leonid,</div><div><br></div><div>Our data have @ID: headers and we want to get type/token totals and ratio for *speaker words*. It would be of great help to get the output as a spreadsheet. Gemfreq does almost all the work. The only missing information is the one in blue in the following sample output:</div></div><div><br></div><div>From file <........></div><div> 3 tiers in gem "disputa_1":</div><div> 1 abuela</div><div> 2 dije</div><div> 1 duele</div><div> 1 eso</div><div> 1 heladito</div><div> 1 idea</div><div> 2 la</div><div> 2 mamá</div><div> 1 me</div></div><div><span style="color:rgb(4,51,255)">------------------------------</span><br style="color:rgb(4,51,255)"><span style="color:rgb(4,51,255)"> 9 Total number of different item types used</span><br style="color:rgb(4,51,255)"><span style="color:rgb(4,51,255)"> 11 Total number of items (tokens)</span><br style="color:rgb(4,51,255)"><span style="color:rgb(4,51,255)">0.818 Type/Token ratio</span><br></div><div><span style="color:rgb(4,51,255)"><br></span></div><div><div><div>From file <........></div><div> 4 tiers in gem "disputa_2":</div><div> 3 no</div><div> 1 porque</div><div> 1 sí</div><div> 2 tenía</div><div> 1 un</div><div> 1 yo</div></div><div><span style="color:rgb(4,51,255)">------------------------------</span><br style="color:rgb(4,51,255)"><span style="color:rgb(4,51,255)"> 9 Total number of different item types used</span><br style="color:rgb(4,51,255)"><span style="color:rgb(4,51,255)"> 11 Total number of items (tokens)</span><br style="color:rgb(4,51,255)"><span style="color:rgb(4,51,255)">0.818 Type/Token ratio</span></div></div><div><br></div><div><br></div><div>etc.</div><div><br></div><div><br></div><div><br></div><br><br>El jueves, 2 de julio de 2020, 17:05:33 (UTC-3), Leonid Spektor escribió:<blockquote class="gmail_quote" style="margin:0;margin-left:0.8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word;line-break:after-white-space">Cynthia,<div><br></div><div><span style="white-space:pre-wrap"> </span>First I need to get more information from you. Do your data files have an @ID: headers? Do you want to get type/token and type/token ratio for speaker words or for morphological analysis words or for lemmas? Do you want the output in plain readable text format or in Excel format? Different answers to those questions will require different commands to get the exact result that you want.</div><div><br></div><div>Please allow me to explain the reason for my second question. For example, if you have the following sentence:</div><div><br></div><div><font color="#ff2600">*MOT: you can't put it on the table and table it.</font><br><div><br></div><div>If you run FREQ on <font color="#ff2600">speaker words</font>, then you will get result:</div><div><br></div><div><div><font color="#0433ff"> 1 and</font></div><div><font color="#0433ff"> 1 can't</font></div><div><font color="#0433ff"> 2 it</font></div><div><font color="#0433ff"> 1 on</font></div><div><font color="#0433ff"> 1 put</font></div><div><font color="#0433ff"> 2 table</font></div><div><font color="#0433ff"> 1 the</font></div><div><font color="#0433ff"> 1 you</font></div><div><font color="#0433ff">------------------------------</font></div><div><font color="#0433ff"> 8 Total number of different item types used</font></div><div><font color="#0433ff"> 10 Total number of items (tokens)</font></div><div><font color="#0433ff">0.800 Type/Token ratio</font></div></div><div><br></div><div><div>If you run FREQ on <font color="#ff2600">morphological analysis words</font>, then you will get result:</div></div><div><br></div><div><font color="#0433ff"> 1 coord|and<br> 1 det:art|the<br> 1 mod|can<br> 1 neg|not<br> 1 n|table<br> 1 prep|on<br> 2 pro:per|it<br> 1 pro:per|you<br> 1 v|put&ZERO<br> 1 v|table<br>------------------------------<br> 10 Total number of different item types used<br> 11 Total number of items (tokens)<br>0.909 Type/Token ratio<br></font><br></div><div>
<div>If you run FREQ on <font color="#ff2600">lemmas</font>, then you will get result:</div><div><br></div><font color="#0433ff"> 1 and<br> 1 can<br> 2 it<br> 1 not<br> 1 on<br> 1 put<br> 2 table<br> 1 the<br> 1 you<br>------------------------------<br> 9 Total number of different item types used<br> 11 Total number of items (tokens)<br>0.818 Type/Token ratio<br></font><br><br></div><div><br></div><div>Leonid.
</div>
<div><br><blockquote type="cite"><div>On Jul 2, 2020, at 14:13, Cynthia Audisio <<a rel="nofollow noreferrer">cpau...@gmail.com</a>> wrote:</div><br><div><div dir="ltr">Hello Chibolts,<div><br></div><div>I've got a group of files, each of them's got several "gems" with play situations. Is it possible to get separate type/token totals and ratios for each of the gems in a file ?</div><div>This is how the file looks:</div><div><br></div><div>.</div><div>.</div><div>@Bg: play1</div><div>.</div><div>.</div><div>.</div><div>@Eg: play1</div><div>.</div><div>.</div><div>.</div><div>.</div><div><div>@Bg: play2</div><div>.</div><div>.</div><div>.</div><div>@Eg: play2</div></div><div>.</div><div>.</div><div><br></div><div>and what I need is individual type/token counts and ratio for each play situation (play 1, play 2, etc). Up to now I've run gemfreq which yields a freq list (not total number of type/token and type/token ratio, which is what i need).</div><div>Thanks,</div></div><div><br></div>
-- <br>
You received this message because you are subscribed to the Google Groups "chibolts" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to <a rel="nofollow noreferrer">chib...@googlegroups.com</a>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/65a859f6-26e9-487f-9246-761b46de9bb3o%40googlegroups.com?utm_medium=email&utm_source=footer" rel="nofollow noreferrer" target="_blank">https://groups.google.com/d/msgid/chibolts/65a859f6-26e9-487f-9246-761b46de9bb3o%40googlegroups.com</a>.<br>
</div></blockquote></div><br></div></div></blockquote></div><div><br></div>
-- <br>
You received this message because you are subscribed to the Google Groups "chibolts" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com" target="_blank" rel="noreferrer">chibolts+unsubscribe@googlegroups.com</a>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/7834c54f-e966-4f88-95db-2d76d2e0e743o%40googlegroups.com?utm_medium=email&utm_source=footer" target="_blank" rel="noreferrer">https://groups.google.com/d/msgid/chibolts/7834c54f-e966-4f88-95db-2d76d2e0e743o%40googlegroups.com</a>.<br>
</div></blockquote></div><br></div></div>
<p></p>
-- <br>
You received this message because you are subscribed to a topic in the Google Groups "chibolts" group.<br>
To unsubscribe from this topic, visit <a href="https://groups.google.com/d/topic/chibolts/2r8Ifpxd-44/unsubscribe" target="_blank" rel="noreferrer">https://groups.google.com/d/topic/chibolts/2r8Ifpxd-44/unsubscribe</a>.<br>
To unsubscribe from this group and all its topics, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com" target="_blank" rel="noreferrer">chibolts+unsubscribe@googlegroups.com</a>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/239D98C6-A98A-46BF-A7AE-A0C68A9841C8%40andrew.cmu.edu?utm_medium=email&utm_source=footer" target="_blank" rel="noreferrer">https://groups.google.com/d/msgid/chibolts/239D98C6-A98A-46BF-A7AE-A0C68A9841C8%40andrew.cmu.edu</a>.<br>
</blockquote></div>
<p></p>
-- <br />
You received this message because you are subscribed to the Google Groups "chibolts" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com">chibolts+unsubscribe@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/chibolts/CAEACiGmsaW5rjfYkS%3D0s8O1bsttOBM8gY4S3%2By2cFUc1dO4yoA%40mail.gmail.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/chibolts/CAEACiGmsaW5rjfYkS%3D0s8O1bsttOBM8gY4S3%2By2cFUc1dO4yoA%40mail.gmail.com</a>.<br />