gemfreq type/token counts ratios

Cynthia Audisio cpaudisio at gmail.com
Thu Jul 2 20:49:36 UTC 2020


Hello Leonid,

Our data have @ID: headers and we want to get type/token totals and ratio 
for *speaker words*. It would be of great help to get the output as a 
spreadsheet. Gemfreq does almost all the work. The only missing information 
is the one in blue in the following sample output:

>From file <........>
  3 tiers in gem "disputa_1":
      1 abuela
      2 dije
      1 duele
      1 eso
      1 heladito
      1 idea
      2 la
      2 mamá
      1 me
------------------------------
    9  Total number of different item types used
   11  Total number of items (tokens)
0.818  Type/Token ratio

>From file <........>
  4 tiers in gem "disputa_2":
      3 no
      1 porque
      1 sí
      2 tenía
      1 un
      1 yo
------------------------------
    9  Total number of different item types used
   11  Total number of items (tokens)
0.818  Type/Token ratio


etc.





El jueves, 2 de julio de 2020, 17:05:33 (UTC-3), Leonid Spektor escribió:
>
> Cynthia,
>
> First I need to get more information from you. Do your data files have an 
> @ID: headers? Do you want to get type/token and type/token ratio for 
> speaker words or for morphological analysis words or for lemmas? Do you 
> want the output in plain readable text format or in Excel format? Different 
> answers to those questions will require different commands to get the exact 
> result that you want.
>
> Please allow me to explain the reason for my second question. For example, 
> if you have the following sentence:
>
> *MOT: you can't put it on the table and table it.
>
> If you run FREQ on speaker words, then you will get result:
>
>   1 and
>   1 can't
>   2 it
>   1 on
>   1 put
>   2 table
>   1 the
>   1 you
> ------------------------------
>     8  Total number of different item types used
>    10  Total number of items (tokens)
> 0.800  Type/Token ratio
>
> If you run FREQ on morphological analysis words, then you will get result:
>
>   1 coord|and
>   1 det:art|the
>   1 mod|can
>   1 neg|not
>   1 n|table
>   1 prep|on
>   2 pro:per|it
>   1 pro:per|you
>   1 v|put&ZERO
>   1 v|table
> ------------------------------
>    10  Total number of different item types used
>    11  Total number of items (tokens)
> 0.909  Type/Token ratio
>
> If you run FREQ on lemmas, then you will get result:
>
>   1 and
>   1 can
>   2 it
>   1 not
>   1 on
>   1 put
>   2 table
>   1 the
>   1 you
> ------------------------------
>     9  Total number of different item types used
>    11  Total number of items (tokens)
> 0.818  Type/Token ratio
>
>
>
> Leonid. 
>
> On Jul 2, 2020, at 14:13, Cynthia Audisio <cpau... at gmail.com <javascript:>> 
> wrote:
>
> Hello Chibolts,
>
> I've got a group of files, each of them's got several "gems" with play 
> situations. Is it possible to get separate type/token totals and ratios for 
> each of the gems in a file ?
> This is how the file looks:
>
> .
> .
> @Bg:    play1
> .
> .
> .
> @Eg:    play1
> .
> .
> .
> .
> @Bg:    play2
> .
> .
> .
> @Eg:    play2
> .
> .
>
> and what I need is individual type/token counts and ratio for each play 
> situation (play 1, play 2, etc). Up to now I've run gemfreq which yields a 
> freq list (not total number of type/token and type/token ratio, which is 
> what i need).
> Thanks,
>
> -- 
> You received this message because you are subscribed to the Google Groups 
> "chibolts" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to chib... at googlegroups.com <javascript:>.
> To view this discussion on the web visit 
> https://groups.google.com/d/msgid/chibolts/65a859f6-26e9-487f-9246-761b46de9bb3o%40googlegroups.com 
> <https://groups.google.com/d/msgid/chibolts/65a859f6-26e9-487f-9246-761b46de9bb3o%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>
>
>

-- 
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/7834c54f-e966-4f88-95db-2d76d2e0e743o%40googlegroups.com.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20200702/42f22b9c/attachment.htm>


More information about the Chibolts mailing list