Type/token ration question

E C Kelty eckelty at gmail.com
Fri May 29 15:38:43 UTC 2009


Hello,
I have a question about measuring type/token ratios of linguistic
forms using CLAN.  I have transcripts of mother/child speech, and I am
trying to calculate MLUs as well as frequencies of wh-questions,
auxiliary verbs, and a few other things.  I am wondering if there is a
way to run these commands so that, for example, "did" and "didn't" are
counted as two different tokens of the same word type.

Here is an output that I got when searching for auxiliary verbs:

freq +t*MOT +t%mor +saux|* A.Z.visit3.mor.pst.cex
Thu May 28 15:12:26 2009
freq (18-Jul-2008) is conducting analyses on:
  ONLY speaker main tiers matching: *MOT;
	and those speakers' ONLY dependent tiers matching: %MOR;
****************************************
>From file <A.Z.visit3.mor.pst.cex>
  2 aux|be&PRES
  4 aux|can
 10 aux|do
  8 aux|do&3S
  1 aux|do&3S~neg|not
  4 aux|do&PAST
  3 aux|do~neg|not
------------------------------
    7  Total number of different word types used
   32  Total number of words (tokens)
0.219  Type/Token ratio

Could I write a command that would count all those examples of "do" as
the same type, thus giving me an overall type count of 3 and token
count of 32?

Any information or direction towards other places where I could learn
about this would be appreciated.

Thanks very much,
Emma Kelty
emma.kelty at uconn.edu

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups "chibolts" group.
To post to this group, send email to chibolts at googlegroups.com
To unsubscribe from this group, send email to chibolts+unsubscribe at googlegroups.com
For more options, visit this group at http://groups.google.com/group/chibolts?hl=en
-~----------~----~----~----~------~----~------~--~---



More information about the Chibolts mailing list