MLU giving odd results
Sarah Surrain
sarahsurrain at gmail.com
Wed Oct 9 22:18:09 UTC 2024
I tried the MLU command with the same file (020304) and replicated the same result as Peter.
Could it have to do with how the MLU command handles utterances with repeated words?
For example, these are some of the longest utterances in this transcript:
*CHI: bulldozer bulldozer bulldozer bulldozer bulldozer bulldozer bulldozer bulldozer bulldozer bulldozer bulldozer bulldozer bulldozer bulldozer .
*CHI: put dirt up (.) put dirt up (.) put dirt up .
*CHI: look look look look .
Sarah Surrain, Ph.D.
Postdoctoral Research Fellow (she/her/ella)
Children’s Learning Institute
McGovern Medical School at UTHealth
7000 Fannin St | 2460A | Houston, TX 77030
713-500-3826
www.childrenslearninginstitute.org<http://www.childrenslearninginstitute.org>
https://sarahsurrain.com/
From: chibolts at googlegroups.com <chibolts at googlegroups.com> on behalf of Gordon, Peter <pgordon at tc.edu>
Date: Wednesday, October 9, 2024 at 5:15 PM
To: Nan Bernstein Ratner <nratner at umd.edu>
Cc: chibolts at googlegroups.com <chibolts at googlegroups.com>
Subject: Re: MLU giving odd results
Yes it does them sequentially. I've always done it that way.
On Wed, Oct 9, 2024 at 6:01 PM Nan Bernstein Ratner <nratner at umd.edu<mailto:nratner at umd.edu>> wrote:
I could be wrong but it looks like your command put in ALL of Adam's files? *.cha?
On Wed, Oct 9, 2024, 5:47 PM Gordon, Peter <pgordon at tc.edu<mailto:pgordon at tc.edu>> wrote:
I just taught a class where students do a simple MLU analysis to get used to CHILDES. As I was doing it in class I noticed that the MLUs for Adam did not look right. His MLU for the first sample was 4.176, despite having mostly single word utterances. Any thoughts?
Peter
mlu +tchi childes/Eng-NA/Brown/Adam/*.cha
Wed Oct 9 17:38:29 2024
mlu (29-Oct-2020) is conducting analyses on:
ONLY dependent tiers matching: %MOR;
****************************************
_________________________________________________________________
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
>From file <childes/Eng-NA/Brown/Adam/020304.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):
Number of: utterances = 1239, morphemes = 5174
Ratio of morphemes over utterances = 4.176
Standard deviation = 2.946
_________________________________________________________________
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
>From file <childes/Eng-NA/Brown/Adam/020318.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):
Number of: utterances = 1272, morphemes = 5062
Ratio of morphemes over utterances = 3.980
Standard deviation = 2.767
_________________________________________________________________
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
>From file <childes/Eng-NA/Brown/Adam/020403.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):
Number of: utterances = 830, morphemes = 3964
Ratio of morphemes over utterances = 4.776
Standard deviation = 3.062
_________________________________________________________________
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
>From file <childes/Eng-NA/Brown/Adam/020415.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):
Number of: utterances = 774, morphemes = 2870
Ratio of morphemes over utterances = 3.708
Standard deviation = 2.546
_________________________________________________________________
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
>From file <childes/Eng-NA/Brown/Adam/020430.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):
Number of: utterances = 837, morphemes = 3679
Ratio of morphemes over utterances = 4.395
Standard deviation = 3.146
_________________________________________________________________
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
>From file <childes/Eng-NA/Brown/Adam/020512.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):
Number of: utterances = 810, morphemes = 3392
Ratio of morphemes over utterances = 4.188
Standard deviation = 3.216
_________________________________________________________________
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
>From file <childes/Eng-NA/Brown/Adam/020603.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):
Number of: utterances = 849, morphemes = 4548
Ratio of morphemes over utterances = 5.357
Standard deviation = 4.005
_________________________________________________________________
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
>From file <childes/Eng-NA/Brown/Adam/020617.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):
Number of: utterances = 635, morphemes = 4197
Ratio of morphemes over utterances = 6.609
Standard deviation = 4.429
_________________________________________________________________
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
>From file <childes/Eng-NA/Brown/Adam/020701.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):
Number of: utterances = 853, morphemes = 4596
Ratio of morphemes over utterances = 5.388
Standard deviation = 3.860
_________________________________________________________________
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
>From file <childes/Eng-NA/Brown/Adam/020714.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):
Number of: utterances = 912, morphemes = 5096
Ratio of morphemes over utterances = 5.588
Standard deviation = 4.284
--
Peter Gordon
Pronouns: He/His/Him
Associate Professor
Biobehavioral Sciences and Human Development
Teachers College, Columbia University
525 West 120th Street<https://www.google.com/maps/search/525+West+120th+Street?entry=gmail&source=g>, Box 306
New York, NY 10027
Email pgordon at tc.edu<mailto:pgordon at tc.edu> | p: (212) 678-8162
[Image removed by sender.]
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com<mailto:chibolts+unsubscribe at googlegroups.com>.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/CAJE3P%2B_TiyPA_-oQBAMppkUF6mP6OBpSZUjK3W_KoJDK5BcK7g%40mail.gmail.com<https://groups.google.com/d/msgid/chibolts/CAJE3P%2B_TiyPA_-oQBAMppkUF6mP6OBpSZUjK3W_KoJDK5BcK7g%40mail.gmail.com?utm_medium=email&utm_source=footer>.
--
Peter Gordon
Pronouns: He/His/Him
Associate Professor
Biobehavioral Sciences and Human Development
Teachers College, Columbia University
525 West 120th Street, Box 306
New York, NY 10027
Email pgordon at tc.edu<mailto:pgordon at tc.edu> | p: (212) 678-8162
[Image removed by sender.]
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com<mailto:chibolts+unsubscribe at googlegroups.com>.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/CAJE3P%2B_rk0iuP3MRbVei8TH923jtAx3%2BtJdjddH3%2Bhj8MKdfMg%40mail.gmail.com<https://groups.google.com/d/msgid/chibolts/CAJE3P%2B_rk0iuP3MRbVei8TH923jtAx3%2BtJdjddH3%2Bhj8MKdfMg%40mail.gmail.com?utm_medium=email&utm_source=footer>.
--
You received this message because you are subscribed to the Google Groups "chibolts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to chibolts+unsubscribe at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/chibolts/SA1P220MB1343847D83561EFF8A12BEB6A77F2%40SA1P220MB1343.NAMP220.PROD.OUTLOOK.COM.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/chibolts/attachments/20241009/c245555c/attachment-0001.htm>
More information about the Chibolts
mailing list