Hello again,<div>Some of my students pointed out today that they are getting different MLU results when they run it within the browser versus in CLAN. The effect seems to be widespread--not just one corpus. They noticed discrepancies with the Tardif corpus at first but then found more.</div><div><br /></div><div>Taking Eve (Brown corpus) file 020000a in the browser as an example, the command</div><div>mlu +t*CHI 020000a.cha yields:</div><div><span style="box-sizing: border-box; font-family: "Courier New", Courier, monospace; color: rgb(0, 0, 0); background-color: rgb(245, 245, 245);">From file <childes/Eng-NA/Brown/Eve/020000a.cha>
MLU for Speaker: *CHI:
MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts): </span></div><div><span style="box-sizing: border-box; font-family: "Courier New", Courier, monospace; color: rgb(0, 0, 0); background-color: rgb(245, 245, 245);">Number of: utterances = 424, morphemes = </span><span style="box-sizing: border-box; font-family: "Courier New", Courier, monospace; color: rgb(0, 0, 0); background-color: yellow;">3687 </span></div><div><span style="box-sizing: border-box; font-family: "Courier New", Courier, monospace; color: rgb(0, 0, 0); background-color: rgb(245, 245, 245);">Ratio of morphemes over utterances = </span><span style="box-sizing: border-box; font-family: "Courier New", Courier, monospace; color: rgb(0, 0, 0); background-color: yellow;">8.696 </span></div><div><span style="box-sizing: border-box; font-family: "Courier New", Courier, monospace; color: rgb(0, 0, 0); background-color: rgb(245, 245, 245);">Standard deviation = 5.953</span></div><div><br /></div><div><div>That can't be correct. </div><div><br /></div><div>In downloaded transcripts using CLAN, the same command yields:</div><div><font face="Courier New">From file <C:\talkbank\clan\Brown\Eve\020000a.cha></font></div><div><font face="Courier New">MLU for Speaker: *CHI:<br /> MLU (xxx, yyy and www are EXCLUDED from the utterance and morpheme counts):<br /><span style="white-space: pre;"> </span>Number of: utterances = 424, morphemes = <span style="background-color: yellow;">1468</span><br /><span style="white-space: pre;"> </span>Ratio of morphemes over utterances = <span style="background-color: yellow;">3.462</span><br /><span style="white-space: pre;"> </span>Standard deviation = 1.975</font></div></div><div><br /></div><div>Any advice would be appreciated.</div><div><br /></div><div>Thanks,</div><div>Jenny</div><div><br /></div><div><br /></div>
<p></p>
-- <br />
You received this message because you are subscribed to the Google Groups "chibolts" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:chibolts+unsubscribe@googlegroups.com">chibolts+unsubscribe@googlegroups.com</a>.<br />
To view this discussion visit <a href="https://groups.google.com/d/msgid/chibolts/cafe4c39-c9f5-44d6-aae3-3d547b810828n%40googlegroups.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/chibolts/cafe4c39-c9f5-44d6-aae3-3d547b810828n%40googlegroups.com</a>.<br />