<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content="text/html; charset=iso-8859-1" http-equiv=Content-Type>
<META name=GENERATOR content="MSHTML 8.00.6001.18928">
<STYLE></STYLE>
</HEAD>
<BODY bgColor=#ffffff>
<DIV><FONT size=2 face=Arial>Dear Corpora colleagues, some American linguists
e.g.</FONT></DIV>
<DIV><FONT size=2 face=Arial><FONT size=3 face="Times New Roman">Rob
Malouf and </FONT></FONT><FONT size=2 face=Arial><FONT size=3
face="Times New Roman">Stefan Th. Gries University of California, Santa Barbara
wrote:</FONT></FONT></DIV>
<DIV><FONT size=2 face=Arial><FONT size=3 face="Times New Roman">This is
especially true when you're comparing really big counts with really small
counts, which is I think what Adam's rule of thumb is meant to address.
Once you've decided that applying the chi-square test even makes sense, then
questions like significance levels and Bonferroni corrections come into play.
Rob Malouf <BR>Department of Linguistics and Asian / Middle Eastern Languages
San Diego State University<BR>I wonder if all the linguists on the Corpora list
are so advanced in math. statistics. Being a simple linguist I did not
understand anithing. I mean why it is not possible to use Chi-square criterion
when the samples are different in size. On the contrary, I read in the books on
Chi-square that it is also possible to use it when the samples are not equal.
However, I want to be on the safe side, so I take the equal samples when
comparing two transcribed texts. I usually take a sample of 10000 speech sounds
from longer texts. I take the sentences from the long texts at random. When the
sample is 10000 I stop. Is it not possible to use the Chi-square in this way? I
am sure the discussion of how to use and how not to use the Chi-square criterion
and other math. statistics criteria in linguistics is very important. Looking
forward to hearing for your advice to <A
href="mailto:yutamb@mail.ru">yutamb@mail.ru</A> Remain yours sincerely
Yuri Tambovtsev, Novosibirsk, Russia</FONT></DIV></FONT></BODY></HTML>