<div dir="ltr">Hi Brian,<div>Jurafsky & Martin discuss term frequency and document frequency in their textbook; others may know of specific research papers that have investigated this.</div><div>Andrew</div></div><div class="gmail_extra">
<br><br><div class="gmail_quote">On 28 February 2014 16:16, Brian Schanding <span dir="ltr"><<a href="mailto:bschanding@gmail.com" target="_blank">bschanding@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div dir="ltr"><div>Hello,</div><div><br></div><div>I'm working on research with learner corpora. My corpora aren't that big (approx. 250,000 wds with about 300-400 text files). I wonder what research/textbook sources anyone can point me to that discuss the importance of considering how many texts in the corpus a language feature occurs in (as opposed to merely considering overall frequency of a language feature within a corpus). </div>
<div><br></div><div>Many Thanks!</div><span class="HOEnZb"><font color="#888888"><div>Brian </div></font></span></div>
<br>_______________________________________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
<br></blockquote></div><br></div>