[Corpora-List] Considering Distributions Across Texts

Andrew Caines andrewcaines7 at gmail.com
Mon Mar 3 10:38:08 UTC 2014


Hi Brian,
Jurafsky & Martin discuss term frequency and document frequency in their
textbook; others may know of specific research papers that have
investigated this.
Andrew


On 28 February 2014 16:16, Brian Schanding <bschanding at gmail.com> wrote:

> Hello,
>
> I'm working on research with learner corpora. My corpora aren't that big
> (approx. 250,000 wds with about 300-400 text files). I wonder what
> research/textbook sources anyone can point me to that discuss the
> importance of considering how many texts in the corpus a language feature
> occurs in (as opposed to merely considering overall frequency of a language
> feature within a corpus).
>
> Many Thanks!
> Brian
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140303/c6447caf/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list