[Corpora-List] Frequency of the pronoun I
Adam Funk
a.funk at dcs.shef.ac.uk
Wed Sep 14 08:58:06 UTC 2011
[13/09/11 19:19] Rich Cooper wrote:
> Using "the/I" can lead to infinite values in
> corpora (scientific lit, patents) that never use
> the pronoun "I".
The usual ;-) way to get around that problem is to calculate
(count("the") + 1) / (count("I") + 1)
instead.
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list