[Corpora-List] Frequency of the pronoun I

Adam Funk a.funk at dcs.shef.ac.uk
Wed Sep 14 08:58:06 UTC 2011


[13/09/11 19:19] Rich Cooper wrote:
> Using "the/I" can lead to infinite values in
> corpora (scientific lit, patents) that never use
> the pronoun "I".  

The usual ;-) way to get around that problem is to calculate

(count("the") + 1) / (count("I") + 1)

instead.

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list