Hello dear list members, I have an arithmetic question. If a particular expression occurs let's say 500 times in a 5 million word corpus, can I assume that there will be 100 of these expressions in a one million corpus or is there a statistical (probability)formula which I should apply? Cheers, Helene Stengers