[Corpora-List] rare words

N M Chipere n.chipere at reading.ac.uk
Wed Jun 18 09:44:13 UTC 2003


Dear all,

Is anyone familiar with the issues surrounding the definition and
measurement of word rarity? My colleagues and I are currently treating
the first two  thousand most frequent words in English as common words and
the rest as rare (excluding proper nouns, numerals, etc). Apart from the
issue of where one puts the cut-off point, there is an obvious problem to
do with homographs, for which we don't have a simple solution.

I'd be grateful for any feedback.

Ngoni

*********************************************************************
Dr Ngoni Chipere
Institute of Education
The University of Reading
Reading
Berkshire RG6 1HY

tel: 0118 987 5123 x 4943
**********************************************************************



More information about the Corpora mailing list