list of common words

Hedvig Skirgård hedvig.skirgard at gmail.com
Thu Jun 5 15:33:15 UTC 2014


COCA and BNC are both based on the Brown corpus structure, and much larger.

/Hedvig


2014-06-05 17:30 GMT+02:00 Shelley Brundage <shelley.brundage at gmail.com>:

> ho Folks
> I was remembering something by Thorndike and Lorge.  Here is some info
> from Wikipedia regarding 'traditional lists of word frequency":
>
> Traditional lists[edit
> <http://en.wikipedia.org/w/index.php?title=Word_lists_by_frequency&action=edit&section=10>
> ] The Teachers Word Book of 30,000 words (Thorndike and Lorge, 1944)
>
> The TWB contains 30,000 lemmas or ~13,000 word families (Goulden, Nation
> and Read, 1990). A corpus of 18,000,000 written words was hand analysed.
> The size of its inputted corpus increased its usefulness, but its age and
> language change reduced its applicability (Nation 1997
> <http://en.wikipedia.org/wiki/Word_lists_by_frequency#CITEREFNation1997>).
> The General Service List
> <http://en.wikipedia.org/wiki/General_Service_List> (West, 1953)
>
> The GSL contains 2,000 headwords divided into two sets of 1,000 words. A
> corpus of 5,000,000 written words was analysed in the 1940s. Rate of
> occurrence (%) for different meanings and parts of speech of the headword
> are provided, while it was also a careful application of the various
> criteria other than frequency and range. Thus, despite its age, some
> errors, and its solely written base, it is still an excellent database
> (word frequency, frequency of meanings, reduction of noise) (Nation 1997
> <http://en.wikipedia.org/wiki/Word_lists_by_frequency#CITEREFNation1997>).
> The American Heritage Word Frequency Book (Carroll, Davies and Richman,
> 1971)
>
> A corpus of 5,000,000 running words, from written texts used in United
> States schools (various grades, various subject areas). Its value is in its
> focus on school teaching materials, and its tagging of words, namely the
> frequency of each word in each of the school grade levels and in each of
> the subject areas (Nation 1997
> <http://en.wikipedia.org/wiki/Word_lists_by_frequency#CITEREFNation1997>).
> The Brown (Francis and Kucera, 1982) LOB and related corpora
>
> These now contain 1,000,000 words from a written corpora representing
> different dialects of English. These sources are used to produce frequency
> lists (Nation 1997
> <http://en.wikipedia.org/wiki/Word_lists_by_frequency#CITEREFNation1997>).
>
>
> On Thu, Jun 5, 2014 at 10:45 AM, Nan Bernstein Ratner <nratner at umd.edu>
> wrote:
>
>>  I think this is what folks want. My memory is that it was prepared by
>> Patton Tabors of Harvard Grad School of Ed; we have used it in a few
>> studies. I hope this attachment will go through. I do not know if there are
>> others; this is the one we have used.
>>
>>
>>
>> Best to all,
>>
>> Nan
>>
>>
>>
>>
>>
>> *From:* info-childes at googlegroups.com [mailto:
>> info-childes at googlegroups.com] *On Behalf Of *Erika Hoff
>> *Sent:* Thursday, June 05, 2014 10:30 AM
>> *To:* info-childes at googlegroups.com
>> *Subject:* Re: list of common words
>>
>>
>>
>> Me too.
>>
>>
>>
>> Erika
>>
>>
>>
>> On Thu, Jun 5, 2014 at 10:25 AM, Tager-Flusberg, Helen B <htagerf at bu.edu>
>> wrote:
>>
>>  Philip,
>>
>> I too am interested in this information ---can you send me useful replies?
>>
>>
>>
>> thanks!
>>
>> Helen
>>
>> _________________________________
>>
>> Helen Tager-Flusberg, Ph.D.
>>
>> Professor of Psychological & Brain Sciences, Boston University
>>
>> Professor of Anatomy & Neurobiology and Pediatrics, BUSM
>>
>>
>>
>> Address
>>
>> Department of Psychological & Brain Sciences
>>
>> Center for Autism Research Excellence
>>
>> 100 Cummington Mall
>>
>> Boston MA 02215
>>
>> T:  617-358-5919
>>
>> htagerf at bu.edu
>>
>> www.bu.edu/autism
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> On Jun 5, 2014, at 10:19 AM, Philip Dale <dalep at unm.edu> wrote:
>>
>>
>>
>>    I have the memory that there is a list somewhere of the 3000 (?) most
>> common words in English, which can be used by exclusion to measure the use
>> of ‘rare’ words. Can someone point me to that list? Even better, is there
>> software available to scan a passage and compute the number of common vs.
>> rare wrods?  Many thanks.
>>
>> Philip Dale
>>
>>
>>
>>
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "Info-CHILDES" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to info-childes+unsubscribe at googlegroups.com.
>> To post to this group, send email to info-childes at googlegroups.com.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/info-childes/2e869b16c09143509f191dce9cb13bb3%40BN1PR07MB326.namprd07.prod.outlook.com
>> <https://groups.google.com/d/msgid/info-childes/2e869b16c09143509f191dce9cb13bb3%40BN1PR07MB326.namprd07.prod.outlook.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>>
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "Info-CHILDES" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to info-childes+unsubscribe at googlegroups.com.
>> To post to this group, send email to info-childes at googlegroups.com.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/info-childes/DDE5FDB8-08AD-4A8E-9566-51DA114C80F5%40bu.edu
>> <https://groups.google.com/d/msgid/info-childes/DDE5FDB8-08AD-4A8E-9566-51DA114C80F5%40bu.edu?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>>
>>
>>
>>
>> --
>>
>> Erika Hoff, Professor
>>
>> Department of Psychology
>>
>> Florida Atlantic University
>>
>> 3200 College Ave.
>>
>> Davie, FL 33314
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "Info-CHILDES" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to info-childes+unsubscribe at googlegroups.com.
>> To post to this group, send email to info-childes at googlegroups.com.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/info-childes/CAAhHXzW_HvjWD9hzzLF%2BJg%2BtW8yQNt6jLUAZKY1DGzYmQfJEKg%40mail.gmail.com
>> <https://groups.google.com/d/msgid/info-childes/CAAhHXzW_HvjWD9hzzLF%2BJg%2BtW8yQNt6jLUAZKY1DGzYmQfJEKg%40mail.gmail.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "Info-CHILDES" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to info-childes+unsubscribe at googlegroups.com.
>> To post to this group, send email to info-childes at googlegroups.com.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/info-childes/78F7051232E584458D81A07B6C78AF7D4ADC96%40OITMX1008.AD.UMD.EDU
>> <https://groups.google.com/d/msgid/info-childes/78F7051232E584458D81A07B6C78AF7D4ADC96%40OITMX1008.AD.UMD.EDU?utm_medium=email&utm_source=footer>
>> .
>>
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>
>
> --
> Shelley B. Brundage, Ph.D., CCC-S
> Associate Professor and Graduate Program Director
> ASHA Fellow
> Board Recognized Specialist and Mentor-Fluency Disorders
> Speech and Hearing Science department
> George Washington University
> 2115 G St NW Suite 201
> Washington, D.C. 20052
> (202) 994-5008 office
> (202 994-2205 lab
> (202) 994-2589 fax
>
> --
> You received this message because you are subscribed to the Google Groups
> "Info-CHILDES" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to info-childes+unsubscribe at googlegroups.com.
> To post to this group, send email to info-childes at googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/info-childes/CAH2afvL%2B5CYhxzLooxiVS4ut9uuBg9EeS2Lh%3DVoSjEJ6C%2BK41A%40mail.gmail.com
> <https://groups.google.com/d/msgid/info-childes/CAH2afvL%2B5CYhxzLooxiVS4ut9uuBg9EeS2Lh%3DVoSjEJ6C%2BK41A%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
>
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups "Info-CHILDES" group.
To unsubscribe from this group and stop receiving emails from it, send an email to info-childes+unsubscribe at googlegroups.com.
To post to this group, send email to info-childes at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/info-childes/CAHHFGT1bWEAQ9nZXiAKaEZRcGga7b-hTQNdW7m66gq2sXE190A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/info-childes/attachments/20140605/7bd2c07c/attachment.htm>


More information about the Info-childes mailing list