National Dictionary Day on ABC World News (must-see!)

Laurence Horn laurence.horn at YALE.EDU
Fri Oct 19 02:42:41 UTC 2007


At 9:00 PM -0500 10/18/07, Gordon, Matthew J. wrote:
>He was referring to the size of the corpus:
>
>"Because the corpus is a collection of texts,
>there are not two billion different words: the
>humble word 'the', the commonest in the written
>language, accounts for almost 100 million of all
>the words in the corpus!"
>http://www.askoxford.com/oec/mainpage/?view=uk

Maybe we need to introduce a unit on the type/token distinction.

LH

>-----Original Message-----
>From: American Dialect Society on behalf of Tom Zurinskas
>Sent: Thu 10/18/2007 8:56 PM
>To: ADS-L at LISTSERV.UGA.EDU
>Subject:      Re: National Dictionary Day on ABC World News (must-see!)
>
>So Ben, does English have 1 billion or 2 billion
>words?  And what does "word" mean?
>Say 100 words with definitions would fit on a
>page, then it would take 10,000 pages to list 1
>billion.
>
>Tom Zurinskas, USA - CT20, TN3, NJ33, FL5+
>See truespel.com - and the 4 truespel books plus
>"Occasional Poems" at authorhouse.com.
>
>
>
>
>>  Date: Thu, 18 Oct 2007 01:23:48 -0400
>>  From: bgzimmer at BABEL.LING.UPENN.EDU
>>  Subject: Re: National Dictionary Day on ABC World News (must-see!)
>>  To: ADS-L at LISTSERV.UGA.EDU
>>
>>  ---------------------- Information from the
>>mail header -----------------------
>>  Sender: American Dialect Society
>>  Poster: Benjamin Zimmer
>>  Subject: Re: National Dictionary Day on ABC World News (must-see!)
>>
>>-------------------------------------------------------------------------------
>>
>>  On 10/17/07, Tom Zurinskas  wrote:
>>>
>>>>  From: bgzimmer at BABEL.LING.UPENN.EDU
>>>>
>>>>  On 10/17/07, Tom Zurinskas wrote:
>>>>>
>>>>>  Good job, Ben. Is it 2 billion words in
>>>>>English? I thought I read somewhere 1
>>>>>  billion.
>>>>
>>>>  I was talking about the two billion words in the Oxford English Corpus
>>>>  (which was only discussed obliquely in the snippets of the interview
>>>>  that aired). More here:
>>>>
>>>>  http://www.askoxford.com/oec/
>>>
>>>  I found nothing at the site edress you gave. Too general.
>>
>>  If you click through to the links on that page, you'll find plenty of
>>  specific information. I also frequently write about the Corpus on
>>  OUPblog:
>>
>>  http://blog.oup.com/category/reference/a_to_zimmer/
>>
>>>  Regarding the number of words, I found this
>>>below. Turns out the 1 billion is overstated
>>>  as it includes phrases. See
>>>
>>>  http://www.cbsnews.com/stories/2006/04/26/ap/strange/mainD8H7NGDG0.shtml
>>>
>>>  English Language Hits 1 Billion Words
>>
>>  That was a laughably bad headline that I wrote about on Language Log
>>  even before I began my OUP affiliation:
>>
>>  http://itre.cis.upenn.edu/~myl/languagelog/archives/003073.html
>>
>>
>>  --Ben Zimmer
>>
>>  ------------------------------------------------------------
>>  The American Dialect Society - http://www.americandialect.org
>
>_________________________________________________________________
>Help yourself to FREE treats served up daily at
>the Messenger Café. Stop by today.
>http://www.cafemessenger.com/info/info_sweetstuff2.html?ocid=TXT_TAGLM_OctWLtagline
>
>------------------------------------------------------------
>The American Dialect Society - http://www.americandialect.org
>
>------------------------------------------------------------
>The American Dialect Society - http://www.americandialect.org

------------------------------------------------------------
The American Dialect Society - http://www.americandialect.org



More information about the Ads-l mailing list