[Corpora-List] Re: Chomsky

Shlomo Izre'el izreel at post.tau.ac.il
Thu Oct 14 16:00:14 UTC 2004


I don't have the original by Leech, but here is what I have in my files:
"Any natural corpus will be skewed. Some sentences won't occur because 
they are obvious, others because they are false, still others because 
they are impolite. The corpus, if natural, will be so wildly skewed 
that the description would be no more than a mere list."
(Chomsky in Leech, The State of the Art in Corpus Linguistics, 1991, p. 
8)
Shlomo Izre'el

On Oct 14, 2004, at 4:08 PM, Bob Knippen wrote:

>
>
> Mª Belén Díez Bedmar wrote:
>
>  > I'm looking for the exact bibliographical reference where we can 
> find
>  > Chomsky's idea that a corpus presents a language that is defective 
> or
>  > corrupted.
>
> To my knowledge, he never says any such thing.
>
> He does say, in several places (Syntactic Structures, 1957 comes to
> mind), that corpora do not provide the kind of information about
> linguistic competence that Linguistics ought to be after.
>
> In particular, he says that corpora do not provide information about
> what is ungrammmatical, and he says something to the effect that
> corpora, being finite, do not shed light on the infinite generative
> capacity of language.  (That is, a statistical model based on a
> particular corpus is not a model of the language in general).
>
> I very much doubt he wrote that a corpus presents a language that is
> defective or corrupted.
>
> Bob
>
>
> -- 
> Bob Knippen	  	    	    	
> Computer Science Department
> 110 Volen Center
> Mail Stop 018
> Brandeis University	
> 415 South Street			
> Waltham, MA 02254-9110 			
> 781-736-2745	    				
> http://www.cs.brandeis.edu/~knippen
>
>
>
> +++++++++++++++++++++++++++++++++++++++++++
> This Mail Was Scanned By Mail-seCure System
> at the Tel-Aviv University CC.
>
>
_______________________________________________________
Shlomo Izre'el
Professor of Semitic Linguistics
Department of Hebrew and Semitic Languages
Webb Building #516
Tel Aviv University                      Home address:
POB 39040                                   Simtat Neve-Tsedek 7
IL-61390 Tel Aviv                        IL-65154 Tel Aviv
Israel                                              Israel
Tel. +972-3-640 5016                 Tel. +972-3-517 5341
Fax. +972-3-640 7031                Fax. +972-3-510 1867
        +972-3-640 9457
izreel at post.tau.ac.il
http://www.tau.ac.il/humanities/semitic/izreel.html

The Corpus of Spoken Israeli Hebrew:
http://www.tau.ac.il/humanities/semitic/maamad.html (Hebrew text)
http://www.tau.ac.il/humanities/semitic/cosih.html (English text)
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: text/enriched
Size: 2587 bytes
Desc: not available
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20041014/45bdcc02/attachment-0001.bin>


More information about the Corpora mailing list