[Corpora-List] RE : Chomsky and computational linguistics

Santos Diana Diana.Santos at sintef.no
Sat Sep 1 13:48:15 UTC 2007


Although I have not participated -- mainly because I came from holidays already in the middle of it -- I tremendously enjoyed it, and agree with Cécile and Rob that this is the (one) right place to discuss such issues.

So please go on! 

I avow that discussions of personal atitudes by Chomsky (such as whether he recognized he was worong or not) strike me as uninteresting. Much has already been written about the (science) political atitudes by Chomsky as a scholar (see e.g. Geoffrey Sampson's books) and his influence in the linguistics discipline. But a question that may still be pertinent to ask -- and discuss -- was whether anything he suggested is relevant to corpus linguistics.
 
(Incidentally, I hate this designation, our discipline should be called "empirical linguistics" or at least "linguistics using corpora" or "corpus-based NLP"...)

To add my own bit of discussion, the "corpus of a language" is not the reason we do corpus linguistics -- my impression being that a corpus is to be thought as a sample, a sample that we can manipulate and observe externally (and, therefore, discuss with others our findings on that corpus, and replicate them.) 

I am not sure, either, that anyone is looking for a complete grammar - or the most compact description of a corpus (this last one seems to me VERY suspicious, if a corpus is a sample).

I think that most people doing corpus linguistics see a corpus as a (near) perfect exploratory testbed, where instantaneous access to a lot of intuitions and speech practices can be found, as well as a good (although carefully dealt with) testbed for more developed hypotheses. (For this one you might require carefully designed new corpora, in fact...)

There is also another branch (flavour) of corpus linguistics (?) where you just test and train your own systems, of course, and then the goal is to aid system development. This is the engineering side of corpus linguistics, that again is not well described by its name. "Corpus-based testing & development" might a better name?

In any case, if the corpora-list only had conference announcements and requests for particular applications for particular languages, it would not be half as interesting (IMO) as it is now, thanks to Mike Maxwell, Rob Freeman and others:-)

Diana


________________________________

	From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of Rob Freeman
	Sent: 1. september 2007 05:48
	To: Mike Maxwell; CORPORA at UIB.NO
	Subject: Re: [Corpora-List] RE : Chomsky and computational linguistics
	
	
	For me the question of interest is the complexity of descriptions of corpora. Is a complete grammar possible, or is the corpus of a language is the most compact description of itself?
	
	If Chomsky's work is relevant to that, why not talk about it? 
	
	Personally I could do without the "he was right"/"he was wrong" stuff too. It buries the interesting issues and is meaningless in the abstract.
	
	So let's keep it tight to the science/engineering issue. 
	
	But let's talk about it.
	
	If the completeness of grammatical descriptions of corpora is not an appropriate topic for the Corpora list, what is?
	
	-Rob
	
	
	On 8/31/07, Mike Maxwell <maxwell at umiacs.umd.edu> wrote: 

		Cécile Yousfi wrote:
		> I'm a mere user of the BNC and I'm no Chomsky specialist, but I do enjoy
		> reading interesting discussion on the subject. So please go on
		> discussing the matter on the list.
		> 
		> It's intellectually stimulating to have a genuine dialogue on a
		> theoretical subject, and to be confronted to different points of view.
		
		As one who posted on this subject a couple months ago (and probably 
		posted too much :-), not to mention representing the strident minority
		on this list), I have to say that there's probably a better forum.  This
		list is, after all, about corpora; and while the discussion could have 
		been about whether modeling corpora is about science or engineering, it
		tended to be more about whether Chomsky's approach had any validity, or
		whether he should have admitted defeat, or about other issues that (at 
		least IMO) have less to do with corpora.
		
		I would welcome suggestions for a more appropriate forum.
		--
		        Mike Maxwell
		        maxwell at umiacs.umd.edu
		        "Theorists...have merely to lock themselves in a room
		        with a blackboard and coffee maker to conduct their business."
		        --Bruce A. Schumm, Deep Down Things
		
		_______________________________________________ 
		Corpora mailing list
		Corpora at uib.no
		http://mailman.uib.no/listinfo/corpora
		



_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list