[Corpora-List] history of corpus linguistics
Adam Kilgarriff
adam at lexmasterclass.com
Sat Jan 6 10:10:34 UTC 2007
As well as Corpus Linguistics's "own" history (Brown, LOB, ICAME,
anti-Chomsky), two external influences need mentioning:
* lexicography - different agenda but responsible for lots of the
actual corpus-building work and innovation, at least in UK. BNC was
lexicography-led.
* NLP / computational linguistics, which has come into the field
like a schoolyard bully, forcing everything that's not computational into
submission, collusion or the margins. For history of this aspect, Church
and Mercer's Intro to the 1993 Sp Issue of Computational linguistics on
using large corpora (19 (1)) is great reading
Adam
-----Original Message-----
From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
Behalf Of Geoffrey Williams
Sent: 06 January 2007 08:29
To: Florian Petran
Cc: CORPORA at uib.no
Subject: Re: [Corpora-List] history of corpus linguistics
As far as I know there is no complete and unexpurgated history as yet, b
ut I very good background is given from in Graeme Kennedy's introduction
to corpus linguistics. I recently wrote an explanation of the origins
contextualist corpus linguistics for a French audience, who often seem
to confuse corpus linguistics and NLP, and also literary analysis with
quantitive methods. In looking for information John Sinclair pointed me
in the direction of a very interesting article by Léon (2005) which
kicks into touch the boring litany of Chomskyan influence on linguistics
this side of the pond. Given that John was a prime mover in the
development of corpus studies in the UK, the interview of with Wolfgang
Teubert in the introduction the recent republication of the OSTI report
by Ramesh Krishnamurthy (Sinclair et al 2004) is worth reading, as is
the report itself as it is a good lesson in humility with so much done
to lay the foundations of current methodology. Another source in the
same ilk is Sampson and McCarthy (2004) as this has texts from the pre
computer period as well as some foundational texts that are no longer
easily available.
Hope this helps
Best
Geoffrey
*Kennedy G.*1998. /An introduction to corpus linguistics./ London & New
York: Longman
*Léon, J*. 2005. Claimed and unclaimed sources of /Corpus
Linguistics/. /Henry Sweet Society Bulletin/. N°44. pp.36-50.
*G. Sampson and D. McCarthy (eds). */Corpus Linguistics: Readings// in a
widening discipline/. London and New York: Continuum, 2004
*Sinclair J. McH., Jones S., Daley R.* 2004. /English Collocation
Studies: The OSTI Report/. Londres - New York : Continuum.
Geoffrey Williams
Professeur des Universités en Sciences du Langage
Université de Bretagne Sud, Lorient, France
geoffrey.williams at univ-ubs.fr
Florian Petran a écrit :
> McEnery/Wilson : Corpus Linguistics. An introduction, Edinburgh: EUP
> 2005 have a chapter on the topic.
>
> Harris: The linguistics wars, Oxford: OUP 1993
> covers the debate with Chomsky, though does not deal explicitly with
> corpus linguistics.
>
> 2007/1/5, Ronald P. Reck <rreck at rrecktek.com>:
>
>> Can someone recommend sources for a history of corpus linguistics, and
>> more specifically string frequency analysis?
>>
>> Right now I have:
>>
>> Hockney, Susan, 2000. Electronic Texts in the Humanities Oxford
>> University Press
>>
>>
>> Thanks.
>>
>>
>
>
>
More information about the Corpora
mailing list