[Corpora-List] Google searches as linguistic evidence
William Fletcher
fletcher at usna.edu
Thu Dec 7 13:39:37 UTC 2006
I too was amazed that a number of _an workshop_ hits may be from native speakers.
Google advanced search specifying English as language and UK as domain drastically reduces the hit count for the an-variant. With these filters it is immediately obvious which usage predominates, by a factor of 20,000:1. (Some webpages with German text did slip by the filters; search engines often mislabel the language of a document, and have no way to identify multilingual text.)
All the examples of _an w*_ I found in the BNC seem to be _an'_ = _and_.
Regards,
Bill Fletcher
---- Original message ----
>Date: Thu, 07 Dec 2006 12:58:46 +0000
>From: Diana Maynard <d.maynard at dcs.shef.ac.uk>
>Subject: Re: [Corpora-List] Google searches as linguistic evidence
>To: Fanny Meunier <fanny.meunier at uclouvain.be>
>Cc: corpora at lists.uib.no
>
>Indeed. I looked through some of them and there were some like that, but
>many genuine ones too
>Diana
>
>Fanny Meunier wrote:
>> Hi there,
>>
>> Your question puzzled me and I googled "a worshop" (7840000 hits) vs
>> "an workshop" (21500 hits).
>>
>> It struck me that they were quite a lot of German refs such as
>> Sie bitte *an workshop*@... (= sthg like: please see workshop at ...)
>> schicken Sie bitte eine Email *an workshop* (= sthg like: please send
>> an e-mail to workshop at ...)
>> direkt per E-Mail *an workshop*@... (= directly via e-mail to
>> workshop at ...)
>>
>> Food for thought...
>>
>> All the best,
>> Fanny
>>
>>
>>
>
More information about the Corpora
mailing list