[Corpora-List] Deviations in language models on the web
Stefan Evert
stefanML at collocations.de
Wed Nov 17 12:11:22 UTC 2010
On 16 Nov 2010, at 22:55, Adam Kilgarriff wrote:
> One is to piggyback on the large amounts of work that Google and Bing do to stay ahead of the spammers, eg by using BootCaT. They are putting lots of effort into not giving spam as top search hits ...
... and in my experience (as a Google end-user) they are failing very badly at it. If I search for anything that's even remotely sellable, it feels like 80% or so of the top Google hits are pure spam.
-Stefan
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list