[Corpora-List] Deviations in language models on the web

Stefan Evert stefanML at collocations.de
Wed Nov 17 12:11:22 UTC 2010


On 16 Nov 2010, at 22:55, Adam Kilgarriff wrote:

> One is to piggyback on the large amounts of work that Google and Bing do to stay ahead of the spammers, eg by using BootCaT.  They are putting lots of effort into not giving spam as top search hits ...

... and in my experience (as a Google end-user) they are failing very badly at it.  If I search for anything that's even remotely sellable, it feels like 80% or so of the top Google hits are pure spam.

-Stefan


_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list