[Corpora-List] word frequencies on the web
    William Fletcher 
    fletcher at usna.edu
       
    Fri Dec  8 18:54:00 UTC 2006
    
    
  
Dear Tony,
I have lists of words occurring 100 or more and 10 or more times
respectively in the preliminary version of a dynamic Web Corpus I am
compiling for "Phrases in English".  Since you cannot reach PIE directly, I
put them on my KWiCFinder site:
http://www.kwicfinder.com/WebCorpus2006_min100.html 
tab-separated text files
http://www.kwicfinder.com/WebCorpus2006_min100.txt 
http://www.kwicfinder.com/WebCorpus2006_min10.txt
Corpus currently has 97,198,272 tokens and 525,509 types, of which 30,524
occur 100 or more times 104,675 tokens occur 10 or more times 
Regards,
Bill Fletcher
-----Original Message-----
From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
Behalf Of Tony Berber Sardinha
Sent: Friday, December 08, 2006 11:44 AM
To: CORPORA
Subject: [Corpora-List] word frequencies on the web
Dear all, does anyone know of ways to estimate the frequency of words on the
web, or if there're search engines that supply this info (as Altavista used
to do)?
thank you!
tony
www2.lael.pucsp.br/~tony
    
    
More information about the Corpora
mailing list