[Corpora-List] IDF values

Clive De Silva cd334 at cam.ac.uk
Wed May 12 08:24:24 UTC 2004


Hi all.

I need to get IDF values for an American corpus of at least 100MW words. I have access to TREC4 and TREC5 corpus but would prefer to not have to extract the information 'manually' and was wondering if there are IDF values out there already calculated from a large corpus. If not, are there any tools for extracting IDFs efficiently?

Regards,

Clive De Silva
MPhil student at the Computing Lab
University of Cambridge, UK
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20040512/e0a208b5/attachment.htm>


More information about the Corpora mailing list