[Corpora-List] Package for LSA, tfidf, etc

Catherine Havasi havasi at mit.edu
Sun Oct 18 06:13:51 UTC 2009


Well, there's Divisi for lsa (or any kind of SVD stuff) in Python:
http://divisi.media.mit.edu/

- Catherine

On Thu, Oct 15, 2009 at 6:09 AM, Stephan Gouws <gouwsmeister at gmail.com> wrote:
> Hi,
>
>  I'm looking for a software package that I can use to generate the document
> similarity matrix for a small corpus of 50 documents, using various of the
> standard algorithms like tfidf, okapi, language models, cosine, lsa, etc.
>
>  Research code is fine I just want a trusted implementation of these
> algorithms, languages in order of preference are [Python, C, C++] , [Java],
> Perl], and from there it's not really preferred anymore but fine nonetheless
> :)
>
>  I want to correlate these with human ratings in a research setting.
>
>  Thank you very much!
>  Stephan.
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list