[Corpora-List] SVD on high-dimension data

David Reitter david.reitter at gmail.com
Tue Mar 6 15:38:58 UTC 2007


Jamie,

On 6 Mar 2007, at 14:59, Jamie Smith wrote:

> I have large (1 million by 1 million) term-term matrices. What SVD
> packages work with such massive datasets? I have tried Matlab and
> SVDPACKC without much success.

Have a look at Infomap,

http://infomap-nlp.sourceforge.net/
http://infomap.stanford.edu/

we've used it successfully on the Aquaint  and DUC2005 data (100+  
million words).


--
David Reitter
ICCS/HCRC, Informatics, University of Edinburgh
http://www.david-reitter.com



More information about the Corpora mailing list