[Corpora-List] semantic similarity

Leonid Kontorovich lkontoro at andrew.cmu.edu
Thu Jan 20 16:57:48 UTC 2005


Hi Jana,

have you looked at Latent Dirichlet Allocation, developed by Blei, Jordan
and Ng? Take a look at Blei's homepage:
http://www.cs.berkeley.edu/~blei/

in particular,
Latent Dirichlet allocation. D. Blei, A. Ng, and M.
Jordan. Journal of Machine Learning Research, 3:993-1022, January 2003.

Dave Blei is now a postdoc at CMU, and I'm a grad student here -- so feel
free to stop by.

Best,
-Leo

On Thu, 20 Jan 2005, Jana Diesner wrote:

> Dear list members,
>
> We are looking for strategies, algorithms or code to automatically find
> single terms or multiple adjacent terms that are semantically similar within
> and across documents. The approach must not require POS tagging or an
> initial input of a reference term to the system. The resulting clusters of
> semantically similar terms suggested by the system do not need to be
> exclusive. We are familiar with secondstring, the software developed by
> William Cohen, and semantic similarity based on string-edit distances.
>
>
>
> Thank you very much.
>
> Jana
>
>
>
> ____________________
>
> Jana Diesner
> Carnegie Mellon University
>
> jdiesner at andrew.cmu.edu



More information about the Corpora mailing list