[Corpora-List] semantic similarity
Leonid Kontorovich
lkontoro at andrew.cmu.edu
Thu Jan 20 16:57:48 UTC 2005
Hi Jana,
have you looked at Latent Dirichlet Allocation, developed by Blei, Jordan
and Ng? Take a look at Blei's homepage:
http://www.cs.berkeley.edu/~blei/
in particular,
Latent Dirichlet allocation. D. Blei, A. Ng, and M.
Jordan. Journal of Machine Learning Research, 3:993-1022, January 2003.
Dave Blei is now a postdoc at CMU, and I'm a grad student here -- so feel
free to stop by.
Best,
-Leo
On Thu, 20 Jan 2005, Jana Diesner wrote:
> Dear list members,
>
> We are looking for strategies, algorithms or code to automatically find
> single terms or multiple adjacent terms that are semantically similar within
> and across documents. The approach must not require POS tagging or an
> initial input of a reference term to the system. The resulting clusters of
> semantically similar terms suggested by the system do not need to be
> exclusive. We are familiar with secondstring, the software developed by
> William Cohen, and semantic similarity based on string-edit distances.
>
>
>
> Thank you very much.
>
> Jana
>
>
>
> ____________________
>
> Jana Diesner
> Carnegie Mellon University
>
> jdiesner at andrew.cmu.edu
More information about the Corpora
mailing list