[Corpora-List] Distributional and Morphological Word Clustering

manaal faruqui manaalfar at gmail.com
Sat Feb 11 11:23:07 UTC 2012


Hi,

I need a software (even a raw piece of code) which can cluster words from a
large untagged corpus into groups using their distributional and
morphological similarity.
One such software is provided by Alexander Clark (
http://www.cs.rhul.ac.uk/home/alexc/) but his code works only for ASCII
characters. I have used it earlier and it works pretty well.

I need something which can work for Unicode encoding.
I can deal with it even if the software doesnt take morphological info into
account.

Thanks !
Manaal Faruqui
IIT Kharagpur, India
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20120211/2fa74b9d/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list