[Corpora-List] Stemmer for Hindi
Daniel Zeman
zeman at ufal.mff.cuni.cz
Wed Nov 24 10:46:14 UTC 2010
Dne 18.11.2010 13:53, Francis Tyers napsal(a):
> ...
> You could try the morphological analyser from IIIT if you can manage
> with the WX notation:
>
> http://ltrc.iiit.ac.in/showfile.php?filename=onlineServices/morph/index.htm
Thanks for the link, Fran. We too have been trying to locate a Hindi MA
for translation experiments. We got something from Hyderabad a year ago
but we did not manage to get it running then. Their current version is
much better - great thanks to IIIT!
For Luís and others, I have a convertor from UTF-8-encoded Devanagari to
WX and back. I also rewrote the spell_variation.pl script because the
original version was incredibly slow. Contact me if you are interested.
Dan
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list