[Corpora-List] Korean and Japanese stemming

Michal Ptaszynski ptaszynski at media.eng.hokudai.ac.jp
Fri Mar 2 12:28:50 UTC 2012


Dear Stefan

Pontus already mentioned that, but MeCab gives you word stem as one of the  
information provided in output.

Also, you may want to try the new version of JUMAN,
http://nlp.ist.i.kyoto-u.ac.jp/index.php?JUMAN
which has been released just a couple of weeks ago and seems to be quite  
awesome.
Perhaps for the first time in the history of Japanese NLP JUMAN family has  
a chance to beat MeCab/ChaSen family.

Best,

Michal

--------------------------------
Od: Stefan Bordag <sbordag at informatik.uni-leipzig.de>
Do: corpora at uib.no
Data: Fri, 02 Mar 2012 10:16:32 +0100
Temat: [Corpora-List] Korean and Japanese stemming

Dear all,

Does anyone know whether someone wrote a simple Porter-stemmer or similar  
set of rules for stemming korean texts? Same for Japanese texts. It  
doesn't need to be anything fancy. But using google translate and search  
engine results turns out to not lead anywhere, or I am looking in the  
wrong places.

Thank you very much in advance,
Stefan Bordag

-- 
--
---------------------------------------------
- Dr. Stefan Bordag                         -
- 0341 49 26 196                            -
- sbordag at informatik.uni-leipzig.de         -
---------------------------------------------
 

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list