<html><body><div style="color:#000; background-color:#fff; font-family:times new roman, new york, times, serif;font-size:12pt"><div>Dear Amanda,</div>
<div> </div>
<div>You could check out NooJ linguistic platform at: <a rel="nofollow" target="_blank" href="http://www.nooj4nlp.net/">www.nooj4nlp.net</a><br><br>It's a freeware corpus processor that includes a morphological parser and linguistic resources for more than 20 languages.</div>
<div> </div>
<div>I already used it to, successfully, tokenize Arabic agglutinated forms.</div>
<div> </div>
<div>Best regards,</div>
<br>Dr. Slim Mesfar<br><div><span></span></div><div><br></div> <div style="font-family: times new roman, new york, times, serif; font-size: 12pt;"> <div style="font-family: times new roman, new york, times, serif; font-size: 12pt;"> <div dir="ltr"> <hr size="1"> <font face="Arial" size="2"> <b><span style="font-weight:bold;">De :</span></b> "Potts, Amanda" <a.potts@lancaster.ac.uk><br> <b><span style="font-weight: bold;">À :</span></b> "Corpora@uib.no" <Corpora@uib.no> <br> <b><span style="font-weight: bold;">Envoyé le :</span></b> Mercredi 19 juin 2013 12h42<br> <b><span style="font-weight: bold;">Objet :</span></b> [Corpora-List] Recommendations for morphological segmenter?<br> </font> </div> <div class="y_msg_container"><br><div id="yiv5108958471">
<style><!--
#yiv5108958471
_filtered #yiv5108958471 {font-family:Calibri;panose-1:2 15 5 2 2 2 4 3 2 4;}
#yiv5108958471
#yiv5108958471 p.yiv5108958471MsoNormal, #yiv5108958471 li.yiv5108958471MsoNormal, #yiv5108958471 div.yiv5108958471MsoNormal
{margin:0cm;margin-bottom:.0001pt;font-size:11.0pt;font-family:"Calibri", "sans-serif";}
#yiv5108958471 a:link, #yiv5108958471 span.yiv5108958471MsoHyperlink
{color:blue;text-decoration:underline;}
#yiv5108958471 a:visited, #yiv5108958471 span.yiv5108958471MsoHyperlinkFollowed
{color:purple;text-decoration:underline;}
#yiv5108958471 span.yiv5108958471EmailStyle17
{font-family:"Calibri", "sans-serif";color:windowtext;}
#yiv5108958471 .yiv5108958471MsoChpDefault
{font-family:"Calibri", "sans-serif";}
_filtered #yiv5108958471 {margin:72.0pt 72.0pt 72.0pt 72.0pt;}
#yiv5108958471 div.yiv5108958471WordSection1
{}
--></style>
<div>
<div class="yiv5108958471WordSection1">
<div class="yiv5108958471MsoNormal">Hello everyone,</div>
<div class="yiv5108958471MsoNormal"> </div>
<div class="yiv5108958471MsoNormal">I was wondering whether anyone could recommend a tool for automated morphological segmentation. Either a script or a feature in a programme would be really helpful for me. I’m hoping to perform morpheme splitting on relatively small datasets
(roughly 6 million words total), and would ideally like to be able to generate wordlists of affixes and other morphemes and to search for types of morphemes (either in their lexical forms or tagged by the tool). Has anyone used anything like this with a good
degree with reliability? Thanks very much for any advice you can give me.</div>
<div class="yiv5108958471MsoNormal"> </div>
<div class="yiv5108958471MsoNormal">Amanda Potts</div>
</div>
</div>
</div><br>_______________________________________________<br>UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>Corpora mailing list<br><a ymailto="mailto:Corpora@uib.no" href="mailto:Corpora@uib.no">Corpora@uib.no</a><br><a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br><br></div> </div> </div> </div></body></html>