<div dir="ltr"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">------------------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">------------------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">------------</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Arabic-L: Thu 12 Sep 2013</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Moderator: Dilworth Parkinson <</span><a href="mailto:dilworth_parkinson@byu.edu" style="font-size:13.333333969116211px;font-family:arial,sans-serif" target="_blank">dilworth_parkinson@byu.edu</a><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">></span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">[To post messages to the list, send them to </span><a href="mailto:arabic-l@byu.edu" style="font-size:13.333333969116211px;font-family:arial,sans-serif" target="_blank">arabic-l@byu.edu</a><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">]</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">[To unsubscribe, send message from same address you subscribed from to</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<a href="mailto:listserv@byu.edu" style="font-size:13.333333969116211px;font-family:arial,sans-serif" target="_blank">listserv@byu.edu</a><span style="font-size:13.333333969116211px;font-family:arial,sans-serif"> with first line reading:</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif"> unsubscribe arabic-l ]</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">-------------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Directory---------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">---------------</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">1) Subject: </span><font face="arial, sans-serif">Arabic text analysis software response</font><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">2) Subject: </span><font face="arial, sans-serif">Arabic text analysis software response</font><div><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">3) Subject: </span><font face="arial, sans-serif">Arabic text analysis software response</font><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif"><br></span></div><div><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">-------------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Messages----------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">-------------</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">1)</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Date: </span><span style="font-size:13.63636302947998px;font-family:arial,sans-serif">12 Sep 2013</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">From: </span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Eric Atwell <<a href="mailto:E.S.Atwell@leeds.ac.uk" target="_blank">E.S.Atwell@leeds.ac.uk</a>></span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Subject: </span><font face="arial, sans-serif">Arabic text analysis software response</font><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<br><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">I recommend you try </span><a href="http://sketchengine.co.uk/" style="font-family:arial,sans-serif;font-size:13.333333969116211px" target="_blank">http://sketchengine.co.uk/</a><span style="font-family:arial,sans-serif;font-size:13.333333969116211px"> 30-day free trial.</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">This website allows you to upload your own Arabic corpus, or use an existing corpus on the website, or you can even use the web-crawler to collect a corpus from your own chosen websites. Then you can</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">automatically extract wordlists, keywords, terms, and thesauri;</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">compare and contrast usages of words; and extract lexical patterns.</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">SketchEngine is used by dictionary publishers (Oxford University Press,</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Le Robert, Cornelsen, Collins, Macmillan etc) but is also useful for individual Arabic language teachers and researchers.</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Eric</span></div><div><br></div><div><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Dear Sohaib,</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">I suggest you also contact your colleagues at Taibah University</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">in the College of Computer Science and Engineering, who are also researching Arabic text analysis, particularly religious texts.</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">They may be interested in collaboration on Arabic text corpus analysis.</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">I have met Dr Mohamed Menacer of the NOOR research centre at Taibah</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">University, who is helping to organise a conference around this topic in December 2013: International Conference on Advances in Information Technology for the Holy Quran and Its Sciences.</span><a href="http://www.taibahu.edu.sa/pages.aspx?pid=11438&ln=en" style="font-family:arial,sans-serif;font-size:13.333333969116211px" target="_blank">http://www.taibahu.edu.sa/<u></u>pages.aspx?pid=11438&ln=en</a><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">He can be contacted at:</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Dr Mohamed Menacer </span><a href="mailto:eazmm@hotmail.com" style="font-family:arial,sans-serif;font-size:13.333333969116211px" target="_blank">eazmm@hotmail.com</a><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Department of Computer Science</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">College of Computer Science and Engineering, Taibah University,</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">P.O. Box 30002, Madinah Munawarrah,</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Kingdom of Saudi Arabia</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Mobile: +966-530943483</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">regards</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Eric</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<br><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">------------------------------</span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">------------------------------</span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">--------------</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">2)</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Date: </span><span style="font-size:13.63636302947998px;font-family:arial,sans-serif">12 Sep 2013</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">From: </span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px"> "Jiří Milička" <<a href="mailto:milicka@centrum.cz" target="_blank">milicka@centrum.cz</a>></span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Subject: </span><font face="arial, sans-serif">Arabic text analysis software response</font></div><div><br style="font-family:arial,sans-serif"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Hello Sohaib</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Try TypeTokener (</span><a href="http://milicka.cz/en/typetokener" style="font-family:arial,sans-serif;font-size:13.333333969116211px" target="_blank">milicka.cz/en/typetokener</a><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">), it is a freeware.</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Just provide it with names of files you want to process in plain txt format (utf-8) (it can remove vocalisation if you wish) and it gives you set of word types (=set of distinct words), rank-frequency relation ("Zipf law"), number of word types, type-token relation (Herdan's/ Heaps' law) and combinatorial model of the type-token relation which can help you to discover inhomogeneities in the text.</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px"> </span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Let me know if you met any problem.</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-family:arial,sans-serif;font-size:13.333333969116211px"> </span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Jiří Milička</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<div style="font-size:13.333333969116211px;font-family:arial,sans-serif"><br></div><div style="font-size:13.333333969116211px;font-family:arial,sans-serif"><div style="font-family:arial;font-size:small"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">------------------------------</span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">------------------------------</span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">--------------</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">3)</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Date: </span><span style="font-size:13.63636302947998px;font-family:arial,sans-serif">12 Sep 2013</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">From: </span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">hussein hiyassat <<a href="mailto:hiyassat@gmail.com" target="_blank">hiyassat@gmail.com</a>></span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Subject: </span><font face="arial, sans-serif">Arabic text analysis software response</font></div><div><font face="arial, sans-serif"><br></font></div>
</div><div style="font-size:13.333333969116211px;font-family:arial,sans-serif">Please try cmu language tool kit<br><br><a href="http://www.speech.cs.cmu.edu/SLM_info.html" target="_blank">http://www.speech.cs.cmu.edu/SLM_info.html</a><br>
<br></div><div style="font-size:13.333333969116211px;font-family:arial,sans-serif"><br></div><div style="font-size:13.333333969116211px;font-family:arial,sans-serif">--------------------------------------------------------------------------<br>
End of Arabic-L: 12 Sep 2013</div></div></div>