<div dir="ltr"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">------------------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">------------------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">------------</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Arabic-L: Fri 20 Dec 2013</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Moderator: Dilworth Parkinson <</span><a href="mailto:dilworth_parkinson@byu.edu" style="font-size:13.333333969116211px;font-family:arial,sans-serif" target="_blank">dilworth_parkinson@byu.edu</a><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">></span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">[To post messages to the list, send them to </span><a href="mailto:arabic-l@byu.edu" style="font-size:13.333333969116211px;font-family:arial,sans-serif" target="_blank">arabic-l@byu.edu</a><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">]</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">[To unsubscribe, send message from same address you subscribed from to</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<a href="mailto:listserv@byu.edu" style="font-size:13.333333969116211px;font-family:arial,sans-serif" target="_blank">listserv@byu.edu</a><span style="font-size:13.333333969116211px;font-family:arial,sans-serif"> with first line reading:</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif"> unsubscribe arabic-l ]</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">-------------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Directory---------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">---------------</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">1) Subject: </span><font face="arial, sans-serif">Workshop on Free/OpenSource Arabic Corpora</font><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">-------------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Messages----------------------</span><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">-------------</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">1)</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif"><span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Date: </span><span style="font-size:13px;font-family:arial,sans-serif">20 Dec 2013</span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">From: </span><span name="AbdulMohsen Al-Thubaity PhD, PMP" style="font-size:13px;font-family:arial,sans-serif">AbdulMohsen Al-Thubaity PhD, PMP</span><span style="font-family:arial,sans-serif;font-size:13px;white-space:nowrap"> </span><span style="font-family:arial,sans-serif;font-size:13px;white-space:nowrap"><<a href="mailto:althubaity@gmail.com" target="_blank">althubaity@gmail.com</a>></span><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<span style="font-size:13.333333969116211px;font-family:arial,sans-serif">Subject: </span><font face="arial, sans-serif">Workshop on Free/OpenSource Arabic Corpora</font><br style="font-size:13.333333969116211px;font-family:arial,sans-serif">
<br>Workshop on Free/Open-Source Arabic Corpora and Corpora Processing Tools<br><br>Workshop URL: <a href="http://www.kacstac.org.sa/osact/index.html" target="_blank">http://www.kacstac.org.sa/osact/index.html</a><br><br>
Workshop description<br>
<br><div>For Natural Language Processing (NLP) and Computational Linguistics (CL) communities, it was a known situation that Arabic is a resource poor language. This situation was thought to be the reason why there is a lack of corpus based studies in Arabic. However, the last years witnessed the emergence of new considerably free Arabic corpora and in lesser extent Arabic corpora processing tools.<br>
<br>Freely available Arabic corpora can be divided into two groups. The first group contains large Arabic corpora, which are designed and constructed basically for Arabic linguistics research and activities, and maybe for Arabic NLP. These corpora are diverse in the genres they cover and their sizes range from one million words to 700 million words. The second group contains corpora that were designed basically for Arabic text classification and clustering, they mainly contain newspapers' articles. They range from less than 1 million words to 11 million words.</div>
<div><br>Some Arabic corpora are available on the web to explore using different tools, basically large corpora, while other corpora are only available for download. For the corpora that are available for download, the user may need to use standalone corpus processing tools. These tools contain many functionality such as word frequency, concordance, collocation, etc. Therefore, with the availability of large and diverse Arabic corpora, the situation does not change. There is still a lack of Arabic corpus base studies. Is this because of representativeness of these corpora? The available functions and tools associated with these corpora? or is it because they are not well known enough for the Arabic linguistics community?<br>
<br> Motivation and topics of interest<br><br>This half-day-workshop aims to encourage the researchers and developers to foster the utilization of freely available Arabic corpora and open source Arabic corpora processing tools and help in highlighting the drawbacks of these resources and discuss techniques and approaches on how to improve them. The workshop topics include but not limited to:<br>
<br>1. Surveying and criticizing the design of freely available Arabic corpora, their associated tools and stand alone Arabic corpora processing tools.<br><br>2. The applications and uses of freely available Arabic language resources in fields such as Arabic language education e.g. L1 and L2.<br>
<br>3. Arabic language modeling.<br><br>4. Corpus based Arabic lexigraphy.<br><br>Lexical semantics and word sense.<br><br>6. Corpus based Arabic syntactic.<br><br>7. Corpus based Arabic morphology.<br>
<br>8. Development of Arabic mobile applications based on the available Arabic language resources.<br><br>9. Evaluation and assessment of Arabic Corpora and Corpora Processing Tools.<br><br>10. Future directions of Free/Open Arabic Corpora and Corpora Processing Tools.<br>
<br><br>Important Dates<br><br>Submission deadline: 10 February 2014<br><br>Notification of acceptance: 10 March 2013<br><br>Final submission of manuscripts: 21 March 2014<br><br>Workshop date: 27 May 2014 (morning session) <br>
<br><br>Submission guidelines<br><br>The language of the workshop is English and submissions should be with respect to LREC 2014 paper submission instructions. All papers will be peer reviewed possibly by three independent referees. Papers must be submitted electronically in PDF format to the STAR system. When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.), to enable their reuse, replicability of experiments, including evaluation ones, etc.<br>
<br>Organising Committee<br><br>Hend Al-Khalifa, King Saud University, KSA<br><br>Abdulmohsen Al-Thubaity, King Abdul Aziz City for Science and Technology, KSA<br><br>Program Committee<br><br>Eric Atwell, University of Leeds, UK<br>
<br>Khaled Shaalan, The British University in Dubai (BUiD), UAE<br><br>Dilworth Parkinson, Brigham Young University, USA<br><br>Nizar Habash, Columbia University, USA<br><br>Khurshid Ahmad, Trinity College Dublin, Ireland<br>
<br>Abdulmalik AlSalman, King Saud University, KSA <br><br>Maha Alrabiah, King Saud University, KSA<br><br>Saleh Alosaimi, Imam University, KSA<br><br>Sultan almujaiwel, King Saud University, KSA<br><br>Adam Kilgarriff, Lexical Computing Ltd, UK <br>
<br>Amal AlSaif, Imam University, KSA<br><br>Maha AlYahya, King Saud University, KSA<br><br>Auhood AlFaries, King Saud University, KSA<br><br>Salwa Hamada, Taibah University, KSA<br><br>Mansour Algamdi, King Abdul Aziz City for Science and Technology, KSA<br>
<br>Abdullah Alfaifi, University of Leeds, UK</div><div><br><div style="font-size:13.333333969116211px;font-family:arial,sans-serif">--------------------------------------------------------------------------<br>
End of Arabic-L: 20 Dec 2013</div></div></div>