<div dir="ltr"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">------------------------------</span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">------------------------------</span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">------------</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Arabic-L: Fri 19 Sep 2013</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Moderator: Dilworth Parkinson <</span><a href="mailto:dilworth_parkinson@byu.edu" style="font-family:arial,sans-serif;font-size:13.333333969116211px" target="_blank">dilworth_parkinson@byu.edu</a><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">></span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">[To post messages to the list, send them to </span><a href="mailto:arabic-l@byu.edu" style="font-family:arial,sans-serif;font-size:13.333333969116211px" target="_blank">arabic-l@byu.edu</a><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">]</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">[To unsubscribe, send message from same address you subscribed from to</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<a href="mailto:listserv@byu.edu" style="font-family:arial,sans-serif;font-size:13.333333969116211px" target="_blank">listserv@byu.edu</a><span style="font-family:arial,sans-serif;font-size:13.333333969116211px"> with first line reading:</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">           unsubscribe arabic-l                                      ]</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">-------------------------</span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Directory---------------------</span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">---------------</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">1) Subject: </span><font face="arial, sans-serif">Query on Spoken Arabic Corpus tools</font><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">-------------------------</span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Messages----------------------</span><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">-------------</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">1)</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Date: </span><span style="font-family:arial,sans-serif;font-size:13.63636302947998px">19 Sep 2013</span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">From: </span><span style="font-family:arial,sans-serif;font-size:13px"> David Wilmsen <<a href="mailto:david.wilmsen@gmail.com" target="_blank">david.wilmsen@gmail.com</a>></span><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">

<span style="font-family:arial,sans-serif;font-size:13.333333969116211px">Subject: </span><font face="arial, sans-serif">Query on Spoken Arabic Corpus tools</font><br style="font-family:arial,sans-serif;font-size:13.333333969116211px">


<br style="font-family:arial,sans-serif;font-size:13.333333969116211px"><span style="font-family:arial,sans-serif;font-size:13px">I see we have a great many tools (or at least some) with which to construct</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">(or attempt to construct) corpora of written Arabic.</span><br style="font-family:arial,sans-serif;font-size:13px"><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">I have a query involving what I think would involve corpora of much greater</span><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">complexity:</span><br style="font-family:arial,sans-serif;font-size:13px">

<br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">Does anyone know of or is anyone working on programs that can handle spoken</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">Arabic corpora?</span><br style="font-family:arial,sans-serif;font-size:13px"><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">By now, thousands - maybe hundreds of thousands - of hours of spoken</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">language data are available in the form of Arab serials, archived on many</span><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">web sites, including those of the channels that originally broadcast them.</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">This is to say nothing of the unscripted spoken language available on sites</span><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">such as Utube.</span><br style="font-family:arial,sans-serif;font-size:13px">

<br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">Some researchers (including myself) are already utilizing language-use data</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">gleaned from Arabic-language serials. Imagine the potential for being able</span><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">to search and compare thousands of instances of usage of whatever word or</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">construct is under investigation.</span><br style="font-family:arial,sans-serif;font-size:13px"><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">I think I'm too old to begin trying to learn how to construct software that</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">might be able to handle such a task. Is there anyone in our younger</span><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">generation of scholars with the know-how to approach it?</span><br style="font-family:arial,sans-serif;font-size:13px">

<br style="font-family:arial,sans-serif;font-size:13px"><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">David Wilmsen</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">Associate Professor of Arabic</span><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">Chair, Department of Arabic and Near Eastern Languages</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">American University of Beirut</span><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">Bliss Street, Hamra</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">Beirut, Lebanon</span><br style="font-family:arial,sans-serif;font-size:13px"><span style="font-family:arial,sans-serif;font-size:13px">1107 2020</span><br style="font-family:arial,sans-serif;font-size:13px">

<span style="font-family:arial,sans-serif;font-size:13px">tel:  +961-1-350000 ext. 3850/1</span><br style="font-family:arial,sans-serif;font-size:13px"><div style="font-family:arial,sans-serif;font-size:13.333333969116211px">

<br></div><div style="font-family:arial,sans-serif;font-size:13.333333969116211px">--------------------------------------------------------------------------<br>
End of Arabic-L: 19 Sep 2013</div></div>