<br><br><div class="gmail_quote">---------- Forwarded message ----------<br>From: <b class="gmail_sendername">Majdi Sawalha</b> <span dir="ltr"><<a href="mailto:maj_sawalha@yahoo.com">maj_sawalha@yahoo.com</a>></span><br>
Date: Mon, Oct 12, 2009 at 11:40 AM<br>Subject: Re: [Corpora-List] Arabic Corpus work in Python<br>To: Lisa Hesterberg <<a href="mailto:lisahesterberg2013@u.northwestern.edu">lisahesterberg2013@u.northwestern.edu</a>><br>
<br><br><div><div style="font-family: arial,helvetica,sans-serif; font-size: 14pt;"><div></div>
<div>Hi lisa,</div>
<div> </div>
<div>i would suggest to use unicode utf-8 for input and output Arabic text in python. there is a utf-8 copy of the CCA Arabic corpus which u can use. if you mean writing Arabic words inside the code in IDLE, this might not work, and if it is work on one machine, it might cause problem on other machines that do not support Arabic characters. so, the best way is to use a string of unicode characters instead. e.g Alif is equivelant to u"\u0627". Arabic letters starts from u0621 to u0652 including short vowels. </div>
<div> </div>
<div>i hope this will help,</div>
<div> </div>
<div>Majdi<br> </div>
<hr style="width: 100%; min-height: 2px;">
<div style="text-align: left;"><font style="font-weight: bold;" size="4">Majdi Sawalha</font><br></div><font size="4"><font style="font-weight: bold;" size="2"><span style="font-style: italic;">Faculty of Engineering<br>
School of Computing<br>University of Leeds<br>Leeds, LS2 9JT<br>UK<br><a href="http://www.comp.leeds.ac.uk/sawalha" target="_blank">http://www.comp.leeds.ac.uk/sawalha</a><br></span><span style="font-style: italic;"></span></font></font>
<hr style="width: 100%; min-height: 7px;">
<font size="4"><font style="font-weight: bold;" size="2"><span style="font-style: italic;"><br></span></font></font>
<div><br></div>
<div style="font-size: 14pt; font-family: arial,helvetica,sans-serif;"><br>
<div style="font-size: 12pt; font-family: times new roman,new york,times,serif;"><font face="Tahoma" size="2"><div class="im">
<hr size="1">
<b><span style="font-weight: bold;">From:</span></b> Lisa Hesterberg <<a href="mailto:lisahesterberg2013@u.northwestern.edu" target="_blank">lisahesterberg2013@u.northwestern.edu</a>><br><b><span style="font-weight: bold;">To:</span></b> <a href="mailto:CORPORA@uib.no" target="_blank">CORPORA@uib.no</a><br>
</div><b><span style="font-weight: bold;">Sent:</span></b> Mon, October 12, 2009 4:49:49 PM<div class="im"><br><b><span style="font-weight: bold;">Subject:</span></b> [Corpora-List] Arabic Corpus work in Python<br></div></font><div>
<div></div><div class="h5"><br>Hi,<br><br>I'm currently working with Python on the CCA Arabic corpus, and IDLE is giving me problems with the Arabic characters. Does anyone have any experience working with Arabic in IDLE, or is there a better way to deal with Arabic characters in Python? I would very much appreciate any help on this matter.<br>
<br>Thanks,<br><br>Lisa Hesterberg<br>Department of Linguistics<br>Northwestern University<br></div></div></div></div></div><br>
</div></div><br>