Hello,<br><br>Thank a lot Motaz.<br>Are these corpora from UN 2000. And <span id="result_box" class="short_text" lang="en"><span class="hps">they</span> <span class="hps"> </span></span><span id="result_box" class="short_text" lang="en"><span class="hps">are</span></span><span id="result_box" class="short_text" lang="en"><span class="hps"> aligned</span> <span class="hps">manually</span> <span class="hps">or</span> <span class="hps">automatically</span></span>?<br>
I execute the script extract.py to extract text plain UN corpora, and I'm aligning this corpora with hunalign toolkit.<br>I also need Arabic French aligned UN corpora. I wondered if you have these corpora?<br><br>Thanks.<br>
<br><br><br><div class="gmail_quote">2012/9/20 Motaz SAAD <span dir="ltr"><<a href="mailto:motaz.saad@inria.fr" target="_blank">motaz.saad@inria.fr</a>></span><br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div><div style="font-size:12pt;font-family:times new roman,new york,times,serif">Hello,<br><br>Please find the aligned corpora in the attached files,<br><br>best regards,<br>Motaz <br><br><hr><blockquote style="padding-left:5px;font-size:12pt;font-style:normal;margin-left:5px;font-family:Helvetica,Arial,sans-serif;text-decoration:none;font-weight:normal;border-left:2px solid rgb(16,16,255)">
<b>From: </b>"Rahma Sellami" <<a href="mailto:rahma.sellami@gmail.com" target="_blank">rahma.sellami@gmail.com</a>><br><b>To: </b><a href="mailto:corpora@uib.no" target="_blank">corpora@uib.no</a><br><b>Sent: </b>Friday, September 14, 2012 10:27:42 PM<br>
<b>Subject: </b>[Corpora-List] extract.py<div><div class="h5"><br><br>Hi,<br>How can I execute the scripts extract.py to align UN corpora.<br>I use this syntax: "python extract.py en ar" but always "0 documents in all languages" is returned.<br>
arabic files are in the directory: xmlar/2000/ and english files are in:xmlen/2000.<br>
Thanks<br><br>-- <br><div dir="ltr" style="text-align:left"><span></span><span></span></div><div dir="ltr"><br><br></div><div dir="ltr">RAHMA Sellami<br><div style="text-align:left"><span style="font-family:arial,helvetica,sans-serif;border-collapse:collapse">PhD Computer Science Student</span></div>
<div><font face="arial, helvetica, sans-serif"><span style="border-collapse:collapse"><a href="http://sites.google.com/site/rahmasellami/" target="_blank">http://sites.google.com/site/rahmasellami/</a></span></font></div>
<div><font face="arial, helvetica, sans-serif"><span style="border-collapse:collapse"><a href="http://sites.google.com/site/rahmasellami/" target="_blank"></a><br></span></font>Faculty of Economic Sciences and management of Sfax<br>
ANLP Research Group<br><a href="http://sites.google.com/site/anlprg" target="_blank">http://sites.google.com/site/anlprg</a><br><br>MIRACL Laboratory<br><a href="http://www.miracl.rnu.tn" target="_blank">www.miracl.rnu.tn</a><br>
<br>Email: <a href="mailto:rahma.sellami@gmail.com" target="_blank">rahma.sellami@gmail.com</a></div></div><br>
<br></div></div>_______________________________________________<br>UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>Corpora mailing list<br>
<a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br><a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br></blockquote><br></div></div></blockquote>
</div><br><br clear="all"><br>-- <br><div dir="ltr" style="text-align:left"><span></span><span></span></div><div dir="ltr"><br></div><div dir="ltr">RAHMA Sellami<br><div style="text-align:left"><span style="font-family:arial,helvetica,sans-serif;border-collapse:collapse">PhD Computer Science Student</span></div>
<div><font face="arial, helvetica, sans-serif"><span style="border-collapse:collapse"><a href="http://sites.google.com/site/rahmasellami/" target="_blank">http://sites.google.com/site/rahmasellami/</a></span></font></div>
<div><font face="arial, helvetica, sans-serif"><span style="border-collapse:collapse"><a href="http://sites.google.com/site/rahmasellami/" target="_blank"></a><br></span></font>Faculty of Economic Sciences and management of Sfax<br>
ANLP Research Group<br><a href="http://sites.google.com/site/anlprg" target="_blank">http://sites.google.com/site/anlprg</a><br><br>MIRACL Laboratory<br><a href="http://www.miracl.rnu.tn" target="_blank">www.miracl.rnu.tn</a><br>
<br>Email: <a href="mailto:rahma.sellami@gmail.com" target="_blank">rahma.sellami@gmail.com</a></div></div><br>