<div dir="ltr">Thanks Amin,<br>As I know about SCICT corpus, it is a big corpus of collections of classic novels that the project has been finished in summer. I don't think the corpus is completely available but if you live in Iran I think it's easy to obtain that dataset. I think you should contact with people in charge in SCICT.   <br>
Best<br><div class="gmail_quote">On Tue, Dec 18, 2012 at 11:12 AM, amin farajian <span dir="ltr"><<a href="mailto:ma.farajian@gmail.com" target="_blank">ma.farajian@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

  
    
    
  
  <div style="direction:ltr" bgcolor="#FFFFFF" text="#000000">
    <font>Dear Karine,<br>
      the corpus that you talked about (in Payame Noor University of
      Yazd) is actually the one which is available in ELRA. </font><small><font>There
      is also another parallel corpus entitled PEN, developed by myself.
      It is not still publicly available, but I'm going to publish it.
      In the following paper you can find some information about it: <br>
      Mohammad Amin Farajian (2011). <a href="http://world-comp.org/p2011/ICA4953.pdf" rel="nofollow" target="_blank">PEN: Parallel English-Persian News Corpus</a>.
      Proceedings of 2011 International Conference on Artificial
      Intelligence (ICAI'11), Nevada, USA.
      <br><br>
      There are some other researchers (Dr. khadivi in Amirkabir
      University, Dr. Faili in University of Tehran, Dr. Analoui in Iran
      University of Science and Technology) and </font></small><font>research centers (ITRC and SCICT) </font><small><font>
      in Iran which are working on SMT and are building some parallel
      corpora, but as I know their corpora are not available yet.</font>
      <font><br><br>
      Best regards,</font>
      <font><br>
      Amin</font><br>
    </small><div><div class="h5"><br>
    On 12/18/2012 03:33 PM, Megerdoomian, Karine wrote:
    <blockquote type="cite">
      
      
      
      <div>
        <p class="MsoNormal"><span style="font-size:10.5pt;font-family:"Sylfaen","serif";color:#1f497d">I
            haven’t seen any other parallel English-Persian corpora
            besides the ones already mentioned below. However, I have
            heard about a corpus being developed by the English
            department at Payame Noor University in Yazd, Iran. You may
            want to contact them. Here’s the info online:
            <a href="http://www.eurac.edu/it/newsevents/focus/Newsdetails.html?entryid=22181" target="_blank">http://www.eurac.edu/it/newsevents/focus/Newsdetails.html?entryid=22181</a><u></u><u></u></span></p>
        <p class="MsoNormal"><span style="font-size:10.5pt;font-family:"Sylfaen","serif";color:#1f497d"><u></u> <u></u></span></p>
        <p class="MsoNormal"><span style="font-size:10.5pt;font-family:"Sylfaen","serif";color:#1f497d">“</span><span style="font-size:9.0pt;font-family:"Arial","sans-serif";color:#646567">Our
            developmental English-Persian parallel corpus consists of
            about
            <b><span style="font-family:"Arial","sans-serif"">three
                million words</span></b> (more than 50,000
            corresponding sentences in two languages). This is a kind of
            ongoing corpus, that is, an open corpus in which more
            material can be added as the need arises.”<u></u><u></u></span></p>
        <p class="MsoNormal"><span style="font-size:10.5pt;font-family:"Sylfaen","serif";color:#1f497d"><u></u> <u></u></span></p>
        <p class="MsoNormal"><span style="font-size:10.5pt;font-family:"Sylfaen","serif";color:#1f497d">Karine<u></u><u></u></span></p>
        <p class="MsoNormal"><span style="font-size:10.5pt;font-family:"Sylfaen","serif";color:#1f497d"><u></u> <u></u></span></p>
        <p class="MsoNormal"><a name="13baec9df682f50f_13baec5c3719123c__MailEndCompose"><span style="font-size:10.5pt;font-family:"Sylfaen","serif";color:#1f497d"><u></u> <u></u></span></a></p>
        <p class="MsoNormal" style="margin-left:.5in"><b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">From:</span></b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">
            <a href="mailto:corpora-bounces@uib.no" target="_blank">corpora-bounces@uib.no</a> [<a href="mailto:corpora-bounces@uib.no" target="_blank">mailto:corpora-bounces@uib.no</a>]
            <b>On Behalf Of </b>Hieu Hoang<br>
            <b>Sent:</b> Tuesday, December 18, 2012 7:31 AM<br>
            <b>To:</b> Khamesi Fahime<br>
            <b>Cc:</b> <a href="mailto:corpora@uib.no" target="_blank">corpora@uib.no</a><br>
            <b>Subject:</b> Re: [Corpora-List] machine translation<u></u><u></u></span></p>
        <p class="MsoNormal" style="margin-left:.5in"><u></u> <u></u></p>
        <p class="MsoNormal" style="margin-right:0in;margin-bottom:12.0pt;margin-left:.5in">Hi
          Khamesi<br>
          <br>
          According to this website<br>
             <a href="http://opus.lingfil.uu.se/" target="_blank">http://opus.lingfil.uu.se/</a><br>
          There are 3 freely available parallel corpora for
          persian-english:<br>
            TEP<br>
            KDE<br>
            OpenSubtitles<br>
          <br>
          I've noticed other people, especially in Tehran, are also
          working on MT and collect data, eg.<br>
            <a href="http://ece.ut.ac.ir/iis/resources.html" target="_blank">http://ece.ut.ac.ir/iis/resources.html</a><br>
          <br>
          Kind Regards<br>
          Hieu<br>
          <br>
          <u></u><u></u></p>
        <div>
          <p class="MsoNormal" style="margin-left:.5in">On 12 December
            2012 21:15, Khamesi Fahime <<a href="mailto:khamesi_fahime@yahoo.com" target="_blank">khamesi_fahime@yahoo.com</a>>
            wrote:<u></u><u></u></p>
          <div>
            <div>
              <div>
                <p class="MsoNormal" style="margin-left:.5in"><span style="font-size:14.0pt;font-family:"Arial","sans-serif"">Hi,<br>
                    I am student of Linguistics in Iran and i am working
                    on English to Persian statistical machine
                    translation .<u></u><u></u></span></p>
              </div>
              <div>
                <p class="MsoNormal" style="margin-left:.5in"><span style="font-size:14.0pt;font-family:"Arial","sans-serif"">unfortunately 
                    I haven't found any EN-PER corpus except TEP and
                    ELRA .<u></u><u></u></span></p>
              </div>
              <div>
                <p class="MsoNormal" style="margin-left:.5in"><span style="font-size:14.0pt;font-family:"Arial","sans-serif"">There
                    are many restrictions in Iran(boycott) for ordering
                    ELRA .<br>
                    I appreciate if u can help me in this respect.<u></u><u></u></span></p>
              </div>
              <div>
                <p class="MsoNormal" style="margin-left:.5in"><span style="font-size:14.0pt;font-family:"Arial","sans-serif"">I
                    am looking forward to your reply.<u></u><u></u></span></p>
              </div>
              <div>
                <p class="MsoNormal" style="margin-left:.5in"><span style="font-size:14.0pt;font-family:"Arial","sans-serif"">Best
                    regards,<u></u><u></u></span></p>
              </div>
              <p class="MsoNormal" style="margin-left:.5in"><span style="font-size:14.0pt;font-family:"Arial","sans-serif"">Khamesi<u></u><u></u></span></p>
            </div>
          </div>
          <p class="MsoNormal" style="margin-right:0in;margin-bottom:12.0pt;margin-left:.5in"><br>
            _______________________________________________<br>
            UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">
              http://mailman.uib.no/options/corpora</a><br>
            Corpora mailing list<br>
            <a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br>
            <a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><u></u><u></u></p>
        </div>
        <p class="MsoNormal" style="margin-left:.5in"><u></u> <u></u></p>
      </div>
      <br>
      <fieldset></fieldset>
      <br>
      <pre>_______________________________________________
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a>
Corpora mailing list
<a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a>
</pre>
    </blockquote>
    <br>
  </div></div></div>

<br>_______________________________________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/listinfo/corpora</a><br>
<br></blockquote></div><br><br clear="all"><br>-- <br><div dir="ltr">Mohammad Sadegh Rasooli<br>PhD Student, Computer Science Department, Columbia University<br>Research Assistant, Center for Computational Learning Systems, Columbia University</div>

</div>