<div dir="ltr">Dear Asad,<div>To my knowledge, if your parallel corpus is done, I mean that every sentence in source language is aligned to a sentence in the target language, it would be enough.</div><div>Which language pairs are concerned by your corpus?</div>
<div><br></div><div>Best,</div><div><br></div><div>Christophe</div></div><div class="gmail_extra"><br><br><div class="gmail_quote">2014-04-18 8:56 GMT+02:00 Asad Abdul Malik <span dir="ltr"><<a href="mailto:asadmalik@kust.edu.pk" target="_blank">asadmalik@kust.edu.pk</a>></span>:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi All,<br>
<br>
I am currently doing MS, and for my final research I wanted to develop the parallel corpus. I have translation of source and target language. What else I have to do in order to develop the parallel corpus? Should I have to tokenize this data? or any other processing on this text?<br>
-- <br>
<br>
Kind Regards,<br>
<br>
Mr. Asad Abdul Malik<br>
Research Student<br>
Institute of Information Technology,<br>
Kohat University of Science and Technology,<br>
Kohat-26000, K.P.K, Pakistan<br>
<br>
<br>
______________________________<u></u>_________________<br>
UNSUBSCRIBE from this page: <a href="http://mailman.uib.no/options/corpora" target="_blank">http://mailman.uib.no/options/<u></u>corpora</a><br>
Corpora mailing list<br>
<a href="mailto:Corpora@uib.no" target="_blank">Corpora@uib.no</a><br>
<a href="http://mailman.uib.no/listinfo/corpora" target="_blank">http://mailman.uib.no/<u></u>listinfo/corpora</a><br>
</blockquote></div><br></div>