[Corpora-List] First Free English-Persian Parallel Corpus
Francis Tyers
ftyers at prompsit.com
Wed Apr 14 23:02:16 UTC 2010
El dc 14 de 04 de 2010 a les 11:40 +0430, en/na Taher Pilevar va
escriure:
> Please send this message to the list for the researches who are
> looking for English-Persian corpora:
>
> First Free English-Persian Parallel Corpus
>
> By Mohammad Taher Pilevar, NLP Lab, University of Tehran, Iran.
>
> 4 million tokens on each side
> Sentence Aligned
> Extracted from movie subtitles
> Text domain: informal/conversational
> Total alinged movie subtitles: 1600
>
> http://ece.ut.ac.ir/NLP/resources.htm
What is the copyright status of the corpus ? Are the subtitles all from
public domain films ?
Fran
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list