[Corpora-List] BILINGUAL PARALLEL CORPORA

Daniel Zeman zeman at ufal.mff.cuni.cz
Mon Nov 13 13:46:15 UTC 2006


The JRC Acquis corpus contains aligned texts from the European 
legislation in 20 or so languages, see http://wt.jrc.it/lt/Acquis/

Cheers
Dan

JLDLME napsal(a):
> Dear Corpora-List members,
>  
> I have three questions...
>  
> Does anyone know if there is any publicly available bilingual, 
> sentence aligned, freely available corpus involving several languages, 
> namely in Scandinavian (Finnish, Norwegian, etc.) or Latin languages 
> (Spanish, Italian, etc.), for bilingual studies ?
>  
> My second question is: Which would be the requirements to create an 
> online/desktop software tool for the whole process of a parallel corpora?
>  
> Finally, do you should consider one million of words (in both 
> languages) a large or a little bilingual corpus?
>  
> Any help will be appreciated.
>  
>  
> Regards,
>  
>  
> J. L. DeLucca (in some place of Spain)
>  
>
> ------------------------------------------------------------------------
> Access over 1 million songs - Yahoo! Music Unlimited. 
> <http://pa.yahoo.com/*http://us.rd.yahoo.com/evt=36035/*http://music.yahoo.com/unlimited/> 



More information about the Corpora mailing list