[Corpora-List] English-Spanish Medical Corpora

Dominic Widdows widdows at maya.com
Wed Feb 7 17:49:45 UTC 2007


Hi Olivier,

I wasn't previously aware of a large collection of reports that  
included versions in Arabic, Chinese, English, French, Russian and  
Spanish. This would be a great resource to use for a variety of  
experiments: do you know if there is a some part of these sites where  
you can request bulk downloads?

If not, would it be possible for someone to write a spider and host  
the corpora somewhere else for bulk download? Would there be  
copyright issues, and if so could these be negotiated? If you have  
experience of doing this or any suggestions, I would be very interested.

Best wishes,
Dominic

On Feb 7, 2007, at 12:30 PM, Olivier Kraif wrote:

> Hi Mario,
> you can find the WHO reports in both languages (and even in  
> Chinese , Arabic, Russian and French). The reports can be  
> downloaded in pdf from this url :
> http://www.who.int/whr/previous/es/index.html
> If you need already processed and aligned reports in English and  
> French, I can send you some texts.
>
> You may also have a look to the UN records : http://unbisnet.un.org: 
> 8080/ipac20/ipac.jsp?profile=bib&menu=search&submenu=power#focus
> A lot of texts are available online, in the latter languages, and  
> some texts concern medical subjects.
> Texts can be downloaded in PDF (and even in DOC format, if you  
> change something in the URL :-).
>
> Regards
>
> Olivier
>
>
>> Dear all,
>>
>> I am student of Msc Language Technology in Saarland University. I  
>> am looking for a English-Spanish medical corpora or, failing that,  
>> papers, articles, any kind of publication... where you can find  
>> English-Spanish medical texts aligned (like, for example,  
>> abstracts in both languages). I hope someone can help me. Thank  
>> you in advance,
>>
>> Mario
>>
>>
>>
>>
>
>
>



More information about the Corpora mailing list