[Corpora-List] English-Spanish Medical Corpora
Dominic Widdows
widdows at maya.com
Wed Feb 7 17:49:45 UTC 2007
Hi Olivier,
I wasn't previously aware of a large collection of reports that
included versions in Arabic, Chinese, English, French, Russian and
Spanish. This would be a great resource to use for a variety of
experiments: do you know if there is a some part of these sites where
you can request bulk downloads?
If not, would it be possible for someone to write a spider and host
the corpora somewhere else for bulk download? Would there be
copyright issues, and if so could these be negotiated? If you have
experience of doing this or any suggestions, I would be very interested.
Best wishes,
Dominic
On Feb 7, 2007, at 12:30 PM, Olivier Kraif wrote:
> Hi Mario,
> you can find the WHO reports in both languages (and even in
> Chinese , Arabic, Russian and French). The reports can be
> downloaded in pdf from this url :
> http://www.who.int/whr/previous/es/index.html
> If you need already processed and aligned reports in English and
> French, I can send you some texts.
>
> You may also have a look to the UN records : http://unbisnet.un.org:
> 8080/ipac20/ipac.jsp?profile=bib&menu=search&submenu=power#focus
> A lot of texts are available online, in the latter languages, and
> some texts concern medical subjects.
> Texts can be downloaded in PDF (and even in DOC format, if you
> change something in the URL :-).
>
> Regards
>
> Olivier
>
>
>> Dear all,
>>
>> I am student of Msc Language Technology in Saarland University. I
>> am looking for a English-Spanish medical corpora or, failing that,
>> papers, articles, any kind of publication... where you can find
>> English-Spanish medical texts aligned (like, for example,
>> abstracts in both languages). I hope someone can help me. Thank
>> you in advance,
>>
>> Mario
>>
>>
>>
>>
>
>
>
More information about the Corpora
mailing list