[Corpora-List] Release 2014 of DGT-TM (parallel corpus in 24 languages)

Jörg Tiedemann Jorg.Tiedemann at lingfil.uu.se
Fri Sep 19 08:33:29 UTC 2014


You can have a look at OPUS (UN data is there as well):
http://opus.lingfil.uu.se

All corpora are available in TMX format. Most of them are not as clean as DGT, though.

Best,
Jörg

**********************************************************************************
 Jörg Tiedemann                                   jorg.tiedemann at lingfil.uu.se<mailto:jorg.tiedemann at lingfil.uu.se>
 Dep. of Linguistics and Philology           http://stp.lingfil.uu.se/~joerg/
 Uppsala University                                  tel:  +46 (0)18 - 471 1412
 Box 635, SE-751 26 Uppsala/SWEDEN    fax: +46 (0)18 - 471 1094



On Sep 18, 2014, at 8:47 PM, John F Sowa wrote:

On 9/18/2014 9:35 AM, Ralf Steinberger wrote:
Readers on this list may be interested to hear that the 2014 release
of the DGT-Translation Memory is now available for download.

That is an excellent resource for the languages of the EU.  But it would
be helpful to have at least a subset for languages outside the EU --
especially for the official languages of the UN that are not in the EU.

Are such resources available?

John



_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no<mailto:Corpora at uib.no>
http://mailman.uib.no/listinfo/corpora

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20140919/1659dc17/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora


More information about the Corpora mailing list