32.1938, FYI: Danish Gigaword Corpus v1.0 Released

The LINGUIST List linguist at listserv.linguistlist.org
Fri Jun 4 10:49:57 UTC 2021


LINGUIST List: Vol-32-1938. Fri Jun 04 2021. ISSN: 1069 - 4875.

Subject: 32.1938, FYI: Danish Gigaword Corpus v1.0 Released

Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Jeremy Coburn, Lauren Perkins
Managing Editor: Becca Morris
Team: Helen Aristar-Dry, Everett Green, Sarah Robinson, Nils Hjortnaes, Joshua Sims, Billy Dickson
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Everett Green <everett at linguistlist.org>
================================================================


Date: Fri, 04 Jun 2021 06:49:03
From: Leon Derczynski [leod at itu.dk]
Subject: Danish Gigaword Corpus v1.0 Released

 
This week marks the release of Danish Gigaword v1.0, with over 1,000,000,000
words of Danish, spanning centuries, dialects, registers, modalities, and
domains. This marks the largest single collection of openly-licensed documents
in Danish, and we hope helps bring the language up from an underprivileged to
a well-resourced one.

Links:
* The DAGW homepage, https://gigaword.dk/ , where there's a download link and
license information;
* The paper in the ACL anthology,
https://www.aclweb.org/anthology/2021.nodalida-main.46/

Thank you for your interest.

Faithfully,

Leon Derczynski (IT University of Copenhagen)
Manuel R. Ciosici (University of Southern California / IT University of
Copenhagen)
 



Linguistic Field(s): Computational Linguistics
                     Text/Corpus Linguistics

Subject Language(s): Danish (dan)





 



------------------------------------------------------------------------------

***************************    LINGUIST List Support    ***************************
 The 2020 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
  to find out how to donate and check how your university, country or discipline
     ranks in the fund drive challenges. Or go directly to the donation site:
                   https://crowdfunding.iu.edu/the-linguist-list

                        Let's make this a short fund drive!
                Please feel free to share the link to our campaign:
                    https://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-32-1938	
----------------------------------------------------------






More information about the LINGUIST mailing list