33.393, FYI: The DiGreC Treebank

The LINGUIST List linguist at listserv.linguistlist.org
Wed Feb 2 07:15:54 UTC 2022


LINGUIST List: Vol-33-393. Wed Feb 02 2022. ISSN: 1069 - 4875.

Subject: 33.393, FYI: The DiGreC Treebank

Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Billy Dickson
Managing Editor: Lauren Perkins
Team: Helen Aristar-Dry, Everett Green, Sarah Goldfinch, Nils Hjortnaes,
      Joshua Sims, Billy Dickson, Amalia Robinson, Matthew Fort
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Everett Green <everett at linguistlist.org>
================================================================


Date: Wed, 02 Feb 2022 02:14:04
From: Morgan Macleod [m.macleod at ulster.ac.uk]
Subject: The DiGreC Treebank

 
We would like to announce the availability of a new corpus resource, the
DiGreC (Diachrony of Greek Case) treebank. This corpus contains selected
sentences from Greek texts ranging from the 8th century BC to the 17th century
AD, with detailed morphosyntactic and semantic annotation. The data were
collected as part of the project ‘Investigating Variation and Change: Case in
Diachrony’, involving researchers at Ulster University and the University of
Crete and funded by the Arts & Humanities Research Council (AH/P006612/1).  A
primary focus of this project was on the changing syntax of Greek ditransitive
verbs; however, the data collected have the potential to be of use for those
seeking attestations of a wide variety of constructions.

The corpus can be searched through our free web interface at
https://cid.ulster.ac.uk.  The raw data are also available in CSV and XML
format, the latter using a version of the PROIEL schema.  In its current form
the corpus comprises excerpts from 655 texts, for a total of 3385 sentences
and 56,440 word tokens; however, this is an evolving resource and the data set
will continue to expand in future versions.  Detailed specifications are found
in our recently published article:

Macleod, Morgan, Elena Anagnostopoulou, Dionysios Mertyris, and Christina
Sevdali. 2021. “The DiGreC Treebank”. Research Data Journal for the Humanities
and Social Sciences 6.1: 1-12. https://doi.org/10.1163/24523666-06010004.

We hope that this resource will be of value for research on the history of the
Greek language and the various morphosyntactic phenomena that it exemplifies.
 



Linguistic Field(s): Text/Corpus Linguistics

Subject Language(s): Greek, Ancient (grc)
                     Greek, Modern (ell)

Language Family(ies): Indo-European





 



------------------------------------------------------------------------------

***************************    LINGUIST List Support    ***************************
 The 2020 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
  to find out how to donate and check how your university, country or discipline
     ranks in the fund drive challenges. Or go directly to the donation site:
                   https://crowdfunding.iu.edu/the-linguist-list

                        Let's make this a short fund drive!
                Please feel free to share the link to our campaign:
                    https://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-33-393	
----------------------------------------------------------






More information about the LINGUIST mailing list