35.1155, FYI: RUEG completed, corpus (with English, German, Greek, Russian, Turkish data) will remain open-access
The LINGUIST List
linguist at listserv.linguistlist.org
Sat Apr 6 22:05:02 UTC 2024
LINGUIST List: Vol-35-1155. Sat Apr 06 2024. ISSN: 1069 - 4875.
Subject: 35.1155, FYI: RUEG completed, corpus (with English, German, Greek, Russian, Turkish data) will remain open-access
Moderators: Malgorzata E. Cavar, Francis Tyers (linguist at linguistlist.org)
Managing Editor: Justin Fuller
Team: Helen Aristar-Dry, Steven Franks, Everett Green, Daniel Swanson, Maria Lucero Guillen Puon, Zackary Leech, Lynzie Coburn, Natasha Singh, Erin Steitz
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org
Homepage: http://linguistlist.org
Please support the LL editors and operation with a donation at:
https://funddrive.linguistlist.org/donate/
Editor for this issue: Justin Fuller <justin at linguistlist.org>
LINGUIST List is hosted by Indiana University College of Arts and Sciences.
================================================================
Date: 05-Apr-2024
From: Heike Wiese [heike.wiese at hu-berlin.de]
Subject: RUEG completed, corpus (with English, German, Greek, Russian, Turkish data) will remain open-access
After 6 successful years, the Research Unit "Emerging Grammars in
Language-Contact Situations" (https://hu.berlin/rueg) came to an end
in March 2024.
The RUEG corpus that we created will remain accessible, through full
open access. The corpus contains naturalistic yet systematically
comparable data from language productions of a total of 774 heritage
and monolingually-raised speakers:
- in English, German, Greek, Russian, and Turkish
- in formal and informal settings, spoken and written
- by multilingual and monolingual speakers, adolescents and adults
- in Germany, Greece, Russia, Turkey, and the US
Data in the corpus were collected via elicited narrations of a short
video clip of a minor car accident. In addition to the basic
transcription (over 550K words), the data are annotated for syntactic
spans, lemmata, language, and part of speech. Subsets of the data are
also annotated for specific phonological, lexical, morphosyntactic,
and discourse-pragmatic phenomena.
We encourage everyone to keep using the RUEG corpus as a resource for
research on language contact; language variation and change; majority
and heritage language use; register differentiation; youth language;
computer-mediated communication (CMC); lexicon, morphosyntax, and
discourse-pragmatics; and much more.
Check out the RUEG corpus at https://hu.berlin/rueg-corpus
Linguistic Field(s): General Linguistics
Text/Corpus Linguistics
Subject Language(s): English (eng)
German (deu)
Greek, Modern (ell)
Russian (rus)
Turkish (tur)
------------------------------------------------------------------------------
Please consider donating to the Linguist List https://give.myiu.org/iu-bloomington/I320011968.html
LINGUIST List is supported by the following publishers:
Cambridge University Press http://www.cambridge.org/linguistics
De Gruyter Mouton https://cloud.newsletter.degruyter.com/mouton
Equinox Publishing Ltd http://www.equinoxpub.com/
John Benjamins http://www.benjamins.com/
Lincom GmbH https://lincom-shop.eu/
Multilingual Matters http://www.multilingual-matters.com/
Narr Francke Attempto Verlag GmbH + Co. KG http://www.narr.de/
Wiley http://www.wiley.com
----------------------------------------------------------
LINGUIST List: Vol-35-1155
----------------------------------------------------------
More information about the LINGUIST
mailing list