35.1155, FYI: RUEG completed, corpus (with English, German, Greek, Russian, Turkish data) will remain open-access

The LINGUIST List linguist at listserv.linguistlist.org
Sat Apr 6 22:05:02 UTC 2024


LINGUIST List: Vol-35-1155. Sat Apr 06 2024. ISSN: 1069 - 4875.

Subject: 35.1155, FYI: RUEG completed, corpus (with English, German, Greek, Russian, Turkish data) will remain open-access

Moderators: Malgorzata E. Cavar, Francis Tyers (linguist at linguistlist.org)
Managing Editor: Justin Fuller
Team: Helen Aristar-Dry, Steven Franks, Everett Green, Daniel Swanson, Maria Lucero Guillen Puon, Zackary Leech, Lynzie Coburn, Natasha Singh, Erin Steitz
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Justin Fuller <justin at linguistlist.org>

LINGUIST List is hosted by Indiana University College of Arts and Sciences.
================================================================


Date: 05-Apr-2024
From: Heike Wiese [heike.wiese at hu-berlin.de]
Subject: RUEG completed, corpus (with English, German, Greek, Russian, Turkish data) will remain open-access


After 6 successful years, the Research Unit "Emerging Grammars in
Language-Contact Situations" (https://hu.berlin/rueg) came to an end
in March 2024.

The RUEG corpus that we created will remain accessible, through full
open access. The corpus contains naturalistic yet systematically
comparable data from language productions of a total of 774 heritage
and monolingually-raised speakers:
- in English, German, Greek, Russian, and Turkish
- in formal and informal settings, spoken and written
- by multilingual and monolingual speakers, adolescents and adults
- in Germany, Greece, Russia, Turkey, and the US

Data in the corpus were collected via elicited narrations of a short
video clip of a minor car accident. In addition to the basic
transcription (over 550K words), the data are annotated for syntactic
spans, lemmata, language, and part of speech. Subsets of the data are
also annotated for specific phonological, lexical, morphosyntactic,
and discourse-pragmatic phenomena.

We encourage everyone to keep using the RUEG corpus as a resource for
research on language contact; language variation and change; majority
and heritage language use; register differentiation; youth language;
computer-mediated communication (CMC); lexicon, morphosyntax, and
discourse-pragmatics; and much more.

Check out the RUEG corpus at https://hu.berlin/rueg-corpus

Linguistic Field(s): General Linguistics
                     Text/Corpus Linguistics

Subject Language(s): English (eng)
                     German (deu)
                     Greek, Modern (ell)
                     Russian (rus)
                     Turkish (tur)




------------------------------------------------------------------------------

Please consider donating to the Linguist List https://give.myiu.org/iu-bloomington/I320011968.html


LINGUIST List is supported by the following publishers:

Cambridge University Press http://www.cambridge.org/linguistics

De Gruyter Mouton https://cloud.newsletter.degruyter.com/mouton

Equinox Publishing Ltd http://www.equinoxpub.com/

John Benjamins http://www.benjamins.com/

Lincom GmbH https://lincom-shop.eu/

Multilingual Matters http://www.multilingual-matters.com/

Narr Francke Attempto Verlag GmbH + Co. KG http://www.narr.de/

Wiley http://www.wiley.com


----------------------------------------------------------
LINGUIST List: Vol-35-1155
----------------------------------------------------------



More information about the LINGUIST mailing list