37.796, FYI: Publication of the Parallel European Corpus of Informal Interaction (PECI)
The LINGUIST List
linguist at listserv.linguistlist.org
Thu Feb 26 14:05:02 UTC 2026
LINGUIST List: Vol-37-796. Thu Feb 26 2026. ISSN: 1069 - 4875.
Subject: 37.796, FYI: Publication of the Parallel European Corpus of Informal Interaction (PECI)
Moderator: Steven Moran (linguist at linguistlist.org)
Managing Editor: Valeriia Vyshnevetska
Team: Helen Aristar-Dry, Mara Baccaro, Daniel Swanson
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org
Homepage: http://linguistlist.org
Editor for this issue: Daniel Swanson <daniel at linguistlist.org>
================================================================
Date: 26-Feb-2026
From: Siegwalt Lindenfelser [lindenfelser at ids-mannheim.de]
Subject: Publication of the Parallel European Corpus of Informal Interaction (PECI)
The program area “Oral corpora” at the Leibniz Institute for the
German Language (IDS Mannheim) is pleased to announce that the
“Parallel European Corpus of Informal Interaction” (PECI) has been
released this week as part of version 2.25 of the “Database for Spoken
German” (DGD). The corpus is now available online for scientific
research and academic teaching purposes after registration:
https://dgd.ids-mannheim.de/
The PECI corpus is a multilingual comparative corpus of spoken
everyday social interaction. It was collected as part of the project
“Norms, Rules, and Morality across Languages” (NoRM-aL, 2020-2023, led
by Jörg Zinken, IDS Mannheim, funded by the Leibniz-Gemeinschaft) and
contains audio and video recordings in four languages: German,
(British) English, Italian and Polish. For each language, three social
activities were recorded from two camera angles: Family breakfasts,
game nights with friends and relatives as well as car rides with
friends.
The PECI corpus contains 83 recordings with 254 speakers with a total
length of around 77 hours and transcripts with around 600.000
transcribed tokens. In addition, detailed metadata on the interactions
and the participants is provided in four languages as well as
additional materials (in English and German).
If you are interested in the new data and functionalities included in
Release 2.25 of the DGD, you can read more about the details in the
DGD release mail here (in German):
https://agd.ids-mannheim.de/Releasemails/Release-Text_2_25.html
Linguistic Field(s): Pragmatics
Text/Corpus Linguistics
Subject Language(s): English (eng)
German (deu)
Italian (ita)
Polish (pol)
------------------------------------------------------------------------------
********************** LINGUIST List Support ***********************
Please consider donating to the Linguist List, a U.S. 501(c)(3) not for profit organization:
https://www.paypal.com/donate/?hosted_button_id=87C2AXTVC4PP8
LINGUIST List is supported by the following publishers:
Bloomsbury Publishing http://www.bloomsbury.com/uk/
Cambridge University Press http://www.cambridge.org/linguistics
Cascadilla Press http://www.cascadilla.com/
De Gruyter Brill https://www.degruyterbrill.com/?changeLang=en
Edinburgh University Press http://www.edinburghuniversitypress.com
John Benjamins http://www.benjamins.com/
Language Science Press http://langsci-press.org
Lincom GmbH https://lincom-shop.eu/
MIT Press http://mitpress.mit.edu/
Multilingual Matters http://www.multilingual-matters.com/
Narr Francke Attempto Verlag GmbH + Co. KG http://www.narr.de/
Netherlands Graduate School of Linguistics / Landelijke (LOT) http://www.lotpublications.nl/
Peter Lang AG http://www.peterlang.com
SIL International Publications http://www.sil.org/resources/publications
----------------------------------------------------------
LINGUIST List: Vol-37-796
----------------------------------------------------------
More information about the LINGUIST
mailing list