35.2580, Confs: Linguistic Data and Language Comparison in Light of the ‘Quantitative Turn’ and ‘Big Data’ – a Workshop and Symposium

The LINGUIST List linguist at listserv.linguistlist.org
Mon Sep 23 19:05:02 UTC 2024


LINGUIST List: Vol-35-2580. Mon Sep 23 2024. ISSN: 1069 - 4875.

Subject: 35.2580, Confs: Linguistic Data and Language Comparison in Light of the ‘Quantitative Turn’ and ‘Big Data’ – a Workshop and Symposium

Moderator: Steven Moran (linguist at linguistlist.org)
Managing Editor: Justin Fuller
Team: Helen Aristar-Dry, Steven Franks, Joel Jenkins, Daniel Swanson, Erin Steitz
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Editor for this issue: Erin Steitz <ensteitz at linguistlist.org>

================================================================


Date: 20-Sep-2024
From: Sandra Auderset [sandra.auderset at unibe.ch]
Subject: Linguistic Data and Language Comparison in Light of the ‘Quantitative Turn’ and ‘Big Data’ – a Workshop and Symposium


Linguistic Data and Language Comparison in Light of the ‘Quantitative
Turn’ and ‘Big Data’ – a Workshop and Symposium

Date: 07-May-2025 - 09-May-2025
Location: Department of Linguistics, University of Bern, Switzerland
Contact: Sandra Auderset
Contact Email: data_ws_unibe at gmx.ch

Linguistic Field(s): Discipline of Linguistics; General Linguistics;
Language Documentation; Linguistic Theories; Typology

Meeting Description:

This workshop provides a forum for in-depth discussion and exchange on
theoretical and methodological issues related to linguistic data and
language comparison by exploring the relationship of data gathering,
analysis, and annotation practices in linguistics in light of the
'quantitative turn' and the advent of ‘big data’. A particular focus
lies on synchronic and diachronic comparison and the role of
understudied/endangered languages.
In recent years, linguistics has undergone a ‘quantitative turn’. The
uptake for such approaches has been greater in some subfields than in
others. In subfields focusing on language description and comparison
(both synchronic and diachronic) many remain skeptical, especially
with respect to the integration of understudied and/or endangered
languages. Quantitative methods applied in linguistic typology and
historical linguistics often need relatively ‘big’ data sets resulting
in a rise in large-scale databases including extensive reference
catalogs such as Glottolog (Hammarström et al. 2024), comparative
typological data bases such as the World Atlas of Language Structures
(Dryer & Haspelmath 2013) and more recently GramBank (Skirgård et al.
2023), and large cognate-coded word lists such as IE-Cor (Heggarty et
al. 2023), among many others. These resources are often used to make
broad, universal claims about the interplay of language and cognition
(Hahn 2020), language and social structure (Lupyan & Dale 2010,
Shcherbakova et al. 2023), language and genetics (Dediu 2011), and
language and climate (Everett et al. 2015, Everett et al. 2016), among
others. ‘Big data’ sets all involve standardization, multiple levels
of abstraction, and a view of language as composed of separable,
domain-specific building blocks. The alternative view – of language as
interaction and an interconnected system – has led to lower-level
(regional, family-specific, etc.), but more detailed and less
abstractive micro-typologies. Such studies reveal that there is
considerable internal diversity within language families and
subgroups, which is key to understanding diachronic processes.
The question of how to model diachronic processes has also been at the
center of recent developments in historical linguistics. Bayesian
phylogenetics have found  wider adoption in the past decade but remain
controversial. The main points of skepticism concern whether
biological models of evolution are applicable to languages at all
(e.g. Campbell 2024: 23) and the issue of relying solely on ‘lexical’
data. At the same time, classifications based on expert opinions and
qualitative methods are often accepted without much scrutiny, even if
the data remain inaccessible to other scholars. Thus there has been a
move towards open datasets in historical linguistics, often with
considerable efforts to make the analytical choices, for example in
cognate annotation, transparent.
Finer-grained, family-internal and truly bottom-up approaches and
methodologies are easier to connect with language documentation and
description efforts that have increased over the past decades.
However, the question of how to integrate this data into comparative
studies, both qualitative and quantitative, is not resolved. This is
especially pertinent for spoken language data, which forms the bulk of
language documentation, but so far plays only a minor role in typology
and diachronic linguistics.
In general, discussions on the notion and role of data with respect to
analysis and theory often revolve around how language-specific data
can be related to cross-linguistic definitions and concepts (see e.g.
Alfieri et al. 2021). Much less attention is paid to the ontological
underpinnings of what constitutes (primary/secondary) data and how the
preparation and annotation of this data influences qualitative and
quantitative theories and models.

Full call with references: https://tinyurl.com/45t6ta6y
More info coming soon: https://tinyurl.com/3uh4xwke

Format and target audience:
The workshop/symposium consists of short talks by the participants,
invited keynotes, and discussion sessions. It is aimed primarily at
early career researchers in linguistics or adjacent fields. Preference
will be given to scholars working on endangered and/or understudied
languages and/or on methods and tools that advance research on such
languages.



------------------------------------------------------------------------------

********************** LINGUIST List Support ***********************
Please consider donating to the Linguist List to support the student editors:

https://www.paypal.com/donate/?hosted_button_id=87C2AXTVC4PP8

LINGUIST List is supported by the following publishers:

Bloomsbury Publishing http://www.bloomsbury.com/uk/

Brill http://www.brill.com

Cambridge University Press http://www.cambridge.org/linguistics

De Gruyter Mouton https://cloud.newsletter.degruyter.com/mouton

Equinox Publishing Ltd http://www.equinoxpub.com/

European Language Resources Association (ELRA) http://www.elra.info

John Benjamins http://www.benjamins.com/

Language Science Press http://langsci-press.org

Lincom GmbH https://lincom-shop.eu/

Multilingual Matters http://www.multilingual-matters.com/

Narr Francke Attempto Verlag GmbH + Co. KG http://www.narr.de/

Oxford University Press http://www.oup.com/us

Wiley http://www.wiley.com


----------------------------------------------------------
LINGUIST List: Vol-35-2580
----------------------------------------------------------



More information about the LINGUIST mailing list