36.1324, Summer Schools: Analysing Multimodal Language Data for Quantitative Social Science (United Kingdom)

The LINGUIST List linguist at listserv.linguistlist.org
Wed Apr 23 00:05:07 UTC 2025


LINGUIST List: Vol-36-1324. Wed Apr 23 2025. ISSN: 1069 - 4875.

Subject: 36.1324, Summer Schools: Analysing Multimodal Language Data for Quantitative Social Science (United Kingdom)

Moderator: Steven Moran (linguist at linguistlist.org)
Managing Editor: Justin Fuller
Team: Helen Aristar-Dry, Steven Franks, Joel Jenkins, Daniel Swanson, Erin Steitz
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Editor for this issue: Joel Jenkins <joel at linguistlist.org>

================================================================


Date: 22-Apr-2025
From: Charles Redmon [c.redmon at essex.ac.uk]
Subject: Analysing Multimodal Language Data for Quantitative Social Science


Analysing Multimodal Language Data for Quantitative Social Science

Host Institution: University of Essex
Website:
https://essexsummerschool.com/summer-school-facts/courses/ess-2025-course-list/2r/

Dates: 21-Jul-2025 - 01-Aug-2025
Location: Hybrid: University of Essex / Online, United Kingdom

Minimum Education Level: All welcome (student, faculty, researcher,
etc.)

Focus: Language data analysis and computing, with a particular
emphasis on challenges in handling multimodal (text, audio, video)
data.

Description:

As part of the 58th Annual Essex Summer School in Social Science Data
Analysis, we are hosting a two-week course on the analysis of
multimodal (text, audio, video) language data. This course will be
taught by Drs Nina Markl and Charles Redmon in coordination with wider
work on computational linguistics at the Institute for Analytics and
Data Science and the Department of Language and Linguistics at the
University of Essex.
Summary:
While there is great research interest in multimodal data (e.g.,
social media video), the large-scale analysis of such data is
challenging. Complementing ESS courses on text analysis, we will focus
on state-of-the-art tools to facilitate automatic transcription of
audio and video data and handwritten or printed (non-digitised) text.
We will furthermore briefly explore tools to automatically annotate
video. These systems allow students to make use of complex multimodal
data such as social media videos.
The first half of the course focuses on the theoretical background,
introducing core concepts and techniques to ensure students have a
basic understanding of the underlying technologies. The second half of
the course focuses on the application of state-of-the-art tools. Each
day consists of one theory session and one practical session, and
students will be a completing in-class programming exercises. During
the second week, students will complete a small data analysis project,
including analysis design, data extraction, data analysis and
visualisation.
Outcomes:
By the end of this course, students will:
Understand the theoretical foundations of state-of-the-art
(multimodal) large language models, and more conventional tools for
automatic speech recognition, and optical character recognition
Understand the limitations of state-of-the-art language technologies
Be able to design an appropriate data analysis methodology for audio,
video and text data using state-of-the-art language technologies
Be able to prepare video, audio and non-digitised text data for
semi-automated analysis
Be able to analyse and visualise text data
Prerequisites:
Some familiarity with Python recommended but not required; attendees
should bring laptops for practical work
Outline:
Week 1: Introducing multimodal language analysis
This week is focused on the core background theory and methodology,
with sample problem sets after each topic to apply concepts to
concrete data.
    Day 1: Introduction
    Day 2: Optical Character Recognition
    Day 3: Automatic Speech Recognition and Transcription
    Day 4: Image and video analysis
    Day 5: Large Language Models
Week 2: Developing data analysis protocols
This week is focused on applying the previous week's tools to the
students' own particular data analysis problems, as well as
illustrating a best practice for a general project pipeline. Students
are encouraged to bring existing work of their own that they'd like
input on and assistance with.
    Day 6: Designing Data Analysis
        Data Ethics
        Developing a Data Collection Protocol
        Exploratory Analysis
        System validation
    Day 7: Extracting digitised text data from different sources
        Automatic Transcription
        Optical Character Recognition
        Data Wrangling
        Data validation
    Day 8: Text analysis
        Understanding differences between text and transcripts
        Making use of multimodality
    Day 9: Data Visualisation
        Visualising data
        Interpreting data
    Day 10: Future directions and limitations
        Developing independent data analysis protocols

Tuition: Depends on origin
Tuition Explanation: Please see
https://essexsummerschool.com/summer-school-facts/fee-structure/ for
details.

Financial Aid Applications accepted until 15-Apr-2025

Essex Summer School Scholarship for PhD students in the Global South
The Essex Summer School in Social Science Data Analysis is excited to
announce the Essex Summer School Scholarship for PhD students in the
Global South. These awards intend to provide advanced methodological
training for graduate students in political science in the Global
South. We offer access to the training and the materials used in our
intermediate and advanced methods courses.
The Global South scholarship is open to students currently enrolled in
Political Science PhD programs in institutions located in the Global
South. The scholarship provides a tuition waiver to attend  one of our
two-week intermediate or advanced sessions in the 2025 Essex Summer
School in Social Science Data Analysis (ESS). Students can attend in
person or virtually. Introductory courses do not qualify for the
Global South scholarship, but applicants are welcome to enrol in
introductory courses at their own expense.

Financial Aid Instructions:
Please see the following page for more information:
https://essexsummerschool.com/summer-school-facts/scholarship-opportunities/global-south-scholarship/

Linguistic Field(s): Computational Linguistics
                     Language Documentation
                     Lexicography
                     Text/Corpus Linguistics
                     Translation

Registration Open until 04-Jul-2025

Contact Person: Dr Charles Redmon
                Email: c.redmon at essex.ac.uk

Apply by Email: esumsda at essex.ac.uk
Apply on the web: https://essexsummerschool.com/application/

Registration Instructions:
Please see the following page for more details on the application
process:
https://essexsummerschool.com/summer-school-facts/fees/



------------------------------------------------------------------------------

********************** LINGUIST List Support ***********************
Please consider donating to the Linguist List to support the student editors:

https://www.paypal.com/donate/?hosted_button_id=87C2AXTVC4PP8

LINGUIST List is supported by the following publishers:

Bloomsbury Publishing http://www.bloomsbury.com/uk/

Cambridge University Press http://www.cambridge.org/linguistics

Cascadilla Press http://www.cascadilla.com/

De Gruyter Mouton https://cloud.newsletter.degruyter.com/mouton

Edinburgh University Press http://www.edinburghuniversitypress.com

Elsevier Ltd http://www.elsevier.com/linguistics

John Benjamins http://www.benjamins.com/

Language Science Press http://langsci-press.org

Lincom GmbH https://lincom-shop.eu/

Multilingual Matters http://www.multilingual-matters.com/

Netherlands Graduate School of Linguistics / Landelijke (LOT) http://www.lotpublications.nl/

Oxford University Press http://www.oup.com/us

Wiley http://www.wiley.com


----------------------------------------------------------
LINGUIST List: Vol-36-1324
----------------------------------------------------------



More information about the LINGUIST mailing list