37.1172, Confs: Pre-conference Workshop at CLIB 2026 - FAIR Language Resources in NLP: Stewardship, Reuse and Long-Term Sustainability (Bulgaria)
The LINGUIST List
linguist at listserv.linguistlist.org
Mon Mar 23 15:05:02 UTC 2026
LINGUIST List: Vol-37-1172. Mon Mar 23 2026. ISSN: 1069 - 4875.
Subject: 37.1172, Confs: Pre-conference Workshop at CLIB 2026 - FAIR Language Resources in NLP: Stewardship, Reuse and Long-Term Sustainability (Bulgaria)
Moderator: Steven Moran (linguist at linguistlist.org)
Managing Editor: Valeriia Vyshnevetska
Team: Helen Aristar-Dry, Mara Baccaro, Daniel Swanson
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org
Homepage: http://linguistlist.org
Editor for this issue: Valeriia Vyshnevetska <valeriia at linguistlist.org>
================================================================
Date: 21-Mar-2026
From: Milena Dobreva [milena.dobreva at ustrath.ac.uk]
Subject: Pre-conference Workshop at CLIB 2026 - FAIR Language Resources in NLP: Stewardship, Reuse and Long-Term Sustainability
Pre-conference Workshop at CLIB 2026 - FAIR Language Resources in NLP:
Stewardship, Reuse and Long-Term Sustainability
Short Title: Workshop FAIR Language Resources in NLP
Date: 07-Sep-2026 - 07-Sep-2026
Location: Sofia, Bulgaria
Contact: Milena Dobreva
Contact Email: milena.dobreva at ustrath.ac.uk
Meeting URL: https://dcl.bas.bg/clib/
Linguistic Field(s): General Linguistics
Submission Deadline: 22-Apr-2026
Language resources are the foundation of linguistic research and NLP.
Corpora, lexicons, annotated datasets, benchmarks, and models are
produced at an unprecedented pace. Yet their long-term stewardship,
interoperability, and reuse remain inconsistent and often fragile.
Rapid creation has outpaced sustainable design. This workshop aims to
bring together researchers, infrastructure providers, data stewards,
and policy actors who are committed to building durable language
resource ecosystems. We aim to address the pressing challenges of
sustaining datasets used in linguistic research and in the development
of NLP systems—from documentation and versioning to governance,
licensing, and infrastructure support. The workshop will explore how
FAIR principles (Findable, Accessible, Interoperable, Reusable) can be
meaningfully operationalised for language resources in NLP and
computational linguistics.
Important Dates:
Submission deadline: 22 April 2026
Notification of acceptance: 22 May 2026
Camera-ready deadline: To be confirmed
Workshop date: 7 September 2026
Submission Guidelines:
The proposal considers short papers (4-6 pages), which will be
delivered in 10 min slots. Submissions should be made using the CLIB
Word template (camera-ready) available at
https://dcl.bas.bg/clib/instructions-for-authors/.
Submission link: https://easychair.org/conferences/?conf=fairclib2026
List of Topics:
1. Technical Foundations
- Designing language resources so they are interoperable,
transparent, and structurally reusable.
- Domain-specific FAIR implementation strategies for corpora,
lexicons, datasets, and models
- Metadata, paradata, and annotation transparency frameworks
- Repository architectures and infrastructure design for linguistic
data
2. Lifecycle & Reuse
- Еnsuring language resources remain usable, traceable, and
measurable across research cycles.
- Documentation, versioning, and provenance tracking for evolving
resources
- Persistent identifiers and citation mechanisms for language
datasets
- Methods for tracking, measuring, and evidencing reuse
- Critical reflections and lessons learned from implementation
challenges
- From raw data to FAIR-ready assets: preprocessing, cleaning, and
quality assurance workflows
- Replicability of the experiments over the language resources
3. Policy & Sustainability
- Creating the institutional and legal conditions that allow language
resources to endure.
- Legal, ethical, and licensing considerations in sharing and reusing
language data
- Governance structures and sustainability models beyond project
funding
- Raising awareness for and supporting communities in adapting best
practices
Program Committee (under development):
Edward J. Pinot Gray, DARIAH Coordination Office Paris, France
Egon W. Stemle, Institute for Applied Linguistics, Italy
Olha Kanishcheva, Friedrich Schiller University Jena, Germany
Petya Osenova, Faculty of Slavic Studies at Sofia University “St.
Kliment Ohridski” and Department of Linguistic Modelling and Knowledge
Processing at the Institute of Information and Communication
Technologies, Bulgarian Academy of Sciences
Ruslana Margova, GATE Institute, Sofia
Chairs:
Milena Dobreva (University of Strathclyde, IMI BAS)
Ivan Lambov (IMI BAS)
Invited Speakers:
Kaja Dobrovoljc, Research Associate, Laboratory for Machine Learning
and Language Technologies, Faculty of Computer and Information
Science, University of Ljubljana, Slovenia
Mietta Lennes, RI Specialist, FIN-CLARIN & Kielipankki – The Language
Bank of Finland, Department of Digital Humanities, University of
Helsinki, Finland
Beth Knazook, Senior Programme Manager, Research and Engagement,
Digital Repository of Ireland
Publication: We are finalising the proceedings information.
Venue: The conference will be held in Sofia, Bulgaria.
------------------------------------------------------------------------------
********************** LINGUIST List Support ***********************
Please consider donating to the Linguist List, a U.S. 501(c)(3) not for profit organization:
https://www.paypal.com/donate/?hosted_button_id=87C2AXTVC4PP8
LINGUIST List is supported by the following publishers:
Bloomsbury Publishing http://www.bloomsbury.com/uk/
Cambridge University Press http://www.cambridge.org/linguistics
Cascadilla Press http://www.cascadilla.com/
De Gruyter Brill https://www.degruyterbrill.com/?changeLang=en
Edinburgh University Press http://www.edinburghuniversitypress.com
European Language Resources Association (ELRA) http://www.elra.info
John Benjamins http://www.benjamins.com/
Language Science Press http://langsci-press.org
Lincom GmbH https://lincom-shop.eu/
MIT Press http://mitpress.mit.edu/
Multilingual Matters http://www.multilingual-matters.com/
Narr Francke Attempto Verlag GmbH + Co. KG http://www.narr.de/
Netherlands Graduate School of Linguistics / Landelijke (LOT) http://www.lotpublications.nl/
Peter Lang AG http://www.peterlang.com
SIL International Publications http://www.sil.org/resources/publications
----------------------------------------------------------
LINGUIST List: Vol-37-1172
----------------------------------------------------------
More information about the LINGUIST
mailing list