36.2863, Software: Corpus Sense
The LINGUIST List
linguist at listserv.linguistlist.org
Wed Sep 24 14:05:02 UTC 2025
LINGUIST List: Vol-36-2863. Wed Sep 24 2025. ISSN: 1069 - 4875.
Subject: 36.2863, Software: Corpus Sense
Moderator: Steven Moran (linguist at linguistlist.org)
Managing Editor: Valeriia Vyshnevetska
Team: Helen Aristar-Dry, Mara Baccaro, Daniel Swanson
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org
Homepage: http://linguistlist.org
Editor for this issue: Daniel Swanson <daniel at linguistlist.org>
================================================================
Date: 23-Sep-2025
From: Antonio Moreno-Ortiz [amo at uma.es]
Subject: Corpus Sense
Corpus Sense is a new web-based platform for corpus linguistics and
discourse analysis. The application is free to use for academic
purposes, and access can be requested via the Contact page on the
website: https://corpus-sense.app
Corpus Sense has been designed to support both research and teaching,
offering a user-friendly interface alongside powerful NLP tools. Its
main features include:
- Corpus management: Upload, organize, and share corpora
securely.
- Concordancing & KWIC: Explore keyword-in-context lines with
flexible filters.
- Keyword and collocation extraction: Identify salient patterns
and associations.
- Topic modeling (via BERTopic with interpretable LLM-generated
labels).
- Sentiment analysis: Assess evaluative language using advanced
NLP methods.
- Named Entity Recognition & semantic search: Enhance corpus
exploration.
- Visualization tools: Interactive charts, word clouds, and
network diagrams.
- Custom NLP pipelines: Integration with spaCy, embeddings, and
AI-enhanced resources.
A full description of the application, including its development,
architecture, and applications, can be found in:
Moreno-Ortiz, Antonio (2025). Corpus Sense: A Comprehensive Tool for
Advanced Text and Discourse Exploration. Applied Corpus Linguistics,
5, 100145.
https://doi.org/10.1016/j.acorp.2025.100145
Linguistic Field(s): Computational Linguistics
Discourse Analysis
Text/Corpus Linguistics
------------------------------------------------------------------------------
********************** LINGUIST List Support ***********************
Please consider donating to the Linguist List, a U.S. 501(c)(3) not for profit organization:
https://www.paypal.com/donate/?hosted_button_id=87C2AXTVC4PP8
LINGUIST List is supported by the following publishers:
Bloomsbury Publishing http://www.bloomsbury.com/uk/
Cambridge University Press http://www.cambridge.org/linguistics
Cascadilla Press http://www.cascadilla.com/
De Gruyter Brill https://www.degruyterbrill.com/?changeLang=en
Edinburgh University Press http://www.edinburghuniversitypress.com
John Benjamins http://www.benjamins.com/
Language Science Press http://langsci-press.org
MIT Press http://mitpress.mit.edu/
Multilingual Matters http://www.multilingual-matters.com/
Narr Francke Attempto Verlag GmbH + Co. KG http://www.narr.de/
Netherlands Graduate School of Linguistics / Landelijke (LOT) http://www.lotpublications.nl/
Peter Lang AG http://www.peterlang.com
----------------------------------------------------------
LINGUIST List: Vol-36-2863
----------------------------------------------------------
More information about the LINGUIST
mailing list