33.1690, Books: Natural Language Processing for Corpus Linguistics: Dunn

The LINGUIST List linguist at listserv.linguistlist.org
Thu May 12 20:16:29 UTC 2022


LINGUIST List: Vol-33-1690. Thu May 12 2022. ISSN: 1069 - 4875.

Subject: 33.1690, Books: Natural Language Processing for Corpus Linguistics: Dunn

Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Billy Dickson
Managing Editor: Lauren Perkins
Team: Helen Aristar-Dry, Everett Green, Sarah Goldfinch, Nils Hjortnaes,
      Joshua Sims, Billy Dickson, Amalia Robinson, Matthew Fort
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Billy Dickson <billyd at linguistlist.org>
================================================================


Date: Thu, 12 May 2022 16:16:14
From: Ellena Moriarty [ellena.moriarty at cambridge.org]
Subject: Natural Language Processing for Corpus Linguistics: Dunn

 


Title: Natural Language Processing for Corpus Linguistics 
Publication Year: 2022 
Publisher: Cambridge University Press
	   http://www.cambridge.org/linguistics
	

Book URL: https://www.cambridge.org/us/academic/subjects/languages-linguistics/applied-linguistics-and-second-language-acquisition/natural-language-processing-corpus-linguistics?format=PB 


Author: Jonathan Dunn

Paperback: ISBN:  9781009074438 Pages:  Price: U.S. $ 20.00
Paperback: ISBN:  9781009074438 Pages:  Price: U.K. £ 15.00
Paperback: ISBN:  9781009074438 Pages:  Price: Europe EURO 17.51


Abstract:

Corpus analysis can be expanded and scaled up by incorporating computational
methods from natural language processing. This Element shows how text
classification and text similarity models can extend our ability to undertake
corpus linguistics across very large corpora. These computational methods are
becoming increasingly important as corpora grow too large for more traditional
types of linguistic analysis. We draw on five case studies to show how and why
to use computational methods, ranging from usage-based grammar to authorship
analysis to using social media for corpus-based sociolinguistics. Each section
is accompanied by an interactive code notebook that shows how to implement the
analysis in Python. A stand-alone Python package is also available to help
readers use these methods with their own data. Because large-scale analysis
introduces new ethical problems, this Element pairs each new methodology with
a discussion of potential ethical implications.
 



Accessing the Code Notebooks; 1. Computational Linguistic Analysis; 2. Text
Classification; 3. Text Similarity; 4. Validation and Visualization; 5.
Conclusions.
 


Linguistic Field(s): Computational Linguistics
                     Text/Corpus Linguistics


Written In: English  (eng)

See this book announcement on our website: 
http://linguistlist.org/pubs/books/get-book.cfm?BookID=161354




------------------------------------------------------------------------------

***************************    LINGUIST List Support    ***************************
 The 2020 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
  to find out how to donate and check how your university, country or discipline
     ranks in the fund drive challenges. Or go directly to the donation site:
                   https://crowdfunding.iu.edu/the-linguist-list

                        Let's make this a short fund drive!
                Please feel free to share the link to our campaign:
                    https://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-33-1690	
----------------------------------------------------------






More information about the LINGUIST mailing list