33.1713, Books: Designing and Evaluating Language Corpora: Egbert, Biber, Gray

The LINGUIST List linguist at listserv.linguistlist.org
Fri May 13 12:17:06 UTC 2022


LINGUIST List: Vol-33-1713. Fri May 13 2022. ISSN: 1069 - 4875.

Subject: 33.1713, Books: Designing and Evaluating Language Corpora: Egbert, Biber, Gray

Moderator: Malgorzata E. Cavar (linguist at linguistlist.org)
Student Moderator: Billy Dickson
Managing Editor: Lauren Perkins
Team: Helen Aristar-Dry, Everett Green, Sarah Goldfinch, Nils Hjortnaes,
      Joshua Sims, Billy Dickson, Amalia Robinson, Matthew Fort
Jobs: jobs at linguistlist.org | Conferences: callconf at linguistlist.org | Pubs: pubs at linguistlist.org

Homepage: http://linguistlist.org

Please support the LL editors and operation with a donation at:
           https://funddrive.linguistlist.org/donate/

Editor for this issue: Billy Dickson <billyd at linguistlist.org>
================================================================


Date: Fri, 13 May 2022 08:16:49
From: Ellena Moriarty [ellena.moriarty at cambridge.org]
Subject: Designing and Evaluating Language Corpora: Egbert, Biber, Gray

 


Title: Designing and Evaluating Language Corpora 
Subtitle: A Practical Framework for Corpus Representativeness 
Publication Year: 2022 
Publisher: Cambridge University Press
	   http://www.cambridge.org/linguistics
	

Book URL: https://www.cambridge.org/us/academic/subjects/languages-linguistics/research-methods-linguistics/designing-and-evaluating-language-corpora-practical-framework-corpus-representativeness?format=PB 


Author: Jesse Egbert
Author: Douglas Biber
Author: Bethany Gray

Hardback: ISBN:  9781107151383 Pages:  Price: U.S. $ 99.99
Hardback: ISBN:  9781107151383 Pages:  Price: U.K. £ 74.99
Hardback: ISBN:  9781107151383 Pages:  Price: Europe EURO 87.52
Paperback: ISBN:  9781316605882 Pages:  Price: U.S. $ 34.99
Paperback: ISBN:  9781316605882 Pages:  Price: U.K. £ 26.99
Paperback: ISBN:  9781316605882 Pages:  Price: Europe EURO 31.50


Abstract:

Corpora are ubiquitous in linguistic research, yet to date, there has been no
consensus on how to conceptualize corpus representativeness and collect corpus
samples. This pioneering book bridges this gap by introducing a conceptual and
methodological framework for corpus design and representativeness. Written by
experts in the field, it shows how corpora can be designed and built in a way
that is both optimally suited to specific research agendas, and adequately
representative of the types of language use in question.  It considers
questions such as 'what types of texts should be included in the corpus?', and
'how many texts are required?' – highlighting that the degree of
representativeness rests on the dual pillars of domain considerations and
distribution considerations. The authors introduce, explain, and illustrate
all aspects of this corpus representativeness framework in a step-by-step
fashion, using examples and activities to help readers develop practical
skills in corpus design and evaluation.
 



1. Introduction; 2. Approaches to representativeness in previous corpus
linguistic research; 3. Corpus representativeness: a conceptual and
methodological framework; 4. Domain considerations; 5. Distribution
considerations; 6. The influence of domain and distribution considerations on
corpus representativeness – bringing it all together; 7. Corpus design and
representativeness in practice; Glossary; Appendix A. Example articles
documenting existing corpora; Appendix B. Survey of corpus design and
compilation practices.
 


Linguistic Field(s): Text/Corpus Linguistics


Written In: English  (eng)

See this book announcement on our website: 
http://linguistlist.org/pubs/books/get-book.cfm?BookID=161297




------------------------------------------------------------------------------

***************************    LINGUIST List Support    ***************************
 The 2020 Fund Drive is under way! Please visit https://funddrive.linguistlist.org
  to find out how to donate and check how your university, country or discipline
     ranks in the fund drive challenges. Or go directly to the donation site:
                   https://crowdfunding.iu.edu/the-linguist-list

                        Let's make this a short fund drive!
                Please feel free to share the link to our campaign:
                    https://funddrive.linguistlist.org/donate/
 


----------------------------------------------------------
LINGUIST List: Vol-33-1713	
----------------------------------------------------------






More information about the LINGUIST mailing list