[Corpora-List] Distribution of tokens by POS in BNC or COCA

Mark Davies Mark_Davies at byu.edu
Sun Mar 23 01:54:15 UTC 2014


>> Is there a table which lists the contents of BNC or COCA by POS - NN, NNP, JJ, VB and their variations?

You can quickly and easily generate the top 4000-5000 word forms for a given PoS (nn2, jjr, etc) via the web interfaces for COCA or the BNC (http://corpus.byu.edu).

Might also look at http://www.wordfrequency.info/100k.asp (a bit pricey, though, unless you need all of that info.)

Best,

Mark Davies

============================================
Mark Davies
Professor of Linguistics / Brigham Young University
http://davies-linguistics.byu.edu/

** Corpus design and use // Linguistic databases **
** Historical linguistics // Language variation **
** English, Spanish, and Portuguese **
============================================

________________________________________
From: corpora-bounces at uib.no [corpora-bounces at uib.no] on behalf of Khurshid Ahmad [kahmad at scss.tcd.ie]
Sent: Saturday, March 22, 2014 12:15 PM
To: corpora at uib.no
Subject: [Corpora-List] Distribution of tokens by POS in BNC or COCA

Dear All
Is there a table which lists the contents of BNC or COCA by POS - NN,
NNP, JJ, VB and their variations?
Apologies for using the bandwidth for such a simple query.

--
Best wishes

Khurshid Ahmad.
Professor of Computer Science
School of Computer Science and Statistics
Trinity College
Dublin 2
IRELAND

Phone: 00353 1 896 8429 (Labs: 00 353 1 8968435)
Fax 353 1 677 2204
Webpage: www.cs.tcd.ie/khurshid.ahmad

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora

_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list