[Corpora-List] Distribution of tokens by POS in BNC or COCA
Mark Davies
Mark_Davies at byu.edu
Sun Mar 23 01:54:15 UTC 2014
>> Is there a table which lists the contents of BNC or COCA by POS - NN, NNP, JJ, VB and their variations?
You can quickly and easily generate the top 4000-5000 word forms for a given PoS (nn2, jjr, etc) via the web interfaces for COCA or the BNC (http://corpus.byu.edu).
Might also look at http://www.wordfrequency.info/100k.asp (a bit pricey, though, unless you need all of that info.)
Best,
Mark Davies
============================================
Mark Davies
Professor of Linguistics / Brigham Young University
http://davies-linguistics.byu.edu/
** Corpus design and use // Linguistic databases **
** Historical linguistics // Language variation **
** English, Spanish, and Portuguese **
============================================
________________________________________
From: corpora-bounces at uib.no [corpora-bounces at uib.no] on behalf of Khurshid Ahmad [kahmad at scss.tcd.ie]
Sent: Saturday, March 22, 2014 12:15 PM
To: corpora at uib.no
Subject: [Corpora-List] Distribution of tokens by POS in BNC or COCA
Dear All
Is there a table which lists the contents of BNC or COCA by POS - NN,
NNP, JJ, VB and their variations?
Apologies for using the bandwidth for such a simple query.
--
Best wishes
Khurshid Ahmad.
Professor of Computer Science
School of Computer Science and Statistics
Trinity College
Dublin 2
IRELAND
Phone: 00353 1 896 8429 (Labs: 00 353 1 8968435)
Fax 353 1 677 2204
Webpage: www.cs.tcd.ie/khurshid.ahmad
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list