[Corpora-List] Two new COCA-based resources for academic English
Mark Davies
Mark_Davies at byu.edu
Mon Apr 9 13:06:13 UTC 2012
The following announcement is probably most relevant to those who use corpora for language teaching (especially academic English). It's written for a general audience (rather than the more sophisticated computational linguists found here on CORPORA), but it will hopefully still be of interest and use to some people here.
-------------------------------------
If you are interested in academic English - for teaching or learning - there are two new, free corpus-based resources that might be of interest to you. These are based on the 110 million words of academic texts in the Corpus of Contemporary American English<http://corpus.byu.edu/coca/> [COCA] (85 million words in academic journals and 25 million words in more academically-oriented magazine articles).
1. The new site www.academicwords.info<http://www.academicwords.info> contains free COCA-based academic wordlists. There are important differences<http://www.academicwords.info/compare.asp> between these lists and the Academic Word List created by Coxhead (2000), and we believe that the lists that these new lists provide better coverage of academic English and that they have a format that better enhances learning and teaching. The three sets of word lists, which have been created in conjunction with Prof. Dee Gardner of BYU, are:
-- Word families (SAMPLE<http://www.academicwords.info/samples/families.pdf>): The top 1000 word families of academic English (with nearly 3000 words total). Unlike the traditional Academic Word List, ours contain separate entries for different parts of speech, so you know, for example, whether abstract is used more as a noun, verb, or adjective. The words are also color-coded to let you know whether the word is a "general" academic word, or whether it is a more "technical" one that occurs in just a few sub-genres. And most importantly, the entries are listed in order of frequency, to help you focus more on words that you will actually see in the real world -- rather than just having a mass of unorganized words in each word family.
-- General "core" academic English (SAMPLE<http://www.academicwords.info/samples/general-core.pdf>): The top 3500 words (lemmas) in COCA Academic (listed individually, rather than by word family)
-- Technical / sub-genre lists (SAMPLE<http://www.academicwords.info/samples/subgenre-technical.pdf>): The top 1000 words in each of the nine academic sub-genres (Business, Law, Medicine, Science, Humanities, etc)
2. We have created a new interface at www.wordandphrase.info/academic/<http://www.wordandphrase.info/academic/> for just academic English. It has the same features as the general WordAndPhrase<http://www.wordandphrase.info/> site, but all of the data is based strictly on the 110 million words of academic English in COCA.
-- Frequency listing: Browse through these lists (including word families) to see detailed information (all on one screen, with extensive links between resources): definition, frequency by academic sub-genre (e.g. Medicine, Business, Humanities), synonyms, and collocates and concordance lines (based just on academic English).
-- Input texts: As with the general interface, you can input an entire text (such as a journal article, or an academic paper that you have written) and it will give you detailed information about the words and phrases in the text. You can download word lists based on your text, and you can click on phrases in your text to see related phrases from COCA.
(By the way, if you previously had trouble accessing WordAndPhrase<http://www.wordandphrase.info/> with your account, please try again; we've fixed a few bugs.)
I hope that these two new corpus-based resources on academic English will be of interest and value to you for teaching, learning, and research.
Best,
Mark Davies
Brigham Young University
============================================
Mark Davies
Professor of (Corpus) Linguistics
Brigham Young University
(phone) 801-422-9168 / (fax) 801-422-0906
http://davies-linguistics.byu.edu
** Corpus design and use // Linguistic databases **
** Historical linguistics // Language variation **
** English, Spanish, and Portuguese **
============================================
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20120409/bc5e0a8e/attachment.htm>
-------------- next part --------------
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list