[Corpora-List] BNC word list

Rayson, Paul rayson at exchange.lancs.ac.uk
Wed Jul 2 10:29:41 UTC 2003

Dear Sylvana,

Please see the page: http://www.comp.lancs.ac.uk/ucrel/bncfreq/flists.html which is part of the Companion website for the book "Word Frequencies in Written and Spoken English: based on the British National Corpus."
There are two versions: unix compressed and Winzip compressed of the full frequency list.


Dr. Paul Rayson
Director of UCREL
University Centre for Computer Corpus Research on Language
Computing Department, Lancaster University,
Lancaster, LA1 4YR, UK.
Web: http://www.comp.lancs.ac.uk/computing/users/paul/
Tel: +44 1524 593786  Fax: +44 1524 593608

-----Original Message-----
From: krausse [mailto:krausse at fh-nordhausen.de]
Sent: 02 July 2003 11:09
To: corpus list
Subject: [Corpora-List] BNC word list

Dear everyone,

A couple of weeks ago I asked whether anybody could point me in the
direction of a word list of the BNC. I had a few answers to that and
followed up some trails but the word lists I found were all restricted
in some way. I tried to load the whole BNC into WordSmith and create my
own version but I didn't manage to create a word list bigger than 300
and something thousand words, which can't be all the words of the BNC
(although I played around with the settings).

So I am just trying again and asking whether anyone knows of the
existence of a word list with the full 940.000 words from the BNC.

Many thanks,
Sylvana Krausse

More information about the Corpora mailing list