[Corpora-List] bnc word list

krausse krausse at fh-nordhausen.de
Tue Jun 17 12:37:33 UTC 2003


Dear list members,

Having followed the discussion on the size of reference corpus where as
a byproduct it was mentioned where to get the BNC word list from I am
actually wondering about a problem that I have come across recently.

If I want to compare the word list of my corpus with the BNC I would
need a list with just the words in it without frequency information and
tags. Being only as computer literate as the average languages person I
wonder whether there either exists such a plain list or whether there is
another way of removing numbers and tags than letting it run through the
exchange program of my word processor. (I only want to find out which
words in my corpus are not represented in the BNC.)

Many thanks in advance for any help/advice you might have.

Sylvana Krausse
-------------- next part --------------
A non-text attachment was scrubbed...
Name: krausse.vcf
Type: text/x-vcard
Size: 323 bytes
Desc: Visitenkarte für krausse
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20030617/06940c60/attachment-0001.vcf>


More information about the Corpora mailing list