[Corpora-List] English compound words

TadPiotr tadpiotr at plusnet.pl
Thu May 19 20:21:39 UTC 2005


There is something called Moby word lists

Moby Words

354,984 single words

Over 354,000 single words, excluding proper names, acronyms, or compound
words and phrases. This list does not exclude archaic words or significant
variant spellings.

256,772 compound words

Over 256,700 hyphenated or other entries containing more than one word as
well as all capitalized words and acronyms. Phrases were considered 'common'
if they or variations of them occur in standard dictionaries or thesauruses.

at

 <http://www.dcs.shef.ac.uk/research/ilash/Moby/>
http://www.dcs.shef.ac.uk/research/ilash/Moby/

I am not quite sure about the quality.

Best wishes

Professor Tadeusz Piotrowski

English Department

Opole University

Opole

 <outbind://7/www.tadeuszpiotrowski.neostrada.pl>
www.tadeuszpiotrowski.neostrada.pl


  _____

From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
Behalf Of John Newman
Sent: Thursday, May 19, 2005 9:33 PM
To: CORPORA at UIB.NO
Subject: [Corpora-List] English compound words


I am making this inquiry to the list on behalf of a colleague who asks:

_____________________________________________________

I'm looking for a reasonably comprehensive list of English compound words,
whether spelled as two words, hyphenated, or spelled as one word. Has anyone
compiled such a list, or does anyone know of corpora from which such a list
might be straightforwardly extracted?

Dr. Robert Kirchner, Linguistics Dept.
4-20 Assiniboia Hall, U. Alberta
Edmonton, AB T6G2E7
(780) 492-3480 (fax 492-0806)
kirchner at ualberta.ca,  <http://www.ualberta.ca/~kirchner>
http://www.ualberta.ca/~kirchner
_______________________________________________________

John Newman,
Department of Linguistics
Faculty of Arts, University of Alberta
4-32 Assiniboia Hall
Edmonton, Alberta, T6G 2E7, CANADA
Tel. (780) 492-5500, Fax. (780) 492-0806
Homepage:  <http://www.ualberta.ca/~johnnewm/>
http://www.ualberta.ca/~johnnewm/




-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20050519/1cbd7d49/attachment.htm>


More information about the Corpora mailing list