[Corpora-List] English compound words
TadPiotr
tadpiotr at plusnet.pl
Thu May 19 20:21:39 UTC 2005
There is something called Moby word lists
Moby Words
354,984 single words
Over 354,000 single words, excluding proper names, acronyms, or compound
words and phrases. This list does not exclude archaic words or significant
variant spellings.
256,772 compound words
Over 256,700 hyphenated or other entries containing more than one word as
well as all capitalized words and acronyms. Phrases were considered 'common'
if they or variations of them occur in standard dictionaries or thesauruses.
at
<http://www.dcs.shef.ac.uk/research/ilash/Moby/>
http://www.dcs.shef.ac.uk/research/ilash/Moby/
I am not quite sure about the quality.
Best wishes
Professor Tadeusz Piotrowski
English Department
Opole University
Opole
<outbind://7/www.tadeuszpiotrowski.neostrada.pl>
www.tadeuszpiotrowski.neostrada.pl
_____
From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
Behalf Of John Newman
Sent: Thursday, May 19, 2005 9:33 PM
To: CORPORA at UIB.NO
Subject: [Corpora-List] English compound words
I am making this inquiry to the list on behalf of a colleague who asks:
_____________________________________________________
I'm looking for a reasonably comprehensive list of English compound words,
whether spelled as two words, hyphenated, or spelled as one word. Has anyone
compiled such a list, or does anyone know of corpora from which such a list
might be straightforwardly extracted?
Dr. Robert Kirchner, Linguistics Dept.
4-20 Assiniboia Hall, U. Alberta
Edmonton, AB T6G2E7
(780) 492-3480 (fax 492-0806)
kirchner at ualberta.ca, <http://www.ualberta.ca/~kirchner>
http://www.ualberta.ca/~kirchner
_______________________________________________________
John Newman,
Department of Linguistics
Faculty of Arts, University of Alberta
4-32 Assiniboia Hall
Edmonton, Alberta, T6G 2E7, CANADA
Tel. (780) 492-5500, Fax. (780) 492-0806
Homepage: <http://www.ualberta.ca/~johnnewm/>
http://www.ualberta.ca/~johnnewm/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listserv.linguistlist.org/pipermail/corpora/attachments/20050519/1cbd7d49/attachment.htm>
More information about the Corpora
mailing list