[Corpora-List] Full list of irregular English plurals

Mark Davies Mark_Davies at byu.edu
Tue Nov 25 13:56:41 UTC 2008


It's relatively easy to find irregular plurals in the Corpus of Contemporary American English [COCA] (385+ million words, US, 1990-2008), BNC (100m, UK, -1993), or TIME (100m, US, 1920s-2000s) (see http://corpus.byu.edu for links to these three corpora). The search string is:

-*s.[*nn2*]

"-*s" indicates that the word does not end in an [s]
"[*nn2*] indicates that at least one of the possible tags for the word is "plural noun"

Might also set the number of results to something more than the default 100. Also, remember that the results are only as accurate as the CLAWS tagger, which was used to tag these three corpora.

Best,

Mark Davies

============================================
Mark Davies
Professor of (Corpus) Linguistics
Brigham Young University
(phone) 801-422-9168 / (fax) 801-422-0906
Web: davies-linguistics.byu.edu

** Corpus design and use // Linguistic databases **
** Historical linguistics // Language variation **
** English, Spanish, and Portuguese **
============================================

> -----Original Message-----
> From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of Yorick
> Wilks
> Sent: Monday, November 24, 2008 9:03 AM
> To: corpora at lists.uib.no
> Subject: Re: [Corpora-List] Full list of irregular English plurals
> 
> There are on the web samples of the major types of irregular plurals
> in English, but nothing that has any claim to completeness. Does
> anyone know of anything out there reasonable available and complete?
> Yorick Wilks
> Sheffield
> 
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list