[Corpora-List] List of mass and count nouns?

Francis Bond fcbond at gmail.com
Thu Dec 20 08:23:59 UTC 2007


G'day,

COMLEX and CELEX both have countability as a feature in their
lexicons, if you have access to them.

The largest freely available lexicon I know of is in the ERG,  which
you can download from http://www.delph-in.net/erg/.
The lexicon is in the file lexicon.tdl, and there is an explanation of
the lexical types at: http://wiki.delph-in.net/moin/ErgLeTypes.
Basically lexical types that match n_-_c_*le are count and those that match
n_-_m_*le are mass, while n_-_mc* can be either.

There are some papers on determining mass/noun from corpora (as Adam
suggests) or ontologies (as Philip suggests), or by looking at
countability of translation equivalents
in similar languages  at my or Tim Baldwin's web sites:
http://www2.nict.go.jp/x/x161/en/member/bond/bib.html
http://www.cs.mu.oz.au/~tim/publications.html

Hope this is useful,

-- 
Francis Bond <http://www2.nict.go.jp/x/x161/en/member/bond/>
NICT Computational Linguistics Group

_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list