[Corpora-List] List of mass and count nouns?
Francis Bond
fcbond at gmail.com
Thu Dec 20 08:23:59 UTC 2007
G'day,
COMLEX and CELEX both have countability as a feature in their
lexicons, if you have access to them.
The largest freely available lexicon I know of is in the ERG, which
you can download from http://www.delph-in.net/erg/.
The lexicon is in the file lexicon.tdl, and there is an explanation of
the lexical types at: http://wiki.delph-in.net/moin/ErgLeTypes.
Basically lexical types that match n_-_c_*le are count and those that match
n_-_m_*le are mass, while n_-_mc* can be either.
There are some papers on determining mass/noun from corpora (as Adam
suggests) or ontologies (as Philip suggests), or by looking at
countability of translation equivalents
in similar languages at my or Tim Baldwin's web sites:
http://www2.nict.go.jp/x/x161/en/member/bond/bib.html
http://www.cs.mu.oz.au/~tim/publications.html
Hope this is useful,
--
Francis Bond <http://www2.nict.go.jp/x/x161/en/member/bond/>
NICT Computational Linguistics Group
_______________________________________________
Corpora mailing list
Corpora at uib.no
http://mailman.uib.no/listinfo/corpora
More information about the Corpora
mailing list