Corpora: Collaborative effort

E S Atwell eric at scs.leeds.ac.uk
Tue Jun 13 10:36:08 UTC 2000


>I agreed if the sense tags have completely different meaning. However,
>the differences in meaning between tags may be in shades of meaning
>rather than the crisp decision that they are or not same.
Surely this is the underlying flaw in the concept of "semantic tagging".
My background is in PoS-tagging, where there are several rival tagsets and
tagging schemes, but nevertheless there is general consensus on PoS
categories (eg noun, verb) and subcats/features (eg singular n/v,
superlative adj) despite having different labels and grey areas in
boundaries.  In contrast, I don't believe there is a clear, "self-evident"
set of semantic tags. Semantic tagging could instead aim to annotate each
word with a SET of semantic features, and "disambiguation" could aim to
eliminate sematic features incompatible with context; this would allow for
overlap and indeterminate sense-tagging. The set of semantic features for
a word could be a bundle of semantic information, for example the
lemma/root, subject-category code, selection restrictions, and meaning
definition from LDOCE; instead of sense-tagging, if the aim was to
eliminate features which were incompatible with context, you should get
more inter-annotator agreement.

 --
Eric Atwell, Distributed Multimedia Systems MSc Tutor & SOCRATES Tutor
Centre for Computer Analysis of Language And Speech (CCALAS)
School of Computer Studies, University of Leeds, LEEDS LS2 9JT
TEL: (44)113-2335430  FAX: (44)113-2335468
WWW: http://www.scs.leeds.ac.uk/eric  EMAIL: eric at scs.leeds.ac.uk



More information about the Corpora mailing list