Corpora: Annotation question

Nancy M. Ide ide at cs.vassar.edu
Tue Mar 14 18:00:46 UTC 2000


Hello,

I would like to know if people (or systems?) are actively using TEI
elements such as <offset> and <distance> in markup of text for
representing the analysis of relative temporal and spatial
expressions, and attributes such as "reg" (for capturing
normalization) and "exact" (for indicating fuzziness).  If they are
being used, then I'd be interested in what they're being used for,
whether the uses require extreme degrees of agreement among annotators
about the guidelines for annotation, what the 'lessons learned' are,
etc.

Also, I would like information on what are *people* and/or *systems*
being expected to annotate, and what do the annotations capture, in
particular for "distinguished expressions" (names, patterns).

Thanks in advance,
Nancy

=======================================================

Nancy Ide

Professor and Chair
Department of Computer Science, Vassar College
Poughkeepsie, NY 12604-0520 USA
Tel: +1 914 437-5988 Fax: +1 914 437-7498
ide at cs.vassar.edu

Chercheur Invite
Equipe Langue et Dialogue, LORIA/CNRS
Campus Scientifique - BP 239
54506 Vandoeuvre-les-Nancy FRANCE
Tel: +33 (0)3 83 59 20 47 Fax: +33 (0)3 83 41 30 79
ide at loria.fr

=======================================================



More information about the Corpora mailing list