16.3481, Review: Semantics/Computational Ling: Mani et al (2005)
LINGUIST List
linguist at LINGUISTLIST.ORG
Wed Dec 7 04:21:38 UTC 2005
LINGUIST List: Vol-16-3481. Tue Dec 06 2005. ISSN: 1068 - 4875.
Subject: 16.3481, Review: Semantics/Computational Ling: Mani et al (2005)
Moderators: Anthony Aristar, Wayne State U <aristar at linguistlist.org>
Helen Aristar-Dry, Eastern Michigan U <hdry at linguistlist.org>
Reviews (reviews at linguistlist.org)
Sheila Dooley, U of Arizona
Terry Langendoen, U of Arizona
Homepage: http://linguistlist.org/
The LINGUIST List is funded by Eastern Michigan University, Wayne
State University, and donations from subscribers and publishers.
Editor for this issue: Lindsay Butler <lindsay at linguistlist.org>
================================================================
What follows is a review or discussion note contributed to our
Book Discussion Forum. We expect discussions to be informal and
interactive; and the author of the book discussed is cordially
invited to join in. If you are interested in leading a book
discussion, look for books announced on LINGUIST as "available
for review." Then contact Sheila Dooley at dooley at linguistlist.org.
===========================Directory==============================
1)
Date: 05-Dec-2005
From: Anil Singh < anil_phdcl at yahoo.co.in >
Subject: The Language of Time: A Reader
-------------------------Message 1 ----------------------------------
Date: Tue, 06 Dec 2005 23:16:47
From: Anil Singh < anil_phdcl at yahoo.co.in >
Subject: The Language of Time: A Reader
EDITORS: Mani, Inderjeet; Pustejovsky, James; Gaizauskas, Robert
TITLE: The Language of Time
SUBTITLE: A Reader
PUBLISHER: Oxford University Press
YEAR: 2005
Announced at http://linguistlist.org/issues/16/16-2411.html
Anil Kumar Singh, Language Technology Research Centre (LTRC),
IIIT, Gachibowli, Hyderabad-500019 (A.P.), India.
OVERVIEW
Natural language (NL) can convey all kinds of information about the
world, abstract or concrete, real or hypothetical. Temporal information
is one of the most important aspects of whatever we want to express
using NL. This book presents a collection of some of the well known
papers on temporal information in NL. The focus seems to be on the
computational point of view. However, papers on linguistic and
philosophical issues have been included to provide enough
background. Since scholars and researchers from diverse fields
(philosophy, linguistics, psychology, natural language processing,
artificial intelligence) have contributed to the study of time in NL, the
editors have made an attempt to represent all of these. The book can
be useful for anyone interested in time and/or language.
The book is divided into four parts and there is a very informative
introduction to each of these parts. These introductions make the
papers accessible to even those without the required background as
they summarize and connect the papers. Some readers who are
looking just for a quick overview of the study of time in NL might want
to read only the introductions.
In all, the book contains 29 important papers. These papers cover
theories as well as computational approaches for studying (or
computing) time in NL. People working in the relevant fields are
already familiar with many of these.
The first part has 8 papers on tense, aspect and event structure. The
second part is about temporal reasoning and it consists of 6 papers.
The third part has 8 papers on temporal structure of discourse. The
fourth part puts together 7 papers on temporal annotation.
SYNOPSIS
Chapter 1
The first part starts with Vendler's well known paper ('Verbs and
Times') on verb classification with respect to time (or tense). Vendler
claims (a claim more or less accepted by many authors of the following
papers) that almost all the verbs can be classified into a few classes,
at least in their most dominant sense. The classes that he comes up
with after a very interesting and enjoyable discussion are those of
states, activities (or processes), accomplishments and achievements.
This four way classification is the starting point of a lot of work done
on time in language. Perhaps this is why Vendler's is the first paper in
the book, although it is not the earliest among those included in the
book.
Chapter 2
The second paper is on 'the syntax of event structure' by Pustejovsky.
He argues that grammatical phenomena make reference to the
internal structure of events, and that a subeventual analysis for
predicates is able to systematically capture these effects. This paper
represents a major stream in semantics. The purpose is to show that
verbs decompose into distinct event types with internal structure. The
primary components of an event structure are the event type (of the
lexical item), the rules of event decomposition and the rules to map to
lexical structure. Pustejovsky considers three basic event types:
states, processes and transitions. He also relates event structure with
lexical conceptual structure (LCS). Problems like adverbial
modification are discussed in terms of event structure. On the whole it
is a long but very readable paper which makes a convincing case for
looking at verbs as events with an internal structure. As the editors
point out in the introduction, this approach can avoid proliferation of
primitives, which is a major problem with schemes like that of Dowty's
(Dowty, 1979). A limitation is that event structure is not directly related
with temporal structure.
Chapter 3
Pustejovsky's paper is followed by Emmon Bach's 'Algebra of Events'.
The starting point for this paper is the close parallels between the
mass count distinction in nominal systems and the aspectual
classification of verbal expressions. Bach expresses one of the
parallels as a proportion: events:processes::things:stuff, drawing on
Link's paper on count-mass-plural domain (Link, 1983). His algebra
deals with the distinctions based mainly on Carlson's classification
(Carlson, 1981), which subdivides achievements in two categories
(happenings and culminations, e.g. recognize and die). It also, in a
way, takes into account the structure of events and processes. Link's
scheme treats sets of individuals ('John and Mary') as super-
individuals formed by a join operation. The super-individual is made of
the stuff that the individuals are made of. Bach's contention is that telic
events are also formed of process 'stuff'.
Chapter 4
The fourth paper is Reichenbach's classic work called 'The Tenses of
Verbs' (Reichenbach, 1947). The primary idea is that the times of
events can be located with respect to a deictic centre, which makes
them similar to pronouns (the anaphoric view of tenses). In
Reichenbach's scheme, there can be references to three time points:
the speech time (S), the event time (E) and the reference time (R).
These three time points can be related by 'precedes' or 'simultaneous'
relations. Depending on the relations between S, E and R, we can
define all the possible tenses. For example, E=R< S would be simple
past, whereas S=R=E would be simple present. Only seven out of the
thirteen possible relations are realized in English. This is perhaps the
most influential paper in the book. This approach to analyze tenses
has come to be known as the Reichenbachian approach. Quite apart
from its influence however, the paper is remarkable for the way it is
written. It is not divided into sections, there is no conclusion, nor are
there any references. One wonders how the reviewers would react if a
paper like this was submitted to a natural language processing (NLP)
conference in this age of LaTex stylesheets. Anyway, I found this
paper to be as enjoyable as it is influential.
Chapter 5
The next paper is 'Tense Logic and the Logic of Earlier and Later'
(Prior, 1968). Prior presents a tense logic which is an extension of E.
J. Lemmon's minimal tense logic Kt. He also relates it to the minimal
calculus of earlier-later relation. He adds two operators G ('is always
going to be') and H ('has always been') to the two operators provided
by Kt: P (past) and F (future). The earlier-later calculus uses
operators T and U such that 'Tap' means 'It is the case at instant a
that p' and 'Uab' means that 'The instant a is earlier than instant b'.
The tense logic is built upon in stages in the paper so as to get four
logical systems of increasing power. This paper is also an important
one, but it is not as easy to read as the some of the other papers in
the book, partly due to the 'somewhat opaque prefix notation'.
Chapter 6
The paper by Moens and Steedman counters the idea that linguistic
categories are related to a linear model of time. Their ontology is
based on a mental representation of events structured on more than
purely temporal principles. Their aspectual categories are a modified
form of the Vendlerian classes. States are distinguished from events,
which in turn can be classified in four categories depending on
whether there are consequences and whether the events are atomic
or extended. Aspectual type of a proposition can change due to
modifiers. Such 'aspectual coercion' can be explained in terms of a
transition network, which represents an aspect calculus. Another
central notion in their ontology is that of 'nucleus' ('a structure
comprising a culmination, an associated preparatory process, and a
consequent state'). This nucleus plays a role in many of the
permissible transitions in the network. However, as the editors point
out, the paper doesn't offer rules for compositional semantics.
Chapter 7
Dorr and Olsen use LCS representation of Levin's classes. The
aspectual classed are defined in terms of three features (telicity,
dynamicity and durativity). In their system, features can only get
added, not deleted. These aspectual features can be used to help in
machine translation and generation.
Chapter 8
The last paper in the first part is by Passonneau, describing the
PUNDIT information extraction system. She focuses only on actual
(realis) events. A three step method is used to find the actual time
associated with an event (if there is any), determine the temporal
structure of the situation, and locating the situation with respect to the
time of text production or to the times of other situations. The features
used for events are situation type (state, process, or transition event),
kinesis (active or stative) and boundedness (bounded, unbounded, or
unspecified).
Chapter 9
The second part begins with a long but enjoyable paper (except
perhaps the example about a Republican President) by McDermott.
He presents a 'temporal logic for processes and plans'. One important
idea in this logic is that of temporal chronicles ('a complete possible
history of the universe'). The chronicles are arranged in the form of a
tree in which branching occurs only towards the future. This is meant
to model future possibility. McDermott defines events as sets of
intervals over which a proposition is minimally (at least once) true.
Chapter 10
'A logic based calculus of events' is presented by Kowalsky and
Sergot. They concentrate on applying event calculus for database
updates as well as simple narratives. Instead of rejecting an update
conflicting with the existing information, the update is accepted and
the conflicting information is withdrawn. Some predicates introduced
are 'Holds', 'HoldsAt', 'initiates', 'terminates' and 'happens'.
Chapter 11
Indeterminant temporal anchoring and granularity is taken into
account by Chittaro and Combi in framework called 'Temporal
Granularity and Indeterminacy Event Calculus (TGIC)'. Event calculus
determines the maximal validity intervals (MVIs) over which
properties hold. Their definition of MVI accommodates a more general
concept of event.
Chapter 12
Here is another very influential paper, this time by James F. Allen. It
presents a 'general theory of action and time'. This theory seeks to
take care of actions that involve non-activity, actions not
decomposable into subactions and actions which occur simultaneously
and interact with others. Allen uses thirteen basic interval relations
like 'before', 'after', 'meet', 'during', etc. There are three classes
(property, event and process) and one metalanguage predicate
(HOLDS, OCCUR, OCCURRING) associated with each of these
classes.
Chapter 13
Galton's paper is a critical revision of Allen's work. He 'reinstates'
instants to give a combined instant-interval scheme in which instants
either fall within or limit intervals. One of the reasons for this revision
of Allen's theory is to accommodate continuous change. The HOLD
and OCCURS predicated are each split into three (HOLDS-ON,
HOLDS-IN, HOLDS-AT).
Chapter 14
The paper by Hobbs and Pustejovsky on TimeML (originally DAML-
Time), a markup language for annotating temporal information in a
discourse. They also relate it to OWL-Time ontology. It takes into
account the formal theories of time that have been suggested during
the last many decades. This paper can be a good introduction to
TimeML, which might become a standard.
Chapter 15
Though Dowty's 1979 paper is not included, there is one by him
on 'the effect of aspectual class on the temporal structure of
discourse'. His analysis is based on a compositional theory of aspect,
i.e., aspectual classes of lexical items combine to give the aspectual
classes of sentences. He also uses a pragmatic principle called
Temporal Discourse Interpretation Principle (TDIP) which basically
says that the reference time advances forward with each utterance,
unless there is a temporal adverbial. He discusses some other
pragmatic principles like the perspective of the narrator, expectations
about discourse conventions (Grice's maxims) and background
knowledge.
Chapter 16
Lascarides and Asher take further the work on including pragmatics in
discourse (with respect to temporal relations). There work is more
formal, based on the notion of 'defeasible reasoning' (based on
default knowledge that can be overridden).
Chapter 17
In this chapter, Bell extends the Labovian analysis of temporal
discourse structure. They focus on news stories and claim that the
narrative in these stories can be segmented into 'abstract', 'attribution'
and 'story', each of which can have further segmentation. The stories
also contain many 'episodes', which in turn contain events.
Chapter 18
Here Webber extends the Reichenbachian notion of tense as
discourse anaphor. She also related this to the work on discourse by
Grosz and Sidner. The main elements of her approach are temporal
focus, a tripartite ontology of events (preparation, culmination and
consequence), 'specification' of entities by anaphors, and a 'focus
stack' mechanism for resumption of dialog segments.
Chapter 19
Song and Cohen present an algorithm for using tense interpretation in
analyzing simple narratives. They also use a modified Reichenbachian
scheme for representing tenses which is more precise and
unambiguous (uses one SRE triple instead of three structures for
future perfect). Their algorithm is based on temporal focus, a tense
hierarchy (for English) and constraints on coherent tense sequences.
Chapter 20
The automatic temporal reference resolution system described by
Wiebe et al. works on the ambiguous output of a semantic parser. The
domain they have considered is scheduling dialogs (''How about
two?'', ''Twelve to two.''). In their model, a 'temporal unit' is associated
with the current and the preceding utterance. A temporal unit has
fields which are partially ordered with respect to specificity. Some
resolution rules are also used.
Chapter 21
Hwang and Schubert use a novel 'fine structure' of discourse,
namely 'tense trees'. Tense trees differ from simple
Reichenbachian 'lists'. Given that the sentences are not 'flat' (as
seems to have assumed by some, as far as studies of temporal
structure in discourse are concerned), tense trees can be used to
more effectively (compositionally) analyze tense and aspect.
Chapter 22
Hitzeman et al. use an HPSG implementation of discourse grammar
for determining the temporal structure of the discourse. They try to
take into account the mutually constraining effect of tense, aspect,
temporal adverbials and rhetorical relations to reduce ambiguity.
Another suggestion is to use underspecified representation of
temporal or rhetorical structure, again for reducing ambiguity.
Note: The page numbers in the references given on the first page of
the book for chapters 23-26 (from the ACL 2001 workshop) seem to
be wrong. See the references below for the correct information.
Chapter 23
Wilson et al. present an annotation framework for automatically
marking up temporal information. The annotation scheme is TIMEX2,
which has been incorporated into TimeML. Their system tries to
achieve cross-lingual reusability. Some updates since the publication
of the paper are also mentioned.
Chapter 24
Another technique for temporal annotation is described by Katz and
Arosio. In their scheme ('a radically simplified semantic formalism'),
each verb is associated with a temporal interval and there are
relations among these intervals. The relations are encoded with
directed 'secondary edges' representing 'precedence' and 'inclusion'
relations and their duals. They also discuss the relations between
annotations, i.e., whether they are equivalent, consistent, or
inconsistent, or whether one subsumes the other.
Chapter 25
Another temporal extraction system has been described by Filatova
and Hovy. They break up news stories into clauses representing
events and then assign time stamps to these events. Tense
information in the clauses is used to help in timestamping.
Chapter 26
This paper (Schilder and Habel) describes a semantic tagging system
for extracting temporal information from news messages. The tagger
looks for dates, prepositional phrases and situational verbs. It marks
chunks of texts containing temporal information and also tries to
extract this information. Temporal relations (based on Allen's theory)
are identified partly by prepositions.
Chapter 27
This chapter presents in detail the specification language TimeML.
While the other paper (Chapter 14) is more theory oriented, this one is
more about the definition of TimeML and guidelines for annotation
using this language. It is compulsory reading for anyone involved with
actual annotation of temporal information, or even with annotation in
general.
Chapter 28
Li et al. present a framework for mapping linguistic patterns to
temporal relations. It uses 'temporal concept frames' (activity related
or time related), temporal relations (relative or absolute), temporal
indicators (time related words), rules for temporal references and
rules for rules for resolving conflicts. They have tried their system for
Chinese with good results.
Chapter 29
The last paper (Setzer et al.) is about comparing and evaluating
temporal annotations using the idea of 'temporal closure', which can
help in deciding whether two annotations are equivalent or not. The
paper also suggests ways to make manual annotation easier by
computing closure of an annotation. A precise way of evaluating
annotations is presented.
EVALUATION
This book brings together a variety of approaches, theoretical as well
practical, for dealing with time in NL. The papers are among the most
relevant. They have been arranged in an order which makes sense.
The introductions are excellent too. Perhaps the division into four
parts and an introduction for each of them is best way to bring
together a lot of diversity. However, one wonders whether a small
introduction to each paper (instead of a long one to each part),
together with a chapter in the beginning looking at the big picture
wouldn't have been better. As it is, the readers might miss the
connections between the successive chapters in some cases. And
one can also complain about some missing papers (like Dowty, 1979),
but that is unavoidable in any collection of papers. On the whole, it
was rewarding to read this book. It might become a compulsory
reading for people working in the relevant disciplines, and there are
quite a few of them (people as well disciplines).
REFERENCE
Dowty D. R. (1979). Word Meaning and Montague Grammar: the
Semantics of Verbs and Times in Generative Semantics and
Montague's PTQ. Reidel.
REFERENCES TO THE ORIGINAL PUBLICATION OF THE
CONTENTS OF THIS BOOK
1. Vendler Z. (1967). Verbs and Times. Ch. 4 of 'Linguistics in
Philosophy', 97-121. Cornell University Press.
2. Pustejovsky James (1991). The Syntax of Event Structure.
Cognition, 41. Elsevier.
3. Bach Emmon (1986). The Algebra of Events. 'Linguistics and
Philosophy', 9, 5-16. Swets and Zeitlinger Publishers.
4. Reichenbach Hans (1947). The Tenses of Verbs. Section 51
of 'Elements of Symbolic Logic', 287-298. The Macmillan Company,
New York.
5. Prior A. N. (1968). Tense Logic and the Logic of Earlier and Later.
Chapter 11 of 'Papers on Time and Tense', 116-134. Oxford
University Press, Oxford.
6. Moens Marc and Steedman Mark (1988). Temporal Ontology and
Temporal Reference. Computational Linguistics 14(2), 15-28.
Association for Computational Linguistics.
7. Dorr Bonnie J. and Olsen Mari Broman (1997). Deriving Verbal and
Compositional Lexical Aspect for NLP Applications. Proceedings of the
35th Annual Meeting of the Association for Computational Linguistics,
151-158. Association for Computational Linguistics.
8. Passonneau Rebecca J. (1988). A Computational Model of the
Semantics of Tense and Aspect. Computational Linguistics, 14(2), 15-
28. Association for Computational Linguistics.
9. McDermott Drew (1982). A Temporal Logic for Reasoning About
Processes and Plans. Cognitive Science 6, 101-155. Cognitive
Science Society.
10. Kowalski Robert and Sergot Marek (1986). A Logic-Based
Calculus of Events. New Generation Computing, 4, 67-94. Ohmsha
Ltd.
11. Chittaro Luca and Combi Carlo (2000). Extending the Event
Calculus with Temporal Granularity and Indeterminacy. In Bettini C.
and Montanari A. (eds.), 'Spatial and Temporal Granularity: Papers
from the AAAI Workshop'. Technical Report WS-00-08. The AAAI
Press.
12. Allen James F. (1984). Towards a General Theory of Action and
Time. Artificial Intelligence, 23, 123-154. Elsevier.
13. Galton Antony (1990). A Critical Examination of Allen's Theory of
Action and Time. Artificial Intelligence, 42, 159-188. Elsevier.
14. Hobbs Jerry R. and Pustejovsky James (2003). Annotating and
Reasoning About Time and Events. Proceedings of the AAAI Spring
Symposium on Logical Formalization of Commonsense Reasoning.
Stanford University, CA.
15. Dowty David R. (1986). The Effects of Aspectual Class on the
Temporal Structure of Discourse: Semantics or Pragmatics?
Linguistics and Philosophy, 9, 37-61. Swets and Zeitlinger.
16. Lascarides Alex and Asher Nicholas, (1993). Temporal Relations,
Discourse Structure, and Commonsense Entailment. Linguistics and
Philosophy, 16, 437-493. Kluwer Academic Publishers.
17. Bell Allan (1999). News Stories as Narratives. In Jaworski A. and
Coupland N. (eds.), 'The Discourse Reader', 236-251. Routledge.
18. Webber Bonnie Lynn (1988). Tense as Discourse Anaphor.
Computational Linguistics, 14(2), 61-73. Association for Computational
Linguistics.
19. Song Fei and Cohen Cohen, (1991). Tense Interpretation in the
Context of Narrative. Proceedings of the Ninth National Conference
on Artificial Intelligence (AAAI-91), 131-136. American Association for
Artificial Intelligence.
20. Wiebe Janyce, O'Hara Tom, McKeever Kenneth, Ohrstrom-
Sandgren Thorsten (1997). An Empirical Approach to Temporal
Reference Resolution. Proceedings of the Second Conference on
Empirical Methods in Natural Language Processing (EMNLP-97), 174-
186. Association for Computational Linguistics and SIGDAT.
21. Hwang Chung Hee and Schubert Lenhart K. (1992). Tense Trees
as the Fine Structure of Discourse. Proceedings of the 30th Annual
Meeting of the Association for Computational Linguistics, 232-240.
Association for Computational Linguistics.
22. Hitzeman Janet and, Moens Mark and Grover Claire (1995).
Algorithms for Analyzing the Temporal Structure of Discourse.
Proceedings of the Annual Meeting of the European Chapter of the
Association for Computational Linguistics, 253-260. Association for
Computational Linguistics.
23. Wilson George, Mani Inderjeet, Sundheim Beth and Ferro Lisa
(2001). A Multilingual Approach to Annotating and Extracting
Temporal Information. Proceedings of the ACL 2001 Workshop
Temporal and Spatial Information Processing, 81-87. Association for
Computational Linguistics.
24. Katz Graham and Arosio Fabrizio (2001). The Annotation of
Temporal Information in Natural Language Sentences. Proceedings of
the ACL 2001 Workshop Temporal and Spatial Information
Processing, 104-111. Association for Computational Linguistics.
25. Filatove Elena and Hovy Eduard, (2001). Assigning Time-Stamps
to Event-Clauses. Proceedings of the ACL 2001 Workshop Temporal
and Spatial Information Processing, 88-95. Association for
Computational Linguistics.
26. Schilder Frank and Habel Christopher (2001). From Temporal
Expressions to Temporal Information: Semantic Tagging of News
Messages. Proceedings of the ACL 2001 Workshop Temporal and
Spatial Information Processing, 65-72. Association for Computational
Linguistics.
27. Pustejovsky James, Ingria Robert, Sauri Roser, Castano Jose,
Littman Jessica, Gaizauskas Rob, Setzer Andrea, Katz Graham, Mani
Inderjeet (2004). The Specification Language TimeML. This volume,
545-557. Oxford University Press.
28. Li Wenjie, Wong Kam-Fai and Yuan Chunfa (2001). A Model for
Processing Temporal References in Chinese. Proceedings of the ACL
2001 Workshop Temporal and Spatial Information Processing, 33-40.
Association for Computational Linguistics.
29. Setzer Andrea, Gaizauskas Robert and Hepple Mark (2003).
Using Semantic Inference for Temporal Annotation Comparison.
Proceedings of the Fourth International Workshop on Inference in
Computational Semantics (ICOS-4), 185-196.
ABOUT THE REVIEWER
Anil Kumar Singh is working towards PhD in Computational Linguistics
at the Language Technology Research Centre (LTRC), IIIT,
Hyderabad, India. His research interests are in dealing with temporal
information in NL, statistical NLP, corpus linguistics and, last but not
the least, NL engineering. But there is much more to life than research
interests.
-----------------------------------------------------------
LINGUIST List: Vol-16-3481
More information about the LINGUIST
mailing list