1
Neat Versus Scruffy: A Review of Computational Models for Spatial Expressions
Amitabha Mukerjee
Indian Institute of Technology Kanpur
This chapter reviews the computational modalities for representing spatial expressions. The focus is on computational or synthetic models, as opposed to cognitive or linguistic aspects. A number of formalisms based on qualitative paradigms such as topology seek to discretize space into regions based on alignment or tangency. This approach may be termed as âneat.â Yet a large class of spatial expressions (e.g., ânear,â âmove away fromâ) do not involve alignments, and even some that do (e.g., âin front ofâ) have gradations. These gradations in a continuum are clearly less âneatâ; we may view these as âscruffyâ; there appears to be a growing trend toward the scruffy in recent times. The distinctions between the different models reviewed are demonstrated using the canonical example of âthe chair in front of the desk.â
1. INTRODUCTION
The philosopher Wittgenstein, when young (Tractatus, 1921), claimed to have provided a âfinal solutionâ to the problem of logic and language, based on a set of primitives that describe the boundaries of what can be expressed. In essence, this approach claims that thought is grounded on a set of distinct symbols, a âneatâ discretization. In later years (Philosophical Investigations, 1953) he himself rejected this view, insisting that language can properly be considered only with respect to its use in specific behavioral contexts (âscruffâ). Nuallain and Smith (1995) compared this famous transformation from neat to scruff to the change of paradigm within the AI (artificial intelligence) community from a primitive-based model of semantics to a more context sensitive one. The contradiction between the neat and the scruffy holds probably over the entire domain of artificial intelligence, and certainly to the topic of this surveyâthe computational modeling of spatial expressions.
Representing space has a rich history in the physical sciences, where many representations were developed for encoding orientations, points, vectors, rigid body motions, and so forth. These representations are, as a class, continuous and serve to locate objects in a quantitative frameworkâfor example, in a multidimensional space of coordinates. Any assignment of values to the coordinates results in a unique and precise location for the entity.
Spatial expressions, on the other hand, operate on a loose partitioning of the domain, a discretization into regions such as âin front/backâ or ânear/far,â where the location, and even the partitioning itself, is specified with a large degree of ambiguity. The task of finding suitable representations for this less precise description has turned out to be considerably more difficult than was initially realized. The emphasis has been on creating discretizations that matched the linguistic partitions. Unfortunately, each discretization that was created, for example, the many versions of the âONâ primitive in the blocks world, attempted to enforce the loose partitioning of a spatial expression onto the straitjacket of strict, numerical distinctions. Representations proliferated, more or less on an ad hoc basis. With the emergence of qualitative reasoning, the partitioning was done more systematically, mapping the complete set of distinctions deriving from a given set of landmarks, and treating all zones between the landmarks as indistinguishable.
1.1. The Evolution of the âNeatâ
Very early AI created discretizations on an ad hoc basis, for example, by defining cells that were sensitive to the choice of origin and frame size. Discretizations mapped the continuous world into a set of discrete symbols, and the partitions were defined by the programmer based on pragmatic considerations of what appeared to suit the theory, as opposed to any systematic basis; such models may be called gerrymandered discretizations and remind one of the classification of the heavens in the work of Ptolemy, say. An example of this is the proliferation of interpretations that ON(A, B) had in the early Blocks World models (see Fig. 1.1).
More systematic discretizations were achieved by defining the complete set of partitioning based on landmarks. For example, in one-dimensional space Allen (1983) used a set of two landmarks (endpoints) in the
reference object (RO); the two endpoints of the located object (LO) were described completely in terms of these as a set of 13 possible orderings or arrangements (see Fig. 1.10, discussed in more detail later). Such thinking extends to spatial situations with a one-dimensional envisionment, as in the road scene of Fig. 1.2. Similarly, in higher dimensions, all possible visibility orderings based on a set of points can be modeled as an arrangement of visible pointsâthis results in a partitioning of the plane into a set of qualitative zones: lines (two or more points aligned on same line), points (two or more lines intersecting), and regions (no two points aligned) (Kuipers, 1977; Schlieder, 1994). For example, in the four-landmark case (Fig. 1.2b), there are 23 lines, 7 points, and 18 regions based on visibility orderings on the landmarks ABCD. The partitioning in these models was based on the alignment between some fundamental entities,
such as two points in the interval model, or two orientations in the visibility-ordering model.
Qualitative reasoning models that create discretizations based on alignment now have a varied set of representational structures, many of them involving considerations of topology, focusing on connectivity rather than position (Cohn, Randell, Cui, & Bennett, 1993; Egenhofer & Franzosa, 1991). Expressions involving tangency, or transition from contact to no-contact situations, can be expressed in terms of such predicates. Other models define local frames on the objects based on intrinsic, extrinsic, or deictic (or other) considerations, which can be used to combine position and orientation (Forbus, Nielsen, & Faltings, 1991; HernĂĄndez, 1994; Mukerjee & Joe, 1990).
Within the class of alignment-based models, the systematic treatment of all possible relations resulted in much better handling of spatial expressions, especially in the zones near contact. For example, the ON relation in the orthogonal blocks world can be handled elegantly by considering a multidimensional projection model, instead of subjectively chosen predicates like ânot-touches.â
To simplify things tremendously, qualitative models may be thought of as multidimensional orderings of point-sets; these are in general very expressive in the region near contact or alignmentâhere slight changes in spatial position or orientation result in changes of order, and generate topologically relevant information. However, qualitative models are not very meaningful for noncontact or nonalignment positions, such as in the spatial prepositions like ânearâ or âbetweenâ or in expressions such as âmoved farther away fromâ or âveer sharply to the left.â These relations involve no changes in the alignments or orderings of the objects, and are changes of measure rather than of quality; consequently they cannot be modeled with purely qualitative formalisms. Experience reveals that many projective spatial prepositions such as âin front ofâ also have a gradation; for example, in Fig. 1.3, the chair A is more âin front ofâ the desk than the chair E, say. In a discretized model for the desk, as in the proj...