<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Semantic Integrity Constraints for Spatial Databases</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Loreto</forename><surname>Bravo</surname></persName>
							<email>lbravo@udec.cl</email>
							<affiliation key="aff0">
								<orgName type="institution">Universidad de Concepción</orgName>
								<address>
									<country key="CL">Chile</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">M</forename><surname>Andrea Rodríguez</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Universidad de Concepción</orgName>
								<address>
									<country key="CL">Chile</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Semantic Integrity Constraints for Spatial Databases</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">F22A92934AC580B69743AEC4A8D66D1D</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-19T15:58+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper introduces a formalization of a set of spatial semantic integrity constraints on an extended-relational database model. The formalization extends traditional notions of functional and inclusion dependencies by adding interaction with spatial attributes. This enables to specify implicit and explicit topological conditions between geometries and impose constraints on thematic attributes that depend on the geometries. We study the consistency problem for this set of integrity constraints, which rises issues about topological consistency and realizability of spatial constraints. We show that the consistency problem is not tractable and provide some conditions under which it is.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Spatial databases systems enhance traditional databases with models and query languages that support spatially referenced data and provide spatial indexing and efficient algorithms for spatial query processing <ref type="bibr" target="#b7">[8]</ref>. The use of spatially referenced data is an essential component of such diverse applications as Geographic Information Systems (GIS), Computer-Aided Design (CAD), multimedia information systems, data warehousing, mobile computing, location-based services, and NASA's Earth Observing System (EOS).</p><p>Fundamental to spatial databases are the spatial data models or geometric models that represent objects in a space. Examples of such models are the spaghetti <ref type="bibr" target="#b17">[18]</ref>, topological <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b17">18]</ref>, and polynomial models <ref type="bibr" target="#b13">[14]</ref>. Most of the current spatial database management systems (SDBMSs) implement extensions to relational databases, so called object-relational or extended-relational databases, by defining spatial data types that represent the geometry of objects in the Euclidean space (e.g. Postgres+Posgis, Oracle Spatial SQL, and MySql Spatial).</p><p>In the context of database consistency, spatial databases offer new challenges due, particularly, to the complex nature of spatial attributes, the derivation of implicit relations from spatial attributes, and the combination of spatial and non-spatial attributes (called thematic attributes). SDBMSs enforce integrity constraints used for preventing structural inconsistency of spatial attributes (i.e. domain constraints in the relational context). For example, they enforce that regions must be represented by closed and simple polylines <ref type="bibr" target="#b1">[2]</ref>. The other important type of spatial integrity constraints, which are not implemented in SDBMs, are the spatial semantic integrity constraints (also known as topo-semantic integrity constraints) that relate geometries with semantic conditions. For example, a semantic constraint can enforce that a city's administrative region must be contained within its corresponding city limits <ref type="bibr" target="#b16">[17]</ref>. These constraints are less formalized and are highly dependent on the application.</p><p>In this work we define a constraint language to specify spatial semantic constraints. This language extends functional and inclusion dependency constraints to combine spatial and thematic attributes.</p><p>Example 1. Consider a spatial database that stores information of land parcels and buildings using the schema R = {Landparcel (id ,owner ,area,g), Building(id , use,g)}. In predicate Landparcel , the thematic attributes are id , owner , and area (in m 2 ), and the only spatial attribute is g that stores the geometry of each land parcel. Similarly for Building, which has id and use as thematic attributes and g as spatial attribute. Figure <ref type="figure">1</ref> shows an instance of this schema. In it, dark rectangles represent buildings and white rectangles represent land parcels. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Fig. 1. A spatial database instance</head><p>In this database, we need traditional constraints to enforce that attribute id is unique in both predicates. But, there are other relations that cannot be captured by traditional functional dependencies or inclusion dependencies. For example, we need spatial constraints to ensure that land parcels do not overlap, that each building is inside a land parcel, and that the value in attribute area corresponds to the area of the spatial attribute g in predicate landparcel . 2</p><p>Some studies have addressed the specification of spatial integrity constraints <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b9">10,</ref><ref type="bibr" target="#b11">12]</ref>; however, they treat spatial data in isolation from other integrity constraints, this is, without considering the interaction with the thematic attributes.</p><p>Other studies have analyzed topological <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b11">12]</ref> and realizability <ref type="bibr" target="#b6">[7]</ref> problems.</p><p>The topological problem analyzes the consistency of topological relations as expressed by inference tables. The realizability problem, on the other hand, analyzes whether or not there is a set of geometries that realizes a set of topological relations. These problems differ from the database consistency problem, or simply consistency problem, in which the goal is to determine if there is a non-empty database instance that satisfies a set of integrity constraints. In the consistency problem the number of geometries in a database instance that must satisfy a set of constraints is unknown. For topological and realizability problems, instead, there exists a pre-defined number of geometries that must satisfy a set of constraints expressed by the possible topological relations between them. To the best of our knowledge, the consistency problem of spatial integrity constraints has not been studied so far. This is why in this paper we formalize a set of spatial semantic integrity constraints that: (i) generalizes traditional functional and inclusion dependencies, (ii) enables to specify topological relations between spatial objects, (iii) enables to relate both spatial and thematic attributes, and (iv) are expressive enough to capture constraints found in practice. Furthermore, we reason about these constraints and determine whether or not the set of integrity constraints is consistent, i.e., if there is a non-empty database instance that satisfies the set of constraints. We show that this problem is not tractable and provide some conditions under which it is.</p><p>The organization of this paper is as follows. In Section 2 we present the data model used to represent spatial databases. Then, in Section 3 we present the set of spatial semantic constraints for which we will study the consistency problem in Section 4. Finally, in Section 5 we briefly discuss related work and give conclusions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">A Data Model for Spatial Databases</head><p>Current models of spatial data are typically seen as extensions of the relational data model (object-relational models), with the definition of abstract data types to specify spatial or geometric attributes. We now introduce a general spatiarelational data model that includes spatia-relational predicates (that can be purely relational) and also spatial ICs. It uses some of the definitions introduced in <ref type="bibr" target="#b14">[15]</ref> and is independent of the geometric data model underlying the representation of spatial data types.</p><p>A spatia-relational database schema is of the form Σ = (U, A, S, R, T , O), where: (a) U is the possibly infinite database domain of atomic thematic values that includes R. (b) A = {A 1 , . . . , A n } where each A i is a thematic attribute which takes values in R. (c) S = {S 1 , . . . , S n } where each S i takes admissible values in P(R 2 ), the power set of R 2 . (d) R is a finite set of spatia-relational predicates each of them with a finite and ordered set of attributes belonging to A or S. We sometimes use R(A 1 , . . . , A n ) to represent the predicate with its set of attributes. (e) T is a fixed set of binary topological predicates. (f) O is a fixed set of geometric operators that take spatial and thematic arguments and return a geometry or a value in U.</p><p>A database instance D of a spatia-relational schema Σ is a finite collection of ground atoms (or spatial database tuples) of the form</p><formula xml:id="formula_0">R(c 1 , . . . , c i , . . . , c n ), where (a) R(A 1 , . . . , A i , . . . , A n ) ∈ R, (b) if A i ∈ A, then c i ∈ U (c) if A i ∈ S, then c i ∈ Ad ⊆ P(R 2 )</formula><p>where Ad is the class of closed regions in the topological space formed by point sets of the Euclidean space. These regions represent real objects that have geographic extent, this is, they are not empty.</p><p>The elements of T are binary topological relations with a fixed semantics. There are eight basic binary relations: Overlaps (O), Equal (EQ ), CoveredBy (CB ), Inside (IS ), Covers (CV ), Includes (IC ), Touches (TO), and Disjoint (D ) <ref type="bibr" target="#b15">[16,</ref><ref type="bibr" target="#b4">5]</ref> <ref type="foot" target="#foot_0">1</ref> . In addition to the basic topological relations, SQL languages consider some derived relations that can be logically defined in terms of the other basic predicates: Intersects (IT ), Within (W ), and Contains (C) <ref type="bibr" target="#b12">[13]</ref>. A lattice showing how these relations are grouped to derive new relations is shown in Figure <ref type="figure" target="#fig_0">2</ref>. The eight basic relations are shown in boxes containing prototypical cases and the relations included in current SQL are represented with thick borders. For example, Contains is obtained by grouping Equal, CoveredBy and Includes, this is,</p><formula xml:id="formula_1">A Contains B if either A is Equal to, is CoveredBy or Includes B.</formula><p>Observe that relation IDisjoint (IDC) (internally disjoint) was added even though it is neither basic nor used in SQL. Nonetheless, it is added because it is useful to represent situations found in practice. In what follows we assume that T contains all the relations shown in Figure <ref type="figure" target="#fig_0">2</ref>. Given a database instance, additional spatial information is usually computed from the explicit geometric data by means of the spatial operators in O. Some relevant operators are: Area, Union, Intersection and Difference (cf. <ref type="bibr" target="#b12">[13]</ref> for the complete set of spatial predicates defined within the Open GIS Consortium).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Spatial Semantic Integrity Constraints</head><p>In this section we present a set of spatial semantic integrity constraints that generalizes classic integrity constraints and that are capable of dealing with spatial data.</p><p>To simplify the presentation we will use the following notation: (i) variables x, y, z, x 1 , x ′ , . . . represent values in U or geometries, (ii) variables g, g 1 , g ′ , . . . represent geometries, and (iii) variables u, v, w, u 1 , u ′ , . . . represent values in U. (iv) symbols x, ū, x1 , . . . denote a possibly empty sequence of distinct variables or, with a slight abuse of notation, a set of variables. Definition 1. A spatial semantic integrity constraint (SSIC) is one of the following: Functional Dependency (FD):</p><formula xml:id="formula_2">∀ūȳ 1 ȳ2 v 1 v 2 (P (ū, ȳ1 , v 1 ) ∧ P (ū, ȳ2 , v 2 ) → v 1 = v 2 ) where ȳ1 ∩ ȳ2 = ∅. Inclusion Dependency (IND):</formula><p>∀ūx(P (ū, x) → ∃ȳR(ū, ȳ)) Topological Dependency (TD):</p><formula xml:id="formula_3">∀ūȳ 1 ȳ2 ḡ1 ḡ2 (P (ū, ȳ1 , g 1 ) ∧ R(ū, ȳ2 , g 2 ) n i=1 v i = w i → T (g 1 , g 2 ))</formula><p>where ȳ1 ∩ ȳ2 = ∅ and for every i, variable v i ∈ ȳ1 and variable w i ∈ ȳ2 .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Spatial Referential Dependency (RD):</head><p>∀ūxg</p><formula xml:id="formula_4">1 (P (ū, x, g 1 ) → ∃ȳg 2 R(ū, ȳ, g 2 ) ∧ T (g 1 , g 2 )) Comparison Constraint (CC): ∀x(P (x) → t 1 ⊙ t 2 )</formula><p>where ⊙ ∈ {&gt;, =, =} and terms t 1 and t 2 are constructed from variables in x and operators in O.</p><p>2</p><p>FDs and INDs correspond to the traditional constraints since there are no joins nor references between geometrical attributes. Indeed, spatial attributes are not used in keys nor in referenced attributes since that would have a high computational cost associated in terms of space and time. Also, from a practical point of view, geometries by themselves do not identify spatial objects, so they need some kind of thematic key. TDs enforce topological relations between spatial attributes and RDs enforce inclusion dependencies where the spatial attributes of the referenced table should have some particular spatial properties. Finally, comparison constraints can be used to compare values within a tuple in P . Note that none of the constraints considers joins on geometric attributes. Since all the topological relations in T have their converse (or inverse) relation in T , there is no need to have an RD with T (g 2 , g 1 ) in replacement of T (g 2 , g 1 ). Indeed the constraint ∀xg 1 (P (x, g 1 ) → ∃g 2 R(x, g 2 ) ∧ CoveredBy(g 2 , g 1 ) can be replaced by ∀xg 1 (P (x, g 1 ) → ∃g 2 R(x, g 2 ) ∧ Covers(g 1 , g 2 ). Example 2. (example 1 cont.) The following SSICs can be defined:</p><formula xml:id="formula_5">∀ (Landparcel(u, v, w, g) ∧ Landparcel(u, v ′ , w ′ , g ′ ) → v = v ′ ) (1) ∀ (Landparcel(u, v, w, g) ∧ Landparcel(u, v ′ , w ′ , g ′ ) → w = w ′ ) (2) ∀ (Landparcel(u, v, w, g) ∧ Landparcel(u, v ′ , w ′ , g ′ ) → Equal(g, g ′ )) (3) ∀ (Building(u, v, g) ∧ Building(u, v ′ , g ′ ) → v = v ′ ) (4) ∀ (Building(u, v, g) ∧ Building(u, v ′ , g ′ ) → Equal(g = g ′ )) (5) ∀ (Landparcel(u, v, w, g) ∧ Landparcel(u ′ , v ′ , w ′ , g ′ ) ∧ u = u ′ → IDisjoint(g, g ′ )) (6) ∀ (Building(u, v, g) ∧ Building(u ′ , v ′ , g ′ ) ∧ u = u ′ → IDisjoint(g, g ′ ))<label>(7)</label></formula><p>∀ (Building(u, v, g) → ∃xyzg ′ (Landparcel(x, y, z, g ′ ) ∧ Within(g, g ′ )))</p><formula xml:id="formula_6">(8) ∀ (Landparcel(u, v, w, g) → Area(g) = w)<label>(9)</label></formula><p>Functional dependencies (1-2) together with topological dependency (3) ensure that id is a key of table Landparcel. In the same way, constraints (4-5) enforce that id is a key of Building. The topological dependencies ( <ref type="formula">6</ref>) and ( <ref type="formula" target="#formula_5">7</ref>) ensure that land parcels and buildings, respectively, are either adjacent or disjoint. Finally, the comparison constraint (9) forces attribute area to contain the area of the spatial attribute g. 2</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Consistency Problem</head><p>Before checking if a database satisfies a set of spatial semantic constraints, we need to make sure that the constraints themselves are not in conflict. Thus, we first need to check if the integrity constraints are consistent.</p><p>The consistency problem for a database schema Σ and a set of ICs IC , consists on determining whether there exists a non-empty database instance D over Σ that satisfies IC . This is, checking if there exists a nontrivial database that satisfies the constraints.</p><p>This consistency problem is related but different from the topological and realizability problems <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7]</ref>. A set of topological relations defined over a set of objects is said to be topologically consistent if there are no contradictions among the relations. For example three objects A, B and C and the relations Covers(A, B), Covers(B, C), Disjoint(A, C) are topologically inconsistent since the first two relations imply that A and C cannot be disjoint. There are some cases in which even if the topological relations are not inconsistent, the geometries may not be realizable due to reasons of planarity, this is, there are no geometries in R 2 that satisfy those relations. The problem of determining if those geometries exist is the realizability problem.</p><p>Those problems differ from our consistency problem for two reasons. First, since we do not know in which tables we have tuples nor how many tuples there are, it is unknown the number of objects there are in the spatial database. Second, we do not know the topological relations that will be imposed since that will depend on the values not only in the spatial attributes but in the thematic attributes. Even though the problems differ from the consistency problem, some settings of the topological and realizability problem can be reduced to the consistency problem.</p><p>The following example illustrates the case when the topological relations imposed by spatial integrity constraints contradict topological consistency. Example 3. (example 2 cont.) Consider the set of integrity constraints imposed in example 2. The database administrator could have, by mistake, written constraints ( <ref type="formula">6</ref>) and ( <ref type="formula" target="#formula_5">7</ref>) as follows:</p><p>The first constraint will also be checked when u = u ′ and therefore it requires that a land parcel should be disjoint of or adjacent to itself. This, of course, is not possible. The second constraint enforces the same for Building. Since none of the predicates can contain tuples, the set of ICs is inconsistent.</p><p>2</p><p>The following example shows a situation in which, even though the topological relations are consistent, it is not possible to find geometries in R 2 that satisfy them, this is, the geometries are not realizable.</p><formula xml:id="formula_7">X 1 X 3 X 4 X 5 X 2 Y 1 Y 3 Y 2 Y 4 Y 5 Y 7 Y 8 Y 6 Y 9 Y 10</formula><p>Fig. <ref type="figure">3</ref>. The non-planar graph K5</p><p>Example 4. A graph is said to be non-planar if it is not possible to draw it in the plane without edge intersection. A well known example of non-planar graph is K 5 , the complete graph with five nodes (see Figure <ref type="figure">3</ref>). In this graph, even though all the topological relations are satisfied, it is not realizable in the plane. We can map the topological relations on K 5 as a set of spatial integrity constraints IC 5 defined over a schema S 5 in such a way that the set of constraints is consistent iff K 5 can be drawn in the plane. Let S 5 contain five and ten spatia-relational predicates of the form X i (id, g i ) and Y j (id, g j ) respectively, which represent the nodes and edges of K 5 , respectively, and where id is a thematic attribute, and g i and g j are spatial attributes. Now, let IC 5 contain:</p><p>1. For all Z ∈ {X 1 , . . . , X 5 , Y 1 , . . . , Y 10 } ∀ug 1 g 2 (Z(u, g 1 ) ∧ Z(u, g 2 ) → Equal(g 1 , g 2 )) 2. For all X i ∈ {X 1 , . . . , X 5 } and Y j ∈ {Y 1 , . . . , Y 10 }, where X i and Y j are incident in K 5 :</p><formula xml:id="formula_8">∀ug i (X i (u, g i ) → ∃g j Y j (u, g j )) ∀ug j (Y j (u, g j )) → ∃g i X i (u, g i )) ∀ug i g j (X i (u, g i ) ∧ Y j (u, g j ) → Overlaps(g i , g j )) 3. For all X i ∈ {X 1 , . . . , X 5 } and Y j ∈ {Y 1 , . . . , Y 10 }, where X i and Y j are not incident in K 5 ∀ug i g j (X i (u, g i ) ∧ Y j (u, g j ) → Disjoint(g i , g j )) 4. For all X i , X j ∈ {X 1 , . . . , X 5 } with i = j: ∀ug i vg j (X i (u, g i ) ∧ X j (v, g j ) → Disjoint(g i , g j )) 5. For all Y i , Y j ∈ {Y 1 , . . . , Y 10 } with i = j: ∀ug i g j (Y i (u, g i ) ∧ Y j (u, g j ) → Disjoint(g i , g j ))</formula><p>The INDs enforce that if there is a tuple in any predicate, there will be at least one tuple in all the rest. Since a database is consistent if there is at least one tuple in the database, this constraints will ensure that there is at least one tuple in every predicates. The TDs enforce the topological relations between the different nodes and edges. It is easy to see that IC 5 is consistent iff K 5 is planar. Since K 5 is not planar, there is no non-trivial database that satisfies IC 5 . This situation is interesting since the topological relations enforced by the TDs are not contradictory, but it is not possible to find tuples with objects in two dimensions that satisfy them.</p><p>2</p><p>The constraints we have defined are able to express most of the constraints commonly needed in spatial databases. However, this expressiveness comes with a cost: the consistency problem for SSICs is not tractable.</p><p>Proposition 1. The consistency problem for SSIC is np-hard. 2</p><p>Proof. Hardness can be proven by reduction of the explicit low level conjunctive realizability problem <ref type="bibr" target="#b6">[7]</ref> which is np-complete. This problem consists in determining if there exists a planar model that satisfies a conjunctive topological expression like</p><formula xml:id="formula_9">ψ = (A intersects B) ∧ (B disjoint C) ∧ (A disjoint C).</formula><p>More formally it consists on determining if for a set of objects O there exists a planar model for the topological expression ψ</p><formula xml:id="formula_10">= n i=1 (A i • i B i ) where (1) each • i ∈{intersects,disjoint} 2 , (2) if (A i • i B i ) and (A i • j B i ) belong to ψ, • i = • j , and<label>(3)</label></formula><p>for every A, B ∈ O such that A = B there exists • ∈{intersects,disjoint} such that (A • B) ∈ ψ. The problem is said to be "explicit" since for every pair of objects there always exists a relationship which is unique.</p><p>We need to construct a set of constraints IC ψ defined over a schema Σ ψ such that IC ψ is consistent iff there exists a planar model of ψ for objects in O. Let Σ ψ = (R, A, S, R, T , O) where A = {id}, S = {g} and R = {P A (id, g)|A ∈ O}, T = {Intersects, Disjoint} and O = ∅. Let IC ψ contain:</p><p>1. For every P A ∈ R, a topological dependency: ∀ug 1 g 2 (P A (u, g 1 ) ∧ P A (u, g 2 ) → Equal(g 1 , g 2 )) 2. For every A, B ∈ O where A = B, the spatial referential dependency: ∀ug 1 (P A (u, g 1 ) → ∃g 2 (P B (u, g 2 ) ∧ T (g 1 , g 2 )))</p><p>where</p><formula xml:id="formula_11">T = Intersects if (A intersects B) ∈ ψ or (B intersects A) ∈ ψ Disjoint if (A disjoint B) ∈ ψ or (B disjoint A) ∈ ψ Intuitively,</formula><p>each relation represents one of the objects in ψ, the TDs enforce that for each relation and value u in the first attribute there is a unique geometry, and RDs enforce that all the objects sharing the same value in attribute id satisfy the topological constraints given in ψ.</p><p>First let us show that if IC ψ is consistent, then there is a planar model for ψ. If IC ψ is consistent, there exists a database D defined over Σ ψ that satisfies the constraints and that has at least one tuple in one of the relations. Then, from constraints in (2), it follows that every relation will have at least one tuple. Furthermore, for every predicate P A there exists a constant c and a geometry g A such that (c, g A ) ∈ P A . Since this geometries will also satisfy the topological constraints, if we assign to each object A i geometry g Ai , we have found a planar model for ψ. Now, let us prove that if there is planar model of ψ, IC ψ is consistent. For every object A ∈ O, let g A be the geometry of A in the planar model of ψ. Now, choose a constant c ∈ U and let D be such that (c, g A ) ∈ P A for every A ∈ O. Clearly D satisfies IC ψ and IC ψ is consistent.</p><p>2</p><p>It is an open problem to determine if the realizability problem used in the reduction is in np since in some cases the objects need to be of exponential size.</p><p>A natural question is if we can find a set of constraints which are less expressive than SSIC, but that is still useful and tractable. We now explore the use of constraints that do not consider geometric operators O nor built-ins B. Definition 2. A basic spatial integrity constraints (BSIC) is one of the following: Functional Dependency (FD):</p><formula xml:id="formula_12">∀ūȳ 1 ȳ2 v 1 v 2 (P (ū, ȳ1 , v 1 ) ∧ P (ū, ȳ2 , v 2 ) → v 1 = v 2 ) where ȳ1 ∩ ȳ2 = ∅. Inclusion Dependency (IND): ∀ūx(P (ū, x) → ∃ȳR(ū, ȳ)) Topological Dependency (TD 0 ): ∀ūȳ 1 ȳ2 ḡ1 ḡ2 (P (ū, ȳ1 , g 1 ) ∧ R(ū, ȳ2 , g 2 ) → T (g 1 , g 2 )) where ȳ1 ∩ ȳ2 = ∅. Spatial Referential Dependency (RD 0 ): ∀ūxg 1 (P (ū, x, g 1 ) → ∃ȳg 2 R(ū, ȳ, g 2 ) ∧ T (g 1 , g 2 ))<label>2</label></formula><p>Even though these constraints are much simpler, the consistency problem is still np-hard. This follows directly from the proof in the general case since only TD 0 s and RD 0 s are used.</p><p>Corollary 1. The consistency problem for BSIC constraints is np-hard. 2</p><p>However, these constraints have better properties in some cases. For example consistency of a set of FDs and TD 0 s can be checked in polynomial time.</p><p>Proposition 2. Consistency of a set of FDs and TD 0 s can be checked in polynomial time.</p><p>Proof. First, observe that if a set IC of FDs and TD 0 s is consistent, then there will exist a database with only one tuple in one predicate that satisfies the constraints. Thus, a possible algorithm consists on checking in each predicate P if a tuple with fresh constants in each attribute satisfies the FDs and TD 0 s defined over it. If this is true in any of the predicates, then the set IC is consistent. Otherwise, it is not. 2</p><p>A more general an interesting result is that if there are no cycles through INDs and RD 0 s, consistency can be checked in polynomial time.</p><p>Definition 3. Given a schema S and a set of BSIC IC , let the dependency graph G of IC be a digraph where the nodes corresponds to predicates in S with an edge from predicate P to R if there is an IND ∀ūx(P (ū, x) → ∃ȳR(ū, ȳ)) or a RD 0 ∀ūxg 1 (P (ū, x, g 1 ) → ∃ȳg 2 R(ū, ȳ, g 2 ) ∧ T (g 1 , g 2 )) in IC . A set of BSICs is said to be acyclic if its dependency graph is acyclic. 2 Proposition 3. Consistency of an acyclic set of BSICs can be checked in polynomial time.</p><p>Proof. If a set IC of acyclic BSICs is consistent, then there will exist a database with only one tuple in a leaf predicate that satisfies the constraints. Thus, IC is consistent iff one of the leafs satisfies its FDs and the TD 0 . Since consistency of FDs and the TD 0 can be done in polynomial time, consistency of IC can be checked in polynomial time. 2</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Related Work and Conclusion</head><p>In the spatial domain, some related studies address the specification of topological constraints (domain constraints) <ref type="bibr" target="#b0">[1]</ref> and spatial semantic constraints <ref type="bibr" target="#b11">[12,</ref><ref type="bibr" target="#b9">10]</ref>.</p><p>There have been also studies concerning models for checking topological consistency at multiple representations and for data integration <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b18">19,</ref><ref type="bibr" target="#b5">6,</ref><ref type="bibr" target="#b10">11]</ref>. In all these studies, however, the consistency problem has not been addressed, and spatial integrity constraints have been treated without considering the interaction with thematic attributes in a database model. The specification of constraints involving topological relations and alphanumeric attributes was studied at the conceptual level in <ref type="bibr" target="#b2">[3]</ref>. This study, however, does not address the association of the constraints at the conceptual level with already existing semantic constraints at the logical level. Unlike previous works, we have presented a formalization of integrity constraints that combine thematic with spatial attributes in a database model and that extends classical notions of functional and inclusion dependency. We have analyzed the consistency problem and found that it is not tractable in general. However, we have identified an interesting case for which the problem can be solved in polynomial time.</p><p>We plan to extend our results by analyzing other interesting cases in which the problem is tractable and to develop heuristics for those cases that it is not. In this direction, we would like to study the complexity of the consistency problem if we remove the requirement of realizability and only require topological consistency. This can be easily achieved by assuming that the spatial attributes take values in R 3 instead of R 2 . Since the topological consistency problem is tractable in most settings, this can be a good approximation of the consistency problem.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Subsumption lattice of relations O (Overlaps), CB (CoveredBy), IS (Inside), EQ (Equal), CV (Covers), IC (Includes), TO (Touches), D (Disjoint), IT (Intersects), IDC (IDisjoint), W (Within), and C (Contains).</figDesc></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">The names of relations given here are in agreement with the names used in current SQL languages<ref type="bibr" target="#b12">[13]</ref>, but differ slightly from the ones in the research literature.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_1">∀ (Landparcel(u, v, w, g) ∧ Landparcel(u ′ , v ′ , w ′ , g ′ ) → IDisjoint(g, g ′ )) ∀ (Building(u, v, g) ∧ Building(u ′ , v ′ , g ′ ) → IDisjoint(g, g ′ ))</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_2">The topological relation called intersects in this paper is referred as overlaps in<ref type="bibr" target="#b6">[7]</ref> </note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Acknowledgements. This work is funded by Bicentenario Program PSD 57. Andrea Rodríguez is also funded by Fondecyt 1080138, and Loreto Bravo by Fondecyt 11080260, Conicyt -Chile.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Spatial Integrity Constraints in Object Oriented Geographic Data Modeling</title>
		<author>
			<persName><forename type="first">K</forename><surname>Borges</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Laender</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Davis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ACM-GIS</title>
				<imprint>
			<date type="published" when="1999">1999</date>
			<biblScope unit="page" from="1" to="6" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">A Taxonomy of Spatial Integrity Constraints</title>
		<author>
			<persName><forename type="first">S</forename><surname>Cockcroft</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">GeoInformatica</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="327" to="343" />
			<date type="published" when="1997">1997</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Precise Modeling and Verification of Topological Integrity Constraints in Spatial Databases: From an Expressive Power Study to Code Generation Principles</title>
		<author>
			<persName><forename type="first">M</forename><surname>Duboisset</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Pinet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Schneider</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ER</title>
		<title level="s">Springer LNCS</title>
		<imprint>
			<date type="published" when="2005">2005</date>
			<biblScope unit="volume">3716</biblScope>
			<biblScope unit="page" from="465" to="482" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Evaluating Inconsistency among Multiple Representations</title>
		<author>
			<persName><forename type="first">M</forename><surname>Egenhofer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Clementine</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Di Felice</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Spatial Data Handling</title>
				<imprint>
			<date type="published" when="1995">1995</date>
			<biblScope unit="page" from="901" to="920" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Point Set Topological Relations</title>
		<author>
			<persName><forename type="first">M</forename><surname>Egenhofer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Franzosa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IJGIS</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="161" to="174" />
			<date type="published" when="1991">1991</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Assessing the Consistency of Complete and Incomplete Topological Information</title>
		<author>
			<persName><forename type="first">M</forename><surname>Egenhofer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sharma</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Geographical Systems</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="47" to="68" />
			<date type="published" when="1993">1993</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Topological Inference</title>
		<author>
			<persName><forename type="first">M</forename><surname>Grigni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Papadias</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">H</forename><surname>Papadimitriou</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IJCAI</title>
				<imprint>
			<date type="published" when="1995">1995</date>
			<biblScope unit="page" from="901" to="907" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">An Introduction to Spatial Database Systems</title>
		<author>
			<persName><forename type="first">R</forename><surname>Güting</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">VLDB Journal</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="357" to="399" />
			<date type="published" when="1994">1994</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">GraphDB: Modeling and Querying Graphs in Databases</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">Hartmut</forename><surname>Güting</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">VLDB</title>
				<imprint>
			<date type="published" when="1994">1994</date>
			<biblScope unit="page" from="297" to="308" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">A Model for Expressing Topological Integrity Constraints in Geographic Databases</title>
		<author>
			<persName><forename type="first">T</forename><surname>Hadzilacos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tryfona</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Spatio-Temporal Reasoning</title>
		<title level="s">Springer LNCS</title>
		<imprint>
			<date type="published" when="1992">1992</date>
			<biblScope unit="volume">639</biblScope>
			<biblScope unit="page" from="252" to="268" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">On Topological Elementary Equivalence of Spatial Databases</title>
		<author>
			<persName><forename type="first">B</forename><surname>Kuijpers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Paredaens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Van Den Bussche</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ICDT, Springer LNCS</title>
				<imprint>
			<date type="published" when="1997">1997</date>
			<biblScope unit="volume">1186</biblScope>
			<biblScope unit="page" from="432" to="446" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Reasoning on Spatial Semantic Integrity Constraints</title>
		<author>
			<persName><forename type="first">S</forename><surname>Mäs</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">COSIT, Springer LNCS</title>
				<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="volume">4736</biblScope>
			<biblScope unit="page" from="285" to="302" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">Opengis Simple Features Specification for SQL</title>
		<author>
			<persName><surname>Opengis</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1999">1999</date>
		</imprint>
		<respStmt>
			<orgName>Open GIS Consortium</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Technical report</note>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Towards a Theory of Spatial Database Queries</title>
		<author>
			<persName><forename type="first">J</forename><surname>Paredaens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Van Den Bussche</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Van Gucht</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">PODS</title>
				<imprint>
			<publisher>ACM Press</publisher>
			<date type="published" when="1994">1994</date>
			<biblScope unit="page" from="279" to="288" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Data Models and Query Languages for Spatial Databases</title>
		<author>
			<persName><forename type="first">J</forename><surname>Paredaens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Kuijpers</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Data and Knowledge Engineering</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="issue">1-2</biblScope>
			<biblScope unit="page" from="29" to="53" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">A Spatial Logic based on Regions and Connection</title>
		<author>
			<persName><forename type="first">D</forename><surname>Randell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Cui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Cohn</surname></persName>
		</author>
		<editor>KR</editor>
		<imprint>
			<date type="published" when="1992">1992</date>
			<biblScope unit="page" from="165" to="176" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">A Methodology for Spatial Consistency Improvement of Geographic Databases</title>
		<author>
			<persName><forename type="first">S</forename><surname>Servigne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Ubeda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Puricelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Laurini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">GeoInformatica</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="7" to="34" />
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Fundamentals of Spatial Information Systems</title>
		<author>
			<persName><forename type="first">D</forename><surname>Thompson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Laurini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Number 37</title>
				<imprint>
			<publisher>APIC. Academic Press</publisher>
			<date type="published" when="1992">1992</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Consistency among Parts and Aggregates: A Computational Model</title>
		<author>
			<persName><forename type="first">N</forename><surname>Tryfona</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Egenhofer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Transactions on GIS</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="189" to="206" />
			<date type="published" when="1997">1997</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
