<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Turning The Partial-closed World Assumption Upside Down</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Simon</forename><surname>Razniewski</surname></persName>
							<email>razniewski@inf.unibz.it</email>
							<affiliation key="aff0">
								<orgName type="institution">Free University of Bozen-Bolzano</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ognjen</forename><surname>Savković</surname></persName>
							<email>savkovic@inf.unibz.it</email>
							<affiliation key="aff0">
								<orgName type="institution">Free University of Bozen-Bolzano</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Werner</forename><surname>Nutt</surname></persName>
							<email>nutt@inf.unibz.it</email>
							<affiliation key="aff0">
								<orgName type="institution">Free University of Bozen-Bolzano</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Turning The Partial-closed World Assumption Upside Down</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">ACC297C7E4BDA96CD0B5C102F9439E96</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T04:48+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The partial-closed world setting provides an intermediate ground between open-world and closed-world settings. Previous work on the partial-closed world assumption assumes that incompleteness is the default, i.e., that databases in general can be incomplete, and are only definitely complete in specified parts. In this work we turn this assumption around, and study databases where completeness is the default, and incompleteness only occurs in specified parts. We present four languages for describing potentially incomplete parts of databases, full-table statements, patterns, local statements and query statements. We show that except for full-table statements, it is not possible to translate between the two settings without extending the languages. Finally, we present techniques to decide query completeness entailment for each of the languages both on the schema and instance level, finding that for queries under bag semantics, the complexity is sometimes easier than under the setting where complete parts are specified.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Databases are traditionally considered under the closed-world assumption <ref type="bibr" target="#b14">[15]</ref>, while for many knowledge bases and the semantic web, the open-world assumption <ref type="bibr" target="#b9">[10]</ref> is used. In many applications however, only parts of databases are complete, while in other parts, data is known to be missing, or potentially missing. One can call this the partial-closed world assumption (PCWA). A problem for databases under the PCWA is to decide whether a query returns all possible answers, in which case the query is called complete.</p><p>Previous work on query completeness under the PCWA used so-called completeness statements to describe complete parts, while assuming that the undescribed parts of the database can be incomplete <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b13">14,</ref><ref type="bibr" target="#b12">13,</ref><ref type="bibr" target="#b3">4,</ref><ref type="bibr" target="#b2">3]</ref>. We call this the incompleteness-as-default (IAD) setting.</p><p>In this paper, we turn this assumption around and instead assume that databases are only incomplete in explicitly described parts, while all other parts are complete. We call this setting completeness-as-default (CAD). The study of this setting is motivated by the observation that in many business applications, data is generally of good quality, while only specific parts are questionable.</p><p>Motivating Example. As an example, consider a university database containing three tables:</p><p>student(name,degree) lecturer(name,faculty) takes(name,course).</p><p>Assume that for all faculties but Computer Science the course enrolment has already finished, so takes is only potentially incomplete for enrolments of CS students. Then clearly, queries for courses of students in the Philosophy degree will still produce complete results, while queries for subjects of students in the Computer Science degree might not.</p><p>In order to translate this situation into the IAD setting, we would need a completeness statement for the takes table for all entries for students that are not in the CS degree, which requires the use of negation.</p><p>Another motivation for incompleteness statements is that they can be naturally translated into tasks. In the research information system of our university, for instance, academics are periodically given tasks to complete information about publications in recent years. These tasks are the same as incompleteness statements for the respective periods. Also, the use of incompleteness statements allows to directly point out why query answers are not complete, which in general, in the incompleteness-as-default setting cannot be done.</p><p>Research questions. Motivated by the example above, we study three questions:</p><p>1. How can one describe potentially incomplete parts of a database? 2. Which language extensions are needed to translate descriptions from the CAD to the IAD setting? 3. How can one decide whether a query answer is complete? Contribution. Our contribution is to study the completeness-as-default setting for partially complete databases. In particular we present <ref type="bibr" target="#b0">(1)</ref> four different languages for describing potential incompleteness, which are not symmetric to the ones used in the IAD setting, (2) we show that except for a trivial case, it is not possible to translate between the two settings without extending the languages, and (3) we present techniques to decide query completeness entailment for each of the languages both on the schema and instance level.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Formalization</head><p>We assume a fixed set of relation symbols Σ, and we deal with conjunctive queries that are satisfiable and do not contain arithmetic comparisons.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Incompleteness Statement Languages</head><p>We study four languages for specifying possibly incomplete parts of a database:</p><p>1. Full-table statements, 2. Pattern statements <ref type="bibr" target="#b12">[13]</ref>, 3. Local statements, similar to the local completeness statements in <ref type="bibr" target="#b7">[8,</ref><ref type="bibr" target="#b13">14]</ref>, 4. Query statements, similar to the query completeness statements in <ref type="bibr" target="#b10">[11]</ref>.</p><p>Intuitively, full-table statements can be used to assert that a whole table is potentially incomplete. They are a very coarse way to describe possible incompleteness, nevertheless, they are the only of the languages studied here for which it is possible to translate between completeness as default and incompleteness as default without extending the language. Patterns were introduced in <ref type="bibr" target="#b12">[13]</ref>, while the local statements closely resemble the local completeness statements introduced in <ref type="bibr" target="#b7">[8]</ref>, and the query statements somewhat the query completeness statements introduced in <ref type="bibr" target="#b10">[11]</ref>. We next formalize the statements and their semantics. We next formalize the models of potential incompleteness statements. As common <ref type="bibr" target="#b13">[14]</ref>, an incomplete database is a pair (D i , D a ) of an ideal and an available database with D a ⊆ D i . Given an ideal database D i and a fact f , we say that a potential incompleteness statement explains the absence of f from D a , if</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Potential Incompleteness</head><formula xml:id="formula_0">-Full table statement PotInc(R): f is a fact in relation R. -Pattern statement PotInc(σ p (R)): f is an R-fact that satisfies the selection condition p -Local statement PotInc(R( x); G): f is in Q R;G (D i ), where Q R;G is the query Q R;G ( x) :-R( x), G. -Query statement PotInc(Q):</formula><p>There exists a valuation for Q over D i that maps some atom in the body of Q to f .</p><p>Intuitively, models of a set of incompleteness statements are those incomplete databases where incompleteness occurs only in parts described by incompleteness statements.</p><p>Definition 1. Given a set of incompleteness statements S, models are all incomplete database (D i , D a ) where the absence of each fact f in D i \ D a is explained by some statement in S.</p><p>Example 2. Consider the statement PotInc(student(n, c); takes(n, Databases)) from Ex. 1, which says that records in student of students taking Databases are possibly incomplete. Models of this statement are all incomplete databases (D i , D a ) where the only difference between D i and D a are student facts for students that take Databases according to D i .</p><p>Unlike for instance in propositional logic, models of potential incompleteness statements are monotonic. The incomplete database (D i , D a ) with D i = {student(John, CS), lecturer(Mary, ST)} and D a = ∅ is neither a model of PotInc(student) nor of PotInc(lecturer) alone, but of the two together. In general, supersets of potential incompleteness statements have more models than their subsets. We find the following order of expressivity of languages for potential incompleteness:</p><p>Proposition 1 (Expressiveness of Languages). Proof. Subsumption (2) holds because any pattern statement PotInc(σ p (R)) can be expressed using a query statement PotInc(Q() :-σR). Similarly, (3) holds as any potential incompleteness of a table can be expressed equivalently with a pattern stating that any part of the table is potentially incomplete. Regarding (1), observe that given a query Q :-R 1 , . . . , R n that is potentially incomplete, one can equivalently formulate n local statements which assert PotInc(R i ; R 1 , . . . , R i−1 , R i+1 , . . . , R n ). An illustration is shown in Fig. <ref type="figure" target="#fig_0">1</ref>.</p><p>Query Completeness Under the OWA, conjunctive queries only return certain answers, while under the CWA, the returned answers are also all possible answers. Under the PCWA, one can investigate whether a query returns all possible answers, which is a property called query completeness. Formally, an incomplete database satisfies query completeness under bag (set) semantics, if the bag (set) of answers to the query over the ideal database is the same as over the available database. We then write Compl(Q). Similarly, a set of incomplete databases satisfies query completeness, if each member satisfies query completeness. Note that query completeness inferences are nonmonotonic wrt. potential incompleteness statements: More potential incompleteness statements imply that fewer queries can be inferred to be complete.</p><p>In the rest of the paper, we study three problems.</p><p>Problem 1 (Translation). Given a language of potential incompleteness statements, which additions are needed to translate any set of statements from the CAD to the IAD setting?</p><p>Problem 2 (Schema Reasoning). Given a set S of statements and a query Q, is Q complete over all models of S, i.e., S | = Compl(Q)?</p><p>Problem 3 (Instance Reasoning). Given statements S, a database D a and a query Q, is Q complete over all models of S where D a is fixed, i.e., S | = D a Compl(Q)?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Translation</head><p>In this section we study how to translate from the CAD to the IAD setting.</p><p>Example 3. Recall the full-table statement PotInc(student) from Ex. 1. Given that the database has only three tables, this can be translated into the IAD setting by stating that the tables lecturer and takes are complete, i.e., Compl(lecturer) and Compl(takes). In the motivating example we already saw that negation may be needed.</p><p>We next briefly formalize the semantics of completeness statements. An incomplete database (D i , D a ) satisfies a completeness statement, if:</p><formula xml:id="formula_1">-Full table statement PotInc(R): R(D i ) = R(D a ).</formula><p>-Pattern statement PotInc(σ p (R)) <ref type="bibr" target="#b12">[13]</ref>:</p><formula xml:id="formula_2">σ p (R(D i ) ⊆ R(D a ) -Local statement PotInc(R( x); G) [8,14]: The result of Q R;G ( x) :-R( x), G over D i is contained in R(D a ). -Query statement PotInc(Q) [11]: Q(D i ) = Q(D a ).</formula><p>Unlike potential incompleteness statements, completeness statements are monotonic, i.e., an incomplete database satisfies a set of completeness statements, iff it satisfies each of them. We next show how translation between CAD and IAD can work in principle.</p><p>The next proposition shows that unless for full table statements, translation from the CAD to the IAD setting is not possible without additional assumptions. (2): A translation is possible if instantiated domains are finite, e.g. if gender is either male or female, then PotInc(σ gender=male (person)) is analogue to Compl(σ gender=female (person)), otherwise, we would need an infinite set of statements. If disequality is allowed, one could assert Compl(σ gender male (person)).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Proposition 2 (Translatability from CAD to IAD).</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">For full</head><p>(3): Additionally to finite domains or disequality, negations is needed, as shown in the introductory example.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Completeness Entailment for Queries under Bag Semantics</head><p>In the following, we develop techniques for deciding query completeness entailment for queries under bag semantics. Somewhat surprisingly, for all four languages, schema reasoning has the same complexity (PTIME), and instance reasoning has the same complexity (coNP), which is generally easier than in the IAD setting.</p><p>Let us see first how to reason with full-table and pattern statements:</p><p>Example 4 (Full-table and Pattern Statements). Consider the following query Q Databases (n) :student(n, c), takes(n, Databases), and a potential incompleteness statement for the takes table, that is, PotInc(takes). Then, this statement does not entail query completeness, because missing records in takes could lead to an incomplete query result. Alternatively, assume a potential incompleteness statement PotInc(σ course=Algorithms (takes)). Clearly, this statement has no influence on the query Q Databases that asks for students that take Databases, thus, query completeness is entailed.</p><p>For local statements, we find that, unlike in the IAD setting, even statements that contain relations not appearing in the query can influence query completeness.</p><p>Example 5 (Local Statements). Consider an incompleteness statement for takes for remote students, that is, PotInc(takes(n, d); remoteStudent(n)). This can lead to a potentially incomplete query result for Q Databases , because remote students might take Databases, thus, no completeness holds.</p><p>As query statements are just collections of atoms that might be incomplete, they behave analogously to pattern statements.</p><p>Example 6 (Query Statements). Consider an incompleteness statement for a query for all lecturers that are also students,</p><formula xml:id="formula_3">PotInc(Q(n) :-student(n, d), lecturer(n, f )).</formula><p>Clearly, such students could also take Databases and thus, if missing, could lead to an incomplete query result for Q Databases , thus, no query completeness holds.</p><p>Formally, we find the following:</p><p>Theorem 1 (Schema Reasoning for Bag Semantics). For all languages, the combined complexity of schema reasoning is PTIME.</p><p>Proof. We only discuss local statements, as they entail all other statements. For local statements of the form PotInc(R i ( xi ), G i ), completeness holds if and only if none of the R i ( xi ) atoms can be unified with any atom in Q.</p><p>Wrt. the database instance, in general, more conclusions are possible.</p><p>Example 7 (Instance Reasoning). Consider again the situation from Ex. 4, where PotInc(takes) holds, but assume that we additionally know that student is empty. Then, no matter whether records are missing in takes, the result of Q Databases will always be empty and hence complete.</p><p>Theorem 2 (Instance Reasoning for Bag Semantics). For all languages, instance reasoning is coNP-hard in combined complexity. For full table and pattern statements, the combined complexity is also coNP, and the data complexity is PTIME.</p><p>Proof. (coNP-membership): To show that completeness does not hold, it suffices to guess an instantiation θ for the body B of Q such that at least one fact in θB is not in D a , and such that (D a , D a ∪ θB) satisfies S. As the latter part can be done in PTIME, the entailment checking is in coNP.</p><p>(PTIME data complexity): Observe that for full-table and pattern statements, the guesses are especially easy, as it suffices to consider the constants in the database plus at most as many new ones as there are variables in the query, thus, there are polynomial many possible valuations to consider. For local and query statements, one also has to show that the new facts are explained by potential incompleteness statements, which may increase the complexity.</p><p>(coNP-hardness): We can reduce graph colorability to instance reasoning with full table statements as follows: Consider a graph G. We construct a Boolean query Q() :-R(), G , where G contains for each edge from node x to node y in G an atom E(x, y) (where x and y are variables). Furthermore, let D a = {E(r, g), E(r, b), E(g, r), E(g, b), E(b, r), E(b, g)}, that is, the database contains the six allowed colorings for adjacent vertices. Finally, let S = {PotInc(R)}. Then, S and D a entail Compl(Q) if and only if G is not three-colorable.. Assume G is not colorable. Then there is no homomorphism from G into the (complete) set of allowed colorings in any D i , and hence, Q would return the false also over any D i and hence Q would be complete. On the other hand, if G is three-colorable, a homomorphism from G to D a exists and hence we can construct a D i = D a ∪ R() where Q evaluates to true.</p><p>For local and query statements it is not yet known how to decide entailment, because new facts in θB might require justifications that use again new facts, which require again justifications, and so on, similarly to chase-procedures. All results for queries are summarized in Table <ref type="table" target="#tab_3">2</ref>, where we also list known complexities from the IAD setting.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Completeness Entailment for Queries under Set Semantics</head><p>Queries can be evaluated under bag or set semantics, and it is possible that a query is complete under set, but not under bag semantics. Example 8. Consider the query Q(n) :takes(n, Databases), takes(n, x) that, under bag semantics, asks for the number of courses that students taking databases take. Consider also the incompleteness statement PotInc(σ course=OS (takes)). Then the above query might not return the correct count (bag), but it would still return the correct set of all students that take databases and something else.</p><p>For queries under set semantics, entailment is generally more difficult.  <ref type="figure">PotInc(E(r, b</ref>)) and so on (the six allowed colorings). Then Q is complete exactly if G is not colorable.</p><p>Π P 2 -membership can be shown by the same technique as used in the proof of coNP-membership in Thm. 2. Now however it needs to be shown that the newly retrieved fact is not already returned over D a , thus, after the first guess, a call to an NP-oracle is needed.</p><p>Instance reasoning is hard even for full-table statements.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Theorem 4 (Instance Reasoning for Set Semantics).</head><p>Instance reasoning for all languages is Π P 2 -hard in combined complexity, and in Π P 2 for full-table and pattern statements.</p><p>Proof. (Hardness): Can be shown by reduction of 3-coloring extension problem <ref type="bibr" target="#b0">[1]</ref>.</p><p>(Membership): Can be shown using the same technique as in the proof of Thm. 2.</p><p>Interestingly, even reasoning for Boolean queries provides a challenge. Proposition 3 (Boolean Queries). Instance reasoning with full table statements for boolean queries is co-DP-complete.</p><p>Proof. We show that incompleteness is DP-complete. We show hardness by encoding critical 3-colorability problem which is DP-complete <ref type="bibr" target="#b11">[12]</ref>. To show membership we check incompleteness by calling two NP oracles: one that checks if a boolean query evaluates to false over a given database, and the other that checks if the query evaluates to true over a representative ideal database.</p><p>All hardness results also for linear queries (because the complexity comes from the choices for the mapping), as one could reduce satisfiability of quantified boolean formulas (QBF). So all complexities above hold even with fixed database. Also, if the number of variables in the query is bounded, entailment becomes polynomial, as there are only polynomially many possible mappings.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Related Work</head><p>The problem of query completeness entailment in the IAD setting has been introduced by <ref type="bibr">Motro [11]</ref> and Halevy <ref type="bibr" target="#b7">[8]</ref>. Most complexity results for this setting appeared in <ref type="bibr" target="#b13">[14]</ref>.</p><p>Query completeness in the CAD setting is closely related to the problem of queries independent of updates (QIU). QIU investigates whether a query result can change as a consequence of a database update. Intuitively, query completeness entailment holds, if the given query is independent from all insertion operations corresponding to the incompleteness statements.</p><p>QIU has been introduced by Blakeley et al. <ref type="bibr" target="#b1">[2]</ref>, who investigated QIU for conjunctive queries without selfjoins and deletion, insertion and modification updates that correspond to patterns. Both queries and statements were allowed to contain comparisons. They showed that deciding QIU wrt. a database instance in these cases is in coNP for selfjoin-free queries that may contain comparisons. Elkan <ref type="bibr" target="#b4">[5]</ref> and Halevy and Sagiv <ref type="bibr" target="#b8">[9]</ref> expanded this work to recursive queries and queries containing negation. They provided sufficient conditions for QIU for such cases in terms of query reachability, and showed that in special cases these conditions were also necessary. Recent work studies QIU also for the SPARQL language <ref type="bibr" target="#b6">[7]</ref>.</p><p>Implementation Techniques Schema reasoning is straightforward for all four languages, each time requiring to check for unifiability between single atoms. Implementing instance reasoning for full table and pattern statements is possible as follows: Consider that the query has the form Q() :-A 1 , . . . , A n , where atoms A 1 to A m are complete, and atoms A to A n are potentially incomplete. If some atom in A m+1 to A n contains a variable that does not appear in A 1 to A m , it suffices to check whether the A 1 to A m can be evaluated over D a . If none of the potentially incomplete atoms has a new variable, one can proceed as follows: For each atom A i (s i ), i = m + 1 . . . n one can build a query Q(s i ) :-A 1 , . . . , A m , ¬A i . If this query returns any answer over D a , this means that the query is not complete. To deal with pattern statements, one would add the condition to the query. Thus, completeness reasoning can be implemented using query evaluation.</p><p>Combining CAD and IAD One can imagine situations where some relations are interpreted under IAD and some under CAD. An example could be internal research evaluation by publication analysis, where publications of the members of an institute are in a local information system and thus complete by default, while publications of coauthors are taken from the web and thus generally incomplete. To reason about query completeness in such settings, one could proceed in three steps: (1) one would assert completeness for all relations under CAD assumption, (2) one would perform completeness reasoning using all completeness statements, and (3) one would perform completeness reasoning using only the potential incompleteness statements. Consequently, the complexity would be the union of the complexities of reasoning under CAD and IAD.</p><p>Incompleteness statements Instead of potential incompleteness statements, one can also image statements that state that a certain part of a database is definitely incomplete, for instance, that the student table is definitely missing some records. For the nonspecified parts, one may either assume completeness or potential incompleteness. In both cases, one could additionally check whether a query is known to be definitely incomplete, which would be a different reasoning problem. However, this inference is not very practical, as it is hardly ever possible. For instance, consider a query Q(x) :-R(x), S(x), and consider that R and S are known to be definitely incomplete (miss some facts). Then this does not allow to conclude that Q is incomplete, because sets of facts missing from R and S might be disjoint. More generally, for pattern statements, inferring definite query incompleteness seems only possible in trivial cases where the query has exactly the same shape as a single definite incompleteness statement.</p><p>Future work will focus filling the gaps in the complexities of completeness entailment, and on the connections to data exchange.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 .</head><label>1</label><figDesc>Figure 1. Entailment of potential incompleteness statement languages.</figDesc><graphic coords="4,376.84,470.12,103.75,130.84" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>Statements.A full table statement has the form PotInc(R), where R is a table.A pattern statement has the form PotInc(σ p (R)), where p is a selection by constants, i.e., p is " x = c", with x being a subset of the attributes of R, and R being a table.A local statement has the form PotInc(R( x); G), where R is a table and G is a conjunction of atoms.A query statement has the form PotInc(Q), where Q is a conjunctive query.</figDesc><table><row><cell>Example 1. Consider a database with three tables, student, lecturer and takes.</cell></row><row><cell>A full-table statement PotInc(student) for student would assert that the student</cell></row><row><cell>table is possibly incomplete.. A pattern statement PotInc(σ degree=CS (student))</cell></row><row><cell>would assert that students in the CS degree are possibly incomplete. A local</cell></row><row><cell>statement PotInc(student(n, d); takes(n, Databases)) would assert that records</cell></row><row><cell>in student of students taking Databases are possibly incomplete. A query</cell></row><row><cell>statement PotInc(Q Databases (n) :-student(n, d), takes(n, Databases)) would as-</cell></row><row><cell>sert that the query for students taking Databases is potentially incomplete,</cell></row><row><cell>which can be either due to missing records in student or to missing records in</cell></row><row><cell>takes.</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head></head><label></label><figDesc>table statements, translation is possible. 2. For pattern statements, translation is possible if attribute domains are finite, or disequality is added to the pattern statements. 3. For local and query statements, translation is possible if (1) the domains of all attributes instantiated in the pattern statement are finite or disequality is added, and (2) if negation is added. Proof. (1): Translation is possible by creating a completeness statement for every relation in Σ \ {R | PotInc(R) ∈ S}.</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 2 .</head><label>2</label><figDesc>Combined complexity of completeness entailment under IAD and CAD, and requirements for translation. Straightforward new results for the IAD setting are marked with * and are not explained in the paper.</figDesc><table><row><cell></cell><cell></cell><cell cols="2">Bag semantics</cell><cell></cell><cell></cell><cell cols="2">Set semantics</cell><cell></cell><cell></cell></row><row><cell></cell><cell cols="2">IAD</cell><cell cols="2">CAD</cell><cell cols="2">IAD</cell><cell cols="2">CAD</cell><cell>Translation</cell></row><row><cell></cell><cell>Schema</cell><cell>Instance</cell><cell>Schema</cell><cell>Instance</cell><cell>Schema</cell><cell>Instance</cell><cell>Schema</cell><cell>Instance</cell><cell></cell></row><row><cell></cell><cell>Reason-</cell><cell>Reason-</cell><cell>Reason-</cell><cell>Reason-</cell><cell>Reason-</cell><cell>Reason-</cell><cell>Reason-</cell><cell>Reason-</cell><cell></cell></row><row><cell></cell><cell>ing</cell><cell>ing</cell><cell>ing</cell><cell>ing</cell><cell>ing</cell><cell>ing</cell><cell>ing</cell><cell>ing</cell><cell></cell></row><row><cell>Full-table statements</cell><cell>PTIME*</cell><cell>coNP-complete*</cell><cell>PTIME</cell><cell>coNP-complete</cell><cell>PTIME*</cell><cell>Π P 2 -complete*</cell><cell>PTIME</cell><cell>Π P 2 -complete</cell><cell>Possible</cell></row><row><cell>Pattern statments</cell><cell>PTIME*</cell><cell>coNP-complete*</cell><cell>PTIME</cell><cell>coNP-complete</cell><cell>NP-complete*</cell><cell>Π P 2 -complete*</cell><cell>coNP-hard, in Π P 2</cell><cell>Π P 2 -complete</cell><cell>With finite domains or disequality</cell></row><row><cell>Local statements</cell><cell>NP-complete [14]</cell><cell>in Π P 2 *</cell><cell>PTIME</cell><cell>coNP-hard</cell><cell>NP-complete [14]</cell><cell>Π P 2 -complete [14]</cell><cell>coNP-hard</cell><cell>Π P 2 -hard</cell><cell>With finite domains or disequality, and negation</cell></row><row><cell>Query statements</cell><cell>NP-complete [14]</cell><cell>Π P 2 -complete [6]</cell><cell>PTIME</cell><cell>coNP-hard</cell><cell>Decidab-ility open</cell><cell>Π P 2 -complete [6]</cell><cell>coNP-hard</cell><cell>Π P 2 -hard</cell><cell>With finite domains or disequality, and negation</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Theorem 3 (Schema Reasoning for Set Semantics).</head><label></label><figDesc>Schema reasoning for full table statements in PTIME, for pattern statements in Π P 2 and coNP-hard for pattern, local and query statements in combined complexity. Proof. Membership for full table statements holds as one can just check whether any relation used in the query is potentially incomplete. CoNP-hardness for pattern statements can be shown by reduction of 3colorability. Consider a graph G encoded into a query Q as before. Consider furthermore potential incompleteness statements PotInc(E(r, g)),</figDesc><table /></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgment</head><p>This work has been partially supported by the projects "MAGIC", funded by the province of Bozen-Bolzano, and "TaDaQua", funded by the Free University of Bozen-Bolzano.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">The closure of monadic NP</title>
		<author>
			<persName><forename type="first">Miklós</forename><surname>Ajtai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ronald</forename><surname>Fagin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Larry</forename><forename type="middle">J</forename><surname>Stockmeyer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. Comput. Syst. Sci</title>
		<imprint>
			<biblScope unit="volume">60</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="660" to="716" />
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Updating derived relations: Detecting irrelevant and autonomously computable updates</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Blakeley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Coburn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P.-A</forename><surname>Larson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">VLDB</title>
				<imprint>
			<date type="published" when="1986">1986</date>
			<biblScope unit="page" from="457" to="466" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Towards a logical reconstruction of a theory for locally closed databases</title>
		<author>
			<persName><forename type="first">M</forename><surname>Denecker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Cortés-Calabuig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bruynooghe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Arieli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM TODS</title>
		<imprint>
			<biblScope unit="volume">35</biblScope>
			<biblScope unit="issue">3</biblScope>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Efficient reasoning using the local closedworld assumption</title>
		<author>
			<persName><forename type="first">P</forename><surname>Doherty</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Lukaszewicz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Szalas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">AIMSA</title>
				<imprint>
			<date type="published" when="2000">2000</date>
			<biblScope unit="page" from="49" to="58" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Independence of logic database queries and updates</title>
		<author>
			<persName><surname>Ch</surname></persName>
		</author>
		<author>
			<persName><surname>Elkan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. PODS</title>
				<meeting>PODS</meeting>
		<imprint>
			<date type="published" when="1990">1990</date>
			<biblScope unit="page" from="154" to="160" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Enabling fine-grained RDF data completeness assessment</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">E</forename><surname>Prasojo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Nutt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Darari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Razniewski</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ICWE</title>
		<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">On query-update independence for SPARQL</title>
		<author>
			<persName><forename type="first">N</forename><surname>Guido</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Genevès</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Layaïda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Roisin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CIKM &apos;15</title>
				<meeting><address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="1675" to="1678" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Obtaining complete answers from incomplete databases</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Alon</surname></persName>
		</author>
		<author>
			<persName><surname>Levy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">VLDB</title>
				<imprint>
			<date type="published" when="1996">1996</date>
			<biblScope unit="page" from="402" to="412" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Queries independent of updates</title>
		<author>
			<persName><forename type="first">Alon</forename><forename type="middle">Y</forename><surname>Levy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yehoshua</forename><surname>Sagiv</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. VLDB</title>
				<meeting>VLDB</meeting>
		<imprint>
			<date type="published" when="1993">1993</date>
			<biblScope unit="page" from="171" to="181" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">On indefinite databases and the closed world assumption</title>
		<author>
			<persName><forename type="first">J</forename><surname>Minker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Conference on Automated Deduction</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="1982">1982</date>
			<biblScope unit="page" from="292" to="308" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Integrity = Validity + Completeness</title>
		<author>
			<persName><forename type="first">A</forename><surname>Motro</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM TODS</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="480" to="502" />
			<date type="published" when="1989">1989</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">Computational complexity</title>
		<author>
			<persName><forename type="first">Christos</forename><forename type="middle">M</forename><surname>Papadimitriou</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1994">1994</date>
			<publisher>Addison-Wesley</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Identifying the extent of completeness of query answers over partially complete databases</title>
		<author>
			<persName><forename type="first">S</forename><surname>Razniewski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Korn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Nutt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Srivastava</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">SIGMOD</title>
				<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Completeness of queries over incomplete databases</title>
		<author>
			<persName><forename type="first">S</forename><surname>Razniewski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Nutt</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">VLDB</title>
				<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">On closed world data bases</title>
		<author>
			<persName><forename type="first">Raymond</forename><surname>Reiter</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Logic and Data Bases</title>
				<imprint>
			<date type="published" when="1977">1977</date>
			<biblScope unit="page" from="55" to="76" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
