<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Inconsistency-Tolerant First-Order Rewritability of DL-Lite with Identification and Denial Assertions</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Domenico Lembo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maurizio Lenzerini</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Riccardo Rosati</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Ruzzi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Domenico Fabio Savo</string-name>
          <email>savo@dis.uniroma1.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dipartimento di Ingegneria Informatica, Automatica e Gestionale Antonio Ruberti Sapienza Universita` di Roma</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper is motivated by two requirements arising in practical applications of ontology-based data access (OBDA): the need of inconsistencytolerant semantics, which allow for dealing with classically inconsistent specifications; and the need of expressing assertions which go beyond the expressive abilities of traditional Description Logics, namely identification and denial assertions. We consider an extension of DL-Lite (the most used DL in OBDA) which allows for the presence of both the aforementioned kinds of assertions in the TBox. We adopt an inconsistency-tolerant semantics for such a DL, specifically the so-called Intersection ABox Repair (IAR) semantics, and we study query answering in this setting. Our main contribution is a query rewriting technique which is able to take into account both identification and denial assertions. By virtue of this technique, we prove that conjunctive query answering under such semantics is first-order rewritable in the considered extension of DL-Lite.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Ontology-based data access (OBDA) is a computing paradigm which considers the
problem of accessing data stored in autonomous databases through the use of an
ontology, which provides a conceptual and shared representation of the domain of
interest [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The main components of an OBDA system are the ontology, the set of data
sources to be accessed, and the mapping, which specifies the relationship between the
ontology and the data sources. The reasoning service of main interest is query
answering, which amounts to process a query expressed in terms of the ontology alphabet, and
to compute its answer on the basis of the knowledge specified by the ontology, the
mapping, and the data stored at the sources. Various languages for specifying the ontology
in this context have been recently proposed [
        <xref ref-type="bibr" rid="ref2 ref3 ref7">3,2,7</xref>
        ], which are designed in order to allow
for tractable query answering w.r.t. the size of the data (data complexity). Among these,
the members of the DL-Lite family of light-weight Description Logics (DLs) present
the distinguishing feature of first-order rewritable query answering for unions of
conjunctive queries (UCQs), which means that such a reasoning service can be reduced
to the evaluation of a first-order query (directly translatable into SQL) over a database.
This turns out to be a crucial aspect in OBDA, where data are in general not moved from
the sources, and are usually managed by a Relational Database Management System.
      </p>
      <p>
        In the last years we have experimented (see, e.g., [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]) that two important
requirements arise in OBDA applications: (i) the need of declarative mechanisms for dealing
with data that are inconsistent with respect to the intensional knowledge specified by
the ontology, and (ii) the need of expressing assertions which go beyond the expressive
abilities of traditional DLs, namely identification assertions and denial assertions.
Solutions aiming at fulfilling these two requirements should of course still guarantee enough
efficiency. Ideally, they should allow for inconsistency-tolerant first-order rewritable
query answering of UCQs. Devising one such solution is the goal of the present paper.
      </p>
      <p>
        Motivated by the first mentioned requirement, we have recently proposed various
inconsistency-tolerant semantics [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], inspired by the studies on inconsistency handling
in belief revision [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and by the work on consistent query answering in databases [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
Later works [
        <xref ref-type="bibr" rid="ref1 ref9">9,1</xref>
        ] have then shown that answering UCQs under the so-called
Intersection ABox Repair (IAR) semantics is first-order rewritable for ontologies specified
in DL-LiteA, a member of the DL-Lite family [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In the present paper we extend the
above result, and show that first-order rewritability of conjunctive query answering still
holds if we enrich DL-LiteA with identification and denial assertions.
      </p>
      <p>
        Identification assertions (IdAs) are mechanisms for specifying a set of properties
that can be used to identify instances of concepts. IdAs we study here have been
originally presented in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Such assertions allow for sophisticated forms of object
identification, which may include paths realized through the chaining of roles, their inverses, and
attributes. Roughly speaking, when we say that instances of a concept C are identified
by a path , we impose that no two instances of C with overlapping sets of -fillers
exist, i.e., in every interpretation I, no two instances o and o0 of C exist with a non-empty
intersection of the set of objects reachable by o and by o0 in I. This naturally extends
to IdAs involving more than one path. Identification assertions are very useful in
practice and are essential to represent n-relations and attributes of roles through reification.
Indeed, reification is the only way to model n-relations and role attributes in ontology
languages, such as OWL, which do not natively allow for such constructs.
      </p>
      <p>
        Denial Assertions (DAs) are instead used to impose that the answer to a certain
boolean conjunctive query over the ontology is false, analogous to negative constraints
in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This is particularly useful to specify general forms of disjointness, which is again
not supported in traditional ontology languages.
      </p>
      <p>
        The query rewriting algorithm given in this paper elegantly deals with all forms of
inconsistency that can arise in DL-LiteA enriched with IdAs and DAs, thus it represents
an alternative to the rewriting algorithm given in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] for DL-LiteA only. Moreover, it
properly manages all the inconsistencies caused by erroneous assignments of values to
attributes, i.e., arising when a value of type Ti is assigned to an attribute of value-type
Tj disjoint from Ti. This kind of inconsistency has not been considered in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        In this paper, for the sake of simplicity, we consider the setting of a single ontology
constituted by a TBox and an ABox. However, our technique can be easily extended to
the OBDA setting, where mapping assertions relate the TBox to the data sources [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>The paper is organized as follows. In section 2 we provide some preliminaries. In
Section 3 we provide some general properties of the IAR-semantics, useful for the rest
of the paper. In Section 4, we show first-order rewritability of query answering of UCQs
under the IAR-semantics. In Section 5 we conclude the paper.</p>
    </sec>
    <sec id="sec-2">
      <title>Preliminaries</title>
      <p>Let be a signature partitioned into P , containing symbols for predicates, i.e.,
atomic concepts, atomic roles, attributes and value-domains, and C , containing
symbols for individual (object and value) constants. Given a DL language L, an L-ontology
O = hT ; Ai over consists of a TBox T , i.e., a set of intensional assertions over
expressed in L, and an ABox A, i.e., a set of extensional assertions over expressed in
L. We assume that ABox assertions are always atomic, i.e., they correspond to ground
atoms, and therefore we omit to refer to L when we talk about ABox assertions.</p>
      <p>The semantics of a DL ontology O is given in terms of first-order (FOL)
interpretations. We denote with Mod (O) the set of models of O, i.e., the set of
FOLinterpretations that satisfy all the assertions in T and A, where the definition of
satisfaction depends on the DL language in which O is specified. An ontology O is satisfiable
if Mod (O) 6= ;, and O entails a FOL-sentence , denoted O j= , if is satisfied by
every I 2 Mod (O). The above definitions naturally apply to a TBox alone.</p>
      <p>
        In the following, we consider DL ontologies hT ; Ai in which the TBox T is always
satisfiable, whereas the ABox A may contradict T . When this happens, we say that A
is T -inconsistent, T -consistent otherwise. For ontologies of this kind, the Intersection
ABox Repair (IAR) semantics introduced in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] allows for meaningful reasoning in the
presence of inconsistency, which is instead trivialized under the FOL-semantics. The
IAR-semantics is defined in terms of A-repairs: given an ontology O = hT ; Ai, an
Arepair for O is a set A0 A such that Mod (hT ; A0i) 6= ;, and there does not exist A00
such that A0 A00 A and Mod (hT ; A00i) 6= ;. In other terms, an A-repair for O is a
maximal T -consistent subset of A. The set of A-repairs for O is denoted by AR-Set(O).
The IAR-semantics is then given in terms of IAR-models, as follows.
      </p>
      <p>Definition 1. Let O = hT ; Ai be a possibly inconsistent DL ontology. The set
of Intersection ABox Repair (IAR)-models of O is defined as ModIAR(O) =
M od(hT ; TAi2AR-Set(O) Aii)</p>
      <p>Notice that if O is satisfiable under FOL-semantics, then Mod (O) = ModIAR(O).
The notion of consistent entailment is then the natural extension of classical
entailment to IAR-semantics: a FOL-sentence is IAR-consistently entailed, or simply
IARentailed, by O, written O j=IAR , if I j= for every I 2 ModIAR(O).</p>
      <p>
        We focus now on DL-LiteA;id, a DL of the DL-Lite family [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] equipped with
identification assertions [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Since DL-LiteA;id distinguishes between object and value
constants, we partition the set C into two disjoint sets O, which is the set of constants
denoting objects, and V , which is the set of constants denoting values.
      </p>
      <p>Basic DL-LiteA;id expressions are defined as follows:
B
C
!
!</p>
      <p>A j 9Q j
B j :B
(U )</p>
      <p>Q
R
!
!</p>
      <p>P j P
Q j :Q</p>
      <p>E
F
V
!
!
!</p>
      <p>(U )
T1 j
U j :U
j Tn
A, P , and P denote an atomic concept, an atomic role, and the inverse of an atomic
role; (U ) (resp. (U )) denotes the domain (resp. the range) of an attribute U , i.e.,
the set of objects (resp. values) that U relates to values (resp. objects); T1; : : : ; Tn are
unbounded pairwise disjoint predefined value-domains; B is called basic concept.</p>
      <p>A DL-LiteA;id TBox is a finite set of the following assertions:
B v C</p>
      <p>Q v R</p>
      <p>U v V</p>
      <p>E v F
(funct Q)
(funct U )
(id B
1; : : : ; n)
The assertions above, from left to right, respectively denote inclusions between
concepts, roles, attributes, and value-domains, global functionality on roles and on
attributes, and identification assertions (IdAs). In IdAs, i is a path, which is an
expression built according to the following syntax: ! S j D? j 1 2, where
S denotes an atomic role, the inverse of an atomic role, an attribute, or the inverse of
an attribute, 1 2 denotes the composition of paths 1 and 2, and D?, called test
relation, represents the identity relation on instances of D, which can be a basic
concept or a value-domain. Test relations are used to impose that a path involves instances
of a certain concept or value-domain. In our logic, IdAs are local, i.e., at least one
i 2 f 1; :::; ng has length 1, i.e., it is an atomic role, the inverse of an atomic role,
or an attribute. Intuitively, an IdA of the above form asserts that for any two different
instances o, o0 of B, there is at least one i such that o and o0 differ in the set of their
i-fillers, that is the set of objects that are reachable from o by means of i. For
example, the IdA (id Match homeTeam; visitorTeam) says that there are not two different
matches with the same home team and host team (which is indeed what happens, for
instance, in a season schedule of a football league).</p>
      <p>Inclusion assertions that do not contain (resp. contain) the symbols ’:’ in the
righthand side are called positive inclusions (resp. negative inclusions). The set of positive
(resp., negative) inclusions in T will be denoted by T + (resp., T ).</p>
      <p>A DL-LiteA;id ABox is a finite set of assertions of the form A(a), P (a; b), and
U (a; v), where A is an atomic concept, P is an atomic role, U is an attribute, a and b
are constants of O, and v is a constant of V . In a DL-LiteA;id ontology O = hT ; Ai,
the following condition holds: each role or attribute that either is functional in T or
appears (in either direct or inverse direction) in a path of an IdA in T is not specialized,
i.e., it does not appear in the right-hand side of assertions of the form Q v Q0 or
U v U 0.</p>
      <p>
        The semantics of DL-LiteA;id is given in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. We only recall here that each
valuedomain Ti is interpreted by means of the same function, denoted val (Ti), in every
interpretation, and each constant ci 2 V is interpreted as one specific value, denoted
val (ci). Also, the Unique Name Assumption is adopted.
      </p>
      <p>A boolean conjunctive query (BCQ) over a DL-LiteA ontology is an expression of
the form q = 9~y.conj(~t) where ~y is a set of variables and ~t is a set of terms (i.e.,
constants or variables) such that each variable in ~t is also in ~y, and conj(~t) is a conjunction
of atoms of the form A(z), T (z0), P (z; z0) U (z; z0) where A is an atomic concept, T
is a value-domain, P is an atomic role and U is an attribute, and z, z0 are terms. In the
following, we will mainly consider boolean unions of conjunctive queries (BUCQs),
i.e., first order sentences of the form Q = Wn
i=1 qi where qi = 9y~i.conji(t~i) is a BCQ.</p>
      <p>A BCQ q is satisfied by an ABox A (denoted by h;; Ai j= q) if there exists a
substitution from the variables in q to constants of C such that the formula (q) is
satisfied in the FOL-interpretation structure where the ABox A determines the
extension of concepts, roles and attributes and the function val determines the extension of
the value-domains. With a little abuse of notation we sometimes use (q) to indicate the
set of the atoms in (q). When query q is satisfied by A, we say that A(q) = (q) \ A
is the image of q in A. A BUCQ Q = Wn</p>
      <p>i=1 qi is satisfied by an ABox A if there exists
i 2 f1; : : : ng such that qi is satisfied by A.</p>
      <p>All the results of the present work can be straightforwardly extended to non-boolean
UCQs, by imposing the (natural) condition that every free variable in the query appears
in at least one atom not involving value-domains.</p>
      <p>With the notion of query in place we introduce now denial assertions (DAs). A DA
is an expression of the form 8~y.conj(~t) ! ? where conj(~t) is defined as for BCQs.
A DA is satisfied by an interpretation I if the formula 9~y.conj(~t) evaluates to false
in I. DAs are used to impose general forms of disjointness. For example, the denial
8x; y.Match(x)^homeTeam(x; y)^visitorTeam(x; y) ! ? says that a match cannot
have the same team as both home and visitor team.</p>
      <p>In the following we will consider DL-LiteA;id ontologies whose TBoxes are enriched
with DAs, and call them DL-LiteA;id;den ontologies.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Properties of the IAR-Semantics</title>
      <p>We discuss here some properties of the IAR-semantics that will be used in the rest of
the paper. We recall that the IAR-semantics is defined for DL ontologies composed by a
consistent TBox and an ABox comprising only atomic assertions, therefore we consider
below only this kind of ontologies. We start with the notion of inconsistent set.
Definition 2. Let T be a TBox and let A be an ABox. A is a minimal T -inconsistent set
if A is T -inconsistent, i.e., the ontology hT ; Ai is unsatisfiable, and there is no A0 A
such that A0 is T -inconsistent.</p>
      <p>Let O = hT ; Ai be a possibly inconsistent DL ontology. We denote with
minIncSets (O) the set of minimal T -inconsistent sets contained in A. Notice that O
is satisfiable iff minIncSets (O) = ;. Then, the following property holds.
Proposition 1. Let O = hT ; Ai be a DL ontology such that minIncSets (O) 6= ;, and
let be an ABox assertion in A. An A-repair A0 of O such that 62 A0 exists iff there
exists V 2 minIncSets (O) such that 2 V .</p>
      <p>Let now AR-Set(O) = fA1; : : : Ang be the set of A-repairs of an ontology O =
hT ; Ai, and let q be a BCQ. From Definition 1, we derive that O j=IAR q iff there exists
a subset A0 of A such that A0 Ai for every 1 i n and hT ; A0i j= q. From the
observation above, we derive the following theorem, which characterizes the notion of
IAR-entailment of BCQs.</p>
      <p>Theorem 1. Let O = hT ; Ai be a DL ontology such that minIncSets (O) 6= ;, and let
q be a BCQ. O j=IAR q iff there exists A0 A such that: (i) A0 is T -consistent; (ii)
hT ; A0i j= q; (iii) A0 \ V = ; for every V 2 minIncSets (O).
4</p>
    </sec>
    <sec id="sec-4">
      <title>Query Rewriting under IAR-Semantics in DL-LiteA;id;den</title>
      <p>
        In this section, we focus on DL-LiteA;id;den ontologies, and provide our technique for
computing FOL-rewritings of BUCQs under the IAR-semantics. The notion of
FOLrewritability is essentially the same used under FOL-semantics [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. More precisely,
BUCQs in DL-LiteA;id;den are FOL-rewritable under the IAR-semantics if, for each
BUCQs q and each DL-LiteA;id;den TBox T , there exists a FOL-query qr such that,
for any ABox A hT ; Ai; j=IAR q iff h;; Ai j= qr. Such qr is called the IAR-perfect
reformulation of q w.r.t. T .
      </p>
      <p>To come up with our reformulation method, we exploit Theorem 1. Roughly
speaking, in the reformulation of a BUCQ q over a DL-LiteA;id;den TBox T , we encode into
a FOL-formula all violations that can involve assertions belonging to images of q in
any ABox A. Indeed, this can be done by reasoning only on the TBox, and considering
each query atom separately. Intuitively, we deal with inconsistency by rewriting each
atom of q into a FOL-formula r in such a way that h;; Ai j= r only if there exists
a substitution of the variables in q to constants of C such that q is satisfied by A,
and ( ) does not belong to any minimal T -inconsistent set.</p>
      <p>
        This inconsistency-driven rewriting is then suitably casted into the final
reformulation, which takes into account also positive knowledge of the TBox, i.e., the inclusion
assertions in T +. As we will show later in this section, this can be done by means of a
slight variation of the algorithm PerfectRef of [
        <xref ref-type="bibr" rid="ref10 ref3">3,10</xref>
        ]. To formalize the above idea, we
need to introduce some preliminary definitions.
      </p>
      <p>We consider Boolean queries corresponding to FOL-sentences of the form:
n m
9z1; : : : ; zk: ^ Hi(ti1) ^ ^ :Ti(ti2) ^
i=1 i=1
`
^ Si(ti3; ti4) ^
i=1
h
^ ti5 6= ti6
i=1
(1)
where every Hi is an unary predicate, which is either an atomic concept or a
valuedomain, every Ti is a value-domain, every Si is a binary predicate, which is either an
atomic role or an attribute, every tij is a term (i.e., either a constant or a variable), and
z1; : : : ; zk are all the variables of the query. Notice that sentences of the form (1) are
BCQs enriched with limited forms of inequalities and negation.</p>
      <p>Given a DL-LiteA;id;den TBox T , we denote with contr(T ) the set of all negative
inclusions, functionalities, identifications, denials, and value-domain inclusions
occurring in T . Intuitively, the set contr(T ) contains all those assertions in T which can
be contradicted by assertions in the ABox. Now, to each assertion 2 contr(T ), we
associate a Boolean query, denoted '( ), which encodes the violation of , i.e., it looks
for images of the negation of . Below we provide some of such encodings. Missing
cases are can be obtained in a similar way.
- '((funct P )) :
- '(A1 v :A2) :
- '( (U ) v T ) :
- '(8~x.conj(~t) ! ?) :
9x; x1; x2.P (x; x1) ^ P (x; x2) ^ x1 6= x2
9x.A1(x) ^ A2(x)
9x1; x2.U (x1; x2) ^ :T (x2)
9~x.conj(~t)
For example, '((id Match homeTeam; visitorTeam)) = 9x; x0; y; z.Match(x) ^
homeTeam(x; y) ^ visitorTeam(x; z) ^ Match(x0) ^ homeTeam(x0; y) ^
visitorTeam(x0; z) ^ x 6= x0, and '(8x; y.Match(x) ^ homeTeam(x; y) ^
visitorTeam(x; y) ! ?) = 9x; y.Match(x) ^ homeTeam(x; y) ^ visitorTeam(x; y).</p>
      <p>
        Then, the algorithm unsatQueries(T ) computes the first-order reformulation
under FOL-semantics of '( ) for each 2 contr(T ), and returns the union of all
such reformulations. To this aim, unsatQueries(T ) makes use of the algorithm
PerfectRef6=(T +; '( )), which is a slight variation of the algorithm PerfectRef given
in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In this modified version, inequality is considered as a primitive role and negated
value-domains are considered as primitive concepts, thus inequality and negated atoms
are never rewritten by the algorithm, and the algorithm does not unify query atoms if
this causes a violation of an inequality. Note that the result of PerfectRef6= is a union
of boolean queries of the form (1), represented as a set of queries, as usual.
      </p>
      <p>For example, assume to have a TBox containing the above mentioned IdA 1 :
(id Match homeTeam; visitorTeam) and the inclusion assertion playedMatch v
Match. The algorithm PerfectRef6=(T +; '( 1)) returns, among others, the query:
9 x; x0; y; z. playedMatch(x) ^ homeTeam(x; y) ^ visitorTeam(x; z) ^ playedMatch(x0)^
homeTeam(x0; y) ^ visitorTeam(x0; z) ^ x 6= x0
Notice that the query '( 1) cannot be rewritten by unifying Match(x) and Match(x0)
because of the inequality x 6= x0. Such an inequality actually causes that in this example
the algorithm can never rewrite queries through unification.</p>
      <p>Let O = hT ; Ai be a DL-LiteA;id;den ontology. Since O is satisfiable iff there are
no T -inconsistent sets in A, the following theorem states that the query produced by
the algorithm unsatQueries(T ) can be used to check satisfiability of O.
Theorem 2. Let O = hT ; Ai be a DL-LiteA;id;den ontology. A T -inconsistent set V
A exists iff h;; Ai j= unsatQueries(T ).</p>
      <p>If h;; Ai j= unsatQueries(T ), then there exists q 2 unsatQueries(T ) such that
h;; Ai j= q. This implies that there exists a substitution from the variables in q to
constants of C such that (q) projected over its positive atoms is contained in A.
Similar to BCQs, we call this set the image of q in A, denoted A(q). It is possible
to show that A(q) is a T -inconsistent set contained in A (in fact, this is part of the
proof of Theorem 2). In order to exploit Theorem 1 towards the definition of a
FOLrewriting procedure, we need however to identify those assertions in A that participate
to a minimal T -inconsistent set. From the definition of minimal T -inconsistent set, and
from Theorem 2 it follows that A(q) is a minimal T -inconsistent set iff for every
V A(q) and every query q0 2 unsatQueries(T ), we have that h;; V i 6j= q0.</p>
      <p>Based on the above observations, we introduce the algorithm
minUnsatQueries(T ) which, starting from the set unsatQueries(T ), computes
a new set Qmin of queries, which enjoys the following properties: (i) For each query
q 2 Qmin, h;; Ai j= q iff there exists in unsatQueries(T ) a query q0 such that
h;; Ai j= q0. This guarantees that Theorem 2 also holds with minUnsatQueries(T )
in place of unsatQueries(T ). (ii) For each query q 2 Qmin, if h;; Ai j= q, then for
every image A(q) of q in A, h;; V i 6j= q0, where V A(q) and q0 2 Qmin. This
guarantees that if a query q 2 Qmin is such that h;; Ai j= q, then every image of q in
A is a minimal T -inconsistent set.</p>
      <p>Before presenting the algorithm minUnsatQueries(T ), we need to introduce some
preliminary notions. Given a query q, we say that a term t occurs in an object position
of q if q contains an atom of the form A(t), P (t; t0), P (t0; t), U (t; t0), whereas we say
that t occurs in a value position of q if q contains an atom of the form T (t), U (t0; t),
where A, P , U , and T have the usual meaning. Given two terms t1 and t2 occurring in
a query q, we say that t1 and t2 are compatible in q if either both t1 and t2 appear only
in object positions of q or both t1 and t2 appear only in value positions of q. Moreover,
let q and q0 be two boolean queries. We say that q is a proper syntactical subset of q0,
written q Rn q0 if there exists a renaming function Rn(q; q0) of the variables in q to the
variables in q0, such that every atom S(~t) occurring in Rn(q; q0) occurs also in q0 and an
analogous renaming from q0 to q does not exists. The algorithm minUnsatQueries(T )
is given below.
The algorithm proceeds as follows.</p>
      <p>Step 1 (line 1): the algorithm initializes Q0 to the set unsatQueries(T ).
Step 2 (line 2): the algorithm computes the set Q00 through the algorithm saturate.
Starting from each query q 2 Q0, such an algorithm first unifies pairs of compatible
terms in q in all passible ways; then, for any query q0 computed in this way, for each
pair of terms t1 and t2 occurring in q0 that are syntactically different, it adds the
inequality atom t1 6= t2 to q0; finally it returns the union of Q0 with this new set of
queries. Notice that such an operation is sound and complete with respect to the set of
queries in Q0: in particular, no answer to the set of queries is lost by this
transformation, since, for every pair of compatible terms t1; t2 which are forced to be not equal
in q, there is a query q0 in Q0 where the same terms have been unified. For instance,
assume that Q0 contains only the query q : 9x; y.Match(x) ^ homeTeam(x; y) ^
visitorTeam(x; y). Then, the algorithm saturate returns a set constituted by the
queries q1 : 9x; y.Match(x) ^ homeTeam(x; y) ^ visitorTeam(x; y) ^ x 6= y and
q2 : 9x.Match(x) ^ homeTeam(x; x) ^ visitorTeam(x; x).</p>
      <p>Step 3 (lines 3-12): let fT1 : : : Tng be the set of value-domains. The algorithm
computes the set Q000 by producing from each query q 2 Q00 the following queries: for each
atom U (x; y), where U is an attribute name, if no atoms T (y) appears in q, where T
is a value-domain, then the algorithm builds 2n new queries by substituting the atom
U (x; y) with either the conjunction of atoms U (x; y) ^ Ti(y) or U (x; y) ^ :Ti(y), for
each 1 i n. In this way, every possible combination is computed. It is easy to
verify that such a transformation is also sound and complete. Step 3 is motivated by the
need to provide a complete comparison in Step 5, as explained below.
Step 4 (lines 13-15): the algorithm removes from Q000 every query q in which a term t
occurs in two atoms of the form T1(t) and T2(t) (T1 and T2 are disjoint, and therefore
for each constant c in V , T1(c) ^ T2(c) is a contradiction).</p>
      <p>Step 5 (lines 16-17): the algorithm removes from the set Q000 each query q0 such that
there is in Q000 a different query q whose atoms form, up to renaming of the variables in
q, a proper subset of the atoms appearing in q0. This simplified form of query
containment guarantees that for every ABox A and every query q in Q000, there does not exist a
query q0 in Q000, such that for every image (q) of q in A, h;; V i j= q0 where V (q).
Step 6 (line 18): the algorithm terminates by returning the set Q000.</p>
      <p>Let us now continue the example given at Step 2. Assume that Q0 contains
also the query q3 : 9x.homeTeam(x; x) which is associate to the denial assertion
8x.homeTeam(x; x) ! ?. Obviously, besides query q1 and q2, Q00 contains also q3,
which is natively in a “saturated” form. Step 3 and Step 4 have no effect in this
example, since none of the queries in Q00 has atoms with attributes or value-domains
as predicate. Therefore, Q000 = Q00, and Step 5 eliminates query q2 from Q000, since
q3 Rn q2. Notice that this step is crucial to ensure that every image of a query
in Q000 in any ABox A is a minimal T -inconsistent set. Indeed, let us, for instance,
pose A = fMatch(a); homeTeam(a; a); visitorTeam(a; a)g; q2 has an image in A,
which is A itself, which is also a T -inconsistent set, but not minimal: indeed, the
set fhomeTeam(a; a)g A is a also T -inconsistent set, since it violates the DA
8x.homeTeam(x; x) ! ?. This set is also minimal, and is indeed an image in A
of the query q3.</p>
      <p>The following lemma states that the algorithm minUnsatQueries(T ) can be used
to check if a DL-LiteA;id;den ontology O = hT ; Ai is consistent.</p>
      <p>Lemma 1. Let O = hT ; Ai be a possibly inconsistent DL-LiteA;id;den ontology. Then,
h;; Ai j= minUnsatQueries(T ) iff h;; Ai j= unsatQueries(T ).</p>
      <p>The following crucial lemma guarantees instead that one can use the queries
produced by the algorithm minUnsatQueries(T ) in order to compute every minimal T
inconsistent set in A.</p>
      <p>Lemma 2. Let O = hT ; Ai be a possibly inconsistent DL-LiteA;id;den ontology, and
let q be a query in minUnsatQueries(T ). If h;; Ai j= q, then every image of q in A is
a minimal T -inconsistent set.</p>
      <p>Let , be two atoms. We say that is compatible with if there exists a mapping
of the variables occurring in to the terms occurring in such that ( ) =
(and in this case we denote the above mapping with the symbol = ). Given an
atom and a query q, we denote by CompSet ( ; q) the set of atoms of q which are
compatible with . Then, let T be a DL-LiteA;id;den TBox and let be an atom, we
define MinIncSet T ( ) as follows.</p>
      <p>MinIncSet T ( ) =
_
0
= (q)A
q2minUnsatQueries(T )^CompSet( ;q)6=;</p>
      <p>Theorem 3. Let hT ; Ai be a possibly inconsistent DL-LiteA;id;den ontology, and let
be an ABox assertion. There exists a minimal T -inconsistent set V in A such that 2 V
iff h;; Ai j= MinIncSet T ( ).</p>
      <p>Let T be a DL-LiteA;id;den TBox and let q be a BCQ of the form</p>
      <p>n
9z1; : : : ; zk: ^ Ai(ti1) ^
i=1
m
^ Pi(ti2; ti3) ^
i=1
`
^ Ui(ti4; ti5)
i=1
(2)
where all symbols have the usual meaning. We denote by IncRewr IAR(q; T ) the
following FOL-sentence:</p>
      <p>n
IncRewr IAR(q; T ) = 9z1; : : : ; zk: ^ Ai(ti1) ^ :MinIncSet T (Ai(ti1))^
i=1
m
^ Pi(ti2; ti3) ^ :MinIncSet T (Pi(ti2; ti3)) ^
i=1
`
^ Ui(ti4; ti5) ^ :MinIncSet T (Ui(ti4; ti5))
i=1
Let Q be a set of CQs, we define IncRewrUCQ IAR(Q; T ) =
Wqi2Q IncRewr IAR(qi; T ). We are then able to give our final results on
reformulation of UCQs under the IAR-semantics.</p>
      <p>
        Theorem 4. Let T be a DL-LiteA;id;den TBox and let Q be a UCQ. For every ABox A,
hT ; Ai j=IAR Q iff h;; Ai j= IncRewrUCQ IAR(saturate(PerfectRef(T +; Q)); T ).
In the above theorem, PerfectRef coincides with the algorithm of [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. This algorithm
takes as input the set of positive inclusions T + and the query Q, and computes the
perfect reformulation under FOL-semantics of Q w.r.t. T +. PerfectRef(T +; Q)
returns a set of CQs specified over T . Through this reformulation we first preprocess
each query according to “positive” knowledge of the TBox, and then manage it to deal
with possible inconsistency. Then, the algorithm saturate previously described is
applied to the query thus obtained: this step is necessary for technical reasons (roughly
speaking, it is needed in order to exactly identify, for every query atom , the queries
from minUnsatQueries(T ) which correspond to inconsistent sets to which an image
of might belong). This query is finally reformulated according to the definition of
IncRewrUCQ IAR.
      </p>
      <p>The following complexity result is a direct consequence of Theorem 4, since
establishing whether h;; Ai j= IncRewrUCQ IAR(saturate(PerfectRef(T +; Q)); T )
simply amounts to evaluating a FOL-query over the ABox A, which is in AC 0 in data
complexity.</p>
      <p>Corollary 1. Let O be a DL-LiteA;id;den ontology and let Q be a UCQ. Deciding
whether O j=IAR Q is in AC 0 in data complexity.</p>
      <p>We finally notice that the above results directly imply that query answering of UCQs
in DL-LiteA;id;den under standard semantics is FOL-rewritable, and therefore in AC 0 in
data complexity, i.e., it has the same computational behavior of all DLs of the DL-Lite
family.</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>Motivated by the requirements arising in applications of ontology-based data access,
we have presented a new algorithm for inconsistency-tolerant query answering in a DL
obtained by extending DL-LiteA with identification and denial assertions. The algorithm
is based on a rewriting technique, and shows that query answering under the considered
inconsistency-tolerant semantics in DL-Lite remains first-order rewritable, even when
identification and denial assertions are added to the TBox. We will soon experiment our
new technique in the context of several ongoing OBDA projects. Our main goal is to
devise optimization techniques that allow for simplifying the rewritten query as much
as possible, so that the performance of the query answering process remains acceptable
even under the inconsistency-tolerant semantics.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bienvenu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>First-order expressibility results for queries over inconsistent DL-Lite knowledge bases</article-title>
          .
          <source>In: Proc. of DL</source>
          <year>2011</year>
          .
          <article-title>CEUR, ceur-ws.org</article-title>
          , vol.
          <volume>745</volume>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Cal`ı,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Gottlob</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Lukasiewicz</surname>
          </string-name>
          ,
          <string-name>
            <surname>T.</surname>
          </string-name>
          :
          <article-title>A general Datalog-based framework for tractable query answering over ontologies</article-title>
          .
          <source>In: Proc. of PODS 2009</source>
          . pp.
          <fpage>77</fpage>
          -
          <lpage>86</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Giacomo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lembo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosati</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Tractable reasoning and efficient query answering in description logics: The DL-Lite family</article-title>
          .
          <source>J. of Automated Reasoning</source>
          <volume>39</volume>
          (
          <issue>3</issue>
          ),
          <fpage>385</fpage>
          -
          <lpage>429</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Giacomo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lembo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosati</surname>
          </string-name>
          , R.:
          <article-title>Path-based identification constraints in description logics</article-title>
          .
          <source>In: Proc. of KR 2008</source>
          . pp.
          <fpage>231</fpage>
          -
          <lpage>241</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Chomicki</surname>
          </string-name>
          , J.:
          <article-title>Consistent query answering: Five easy pieces</article-title>
          .
          <source>In: Proc. of ICDT 2007. LNCS</source>
          , vol.
          <volume>4353</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          . Springer (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6. Ga¨rdenfors,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Rott</surname>
          </string-name>
          , H.:
          <article-title>Belief revision</article-title>
          .
          <source>In: Handbook of Logic in Artificial Intelligence and Logic Programming</source>
          , vol.
          <volume>4</volume>
          , pp.
          <fpage>35</fpage>
          -
          <lpage>132</lpage>
          . Oxford University Press (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Kontchakov</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Toman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zakharyaschev</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The combined approach to query answering in DL-Lite</article-title>
          .
          <source>In: Proc. of KR 2010</source>
          . pp.
          <fpage>247</fpage>
          -
          <lpage>257</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Lembo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosati</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruzzi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Savo</surname>
            ,
            <given-names>D.F.</given-names>
          </string-name>
          :
          <article-title>Inconsistency-tolerant semantics for description logics</article-title>
          .
          <source>In: Proc. of RR</source>
          <year>2010</year>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Lembo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosati</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruzzi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Savo</surname>
            ,
            <given-names>D.F.</given-names>
          </string-name>
          :
          <article-title>Query rewriting for inconsistent DL-Lite ontologies</article-title>
          .
          <source>In: Proc. of RR</source>
          <year>2011</year>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Poggi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lembo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Giacomo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosati</surname>
          </string-name>
          , R.:
          <article-title>Linking data to ontologies</article-title>
          .
          <source>J. on Data Semantics X</source>
          ,
          <fpage>133</fpage>
          -
          <lpage>173</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Savo</surname>
            ,
            <given-names>D.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lembo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poggi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <article-title>Rodr´ıguez-</article-title>
          <string-name>
            <surname>Muro</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Romagnoli</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruzzi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stella</surname>
          </string-name>
          , G.:
          <article-title>MASTRO at work: Experiences on ontology-based data access</article-title>
          .
          <source>In: Proc. of DL</source>
          <year>2010</year>
          .
          <article-title>CEUR, ceur-ws.org</article-title>
          , vol.
          <volume>573</volume>
          , pp.
          <fpage>20</fpage>
          -
          <lpage>31</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>