<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Querying Inconsistent Description Logic Knowledge Bases under Preferred Repair Semantics (Extended Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Meghyn Bienvenu</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Camille Bourgaux</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francois Goasdoue</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IRISA, Universite de Rennes 1</institution>
          ,
          <addr-line>Lannion</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>LRI, CNRS &amp; Universite Paris-Sud</institution>
          ,
          <addr-line>Orsay</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Many inconsistency-tolerant semantics for DLs rely on the notion of a repair, i.e., a -maximal subset of the ABox that is consistent with the TBox. This extended abstract presents our investigation [4] of variants of two popular inconsistency-tolerant semantics obtained by replacing classical repairs by various types of preferred repair.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Recent years have seen an increasing interest in performing database-style query
answering over description logic (DL) knowledge bases (KBs). Since scalability
is crucial in data-rich applications, there has been a trend to using lightweight
DLs for which query answering is tractable w.r.t. the size of the ABox.
Particular attention has been paid to DLs of the DL-Lite family [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] for which query
answering can be reduced to evaluation of standard database queries.
      </p>
      <p>
        An important issue that arises in the context of DL query answering is how to
handle the case in which the ABox is inconsistent with the TBox. Indeed, while
it may be reasonable to assume that the TBox has been properly debugged, the
ABox will typically be very large and subject to frequent modi cations, both
of which make errors likely. Since it may be too di cult or costly to identify
and x these errors, it is essential to be able to provide meaningful answers to
queries in the presence of such data inconsistencies. To address this issue,
several di erent inconsistency-tolerant semantics have been proposed for querying
inconsistent DL knowledge bases. Among them, the AR and IAR semantics [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]
are the most well-known and well-studied. Both semantics are based upon the
notion of a repair, de ned as an inclusion-maximal subset of the ABox which is
consistent with the TBox. The AR semantics, which was inspired by consistent
query answering in relational databases (cf. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] for a survey), amounts to
computing those answers that hold no matter which repair is chosen, whereas the
more cautious IAR semantics queries the intersection of the repairs.
      </p>
      <p>When additional information on the reliability of ABox assertions is available,
it is natural to use this information to identify preferred repairs, and to use the
latter as the basis of inconsistency-tolerant query answering. This motivated us
to study variants of the AR and IAR semantics obtained by replacing the classical
notion of repair by one of four di erent types of preferred repairs.
Cardinalitymaximal repairs, denoted -repairs, are intended for settings in which all ABox
assertions are believed to have the same likelihood of being correct. The other
three types of preferred repairs target the scenario in which some assertions are
considered to be more reliable than others, which can be captured qualitatively
by partitioning the ABox into priority levels (and then applying either the set
inclusion ( P -repairs) or cardinality ( P -repairs) criterion to each level), or
quantitatively by assigning weights to the ABox assertions and selecting those
repairs having the greatest weight ( w-repairs).
2</p>
    </sec>
    <sec id="sec-2">
      <title>Complexity Results</title>
      <p>
        There have been several studies of the complexity of querying DL knowledge
bases under the AR and IAR semantics for the standard notion of repairs [
        <xref ref-type="bibr" rid="ref10 ref14 ref3 ref5">10,
14, 3, 5</xref>
        ]. The complexity of AR query answering with - or w-repairs was rst
studied in the database setting for denial constraints [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], and more recently in
the DL setting for the highly expressive SHIQ [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. However, there had been
no systematic study of the computational properties of preferred repairs for
important lightweight DLs. We addressed this gap in the literature by studying
the complexity of answering conjunctive and atomic queries under the eight
preferred repair semantics (cf. Figure 1). We focused on the lightweight logic
DL-LiteR (which underlies the OWL 2 QL pro le [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]), though many of our
results hold also for other data-tractable ontology languages.
      </p>
      <p>Unsurprisingly, the results are mainly negative, showing that adding
preferences increases complexity. For the IAR semantics, we move from polynomial
data complexity in the case of (plain) IAR semantics to coNP-hard data
complexity (or worse) for IAR semantics based on preferred repairs. For the AR
semantics, query answering is known to be coNP-complete in data complexity
already for the standard notion of repairs, but adding preferences often results in
even higher complexities. The sole exception is P -repairs (which combine
priority levels and the set inclusion criterion), for which both AR and IAR query
answering are \only" coNP-complete in data complexity.</p>
      <p>The lower combined complexity of the IAR semantics shows that they still
retain some computational advantage over the AR semantics. Intuitively, this
is because one can precompute the intersection of repairs in an o ine phase
and then utilize standard querying algorithms at query time, whereas no such
pre-computation is possible for the AR semantics.
3</p>
    </sec>
    <sec id="sec-3">
      <title>System and Experiments</title>
      <p>
        We are aware of two systems for inconsistency-tolerant query answering over DL
KBs: the system of [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] for querying SHIQ KBs under w-AR semantics, and
the QuID system [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] that handles IAR semantics (without preferences) and
DLLiteA KBs. The latter work left open whether, for the standard notion of repairs,
      </p>
      <p>P
P &amp;
w</p>
      <p>AR
coNP
2p[O(log n)]
coNP
py
2</p>
      <sec id="sec-3-1">
        <title>Data complexity IAR in P</title>
        <p>2p[O(log n)]
coNP
py
2</p>
        <p>P
P &amp;
w</p>
        <p>AR
p
2
p
2
p
2
p
2
IAR</p>
        <p>NP
2p[O(log n)]
2p[O(log n)]
py
2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Combined complexity</title>
        <p>the IAR semantics constitutes a good approximation of the AR semantics, and
whether the AR semantics could be feasibly computed in practice.</p>
        <p>
          Encouraged by the performance of modern-day SAT solvers, we proposed
a practical SAT-based approach for query answering in DL-LiteR under the
(plain) AR, P -IAR, and P -AR semantics. We showed how to encode query
answering under these three semantics in terms of propositional unsatis ability,
using a reachability analysis on the con ict graph (inspired by [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]) to reduce the
size of the encodings. We implemented the approach in our CQAPri system. The
ABox is stored in a PostgreSQL server, while the TBox is kept in-memory together
with the pre-computed set of con icts for the KB. When a query arrives, CQAPri
evaluates it over the ABox using its SQLized rewriting (obtained with the Rapid
query rewriting engine [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]), to get its possible answers and their causes. Among
the possible answers, CQAPri identi es the IAR ones, or an approximation of the
        </p>
        <p>
          P -IAR ones. For those possible answers that are not found to be IAR answers
(resp. in the approximation of the P -IAR answers), deciding whether they are
entailed under the AR semantics (resp. the P -IAR and P -AR semantics), is
done using our encodings and the SAT4J SAT solver [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
        </p>
        <p>
          An experimental evaluation demonstrates the scalability of the approach in
settings we presume realistic. We considered the DL-LiteR LUBM290 TBox from
[
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] and the associated Extended University Data Generator (EUDG). We extended
LUBM290 with negative inclusions and introduced inconsistencies in the ABox.
Prioritizations of ABox were made to capture a variety of scenarios. The
experiments show that our system scales up to large ABoxes (up to about 2 million
assertions) for the IAR/AR and P -IAR/ P -AR semantics, when the number
of con icting assertions varies from a few percent (for all of these semantics) to
a few tens of percent (only for IAR/AR). This positive empirical result is due in
large part to the e cacy of the incomplete methods, which leave only few cases
to be handled by the SAT solver. In a similar vein, our simple approximation
of the P -IAR semantics often produced a large share of the P -IAR answers,
which themselves constituted a large portion of the P -AR answers.
Acknowledgements This work was supported by contract ANR-12-JS02-007-01.
We acknowledge Despoina Trivela for her work on an early version of the system.
        </p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Berre</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parrain</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <source>The sat4j library, release 2.2. JSAT 7</source>
          (
          <issue>2-3</issue>
          ),
          <volume>59</volume>
          {
          <fpage>64</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bertossi</surname>
            ,
            <given-names>L.E.</given-names>
          </string-name>
          :
          <article-title>Database Repairing and Consistent Query Answering</article-title>
          .
          <source>Synthesis Lectures on Data Management</source>
          , Morgan &amp; Claypool Publishers (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bienvenu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>On the complexity of consistent query answering in the presence of simple ontologies</article-title>
          .
          <source>In: Proc. of AAAI</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bienvenu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bourgaux</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goasdoue</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Querying inconsistent description logic knowledge bases under preferred repair semantics</article-title>
          .
          <source>In: Proc. of AAAI</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Bienvenu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosati</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Tractable approximations of consistent query answering for robust ontology-based data access</article-title>
          .
          <source>In: Proc. of IJCAI</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Giacomo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lembo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosati</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Tractable reasoning and e cient query answering in description logics: The DL-Lite family</article-title>
          .
          <source>Journal of Automated Reasoning</source>
          <volume>39</volume>
          (
          <issue>3</issue>
          ),
          <volume>385</volume>
          {
          <fpage>429</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Chomicki</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marcinkowski</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Staworko</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Computing consistent query answers using con ict hypergraphs</article-title>
          .
          <source>In: Proc. of CIKM</source>
          . pp.
          <volume>417</volume>
          {
          <issue>426</issue>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Chortaras</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trivela</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stamou</surname>
          </string-name>
          , G.:
          <article-title>Optimized query rewriting for OWL 2 QL</article-title>
          . In
          <source>: Proc. of CADE</source>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Du</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>Y.D.</given-names>
          </string-name>
          :
          <article-title>Weight-based consistent query answering over inconsistent SHIQ knowledge bases</article-title>
          .
          <source>Knowledge and Information Systems</source>
          <volume>34</volume>
          (
          <issue>2</issue>
          ),
          <volume>335</volume>
          {
          <fpage>371</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Lembo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosati</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruzzi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Savo</surname>
            ,
            <given-names>D.F.</given-names>
          </string-name>
          :
          <article-title>Inconsistency-tolerant semantics for description logics</article-title>
          .
          <source>In: Proc. of RR</source>
          . pp.
          <volume>103</volume>
          {
          <issue>117</issue>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Lopatenko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bertossi</surname>
            ,
            <given-names>L.E.</given-names>
          </string-name>
          :
          <article-title>Complexity of consistent query answering in databases under cardinality-based and incremental repair semantics</article-title>
          .
          <source>In: Proc. of ICDT</source>
          . pp.
          <volume>179</volume>
          {
          <issue>193</issue>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seylan</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Toman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>The combined approach to OBDA: Taming role hierarchies using lters</article-title>
          .
          <source>In: Proc. of ISWC</source>
          . pp.
          <volume>314</volume>
          {
          <issue>330</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuenca Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fokoue</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>OWL 2 Web Ontology Language pro les</article-title>
          .
          <source>W3C Recommendation (11 December</source>
          <year>2012</year>
          ), http://www.w3.org/TR/owl2-profiles/, available at http://www.w3.org/ TR/owl2-profiles/
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Rosati</surname>
          </string-name>
          , R.:
          <article-title>On the complexity of dealing with inconsistency in description logic ontologies</article-title>
          .
          <source>In: Proc. of IJCAI</source>
          . pp.
          <volume>1057</volume>
          {
          <issue>1062</issue>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Rosati</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruzzi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Graziosi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masotti</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Evaluation of techniques for inconsistency handling in OWL 2 QL ontologies</article-title>
          .
          <source>In: Proc. of ISWC</source>
          . pp.
          <volume>337</volume>
          {
          <issue>349</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>