<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Closed Predicates in Description Logics: Results on Combined Complexity (Extended Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nhung Ngo</string-name>
          <email>ngo@inf.unibz.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Magdalena Ortiz</string-name>
          <email>ortiz@kr.tuwien.ac.at</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mantas Sˇimkus</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Free University of Bozen-Bolzano</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Vienna University of Technology</institution>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Description Logics (DLs) are a family of languages for knowledge representation,
popular for writing ontologies that describe domain-specific knowledge. As fragments of
classical first-order logic, DLs make the open-world assumption. That is, the semantics
of a knowledge base (KB) is given by the set of all its models. This view naturally
supports modeling of incomplete knowledge. However, it is not always fully adequate, as in
many applications of DLs, incomplete knowledge interacts with complete information.
For example, in the popular paradigm of ontology-based data access, DL ontologies
are advocated as a means to leverage domain knowledge when querying data sources.
These sources often stem from traditional relational databases, which have a
closedworld view and are assumed to fully describe the instances included in a relation. E.g.,
a municipality-provided table of bus routes can be assumed complete. Query answering
algorithms should exploit this to exclude irrelevant models and infer more answers.</p>
      <p>
        Combining open and closed world reasoning is not a new topic in DLs [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], but it has
received renewed attention in recent years [
        <xref ref-type="bibr" rid="ref12 ref13 ref18 ref6 ref7">13, 12, 7, 6, 18</xref>
        ]. A prominent proposal for
achieving partial closed world reasoning is to enrich KBs with a set of predicates that
are to be interpreted as closed [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. In this way, some relations are interpreted under the
closed semantics, while others are considered open.
      </p>
      <p>
        Most work considering closed predicates has focused on the problem of evaluating
a query (usually a conjunctive query) under the standard certain answer semantics, and
on data complexity, i.e. the complexity measured in the size of the data instance to be
queried, while both the ontology and the query are assumed to be fixed. E.g., Franconi
et at. ([
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]) show that closed predicates lead to the loss of tractability of query answering
even in the core fragments of DL-Lite, and provided the first matching upper bounds.
An in-depth analysis of the reasons for intractability, and a fine-grained analysis of
tractable classes can be found in [
        <xref ref-type="bibr" rid="ref12 ref13">12, 13</xref>
        ].
      </p>
      <p>
        The data complexity is an important indicator of whether algorithms can scale to
large amounts of data. However, it is also fundamental to understand the combined
complexity, which takes the size of the ontology and the query along with the data into
account, as the size of these two components often has a big impact on the performance
of query answering algorithms. Indeed, ontologies can be very large [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], and
disregarding their size easily leads to infeasible algorithms. The query size can also have a
major impact on the query answering techniques, and avoiding the exponential blow-up
in the size of the query was in fact one of the hardest obstacles to overcome for
achieving scalable query answering even for DLs of very low data complexity like dialects of
DL-Lite [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. It is thus surprising that the combined complexity of query answering in the
presence of closed predicates has received so little attention. In this extended abstract,
we summarize several results on the combined complexity of reasoning in the presence
of closed predicates in a wide range of DLs, which are presented in detail in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>DLs with closed predicates and query answering</title>
      <p>We use NC, NR and NI for concept names, role names and individual names,
respectively. DL knowledge bases (KB) take the form K = (T ; ; A). The TBox T is a set of
axioms , which usually include concept inclusions C v D and role inclusions r v s.
The specific syntax of concepts C; D, roles r; s and other possible axioms depends on
the DL in question. is a set of concept and role names, called the closed predicates
of K. Finally, the ABox A is a finite set of assertions of the forms A(b) and r(a; b) with
A 2 NC, r 2 NR and a; b 2 NI.</p>
      <p>The notion of an interpretation I = ( I ; I ) is as usual. We make the standard
name assumption (SNA), i.e., aI = a for all I and individuals a. Concepts C are
interpreted as sets of objects CI I and roles r as binary relations rI I I
in the standard way. The notions of TBox and ABox satisfaction I j= T and I j= A
are also as usual. We say that I respects a given NC [ NR if:
(a) for all A 2 \ NC, if d 2 AI , then A(d) 2 A, and
(b) for all r 2 \ NR, if (d; d0) 2 rI , then r(d; d0) 2 A.</p>
      <p>We call I a model of a KB K = (T ; ; A), and write I j= K, if I j= T , I j= A, and I
respects the closed predicates in .</p>
      <p>The main problem we study here is query answering, which is a generalization of
standard reasoning tasks such as consistency testing and instance checking. Let NV be a
countably infinite set of variables. Boolean conjunctive queries (BCQs), take the form
q = 1 ^ : : : ^ n, where each j is an atom of the form A(t) or r(t; t0) with A 2 NC,
r 2 NR, and ft; t0g NI [ NV. Queries are given the standard first order semantics,
where all variables are existentially quantified: we write I j= q0 if there is an assignment
from the variables in q to objects in I that makes all atoms true. A Boolean union of
conjunctive queries (BUCQ) is of the form q0 = q1 _ : : : _ qn, where each qi is a BCQ.
Given an interpretation I, we write I j= q0 if there is i 2 f1; : : : ; ng such that I j= qi,
and given a KB K we write K j= q if I j= q holds for every model I of K. Given K and
a B(U)CQ q as above, the query entailment problem is to decide whether K j= q.All
results below apply also to query answering (as a decision problem) for queries with
answer variables using the standard reduction to query entailment.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>In the presence of complete data, already consistency checking and instance query
entailment in the basic DL-Litecore are NP-complete. For the hardness, it is not hard to
reduce 3-colorability to deciding consistency of a DL-Litecore KB with closed concepts.</p>
      <p>Without closed predicates</p>
      <p>With closed predicates
Instance queries B(U)CQs</p>
      <p>Instance queries</p>
      <p>B(U)CQs
DL-Litecore
DL-LiteR
EL
ALCO
SHOQ,
SHOI</p>
      <p>
        NL
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
NL
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
      </p>
      <p>For the membership, we provide an algorithm that shows an NP upper bound even for
rich DL-Lite dialects with Boolean combinations of concepts, role inclusions, and
nominals, and for UCQs without existentially quantified variables.</p>
      <p>In E L we can use DBoxes to express disjunction, atomic negation, and nominals,
making E L as expressive as ALCO, and causing consistency testing to require
exponential time in the worst case. Tight upper bounds for instance queries and quantifier
free UCQs for any DL containing E L can then be inferred by a reduction to standard
reasoning in well-known expressive DLs with nominals.</p>
      <p>
        For more expensive queries, we show that entailment of UCQs is 2EXPTIME-hard
for DL-LiteR and E L based on a reduction from the word problem for Alternating
Turing machines (ATMs) with exponential work space [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Intuitively, using closed
concepts can use a simple KB to enforce candidate computations of a given ATM, and
then UCQs can test whether the represented computation is correct and accepting. The
result is in sharp contrast to the NP upper bound in the setting with no DBox predicate,
and coincides with known upper bounds for much richer DLs. A minor variation of the
reduction allows shows coNEXPTIME-hardness for DL-Litecore, but we still don’t know
if this bound is tight.
      </p>
      <p>A similar reduction allows us to prove 2EXPTIME-hardness of CQ entailment in
the standard DL ALCO (with no DBox). This closes an open problem and singles out
nominals as a previously unidentified source of complexity when answering queries
over expressive DLs. A summary of our result and its comparison to the case without
DBox is provided in Table 1.</p>
      <p>In the light of these negative results, we also looked for classes of queries for which
the query answering problem has lower complexity. In particular, we considered a
restriction on variables called K-safety which guarantees that it is sufficient to consider
assignments of the variable to individuals in the KB, and there is no need to consider
other objects in the interpretation domain . A query is called K-safe if all its variables
are K-safe. We also generalize K-safe queries to K-acyclic by allowing for variables
that are not K-safe, but requiring that they induce only acyclic subqueries in the
original query.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>Closed predicates seem to have a great potential in applications of DLs, but as our
results show, they are computationally costly. It remains a challenge to identify
useful restricted settings with better complexity, and to explore other ways for effectively
supporting partial completeness at a lower computational cost.</p>
      <p>We note that all hardness proofs in the paper are given for KBs (T ; ; A) where
contains only a constant number of concept names (which can be further reduced to
a single one in DLs that can express disjointness). Moreover, they also apply to KBs
(T ; ;; A) with no closed predicates, but where axioms A v fa; bg with the so-called
one-of or enum are allowed. Hence our results also shed light on the effect of nominal
enumerations in lightweight DLs.</p>
      <p>Some interesting questions remain open for the DL-Lite family, like the precise
complexity of (U)CQs in DL-Litecore. We remark that inverses play a very limited role
in the hardness results here. They all apply to 1-way DL-Lite KBs, where only role
names r occur in the right-hand side of axioms, and inverse roles r on the left-hand
side; these KBs have directed models in the style of E L and ALC.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work was partially supported by the Austrian Science Fund (FWF) projects T515,
P25207, and the project ORMiE from the Free University of Bozen-Bolzano.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Artale</surname>
          </string-name>
          , Diego Calvanese, Roman Kontchakov, and
          <string-name>
            <given-names>Michael</given-names>
            <surname>Zakharyaschev</surname>
          </string-name>
          .
          <article-title>The dl-lite family and relations</article-title>
          .
          <source>J. Artif. Int. Res.</source>
          ,
          <volume>36</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>69</lpage>
          ,
          <year>September 2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Franz</given-names>
            <surname>Baader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Sebastian</given-names>
            <surname>Brand</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Carsten</given-names>
            <surname>Lutz</surname>
          </string-name>
          .
          <article-title>Pushing the EL envelope</article-title>
          .
          <source>In Proc. of IJCAI</source>
          <year>2005</year>
          , pages
          <fpage>364</fpage>
          -
          <lpage>369</lpage>
          . Morgan-Kaufmann Publishers,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Marco</given-names>
            <surname>Cadoli</surname>
          </string-name>
          ,
          <string-name>
            <surname>Francesco M. Donini</surname>
            , and
            <given-names>Marco</given-names>
          </string-name>
          <string-name>
            <surname>Schaerf</surname>
          </string-name>
          .
          <article-title>Closed world reasoning in hybrid systems</article-title>
          .
          <source>In Proc. of ISMIS</source>
          <year>1990</year>
          , pages
          <fpage>474</fpage>
          -
          <lpage>481</lpage>
          . Elsevier Science Publishing,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Diego Calvanese, Thomas Eiter, and
          <string-name>
            <given-names>Magdalena</given-names>
            <surname>Ortiz</surname>
          </string-name>
          .
          <article-title>Regular path queries in expressive description logics with nominals</article-title>
          .
          <source>In Proc. of IJCAI</source>
          <year>2009</year>
          , pages
          <fpage>714</fpage>
          -
          <lpage>720</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ashok</surname>
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Chandra</surname>
          </string-name>
          ,
          <string-name>
            <surname>Dexter C. Kozen</surname>
            , and
            <given-names>Larry J.</given-names>
          </string-name>
          <string-name>
            <surname>Stockmeyer</surname>
          </string-name>
          . Alternation.
          <source>Journal of the ACM</source>
          ,
          <volume>28</volume>
          (
          <issue>1</issue>
          ):
          <fpage>114</fpage>
          -
          <lpage>133</lpage>
          ,
          <year>1981</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Enrico</given-names>
            <surname>Franconi</surname>
          </string-name>
          ,
          <article-title>Yazmin Ange´lica Iba´n˜ez-Garc´ıa, and Inanc¸ Seylan. Query answering with DBoxes is hard</article-title>
          .
          <source>Electr. Notes Theor. Comput. Sci.</source>
          ,
          <volume>278</volume>
          :
          <fpage>71</fpage>
          -
          <lpage>84</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Enrico</given-names>
            <surname>Franconi</surname>
          </string-name>
          , Volha Kerhet, and
          <string-name>
            <given-names>Nhung</given-names>
            <surname>Ngo</surname>
          </string-name>
          .
          <article-title>Exact query reformulation over databases with first-order and description logics ontologies</article-title>
          .
          <source>J. Artif. Intell. Res. (JAIR)</source>
          ,
          <volume>48</volume>
          :
          <fpage>885</fpage>
          -
          <lpage>922</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Birte</given-names>
            <surname>Glimm</surname>
          </string-name>
          , Ian Horrocks, and
          <string-name>
            <given-names>Ulrike</given-names>
            <surname>Sattler</surname>
          </string-name>
          .
          <article-title>Deciding shoqˆcap knowledge base consistency using alternating automata</article-title>
          . In Franz Baader, Carsten Lutz, and Boris Motik, editors,
          <source>Proceedings of the 21st International Workshop on Description Logics (DL2008)</source>
          , Dresden, Germany, May 13-16,
          <year>2008</year>
          , volume
          <volume>353</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Birte</given-names>
            <surname>Glimm</surname>
          </string-name>
          , Ian Horrocks, and
          <string-name>
            <given-names>Ulrike</given-names>
            <surname>Sattler</surname>
          </string-name>
          .
          <article-title>Unions of conjunctive queries in SHOQ</article-title>
          .
          <source>In Proc. of the 11th International Conference on the Principles of Knowledge Representation and Reasoning (KR-08)</source>
          , pages
          <fpage>252</fpage>
          -
          <lpage>262</lpage>
          . AAAI Press/The MIT Press,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>Jan</given-names>
            <surname>Hladik</surname>
          </string-name>
          .
          <article-title>A tableau system for the description logic SHIO</article-title>
          .
          <source>In IJCAR Doctoral Programme</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Markus</surname>
          </string-name>
          <article-title>Kro¨ tzsch and Sebastian Rudolph</article-title>
          .
          <article-title>Conjunctive queries for E L with composition of roles</article-title>
          . volume
          <volume>250</volume>
          <source>of CEUR Electronic Workshop Proceedings</source>
          , http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>250</volume>
          /,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Carsten</surname>
            <given-names>Lutz</given-names>
          </string-name>
          , Inanc¸ Seylan, and
          <string-name>
            <given-names>Frank</given-names>
            <surname>Wolter</surname>
          </string-name>
          .
          <article-title>Ontology-based data access with closed predicates is inherently intractable (sometimes)</article-title>
          .
          <source>In Proc. of IJCAI 2013. IJCAI/AAAI</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Carsten</surname>
            <given-names>Lutz</given-names>
          </string-name>
          , Inanc¸ Seylan, and
          <string-name>
            <given-names>Frank</given-names>
            <surname>Wolter</surname>
          </string-name>
          .
          <article-title>Ontology-mediated queries with closed predicates</article-title>
          .
          <source>In Proc. of IJCAI</source>
          <year>2015</year>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Nhung</surname>
            <given-names>Ngo</given-names>
          </string-name>
          , Magdalena Ortiz, and
          <string-name>
            <given-names>Mantas</given-names>
            <surname>Simkus</surname>
          </string-name>
          .
          <article-title>Closed predicates in description logics: Results on combined complexity</article-title>
          .
          <source>In KR</source>
          . AAAI Press,
          <year>April 2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>Cathy</given-names>
            <surname>Price</surname>
          </string-name>
          and
          <string-name>
            <given-names>Kent</given-names>
            <surname>Spackman</surname>
          </string-name>
          .
          <article-title>Snomed clinical terms</article-title>
          .
          <source>British Journal of Healthcare Computing and Information Management</source>
          ,
          <volume>17</volume>
          (
          <issue>3</issue>
          ):
          <fpage>27</fpage>
          -
          <lpage>31</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>Riccardo</given-names>
            <surname>Rosati</surname>
          </string-name>
          . On conjunctive query answering in E L. volume
          <volume>250</volume>
          <source>of CEUR Electronic Workshop Proceedings</source>
          , http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>250</volume>
          /,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>Klaus</given-names>
            <surname>Schild</surname>
          </string-name>
          .
          <article-title>A correspondence theory for terminological logics: Preliminary report</article-title>
          .
          <source>In Proc. of IJCAI</source>
          <year>1991</year>
          , pages
          <fpage>466</fpage>
          -
          <lpage>471</lpage>
          . Morgan Kaufmann,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18. Inanc¸ Seylan,
          <string-name>
            <given-names>Enrico</given-names>
            <surname>Franconi</surname>
          </string-name>
          , and Jos de Bruijn.
          <article-title>Effective query rewriting with ontologies over DBoxes</article-title>
          .
          <source>In Proc. of IJCAI</source>
          <year>2009</year>
          , pages
          <fpage>923</fpage>
          -
          <lpage>925</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <given-names>Stephan</given-names>
            <surname>Tobies</surname>
          </string-name>
          .
          <article-title>Complexity Results and Practical Algorithms for Logics in Knowledge Representation</article-title>
          .
          <source>PhD thesis</source>
          , LuFG Theoretical Computer Science, RWTH-Aachen, Germany,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>