<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Learning Description Logic Concepts: When can Positive and Negative Examples be Separated? (Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Maurice Funk</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jean Christoph Jung</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Carsten Lutz</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hadrien Pulcini</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Frank Wolter</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Bremen</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Liverpool</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>An important challenge for adopting ontologies in practical applications is the knowledge acquisition bottleneck, that is, the signi cant time and e ort it takes to build the required ontologies. As a promising approach to help overcoming this di culty, the varied eld of ontology learning has received a lot of attention in the last two decades, see [15] for a recent overview. A prominent line of research within ontology learning is concerned with learning description logic (DL) concepts from positive and negative examples, given an already available DL ontology that contains background knowledge [12, 14, 16, 20, 7]. Applications include the support of ontology development and the construction of concept based classi ers [4, 18]. The precise formulation is as follows. Given a knowledge base (KB) K = (T ; A) and designated sets of individuals P and N from A that serve as positive and negative examples, nd a concept C formulated in a DL LS that separates the positive from the negative examples, that is, K j= C(a) for all a 2 P and K 6j= C(a) for all a 2 N . In addition to separation, one also wants to achieve that the learned concept C generalizes the positive examples in a meaningful way, classifying new examples accordingly. As a prominent system for DL concept learning, we mention DL Learner. It encompasses several learning algorithms that support a range of DLs, including expressive ones such as ALC and ALCQ, Horn DLs such as EL, and even full OWL 2 [5, 4]. Like competing systems such as DL-Foil, YinYang, and pFOIL-DL [7, 10, 19], DL Learner uses carefully crafted re nement operators [1, 13, 14] along with various heuristics to learn concepts that provide an as good as possible generalization of the examples, avoiding over tting. If possible, renement operators are designed so that the resulting algorithm terminates on any input and is complete in the sense that whenever there is a concept that separates the positive and negative examples in the input, then such a concept is indeed learned. In the paper reported about in this abstract [8], we investigate the fundamental question of when a separating concept exists for a learning instance (K; P; N ), considering the most popular choices of DLs for the TBox language LT and the separation language LS . Our main contributions are model-theoretic characterizations that give important insight into when this is the case and a precise analysis of the computational complexity of separability viewed as a decision problem, which we refer to as (LT ; LS ) concept separability and as L concept separability when LT = LS = L. We also consider concept de nability, the spe-</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>cial case of concept separability in which P [ N comprises all individuals from A.
All our complexity results actually hold for both separability and de nability.</p>
      <p>
        We believe that these problems are relevant both from a practical and from a
theoretical perspective. In fact, complexity lower bounds for concept separability
point to an inherent complexity that no practical system that aims for
completeness can avoid. Undecidability results even mean that there can be no practical
learning system that is both terminating and complete. From the viewpoint of
machine learning theory, concept separability corresponds to the existence of
a consistent hypothesis, the most fundamental problem for exploring the
version space [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The associated decision problem is often called the consistency
problem, and it is known to be closely related to PAC learnability [
        <xref ref-type="bibr" rid="ref11 ref17">17, 11</xref>
        ].
      </p>
      <p>We cover the expressive DLs ALC, ALCI, ALCQ, and ALCQI as well as
the Horn DLs EL and ELI. For the former, over tting is a risk because the
disjunction operator available in such DLs enables the construction of separating
concepts that do not provide the desired generalization of the positive examples.
Nevertheless, most practical systems such as DL Learner work with expressive
DLs and avoid over tting by using appropriate re nement operators. Horn DLs
do not admit disjunction and therefore are not prone to over tting. On the other
hand, they provide less separating power and, as we show, tend to incur higher
computational (worst-case) cost for learning.</p>
      <p>
        For expressive DLs, we start with initial characterizations in terms of (a form
of) bisimulations and then proceed to more re ned characterizations based on
homomorphisms. Interestingly and unexpectedly to us, these establish a tight link
between concept separability and the evaluation of ontology-mediated queries
(OMQs) based on unions of `rooted' conjunctive queries [
        <xref ref-type="bibr" rid="ref2 ref6">6, 2</xref>
        ]. We use this link
to obtain complexity upper and lower bounds. In fact, L concept separability is
NExpTime-complete for L 2 fALC; ALCI; ALCQg while ALCQI concept
separability is only ExpTime-complete. This refers to combined complexity where
all components of the learning instance are part of the input. We also study
data complexity where the ABox is the only input while the TBox is xed. In all
expressive DLs above, concept separability is 2p-complete in data complexity.
      </p>
      <p>For Horn DLs, we establish characterizations based on products of universal
models and simulations. Based on these, we show that (LT ; EL) concept
separability is ExpTime-complete for LT 2 fEL; ELIg, both in combined complexity
and in data complexity. We nd the high data complexity of this problem rather
remarkable. We also prove that ELI concept separability is undecidable, a result
that came as a surprise to us.</p>
      <p>
        We nally consider a mix of expressive DLs and Horn DLs, that is, (LT ; LS )
concept separability where LT is any of the expressive DLs mentioned above and
LS is EL or ELI. These problems again turn out to be undecidable, thus ruling
out terminating and complete learning systems. The proof exploits a connection
to a certain version of query based conservative extensions between ALC KBs [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>We also consider a stronger version of concept separability that is also
considered in the literature requires that K j= :C(a) for all a 2 N , rather than only
K 6j= C(a).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Badea</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , Nienhuys-Cheng, S.:
          <article-title>A re nement operator for description logics</article-title>
          .
          <source>In: Proceedings of ILP</source>
          . pp.
          <volume>40</volume>
          {
          <issue>59</issue>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bienvenu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>ten Cate</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Ontology-based data access: A study through disjunctive datalog, CSP, and MMSNP</article-title>
          .
          <source>ACM Trans. Database Syst</source>
          .
          <volume>39</volume>
          (
          <issue>4</issue>
          ),
          <volume>33</volume>
          :1{
          <fpage>33</fpage>
          :
          <fpage>44</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Botoeva</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kontchakov</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ryzhikov</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zakharyaschev</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Query inseparability for ALC ontologies</article-title>
          .
          <source>Artif. Intell</source>
          .
          <volume>272</volume>
          ,
          <issue>1</issue>
          {
          <fpage>51</fpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Buhmann, L.,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Westphal</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Dl-learner - A framework for inductive learning on the semantic web</article-title>
          .
          <source>J. Web Sem</source>
          .
          <volume>39</volume>
          ,
          <issue>15</issue>
          {
          <fpage>24</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. Buhmann, L.,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Westphal</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Dl-learner structured machine learning on semantic web data</article-title>
          .
          <source>In: Proceedings of WWW</source>
          . pp.
          <volume>467</volume>
          {
          <issue>471</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Giacomo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lembo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosati</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Data complexity of query answering in description logics</article-title>
          .
          <source>Arti cial Intelligence</source>
          <volume>195</volume>
          ,
          <fpage>335</fpage>
          {
          <fpage>360</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Fanizzi</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rizzo</surname>
          </string-name>
          , G.,
          <string-name>
            <surname>d'Amato</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Esposito</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Dlfoil: Class expression learning revisited</article-title>
          .
          <source>In: Proceedings of EKAW</source>
          . pp.
          <volume>98</volume>
          {
          <issue>113</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Funk</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jung</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pulcini</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Learning description logic concepts: When can positive and negative examples be separated</article-title>
          .
          <source>In: Proceedings of IJCAI</source>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Hirsh</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mishra</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pitt</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Version spaces and the consistency problem</article-title>
          .
          <source>Articial Intelligence</source>
          <volume>156</volume>
          (
          <issue>2</issue>
          ),
          <volume>115</volume>
          {
          <fpage>138</fpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Iannone</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palmisano</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fanizzi</surname>
            ,
            <given-names>N.:</given-names>
          </string-name>
          <article-title>An algorithm based on counterfactuals for concept learning in the semantic web</article-title>
          .
          <source>Appl. Intell</source>
          .
          <volume>26</volume>
          (
          <issue>2</issue>
          ),
          <volume>139</volume>
          {
          <fpage>159</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Kietz</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>Some lower bounds for the computational complexity of inductive logic programming</article-title>
          .
          <source>In: Proceedings of ECML</source>
          . pp.
          <volume>115</volume>
          {
          <issue>123</issue>
          (
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fanizzi</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , Buhmann, L.,
          <string-name>
            <surname>d'Amato</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Concept learning</article-title>
          . In: Lehmann,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Voelker</surname>
          </string-name>
          ,
          <string-name>
            <surname>J</surname>
          </string-name>
          . (eds.)
          <source>Perspectives on Ontology Learning</source>
          , pp.
          <volume>71</volume>
          {
          <fpage>91</fpage>
          . AKA / IOS Press (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haase</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Ideal downward re nement in the EL description logic</article-title>
          .
          <source>In: Proceedings of ILP</source>
          . pp.
          <volume>73</volume>
          {
          <issue>87</issue>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hitzler</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Concept learning in description logics using re nement operators</article-title>
          .
          <source>Machine Learning</source>
          <volume>78</volume>
          (
          <issue>1-2</issue>
          ),
          <volume>203</volume>
          {
          <fpage>250</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Volker, J.:
          <source>Perspectives on Ontology Learning, Studies on the Semantic Web</source>
          , vol.
          <volume>18</volume>
          . IOS Press (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Lisi</surname>
            ,
            <given-names>F.A.</given-names>
          </string-name>
          :
          <article-title>A formal characterization of concept learning in description logics</article-title>
          .
          <source>In: Proceedings of DL Workshop</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Pitt</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valiant</surname>
            ,
            <given-names>L.G.</given-names>
          </string-name>
          :
          <article-title>Computational limitations on learning from examples</article-title>
          .
          <source>J. ACM</source>
          <volume>35</volume>
          (
          <issue>4</issue>
          ),
          <volume>965</volume>
          {
          <fpage>984</fpage>
          (
          <year>1988</year>
          ), https://doi.org/10.1145/48014.63140
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Sarker</surname>
            ,
            <given-names>M.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xie</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Doran</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raymer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hitzler</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Explaining trained neural networks with semantic web technologies: First steps</article-title>
          .
          <source>In: Proceedings of NeSy</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Straccia</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mucci</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>pFOIL-DL: Learning (fuzzy) EL concept descriptions from crisp OWL data using a probabilistic ensemble estimation</article-title>
          .
          <source>In: Proceedings of SAC</source>
          . pp.
          <volume>345</volume>
          {
          <fpage>352</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Tran</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ha</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoang</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nguyen</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nguyen</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          :
          <article-title>Bisimulation-based concept learning in description logics</article-title>
          .
          <source>Fundam. Inform</source>
          .
          <volume>133</volume>
          (
          <issue>2-3</issue>
          ),
          <volume>287</volume>
          {
          <fpage>303</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>