<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Avoiding Subsumption Tests During Classi cation Using the Atomic Decomposition</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Haoruo Zhao</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Uli Sattler</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bijan Parsia</string-name>
          <email>bijan.parsiag@manchester.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>The University of Manchester</institution>
          ,
          <addr-line>Oxford Road, Manchester, M13 9PL</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Classifying a TBox requires many subsumption tests, for example O j=? Animal v P erson. For n the number of concept names in O, a naive algorithm involves a quadratic number of these problems, and various optimisations, called Enhanced Traversal algorithms, have been developed [1, 4]. In DLs, subsumption is decidable but possibly computationally expensive, e.g. for SROIQ(D), it is 2NExpTime [5]. In this paper, we try to modify the standard Enhanced Traversal algorithm to work over a set of modules [3, 9] to optimize this core reasoning task, classi cation. There are two potential bene ts: to shorten the subsumption test times (i.e., easi cation) and to reduce the number of subsumption tests by avoiding some tests (i.e., avoidance).</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In this paper, we explore whether modifying the core Traversal algorithm
to be module aware is worth attempting. Since, touching the internals of
complex and highly tuned systems such as modern description logic reasoners is a
labor intensive and fraught a air, and since existing Enhanced Traversal
algorithms are already highly optimised, our rst aim is to gather data about the
total amount of (further) avoidance possible for a given reasoner (Pellet [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ])
over a signi cant corpus of ontologies (BioPortal). In other words, we want to
verify whether \module aware subsumption avoidance" is meaningful. Hence we
modify Pellet to be moderately module aware: we insert, between its Enhanced
Traversal algorithm and its subsumption checker, a module aware component
that simply determines whether the ontology's Atomic Decomposition1 contains
enough information to answer the subsumption test in question. In this way, we
minimise the overhead of communicating between the core Pellet components
and our avoidance checker and gain insight into the potential avoidance gains of
module awareness for an existing Traversal algorithm.
      </p>
      <p>We nd that in BioPortal, on a signi cant number of ontologies, almost all
subsumption tests performed by Pellet can be avoided{despite the fact that
Pellet uses an Enhanced Traversal algorithm. In Figure 1, we see an overview of
our results: each point represents an ontology, the X-axis the number of
Subsumption Tests (STs) carried out by Pellet and the Y-axis the percentage of STs
avoided.</p>
      <p>More precisely, we nd that for a third of the ontologies, more than 90% of
subsumption tests performed by Pellet during classi cation can be avoided by
such a module aware system. For more than half of the ontologies, more than
70% can be avoided.</p>
      <p>
        These results suggest that designing and implementing a full-blown,
module aware traversal algorithm is likely to be net bene cial. We are currently
investigating module-based avoidance into Traversal algorithm.
1 A modular partition of an ontology computable in polynomial time [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] representing
all locality-based modules.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hollunder</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nebel</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pro</surname>
            <given-names>tlich</given-names>
          </string-name>
          , H.J.,
          <string-name>
            <surname>Franconi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          :
          <article-title>An empirical analysis of optimization techniques for terminological representation systems</article-title>
          .
          <source>In: Proc. of the 3rd Int. Conf. on the Principles of Knowledge Representation and Reasoning (KR92)</source>
          . pp.
          <volume>270</volume>
          {
          <issue>281</issue>
          (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Del</given-names>
            <surname>Vescovo</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>The modular structure of an ontology: Atomic decomposition towards applications</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Del</given-names>
            <surname>Vescovo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Gessler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.D.</given-names>
            ,
            <surname>Klinov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Parsia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Sattler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            ,
            <surname>Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Winget</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>Decomposition and modular structure of bioportal ontologies</article-title>
          .
          <source>In: International Semantic Web Conference</source>
          . pp.
          <volume>130</volume>
          {
          <fpage>145</fpage>
          . Springer (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Glimm</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shearer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stoilos</surname>
          </string-name>
          , G.:
          <article-title>A novel approach to ontology classi cation</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>14</volume>
          ,
          <issue>84</issue>
          {
          <fpage>101</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Kazakov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>RIQ and SROIQ are harder than SHOIQ</article-title>
          .
          <source>International JointConference on Automated Reasoning</source>
          pp.
          <volume>292</volume>
          {
          <issue>297</issue>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Matentzoglu</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>OWL reasoning: Subsumption test hardness and modularity</article-title>
          .
          <source>Journal of Automated Reasoning</source>
          <volume>60</volume>
          (
          <issue>4</issue>
          ),
          <volume>385</volume>
          {
          <fpage>419</fpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Matentzoglu</surname>
            ,
            <given-names>N.A.</given-names>
          </string-name>
          :
          <article-title>Module-based classi cation of OWL ontologies</article-title>
          . The University of Manchester (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Romero</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>More: Modular combination of OWL reasoners for ontology classi cation</article-title>
          .
          <source>In: International Semantic Web Conference</source>
          . pp.
          <volume>1</volume>
          {
          <fpage>16</fpage>
          . Springer (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Romero</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaminski</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Module extraction in expressive ontology languages via datalog reasoning</article-title>
          .
          <source>J. Artif. Intell. Res</source>
          .
          <volume>55</volume>
          ,
          <issue>499</issue>
          {
          <fpage>564</fpage>
          (
          <year>2016</year>
          ). https://doi.org/10.1613/jair.4898, https://doi.org/10.1613/jair.4898
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Sirin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Pellet: A practical OWLDL reasoner</article-title>
          .
          <source>Web Semantics: science, services and agents on the World Wide Web</source>
          <volume>5</volume>
          (
          <issue>2</issue>
          ),
          <volume>51</volume>
          {
          <fpage>53</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Tsarkov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palmisano</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Chainsaw: a Meta Reasoner for large ontologies</article-title>
          .
          <source>In: OWL Reasoner Evaluation Workshop (ORE</source>
          <year>2012</year>
          ).
          <source>Citeseer</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>