<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>NOMSA: Automated modularisation for abstraction modules</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Zubeida Casmod Khan</string-name>
          <email>zkhan@csir.co.za</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>C. Maria Keet</string-name>
          <email>mkeet@cs.uct.ac.za</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Council for Scienti c and Industrial Research</institution>
          ,
          <addr-line>Pretoria</addr-line>
          ,
          <country country="ZA">South Africa</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Computer Science, University of Cape Town</institution>
          ,
          <country country="ZA">South Africa</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Large and complex ontologies lead to di culty in usage by humans and causes processing problems with software agents. Modularity has been proposed to address this problem. Current methods and tools can be used to create only some of the existing types of required modules. To augment options for modularisation, we present novel methods to create ve types of abstraction modules: axiom abstraction, vocabulary abstraction, high-level abstraction, weighted abstraction, and feature expressiveness. They have been implemented in the novel tool NOMSA for automated modularisation, which also o ers a GUI.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>Modularisation methods</title>
      <p>
        We focus on designing algorithms to match those modularisation options that do
not have any means of realising them, as was observed in an existing
evidencebased ontology modularisation framework [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In particular, there are ve types
of abstraction modules that are fall short with respect to algorithms and
implementations, which we summarise here and are illustrated afterward.
{ Axiom abstraction Axiom abstraction generates a module without complex
relations between classes; therefore, the technique decreases the mesh
structure of the ontology (if present) and makes it a `bare' taxonomy of classes
and unused object properties.
{ Vocabulary abstraction Applying this abstraction to an ontology generates a
module where a certain vocabulary element is removed from the ontology,
or a whole group of elements is removed (e.g., all data types, if present).
{ High-level abstraction generates a module where entities at a higher level in
the hierarchy are regarded more important than others. This introduces the
notion of desired depth to specify in the abstraction process.
{ Weighted abstraction deals with removing entities from an ontology that are
deemed less important than others by assigning weight to the classes,
properties, and individuals in an ontology. We determine importance by assessing
entities that other entities are highly dependent on. For instance, in the pizza
ontology, the class TomatoTopping is the most widely used, being referenced
61 times by other entities. Of course, what is used `often' and what is `less
important' may be relative and thus depend on the ontology. Therefore,
the weighted abstraction includes a user-de nable threshold, which may be
absolute or relative.
{ Feature expressiveness modules deal with removing some axioms of the
ontology based on the language features, e.g., cardinality constraints, disjointness,
object property features etc. By manipulating complex constructs of the
ontology language features, the feature expressiveness algorithm results in a
simpli ed model of the ontology. We have designed 7 rules for this. The
algorithm takes these 7 rules, and removes them, from the least important to the
most important. At each rule removal, a `layer' of the ontology is produced
where that ontology is represented in a language of lower expressivity than
the previous layer. Once the algorithm is complete, seven modules (layers)
are produced, each having a lower level of expressivity than the previous.
We summarise the seven rules here, with the type of axioms that are to
be removed. R1: Quali ed cardinality deals with cardinality constraints
between classes and properties. R2: Domain and range pertains to axioms that
have been speci ed using domain and range for object properties. R3: Object
property characteristics are those axioms pertaining to logical characteristics
of object properties such as symmetry and transitivity. R4: Disjointness are
those axioms pertaining to the disjointness of classes. R5: Assertions are
those axioms that are assertions between individuals and classes, or
properties. R6: Atomic equivalence and equality are those axioms that state
equivalence between entities. R7: Complex classes are those axioms that contain
intersection and union logical operators.
      </p>
      <p>Consider the axioms in a toy Burger ontology in Fig. 1 (entity declaration
axioms omitted). Running axiom abstraction on this ontology would remove
the axioms numbered 4, 10, and 24-28, for they involve object properties of
classes. For vocabulary abstraction there are several options. Let us remove all
the instances, as most ontologies focus on the TBox anyway. This would remove
the axioms numbered 30-33 and as knock-on e ect, also axiom numbers 29 and
36. The high level abstraction is, perhaps, not of much interest in this sample
ontology, for there are few hierarchies. Setting the depth at level 1 is the only
way to actually have something removed, as there are only two levels. This would
remove all the types of burgers, of buns, and of llings, i.e., the axioms numbered
2, 3, 5-13, and 15-21.</p>
      <p>To generate a weighted abstraction module, let us assume we wish to create
a module whereby we remove 25% of the entities. To achieve this, we set the
threshold value to 25%. The threshold value represents an amount of the ontology
that is to be removed. First we weigh each class in the ontology with its number
of referencing axiom. Thereafter, 25% of the classes with the lowest values are
removed (amounting to 5), as displayed in Table 1. That is, the classes that are
deemed least important are those with the lowest number of referencing axioms;
e.g., WhiteBun is only referred to in axioms 19 and 21, whereas Filling is referred
to in axioms 4, 6, 12, 16, 17, hence, WhiteBun is removed. We omit the feature
abstraction illustration due to space limitations.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Implementation of NOMSA</title>
      <p>To solve the problem of manual modularisation, we have created the tool NOMSA
to modularise ontologies, which incorporates the ve abstraction algorithms.
NOMSA allows the user to upload an ontology (including any imports), and
select an approach to modularise it. Each approach is satis ed by a novel
algorithm which correspond to the abstractions described in Sect. 2. The algorithms
are available as online supplementary material (http://www.thezfiles.co.za/
modularisation). NOMSA is a stand-alone application that can be downloaded
from the aforementioned URL as well as a screencast.</p>
      <p>
        We have evaluated the algorithms on 128 ontologies. All the generated
modules are notably di erent from the source ontologies, as is the case also for the
Burger example (see Table 2) whose metrics were generated with TOMM [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
We are looking into re ning and module quality metrics to compare modules.
      </p>
      <p>Size No. of axioms Correctness Completeness
Burger (Original) 29 58 -
Burger (WeiAbs) 15 26 True False
Burger (AxAbs) 29 44 True False
Burger (HLAbs level 3) 26 52 True True</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Amato</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santo</surname>
            ,
            <given-names>A.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moscato</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Persia</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Picariello</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poccia</surname>
            ,
            <given-names>S.R.</given-names>
          </string-name>
          :
          <article-title>Partitioning of ontologies driven by a structure-based approach</article-title>
          .
          <source>In: IEEE ICSC'15</source>
          . pp.
          <volume>320</volume>
          {
          <fpage>323</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2015</year>
          ), anaheim, CA, USA, Feb 7-
          <issue>9</issue>
          ,
          <fpage>2015</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sirin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuenca Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hendler</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          :
          <article-title>Swoop: A Web Ontology Editing Browser</article-title>
          .
          <source>J. of Web Semantics</source>
          <volume>4</volume>
          (
          <issue>2</issue>
          ),
          <volume>144</volume>
          {
          <fpage>153</fpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Khan</surname>
            ,
            <given-names>Z.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keet</surname>
            ,
            <given-names>C.M.</given-names>
          </string-name>
          :
          <article-title>Dependencies between modularity metrics towards improved modules</article-title>
          .
          <source>In: Proc. of EKAW'16. LNAI</source>
          , vol.
          <volume>10024</volume>
          , pp.
          <volume>400</volume>
          {
          <fpage>415</fpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Khan</surname>
            ,
            <given-names>Z.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keet</surname>
            ,
            <given-names>C.M.:</given-names>
          </string-name>
          <article-title>An empirically-based framework for ontology modularisation</article-title>
          .
          <source>Applied Ontology</source>
          <volume>10</volume>
          (
          <issue>3-4</issue>
          ),
          <volume>171</volume>
          {
          <fpage>195</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cornet</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lau</surname>
          </string-name>
          , F.,
          <string-name>
            <surname>de Keizer</surname>
          </string-name>
          , N.:
          <article-title>A survey of SNOMED CT implementations</article-title>
          .
          <source>J. of Biomedical Informatics</source>
          <volume>46</volume>
          (
          <issue>1</issue>
          ),
          <volume>87</volume>
          {
          <fpage>96</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Romero</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaminski</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Module extraction in expressive ontology languages via datalog reasoning</article-title>
          .
          <source>J. of Arti cial Intelligence Research</source>
          <volume>55</volume>
          ,
          <volume>499</volume>
          {
          <fpage>564</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Rosse</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mejino</surname>
            <given-names>Jr</given-names>
          </string-name>
          ,
          <string-name>
            <surname>J.L.V.</surname>
          </string-name>
          :
          <article-title>A reference ontology for biomedical informatics: the foundational model of anatomy</article-title>
          .
          <source>J. of Biomedical Informatics</source>
          <volume>36</volume>
          (
          <issue>6</issue>
          ),
          <volume>478</volume>
          {
          <fpage>500</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>