<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>SUBINTERNM: Optimizing the Matching of Networks of Ontologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fabio Santos</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kate Revoredo</string-name>
          <email>kate.revoredo@wu.ac.at</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fernanda Baia˜o</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Industrial Engineering, Pontifical Catholic University of Rio de Janeiro</institution>
          ,
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Northern Arizona University</institution>
          ,
          <country country="US">United States</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Vienna University of Economics and Business</institution>
          ,
          <addr-line>Vienna</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>System of systems (SoS) are interconnected systems that bring value to different domains like health, emergency and crisis management systems. The integration of these SoS creates opportunities to change, validate the information, and add more value to information systems. SoS may have ontologies in their background to support knowledge description and semantic integration. Consequently, the integration of SoSs may benefit from the integration of the network of ontologies behind. However, the task of integrating networks of ontologies, especially the ones describing real-world SoS can be infeasible due to the size of the networks. In this work, we propose an approach, SubInterNM, based on algebraic operations that reduces the number of comparisons needed to match the networks behind the SoSs. We validated our approach using networks of ontologies created from the OAEI ontologies. The SubInterNM combined with Alin and LogMap can overcome these matchers, when running alone in some cases.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        A System-of-Systems (SoS) is defined as a set of independent systems, providing
functionalities derived from the interoperability among them [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Examples of SoS
scenarios are smart cities, health, and emergency response systems, and crisis management
systems [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Current SoS are increasingly supported by networks of ontologies, which
provide a semantic backbone for modeling and reasoning over data.
      </p>
      <p>
        A network of ontologies (NO) is a set of two or more aligned ontologies. The
network can represent a set of domains of their compound ontologies. Each ontology
describes the knowledge of a domain of interest [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Many ontologies networks may be
created inside organizations as a response to the demand for a semantic interoperability
layer among their information systems (IS).
      </p>
      <p>In current scenarios demanding data integration and systems interoperability (such
as company acquisitions) within the same business domain, different SoS must be
integrated. Because of the intrinsically multiple possible relationships inside the SoS that
includes IS from distinct companies, the integration can be challenging and may be
viewed as a matching of ontologies networks, requiring mapping concepts between
both SoSs. This research addresses how to match a network of ontologies.</p>
      <p>
        This study proposes an approach to optimize internetwork matching in the context
of networks of ontologies, by systematically examining the characteristics of the NO
and avoiding computing all possible matchings between entities. More specifically, we
implement the subsumed internetwork matching (SubInterNM) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], which reduces the
number of pairs to be evaluated in the matching process. We evaluated the proposed
approach in a preliminary experiment using an OAEI dataset.
      </p>
      <p>This work addresses the following research question: ”Is it possible to align two
ontology networks without computing all the possible alignments, with viable
computational effort, time and precision, recall and f-measure?”. To answer this question, this
paper contributes by proposing an approach to match network of ontologies and its
implementation into a prototype tool, as well as an empirical study showing the viability
of the approach and its performance gains over state-of-the-art matching tools.</p>
      <p>This work is organized as follows: in section 2, we summarize background.
Evaluation results and a discussion are presented in section 3. Section 4 describes the
limitations, and finally, section 5 concludes and points to future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>
        Current approaches for ontology matching include pairwise matching [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and holistic
matching [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Both may be adapted to match Networks of Ontologies; however, they
perform an exhaustive checking of every single possible pair of entities for each
ontology that composes the networks. They also have limited scalability, since the required
number of steps for computing all the alignments grows exponentially to the number
and sizes of the ontologies composing each network. Indeed, both pairwise or holistic
approaches are not prepared to match Networks of Ontologies [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] since they do not take
into account the structure of the networks and cannot limit the number of comparisons.
      </p>
      <p>
        The ontology matching problem is not new and has been researched for a decade.
However, to our knowledge, the matching of Network of Ontologies has not received
the same attention. Although there are few studies dealing with networks of ontologies
and, consequently, matching networks, there are still open challenges [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Results and Discussion</title>
      <p>To assess SubInterNM we conducted an experiment using ontologies from the OAEI
conference domain, so as to limit the size and help checking the results manually.</p>
      <p>
        We selected Alin [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and LogMap [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] as the matchers for the experiment. Alin
obtained one of the best metrics in the OAEI initiative, and LogMap is one of the best
to handle large ontologies. The SubInterNM approach uses the definitions in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and is
available online b. For the sake of reprodutibility, the results are also available online c.
      </p>
      <p>We first selected ontologies to compose two networks of ontologies. Since LogMap
and Alin are not able to natively process networks, the baseline submitted as input to
bhttps://github.com/fabiojavamarcos/interNetworkOntologyMatching
czenodo.org (10.5281/zenodo.3977855)
these matchers consisted of the union of all ontologies from each network. The
alternative scenario to be compared consisted of executing SubInterNM and then submitting
its partial results to the LogMap and Alin matchers. Following, each possible pair of
ontologies was submitted to LogMap and Alin alone, with duplication and without
duplication (when pairs of the same ontologies were manually eliminated, i.e. Edas x
Ekaw and Ekaw x Edas; Edas x Edas). For each scenario we collected the following
metrics: processing time, average precision, average recall, and average f-measure.
– 2x2: = fsigkdd, confofg and 0 = fconference, confofg;
– 4x4: = fsigkdd, confof, ekaw, edas g and 0 = fconference, confof, ekaw, edasg;
– 5x5: = fsigkdd, confof, ekaw, edas, iasted g and 0 = fconference, confof, ekaw,
edas, iastedg;
– 5x1: = fsigkdd, confof, ekaw, edas, iastedg and 0 = fconferenceg;
– 5x2: = fsigkdd, confof, ekaw, edas, iasted g and 0 = fconference, confofg;
– 5x3: = fsigkdd, confof, ekaw, edas, iastedg and 0 = fconference, confof, ekawg;</p>
      <p>The results show higher precision, recall, and f-measure using the SubInterNM
combined with the Alin compared with the matcher alone and when combined with
LogMap (Table 1). LogMap combined with SubInterNM or alone was faster than
SubInterNm+Alin or Alin alone, even when the network size grew (Table 2).</p>
      <p>The metrics (Table 1) showed a decrease in the values of the Alin and LogMap
approach as the network grew. It can be explained by the lack of flexibility of the solutions
in understanding a reference alignment that contains concepts coming from different
ontologies. It occurs because when finishing the union operation, the resulting
temporary ontology is composed of the union of concepts from all the ontologies together.
LogMap handled better than complexity and loosed significantly less precision than
Alin. The experiments using the matcher alone started aligning the structure after the
union to standardize the input for all approaches.</p>
      <p>In Table 3, we ran the matcher in all possible combinations, Oi x Oj , (column
”All”) or without duplications (column ”Time”). We computed the sum of processing
time and the average of the quality metrics. Alin obtained higher quality metrics again,
while LogMap continued to be faster. Looking at the 5x5 and 5x1 cases, we observe
that Alin used significantly more time than when combined with the SubInterNM, but
delivered better quality metrics. LogMap outperformed all the options wrt processing
time and obtained similar metrics in the 5x5 case and better ones in the 5x1 case.</p>
      <p>Examining the column ”All” (Table 3), we observe that LogMap processing times
were comparable to SubInterNM + LogMap, but the latter approach produced a result
without duplicated alignments. Alin alone had better results than SubInterNM + Alin
but used significantly more time and delivered solutions with duplications and may cost
more O(nlogn) effort and more time. In networks with many isomorphisms, the
SubInterNM + Alin delivered more balanced results combining metrics and time processing.
On the other hand, classic pairwise approaches have better outcomes when the networks
had few isomorphisms.</p>
      <p>Finally, it is possible to avoid the Cartesian product and keep processing time and
resulting metrics depending on the characteristics of the networks being aligned.
When pruning the Network of Ontologies to reduce the posterior matcher
computations, some entities can be missing. This may impact on how the similarity algorithms
find the alignments, which may lead to different results. The use of a dataset from the
same domain is not a real scenario, as discussed in Section 1. Yet, this enabled us to
manually verify the algebraic operations and the computed metrics, since we needed
many customized reference alignments.</p>
      <p>Because of the many possibilities in the experiments, we needed to create some
new reference alignments based on the existing from OAEI. These were validated by
the research group but are not error-proof.</p>
      <p>Finally, the intrinsic characteristics of the ontologies considered in the experiment
generated sparse graphs, which may have helped the algebraic algorithms. In scenarios
with more dense ontologies (i.e., with more connections among their concepts), we
could have strongly connected graphs and, consequently, worse time processing when
using our proposed SubInterNM.
5</p>
    </sec>
    <sec id="sec-4">
      <title>Future Work and Conclusion</title>
      <p>This paper contributes by presenting a novel concept of a Network of Ontologies Matcher
approach.The proposal was implemented in a prototype to show its feasibility and
confirm (our research question) that it is possible to align two ontology networks without
computing all the possible alignments, with viable computational effort, time and
precision, recall and f-measure. As predicted, the experiment results confirmed that
SubInterNM computed the matching among distinct networks of ontologies more efficiently
than using the traditional pairwise approach in specific cases, due to avoiding
unnecessary comparisons without losing information.</p>
      <p>
        For future work, we aim to run experiments with larger ontologies and networks,
discover the optimal point where the SubInterNM can be used and add a strategy to
retracting/forgetting axioms/entities [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] while preserving entailment.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>de Abreu Santos</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Revoredo</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , Baia˜o,
          <string-name>
            <surname>F.A.</surname>
          </string-name>
          :
          <article-title>Paving a research roadmap on network of ontologies</article-title>
          .
          <source>In: OM@ ISWC</source>
          . pp.
          <fpage>221</fpage>
          -
          <lpage>222</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Boehm</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A view of 20th and 21st century software engineering</article-title>
          .
          <source>In: Proceedings of the 28th international conference on Software engineering</source>
          . pp.
          <fpage>12</fpage>
          -
          <lpage>29</lpage>
          . ACM (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Casanova</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          , Magalha˜es, R.C.:
          <article-title>Operations over lightweight ontologies and their implementation</article-title>
          .
          <source>In: Implicit and Explicit Semantics Integration in Proof-Based Developments of Discrete Systems</source>
          , pp.
          <fpage>61</fpage>
          -
          <lpage>82</lpage>
          . Springer (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Da</given-names>
            <surname>Silva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Revoredo</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          , Baia˜o,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Euzenat</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.:</surname>
          </string-name>
          <article-title>A lin: improving interactive ontology matching by interactively revising mapping suggestions</article-title>
          .
          <source>The Knowledge Engineering Review</source>
          <volume>35</volume>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Fitzgerald</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Foster</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ingram</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Larsen</surname>
            ,
            <given-names>P.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Woodcock</surname>
          </string-name>
          , J.:
          <article-title>Model-based engineering for systems of systems: the compass manifesto</article-title>
          . COMPASS Interest Group,
          <source>Tech. Rep. Manifesto Version</source>
          <volume>1</volume>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Jime</surname>
          </string-name>
          <article-title>´nez-</article-title>
          <string-name>
            <surname>Ruiz</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuenca Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Logmap: Logic-based and scalable ontology matching</article-title>
          .
          <source>In: International Semantic Web Conference</source>
          . pp.
          <fpage>273</fpage>
          -
          <lpage>288</lpage>
          . Springer (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Megdiche</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teste</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trojahn</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>An extensible linear approach for holistic ontology matching</article-title>
          .
          <source>In: International Semantic Web Conference</source>
          . pp.
          <fpage>393</fpage>
          -
          <lpage>410</lpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Revoredo</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baiao</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>A proposal for optimizing internetwork matching of ontologies</article-title>
          . In: Ontology Matching: OM-2018
          <source>: Proceedings of the ISWC Workshop</source>
          . p.
          <volume>71</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Euzenat</surname>
          </string-name>
          , J.:
          <article-title>Ontology matching: state of the art and future challenges</article-title>
          .
          <source>IEEE Transactions on knowledge and data engineering 25(1)</source>
          ,
          <fpage>158</fpage>
          -
          <lpage>176</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Topor</surname>
            ,
            <given-names>R.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>J.Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Antoniou</surname>
          </string-name>
          , G.:
          <article-title>Concept and role forgetting in ALCfnmathcal fALCgg ontologies</article-title>
          .
          <source>In: The Semantic Web - ISWC</source>
          <year>2009</year>
          , 8th International Semantic Web Conference. pp.
          <fpage>666</fpage>
          -
          <lpage>681</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>