<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Proposal for Optimizing Internetwork Matching of Ontologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fabio Santos</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kate Revoredo?</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fernanda Bai~ao??</string-name>
          <email>fernanda.baiaog@uniriotec.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Applied Informatics Federal University of the State of Rio de Janeiro (Unirio)</institution>
          ,
          <addr-line>Rio de Janeiro</addr-line>
          ,
          <country country="BR">Brazil</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>A System-of-Systems (SoS) is a set of independent information systems that must communicate with each other towards providing a speci c service. Therefore, e ectively integrating these systems is demanding. Considering that each system is conceptually described by a unique ontology, the conceptual support for the whole SoS demands the alignment of all ontologies, deriving a network of ontologies. Existing ontology matching techniques may be used for the task; however, due to the recently increasing size of the ontologies and the potential number of ontologies being aligned, current approaches may su er from scalability and performance issues. In this paper, we introduce an approach to reduce the number of potential correspondences, therefore optimizing the process of creating a network of ontologies. A preliminary experiment was conducted, showing the potential of the proposed approach.</p>
      </abstract>
      <kwd-group>
        <kwd>network of ontologies</kwd>
        <kwd>network matching</kwd>
        <kwd>data integration</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        A System-of-Systems (SoS) is de ned as a set of independent information systems
(IS), providing functionalities derived from the interoperability among them [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
The development and research on SoS have been gaining increasing attention
due to the relevance of several domains such as smart cities, health, emergency
response systems, and crisis management systems [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Considering that each
IS within a SoS is conceptually described by a unique ontology describing its
domain, the conceptual support for the whole SoS demands the interoperation of
its composing IS, thus requiring the alignment of all the corresponding ontologies.
Moreover, a single SoS may embrace several domains, thus requiring by itself a
network of ontologies as its conceptual support. Therefore, there is an increasing
need for aligning networks of ontologies, a problem called internetwork matching.
      </p>
      <p>
        Traditional solutions for ontology matching may be applied for solving the
internetwork matching problem, either using a pairwise or a holistic strategy
[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. While the former interactively matches one pair of ontologies from di erent
? Partially funded by Unirio (PQ-UNIRIO N01/2018)
?? Partially funded by CNPq (401505/2014-6)
networks at a time, the latter considers all the network at once. In both cases,
all pairs of entities from each ontology that composes the networks are analyzed,
which poses a severe restriction in terms of scalability since the required number
of comparisons for computing the alignments grows exponentially to the number
and size of each ontology. Therefore, there is a need for optimized solutions to
this problem [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        This work proposes an optimized approach for the internetwork matching
challenge [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] that tries to reduce the number of pairs to be evaluated during
the matching process, thus avoiding unneeded computation while preserving
the alignment quality. We evaluated the proposed approach in a preliminary
experiment using an OAEI dataset.
      </p>
      <p>This work is organized as follows: Section 2 de nes the internetwork matching
problem. Our proposed approach is detailed in Section 3. Preliminary evaluation
results are in Section 4. Finally, section 5 concludes and points to future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Problem De nition</title>
      <p>
        A network of ontologies is formally de ned as =&lt; ; &gt;, where is a nite
set of ontologies and (O; O0) is a set of alignments between pairs of ontologies
belonging to [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Given a set of two or more networks of ontologies =
f 1; 2; :::; ng, the internetwork matching problem searches for a nal network
of ontologies f resulting from the alignments of the networks in . For instance,
Figure 1 depicts two networks of ontologies, each one with 3 ontologies, describing
two Systems-of-Systems. The goal is to match these two networks, nding a
unique network of ontologies.
      </p>
      <p>One of the approaches for matching networks of ontologies is pairwise. Given
a set of networks of ontologies, the pairwise internetwork matching sequentially
computes the alignment of each pair of ontologies from each pair of networks from
this set. For example, given two networks of ontologies =&lt; ; &gt; and 0 =&lt;
0; 0 &gt;, in which = fO1; O2g and 0 = fO3g, the pairwise internetwork
matching is obtained by computing (((O1 O2) [ (O1 O3)) [ (O2 O3)). That
is, the pairwise internetwork matching approach computes all matchings between
all pairs of ontologies inside each network that is being aligned.</p>
      <p>Networks frequently have isomorphisms and trivial alignments that may cause
the pairwise approach to nd the same alignments more than once, thus requiring
an additional step to merge the resulting matches at the end. In the case of
isomorphisms, identical correspondences between same entities may be generated
(for instance, in Figure 1 a matcher tool may nd A1;10 = f&lt; O1:a1; O10:a01; =&gt;
; &lt; O1:b1; O10:b01; =&gt;g). The case of a trivial alignment occurs when a group of
entities, that was previously aligned in a network, appears in another network.
For instance, a pairwise matcher that receives the networks and 0 and has
previously computed the intra-network alignments A1;2 =&lt; O1:b1; O2:d2; v&gt;
and A10;20 =&lt; O10:b01; O20:d02; v&gt; will work unnecessarily to produce A1;20 =&lt;
O1:b1; O20:d02; v&gt; and A10;2 =&lt; O10:b01; O2:d2; v&gt;, which are trivial alignments.</p>
      <p>The main weakness of pairwise approach is the number of comparisons needed
to compute all alignments and the lack of ability to handle isomorphisms and
intra-network alignments.
Given the limitations discussed in Section 2, we propose SubInterNM, a new
subsumed approach for the Internetwork matching problem. SubInterNM avoids
unnecessary computation by identifying and reusing trivial and subsumed
alignments already computed in the networks of ontologies that are being aligned. In
such cases, as the intersection among the networks becomes larger, the set of
evaluated correspondences during the alignment process tends to get smaller.</p>
      <p>For instance, consider the example depicted in Figure 1. The networks and
0 were aligned using an internetwork matcher. Since O2 and O20 are identical,
they do not need to be exhaustively compared. Also, both pairs of ontologies
O1 and O10 and O3 and O30 share some subset of entities, thus common parts
could be eliminated to compute the nal network of ontologies resulted from the
internetwork matching problem scenario.</p>
      <p>
        Casanova et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] proposed operations over lightweight ontologies. They
de ne lightweight ontologies as ontologies restricted to DL-Lite core with arbitrary
number restrictions. They use a MEG (minimal equivalent graph) approach to
create a constraint graph in polynomial time if the graph is acyclic. If the graph
is complete, then the problem is NP-Hard. To avoid that, a normalization step is
conducted to simplify the graph structure and keep them lightweight.
      </p>
      <p>
        Our proposed approach SubInterNM uses a combined set of lightweight
operations from [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] to verify the existence of isomorphisms. We extrapolate the
original idea to use in networks environments, instead of just single ontologies.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Preliminary Results</title>
      <p>Our proposed approach was evaluated in a preliminary experiment using
ontologies from the OAEI conference dataset. We experimented 5 distinct scenarios,
and in each of them we speci ed two networks of ontologies to be aligned. The
experiments were de ned by increasing the number of ontologies in the network,
as well as varying the ontologies and the number of common ontologies between
the networks, in order to assess how well our approach would handle the existence
of trivial and subsumed alignments:
= fconference, cmtg and 0 = fcmt, sigkddg;
= fconference, cmt, ekawg and 0 = fcmt, sigkddg;
= fconference, cmt, ekawg and 0 = fcmt, sigkdd, conferenceg;
= fconference, cmt, dblp, ekawg and 0 = fcmt, sigkdd, conferenceg;
= fconference, cmt, edas, ekawg and 0 = fcmt, sigkdd, conferenceg.</p>
      <p>
        In order to compare our proposed SubInterNM approach against the pairwise
internetwork matching approach, we implemented the pairwise approach using
the existing matching system ALIN [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] in all experiments. ALIN was selected
due to the good results achieved on OEAI 2017, and due to our access to the
code [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. To use ALIN as a blackbox (i.e., without having to change its code),
for each internetwork matching experiment, we built all the pairs of ontologies
to be aligned and interactively invoked ALIN. SubInterNM was implemented
using operations de ned by Casanova et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], initially considering only the
isomorphisms. After, the ALIN matching system[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] was also invoked to compute
the alignments between the results from and 0 . So we computed:
as O1 [ O2::: [ On O1 \ O10 O1 \ O20 ::: On \ On0
0 as O10 [ O20::: [ On0 O10 \ O1 O10 \ O2 ::: On0 \ On
      </p>
      <p>Table 1 shows the number of pairs analyzed (i.e., comparisons) by pairwise
and subsumed approach to compute alignments in each experiment. It is possible
to verify that SubInterNM reduces the number of comparisons needed to the
matching process by at least 24%, therefore representing a successful way to deal
with large networks in the internetwork matching problem.</p>
      <p>
        Future work will try to reduce even more the number of comparisons with the
detection of intra-network alignments. All the data gathered in the experiment is
available at (https://bit.ly/2M3jIYS).
This work addressed the problem of internetwork ontology matching, a natural
evolution of the classical ontology matching problem for highly interconnected
scenarios of Systems of Systems, which are of increasing popularity and relevance.
We proposed SubInterNM, an approach for internetwork ontology matching
that optimizes the required computation of correspondences by identifying and
reusing trivial and subsumed alignments. Preliminary evaluation results showed
the potential of the approach and opportunities for improvement, in scenarios
using lightweight ontologies. The computation of subsumed networks posed an
overhead computation since it is a well-known NP-Hard problem [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Therefore,
further implementations may compute trivial alignments, using it as background
knowledge for the matching process [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>In future work, we expect to improve our implementation, including parallel
programming and infrastructure. We also plan to move forward exploring trivial
alignments in newer versions of the tool.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>de Abreu Santos</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Revoredo</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , Bai~ao,
          <string-name>
            <surname>F.A.</surname>
          </string-name>
          :
          <article-title>Paving a research roadmap on network of ontologies</article-title>
          .
          <source>In: Proceedings of the 12th International Workshop on Ontology Matching co-located with the 16th International Semantic Web Conference (ISWC</source>
          <year>2017</year>
          ), Vienna, Austria, October
          <volume>21</volume>
          ,
          <year>2017</year>
          . pp.
          <volume>221</volume>
          {
          <issue>222</issue>
          (
          <year>2017</year>
          ), http:// ceur-ws.
          <source>org/</source>
          Vol-2032/om2017_poster8.pdf
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Boehm</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A view of 20th and 21st century software engineering</article-title>
          .
          <source>In: Proceedings of the 28th international conference on Software engineering</source>
          . pp.
          <volume>12</volume>
          {
          <fpage>29</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Casanova</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          , de Mac^edo,
          <string-name>
            <given-names>J.A.</given-names>
            ,
            <surname>Sacramento</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.R.</given-names>
            ,
            <surname>Pinheiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A^ .M.</given-names>
            ,
            <surname>Vidal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.M.</given-names>
            ,
            <surname>Breitman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.K.</given-names>
            ,
            <surname>Furtado</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.L.</surname>
          </string-name>
          :
          <article-title>Operations over lightweight ontologies</article-title>
          . In:
          <article-title>OTM Confederated International Conferences" On the Move to Meaningful Internet Systems"</article-title>
          . pp.
          <volume>646</volume>
          {
          <fpage>663</fpage>
          . Springer (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Da</given-names>
            <surname>Silva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Baiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.A.</given-names>
            ,
            <surname>Revoredo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Euzenat</surname>
          </string-name>
          , J.:
          <article-title>Semantic interactive ontology matching: synergistic combination of techniques to improve the set of candidate correspondences</article-title>
          .
          <source>In: OM 2017-12th ISWC workshop on ontology matching</source>
          . pp.
          <volume>13</volume>
          {
          <fpage>24</fpage>
          . No commercial editor. (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Euzenat</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>Revision in networks of ontologies</article-title>
          .
          <source>Arti cial intelligence 228</source>
          , 195{
          <fpage>216</fpage>
          (
          <year>2015</year>
          ), ftp://ftp.inrialpes.fr/pub/exmo/publications/euzenat2015a.pdf
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Fitzgerald</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Foster</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ingram</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Larsen</surname>
            ,
            <given-names>P.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Woodcock</surname>
          </string-name>
          , J.:
          <article-title>Model-based engineering for systems of systems: the compass manifesto</article-title>
          . COMPASS Interest Group,
          <source>Tech. Rep. Manifesto Version</source>
          <volume>1</volume>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Hsu</surname>
          </string-name>
          , H.T.:
          <article-title>An algorithm for nding a minimal equivalent graph of a digraph</article-title>
          .
          <source>Journal of the ACM (JACM) 22(1)</source>
          ,
          <volume>11</volume>
          {
          <fpage>16</fpage>
          (
          <year>1975</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Megdiche</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teste</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trojahn</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>An extensible linear approach for holistic ontology matching</article-title>
          .
          <source>In: International Semantic Web Conference</source>
          . pp.
          <volume>393</volume>
          {
          <fpage>410</fpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Euzenat</surname>
          </string-name>
          , J.:
          <article-title>Ontology matching: state of the art and future challenges</article-title>
          .
          <source>IEEE Transactions on knowledge and data engineering 25(1)</source>
          ,
          <volume>158</volume>
          {
          <fpage>176</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. da Silva,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Baiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.A.</given-names>
            ,
            <surname>Revoredo</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          :
          <article-title>Alin results for oaei 2017</article-title>
          . In: OM-2017
          <source>: Proceedings of the Twelfth International Workshop on Ontology Matching</source>
          . p.
          <volume>114</volume>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>