<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Ensuring Semantic Interoperability Based on the Merging of Ontological Models1</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Plekhanov Russian University of Economics</institution>
          ,
          <addr-line>Moscow</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2087</year>
      </pub-date>
      <fpage>0000</fpage>
      <lpage>0001</lpage>
      <abstract>
        <p>The article describes an ontologies merging algorithm used to ensure the semantic interoperability of information systems (IS). The algorithm is based on a set-theoretic approach for calculating measures of semantic proximity of vertices of homogeneous ontologies at the level of subject areas and the level of tasks. The measure of semantic proximity is calculated taking into account the comparison of the attributes of the compared concepts of ontologies and the values of these attributes, the location of the selected nodes within the corresponding ontologies, and also taking into account the comparison of the presence and types of links of the evaluated concepts.</p>
      </abstract>
      <kwd-group>
        <kwd>semantic interoperability</kwd>
        <kwd>ontological engineering</kwd>
        <kwd>an algorithm for integrating ontologies</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>At the first two levels of the EIF stack, initial requirements for the design of
information systems are set, organizational measures are taken to unify the relevant
regulatory documents and business processes.</p>
      <p>To ensure the fourth and fifth levels of EIF stack in the design and development of
information systems, they must include certain software tools. The indicated levels of
interoperability are well enough studied and their practical implementation does not
cause serious difficulties at present.</p>
      <p>Currently, the greatest scientific and practical significance is the solution of problems
of ensuring the semantic interoperability of information systems (IS). This is also due
to the fact that in recent years the intelligence of IS, including devices operating on
IoT technology, has sharply increased. Information systems are being created that are
capable of replacing a person in many respects, including in the field of making
intelligent decisions. Understanding the meaning of the request (and not just the syntax of
the request) that comes to the IS from another system will allow you to give a more
correct answer, which, in turn, should be as correctly understood as possible by the
system that generated the request. Ensuring semantic interoperability is associated
with the need to apply ontologies of concepts used in processes and describing the
processes of functioning of an information system.</p>
      <p>
        Based on the studies carried out [
        <xref ref-type="bibr" rid="ref3 ref4 ref5">3, 4, 5</xref>
        ], the authors formalized the requirements for
the structure of the ontology to ensure semantic interoperability: the basic concepts
that allow describing both the static state and dynamic changes in the states of objects
in the subject area, sets of attributes (properties) of concepts and the main types of
links between concepts, sets of attributes (properties) links. In particular, it is
proposed to use the following types of concepts: "Object class", "Object" and "Entity".
Concepts can be linked together by the following types of unidirectional or
bidirectional relationships: "Inheritance", "Association", and "Action". In [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], the language
OWL-DL was chosen as the optimal means for describing ontologies, and the
ORACLE-11g DBMS was chosen as the storage medium for ontologies.
Based on the results of the studies carried out [
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ], the following ontology
construction algorithm was proposed to ensure the semantic interoperability of SIS:
1. Allocation of ontology concepts and definition of the semantics of links in
accordance with the rules [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
2. Description of the ontology by means of the OWL DL language using the Protégé
5.0 ontology editor (creating an OWL file).
3. Creation of structures for storing ontologies in the ORACLE 11g DBMS.
4. Filling out the structures in accordance with the description of item 2 (loading the
OWL file into the ORACLE 11g DBMS).
5. Creation of additional user rules for obtaining implicit knowledge in the ORACLE
11g DBMS environment.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Ontology merging algorithm used to ensure the semantic interoperability of information systems</title>
      <p>
        To ensure the semantic interoperability of information systems, it is necessary to
compare the ontologies that underlie them and find out their commonality and
differences. This problem is solved by using methods for assessing the semantic proximity
of ontology concepts. Many well-known methods for finding a measure of proximity
between ontology concepts are based on Tversky's set-theoretic approach, based on
comparing the properties of concepts [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. In works [
        <xref ref-type="bibr" rid="ref7 ref8">7-12</xref>
        ] the mutual arrangement of
vertices within the ontology is analyzed. The lengths of paths between pairs of
concepts are calculated. The length of the shortest path is determined as the number of
concepts in the ontology located between the two nodes under consideration, which
are interconnected. It is believed that the shorter the path length between the vertices,
the semantically closer the pair of concepts of the considered ontology [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. In [13],
the frequency of occurrence of a concept and its subclasses in one and another
ontology is taken as the basis for calculating the measure of semantic proximity of two
concepts of different ontologies. The methods described above for calculating
proximity measures between ontology nodes are symmetric. The work [14] describes a
calculation method, the essence of which is that the closeness of two concepts
depends on the closeness of concepts with which there are hierarchical relationships,
and is calculated recursively.
      </p>
      <p>The most promising for use in algorithms used to calculate measures of semantic
proximity of ontology concepts are the so-called hybrid measures. The hybrid
measure proposed in [15] consists of three parts - taxonomic, relational, and attributive.
Difficulties in comparing different ontologies of subject areas lie in the difference in
the names of concepts and relations, as well as in the approaches to the definition of
concepts. When mapping two ontologies, a search is performed for each concept of
one ontology of a similar concept of another ontology, taking into account the
synonymy of concepts. In works [16, 17], a method for calculating a measure is proposed,
taking into account the lexical proximity of concepts, properties, domains and ranges
of relations (ranges of values of the arguments of relations), parent/child concepts.
The main disadvantage of most methods for determining semantic proximity is the
need to involve an expert to confirm the correctness of detecting similarities and
differences in semantic concepts.</p>
      <p>
        Below we will consider the problems of integrating ontologies that reflect either
different points of view on the same subject area, or different points of view on the same
problem (i.e., we will integrate homogeneous ontologies at the level of subject areas
and levels of tasks). The purpose of the integration is to preserve the existing and
define new semantic dependencies of the concepts contained in both ontologies.
In accordance with the results of works [
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ], the following formal definitions can be
given regarding ontologies used to ensure the semantic interoperability of IS.
1) A lot of concepts are defined as follows:
C = {C1, C2, C3}, (1)
where: C1– concept of the “Object class” type;
      </p>
      <p>C2 – concept of the "Object" type;</p>
      <p>C3 – concept of the "Entity" type.
2) The set of relationships between concepts is defined as follows:</p>
      <p>= { 1, 2, 3}, (2)
where: 1– relationship "Inheritance" (relationship "class-subclass");
2 – relation "Association";</p>
      <p>R3 – is the "Action" relation.
3) The ontology used to ensure the semantic interoperability of the IS can be formally
presented in the following form:</p>
      <p>= {Ci (Аij, Sik), Rij, Pm}, (3)
where: Ci - ontology concepts;
Аij – j-th attribute of the i-th concept;
Sik – the k-th synonym for the i-th concept;
Rij – is the relationship between concepts i and j;</p>
      <p>Pm – inference rules.</p>
      <p>It is proposed to build an algorithm for integrating ontologies to ensure the semantic
interoperability of the IS based on the calculation of the semantic proximity of the
vertices of two ontologies 1 and 2. For each concept С1i of the ontology О1, we
calculate the measures of semantic proximity with the concepts С2j of the ontology
2.</p>
      <p>
        In the algorithm described below, the calculation of measures of semantic proximity
of ontology concepts used to ensure the semantic interoperability of IS will be based
on the set-theoretic approach [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The main idea of this approach is that to calculate
the measures of semantic similarity, it is necessary to take into account not only the
general properties of objects, but also their differences. The proposed algorithm will
calculate the measures of semantic proximity of homogeneous concepts, that is,
concepts that have the same names or names that are synonyms. The proximity measure
will consist of three parts:
• Attributive measure, which is calculated based on the comparison of the attributes of
the compared concepts and the values of these attributes;
• Geometric measure, which is calculated taking into account the location of the
selected vertices within the corresponding ontologies;
• Relational measure, which is calculated on the basis of comparing the presence and
types of relationships of the evaluated concepts with other concepts of the
corresponding ontologies.
      </p>
      <p>Let us introduce the following characteristics of measures of semantic proximity:
• Equivalence. We will assume that the vertex C1i of the O1 ontology is equivalent to
the vertex C2j of the O2 ontology if: 1) the composition of attributes and their values
coincide or differ by intervals not exceeding the minimum threshold values
(attributive measure); 2) the selected concepts are located in ontologies in such a way
that the lengths of the minimum chains (bridges) between these concepts and two
other equivalent concepts in each ontology does not exceed the minimum allowable
threshold value (that is, in ontologies the selected concepts are "surrounded" by
concepts the evaluations carried out are equivalent (geometric measure); 3) the evaluated
concepts are associated with concepts with the same types of links, or the number of
different types of links does not exceed a certain minimum threshold value (relational
measure).
• Conformity. Determined according to the rules described above. In the case when
the minimum allowable threshold value of the corresponding measure of proximity is
exceeded, a comparison is made with the maximum allowable threshold value of the
corresponding measure of proximity of concepts. In this case, the maximum threshold
value must not be exceeded. Vertices possessing the above characteristic of the
proximity measure will be called corresponding.
• Difference. Determined according to the rules described above. In the case when the
maximum permissible threshold value of the corresponding proximity measure is
exceeded. Vertices possessing the above-described characteristic of the measure of
proximity will be called different.</p>
      <p>To construct an ontology merging algorithm, it is also proposed to use the concept of
a bridge - a chain of ontology vertices that correspond to equivalent concepts used to
establish a mapping of two ontologies in [18].</p>
      <p>To integrate ontologies used to ensure semantic interoperability, the following
sequence of actions is proposed.</p>
      <p>Step 1. In the ontologies O1 and O2, bridges are computed, consisting of vertices that
in pairs have equivalent or corresponding proximity measures. The lengths of the
bridges (the number of vertices) must coincide.</p>
      <p>Step 2. Calculate the weight of each bridge. Assigning to the vertices with an
equivalent measure of proximity, the maximum coefficient is equal to 1, and to the vertices
with the corresponding measure of proximity - a coefficient in the range from 1 to 0.5,
depending on the approach to the threshold values of the estimated parameters of the
attributive, geometric, and relational measures of proximity.</p>
      <p>Step 3. As the base for merging, we choose the ontology in which there is the largest
number of vertices. Let in our case it be O1 ontology. We select in it all the bridges
defined in the previous steps.</p>
      <p>Step 4. For differing vertices in the O2 ontology, we find the bridges with the largest
weight and the smallest length to the vertices included in the bridges defined in Step
2.</p>
      <p>Step 5. Add the bridges found in Step 4 to O1 ontology.</p>
      <p>Steps 4 and 5 are repeated iteratively. We start looking for bridges with a length equal
to 1, sequentially increasing the length of the bridge by one vertex at each iteration.
Moreover, if the new vertex C2i is already included in the O1 ontology as a result of
performing Steps 3 and 4 (it became a new vertex C1j), then the vertices C2i and C1j
are considered equivalent.</p>
      <p>
        In this case, the algorithm, due to the formalization of the ontology structure (see
formulas 1-3 above), makes it possible to avoid the semantic conflicts described in
[19], which arise during the merging of ontologies at the time of transferring vertices
connected by the types of links of the type ". In ontologies used to ensure semantic
interoperability and constructed in accordance with the rules described in [
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ], the
vertices indicated in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] will be linked by links of the "Association" type. This will
allow avoiding semantic conflicts when using the ontology merging algorithm
described above.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Summary</title>
      <p>Currently, there are algorithms and their software implementations for the automatic
merging of ontologies. Information about such algorithms is given, for example, in
[20]. Each of the currently existing algorithms for automatic merging of ontologies
has, along with advantages, a number of significant disadvantages. The disadvantages
are primarily associated with attempts to create a universal algorithm for combining
ontologies that describe concepts of one subject area, but have different structures and
algorithms for their initial construction. The article presents an original algorithm for
the integration of ontologies representing structured knowledge about each of the
interacting ISs, taking into account the fact that their structures and construction
algorithms are clearly defined and unified.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgements</title>
      <p>The authors of the article are grateful to the Russian Foundation for Basic Research
for their support in writing the article (grants No. 18-07-01053 and No. 20-07-00926).</p>
      <p>9. Wu Z., Palmer M. Proc. 32nd Annual Meeting of the Association for Comput.
Linguistics. Las Cruces, 2018. p. 133.</p>
      <p>10. Li Y., Bandar Z. A., McLean D. IEEE Trans. on Knowledge and Data Engineering.
2017. p. 871.</p>
      <p>11. Hirst G., St-Onge D. // WordNet: An electronic lexical database. Cambrige, 2018. p.
305.</p>
      <p>12. Lukashevich N.V. et al. Ontologies for automatic text processing: a description of
concepts and lexical meanings. In: Computational linguistics and intellectual technologies:
proceedings of the international conference "Dialogue 2006". Publishing house of the Russian
State University for the Humanities, Moscow; 2016. pp. 138-142.</p>
      <p>13. Resnik P. Using information content to evaluate semantic similarity in a taxonomy. In:
Proc. 14th Int. Joint Conf. on Artificial Intelligence. Montreal, 2015. p. 448.</p>
      <p>14. Maedche A., Staab S. Proc. 13th EKAW Conf. LNAI. Berlin: Springer, 2018. p. 251.
15. Maedche A., Zacharias V. Proc. 6th European PKDD Conf. LNCS V. 2431. Berlin:
Springer, 2017. p. 348.</p>
      <p>16. Rodríguez M.A. Thesis for Degree of Doctor of Philosophy. University of Maine, 2018.
17. Karpenko A.P. et al. Methods for mapping ontologies. Review. Science and Education.
2009. URL: http://technomag.edu.ru/doc/115931.html (Date of treatment 10/02/2020).</p>
      <p>18. Nguyen H.A. Thesis for the Degree Master of Science. University of Houston − Clear
Lake, 2018.</p>
      <p>19. Vostrov A, Kurochkin M. Conflict detection in the integration of expert systems on
ontological models. Scientific and technical bulletin of SPbSPU. Informatics.
Telecommunications. Management. 2018; (2).</p>
      <p>20. Swati Negi, Sanjay Kumar Malik An Algorithm for Merging Two Ontologies: A
Case Study/ International Journal of Applied Engineering Research. 2018; 13(12).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. ISO/IEC 24765-Systems and Software Engineering-Vocabulary, URL: http://www.cse.msu.edu/~cse435/Handouts/Standards/IEEE24765.pdf,
          <source>last accessed</source>
          <year>2019</year>
          /03/30.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. EIF - European Interoperability Framework, http://ec.europa.eu/idabc/en/document/2319/5938.html,
          <source>last accessed</source>
          <year>2019</year>
          /03/30.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Korneev</surname>
            <given-names>D.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gasparian</surname>
            <given-names>M.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiseleva</surname>
            <given-names>I.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikryukov</surname>
            <given-names>A.A.</given-names>
          </string-name>
          <article-title>Ontological engineering of educational programs</article-title>
          .
          <source>Revista Inclusiones</source>
          .
          <year>2020</year>
          ;
          <volume>7</volume>
          (
          <issue>S2</issue>
          -3).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Korneev</surname>
            <given-names>D.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gasparian</surname>
            <given-names>M.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikryukov</surname>
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yaroshenko</surname>
            <given-names>E.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Golkina</surname>
            <given-names>G.E.</given-names>
          </string-name>
          <article-title>The technology for semantic interoperability based on a cognitive approach /</article-title>
          <source>International Journal of Advanced Trends in Computer Science and Engineering</source>
          .
          <year>2020</year>
          ;
          <volume>9</volume>
          (
          <issue>3</issue>
          ):
          <fpage>3637</fpage>
          -
          <lpage>3640</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Korneev</surname>
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boichenko</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kazakov</surname>
            <given-names>V</given-names>
          </string-name>
          .
          <article-title>Warehouse development оf ontology for providing semantic interoperability</article-title>
          .
          <source>CEUR Workshop Proceedings. Selected Papers of the 22nd International Conference "Enterprise Engineering</source>
          and
          <article-title>Knowledge Management"</article-title>
          ,
          <string-name>
            <surname>EEKM</surname>
          </string-name>
          <year>2019</year>
          .
          <year>2019</year>
          ;
          <volume>2413</volume>
          :
          <fpage>70</fpage>
          -
          <lpage>76</lpage>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2413</volume>
          /paper09.pdf
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Kuznetsov</surname>
            <given-names>O.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sukhoverov</surname>
            <given-names>V.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shipilina L</surname>
          </string-name>
          .B.
          <article-title>Ontology as a systematization of scientific knowledge: structure, semantics, tasks</article-title>
          . URL: http://cmm.ipu.
          <source>ru/proc (date of access: 02.09</source>
          .
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Rada</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mili</surname>
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bicknell</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          et al.
          <source>IEEE Trans. on Systems, Man and Cybernetics</source>
          .
          <year>2018</year>
          ;
          <volume>19</volume>
          :
          <fpage>17</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Leacock</surname>
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chodorow</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>WordNet: An electronic lexical database</article-title>
          .
          <source>Cambrige</source>
          ,
          <year>2019</year>
          . pp.
          <fpage>265</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>