<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ApplyOnTop - Approach for semantic data integration based on Ontop</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mônica Rocha de Ázara</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>José Maria Parente de Oliveira</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Instituto Tecnológico de Aeronaútica (ITA)</institution>
          ,
          <addr-line>São José dos Campos, SP</addr-line>
          ,
          <country country="BR">Brasil</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>With the increase in data, knowledge management and system integration become challenging. Although the use of Virtual Knowledge Graph, via Ontop, is a promising solution, its application is complex. The article introduces ApplyOnTop, an approach for semantic integration of databases based on Ontop. This approach proposes the execution of structured activities that assist in applying Ontop. To validate the applicability of ApplyOnTop, an experiment was conducted, using the approach in a real-world database integration scenario. The results demonstrate improvements in ontology creation and linking with databases, systematizing the use of Ontop for more efective integration.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Ontology</kwd>
        <kwd>Semantics</kwd>
        <kwd>Data Integration</kwd>
        <kwd>Mapping</kwd>
        <kwd>Database</kwd>
        <kwd>Computing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. Works</title>
      <p>For this work, 20 studies adhering to its theme were selected. These studies underwent an evaluation
stage, which included the application of six classification criteria listed below.</p>
      <p>1. Did the work use a relational database as a data source?
2. Did the work generate an ontology?
3. Did the work present a detailed process for ontology construction?
4. Did the work link the defined ontology to the database?
5. Did the work enable querying the database from the semantic layer defined by the ontology?
6. Did the work define a detailed process for linking the ontology to the database?</p>
      <p>Among the analyzed studies, all incorporated the use of ontologies. Two used existing ontologies,
while the others developed their own. All allowed querying of data through the generated semantic
layer, with most using a database. However, none detailed the process of ontology construction and its
linkage to the database.</p>
    </sec>
    <sec id="sec-3">
      <title>3. ApplyOnTop - Approach for Semantic Data Integration Based on</title>
    </sec>
    <sec id="sec-4">
      <title>Ontop</title>
      <p>The ApplyOnTop is an approach designed to streamline database integration through the use of Ontop.
It guides users through a set of structured steps aimed at making the mapping process between ontology
elements and data stored in databases more accessible.</p>
      <p>ApplyOnTop is built with a focus on domains with multiple developed information systems. Its
strategy involves constructing the ontology from macro views of the individual ontologies of each
system involved in the domain. Subsequently, these macro views of the systems’ ontologies are merged
and detailed to generate the domain ontology, which is then linked to the systems’ databases.</p>
      <p>ApplyOnTop is divided into two distinct phases, as illustrated in the Figure 1. Phase 1, called Domain
Ontology Construction, encompasses activities related to obtaining domain knowledge and elaborating
its representation in the form of an ontology. Phase 2, Ontology-Database Linkage, includes activities
involved in mapping the ontology and linking it to the data’s physical structure.</p>
      <p>ApplyOnTop allows for evaluation, refinement, and continuous improvement of products developed
in previous stages, flexibilizing their sequence. As domain knowledge progresses, there is a need to
revisit and improve products, especially the ontology, initially constructed with a business perspective
and later enhanced with the technical perspective of databases.</p>
      <sec id="sec-4-1">
        <title>3.1. Phase 1 - Domain Ontology Construction</title>
        <p>
          The ontology construction process is segmented into four distinct phases: Specification,
Conceptualization, Formalization, and Implementation. In the Specification phase, the domain scope is delineated,
the ontology’s purpose is defined, and knowledge sources are identified [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. In the Conceptualization
phase, conceptual maps are developed for each system involved in the domain, providing a clear visual
representation of the acquired knowledge. The Formalization phase focuses on transforming conceptual
maps [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] into a formal ontology, involving detailed design elaboration and axiom definition in triples.
Finally, in the Implementation phase, the ontology is converted into OWL format, making it operational.
This work used Protégé as the OWL editor for ontology construction [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>The ApplyOnTop approach aims to unify the ontologies of diferent systems involved into a single
representation, thus simplifying the integration process. After each stage, a review and refinement are
conducted to ensure that the ontology meets the established requirements. This approach allows for
continuous evolution of the ontology as domain knowledge deepens. At the end of the process, users
have a robust and functional ontology ready to facilitate data integration and querying eficiently and
efectively.</p>
        <p>
          This phase was inspired by Methontology [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. However, modifications were suggested to the original
process to simplify it and adapt it to the objectives of this work. Among the changes made to
Methontology, the focus on building ontologies that encompass various information systems within the same
domain stands out.
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>3.2. Phase 2 - Ontology-Database Linkage</title>
        <p>This phase aims to associate ontology elements with data stored in domain databases, facilitating
data querying and understanding. Linkage is essential for the Ontology-Based Approach and requires
creating mappings between ontology elements and tables. This phase consists of four stages: Data
Identification and Preparation, Data Mapping, Query Development, and Verification.</p>
        <p>
          In the Data Identification and Preparation stage, data sources are made available and analyzed. In
this stage, the Physical Data Model Diagram is used to aid in understanding the data [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. In the Data
Mapping stage, the goal is to establish correspondences between ontology elements and fields in the
databases[
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. This stage is carried out using Ontop[
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], which allows linking the ontology to the
database. To assist in mapping, the Linkage Diagram is created, representing how the ontology and
database data are connected.
        </p>
        <p>The Query Development stage aims to formulate queries capable of retrieving information from
the databases and eficiently answering questions defined for the ontology. SQL queries compatible
with the database structure and SPARQL queries to interact with the ontology and associated data are
elaborated. Finally, in the Verification stage, tests are conducted to verify the accuracy of the linkage
between the ontology and the databases. Throughout the process, opportunities may arise for ontology
evolution to a more robust and domain-appropriate representation.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. Experiments and Analysis of Results</title>
      <p>For the experiment, the domain of Contracts and Customer Support was chosen. An ontology for
this domain, named Ontology for Contracts and Customer Support Integration, was developed as the
foundation for the experiment. Following ontology construction, steps were taken to link it to the
relevant databases.</p>
      <p>ApplyOnTop proved efective in integrating the systems within the domain, as evidenced by the results
obtained throughout the proposed phases and stages. During the Domain Ontology Construction phase,
a comprehensive view of the entities contained in the systems was obtained. ApplyOnTop introduced
defined stages that enhanced domain understanding, initially encompassing business perspective
through the use of Conceptual Map and Preliminary Ontology Design. This understanding contributed
to the elaboration of the Ontology Design, covering the classes identified in the domain’s systems.</p>
      <p>An important aspect of ApplyOnTop was its flexibility and iterative format, allowing for continuous
adjustments and improvements. The application of proposed diagrams in ApplyOnTop, such as Ontology
Design and Ontology-Database Linkage Diagram, significantly aided in the analysis and construction
of both ontology and mappings. This contribution was crucial due to the ease of understanding and
visualization of the necessary relationships and linkages for enabling integration.</p>
      <p>This outcome validates the developed approach, ofering organizations an option to efectively and
accurately address the complex integration demands present in their businesses. ApplyOnTop provided
a systematic view of the steps required to enable semantic integration using Ontop.</p>
    </sec>
    <sec id="sec-6">
      <title>5. Conclusion</title>
      <p>This work aimed to develop an approach for semantic database integration based on Ontop, named
ApplyOnTop. This approach proposes the definition of structured activities to help users minimize the
complexities inherent in semantic data integration, making the application of Ontop more systematic.</p>
      <p>To validate the functionality of ApplyOnTop, an experiment was conducted, utilizing the need for
integrating two systems within a company. The experiment was successful, demonstrating the clear
creation of the ontology and its linkage to databases through the use of activities proposed in the
approach.</p>
      <p>ApplyOnTop presented significant aspects such as the creation of diagrams aiding analysis and
decision-making during Ontop integration execution. For example, the Conceptual Map plays a
significant role in obtaining initial domain information. Two other diagrams created were the Preliminary
Ontology Design and the Ontology Design, allowing views of partial ontologies and the consolidated
ontology to be built, aiding in analysis and ontology configuration in Protégé. Additionally, the
construction of the Ontology-Database Linkage Diagram assists in mapping ontology elements with stored
data.</p>
      <p>Ultimately, the significant characteristic of the work was the creation of a systematic approach for
Ontop usage in semantic integration. This systematization defines phases and steps to be executed,
allowing for an iterative approach that enables revisiting previous steps to enhance already built
products. Thus, as knowledge about the domain, business, and database is obtained, the ontology may
need to evolve.</p>
      <p>The ApplyOnTop approach presented a clear structure and defined guidelines to simplify Ontop
usage. By establishing specific steps to follow, the process of ontology construction and linkage becomes
more accessible and guided. Therefore, this work can assist professionals interested in working with
Ontop, simplifying and facilitating the complex journey of data integration.
5.1. Contributions
1. Detailing for Ontology Construction Using Menthontology: Although Menthontology
was adopted as the foundation, the ontology construction process has been enhanced. In this
enhancement, the focus is on building ontologies in domains that encompass multiple information
systems. To achieve this, steps have been included to guide the ontology builder from documenting
the acquired knowledge, using conceptual mapping, to the actual construction of the ontology,
supported by Preliminary Ontology Design and Ontology.
2. Visualization Aspect of Ontology Design: The use of Ontology Design facilitates analyzing and
planning the ontology to be built in an easier way. This approach allows for a preview visualization
of classes and relationships before implementation in Protégé, significantly simplifying the process.
Thus, not only does it reduce the efort involved in construction, but it also creates an important
input that can be used alongside the created ontology, contributing to a clearer conception.
3. Integration of Partial Ontologies Based on Databases: The strategy of developing macro
views of the ontologies from each system involved in the domain, to later merge them and
generate the final ontology, aids in the creation process. This approach helps to make the process
clearer and more precise, allowing for a comprehensive understanding of the individual ontologies
before their consolidation.
4. Link Visualization: The use of the Link Diagram enables the analysis and documentation of the
mapping to be established between the ontology and the databases. This tool makes the linking
process more transparent, providing a visual representation that facilitates understanding of the
relationships between the ontology elements and the data in the corresponding bases.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Patrizio</surname>
          </string-name>
          , Idc:
          <article-title>Expect 175 zettabytes of data worldwide by 2025, Network World 3 (</article-title>
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H. V.</given-names>
            <surname>Jagadish</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gehrke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Labrinidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Papakonstantinou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Patel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ramakrishnan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Shahabi</surname>
          </string-name>
          ,
          <article-title>Big data and its technical challenges</article-title>
          ,
          <source>Communications of the ACM</source>
          <volume>57</volume>
          (
          <year>2014</year>
          )
          <fpage>86</fpage>
          -
          <lpage>94</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Grüninger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Atefi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Fox</surname>
          </string-name>
          ,
          <article-title>Ontologies to support process integration in enterprise engineering</article-title>
          ,
          <source>Computational &amp; Mathematical Organization Theory</source>
          <volume>6</volume>
          (
          <year>2000</year>
          )
          <fpage>381</fpage>
          -
          <lpage>394</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Cogrel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <article-title>Virtual knowledge graphs: An overview of systems and use cases</article-title>
          ,
          <source>Data Intelligence</source>
          <volume>1</volume>
          (
          <year>2019</year>
          )
          <fpage>201</fpage>
          -
          <lpage>223</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>Rautenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Todesco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. V.</given-names>
            <surname>Steil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. A.</given-names>
            <surname>Gauthier</surname>
          </string-name>
          , Uma metodologia para o desenvolvimento de ontologias,
          <source>RECEN-Revista Ciências Exatas e Naturais</source>
          <volume>10</volume>
          (
          <year>2008</year>
          )
          <fpage>237</fpage>
          -
          <lpage>262</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Novak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. J.</given-names>
            <surname>Cañas</surname>
          </string-name>
          ,
          <article-title>A teoria subjacente aos mapas conceituais e como elaborá-los e usá-los, Práxis educativa (</article-title>
          <year>2010</year>
          )
          <fpage>09</fpage>
          -
          <lpage>29</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Protégé</surname>
          </string-name>
          ,
          <string-name>
            <surname>Protégé</surname>
          </string-name>
          (
          <year>2023</year>
          ). URL: https://protege.stanford.edu/, acessado em 24 de setembro de
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Fernández-López</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gómez-Pérez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Juristo</surname>
          </string-name>
          ,
          <article-title>Methontology: from ontological art towards ontological engineering (</article-title>
          <year>1997</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Elmasri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. B.</given-names>
            <surname>Navathe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. G.</given-names>
            <surname>Pinheiro</surname>
          </string-name>
          , et al., Sistemas de banco de dados (
          <year>2005</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Cogrel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Komla-Ebri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kontchakov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lanti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rezk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rodriguez-Muro</surname>
          </string-name>
          , G. Xiao, Ontop:
          <article-title>Answering sparql queries over relational databases</article-title>
          ,
          <source>Semantic Web</source>
          <volume>8</volume>
          (
          <year>2017</year>
          )
          <fpage>471</fpage>
          -
          <lpage>487</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Ontop</surname>
            <given-names>Team</given-names>
          </string-name>
          , OntopTutorial: Basic Setup (
          <year>2023</year>
          ). URL: https://ontop-vkg.org/tutorial, acessado em 30 de setembro de
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>