<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>PeTwin: An ontology-supported data access for petroleum production digital twin</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mara Abel</string-name>
          <email>marabel@inf.ufrgs.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>João Cesar Netto</string-name>
          <email>netto@inf.ufrgs.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fabrício Henrique Rodrigues</string-name>
          <email>fabricio.rodrigues@inf.ufrgs.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nicolau Oyhenard dos</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Santos</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rafael Humann Petry</string-name>
          <email>rhpetry@inf.ufrgs.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Haroldo Rojas de Souza Silva</string-name>
          <email>hrssilva@inf.ufrgs.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>UFRGS - Federal University of Rio Grande do Sul</institution>
          ,
          <addr-line>Porto Alegre</addr-line>
          ,
          <country country="BR">Brazil</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>A digital twin is a system framework tightly attached to a physical production plant conceived for monitoring the operation in real time. This framework integrates data from distinct sources and supports data analytics and predictive evaluation of the petroleum flow and the maintenance schedule. The scenario's difficulty includes multiple data suppliers, diverse data sources and platforms, heterogeneous data types and formats, data or unit transformation needs, and multifaceted data semantics. These requirements demand an innovative semantic solution for data integration and processing in the digital twin environment. The PeTwin project looks to define the best practices and software solutions for the development of digital twins for petroleum production plants. The project's central objective is to deal with the semantic complexity of the information and offer a functional framework for machine learning and data analytics to support engineering daily operations in petroleum production surveillance. We have developed a network of BFO-based domain ontologies, an associated knowledge graph, and an application layer that implements the semantic treatment of information in a real scenario of petroleum production wells.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;digital twin</kwd>
        <kwd>ontology</kwd>
        <kwd>semantic data access and integration</kwd>
        <kwd>petroleum production 1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The information that feeds a modern oil-field digital twin integrates temporal series and static data,
configuring a complex scenario for data analytics. The data is usually spread across many
applications from several service companies that perform specific tasks during operations. These
systems exchange data in distinct layouts, sometimes in proprietary formats. Even when accessed in
open formats, an integrated digital twin requires uniformization in data meaning, formats, units of
measure, the scale of analysis, and interval of time associated with the track of data provenience.
The integrated operation center receives these data hand-labeled with their source and meaning and
analyses them to support short-term decisions.</p>
      <p>The PeTwin project is a 4- year (2020-2024) joint cooperation between Oslo University and
UFRGS, with the participation of Libra Consortium (Brazil), Equinor (Norway), and Shell (Norway)
that looks to define the best practices for the development of digital twins for petroleum production
plants and for offering data analytic methods based on machine learning.</p>
    </sec>
    <sec id="sec-2">
      <title>2. The ontology for petroleum production plant</title>
      <p>
        One contribution of the UFRGS team to the project was developing a well-founded domain ontology
to document the meaning and logical restriction of the assets and processes involved in petroleum
production and facility maintenance, structuring the framework for the semantic interoperability of
data operated by the digital twin. We proposed the O3PO – Offshore Petroleum Production Plant
Ontology [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], a network of independent domain ontologies dealing with a specific part of the
petroleum production process. The ontology development started with an extensive survey of
existent standards and ontologies in the petroleum industry domain. Our research includes the
resources of ISO 15926; the integrated data platform from the OSDU - Open Subsurface Data Universe
Forum [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]; the standard glossaries of the Professional Petroleum Data Management (PPDM) What
is a Well? [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and What is a facility? [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]; the interoperability standard PRODML [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]; and the
equipment specification of CFIHOS [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>We followed the analysis of these resources by a sequence of interviews with petroleum industry
professionals from subsea, reservoir, flow maintenance, production, and integrated process
monitoring areas. The interviews produced a list of digital twin software functional requirements
and raised the terminology adopted for system and data labeling that drives the domain ontology
usage. The whole ontology network followed the same building methodology. It specialized the Basic
Formal Ontology (BFO) top ontology, which guarantees a common conceptual basis for integration
and makes easy alignment with previously developed BFO-derived domain ontologies. The ontology
formalizes the logical definition and textual documentation of each entity representing the
production plant installation assets, including their qualities, domain of qualities, relations, and
associated dependent entities. We manually created a knowledge graph expressing all instances and
relations among them, such as pipe connections, installation sets, and components.</p>
      <p>The network of ontologies provides the semantic framework for the whole digital twin
architecture and software applications with several semantic capabilities. A set of OWL files that
describe the ontology entities, the knowledge graph of ontological relations among ontology
instances, and the mapping component that connects these two components with other applications
compound the semantic framework. We applied this framework for several tasks, such as
information retrieval in the historian system, extracting contextual information about the production
plant installation, and connecting the assets with P&amp;ID (piping and instrumentation diagram) to
explore the oil flow data in the diagrams. This usage led to the development of in-house applications
and the adoption of cloud digital twin solutions.</p>
      <p>
        One in-house solution, focused on the visual exploration of assets, uses the embedded knowledge
in the knowledge graph to find the appropriate time-series data in a historian system while also
using it to provide more context to the system user. The user selects the ontology entities, and our
knowledge graph application retrieves the data associated with these entity instances directly from
the data storage, such as well names, their parts and types of properties, and the references to the
time series in the historian system. We have conceived the whole solution as a microservice
architecture, in which the components play a specific task customized to the environment and
software platform, as shown in Figure 1 (a). This approach allows the easy adaptation of the solution
for other ontology-based tasks and platforms. Figure 1(b), extracted from [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] depicts the application
interface with its main functionalities.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Acknowledgements</title>
      <p>The PeTwin project was financed by FINEP and the Libra Consortium (Petrobras, Shell Brasil, Total
Energies, CNOOC, CNPC). The research group is supported also by CAPES Finance Code 001 and
CNPq, the Brazilian Finance Council.
Conference on Tools with Artificial Intelligence (ICTAI), IEEE, Nevada, 2023. ^DOI
10.1109/ICTAI59109.2023.00023</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>N.</given-names>
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rodrigues</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Romeu</surname>
          </string-name>
          , G. Nascimento,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Abel, O3PO: A Domain Ontology for Offshore Petroleum Production Plants</article-title>
          ,
          <source>Expert Systems with Applications</source>
          <volume>238</volume>
          (
          <issue>122104</issue>
          ) (
          <year>2024</year>
          ).doi.org/10.1016/j.eswa.
          <year>2023</year>
          .
          <volume>122104</volume>
          :
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.O.G. OSDU</given-names>
            , The Open Group Guide OSDU™
            <surname>System</surname>
          </string-name>
          <string-name>
            <surname>Concept</surname>
          </string-name>
          , The Open Group,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>PPDM</surname>
          </string-name>
          ,
          <article-title>What is a well?</article-title>
          ,
          <source>PPDM Association</source>
          , Houston,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>PPDM</surname>
          </string-name>
          ,
          <article-title>What is a facility?</article-title>
          ,
          <source>PPDM Association</source>
          , Houston,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.J.</given-names>
            <surname>King</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.R.</given-names>
            <surname>Ballin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bennis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.E.</given-names>
            <surname>Heath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.D.</given-names>
            <surname>Hiebert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>McKenzie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-F.</given-names>
            <surname>Rainaud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.C.S.P.E.</given-names>
            <surname>Schey</surname>
          </string-name>
          , Reservoir Modeling:
          <article-title>From RESCUE To RESQML, SPE Reservoir Evaluation &amp; Engineering (</article-title>
          <year>2012</year>
          ).
          <volume>10</volume>
          .2118/135280-pa:
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>IOGP</given-names>
            ,
            <surname>Capital Facilities Information Handover Specification (CFIHOS) - Specification Document</surname>
          </string-name>
          , International Association of Oil &amp; Gas
          <string-name>
            <surname>Producers</surname>
          </string-name>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>N.O.</given-names>
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rivera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Petry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rodrigues</surname>
          </string-name>
          , G. Nascimento,
          <string-name>
            <given-names>J.</given-names>
            <surname>Comba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Abel</surname>
          </string-name>
          , Ontology Explorer:
          <article-title>An Ontology-Based Visual Analytics System for Exploring Time Series Data in Oil and Gas</article-title>
          , in: N.
          <string-name>
            <surname>Aussenac-Gilles</surname>
            , T. Hahmann (Eds.) Formal Ontology in Information Systems, IOS, Sherbrooke,
            <given-names>Canada</given-names>
          </string-name>
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.B.</given-names>
            <surname>Correia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Abel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Becker</surname>
          </string-name>
          ,
          <article-title>Data management in digital twins: a systematic literature review</article-title>
          ,
          <source>Knowledge and Information Systems</source>
          (
          <year>2023</year>
          )
          <fpage>1</fpage>
          -
          <lpage>32</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>F.P.</given-names>
            <surname>Knebel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Trevisan</surname>
          </string-name>
          , G.S. do Nascimento,
          <string-name>
            <given-names>M.</given-names>
            <surname>Abel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.A.</given-names>
            <surname>Wickboldt</surname>
          </string-name>
          ,
          <article-title>A study on cloud and edge computing for the implementation of digital twins in the Oil &amp; Gas industries</article-title>
          ,
          <source>Computers &amp; Industrial Engineering</source>
          <volume>182</volume>
          (
          <year>2023</year>
          )
          <article-title>109363</article-title>
          .https://doi.org/10.1016/j.cie.
          <year>2023</year>
          .
          <volume>109363</volume>
          :
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>J.B. Correia</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Rodrigues</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Abel</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Becker</surname>
          </string-name>
          ,
          <article-title>Data management in Digital Twins for the Oil and Gas Industry: beyond the OSDU Data Platform</article-title>
          ,
          <source>Journal of Information and Data Management</source>
          <volume>13</volume>
          (
          <issue>3</issue>
          ) (
          <year>2022</year>
          ).https://doi.org/10.5753/jidm.
          <year>2022</year>
          .
          <volume>2506</volume>
          :
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Schena</surname>
          </string-name>
          ,
          <article-title>A methodology for synthetic generation of failure data for data-driven prognostics and health management modeling for digital twins</article-title>
          , PPGC,
          <string-name>
            <surname>UFRGS</surname>
          </string-name>
          ,
          <source>Porto Alegre</source>
          <year>2023</year>
          , p.
          <fpage>108</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.V.L.</given-names>
            <surname>Pivetta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.H.</given-names>
            <surname>Simon</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.M. Costa</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Abel</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          <string-name>
            <surname>Carbonera</surname>
          </string-name>
          ,
          <article-title>A Systematic Evaluation of Machine Learning Approaches for Petroleum Production Forecasting</article-title>
          , IEEE 35th International
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>