<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Service-oriented tools for automating digital twin development</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Roman Kostromin</string-name>
          <email>kostromin@icc.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alexander Feoktistov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mikhail Voskoboinikov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Matrosov Institute for System Dynamics and Control Theory of SB RAS</institution>
          ,
          <addr-line>Lermontov St. 134, Irkutsk, 664033</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The paper represents a prototype of service-oriented tools for developing digital twins. These tools automate most of the stages in preparing and carrying out a computational experiment reducing the possibility of human error. Computational experiments based on simulation modeling are performed using the proposed tools. As an example in applying the represented tools, a digital twin of a heat pump used as environmentally-friendly equipment of an infrastructure object on the Baikal natural territory is considered. Owing to the growing anthropogenic load, special attention is paid to the objects located at the coast of Lake Baikal. During the simulation modeling, both the heat pump performance characteristics and retrospective meteorological data are used to predict climatic conditions and select optimal control parameters. In addition, the computational experiment has obviously shown that heat pump use significantly reduces the harmful effect on the environment.</p>
      </abstract>
      <kwd-group>
        <kwd>1 digital twin</kwd>
        <kwd>simulation modelling</kwd>
        <kwd>microservices</kwd>
        <kwd>environmentally-friendly equipment</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Nowadays, environmental problems, in particular the reduction of  2 emissions, are the subject
of research by the scientific community around the world [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. One of the directions of such research is
the introduction of environmentally-friendly equipment. In the era of the development of Industry 4.0
technology, a digital twin is an effective tool for analyzing the benefits of using the equipment.
Malakuti et. all [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] describe a digital twin as “a formal digital representation of some asset, process or
system that captures attributes and behaviors of that entity suitable for communication, storage,
interpretation or processing within a certain context”.
      </p>
      <p>
        As a rule, the object behavior study using a digital twin is based on the simulation modeling of
object operation processes [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Such a simulation often requires applying the Monte Carlo method to
provide the reliability of stochastic simulation results, often generates big data, and thereby leads to
the need for High-Performance Computing (HPC) and Virtual Machines (VMs) use [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Moreover,
within the simulation modeling, it becomes necessary to vary the values of the structural and
operational parameters of the object in order to determine the optimal ones. Thus, the problem arises
in organizing and carrying out parameter sweep computation [5]. In addition, current and
retrospective data on the object equipment operation are necessary for verification and validation of
simulation models, as well as to object diagnostic and control [6].
      </p>
      <p>In terms of system architecture, there is a tendency to move from monolithic and general-purpose
applications to compact and specialized web services [7].</p>
      <p>To this end, we have developed a prototype of new microservice-oriented tools for constructing
sets of digital twins representing various subsystems of an infrastructure object. In addition, we have
created a system of intelligent agents who are delegated the rights and obligations of subjects related
to ensuring the object operation and consuming the resources and services provided to them.</p>
      <p>Unlike the well-known toolkits considered in [8-10], the developed tools provide comprehensive
support for the following operations:
• Partial automation of the model design process;
• Verification and validation of models;
• Adaptation of models to a specific subject area; support for hybrid modeling;
• Collection and processing of semi-structured data;
• Converting data to the formats required in models;
• Multi-criteria analysis of simulation results;
• Automation of the formation of a web interface for performing the above-listed tools.</p>
      <p>We applied the developed tools in the study of the heat supply system that uses heat pumps for
infrastructure objects of the Baikal natural territory.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Tools for Automating Digital Twin Development</title>
      <p>Within the proposed approach, the software of a digital twin is a distributed applied software
package. Such software includes a set of applied modules (for example, simulation models). Separate
models are represented by services. Planning and executing a workflow-based service composition
are supported.</p>
      <p>A feature of multiple runs of simulation models is the necessity to save different versions of the
experiment and the ability to run a specific version for computations. Therefore, our tools were
developed using the concept of parent and inherited services. Parent services are implemented
applying the Node.js language within the REST approach. They contain the basic operations required
for all stages in preparing and carrying out experiments. Among such operations are forming a web
interface, generating model specifications, parsing generated specifications, creating and queuing
VM-based jobs, configuring and monitoring a computing environment, and analyzing simulation
modeling results.-jobs generation, and configuration, environment monitoring, analyzing simulation
modeling results.</p>
      <p>An inherited service is considered a new computational experiment that inherits the basic
capabilities of the parent service, but it is used in a new configuration. This service configuration is a
separate project that can be published for use in other experiments. A detailed description of model
parameters, experiments, and requirements to the environment is indicated in [11], where the
specification example in the JSON format is presented.</p>
      <p>The creation of a new project includes the processes of importing and parsing the simulation
model specification, as well as preparing its input data. Also, it is possible to generate input data
variants based on the model specification. The VM lifecycle management is implemented using an
add-on over OpenStack, in which VM-based jobs are transferred for execution to the queue system of
the computing cluster [12]. All standard GPSS simulation reports from the experiment and other
output files are collected for processing and analysis.</p>
      <p>The general scheme of the user’s interaction with our tools is shown in Figure 1. The user creates a
new project using the web interface and loads the simulation model and list of its parameters. Next,
the specification generation service forms a VM-based job that contains the model and its
specification. This job is transferred to the queue of the resource management system located on the
computing cluster.</p>
    </sec>
    <sec id="sec-3">
      <title>3. DT of heat pump</title>
      <p>In this section, we demonstrate applying the developed tools in the study of the heat supply system
that uses heat pumps for infrastructure objects of the Baikal natural territory. The heat pump makes it
possible to reduce the cost of electricity and decrease the harmful effect on the environment due to the
use of natural heat sources (soil or water) [14]. The models are developed by problem domain
experts. Within the digital twin, DevOps is supported in relation to the developed models. [15].</p>
      <p>In our approach, the digital twin of the heat pump is essentially a modular application. A module is
a sequential or parallel program. In our example, a module is a model on the GPSS language.
Therefore, we launch VMs that contain instances of GPSS-model and tie them to different processor
cores of computing environment nodes because GPSS-models are run under Windows operation
systems in single-threaded mode only.</p>
      <p>Data for executable modules are transferred from different sources. Accumulated meteorological
data and equipment operation information are stored in the retrospective DB (Figure 2). The
implementation of their collection and primary processing requires the use of additional software and
hardware. We use microcomputers and software agents located on them [16]. The agents perform
monitoring data received from sensors installed on heat pumps, pre-processing the obtained data, and
periodic uploading the pre-processed data to a central database. It is important to note that during the
modeling process, simulation models (package modules) interact with agents and request the
necessary information via the REST-API.</p>
      <p>For each digital twin, the specification is generated based on the simulation models and their input
parameters. It contains the data sources, formats of input and output files, parameters for generating
variables, and additional system information.</p>
      <p>The input data are the results of meteorological observations in the Baikal coastal area over the
past 8 years. This data makes it possible to simulate climate change and control the heat pump
operation to ensure a comfortable indoor temperature.</p>
      <p>We studied the effect of reducing  2 emissions by means of simulation in replacing a coal-fired
boiler with one of six pump types (Corsa 55, Corsa 70, MOTEN-18D 57, MOTEN-18D 70, BROSK
Mark Prom 58, and BROSK Mark Prom 71) from three Russian manufacturers taking into account
capital investment. Within the experiment, 24 variants of input data were prepared to simulate the
heat pump operation. Each variant contains different combinations of values for the pump type, the
need for drilling, and daily object service capacity in the number of persons. The two criteria for
optimization are  2 emission reduction and capital investments with respect to the following
optimality conditions: maximized quantity and minimized size correspondingly.</p>
      <p>In the process of modeling, the object service capacity is 60 and 80 persons. Figure 4 (Figure 5)
demonstrates the CO_2 emission reduction (a) and capital investments (b) for object service capacity
equal to 60 (80) persons.</p>
      <p>a)
b)</p>
      <p>2 emission (a) via capital investments for the variants with 60 person</p>
      <p>For selecting optimal parameters for operating environmentally-friendly equipment of the studied
objects, we apply the following three methods of discrete multi-criteria analysis from [17]:
Lexicographic (1), Majority (2), and Pareto-optimal (3). The Lexicographic method provides selecting
the optimal variant by sequentially comparing the criteria, sorted by their significance. In our
example, we can prefer to increase the  2 emission reductions, or vice versa, save capital
investments. When the criteria are of equivalent importance, the majority method gives us the variant
that includes the largest number of these criteria with the best values. With the means of the
Paretooptimal selection, we find all the variants that are incomparable with each other, and at the same time,
surpass all the remaining ones. The results of applying the aforementioned methods are shown in
Table 1.</p>
      <p>Thus, we can see that the selection of specific equipment depends on both its structural and
operational parameters and methods of multi-criteria analysis.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusions</title>
      <p>In the paper, we address the development and applying the prototype of service-oriented tools for
creating and carrying out DTs of infrastructure objects. These tools make it possible to perform
simulation modeling based on parameter sweep computing followed by a multi-criteria selection of
parameters for the operation of environmentally-friendly equipment of the studied objects. The
advantages in applying the developed tools are the automation of software deployment and
configuration, significant reduction of the large-scale experiment makespans, and increase of the
computation reliability.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Acknowledgements</title>
      <p>The study is supported by the Russian Foundation of Basic Research, project no. 19-07-00097.
The study is supported by the Russian Foundation of Basic Research and Government of Irkutsk
Region, project no. 20-47-380002-р_а. The study related to hybrid modeling the computing
environment was funded by the Ministry of Science and Higher Education of the Russian Federation,
project no. FWEW-2021-0005 “Technologies for the development and analysis of subject-oriented
intelligent group control systems in non-deterministic distributed environments”.</p>
    </sec>
    <sec id="sec-6">
      <title>6. References</title>
      <p>[5] G. Henley, A. Card, T. Haup, Using High Performance Computing for Parameter Sweep of S3D
Applications, in: IEEE Electric Ship Technologies Symposium (ESTS), IEEE, Washington,
2019, pp. 273–277. doi: 10.1109/ESTS.2019.8847871.
[6] Y. Lu, C. Liu, K.I. Wang, H. Huang, X. Xu, Digital Twin-driven smart manufacturing:
Connotation, reference model, applications and research issues, Robotics and
ComputerIntegrated Manufacturing 61 (2020) 101837. doi: 10.1016/j.rcim.2019.101837.
[7] A. Kovtunenko, A. Bilyalov, S. Valeev, Distributed streaming data processing in IoT systems
using multi-agent software architecture, Lect. Notes Comput. Sc. 11118 (2018) 572–583. doi:
10.1007/978-3-030-01168-0_51.
[8] M. Marrone, L.M. Kolbe, Impact of IT service management frameworks on the IT organization.</p>
      <p>An empirical study on benefits, challenges and processes, Business &amp; Information Systems
Engineering 3(1) (2011) 5–18. doi: https://doi.org/10.1007/s11576-010-0257-8.
[9] S. Ivanov, K. Nikolskaya, G. Radchenko, L. Sokolinsky, M. Zymbler, Digital Twin of City:
Concept Overview, in: Proceedings of the Global Smart Industry Conference (GloSIC), IEEE,
Chelyabinsk, 2020, pp. 178–186. doi: 10.1109/GloSIC50886.2020.9267879.
[10] W. Kritzinger, M. Karner, G. Traar, J. Henjes, W. Sihn, Digital Twin in manufacturing: A
categorical literature review and classification, IFAC-PapersOnLine 51(11) (2018) 1016–1022.
doi: https://doi.org/10.1016/j.ifacol.2018.08.474.
[11] R. Kostromin, A. Feoktistov, Agent-Based DevOps of Software and Hardware Resources for
Digital Twins of Infrastructural Objects, in: Proceedings of the 4th International Conference on
Future Networks and Distributed Systems (ICFNDS 2020), ACM, Saint Petersburg, 2020,
pp. 1–6.
[12] A. Feoktistov, I. Sidorov, A. Tchernykh, A. Edelev, V. Zorkalzev, S. Gorsky, R. Kostromin R,
I. Bychkov, A. Avetisyan, Multi-Agent Approach for Dynamic Elasticity of Virtual Machines
Provisioning in Heterogeneous Distributed Computing Environment, in: Proceedings of the
International Conference on High Performance Computing and Simulation, IEEE Press, Orleans,
2018, pp. 909–916. doi: 10.1109/HPCS.2018.00145.
[13] R. Kostromin, Survey of software configuration management tools of nodes in heterogeneous
distributed computing environment, in: Proceedings of the the 2nd Intern. Workshop on
Information, Computation, and Control Systems for Distributed Environments (ICCS-DE 2020),
CEUR-WS Proceedings, Aachen, 2020, 2638, pp. 156–165. doi: 10.47350/ICCS-DE.2020.15.
[14] B. Baeten, F. Rogiers, L. Helsen, Reduction of heat pump induced peak electricity use and
required generation capacity through thermal energy storage and demand response, Applied
Energy 195 (2017) 184–195. doi: 10.1016/j.apenergy.2017.03.055.
[15] A. Feoktistov, S. Gorsky, I. Sidorov, A. Tchernykh, Continuous Integration in Distributed
Applied Software Packages, in: Proceedings of the 42th Int. Convention on information and
communication technology, electronics and microelectronics, IEEE, Riejka, 2019,
pp. 1775–1780. doi: 10.23919/MIPRO.2019.8757002.
[16] I.A. Sidorov, R.O. Kostromin, A.G. Feoktistov, System for monitoring parameters of functioning
infrastructure objects and their external environment, in: Proceedings of the the 2nd Intern.
Workshop on Information, Computation, and Control Systems for Distributed Environments
(ICCS-DE 2020), CEUR-WS Proceedings, Aachen, 2020, 2638, pp. 252–264. doi:
10.47350/iccs-de.2020.23.
[17] L.A. Sholomov, Logical research methods of discrete selection models, Nauka, Moscow, 1989,
288 p. (in Russian).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Mardani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Streimikiene</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Cavallaro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Loganathan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Khoshnoudi</surname>
          </string-name>
          ,
          <article-title>Carbon dioxide (CO2) emissions and economic growth: A systematic review of two decades of research from 1995 to 2017, Science of the total environment 649 (</article-title>
          <year>2019</year>
          )
          <fpage>31</fpage>
          -
          <lpage>49</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.scitotenv.
          <year>2018</year>
          .
          <volume>08</volume>
          .229.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Malakuti</surname>
          </string-name>
          , P. van
          <string-name>
            <surname>Schalkwyk</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Boss</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Ram Sastry</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Runkana</surname>
            ,
            <given-names>S.W.</given-names>
          </string-name>
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Rix</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Green</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Baechle</surname>
            ,
            <given-names>N.C.</given-names>
          </string-name>
          <string-name>
            <surname>Nath</surname>
          </string-name>
          ,
          <article-title>Digital Twins for Industrial Applications</article-title>
          . Definition, Business Values, Design Aspects, Standards and Use Cases: White Paper, in: Industrial Internet Consortium: Milford, MA,
          <year>2020</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>19</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.V.</given-names>
            <surname>Petrov</surname>
          </string-name>
          ,
          <article-title>Simulation as the basis of digital twin technology</article-title>
          ,
          <source>Proceedings of Irkutsk State Technical University</source>
          <volume>22</volume>
          (
          <issue>10</issue>
          ) (
          <year>2018</year>
          )
          <fpage>56</fpage>
          -
          <lpage>66</lpage>
          . doi:
          <volume>10</volume>
          .21285/1814-3520-2018-10-56-
          <fpage>66</fpage>
          . (In Russian).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.A.</given-names>
            <surname>Varotsos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.F.</given-names>
            <surname>Krapivin</surname>
          </string-name>
          ,
          <article-title>A new big data approach based on geoecological informationmodeling system</article-title>
          ,
          <source>Big Earth Data</source>
          <volume>1</volume>
          (
          <issue>1</issue>
          -2) (
          <year>2017</year>
          )
          <fpage>47</fpage>
          -
          <lpage>63</lpage>
          . doi:
          <volume>10</volume>
          .1080/20964471.
          <year>2017</year>
          .
          <volume>1397405</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>