<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Structuring and Accessing Knowledge for Historical and Streaming Digital Twins</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>AnSyMo Lab, University of Antwerp</institution>
          ,
          <addr-line>Antwerp 2000</addr-line>
          ,
          <country country="BE">Belgium</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>DMMS Lab, Katholieke Universiteit Leuven</institution>
          ,
          <addr-line>Leuven 3001</addr-line>
          ,
          <country country="BE">Belgium</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Flanders Make vzw</institution>
          ,
          <addr-line>Lommel 3920</addr-line>
          ,
          <country country="BE">Belgium</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2079</year>
      </pub-date>
      <abstract>
        <p>Organisations are intensely developing Digital Twins (DTs) to correctly and e ciently answer questions about the history and behaviour of physical systems. However, it is not clear how to construct these DTs starting from the data, information, knowledge, and wisdom available in the organisation. In this work, we present our approach to DT construction which involves a layered knowledge graph (KG) architecture communicating with the organisation's data repositories. We explain the components and timelines for using the KG to build both historical and streaming DTs, and what kinds of questions can be answered for our drivetrain use case.</p>
      </abstract>
      <kwd-group>
        <kwd>Digital Twins</kwd>
        <kwd>Knowledge Graph</kwd>
        <kwd>Knowledge Representation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The rise of the Internet of Things (IoT) and Industry 4.0 means that today's
industries are faced with a deluge of data. Multitudes of data points (sales, sensor
values, maintenance records, . . . ) are generated each second and poured into
various data repositories, each with implicit connections and relations between
organisational resources (processes, machines, environmental conditions, etc.).
The lack of semantics and explicit relations can hinder actionable insights [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
such that a data scientist investigating the data spends considerable time to
understand the context of the data before it can be used. This slows down the
process of discovering trends, in uencing new designs, using modelling results,
and discovering past experiments and lessons learned.
      </p>
      <p>
        For example, a simple query such as what experiment produced these results?
can often become an laborious endeavour to answer. As a rst step, one must nd
Copyright © 2021 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0).
the data and then understand the format and units, which may involve countless
emails to nd the one person in the organisation who understands the data. This
ine cient process can consume the time of multiple people on coordination,
which is further delayed if anyone involved has left the organisation.
Report Context This paper presents our current insights and vision developed
as part of a joint academic-industrial project led by Flanders Make, the
strategic research center for the Flemish manufacturing industry. The Framework for
Systematic Design of Digital Twins (DTDesign) project brings together three
universities and ve industrial partners to focus on determining the most e
ective technologies and methods for how to build a Digital Twin (see Section 3)
from a representation of an organisation's past and current data, information,
knowledge, and wisdom (DIKW) [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. The insights presented in this report are
gained from our collaborations, discussions, and creation of initial prototypes.
Throughout the lifetime of the project, these results are and will be iteratively
applied and validated on the industrial use cases, beyond the academic drivetrain
use case presented in Section 3.2.
      </p>
      <p>
        Approach In the DTDesign approach, the representation of an organisation's
DIKW is stored as a knowledge graph (KG) which \acquires and integrates
information into an ontology and applies a reasoner to derive new knowledge"[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
It contains relevant DIKW which is consistent and provides value to the
organisation through its connections, and has been successfully applied in many
domains [
        <xref ref-type="bibr" rid="ref1 ref4">1, 4</xref>
        ]. The KG is built, maintained, and queried by the user (possibly
through applications). The results of these queries give insights into the system,
or are used to build applications or code to run on the machines. For example,
anomaly detection and optimisation algorithms are trained and executed using
the input conditions of particular procedures [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <p>
        While KGs and DTs are each relatively old concepts, only recently has there
been work on their interactions, such as [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. This is due to the increasing
implementation of DTs within industry and academia, the rise of IoT gathering an
enormous amount of data, and the computing resources o ered by cheap servers
locally or in the cloud. However, experience reports on the development of DTs
and connected systems can lack crucial information [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], especially on the DT
components and their developmental timelines. In this paper, we describe the
processes of building up such a KG, building DTs from that KG, and their
relationships to each other and the physical system under study. In particular, we
discuss how the combination of KGs with the collection of industrial data at
scale leads to e ective DTs able to provide actionable answers to questions. An
example timeline is also presented to illustrate the steps for building DTs from
a KG.
      </p>
      <p>Contributions and Structure This paper presents these speci c contributions:
{ Section 2: A high-level architecture relating a KG to DTs.
{ Section 3: De nition of historical and streaming DTs, and a connection to
the answers they provide for an example drivetrain use case.
{ Section 4: A discussion of the structure and purpose of this KG including
how to access, update, and compartmentalise the
data/information/knowledge/wisdom within to serve large industrial organisations.
{ Section 5: A presentation of an example timeline for creating DTs and their
relations from a KG.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Conceptual Architecture</title>
      <p>This section provides a brief overview of our approach to using a knowledge
graph (KG) to assist with the construction of digital twins (DTs). As discussed
in Section 3, these DTs are a virtual representation of a physical system.</p>
      <p>Figure 1 shows the conceptual architecture of our solution. Our approach
relates four main components: 1) the KG containing information, which rests on
the data repositories of the organisation and acts as a historical DT, 2) the users
and applications who access information from the KG and DT, 3) a physical
system under study or control, and 4) the (streaming) DT of that system.</p>
      <p>Four relations are also speci ed: A) the relation between the physical system
and the DT, B) information added to or retrieved from the KG, C) the connection
between the KG, data repositories, and the digital twin, and D) the questions and
answers between the user and the DT. Of course, the users may also manipulate
the physical system, but we omit this interaction from our architecture.</p>
      <p>The remaining sections of this paper describe these components and relations
in detail. Also note that Figure 1 does not represent the time dimension of the
creation of the historical and streaming DTs. This is discussed in Section 5.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Running Example: Digital Twin(s) for a Drivetrain</title>
      <p>This section provides background on digital twins (DTs) and introduces our
notions of historical and streaming DTs. Our drivetrain use case is also brie y
presented through a discussion of how these DTs can provide actionable answers
about the drivetrain.</p>
      <sec id="sec-3-1">
        <title>Digital Twin Background</title>
        <p>
          As sensors and computing power become ever cheaper each year, it becomes
easier to combine real-time data with high- delity models to create a virtual
representation of a system to answer questions about that system. The umbrella
term digital twin captures this relationship between a physical system and its
digital counterpart [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], where \a DT is a virtual instance of a physical system
(twin) that is continually updated with the latter's performance, maintenance,
and health status data throughout the physical system's life-cycle" [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
        </p>
        <p>
          Kritzinger et al. classify the term digital twin into three categories depending
on the relation between the physical object and the digital object [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. This
relation is labelled A in Figure 1. A digital model is where there is no automatic
transfer of data between the digital and physical objects. A digital shadow is
where data from the physical object is sent to the digital object automatically,
as in a tracking simulator. Finally, a digital twin has an automatic connection
both to and from the physical system. Further details about this separation and
its application to DT experience reports are found in our previous work [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Addressing Drivetrain Questions</title>
        <p>One of the use cases within the DTDesign project (see Section 1) is to
represent and answer questions about an generic drivetrain representing drive-load
interaction, which composes the components that deliver power from a motor
to the wheels. In our conceptual architecture, these questions are answered by
software/hardware components implementing a DT. Three question sets are
presented here, as they showcase how the DTs built are at di erent scales and relate
to the other components in Figure 1 di erently.</p>
        <p>The rst question set is, \what is the correlation between motor current and
the drivetrain torque, and which experiments have been conducted to investigate
this?". The answer is therefore based on historical data, which implies that the
DT is of the history of the drivetrain. We therefore term this a historical digital
twin (Section 3.3).</p>
        <p>The second question set is \What is the best sensor to add to my drivetrain to
measure torque? Should this be a virtual sensor, or one relying on a non-intrusive
sensor?". This question also relies on historical data for sensor validation as well
as for physical modelling of the drivetrain components.</p>
        <p>The third question set asks, \What is an algorithm to detect anomalies in
the torque of the drivetrain, such that a warning signal can be raised?" This
algorithm can be trained on historical data, but will be deployed into a streaming
digital twin (Section 3.4) which is directly connected to the physical system.
3.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Historical Digital Twin</title>
        <p>In the historical DT case, the objective is to utilise the past knowledge of an
organisation to determine which correlations are present in data, or to create
and optimise algorithms to deploy in a streaming DT.</p>
        <p>A traditional process may involve a data scientist mining experimental results
and databases to locate such correlations. This can be a labour-intensive process
if the information is not readily available, such as the data scientist having to
contact a database owner to understand the meaning of the stored data.</p>
        <p>In our approach, the historical DT created is a combination of software and
hardware which is able to access the historical information available in the
knowledge graph, or up-to-date information from the physical drivetrain
system through the data lake. The data scientist can then ask relevant questions to
the DT to obtain the required information in a structured and e ective manner.</p>
        <p>The historical DT is also relevant for questions where the user wishes to model
and simulate the drivetrain itself in high- delity. As discussed in Section 3.2, a
relevant question is whether a sensor can be found that is not too physically
intrusive to be mounted on the drivetrain, or whether a virtual sensor must be
created through combination of readings from other sensors. This relies on an
accurate three-dimensional representation of the drivetrain geometry and of the
considered sensors, such that the models can be combined with expert knowledge
to understand if a particular sensor can be physically inserted into the drivetrain.</p>
        <p>Note that in the historical DT case, the physical system does not need to be
directly connected to the DT. Instead, it is the historical data that is of interest
to the data scientist, and the DT represents the history of the physical system
or the product plus production instead of the current behaviour.
3.4</p>
      </sec>
      <sec id="sec-3-4">
        <title>Streaming Digital Twin</title>
        <p>In contrast, the second DT type created for the drivetrain use case is focused
on the actual \live" behaviour of the drivetrain itself. Data ows from the
physical system to the streaming DT in near real-time. That is, the streaming DT
is running alongside the physical system. For example, the drivetrain question
about anomaly detection requires a constantly updated stream of data being fed
as input into a device running alongside the drivetrain. This anomaly detection
model must accurately represent the behaviour of the drivetrain such that all
relevant anomalies are detected with few false positives. A future enhancement
for this use case is also to directly control the motors on the drivetrain using the
streaming DT, such that anomalies are handled appropriately in real-time.</p>
        <p>
          This type of DT focuses more on the relationship between the digital twin
and the physical system. Thus the streaming DT is closer to those DT
envisioned for optimisation and control of a system, where the DT has an automatic
connection to the physical system (see [
          <xref ref-type="bibr" rid="ref10 ref13">10, 13</xref>
          ]). Note that this implies more
technical constraints, such as Internet of Things (IoT) issues (power consumption,
real-time constraints, local computing, etc.) and intrusivity of sensors.
3.5
        </p>
      </sec>
      <sec id="sec-3-5">
        <title>Digital Twin Classi cations</title>
        <p>Note there may not be a hard division between historical and streaming for a
DT, as both relate to information about a physical system. However, we state
that in a historical DT, the \twinning" of the DT focuses on the past data
and organisation's knowledge about a product or process. In other words, the
physical system being represented is the past history of the object in question, and
the organisation's knowledge about it. In the drivetrain example, this historical
DT focuses on the correlation between the current provided to the motors and
the resulting drivetrain torque, o ering answers to questions about properties,
correlations, and experiments. As such, more information from the KG may be
accessed or present within the historical DT. Indeed, a portion of the KG may be
deployed as the historical DT itself. In contrast, a streaming DT is more focused
on the \live" information from the physical system itself in (semi-) real-time.</p>
        <p>Section 5 also discusses how a DT may change classi cation between
historical and streaming through its development, as the relevant questions change and
data from the physical system becomes accessible.</p>
        <p>
          Finally, we wish to emphasise that this classi cation of historical and
streaming DTs is (mostly) orthogonal to the classi cation of Kritzinger et al. [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] of
digital model/shadow/twin. Recall that Kritzinger et al. makes a classi cation
on whether the connection between the physical and virtual systems are manual
or automatic. That is, whether the DT can automatically sense or actuate the
physical system. In contrast, our classi cation is based on whether a DT
answers questions based on data and information from the physical system's past
(including development information), or on the physical system's current state.
        </p>
        <p>Therefore, it is possible to have any combination of historical/streaming and
digital model/shadow/twin, except for a streaming digital model.</p>
        <p>That is, a DT could be created focusing on historical data which
automatically (or not) receives data and automatically (or not) controls the physical
system. For example, a historical DT may be constructed to periodically
examine the experimental data collected and automatically perform new experiments
on the physical system to better understand the behaviour of the system.</p>
        <p>
          As a streaming DT is concerned with the current behaviour of the system, it
must be able to automatically access the state of the physical system. Thus, a
streaming DT must be a digital shadow or digital twin as de ned by Kritzinger
et al. [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], depending on whether there is the possibility to perform automatic
actions on the physical system.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Knowledge Graph</title>
      <p>This section will discuss the knowledge graph in our approach. The concepts
of information hierarchy, heterogeneity, and compartmentalisation will be
discussed. The relation between the user and the knowledge graph as displayed in
Figure 1 is also detailed.
4.1</p>
      <sec id="sec-4-1">
        <title>Knowledge Graph Motivation</title>
        <p>
          As mentioned in Section 1 there is a deluge of heterogeneous data present in
today's large organisations. There has been signi cant work in the past on
understanding data through the use of ontologies and semantic reasoning to provide
context to this data [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. This allows data to be augmented with and transformed
into information, knowledge, and wisdom (DIKW) [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], in a (semi-) automated
process [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. This can be visualised as a pyramid, with a wide base of data,
then a layer of information, then knowledge, and a peak of wisdom. This
represents how it becomes more di cult to extract meaning from the lower layers,
and how much more data there will be in an organisation as compared to the
forward-facing wisdom. As a rough mapping of this hierarchy to an industrial
context, data is kept in databases, information is the linking between elements
in a database, knowledge is the meta-data (data about data), and wisdom is any
planning which takes into account the other levels.
        </p>
        <p>One approach to storing this DIKW is with a knowledge graph, which stores
this information as typed and attributed nodes with typed and attributed edges
between them. The regular structure is well-suited to query approaches, and a
great deal of exibility is provided to the organisation to store DIKW their way.</p>
        <p>In particular, the knowledge graph should conceptually support storage of all
relevant DIKW from the organisation, and the connections between. For
example, the knowledge graph may contain formalisation of the assets, experiments,
and conclusions performed by an organisation. This allows for its reuse in current
and future activities.</p>
        <p>Once this knowledge graph is (partially) lled, then it can start to answer
meaningful questions about the people, processes, and things present in the
organisation. An connection to data lakes expands the DIKW and connections
available. New DIKW must also be added over time to support new uses, in an
incremental and consistent manner.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Heterogeneity</title>
        <p>
          One challenge of combining a knowledge graph with industrial digital twins is
that the knowledge graph must handle DIKW which is highly heterogeneous.
For example, the Reference Architectural Model Industrie 4.0 (RAMI 4.0) [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]
contains six distinct layers: business, functional, information, communication,
integration, and asset. The knowledge graph could contain DIKW at all six of
these layers and allow for answering questions that span multiple layers.
        </p>
        <p>The DIKW within or referred to by the knowledge graph is also highly
heterogeneous. For example, there may be time series, relational information,
graphbased networks, images, documents, and any other information found in a large
organisation. Another industrially relevant division is whether a piece of DIKW
details the structure of an object, or instead details the behaviour of that
object. For example, an architectural decomposition of a drivetrain refers to the
mechanical components, while the behaviour is given through simulations or
experimental results.
4.3</p>
      </sec>
      <sec id="sec-4-3">
        <title>Type Hierarchies</title>
        <p>Conceptually, the knowledge graph contains all DIKW and connections in an
organisation. As this is not feasible, it must be possible for the knowledge graph
to be easily modi ed and evolved, and for connections between concepts to be
very exible. As the knowledge graph grows in richness, the number of concepts
contained must also expand for exibility and increased expressiveness.</p>
        <p>In our knowledge graph approach, we o er the user a typing system that
de nes attributes and relationships for concepts. This allows the user to
specify their own types and instances, and to form typing relationships within the
knowledge graph. An example subset of a knowledge graph is pictured in
Figure 2. The upper box of the gure shows the types, while the lower box displays
the instances. Note the vertical typing relations between these two boxes which
provide the typing information for the instances.</p>
        <p>However, it is also important to ensure consistency within the knowledge
graph and to promote reusability and inter-operation with other sources of
information. Therefore, the use of standardised concepts is suggested wherever
possible. Also present in Figure 2 is the use of standardised and user-created
types. In our approach, we de ne these explicit type hierarchies to provide a
consistent and uni ed way to access and reason about the information contained.</p>
        <p>In our approach, the knowledge graph is divided into built-in/standardised
types and instances, and user-de ned ones. This encourages levels of consensus,
where the upper levels have (potentially) an international consensus, and the
user can de ne organisation-wide types and instances as required.</p>
        <p>
          For example, the current prototype de nes built-in types similar to those
de ned in the Sensor-Observation-Sampling-Actuator ontology (SOSA) [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. This
ontology is a W3.org standard suitable to representing sensors, observations, and
experiments in a cyber-physical system. Selecting a subset of this ontology allows
the user to exploit the person-hours taken by others to construct a consistent
and expressive ontology, while retaining the exibility to de ne their own types.
        </p>
        <p>Figure 3 shows an example of type hierarchies implemented within our
prototype knowledge graph. The upper levels are type hierarchies which form the
structural backbone of knowledge graphs. The lower levels are type hierarchies
in the knowledge graph prototype representing useful formalisms. These
hierarchies are not xed, but instead represent that layered approach needed to have
a consensus-based yet exible type system. Of course, the user is able (and
encouraged) to develop their own layers beneath (or above, or alongside...) those
layers shown here to ensure exibility and relevance to the concepts required by
the organisations.</p>
        <p>
          For example, the ValidityFrames type hierarchy in Figure 3 includes types
relating to our previous research in de ning the conditions and degree in which
a model is valid with respect to a real-world system [
          <xref ref-type="bibr" rid="ref17 ref18">17, 18</xref>
          ]. This could be the
temperature in which a model is valid (such as it is invalid below freezing), or
de ne the set of training data for a model (such that a neural net has only
been trained on daytime images and not night-time images). A type VFModel
is de ned which denotes the model of interest. As this model should also be
contextualised through connection to other modelling and simulation concepts,
the ValidityFrames type hierarchy imports the Modelling and Simulation type
hierarchy (CoreModSim), and the VFModel type is denoted as a sub-type of the
CoreModel type in that type hierarchy.
4.4
        </p>
      </sec>
      <sec id="sec-4-4">
        <title>Data Layer</title>
        <p>For practical reasons, in our approach the knowledge graph cannot contain all
DIKW. This is due to the massive amounts of data incoming from assets in the
organisation, which could quickly overwhelm the storage, reasoning, or querying
capacities of the knowledge graph. Instead, our approach proposes a dedicated
data layer between the knowledge graph and the underlying data stores. Thus
this knowledge graph could be termed as a `virtual' knowledge graph.</p>
        <p>Figure 4 shows our layered knowledge graph approach. At the bottom are the
sources of raw data, such the sensors of machines and organisation databases.
Unique identi ers stored in the knowledge graph point to these sources of
information. For example, the location of a data series may be stored in the knowledge
graph, but not the individual data points themselves. The adapter layer is then
able to resolve both the element type and the unique identi er to determine how
and where to access the underlying data.</p>
        <p>The knowledge graph requires neutral access to these data sources through
a data adapter layer. This layer is intended to shield the knowledge graph from
knowing the particulars of the data storage, which is quite technical and subject
to constant revision. Examples of this include the database API, SQL queries,
or OPC-UA (https://opcfoundation.org) commands necessary to access
information, or the login information for repositories. Separating these more technical
details out from the knowledge graph allows for greater scalability through the
removal of transitory information and better handling of heterogeneous machine
con gurations. Thus, the data layer forms the abstracting barrier between the
virtual knowledge graph (dealing with reasoning concerns) and the technical
data sources (dealing with scalability and storage concerns).</p>
        <p>For example, the knowledge graph may indicate that humidity information
for a particular machine is stored as a HumidityMeasurement instance stored
as a TimeSeries at a particular location. Typing this information allows for
the knowledge graph to indicate constraints on the data, and precisely specify
the type of information stored, while leaving the actual values to be stored
in an appropriate way until explicitly requested by a user or application. This
TimeSeries can be explicitly linked to the Humidity data type, allowing for a user
to directly query, \where is humidity information for this machine?". Further
information and knowledge can also be inferred with further connections, to
indicate the experimental conditions a TimeSeries was produced from or the
Observation procedure used.
4.5</p>
      </sec>
      <sec id="sec-4-5">
        <title>Accessing the Knowledge Graph</title>
        <p>In the overview shown in Figure 1, there is a component which represents the user
accessing the knowledge graph, or an application on their behalf. This application
could be a dashboard, reports, or any other visualisation. Once the information
is placed in the knowledge graph, there must be an easy-to-use method for users
and applications to extract this information. An example could be to ask for
the set of all physical quantities in the knowledge graph, or which data stores
contain temperature information.</p>
        <p>
          As with any API, there are many challenges to provide a scalable,
expressive query interface to the knowledge graph. Currently, we consider the API
speci cation language GraphQL (https://graphql.org) to be a feasible approach.
This query language takes queries de ned in a JSON-like format, and evaluates
the elds requested using de ned resolvers [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. This approach again shields the
user from understanding the underlying technology used to store the knowledge
graph.
5
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Constructing Digital Twins</title>
      <p>This section will discuss the contribution of this paper concerning the
construction of digital twins (DTs) using a knowledge graph (KG). In particular, an
example timeline is shown noting how the historical and streaming DTs are
created from the KG, and in connection with the KG.</p>
      <p>Figure 1 relates the di erent components of our approach. However, the time
dimension is not represented in that gure and will instead be discussed here.
In particular, this section deals with relation C in Figure 1.</p>
      <p>A portion of our current research is to create a work ow for how the KG,
physical system, and DTs could be created either sequentially or in parallel.
However, for brevity, this work will instead present an example timeline, showing
one possible path for creation of all components. Figure 5 demonstrates this
timeline as the construction of a KG, and then di erent types of DTs being
built from it. In this gure, time increases to the right and the arrows between
boxes represent a manual or automatic transfer of DIKW between the KG and
the DTs.</p>
      <p>Fig. 5: Example timeline for Digital Twin creation.
5.1</p>
      <sec id="sec-5-1">
        <title>Building the Knowledge Graph</title>
        <p>The rst step in the construction of the DTs is the building up of the KG itself. In
Figure 5, this happens before any digital twins are built, where the user imports
data, information, knowledge, and wisdom (DIKW) from existing repositories
and creates types and instances. Conceptually, this KG exists throughout the
lifetime of the organisation. As DTs are built, they are built using the knowledge
in the KG, as well as connect to the KG to deposit new DIKW.
5.2</p>
      </sec>
      <sec id="sec-5-2">
        <title>Digital Twin Types</title>
        <p>Once the KG is created, a few di erent paths exist for the creation of the DTs.
In the lower-right of Figure 5, a historical DT is used to gather DIKW about
the system, such as designing the most e ective sensor for the drivetrain use
case. Then, a deployment occurs, where the models and sensors are deployed
onto hardware and connected to the physical system.</p>
        <p>On the top-left of Figure 5, a DM is created from the KG to gain insights
about the system. Insights are then relayed back to the KG, with the
communication interleaved with the creation of the DT on the bottom-right. This
represents how the queries to the KG are \snapshots" in time, and how the KG
evolves as information comes back from the DTs.</p>
        <p>Finally, in the top-right of Figure 5, a DS is created which is in connection
with the physical system, and is not reliant on information from the KG. It is
then promoted to a DT which exports DIKW to the KG.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>This paper has presented our insights on the issue of combining a knowledge
graph (KG) layered upon an organisation's data repositories with the process of
constructing Digital Twins (DTs). In particular, the components of this approach
are seen in Figure 1, with an example timeline in Figure 5. These DTs, separated
into historical and streaming, allow answering di erent types of queries such as
about correlations, optimisations, and anomaly detection.</p>
      <p>Our future research involves clarifying the possible work ows for creating
these DTs, their possible architectures, and a classi cation of their features.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>This research was supported by Flanders Make, the strategic research center
for the Flemish manufacturing industry, and partially funded by the DTDesign
ICON (Flanders Innovation &amp; Entrepreneurship FM/ICON::HBC.2019.0079)
project.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Abu-Salih</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Domain-speci c knowledge graphs: A survey</article-title>
          .
          <source>Journal of Network and Computer Applications</source>
          <volume>185</volume>
          ,
          <issue>103076</issue>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Banerjee</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dalal</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , et al.:
          <article-title>Generating digital twin models using knowledge graphs for industrial production lines</article-title>
          .
          <source>In: Workshop on Industrial Knowledge Graphs</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Buchgeher</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gabauer</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , et al.:
          <article-title>Knowledge graphs in manufacturing and production: A systematic literature review</article-title>
          .
          <source>IEEE Access 9</source>
          ,
          <issue>55537</issue>
          {
          <fpage>55554</fpage>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Chukkapalli</surname>
            ,
            <given-names>S.S.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mittal</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , et al.:
          <article-title>Ontologies and arti cial intelligence systems for the cooperative smart farming ecosystem</article-title>
          .
          <source>IEEE Access 8</source>
          ,
          <issue>164045</issue>
          {
          <fpage>164064</fpage>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ehrlinger</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , Wo , W.:
          <article-title>Towards a de nition of knowledge graphs</article-title>
          .
          <source>SEMANTiCS (Posters</source>
          , Demos, SuCCESS)
          <volume>48</volume>
          ,
          <issue>1</issue>
          {
          <issue>4</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Grieves</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vickers</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>Digital twin: Mitigating unpredictable, undesirable emergent behavior in complex systems</article-title>
          .
          <source>In: Transdisciplinary perspectives on complex systems</source>
          , pp.
          <volume>85</volume>
          {
          <fpage>113</fpage>
          . Springer (aug
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Hankel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rexroth</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>The Reference Architectural Model Industrie 4.0</article-title>
          . ZVEI: Die Elektroindustrie (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Hartig</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perez</surname>
          </string-name>
          , J.:
          <article-title>Semantics and complexity of GraphQL</article-title>
          .
          <source>In: Proceedings of the 2018 World Wide Web Conference</source>
          . pp.
          <volume>1155</volume>
          {
          <issue>1164</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Janowicz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haller</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al.:
          <article-title>SOSA: A lightweight ontology for sensors, observations, samples, and actuators</article-title>
          .
          <source>Journal of Web Semantics</source>
          <volume>56</volume>
          ,
          <issue>1</issue>
          {
          <fpage>10</fpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kritzinger</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al.:
          <article-title>Digital twin in manufacturing: A categorical literature review and classi cation</article-title>
          .
          <source>IFAC-PapersOnLine</source>
          <volume>51</volume>
          (
          <issue>11</issue>
          ),
          <volume>1016</volume>
          {
          <fpage>1022</fpage>
          (
          <year>2018</year>
          ). https://doi.org/10.1016/j.ifacol.
          <year>2018</year>
          .
          <volume>08</volume>
          .474
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , et al.:
          <article-title>Framework for manufacturing-tasks semantic modelling and manufacturing-resource recommendation for digital twin shop- oor</article-title>
          .
          <source>Journal of Manufacturing Systems</source>
          <volume>58</volume>
          ,
          <fpage>281</fpage>
          {
          <fpage>292</fpage>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Madni</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Madni</surname>
            ,
            <given-names>C.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lucero</surname>
          </string-name>
          , S.D.:
          <article-title>Leveraging digital twin technology in model-based systems engineering</article-title>
          .
          <source>Systems</source>
          <volume>7</volume>
          (
          <issue>1</issue>
          ),
          <volume>7</volume>
          (jan
          <year>2019</year>
          ). https://doi.org/10.3390/systems7010007
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Oakes</surname>
            ,
            <given-names>B.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsai</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al.:
          <article-title>Improving digital twin experience reports</article-title>
          .
          <source>In: Proceedings of the 9th International Conference MODELSWARD</source>
          . pp.
          <volume>179</volume>
          {
          <fpage>190</fpage>
          . INSTICC,
          <string-name>
            <surname>SciTePress</surname>
          </string-name>
          (
          <year>2021</year>
          ). https://doi.org/10.5220/0010236101790190
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Opdahl</surname>
            ,
            <given-names>A.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tessem</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Ontologies for nding journalistic angles</article-title>
          .
          <source>Software and Systems Modeling</source>
          <volume>20</volume>
          (
          <issue>1</issue>
          ),
          <volume>71</volume>
          {
          <fpage>87</fpage>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Rowley</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>The wisdom hierarchy: representations of the DIKW hierarchy</article-title>
          .
          <source>Journal of Information Science</source>
          <volume>33</volume>
          (
          <issue>2</issue>
          ),
          <volume>163</volume>
          {
          <fpage>180</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Rozanec</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jinzhi</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Towards actionable cognitive digital twins for manufacturing</article-title>
          . In: Second International Workshop on Semantic Digital Twins (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Van Acker</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oakes</surname>
            ,
            <given-names>B.J.</given-names>
          </string-name>
          , et al.:
          <article-title>Validity frame concept as e ort-cutting technique within the veri cation and validation of complex cyber-physical systems</article-title>
          .
          <source>In: Proceedings of the 23rd ACM/IEEE International Conference MODELS-C</source>
          . pp.
          <volume>1</volume>
          {
          <issue>10</issue>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Van Mierlo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oakes</surname>
            ,
            <given-names>B.J.</given-names>
          </string-name>
          , et al.:
          <article-title>Exploring validity frames in practice</article-title>
          .
          <source>In: International Conference on Systems Modelling and Management</source>
          . pp.
          <volume>131</volume>
          {
          <fpage>148</fpage>
          . Springer (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>