<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>D. Bayomie);</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Manufacturing Process By Enabling Process Mining on Sensor Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dina Bayomie</string-name>
          <email>dina.sayed.bayomie.sobh@wu.ac.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kate Revoredo</string-name>
          <email>kate.revoredo@hu-berlin.de</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Bachhofner</string-name>
          <email>stefan.bachhofner@wu.ac.at</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kabul Kurniawan</string-name>
          <email>Kabul.Kurniawan@wu.ac.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elmar Kiesling</string-name>
          <email>elmar.kiesling@ai.wu.ac.at</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jan Mendling</string-name>
          <email>jan.mendling@hu-berlin.de</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Austrian Center for Digital Production</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Cairo University</institution>
          ,
          <addr-line>Cairo</addr-line>
          ,
          <country country="EG">Egypt</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Humboldt University</institution>
          ,
          <addr-line>Berlin</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Vienna University of Economics and Business (WU)</institution>
          ,
          <addr-line>Vienna</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0001</lpage>
      <abstract>
        <p>Typical manufacturing processes involve various machines, each of which may be equipped with a variety of sensors. Digital twins can be used to model how the machines operate and support analysts in issue identification and identifying potential improvements in the process. For a complete view of the status of a machine, however, models need to be enriched to identify patterns over changes in the measurements of sensors and correlations between these sensors. Process mining techniques could be usefully applied in this context, given that they provide descriptive analyses to explain and simulate physical objects based on event logs storing multi-perspective data about the process. However, although sensors generate a vast amount of data about the status of machines on the production floor, they cannot be directly used by process mining techniques. To tackle this issue, we introduce a method that creates a custom event log from sensor data based on the process analysts interests. To this end, we propose diferent encodings for the sensor data. An exploratory experiment using real-life data from an industrial partner shows the efectiveness of our approach.</p>
      </abstract>
      <kwd-group>
        <kwd>Sensor data</kwd>
        <kwd>Event log creation</kwd>
        <kwd>Process mining</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        In the manufacturing industry, Digital Twins (DTs) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] play increasingly important roles as
the Industry 4.0 vision becomes reality. DTs facilitate the virtualization of physical objects to
support analysts with an overview of the reality and a visualization of potential improvements
and possible issues to be mitigated. An example of a physical object in a manufacture process is
an injection molding machine used to produce plastic car parts through injection molding. As
Figure 1 depicts, this machine can be equipped with numerous sensors. In this case having a
model that focuses on how this machine operates is not suficient for a full understanding of its
status. In this context, it is beneficial to enrich the
      </p>
      <sec id="sec-1-1">
        <title>DT model with patterns of changes on the</title>
        <p>nEvelop-O
measurements of sensors or correlations between these sensors, which may provide a more
complete view of the machine status.</p>
        <p>
          Process mining techniques [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] are able to provide descriptive analyses to explain and simulate
physical objects. For that event logs storing multi-perspective data about the process are used.
Sensors generate vast amounts of data about the status of machines on the production floor.
However, this data cannot be directly used by process mining techniques, given that sensor
measurements are structured as time serieses whereas event logs store the occurrence of discrete
events of a process over time.
        </p>
        <p>In this paper, we introduce a method that creates a custom event log from sensor data based
on the process analysts interests. To this end, we explore multiple options to encode sensor
measurements as process events and a set of techniques to group these events into cases, i.e.,
sequences of correlated events. We conduct exploratory experiments using real-life use case
form our industrial partner Farplas. The experiments use specific encodings of the sensor data
into an event log, which allow the use of process mining techniques for root-cause analysis and
pattern discovery. The results shows the efectiveness of our approach.</p>
        <p>The remainder of this paper is organized as follows. We discuss prior work in Section 2,
describe our method in 3, and evaluate our method and discuss the findings in Section 4. Finally,
we conclude our work and provide some future research directions in Section 5.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Related work</title>
      <p>
        Various approaches to leverage process mining to support the analysis and modeling of real
machines running on the shopfloor have been introduced in the context of Industry 4.0 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
In [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], an approach to integrate Digital Shadows to analyze shopfloor-level manufacturing
processes is proposed. The diferent materials and sub-parts of a product are considered for
the analysis. The integration of process events with structural data allows process mining
techniques to learn an enriched process model. In this work, we consider a single machine with
various sensors and focus on the change in the measurements of these sensors to define process
events. Our goal is to understand the correlation among these process events and how their
interactions afect the output of the machine.
      </p>
      <p>
        [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] addresses the scenario of using Digital Twins of organizations for business process
improvement. To this end, process mining techniques are used to evaluate violations of constraints
and produce required actions. A digital twin interface model is presented to make the current
state of the business processes transparent, allowing the process analyst to visualize potential
improvements to the business process.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], an approach to transform sensor data into an event log is presented. The approach
maps sensor measurements taken when users interact with smart products to human activities
and grouping them into cases. The goal is to use the event log generated to discover models of
human behavior. To this end, sensor data is encoded into an event log by segmenting the sensor
measurements considering a fixed time window. Then the segments or groups of segments
(when common characteristics are identified) are labeled as activities. The labeling process is
performed manually by domain experts. Activities are grouped into cases based on the goal of
the research, which is the relation of the sensor data with the user interaction. Therefore, each
interaction with a user defines a new case. In the current work, we explore diferent ways for
segmenting the sensor measurements and to create a case.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], a method for analyzing time series data to be used in decision points in a process model
is proposed. The approach learns a process model from an event log and if there are decisions
in the model based on numeric attributes, then time series analysis is used to be considered
at the decision point. In our work, we do not focus on decisions made based on the values of
sensor data. Instead, we focus on investigating the changes in the sensor data to support the
understanding of the process outcome.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] a method for finding the interaction between sensor data and process knowledge is
used to construct an event log. The sensor data used was the location of a specific object (e.g.,
the location of a patient in a hospital). In the current work, we focus on encoding diferent
sensor data as an event log.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Method</title>
      <p>
        In this section, we describe our method for creating an event log from sensor data, which we
henceforth call EL-SD. It is inspired by the data science model [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>Figure 2 illustrates the EL-SD process; the main data input is the sensor data, and the output
is an event log. To create the event log, EL-SD relies on two main steps: (i) a pre-processing
step in which the data is cleaned and an appropriate subset is selected, and (ii) an event log
building step based on the process analyst’s interest and the pre-processed data.</p>
      <p>As shown in Figure 1, diferent types of sensors are associated with machines on the
production floor. Each of these sensors provide time series data about the machine’s status at a
given time. Table 1 provides an example of the extracted entries of sensor data over time. The
sensor data are raw data that track the machine status over time; Notice that ”row #” column
is a table pointer and does not denote the entry id. For example, in the second row, the time
of the sensor reading is ”2021-01-04 17:57:00 ”, the temperature reading is 300.5, the pressure
sensor reading is 222, the location sensor reading is L1, the volume sensor reading is 34.1, the
Data preprocessing</p>
      <p>Event log construction
time sensor reading is 35.0, and the speed sensor reading is 100.</p>
      <sec id="sec-3-1">
        <title>3.1. Data Preprocessing</title>
        <p>The first step is a preprocesssing step to prepare the data in terms of data quality and data
selection.</p>
        <p>
          Data cleaning Time series data are essential in industry, where all kinds of sensor devices
capture data from the industrial environment continuously. The time series data collected is
typically large and afected by the limited reliability of sensor devices [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. Consequently, data
cleaning is an essential step [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] before subsequent analysis. Key quality issues with the sensor
data are (i) unannotated data, i.e., the extracted data is not properly related to sensor metadata
(e.g., sensor name, type etc.) (ii) Missing (null) values due to incomplete sensor readings. There
are several techniques to handle such issues (cf. [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]).
        </p>
        <p>Data Selection Due to the large number of sensors that production machines are typically
instrumented with, the extracted sensor data is typically enormous. To perform meaningful
analyses, process analysts need to select data relevant for their analysis. The selection step
may consist in slicing the data based on a time window, selecting specific sensors to analyze, or
both. For example, suppose the analyst wants to understand the relation between temperature,
pressure, and speed of the product produced on a machine. In that case, they can select the
entries related to the respective sensors.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Event Log Construction</title>
        <p>The second step of the EL-SD process consists in building an event log from the pre-processed
time series sensor data.
3.2.1. Event log
In this section, we define the output of our method in terms with respect to events, cases and
the constructed event log.</p>
        <p>Definition 1 (Event). An event  represents a state in the system execution. An event has a
set of attributes that describe it, mainly the activity attribute that defines what happens in this
state and the timestamp attribute that marks when the state is happening.</p>
        <p>Definition 2 (Case). A case  = ⟨  1, … ,    ⟩ is a finite sequence of length  of events    with 1 ⩽
 ⩽  induced by ≼, i.e., such that    ≼    for every  ⩽  ⩽  . A case groups events to describe
a specific object.</p>
        <p>Definition 3 (Event log). An event log  = { 1, … ,   } is a finite non-empty set of
nonoverlapping cases, i.e., if  ∈   , then  ∉   for all ,  ∈ [1 … ] ,  ≠  .
3.2.2. Event encoding
There are various options for defining what constitutes an event in the context of sensor data
that can help process analysts. As per Definition 1, an event describes the status of the system
execution, therefore the analyst can determine how to represent this status. We propose five
options to define an event and mainly express the event activity.</p>
        <p>The first and naïve option is that every entry is an event representing the system status at this
point. The event represents one activity (”Record status”). This encoding would be interesting
for analyzing the data perspective over sensor data by considering all of them as event data
for the log using root-cause analysis techniques over the data. Table 3 shows an example of
encoding the events in which an event represents the activity ”system status”, and all the sensor
data are attributes that describe this status.</p>
        <p>The second option encodes the events to represent changes in the system over each sensor
between two successive readings of sensor data. Thus, we would have event activities of ”change
in sensor x” for each sensor that changed over the readings. Also, we maintain the status of the
changes over the sensor data, whether it is increasing or decreasing. This encoding helps to
analyze the detailed changing patterns over sensor data readings. Table 4 shows an example
of encoding the events in which an event represents the change over the sensor data, and the
attributes describe this change in terms of the previous value, i.e., ”pValue” attribute, new value,
i.e., ”nValue” attribute and status attribute that indicate whether the sensor reading is increasing
or decreasing. Notice that we do not state any change status for non-numerical sensor such as
location (Loc) sensor.</p>
        <p>The third option encodes the events similar to the previous encoding. However, an event
represents the change over each sensor between two successive readings of sensor data when
the change exceeds a given threshold. Thus, we would have event activities of ”change in sensor
x” for each sensor within the readings such that the change over the two successive readings
exceeds the analyst threshold for this sensor. Also, we maintain the previous and new values of
the exceeding change and the status of the changes over the sensor data, whether increasing
or decreasing. This encoding helps analyze to analyze interesting changing points within the
sensor data readings based on the analyst thresholds. Table 5 shows an example of encoding
the events based on changing threshold. The number of events generated using the threshold is
less than that generated using the second encoding in Table 4 over the same sensor readings.
Therefore, analysts can focus on changing points instead of an excessive number of granular
changes.</p>
        <p>The fourth option encodes the events to represent an aggregation overview of numerical
sensor readings over a given time window specified by the analyst. There are two possibilities
for event activities.</p>
        <p>First, we may conceive event activities as ”changing in sensor behavior on average x” for
each sensor that changed over the readings during the time window. Also, we maintain the
minimum, average, and maximum values of the changes. This encoding can be combined with
the third encoding option by allowing the analyst to provide change thresholds for the sensors.
Table 6 shows an example of encoding the average changes of each numerical sensor over a
one-day time window.</p>
        <p>The second possibility is similar to the first option, setting the event activity to ”Record the
aggregate status”. Then, we take the average of the sensor readings during the time window for
each numerical sensor and consider them as event data attributes. Table 7 shows an example of
encoding the average values of each numerical sensor over a one-day time window.</p>
        <p>The idea of this encoding is to provide an aggregated view of the enormous amount of sensor
data. Moreover, allow the analyst to see the changes over the aggregated values.</p>
        <p>The fith option encodes the events to represent the change in the system over each sensor
between successive readings of sensor data. Unlike the second encoding, however, it has three
diferent event activities that alter based on the changing behavior, such that (i) if the sensor
data is non-numerical, then it is ”change in sensor x”; (ii) if it is numerical and the change is
increasing, then the event activity is ”Increase in sensor x”, and finally, (iii) if it is numerical and
decreasing then it is ”Decrease in sensor x”. Also, we maintain the previous and new values of
the changes over the sensor readings. This encoding can be combined with the third encoding
option by allowing the analyst to provide change thresholds for the sensors.</p>
        <p>As shown in Table 8, the first event indicates an increase in temperature sensor reading from
300 to 300.5.
Sensor data
EL-SD
method
Event log</p>
        <p>Process
Analytics
3.2.3. Case encoding
After defining the event notation, we need to define the case notation to group the events and
generate the logs for further analysis. There are several ways to group the events to formulate
a case. We present two possible case encodings that can be combined with any of the event
encodings presented in the previous section.</p>
        <p>The first option is grouping the events to represent the change over two successive sensor
readings. As shown in Table 2, the events are grouped based on the change over the successive
data entries.</p>
        <p>The second option is grouping the events based on the time window so that all events that
occur within the same time window belong to the same case. For example, all events that occur
on the same day in Table 6 have the same case id, so there will be two cases over these events.</p>
        <p>Generating an event log allows the process analysts to use diferent process analytic
techniques that improve the understanding and explainable of the manufacture process status.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Evaluation</title>
      <sec id="sec-4-1">
        <title>4.1. Dataset</title>
        <p>We used one dataset from our industry partner in Teaming AI project, Farplas. Farplas is a full
system solutions partner for the automotive industry.Farplas researches, develops, and
manufactures superior automotive polymer systems, provides innovative solutions, and implements
state-of-the-art technologies.We use their sensor data that describe the status of the production
lfoor machines for polymer systems production, such as temperature, pressure, and volume</p>
        <sec id="sec-4-1-1">
          <title>1Build event log https://github.com/DinaBayomie/GenerateEventLogFromSensorData</title>
          <p>2https://www.fluxicon.com/disco/
3https://github.com/DinaBayomie/EL-RM
(a) Cases per changes over detailed change (b) Cases per day over detailed changes with
events diferent activities events
sensors. The dataset contains 48 sensors with three non-numerical sensors and one quality
detection sensor data.</p>
          <p>We conducted two exploratory experiments with diferent analysis objectives to explore the
usefulness of our method and the efectiveness of creating event logs from sensor data. Each
experiment addresses an analysis scenario over the data to explore the benefits of having a
specific event log that meets the process analyst’s interests.</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Scenario 1: Change Pattern Analysis</title>
        <p>The first experiment explores the sensor data using process discovery techniques to understand
the changing patterns over the sensors. We generated two event logs using diferent event and
case encodings. Then, we used Disco to discover the process model over these logs.</p>
        <p>Figure 4 shows the process models discovered from the two generated event logs. Figure 4a
illustrates the most changing patterns given that a case contains the events generated over
changes of two successor entries. Using this encoding, the analyst can easily capture the most
frequent sensors that change almost every reading. That helps to understand the physical state
of the manufacturing machines over every reading by providing a virtual model that represents
real-time changes over the sensors.</p>
        <p>Figure 4b depicts the changing patterns over 22 days, given that each day represented a case.
An event describes the changes over two successor entries with three possible activities that
clarify the changes over the sensor readings (see the fith encoding option in subsubsection 3.2.2).
Using this encoding, the analyst can quickly grasp the most regular sensors that change over
the days. Also, the analyst can see the changing behavior in terms of increasing or decreasing
the sensors’ readings and how they afect each other. That helps to understand the physical
interaction between the diferent environmental parameters, e.g., temperature, pressure, and
speed, of the manufacturing machines over the day by modeling a virtual model that represents
the daily behavior of the sensors.</p>
        <p>Using the process models, analysts can investigate various patterns of the sensors. Being able
to use diferent encodings allows them to explore the sensor day from multiple perspectives.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Scenario 2: Root cause Analysis</title>
        <p>
          The second experiment performs a root-cause analysis to understand the relationship between
the sensor data and the quality detection data. We generated one event log in which the case
contained events that occurred on the same day. We encoded the events following the first
encoding insubsubsection 3.2.2 in which each entry represents an event in order to analyze
the sensor data from a data perspective. Then, we discovered the association rules over these
logs using EL-RM[
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. We focus on the association rules that concentrate on the detection
sensor data. Therefore, we select only the rules that include the not-ok quality detection data
as a consequence of the rule. Following that, EL-RM discovered 20 association rules with a
confidence of 0.9 that show a possible association between the changes over the sensors and the
not-ok quality detection data. Using this encoding allows the analyst to conduct a root-cause
analysis and gaining insights into the influence of the machine status from the sensor data over
the quality detection control.
        </p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Discussion</title>
        <p>
          As shown in both exploratory experiments, generating an event log from the sensor data supports
creating a virtual model to understand the manufacturing of physical objects. Moreover, it
allows the analysts to explore the sensor data using process analysis techniques that investigate
it from a new perspective other than the time-series analysis. Also, it enriches the event model
of Teaming.AI [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] which helps understand and support the AI agent.
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>In this paper, we proposed a method (EL-SD) for creating an event log from sensor data. Our
method consists of two steps: a pre-processing step allows the analysts to prepare the data by
applying data cleaning and selection techniques; the main step then creates the log based on
the event notation and case notation specified by the analyst. We provide five event encoding
options and two case encoding options. The results of our exploratory case scenarios show
the potential of the method to investigate the sensor data from diferent perspectives and
provide new insights into the production floor. Moreover, it constructs a virtual model that can
contribute to accurate digital twins that capture the dynamic behavior manufacturing machines.
As future work, we will investigate diferent ways to define the event activities such that they
reflect the production process.
This work received funding from the Teaming.AI project in the European Union’s Horizon 2020
research and innovation program under grant agreement No 95740. The work of J. Mendling
was supported by the Einstein Foundation Berlin.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C.</given-names>
            <surname>Semeraro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lezoche</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Panetto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dassisti</surname>
          </string-name>
          ,
          <article-title>Digital twin paradigm: A systematic literature review</article-title>
          ,
          <source>Comput. Ind</source>
          .
          <volume>130</volume>
          (
          <year>2021</year>
          )
          <fpage>103469</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>W. M. P. van der Aalst</surname>
          </string-name>
          ,
          <source>Process Mining - Data Science in Action, Second Edition</source>
          , Springer,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Dreher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Reimann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Gröger</surname>
          </string-name>
          ,
          <article-title>Application fields and research gaps of process mining in manufacturing companies</article-title>
          , in: R. H.
          <string-name>
            <surname>Reussner</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Koziolek</surname>
          </string-name>
          , R. Heinrich (Eds.),
          <source>INFORMATIK</source>
          <year>2020</year>
          ,
          <article-title>Gesellschaft für Informatik</article-title>
          , Bonn,
          <year>2021</year>
          , pp.
          <fpage>621</fpage>
          -
          <lpage>634</lpage>
          . doi:
          <volume>10</volume>
          .18420/inf2020_
          <fpage>55</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Brockhof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. U.</given-names>
            <surname>Seran</surname>
          </string-name>
          , W. van der Aalst,
          <article-title>Modeling digital shadows in manufacturing by using process mining</article-title>
          ,
          <source>Modellierung 2022 Satellite Events</source>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>G.</given-names>
            <surname>Park</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. M. P. van der Aalst</surname>
          </string-name>
          ,
          <article-title>Realizing A digital twin of an organization using actionoriented process mining</article-title>
          , in: ICPM, IEEE,
          <year>2021</year>
          , pp.
          <fpage>104</fpage>
          -
          <lpage>111</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>M. L. van Eck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Sidorova</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. M. P. van der Aalst</surname>
          </string-name>
          ,
          <article-title>Enabling process mining on sensor data from smart products</article-title>
          ,
          <source>in: 2016 IEEE Tenth International Conference on Research Challenges in Information Science (RCIS)</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Dunkl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rinderle-Ma</surname>
          </string-name>
          , W. Grossmann,
          <string-name>
            <given-names>K.</given-names>
            <surname>Anton Fröschl</surname>
          </string-name>
          ,
          <article-title>A method for analyzing time series data in process mining: Application and extension of decision point analysis</article-title>
          , in: S.
          <string-name>
            <surname>Nurcan</surname>
          </string-name>
          , E. Pimenidis (Eds.),
          <source>Information Systems Engineering in Complex Environments</source>
          , Springer International Publishing, Cham,
          <year>2015</year>
          , pp.
          <fpage>68</fpage>
          -
          <lpage>84</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Senderovich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rogge-Solti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mendling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mandelbaum</surname>
          </string-name>
          ,
          <article-title>The ROAD from sensor data to process instances via interaction mining</article-title>
          ,
          <source>in: CAiSE</source>
          , volume
          <volume>9694</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2016</year>
          , pp.
          <fpage>257</fpage>
          -
          <lpage>273</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Souibgui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Atigui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zammali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cherfi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. B.</given-names>
            <surname>Yahia</surname>
          </string-name>
          ,
          <article-title>Data quality in etl process: A preliminary study</article-title>
          ,
          <source>Procedia Computer Science</source>
          <volume>159</volume>
          (
          <year>2019</year>
          )
          <fpage>676</fpage>
          -
          <lpage>687</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Jefery</surname>
          </string-name>
          , G. Alonso,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Franklin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Hong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Widom</surname>
          </string-name>
          ,
          <article-title>Declarative support for sensor data cleaning</article-title>
          ,
          <source>in: International Conference on Pervasive Computing</source>
          , Springer,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Time series data cleaning: A survey</article-title>
          ,
          <source>Ieee Access</source>
          <volume>8</volume>
          (
          <year>2019</year>
          )
          <fpage>1866</fpage>
          -
          <lpage>1881</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>E. G. L. de Murillas</surname>
            ,
            <given-names>H. A.</given-names>
          </string-name>
          <string-name>
            <surname>Reijers</surname>
            ,
            <given-names>W. M. van der Aalst</given-names>
          </string-name>
          ,
          <source>Case notion discovery and recommendation: automated event log building on databases, Knowledge and Information Systems</source>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D.</given-names>
            <surname>Bayomie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Revoredo</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          <article-title>Mendling, Multi-perspective process analysis: Mining the association between control flow and data objects</article-title>
          ,
          <source>in: CAiSE</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>72</fpage>
          -
          <lpage>89</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>T.</given-names>
            <surname>Hoch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Heinzl</surname>
          </string-name>
          , G. Czech,
          <string-name>
            <given-names>M.</given-names>
            <surname>Khan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Waibel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bachhofner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Kiesling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Moser</surname>
          </string-name>
          , Teaming.
          <article-title>ai: enabling human-ai teaming intelligence in manufacturing</article-title>
          ,
          <source>in: 11th International Conference on Interoperability for Enterprise Systems and Applications</source>
          , volume
          <volume>3214</volume>
          ,
          <year>2022</year>
          , p.
          <fpage>0073</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>