<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Value Behaviour Analysis of Event Attributes in Process Mining (Extended Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jonas Cremerius</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Hasso Plattner Institute, University of Potsdam</institution>
          ,
          <addr-line>Potsdam</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>4</fpage>
      <lpage>7</lpage>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Discovering and analysing business processes are important tasks for organizations. Process
mining bridges the gap between process management and data science by discovering process
models using event logs derived from real-world data [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Besides mandatory event attributes
like case identifier, activity, and timestamp, additional event attributes can be present, such as
human resources, costs, and laboratory values.
      </p>
      <p>
        Event attributes represent a valuable data-source to understand the process under
investigation. For example, read/write operations of activities can be identified and visualized in a
discovered process model [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Additionally, event attributes can be used in the field of process
outcome prediction as additional features [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>However, we see a lack in the analysis of the actual value behaviour of event attributes. We
argue, that event attributes can be associated to multiple events representing diferent activities.
How event attribute values difer among activities and how they change within the process is
what we define as value behaviour analysis.</p>
      <p>Especially in healthcare processes, a huge amount of data is generated, such as vital signs
or laboratory measurements representing a patient’s wellbeing. When analysing treatment
processes, one might be interested in how these measurements develop depending on which
treatment activities were conducted. This can help to evaluate, for example, diferent treatment
paths in the sense, that one path might result in better patient wellbeing than others.</p>
      <p>This problem leads us to the following research question: How can we analyse the value
behaviour of event attributes?</p>
      <p>
        In this PhD project, I focus on healthcare processes extracted from the Medical Information
Mart for Intensive Care (MIMIC-IV) database, which provides a holistic representation of the
hospital treatment process of patients undergoing diferent treatments [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Nevertheless, the
developed methods can be applied to diferent domains, such as manufacturing, where event
attributes are available.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>The analysis of event attributes has been approached from diferent perspectives in the literature.</p>
      <p>
        A prominent application is the identification of decision rules, such as in data-aware heuristic
mining [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Regarding the exploration of event attributes, the multi-perspective process explorer
allows investigating the distribution of each event attribute at each activity [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], the
access to event attributes is described and annotated to the process model, describing the data
object lifecycle of each event attribute. The field of predictive process analytics makes use of
event attributes as well, by incorporating them in diferent machine learning models, such as
process outcome prediction [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Event attributes have also been classified in several ways for diferent analysis purposes.
For instance, [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] classifies event attributes into controllable and non-controllable, where the
ifrst one can be altered during a process execution and the other cannot. Another contribution
classifies event attributes according to a medical standard (openEHR) to find dependencies
within and between these categories [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>Even though it has been identified, that event attributes can be altered during the process, its
analysis is limited to read/write access or distribution visualization at single activities. Therefore,
this PhD project aims to provide diferent methods to understand the value behaviour of event
attributes throughout the process.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Research Roadmap</title>
      <p>The doctoral thesis will provide a framework for the value behaviour analysis of event attributes
following a design science approach. In this section, we present the current state of research
and plans for enhancing the methodology.</p>
      <sec id="sec-3-1">
        <title>3.1. Data-Enhanced Process Models</title>
        <p>The beginning of the value behaviour analysis of event attributes started with the question, if it
is possible to increase the visibility of event attributes in process models. To achieve that, we
designed Data-Enhanced Process Models, which allow calculating and displaying event attribute
aggregations for each activity in the process model. For example, if a patient is registered at the
beginning of treatment, an activity could include the patient’s age as an event attribute. One
can now calculate the mean age as an event attribute aggregation and represent it directly in
the process model.</p>
        <p>
          As there might be lots of event attributes available in an event log, we further developed three
mechanisms to support event attribute selection in Data-Enhanced Process Models. These are
based on the data type, process characteristic, and degree of variability of event attributes [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Change Detection in Dynamic Event Attributes</title>
        <p>
          With Data-Enhanced Process Models, we allow exploring the value behaviour of event attributes
over the process. However, it is not possible to make concrete statements regarding their value
behaviour. In the previous work, we defined the process characteristic dynamic for event
attributes [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. Dynamic event attributes change throughout the process and are associated to
multiple activities in an event log. These are especially interesting, as their changing behaviour
can be analysed. For example, when patients undergo a certain treatment, such as dialysis,
certain laboratory values are expected to change from beginning until end of treatment, such as
the creatinine value [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
        </p>
        <p>
          Therefore, we developed a method based on statistical significance tests, allowing to identify
changes of dynamic event attributes throughout the process [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. It allows identifying changes
in three dimensions: the event attribute, directly and eventually follows relations, and trace
variants. Thus, we are able to provide detailed information about the location of changes in a
process-oriented way, so we can say, that an event attribute is changing between two activities
in certain trace variants.
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Patient Discharge Classification based on the Hospital Treatment</title>
      </sec>
      <sec id="sec-3-4">
        <title>Process</title>
        <p>The analysis methods provided so far allow retrieving novel insights about the value behaviour
of event attributes. In addition to that, we wanted to demonstrate how the process behaviour of
the values can be utilized in the field of predictive process analytics in healthcare.</p>
        <p>
          Predicting the discharge location of patients after hospital treatment is a popular research topic.
So far, the status of the patient at the end of treatment is used as input for most models. However,
the treatment process has not been considered yet. Our approach proposes to incorporate the
conducted treatment activities and development of the patient’s state in form of event attributes
to make the decision of the most suitable discharge location, such as home or skilled nursing
facility [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. We could demonstrate, that the development of the patient’s state during the
process and the respective activities in the hospital have a considerable impact on the discharge
location prediction.
        </p>
      </sec>
      <sec id="sec-3-5">
        <title>3.4. Validation Method</title>
        <p>The approaches presented are validated on MIMIC-IV, providing real-world healthcare process
data. Thus, we generated reproducible event logs out of MIMIC-IV and applied the proposed
methods on them. In addition to that, we consulted medical experts to validate the correctness
of the findings. It is planned to validate the results on a second dataset with event attributes
available, such as the Sepsis event log. Furthermore, we want to demonstrate, that the approach
is able to solve concrete problems by conducting case studies.</p>
      </sec>
      <sec id="sec-3-6">
        <title>3.5. Next Steps</title>
        <p>
          Our plan is to extend the methods to analyse the value behaviour of event attributes. One plan is
to identify correlating event attribute changes, so we can say, that two or more event attributes
are changing together at the same location. It would also be interesting to cluster cases by their
changing behaviour, such that it is possible to characterize changing and non-changing cases.
Furthermore, we think that there is potential in the field of process variant analysis, where the
value behaviour can be compared between multiple process variants. As this thesis focusses on
healthcare processes, we also would like to cope with the challenges presented in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ], where
we, for example, focus on dealing with reality by looking at real-world data and aiming to
provide white-box approaches which are understandable for medical experts.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>In this doctoral thesis, we want to enable process analysts and domain experts to understand the
value behaviour of event attributes within a process. To achieve that, we propose a framework
with diferent methods to look at the value behaviour from diferent perspectives. This can help
to confirm expected behaviour, but also allows deriving novel insights about the data associated
to process activities. The results are validated based on real-world data and domain expertise.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>W. van der Aalst</surname>
          </string-name>
          , Process Mining, Springer Berlin Heidelberg,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>M. de Leoni</surname>
          </string-name>
          , W. van der Aalst,
          <article-title>Data-aware process mining: Discovering decisions in processes using alignments</article-title>
          ,
          <source>in: Proceedings of the 28th Annual ACM Symposium on Applied Computing</source>
          , SAC '13,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2013</year>
          , p.
          <fpage>1454</fpage>
          -
          <lpage>1461</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Leontjeva</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Kuzovkin</surname>
          </string-name>
          ,
          <article-title>Combining static and dynamic features for multivariate sequence classification</article-title>
          ,
          <year>2016</year>
          , pp.
          <fpage>21</fpage>
          -
          <lpage>30</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Johnson</surname>
          </string-name>
          , L. Bulgarelli,
          <string-name>
            <given-names>T.</given-names>
            <surname>Pollard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Horng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Celi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mark</surname>
          </string-name>
          , Mimic-iv,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>F.</given-names>
            <surname>Mannhardt</surname>
          </string-name>
          , M. de Leoni, H. Reijers,
          <article-title>The multi-perspective process explorer</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          , volume
          <volume>1418</volume>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>D.</given-names>
            <surname>Bano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Zerbato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Weber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Weske</surname>
          </string-name>
          ,
          <article-title>Enhancing discovered process models with data object lifecycles</article-title>
          ,
          <source>in: 2021 IEEE 25th International Enterprise Distributed Object Computing Conference (EDOC)</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>124</fpage>
          -
          <lpage>133</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Bozorgi</surname>
          </string-name>
          , I. Teinemaa,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. La</given-names>
            <surname>Rosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polyvyanyy</surname>
          </string-name>
          ,
          <article-title>Process mining meets causal machine learning: Discovering causal rules from event logs</article-title>
          ,
          <year>2020</year>
          , pp.
          <fpage>129</fpage>
          -
          <lpage>136</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>H.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>Modeling clinical activities based on multiperspective declarative process mining with openEHR's characteristic</article-title>
          ,
          <source>BMC Medical Informatics and Decision Making</source>
          <volume>20</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Cremerius</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Weske</surname>
          </string-name>
          ,
          <article-title>Supporting domain data selection in data-enhanced process models</article-title>
          ,
          <source>in: Wirtschaftsinformatik 2022 Proceedings 3</source>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>N.</given-names>
            <surname>Amin</surname>
          </string-name>
          , R. T. Mahmood,
          <string-name>
            <given-names>M.</given-names>
            <surname>Asad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Noorulamin</surname>
          </string-name>
          ,
          <article-title>Evaluating urea and creatinine levels in chronic renal failure pre and post dialysis: A prospective study 2 (</article-title>
          <year>2014</year>
          )
          <fpage>2330</fpage>
          -
          <lpage>4596</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J.</given-names>
            <surname>Cremerius</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Weske</surname>
          </string-name>
          ,
          <article-title>Change detection in dynamic event attributes</article-title>
          , in: C. Di
          <string-name>
            <surname>Ciccio</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Dijkman</surname>
            ,
            <given-names>A. del Río</given-names>
          </string-name>
          <string-name>
            <surname>Ortega</surname>
          </string-name>
          , S. Rinderle-Ma (Eds.),
          <source>Business Process Management Forum</source>
          , Springer International Publishing, Cham,
          <year>2022</year>
          , pp.
          <fpage>157</fpage>
          -
          <lpage>172</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>J.</given-names>
            <surname>Cremerius</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>König</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Warmuth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Weske</surname>
          </string-name>
          ,
          <article-title>Patient discharge classification based on the hospital treatment process</article-title>
          , in: J.
          <string-name>
            <surname>Munoz-Gama</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          Lu (Eds.),
          <source>Process Mining Workshops</source>
          , Springer International Publishing, Cham,
          <year>2022</year>
          , pp.
          <fpage>314</fpage>
          -
          <lpage>326</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>J.</given-names>
            <surname>Munoz-Gama</surname>
          </string-name>
          , et al.,
          <article-title>Process mining for healthcare: Characteristics and challenges</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          <volume>127</volume>
          (
          <year>2022</year>
          )
          <fpage>103994</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>