<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Responsible Process Prescriptive Analytics (Extended Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alessandro Padella</string-name>
          <email>alessandro.padella@phd.unipd.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Bolzano 2022, International Conference on Process Mining</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Process Mining</institution>
          ,
          <addr-line>Prescriptive Analytics, Recommender Systems, Explainable AI, Process Improvement</addr-line>
        </aff>
      </contrib-group>
      <fpage>41</fpage>
      <lpage>45</lpage>
      <abstract>
        <p>Within the realm of Process Mining, Process-Aware Recommender systems (PAR systems) are information systems that aim to monitor process executions, predict their future behaviour, and find optimal corrective actions to reduce the risk of failure or to maximize a given reference Key Performance Indicator (KPI). The PAR system comprises Predictive Analytics and Prescriptive Analytics. The second part regards providing recommendations for fixing the execution of processes that are predicted to have undesired KPI values. While the research has focused on generating recommendations that aim to have better and better KPIs, my Ph.D. aims to combine the high scores of recommendations with their feasibility and fairness.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>https://github.com/Pado123 (A. Padella)</p>
      <p>© 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
CEUR</p>
    </sec>
    <sec id="sec-2">
      <title>2. Research Questions</title>
      <p>As mentioned, a PAR system’s starting point is defining the process outcomes. The outcome
of a process is often measured through a customizable KPI function that, given an execution
recorded in a log trace (which describes the life-cycle of a particular process instance, i.e., a
case), looks at the activities executed and attribute values, returning a KPI value (e.g. the total
execution time or the total cost of the procedure).</p>
      <p>Several PAR systems are limited to providing recommendations that only consist in suggesting
what activity to do as next, irrespectively whether the activity is possible in that moment in time.
Indeed, human process actors and other types of resources may not be available. Furthermore,
the impact of the recommended activity on the process outcome may depend on the actual
resource that performs the activity. From this, the following research question arises.</p>
      <p>Research question 1: How can we build a prescriptive business process analytics block that
can provide more feasible recommendations?</p>
      <p>We indeed want to design a recommendation framework for Prescriptive Analytics to provide
recommendations on combinations of activities and resources. It should analyze the past
executions of the process and exploit some machine (or deep) learning techniques for generating
recommendations.</p>
      <p>However, from the perspective of business reality, there is a wrong assumption: past
executions can be repeated at the moment in which we provide recommendations.</p>
      <p>Each company has its own set of rules and protocols and a pool of human resources. Over
time and as the company evolves, these may change (e.g. some employees may resign, or some
machines may be changed).</p>
      <p>The challenge then becomes to make the framework capable of understanding and
incorporating changes in organisational structures, thus avoiding making recommendations that are
not actually executable. This leads to the second research question.</p>
      <p>Research question 2: How can we ensure that the context in which recommendations are
generated and the context in which they are provided are compliant?</p>
      <p>
        Works [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] illustrate how some datasets, and so the models built on them, may be biased,
especially regarding race and gender. The same problem can occur for PAR systems: it may, e.g.,
recommend corrective actions on cases with given characteristics. Also, PAR systems might
decide to allocate activities to only certain process actors, who have better performances, thus
ultimately causing them to be overloaded while others are seldom employed. This generates
our third research question.
      </p>
      <p>Research question 3: Are the recommendations we are providing fair?</p>
      <p>Solving the research questions above, we focused on the quality of recommendations,
answering the question “Who should perform what, for improving our KPI?”, trying to have an
eficient and responsible framework.</p>
      <p>In addition, it is also crucial to accompany recommendations with an explanation of the
rationale that brought the system to suggest this way. This increases the engagement and trust
of the actors in the system, and thus the willingness to follow what is suggested. This brings to
the fourth research question:</p>
      <p>
        Research question 4? How can we increase trust in our recommendations?
We want to use some Explainable AI techniques to provide some explanations about not only
what we are suggesting but also why the algorithm took that decision. This can answer the
question “Why should a resource perform the suggested activity?”. Finally, we want to understand
whether the explanations provided help process actors trust our framework more. In doing so,
we plan to perform some human evaluations with actual process actors, as well as we aim to
use objective metrics to assess the explanation quality, e.g. discussed in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>Research question 5: Does the real users feel helped by our work?</p>
    </sec>
    <sec id="sec-3">
      <title>3. Project Roadmap</title>
      <p>
        We have initially started addressing the first and the fourth research questions, integrating
explanations in the prescriptive-analytics framework proposed in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and leveraging on
CatBoost [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], a high-performance open source framework that has shown to provide more accurate
predictions and with limited computation time if compared with the literature (see [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]).
      </p>
      <p>
        Explanations are given using the theory of Shapley values [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. However, the explanations of
the recommendations are diferent from those of predictions as discussed in the literature (see
e.g. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]). Indeed, the explanation of a prediction is translated into explaining how much each
variable influences the final KPI value. For example, “The fact that the variable c u s t o m e r _ t y p e
assumes the value G o l d contributes to decreasing the expected total time of the process instance
by 120 hours”. On the other hand, the explanation of a recommendation is related to the
recommended activity. Indeed, given the suggestion of a certain activity, the explanation
represents how much performing that activity changes the contribution of each variable on
the final KPI value. For example, “Performing the activity S e n d L e t t e r , the fact that the variable
c u s t o m e r _ t y p e assumes the value G o l d goes from contributing to decreasing the expected total time
by 120 hours to contributing to decreasing it by 230 hours.”
      </p>
      <p>
        Our work will be organized as follows:
• Regarding research question 1, the literature in [
        <xref ref-type="bibr" rid="ref2 ref9">2, 9</xref>
        ] proposes other interesting
Prescriptive-Analytics frameworks. However, just focus on recommending activities.
We plan to test the goodness of recommendations using cases from the past for which
compensatory actions are known and check whether the system recommends them.
• For research question 2, we plan to analyze the work of De Smedt et al. in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. It presents
a framework which aims to forecast the entire process model from historical event data,
representing event data as multiple time series. We will start with this work and try to
exploit similarity techniques for time-series data like Grid Representation and Matrix
Distances from [11] for detecting the variation of the companies’ structure and resources.
• Regarding research question 3, we aim to follow the directions indicated by Mannhardt
in [12] and by van der Aalst in [13]. Specifically, we plan to specialize and adjust the
de-biasing technique of Adversarial debiasing, discussed in [14]. His goal is to make the
model-independent with respect to a certain variable. In it, two predictive models are
trained: the first predicts your desired value, and the second takes as input the output of
the first and infers what is the value of the variable whose influence we want to remove.
The goal of the model-building phase will be to have the first classifier have the highest
possible accuracy while that of the second has to be equivalent to that of a random,
making the model capable of providing responsible recommendations.
• For Research question 5, we plan to work on a graphical user interface capable of
representing the recommendations and their relative explanations. The literature in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
proposes the approach of appropriate trust: an objective method which may overcome
the problem of subjective evaluations of the explanations regarding the black-box models.
Furthermore, we plan to combine this type of objective evaluation with a full evaluation
through its associated graphical interface. From this, both real users and domain expert
will test the system to which will be associated a satisfaction survey.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments</title>
      <p>Acknowledgement. My PhD. scholarship is partly funded by IBM Italy, and by the BMCS
Doctoral Program, University of Padua. This research is also supported by the Department of
Mathematics, University of Padua, through the BIRD project “Data-driven Business Process
Improvement” (code B I R D 2 1 5 9 2 4 / 2 1 ).
[11] Y. Ye, J. Jiang, B. Ge, Y. Dou, K. Yang, Similarity measures for time series data classification
using grid representation and matrix distance, Knowledge and Information Systems (2019).
[12] F. Mannhardt, Responsible Process Mining, Springer International Publishing, 2022.
[13] W. Aalst, Responsible data science: Using event data in a “people friendly” manner, 2017.</p>
      <p>doi:1 0 . 1 0 0 7 / 9 7 8 - 3 - 3 1 9 - 6 2 3 8 6 - 3 _ 1 .
[14] D. Kenna, Using adversarial debiasing to remove bias from word embeddings, CoRR
(2021).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>M. de Leoni</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Dees</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Reulink</surname>
          </string-name>
          ,
          <article-title>Design and evaluation of a process-aware recommender system based on prescriptive analytics</article-title>
          ,
          <source>in: 2020 2nd International Conference on Process Mining (ICPM)</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Weinzierl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dunzer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zilker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Matzner</surname>
          </string-name>
          ,
          <article-title>Prescriptive business process monitoring for recommending next best actions</article-title>
          ,
          <source>in: Business Process Management Forum</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>Galanti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Coma-Puig</surname>
          </string-name>
          , M. de Leoni,
          <string-name>
            <given-names>J.</given-names>
            <surname>Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Navarin</surname>
          </string-name>
          ,
          <article-title>Explainable predictive process monitoring</article-title>
          ,
          <source>in: Proceedings of the 2nd International Conference on Process Mining (ICPM</source>
          <year>2020</year>
          ),
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J.</given-names>
            <surname>Buolamwini</surname>
          </string-name>
          , T. Gebru,
          <article-title>Gender shades: Intersectional accuracy disparities in commercial gender classification</article-title>
          ,
          <source>in: Proceedings of the 1st Conference on Fairness, Accountability and Transparency</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>H.</given-names>
            <surname>Löfström</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hammar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Johansson</surname>
          </string-name>
          ,
          <article-title>A meta survey of quality evaluation criteria in explanation methods</article-title>
          ,
          <source>in: Proceedings of CAiSE Forum</source>
          <year>2022</year>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A. V.</given-names>
            <surname>Dorogush</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Ershov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gulin</surname>
          </string-name>
          ,
          <article-title>Catboost: gradient boosting with categorical features support</article-title>
          ,
          <source>in: Proceedings of the Workshop on ML Systems at NIPS</source>
          <year>2017</year>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Galanti</surname>
          </string-name>
          , M. de Leoni,
          <string-name>
            <given-names>M.</given-names>
            <surname>Monaro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Navarin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marazzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stasi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Maldera</surname>
          </string-name>
          ,
          <article-title>An explainable decision support system for predictive process analytics</article-title>
          ,
          <source>2022. doi:1 0 . 4 8 5 5 0 / a r X i v . 2 2</source>
          <volume>0 7 . 1 2 7 8 2 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>C.</given-names>
            <surname>Molnar</surname>
          </string-name>
          ,
          <source>Interpretable Machine Learning</source>
          ,
          <year>2022</year>
          . Available online at https://christophm. github.io/interpretable-ml-book/.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Z. D.</given-names>
            <surname>Bozorgi</surname>
          </string-name>
          , I. Teinemaa,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Rosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polyvyanyy</surname>
          </string-name>
          ,
          <article-title>Prescriptive Process Monitoring for Cost-Aware Cycle Time Reduction</article-title>
          ,
          <source>in: 3rd International Conference on Process Mining (ICPM)</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>J. De Smedt</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Yeshchenko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Polyvyanyy</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Weerdt</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Mendling</surname>
          </string-name>
          ,
          <source>Process Model Forecasting Using Time Series Analysis of Event Sequence Data</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>47</fpage>
          -
          <lpage>61</lpage>
          .
          <source>doi:1 0 . 1 0</source>
          <volume>0 7 / 9 7 8 - 3 - 0 3 0 - 8 9 0 2 2 - 3</volume>
          _
          <fpage>5</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>