=Paper=
{{Paper
|id=Vol-3299/Paper06
|storemode=property
|title=Prescriptive Analytics for Business Processes Using Causal Inference (Extended Abstract)
|pdfUrl=https://ceur-ws.org/Vol-3299/Paper06.pdf
|volume=Vol-3299
|authors=Zahra Dasht Bozorgi
|dblpUrl=https://dblp.org/rec/conf/icpm/Bozorgi22
}}
==Prescriptive Analytics for Business Processes Using Causal Inference (Extended Abstract)==
<pdf width="1500px">https://ceur-ws.org/Vol-3299/Paper06.pdf</pdf>
<pre>
Prescriptive Analytics for Business Processes
Using Causal Inference (Extended Abstract)
Zahra Dasht Bozorgi1
1
    University of Melbourne, 700 Swanston St, Carlton, VIC 3053, Australia


                                         Abstract
                                         In this Ph.D. project, we aim to address the need for effective recommendations by devising new process
                                         monitoring methods which are prescriptive in nature. These methods will recommend the process
                                         participants, at tactical and operational levels, what action they need to take to achieve a given process
                                         objective. Moreover, we will use causal inference methods to provide recommendations based on causal
                                         relationships between the proposed actions and the outcome of interest.

                                         Keywords
                                         prescriptive process monitoring, causal inference, machine learning


1. Introduction
Recent advances in process mining techniques have allowed companies to manage and improve
their processes more efficiently. Particularly, process mining has benefited from advances in
machine learning techniques to provide accurate predictions of the future state of business
processes in an area known as predictive process monitoring. Recent predictive monitoring
methods can produce highly accurate predictions of the future state of the process. However,
having reliable predictions does not always lead to improvement of the process. A study by
Dees et al. [1], shows that if good predictions are followed by bad recommendations, the desired
improvement is not achieved. This study illustrates the need for new techniques that find the
best interventions based on the context of each case.
   Causal inference is a field in statistics concerned with discovering and quantifying causal
relationships. This field has been widely used in other domains such as medicine, social sciences,
and marketing. In medicine, causal inference techniques are used to determine the effectiveness
of new drugs. In social sciences and marketing, new policies and campaigns are evaluated using
state-of-the-art causal inference methods. The success of causal inference in these fields serves
as a motivation to use these techniques in the area of process mining.
   In this Ph.D. project, we aim to address the need for effective recommendations by devising
new prescriptive process monitoring methods which are based on causal relationships. These
methods will make recommendations, at tactical and operational levels, about what actions
should be taken to achieve a given process objective.

ICPM 2022 Doctoral Consortium and Tool Demonstration Track
Envelope-Open zdashtbozorg@student.unimelb.edu.au (Z. D. Bozorgi)
GLOBE https://www.linkedin.com/in/zahradbozorgi/ (Z. D. Bozorgi)
Orcid 0000-0002-1595-3934 (Z. D. Bozorgi)
                                       © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073
                                       CEUR Workshop Proceedings (CEUR-WS.org)


                                                                                                          27
2. Research Problem
Starting from a set of pre-defined treatments (a.k.a. process interventions) and associated costs,
my PhD project explores the use of causal inference in prescriptive monitoring, to recommend
what treatments are to be used in what context such that a given benefit function is maximised
for the organisation. Specifically, the overall aim of this project is to answer the following
question:

   How to provide cost-aware recommendations based on causal relations between a proposed
                            intervention and a target of interest?

Since the area of prescriptive process monitoring remains largely under-explored, we select
three main dimensions of exploration for scoping purposes. These are:

    • Who to treat, exploring which cases should get the treatment
    • When to treat, exploring when the treatment should be applied
    • How to treat, exploring which treatment to choose

Accordingly, we define the following research questions to explore these dimensions:

   1. RQ1: Can causal inference provide useful recommendations about which business process
      to treat?
   2. RQ2: Can causal inference provide useful recommendations about when to treat a business
      process?
   3. RQ3: Can causal inference provide useful recommendations about multiple parameteriz-
      able treatments of a business process?


3. Proposed Solutions
To address the identified research problems, we propose three studies each addressing one
research question. To the best of our knowledge, this is the first time causal inference has been
used in a prescriptive process monitoring context.

3.1. RQ1: Who to treat
The main purpose of this study is to investigate the use of causal inference for prescriptive
monitoring. We hypothesise that if we can establish a causal relationship between a possible
intervention and a performance metric, the recommendations that result from such causal rela-
tionships will result in more benefit for the company than prediction-based recommendations.
We divided this study into two sub-studies, one addressing outcome improvement and the other
cycle time reduction. We chose these two objectives because one is an example of a binary
outcome, while the other is described as a real-valued attribute.
   Our approach proposed in [2] is a rule-based prescriptive system (Figure 1). First, a set of
interventions which are highly correlated with a positive outcome of interest are identified.
Then using a causal effect estimation method called Uplift Tree [3], we identify contexts in


                                               28
Figure 1: Overview of the approach for outcome improvement.


Figure 2: Overview of the approach for cycle time reduction.


which those interventions causally influence the outcome. We also propose a cost model that
identifies the Return-on-Investment (ROI) of an intervention. The decision to intervene in a
process is then made based on the ROI. We show that using this method, we can automatically
identify improvement actions that are traditionally done by humans.
   In another study [4], we propose a cost-aware prescriptive monitoring method that is designed
to reduce cycle time (Figure 2). The core of this recommendation system is the Orthogonal
Random Forest (ORF) [5], which is a causal estimation method that works with continuous
outcome variables. The approach consists of two phases: an offline phase in which given an
event log, a causal effect estimation model is trained, and the best policy for applying the
intervention is selected to maximise the gain in applying it. The second phase is the online
phase where the causal effect estimator and the selected policy are used to determine which
ongoing cases should receive an intervention. Our results show that selecting an intervention
policy based on causal models leads to a higher net gain than policies based on traditional
machine learning methods for prediction.

3.2. RQ2: When to treat
In the second study, we investigate the question of when to apply the treatment. Another
important aspect of generating good recommendations is to ascertain when to recommend. This
question has been addressed by previous works on prescriptive process monitoring. In the work
by Teinemaa et al [6], the authors propose empirical thresholding to determine whether an
alarm for triggering an intervention should be raised. Metzger et al. [7] propose a method based
on reinforcement learning (RL) and show that it outperforms empirical thresholding. The main


                                                29
limitation of this method is that they make simplistic assumptions about the effectiveness of the
interventions and the focus is more on the prediction of the performance metric being optimised.
To address RQ2, we propose to extend the approach by Metzger et al. by including causal effect
estimates in the reinforcement learning environment. Specifically, we aim to find when-to-treat
policies by guiding a reinforcement learning agent to make decisions based on information from
causal models in addition to predictive models. In a prescriptive monitoring method based on
reinforcement learning, the environment is defined by case prefixes extracted from event logs
and ongoing process executions, as well as the agent’s estimates of the performance metric and
the causal effect of the treatment. The reward is a function based on one or multiple process
performance metrics and can include the cost and benefit of the treatment. Our initial results
indicate that our approach leads to more net gain than the traditional RL-based method relying
purely on predictions.

3.3. RQ3: How to treat
In the previous studies, we assume that there is a binary treatment that can potentially influence
a process performance metric. However, in practice, many actions can be taken during the execu-
tion of the process that influences performance metrics. With the methods proposed in the first
study, many causal models need to be trained separately, and this is computationally inefficient.
Furthermore, using many standalone causal models will not consider how the different actions
influence each other (i.e., the action interplay). Given the success of reinforcement learning in
the study by Metzger et al. and our own findings in the previous RQ, we plan to extend our
RL-based framework to deal with multiple treatments and treatments with continuous values.
We will expand the action space to include more values and continuous values and will explore
different RL algorithms that are suitable for multi-action and continuous action spaces.


4. Methodology
This project will follow a Design Science research method [8]. The rigour of the approaches
will be ensured by conducting an extensive literature review and constructing a comprehensive
evaluation benchmark, using well-defined selection and assessment criteria. The relevance of the
solutions will be ensured via an extensive evaluation of the developed techniques with real-life
and simulated data sets, and where possible, through case studies with relevant organisations.


5. Challenges and Further Developments
The main challenge in this study is that we assume that all the variables that influence inter-
vention and outcome are observed in the event log. This might lead to biased estimations that
cause the prescriptive system to make a sub-optimal recommendation. We try to alleviate this
threat to validity by performing sensitivity analysis on our models to measure their robustness
to unmeasured confounding. Another challenge in this study is the evaluation of the proposed
methods. While we use cost models and other evaluation methods in the causal inference
literature to evaluate our prescriptive methods, the best and most rigorous way of evaluating


                                               30
these methods is by conducting an A/B test. Also, we plan to seek domain expert feedback to
evaluate our methods qualitatively.


References
[1] M. Dees, M. de Leoni, W. M. P. van der Aalst, H. A. Reijers, What if process predictions
    are not followed by good recommendations?, in: Proceedings of the Industry Forum at
    BPM co-located with 17th International Conference on Business Process Management BPM,
    2019.
[2] Z. D. Bozorgi, I. Teinemaa, M. Dumas, M. L. Rosa, A. Polyvyanyy, Process mining meets
    causal machine learning: Discovering causal rules from event logs, in: 2nd International
    Conference on Process Mining, ICPM, 2020.
[3] P. Rzepakowski, S. Jaroszewicz, Decision trees for uplift modeling with single and multiple
    treatments, Knowl. Inf. Syst. (2012).
[4] Z. D. Bozorgi, I. Teinemaa, M. Dumas, M. L. Rosa, A. Polyvyanyy, Prescriptive process
    monitoring for cost-aware cycle time reduction, in: 3rd International Conference on Process
    Mining, ICPM, 2021.
[5] M. Oprescu, V. Syrgkanis, Z. S. Wu, Orthogonal random forest for causal inference, in:
    Proceedings of the 36th International Conference on Machine Learning, ICML, 2019.
[6] I. Teinemaa, N. Tax, M. de Leoni, M. Dumas, F. M. Maggi, Alarm-based prescriptive process
    monitoring, in: International Conference on Business Process Management, Springer, 2018,
    pp. 91–107.
[7] A. Metzger, T. Kley, A. Palm, Triggering proactive business process adaptations via online
    reinforcement learning, in: International Conference on Business Process Management,
    Springer, 2020, pp. 273–290.
[8] R. H. von Alan, S. T. March, J. Park, S. Ram, Design science in information systems research,
    MIS quarterly (2004).


                                               31

</pre>