=Paper=
{{Paper
|id=Vol-2973/paper_192
|storemode=property
|title=A Causal Approach to Prescriptive Process Monitoring
|pdfUrl=https://ceur-ws.org/Vol-2973/paper_192.pdf
|volume=Vol-2973
|authors=Zahra Dasht Bozorgi
|dblpUrl=https://dblp.org/rec/conf/bpm/Bozorgi21
}}
==A Causal Approach to Prescriptive Process Monitoring==
<pdf width="1500px">https://ceur-ws.org/Vol-2973/paper_192.pdf</pdf>
<pre>
A Causal Approach to Prescriptive Process
Monitoring
Zahra Dasht Bozorgi1
1
    University of Melbourne, Level 4, 700 Swanston St, Carlton, 3053, VIC, Australia


1. Introduction
A business process is a collection of events, activities, and decisions that collectively lead to an
outcome that can be of value to a customer [1]. Process mining is a family of techniques that
extract information about processes using historical process execution data, generally known
as event logs [2]. Recent advances in process mining techniques have allowed companies to
manage and improve their processes more efficiently. Particularly, process mining has benefited
from advances in machine learning techniques to provide accurate predictions of the future
state of business processes in an area known as predictive process monitoring. Recent predictive
monitoring methods can produce highly accurate predictions of upcoming events, future event
suffixes, remaining time, and the outcome of the process [3, 4, 5, 6, 7, 8]. However, having reliable
predictions does not always lead to improvement of the process. A study by Dees et al. [9], shows
that if good predictions are followed by bad recommendations, the desired improvement is not
achieved. This study illustrates the need for new techniques that find the best interventions
based on the context of each case.
   Causal inference is a field in statistics that is concerned with estimating causal effect of a
variable when that variable is changed. This field has been widely used in other domains such
as medicine, social sciences, and marketing. In medicine, causal inference techniques are used
to determine the effectiveness of new drugs. In social sciences and marketing, new policies and
campaigns are evaluated using state-of-the-art causal inference methods. The success of causal
inference in these fields serves as a motivation to use these techniques in the area of process
mining.
   In this Ph.D. project, we aim to address the need for effective recommendations by devising
new prescriptive process monitoring methods which are based on causal relationships. These
methods will make recommendations, at tactical and operational levels, about what actions
should be taken to achieve a given process objective.


BPM 2021: Proceedings of the Demonstration & Resources Track, Best BPM Dissertation Award, and Doctoral
Consortium at BPM 2021 co-located with the 19th International Conference on Business Process Management,
September 6-10, 2021, Rome, Italy
" zdashtbozorg@student.unimelb.edu.au (Z. D. Bozorgi)
~ https://www.linkedin.com/in/zahradbozorgi/ (Z. D. Bozorgi)
                                       © 2021 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073       CEUR Workshop Proceedings (CEUR-WS.org)
2. Research Problem
Starting from a set of pre-defined treatments (a.k.a. process interventions) and associated costs,
this thesis explores the use of causal inference in prescriptive monitoring, to recommend what
treatments are to be used in what context such that a given benefit function is maximised for
the organisation. Specifically, the overall aim of this project is to answer the following question:

   How to provide cost-aware recommendations based on causal relations between a proposed
                            intervention and a target of interest?

Since the area of prescriptive process monitoring remains largely under-explored, we select
three mains dimensions of exploration for scoping purposes. These are:

    • The objective of the recommendations. In this thesis, we focus on two main goals, namely
      outcome improvement and remaining time reduction.
    • Explainability of the recommendations. We explore the use of both interpretable and
      black-box (hard to explain) methods for producing recommendations and assess relative
      merits and shortcomings.
    • Prescriptive monitoring with multiple treatment options. We explore the use of reinforce-
      ment learning solutions to choose the best course of action during the execution of the
      process.

Accordingly, we define the following research questions to explore these dimensions:

   1. RQ1: How can the use of causal inference in prescriptive process monitoring result
      in recommendations that optimise process outcome and cycle time while providing
      measurable benefits to the organisation?
   2. RQ2: How do black-box approaches for causal inference compare with interpretable
      techniques when applied to prescriptive process monitoring?
   3. RQ3: How can we use reinforcement learning in combination with causal inference to
      devise a prescriptive process monitoring method that can select the best actions that lead
      to the achievement of an objective?


3. Approach
To address the identified research problems, we propose three studies each addressing one
research question. To the best of our knowledge, this is the first time causal inference has been
used in a prescriptive process monitoring context.

3.1. RQ1: Recommendations for varying objectives
The main purpose of this study is to investigate the use of causal inference for prescriptive
monitoring. We hypothesise that if we can establish a causal relationship between a possible
intervention and the outcome of interest, the recommendations that result from such causal
relationships will result in more benefit for the company than correlation-based recommenda-
tions. We divided this study into two sub-studies, one addressing outcome improvement and
the other cycle time reduction. We chose this two objectives because one is an example of a
binary outcome, while the other is described as a real-valued attribute.
   Our approach proposed in [10] is a rule-based prescriptive system. First, a set of interventions
which are highly correlated with a positive outcome of interest are identified. Then using a
causal effect estimation method called Uplift Tree [11], we identify contexts in which those
interventions causally influence the outcome. We also propose a cost model that identifies
the Return-on-Investment (ROI) of an intervention. The decision to intervene in a process is
then made based on the ROI. We show that using this method, we can automatically identify
improvement actions that are traditionally done by humans.
   In another study [12], we propose a cost-aware prescriptive monitoring method that is
designed to reduce cycle time. The core of this recommendation system is the Orthogonal
Random Forest (ORF) [13], which is a causal estimation method that works with continuous
outcome variables. The approach consists of two phases: an offline phase in which given an
event log, a causal effect estimation model is trained, and the best policy for applying the
intervention is selected to maximise the gain in applying it. The second phase is the online
phase where the causal effect estimator and the selected policy are used to determine which
on-going cases should receive an intervention. Our results show that selecting an intervention
policy based on causal models leads to a higher net-gain than policies based on traditional
machine learning methods for prediction.

3.2. RQ2: Explainability of the recommendations
In the second study, we plan to investigate the use of black-box and white-box models. Many
methods for causal estimation have been proposed in the causal inference literature. Most
of these models are black-box, meaning that we cannot understand how different variables
are combined to estimate a causal effect. Conversely, some models such as uplift trees are
interpretable or white-box. These models are constrained to give a better understanding of how
the causal effect estimation is made. Many models, however, are not designed to be interpretable,
but to be accurate estimators of causal effects. Due to an issue known as the fundamental problem
of causal inference, it is very difficult to establish whether black-box models have higher accuracy
in real-world settings and whether the more accurate estimates yield higher benefit. Since
prescriptive monitoring methods are designed to aid humans in decision-making, transparency
of the models they are based on is important. Therefore, in this study, we plan to investigate
whether the use of black-box models is preferable to white-box ones. To do this, we first need to
address the problem of not observing ground truth in the data. So we will first create simulated
event logs by using the data generation approach proposed in [14] based on a flow-based deep
generative model called Sigmoidal Flow. We will train a Sigmoidal Flow on real-life event logs,
and then create simulated data that contain the ground truth causal effect while preserving
the statistical properties of our real-world data. Using this simulated data we will investigate
different black-box and white-box causal inference approaches and develop a cost-benefit model
to determine in which circumstances using a black-box model is warranted.
3.3. RQ3: Multiple treatment options
This study will be conducted in the final year of this Ph.D. In the previous studies, we assume
that there is a binary treatment that can potentially influence a process performance metric.
However, in practice, many actions can be taken during the execution of the process that
influence performance metrics. With the methods proposed in the first study, many causal
models need to be trained separately, and this is computationally inefficient. Furthermore,
using many standalone causal models will not consider how the different actions influence each
other (i.e., the action interplay). In such circumstances, reinforcement learning frameworks
can be employed to design a prescriptive monitoring method that deals with a multitude of
actions/treatments. In particular, contextual bandits algorithms have benefited from the causal
inference literature to make them less prone to problems in estimation bias [15]. Contextual
bandits are an extension of multi-armed bandits. They output an action conditional on the state
of the environment. The aim of this study is to devise a new prescriptive monitoring method
based on contextual bandits that can recommend the best action at each step of the process
execution. In a prescriptive monitoring method based on contextual bandits, the context will
be defined by case prefixes extracted from event logs and ongoing process executions, and the
reward will be a function based on one or multiple process performance metrics.

3.4. Methodology
This project will follow a Design Science research method [16]. The rigour of the approaches
will be ensured by conducting an extensive literature review and constructing a comprehensive
evaluation benchmark, using well defined selection and assessment criteria. The relevance of the
solutions will be ensured via an extensive evaluation of the developed techniques with real-life
and simulated data sets, and where possible, through case studies with relevant organisations.


4. Limitations of the Study
The main limitation of this study is that we assume that all the variables that influence inter-
vention and outcome are observed in the event log. This might lead to biased estimations that
cause the prescriptive system to make a sub-optimal recommendation. We try to alleviate this
threat to validity by performing sensitivity analysis on our models to measure their robustness
to unmeasured confounding. Another limitation of this study is the evaluation of the proposed
methods. While we use cost models and other evaluation methods in the causal inference
literature such as Qini coefficients to evaluate our prescriptive methods, the best and most
rigorous way of evaluating these methods is conducting an A/B test.
References
 [1] M. Dumas, M. L. Rosa, J. Mendling, H. A. Reijers, Fundamentals of Business Process
     Management, Second Edition, Springer, 2018.
 [2] W. M. P. v. d. Aalst, Process Mining: Data Science in Action, second ed., Springer, 2016.
 [3] F. Taymouri, M. La Rosa, S. Erfani, Z. D. Bozorgi, I. Verenich, Predictive business pro-
     cess monitoring via generative adversarial nets: The case of next event prediction, in:
     International Conference on Business Process Management, Springer, 2020.
 [4] F. Taymouri, M. L. Rosa, S. M. Erfani, A deep adversarial model for suffix and remaining
     time prediction of event sequences, CoRR (2021).
 [5] V. Pasquadibisceglie, A. Appice, G. Castellano, D. Malerba, Using convolutional neural
     networks for predictive process analytics, in: International Conference on Process Mining,
     ICPM 2019, Aachen, Germany, June 24-26, 2019, IEEE, 2019.
 [6] N. Tax, I. Verenich, M. L. Rosa, M. Dumas, Predictive business process monitoring with
     LSTM neural networks, in: Advanced Information Systems Engineering - 29th International
     Conference, CAiSE 2017, Proceedings, Springer, 2017.
 [7] M. Camargo, M. Dumas, O. G. Rojas, Learning accurate LSTM models of business processes,
     in: T. T. Hildebrandt, B. F. van Dongen, M. Röglinger, J. Mendling (Eds.), Business Process
     Management - 17th International Conference, BPM 2019, Vienna, Austria, September 1-6,
     2019, Proceedings, Springer, 2019.
 [8] L. Lin, L. Wen, J. Wang, Mm-pred: A deep predictive model for multi-attribute event
     sequence, in: T. Y. Berger-Wolf, N. V. Chawla (Eds.), Proceedings of the 2019 SIAM
     International Conference on Data Mining, SDM 2019, Calgary, Alberta, Canada, May 2-4,
     2019, SIAM, 2019.
 [9] M. Dees, M. de Leoni, W. M. P. van der Aalst, H. A. Reijers, What if process predictions are
     not followed by good recommendations?, in: Proceedings of the Industry Forum at BPM
     2019 co-located with 17th International Conference on Business Process Management
     (BPM 2019), Vienna, Austria, September 1-6, 2019, CEUR-WS.org, 2019.
[10] Z. D. Bozorgi, I. Teinemaa, M. Dumas, M. L. Rosa, A. Polyvyanyy, Process mining meets
     causal machine learning: Discovering causal rules from event logs, in: 2nd International
     Conference on Process Mining, ICPM 2020, Padua, Italy, October 4-9, 2020, IEEE, 2020.
[11] P. Rzepakowski, S. Jaroszewicz, Decision trees for uplift modeling with single and multiple
     treatments, Knowl. Inf. Syst. (2012).
[12] Z. D. Bozorgi, I. Teinemaa, M. Dumas, M. La Rosa, Prescriptive process monitoring for
     cost-aware cycle time reduction, arXiv preprint arXiv:2105.07111 (2021).
[13] M. Oprescu, V. Syrgkanis, Z. S. Wu, Orthogonal random forest for causal inference, in:
     Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15
     June 2019, Long Beach, California, USA, PMLR, 2019.
[14] B. Neal, C.-W. Huang, S. Raghupathi, Realcause: Realistic causal inference benchmarking,
     arXiv preprint arXiv:2011.15007 (2020).
[15] M. Dimakopoulou, Z. Zhou, S. Athey, G. Imbens, Balanced linear contextual bandits, in:
     Proceedings of the AAAI Conference on Artificial Intelligence, 2019.
[16] R. H. von Alan, S. T. March, J. Park, S. Ram, Design science in information systems research,
     MIS quarterly (2004).

</pre>