<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Integration of an Explainable Predictive Process Monitoring System into IBM Process Mining Suite (Extended Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Riccardo Galanti y</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Massimiliano de Leoniy</string-name>
          <email>deleoni@math.unipd.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alan Marazzi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giacomo Bottazzi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Massimiliano Delsante</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrea Folli</string-name>
          <email>Andrea.Follig@ibm.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IBM</institution>
          ,
          <addr-line>Bologna</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Riccardo.Galanti</institution>
          ,
          <addr-line>Alan.Marazzi,Giacomo.Bottazzi,Massimiliano.Delsante,Andrea.Folli</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>II. OVERVIEW</title>
      <p>
        The implementation of explainable predictive monitoring
within the IBM Process Mining suite builds on the explanation
framework discussed in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], which is based on SHAP [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]2.
SHAP is based on the strong theoretical foundation of the
original game theory approach to explain the variables that
contribute to the predictions, and it is independent of the
specific prediction technique by nature, as opposite to
attentionbased mechanisms, which only apply to a neural network [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        1IBM Process Mining – https://www.ibm.com/cloud/
cloud-pak-for-business-automation/process-mining
2Welcome to the SHAP documentation – https://shap.readthedocs.io
To provide further evidence of this, the original prototype
in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] built on training LSTM models, whereas the
implementation within the IBM software uses Catboost, a
highperformance open source framework for gradient boosting on
decision trees [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. The choice of replacing LSTM models with
Catboost is motivated by the fact that Catboost reduces the
training time of ca. 20-30 times in all the experiments that we
carried out, while returning models with similar accuracy.
      </p>
      <p>The back-end of the predictive monitoring is based on the
Azure infrastructure. This enables to deploy the technique in
the cloud and develop a whole system around it. The system is
in charge of processing requests coming from the IBM Process
Mining suite, preparing a computing instance to execute our
framework, and deliver the results back.3 In particular, it can
handle and process multiple requests coming from different
users and, in case a customer requests it, multiple compute
instances can be easily provided by allocating new clusters,
enabling to scale on demand. The system has been tested to
work with datasets up to 10 million of events.</p>
      <p>Figure 1 shows a screenshot of the Analytics Dashboard
within the IBM Process Mining suite for prediction of the total
time of cases, namely the time necessary to complete a case.
The use case presented here is based on a process executed
at an Italian Banking Institution. The process deals with the
closure of customer’s accounts, which may be requested by
the customer or by the bank, for several reasons. It uses event
data consisting of 730336 events belonging to 116566 cases.</p>
      <p>The upper-left corner reports on general process statistics,
such as the number of running cases and the average case
total time (here labeled as Completed Time) and cost. The
bottom-right corner lists the running cases, each associated
with the case identifier, and the last performed activity; since
this dashboard refers to the process total time, each case is
also associated with the elapsed time, the expected total time
as forecasted by the predictive monitor, and its difference
wrt. the average completion time, here also named as target.
When one clicks on a specific running case (e.g. with id
3A tutorial and a video showing how to use the tool can be found at
https://github.com/PyRicky/explainable predictive system
20181014067), one can see the the explanations for that case,
named influencers in the tool (see Figure 2). Each explanation
is of form a=x and is associated with a so-called Shapley value
n computed via SHAP. In accordance with the SHAP theory,
the explanation’s interpretation is as follows: since attribute a
takes on value x for this running case, the total-time prediction
deviates n time units from the average total time of cases.</p>
      <p>Let us consider again the all-cases dashboard in Figure 1:
the bar chart in the top-right corner provides an helicopter
view of the explanations. In particular, each row of the bar
chart represents an explanation, and extends towards left or
right, depending whether the average Shapley value for the
explanation is negative or positive. The colour indicates the
frequency of an explanation, with darker colours indicating a
large number of running cases with that explanation.</p>
      <p>As an example, explanation ACTIVITY=Pending Request for
Network Information has a large bar with a light colour: this
means that, for a small number of cases, the fact that the latest
activity has been a Pending Request for Network Information
has contributed to reduce the predicted total case duration by
an average value of 2 days and 4 hours (the average shapley
value). The explanation CLOSURE TYPE=Bank Recess is
conversely associated with a darker colour, namely with a large
number of cases. The average shapley value is equal to -2 days:
when the closure of the bank account is requested directly by
the bank, the total time reduces by 2 days wrt. the average.
This is indeed considered a simpler situation that does not
require much interaction with the bank account holder. On the
other side of the spectrum, explanation ACTIVITY=Pending
request for acquittance of heirs has the largest positive shapley
value: 1 day and 13 hours. This can also be justified: when
the bank account is aimed at closure because of the holder’s
decease, the execution takes longer due to the involvements
of the heirs.</p>
    </sec>
    <sec id="sec-2">
      <title>III. CONCLUSIONS</title>
      <p>In this paper we presented our ready-to-use explainable
predictive module, which can work directly with the data
processed by the process mining engine, without requiring any
additional intervention or technical knowledge and providing
almost immediate insights to the process stakeholder.</p>
      <p>Our framework is fully integrated in the IBM Process
Mining suite, and is ready for evaluation with the users; it can
be leveraged directly by process stakeholders with no need
for customization for each specific project, and is scalable
thanks to a cloud-based infrastructure. The interface is the
result of integrating feedback collected by process analysts and
consultants within IBM. However, we aim at a more extensive
user evaluation to further improve the user experience.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Ma</surname>
          </string-name>
          <article-title>´rquez-</article-title>
          <string-name>
            <surname>Chamorro</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Resinas</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <surname>A.</surname>
          </string-name>
          <article-title>Ruiz-Corte´s, “Predictive monitoring of business processes: A survey,”</article-title>
          <source>IEEE Transaction on Services Computing</source>
          , vol.
          <volume>11</volume>
          , no.
          <issue>6</issue>
          , pp.
          <fpage>962</fpage>
          -
          <lpage>977</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>I.</given-names>
            <surname>Nunes</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Jannach</surname>
          </string-name>
          , “
          <article-title>A systematic review and taxonomy of explanations in decision support and recommender systems,” User Modeling and User-Adapted Interaction</article-title>
          , vol.
          <volume>27</volume>
          , no.
          <issue>3-5</issue>
          , p.
          <fpage>393</fpage>
          -
          <lpage>444</lpage>
          , Dec.
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>F.</given-names>
            <surname>Doshi-Velez</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Kim</surname>
          </string-name>
          , “
          <article-title>Towards a rigorous science of interpretable machine learning</article-title>
          ,”
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Ribeiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Guestrin</surname>
          </string-name>
          , “
          <article-title>“why should I trust you”: Explaining the predictions of any classifier</article-title>
          ,”
          <source>in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          , San Francisco,
          <year>2016</year>
          , pp.
          <fpage>1135</fpage>
          -
          <lpage>1144</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R.</given-names>
            <surname>Galanti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Coma-Puig</surname>
          </string-name>
          , M. de Leoni,
          <string-name>
            <given-names>J.</given-names>
            <surname>Carmona</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Navarin</surname>
          </string-name>
          , “
          <article-title>Explainable predictive process monitoring</article-title>
          ,” in 2nd International Conference on Process Mining,
          <string-name>
            <surname>ICPM</surname>
          </string-name>
          <year>2020</year>
          . IEEE, pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L. S.</given-names>
            <surname>Shapley</surname>
          </string-name>
          , “
          <article-title>A value for n-person games,” Contributions to the Theory of Games</article-title>
          , vol.
          <volume>2</volume>
          , no.
          <issue>28</issue>
          , pp.
          <fpage>307</fpage>
          -
          <lpage>317</lpage>
          ,
          <year>1953</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Bahdanau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Cho</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Bengio</surname>
          </string-name>
          , “
          <article-title>Neural machine translation by jointly learning to align and translate</article-title>
          ,
          <source>” in The 3rd International Conference on Learning Representations</source>
          ,
          <string-name>
            <surname>ICLR</surname>
          </string-name>
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A. V.</given-names>
            <surname>Dorogush</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Ershov</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Gulin</surname>
          </string-name>
          , “
          <article-title>Catboost: gradient boosting with categorical features support,”</article-title>
          <source>in Proceedings of the Workshop on ML Systems at NIPS</source>
          <year>2017</year>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>