<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>How Do Explainability and Interpretability Afect the Use of Process Mining?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Irina Tentina</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>The Eindhoven University of Technology</institution>
          ,
          <addr-line>Eindhoven</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <fpage>14</fpage>
      <lpage>18</lpage>
      <abstract>
        <p>Process mining continues to mature from a technological perspective. However, despite the increasing number of process mining solutions and services, some barriers still remain to its full adoption in business practice. Like any other technology, process mining relies on efective usage by users to add value. One of the factors afecting the adoption by users and the use of process mining is that users cannot understand and translate process mining outputs into valuable insights. Our hypothesis is that interpretation and explainability are crucial to give users a sense of understanding and actionability. This doctoral project aims to define explainability and interpretability in process mining, investigate diferent factors afecting them and, furthermore, design solutions for improvement. The project follows a mix of empirical and technical approaches. As a pilot study survey-based interviews with PM business users were conducted confirming the relevance of the topic.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Process Mining</kwd>
        <kwd>Explainability</kwd>
        <kwd>Interpretability</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction and Motivation</title>
      <p>
        Process mining (PM) is a technology that uses event logs from IT systems and applications to reconstruct,
visualize, analyze and improve business processes [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. According to Gartner [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] the PM field continues
to grow and the PM software market grew by 39.5% to $871.6 million in 2023. The number of PM
solutions and tools is increasing every year. Most large market players, such as SAP, Microsoft and
IBM, acquire smaller PM companies to include PM in their solution stack. There has also been a growth
of consulting companies ofering diferent types of PM services.
      </p>
      <p>
        Both the increase in solutions and consultancy seem to be indicative of the demand for PM among
companies within the industry. At the same time, adoption by end users still appears to be an issue.
There are more companies that either already had some experience with PM solution(s) and decided not
to continue with it [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] or still use it, but struggle with adoption, value identification and realization [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
There are many factors that can lead to (un-)successful PM use. In this research project we focus on
challenges, as it is called below - pain points (PP), which stem from the author’s consulting experience
and have been highlighted in the recent research[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        PP1: General PM eXplainability (XPM) - I do not always understand what exactly am I
seeing. PM outcomes can be presented in diferent ways and, depending on the PM solution, may
include process-focused visualizations such as process maps and/or set of diferent visual components
as dashboards. However, users who are not experts in PM may experience reading these visuals and
understanding exactly what is displayed as "overwhelming" and non-trivial[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        PP2: General PM iNterpretability (NPM) - It is challenging to understand what should I
do with what I see. Once users understand what they see, e.g. how to read a process map and/or
dashboard, the next step is to interpret what they see and translate it into valuable insights. Depending
on the project setting, diferent approaches and techniques can be used to analyze PM results, for
example exploratory and/or confirmatory approach[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. However, when non-expert users start using
one of these techniques, they may still find it challenging to interpret PM results properly without help
from PM experts[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>PP3: PM without Process Focus - I miss Process Mining in all this. Because of PP1 and PP2,
PM developers may try to avoid complex output and simplify analyses to improve NPM. In some cases,
this can lead to better operational use, for example, when a non-expert user is given only a short list of
tasks to focus on. However, it can also lead to using PM without a process focus or with a focus only on
local questions within the complex end-to-end view, similar to Business Intelligence (BI) tools. The
question then arises as to whether such simplification helps to improve XPM and NPM, or whether it
risks that actual complex problems remain unsolved.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Research Questions</title>
      <p>To investigate pain points mentioned in the previous section, we have formulated the following research
questions.</p>
      <p>
        RQ1: How can XPM and NPM be defined for certain stages of the PM journey for certain
user groups? PP1 and PP2 are related to how to explain and interpret PM outputs. Interpretability and
explainability are beneficial to instill feelings of understanding within the system’s users. There are
numerous generic explainability and interpretability definitions [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] as it is becoming a popular research
area. Especially in the field of artificial intelligence (AI) and in relation to machine learning, which use
algorithms learned through their own training process. The ‘black box’ efect can lead to questions
such as how the system made a particular decision. However, no such definition appeared to exist for
general PM, e.g. process discovery, except for predictive process monitoring in recent research [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] or
some focused use cases [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ].
      </p>
      <p>RQ2: What are the main factors that can drive XPM and NPM within the certain phases
of PM journey for certain user groups? There are many diferent drivers that can somehow afect
both XPM and NPM. For example, educational aspects and PM experience of individual level during
the initial PM phases (e.g. requirements gathering) or choices of visual components by the PM analyst
during the technical development phases (e.g. dashboard building) of technological level.</p>
      <p>
        RQ3: How can XPM and NPM be improved for certain PM stages and user groups for more
eficient and insightful use of PM outputs? There are some existing methods and tools proposed by
PM vendors or in the recent research [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] that aim (in-)directly to improve XPM and NPM. However,
new solutions can be identified and existing ones can be (re-)evaluated. As mentioned in PP3, enhanced
explainability and interpretability for more complex PM output(s) can contribute to better PM user
adoption and value identification.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Research Methodology</title>
      <p>We adopt a mixed methodology, combining qualitative and quantitative approaches. To address the
research questions, we plan to use the four main phases shown in Figure 1.</p>
      <p>
        To start with definitions (RQ1), we selected the initial scope. We will focus on three main user
groups - business users, analysts, developers. We target planning, requirements and data collection,
data processing and transformation, mining and analysis, evaluation, and process improvement and
support as the most important high-level PM phases [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>
        Once possible definitions per certain stages and certain user groups are clarified (RQ1), we plan to
investigate drivers (RQ2) that can influence both XPM and NPM. For now, we have grouped these
drivers into four categories. The categorization is preliminary and partially based on
TechnologyOrganization-Environmental (TOE) framework [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] adding the individual perspective.
- Personal / Individual - level of education, personal background, level of experience in process mining,
personal skills, etc.
- Organisational - change culture, learning culture, organizational structure and defined PM roles, etc.
By organisations, we mean not only companies using PM software, but also others, e.g. PM suppliers
and PM consulting firms.
- Environmental - industry and market structure, government regulation, etc.
Developed framework
with phase, user group
and definition of
explainability,
interpretability
Enriched framework with
categorized drivers per
definition (per certain
phase and user group)
Developed assessment
framework with potential
impact and effort to
      </p>
      <p>develop solution
Solution and development</p>
      <p>methods can vary per
specific driver (e.g., new
visual can be proposed or
framework designed)
- Technological - tooling and software (source systems, ETL solutions), visual components within
those systems, programming languages and developed algorithms, etc.</p>
      <p>
        After clarifying the definitions (RQ1) and potential drivers (RQ2), we aim to review, assess and
prioritize them with the aim of identifying the drivers with the greatest business impact based on
potential value. For two-three of the most impact drivers, we plan to review available solutions and
evaluate them. Next, we intent to design improved solutions (RQ3) adopting a Design Science Research
(DSR) approach [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. For example, if the biggest impact (according to the chosen assessment) lies in
the interpretability by the business user during the discovery phase (mining and analysis), a possible
solution may be a new visual component or representation of PM output.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Current State</title>
      <p>As preliminary research to answer RQ1, we conducted survey-based interviews. During the pilot
interviews, we asked several process mining experts and 20 PM business users what do they understand
by explainability and interpretability within process mining. When asking the questions ‘How would
you define explainability in Process Mining?’ and ‘How would you define interpretability in Process
Mining?, we obtained the following results:
• Some respondents answered that they think explainability and interpretability are very important
topics, but that they also find it dificult to answer one or both questions and propose a definition
in a ‘constructive’ way.
• Others shared that explainability and interpretability are the same and that there is no diference
in a definition, meaning they are interchangeable.
• In contrast, some emphasised the diference and even highlighted that one or the other is more
important (e.g. that without explainability, there is no understandability and consequently no
need for interpretability).</p>
      <p>Initial results show that participants find XPM and NPM very important topic. However, some of
them find it challenging to formulate clear definitions.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>W. M. P. van der Aalst</surname>
          </string-name>
          ,
          <source>Process Mining - Data Science in Action, Second Edition</source>
          , Springer,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>V.</given-names>
            <surname>Mehta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kerremans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Tornbohm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Chibber</surname>
          </string-name>
          ,
          <article-title>Market share analysis: Process mining software, worldwide</article-title>
          ,
          <source>Gartner Report</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>V. S.</given-names>
            <surname>Dani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Leopold</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. M. E. M. van der Werf</surname>
            , I. Beerepoot,
            <given-names>H. A.</given-names>
          </string-name>
          <string-name>
            <surname>Reijers</surname>
          </string-name>
          ,
          <article-title>From loss of interest to denial: A study on the terminators of process mining initiatives</article-title>
          ,
          <source>in: CAiSE</source>
          , volume
          <volume>14663</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2024</year>
          , pp.
          <fpage>371</fpage>
          -
          <lpage>386</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Grisold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mendling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Otto</surname>
          </string-name>
          , J. vom Brocke,
          <article-title>Adoption, use and management of process mining in practice, Bus</article-title>
          . Process. Manag. J.
          <volume>27</volume>
          (
          <year>2021</year>
          )
          <fpage>369</fpage>
          -
          <lpage>387</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>F.</given-names>
            <surname>Zerbato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Sofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Weber</surname>
          </string-name>
          ,
          <article-title>Process mining practices: Evidence from interviews</article-title>
          ,
          <source>in: BPM</source>
          , volume
          <volume>13420</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2022</year>
          , pp.
          <fpage>268</fpage>
          -
          <lpage>285</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>V. S.</given-names>
            <surname>Dani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Leopold</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. M. E. M. van der Werf</surname>
            ,
            <given-names>H. A.</given-names>
          </string-name>
          <string-name>
            <surname>Reijers</surname>
          </string-name>
          ,
          <article-title>Progressing from process mining insights to process improvement: Challenges and recommendations</article-title>
          , in: EDOC, volume
          <volume>14367</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2023</year>
          , pp.
          <fpage>152</fpage>
          -
          <lpage>168</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>G.</given-names>
            <surname>Janssenswillen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Depaire</surname>
          </string-name>
          ,
          <article-title>Towards confirmatory process discovery: Making assertions about the underlying system</article-title>
          ,
          <source>Bus. Inf. Syst. Eng</source>
          .
          <volume>61</volume>
          (
          <year>2019</year>
          )
          <fpage>713</fpage>
          -
          <lpage>728</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.</given-names>
            <surname>Droste</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Deters</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fuchs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Schneider</surname>
          </string-name>
          ,
          <article-title>Peeking outside the black-box: AI explainability requirements beyond interpretability</article-title>
          ,
          <source>in: REFSQ Workshops</source>
          , volume
          <volume>3672</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>N.</given-names>
            <surname>Mehdiyev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Fettke</surname>
          </string-name>
          ,
          <article-title>Explainable Artificial Intelligence for Process Mining: A General Overview and Application of a Novel Local Explanation Approach for Predictive Process Monitoring</article-title>
          , Springer International Publishing,
          <year>2021</year>
          , p.
          <fpage>1</fpage>
          -
          <lpage>28</lpage>
          . URL: http://dx.doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -64949-
          <issue>4</issue>
          _1. doi:
          <volume>10</volume>
          . 1007/978-3-
          <fpage>030</fpage>
          -64949-
          <issue>4</issue>
          _
          <fpage>1</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>T.</given-names>
            <surname>Hoogendoorn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Arachchige</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. A.</given-names>
            <surname>Bukhsh</surname>
          </string-name>
          ,
          <article-title>Survey of explainability within process mining: A case study of BPI challenge 2020</article-title>
          , in: FIT, IEEE,
          <year>2023</year>
          , pp.
          <fpage>43</fpage>
          -
          <lpage>48</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>L.</given-names>
            <surname>Nannini</surname>
          </string-name>
          ,
          <article-title>Explainability in process mining: A framework for improved decision-making, in: AIES</article-title>
          , ACM,
          <year>2023</year>
          , pp.
          <fpage>975</fpage>
          -
          <lpage>976</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Miksch</surname>
          </string-name>
          ,
          <article-title>Visual analytics meets process mining: Challenges and opportunities</article-title>
          , in: ICPM, IEEE,
          <year>2021</year>
          , p.
          <fpage>xiv</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>M. L. van Eck</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>S. J. J.</given-names>
          </string-name>
          <string-name>
            <surname>Leemans</surname>
            ,
            <given-names>W. M. P. van der Aalst</given-names>
          </string-name>
          , PM ˆ2 :
          <article-title>A process mining project methodology</article-title>
          ,
          <source>in: CAiSE</source>
          , volume
          <volume>9097</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2015</year>
          , pp.
          <fpage>297</fpage>
          -
          <lpage>313</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.</given-names>
            <surname>Baker</surname>
          </string-name>
          , The
          <string-name>
            <surname>Technology-Organization-Environment</surname>
            <given-names>Framework</given-names>
          </string-name>
          , Springer New York, New York, NY,
          <year>2012</year>
          , pp.
          <fpage>231</fpage>
          -
          <lpage>245</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-1-
          <fpage>4419</fpage>
          -6108-2_
          <fpage>12</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>P.</given-names>
            <surname>Johannesson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Perjons</surname>
          </string-name>
          , An Introduction to Design Science, Springer,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>