<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Journey-to-Process Analytics: Fusing Experience with Operations Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Henrik Leopold</string-name>
          <email>henrik.leopold@klu.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alexander Rochlitzer</string-name>
          <email>alexander.rochlitzer@klu.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manuel Meindl</string-name>
          <email>manuel.meindl@sap.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Raheleh Hadian</string-name>
          <email>raheleh.hadian@sap.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gregory Hakomaki II</string-name>
          <email>gregory.hakomaki.ii@sap.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Download/Demo URL</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Documentation URL</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Process Mining, Business Process Improvement, Process Observability, Large Language Models</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Kühne Logistics University</institution>
          ,
          <addr-line>Großer Grasbrook 17, 20457 Hamburg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Microsoft Windows, GNU/Linux</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Python</institution>
          ,
          <addr-line>Kubernetes, Docker, SAP Data Custodian, ReactJS</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <fpage>14</fpage>
      <lpage>18</lpage>
      <abstract>
        <p>A key business challenge of process mining is to appeal to decision-makers who seek to diferentiate, with the ambition to go beyond operational optimization. One way to position process mining as a diferentiator is to integrate operational process and experience journey perspectives, with the ultimate goal to better align operations with the needs of customers and other external stakeholders. To exemplify this direction, this demonstration presents SAP's journey-to-process analytics capabilities that fuse experience with process data, allowing organizations to generate insights about how operations afect experience.</p>
      </abstract>
      <kwd-group>
        <kwd>Metadata description</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR</p>
      <p>ceur-ws.org
Screencast video
Value
1.0
Proprietary
Journey-to-Process Analytics
https://staging.signavio.com/g/statics/labs/journey2process
https://url.sap/7o0crn
N/A
https://url.sap/vm93si</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>
        To thrive in today’s fast-moving, interconnected world, organizations must not only maintain
tight control over their processes but also continuously transform them to meet the needs of
their stakeholders. Traditional business process management (BPM), however, focuses on an
organization’s inside-out perspective, thereby missing valuable improvement opportunities that
arise from leveraging the knowledge and experiences of stakeholders [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], who, after all, define
the success of an organization. One of the main challenges of leveraging the stakeholders’
perspective is that stakeholder journey data are often unstructured (e.g., survey comments,
incident tickets, emails), meaning that extracting relevant information and combining it with
operational data may require significant manual efort.
      </p>
      <p>
        Recognizing both the lack of process observability and the recent interest in Large Language
Models (LLM) for BPM [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">2, 3, 4</xref>
        ], we propose a natural language-driven tool that analyzes journey
data and maps them to process data and models, enabling a more holistic view on organizational
operations. The tool provides organizations with in-depth insights into how their stakeholders
perceive individual process steps or stages. These insights can help enhance the understanding
of the process dynamics that afect the experience of stakeholders, speed up the search for
necessary changes that address the root causes of experience issues, and facilitate the validation
of the changes’ actual impact on the experience.
      </p>
      <p>The remainder of the paper is structured as follows. Section 2 presents our tool for the
integrated analysis of journey and process data. Section 3 elaborates on the maturity of the
tool. Section 4 outlines directions for future work and concludes the paper.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Architecture and Features</title>
      <p>The tool provides two main functionalities for leveraging the journey perspective for BPM: the
textual analysis of journey data and the mapping of journey data to process data and models.
Below, we elaborate on the details of these functionalities.</p>
      <sec id="sec-3-1">
        <title>2.1. Journey Data Analysis</title>
        <p>The journey-to-process analytics tool starts with the analysis of journey data as the cornerstone
of providing an outside-in perspective. Users first need to create a new dashboard and provide a
CSV file containing the data to be analyzed. The tool then determines the sentiment of the data,
classifies the data into process topics, and identifies groups of similar stakeholder experiences.
Sentiment Analysis. To get insights into how stakeholders perceive the processes of an
organization, the tool classifies each text of the stakeholders as either positive, negative, neutral,
or unknown. Users can then filter by sentiment type and review each text individually.
Alternatively, users can analyze the development of the overall sentiment of the journey data over time,
as shown in Figure 1. The tool provides the overall sentiment both as a net sentiment score and
broken down into the sentiment types. The net sentiment score  net is defined as follows:
 net =</p>
        <p>pos −  neg
 pos +  neg +  neu
where  pos is the number of positive items,  neg is the number of negative items,  neu is the
number of neutral items</p>
        <p>To determine the sentiment of a stakeholder’s text, we use an LLM that receives a prompt
that includes the text and the sentiment types. A text that might violate the LLM’s content
restrictions is classified as unknown.</p>
        <p>Process Topic Analysis. To provide a starting point for a targeted in-depth analysis by the
user, the tool classifies the data according to six expert-defined process topics, as shown in
Figure 1. To help users to identify process issues that require attention, the tool provides, for
each topic, a sentiment score and a brief summary for each sentiment type. Based on this
information, users can drill down into relevant topics. Table 1 provides a description of each
topic and a corresponding example.</p>
        <p>We implemented this analysis as a multi-label classification task, where each text from a
stakeholder can be associated with zero or more process topics. To obtain the topics for a text,
we provide an LLM with a prompt that includes the process topics, their descriptions and the
stakeholder’s text. In addition, we generate a summary by using a prompt that takes in texts
from the specific topic and sentiment type for which the summary is generated.
Cluster Analysis. Similar to the classification of journey data into predefined topics, the tool
classifies the data into ”What goes well” and what ”Needs to Improve”, as shown in Figure 2.
Within these two categories, the tool provides more fine-granular topics that emerge from
the specific experience data. To achieve this, we use the BERTopic library 1 for clustering
the data. For each emerging cluster, our tool also provides a topic label and a summary.
We generate them using an LLM with a prompt that leverages representative keywords
and texts according to BERTopic from the cluster for which the label and summary are generated.
As this functionality does not require process data or models, it enables even
organizations with very low levels of process management maturity to get an outside-in perspective
on their operations.
1https://maartengr.github.io/BERTopic</p>
        <sec id="sec-3-1-1">
          <title>Process Clarity</title>
        </sec>
        <sec id="sec-3-1-2">
          <title>Process Eficiency</title>
        </sec>
        <sec id="sec-3-1-3">
          <title>Process Speed</title>
        </sec>
        <sec id="sec-3-1-4">
          <title>Process Efectiveness</title>
        </sec>
        <sec id="sec-3-1-5">
          <title>System related</title>
        </sec>
        <sec id="sec-3-1-6">
          <title>Human related</title>
        </sec>
        <sec id="sec-3-1-7">
          <title>Description</title>
        </sec>
        <sec id="sec-3-1-8">
          <title>Example</title>
        </sec>
        <sec id="sec-3-1-9">
          <title>Any reference by the subject to the I was not sure how to approve my</title>
          <p>understanding of the process that team’s targets.
feedback is given to.</p>
        </sec>
        <sec id="sec-3-1-10">
          <title>Any reference to the quality of exe- I always have to go back and forth in</title>
          <p>cution of a process, with regards to the system to approve the targets of
costs, efort, time. each team member.</p>
        </sec>
        <sec id="sec-3-1-11">
          <title>Any reference to speed regarding a My team lead approved my targets</title>
          <p>process. Explicit subcategory of pro- quickly.
cess eficiency.</p>
          <p>Any reference to the success of the I joined in the middle of the year and
outcome of a process in one instance I could not set any targets for the
reof execution or the overall success of maining months.
the process and its alignment to the
overall goal.</p>
          <p>Any reference to the systems, tools The target setting system is easy to use.
and integrations or UI and UX
serving as an underlying layer to a
process.</p>
        </sec>
        <sec id="sec-3-1-12">
          <title>Any reference to people serving as stakeholders in a process.</title>
          <p>My team lead was very nice and
provided constructive feedback when we
discussed my targets.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>2.2. Journey-to-Process Mapping</title>
        <p>To combine the outside-in and inside-out perspectives into a holistic view, users can link a
process model (BPMN file) or an event log (XES file) to their provided journey data. The tool
then maps each text of a stakeholder to zero or more activities of the model or nodes of the mined
directly-follows graph (DFG). The mapping dashboards provide an overview of the analyzed
process with sentiment scores attached to each process step. As shown in Figure 2, users can
also select a particular step to get more in-depth insides about how stakeholders perceive this
step, including a breakdown of the sentiment into the diferent types. Similar to the analysis of
process topics, the mapping of journey data to process data and models is implemented as a
multi-label classification task. To obtain the relevant process elements for a stakeholder’s text,
we provide a prompt that includes the activities or nodes and the text to an LLM.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. Maturity</title>
      <p>In this section, we report on the evaluation results of the novel process topic analysis and
journey-to-process mapping features. For the topic matching experiment, we used a synthetic
dataset that is inspired by real-life customer and employee journey data of a variety of business
scenarios, including performance review, hiring, and order-to-cash processes. The dataset
consists of 100 feedback comments. For the evaluation of the activity matching feature, we
used a synthetic dataset inspired by the real-life order-to-cash process mentioned above. The
dataset consists of 16 process activities and 57 feedback comments. To evaluate the features’
performance, we used the average dice score. The dice score  dice is defined as follows:
 dice =
2 ⋅ | ∩  |
| | + | |
where  is the set of human-provided labels,  is the set of predicted labels, | ∩  | is the size
of the intersection between sets  and  , | | is the size of set  , | | is the size of set</p>
      <p>
        The process topic analysis feature and the journey-to-process mapping feature experiments
were performed in a one-shot setting and a zero-shot setting, respectively. For both experiments,
we used GPT-4 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] as the LLM. This resulted in average dice scores of 0.71 and 0.5 for the process
topic analysis and journey-to-process mapping, respectively. Furthermore, we conducted case
studies with companies from various industries and found that the tool can help to validate
business assumptions before taking actions, to align key performance indicators with
stakeholders’ experiences, and to perform more targeted actions such as training sessions for a specific
group of employees or process changes for a specific customer type. In addition, we found that
the journey-to-process mapping can be significantly enhanced by human-in-the-loop feedback
about the validity of the mappings, allowing the tool to adjust to user specific terminologies.
The journey-to-process analytics tool enables organizations to perform an integrated analysis of
journey and operational data, providing a more holistic and actionable view on organizational
operations. The tool analyzes the sentiment of journey data, classifies the data into
expertdefined process topics, identifies groups of similar experiences, and provides in-depth insights
into how stakeholders perceive individual process steps or stages. The increased process
observability enables organizations to make informed and timely adjustments. By performing
experiments and elaborating on case studies, we demonstrated the efectiveness of the tool.
      </p>
      <p>In the future, we aim to enable users to adapt the tool to their data by confirming, rejecting,
or modifying the sentiment assessments of their journey data and the associations of the data
with process topics and process elements. Additionally, we aim to provide a more granular
sentiment analysis that determines the sentiment towards each individual element addressed in
a journey data point. Finally, the tool’s mapping functionality can be improved by considering
more element types, by mapping journey data collected in a process-oriented manner to event
logs instead of DFGs, and by providing a two-step mapping process. In the first step, the process
model or event log that is related to the journey data would be identified within a collection. In
the second step, the journey data would be mapped to individual process elements.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments References</title>
      <p>We sincerely thank Timotheus Kampik and Peyman Badakhshan for their continuous support
in this research project and for revising an earlier draft of this manuscript.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rosemann</surname>
          </string-name>
          ,
          <article-title>Proposals for future bpm research directions</article-title>
          , in: C.
          <string-name>
            <surname>Ouyang</surname>
          </string-name>
          , J.-Y. Jung (Eds.),
          <source>Asia Pacific Business Process Management</source>
          , Springer International Publishing, Cham,
          <year>2014</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Kampik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Warmuth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rebmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Agam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. N. P.</given-names>
            <surname>Egger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gerber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hofart</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kolk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Herzig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Decker</surname>
          </string-name>
          , et al.,
          <article-title>Large process models: A vision for business process management in the age of generative ai</article-title>
          , KI - Künstliche
          <string-name>
            <surname>Intelligenz</surname>
          </string-name>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>K.</given-names>
            <surname>Busch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rochlitzer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Sola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Leopold</surname>
          </string-name>
          ,
          <article-title>Just tell me: Prompt engineering in business process management</article-title>
          , in: H.
          <string-name>
            <surname>van der Aa</surname>
            , D. Bork,
            <given-names>H. A.</given-names>
          </string-name>
          <string-name>
            <surname>Proper</surname>
          </string-name>
          , R. Schmidt (Eds.), Enterprise,
          <source>Business-Process and Information Systems Modeling</source>
          , Springer Nature Switzerland, Cham,
          <year>2023</year>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>11</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Vidgof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bachhofner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mendling</surname>
          </string-name>
          ,
          <article-title>Large language models for business process management: Opportunities and challenges</article-title>
          , in: C. Di
          <string-name>
            <surname>Francescomarino</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Burattin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Janiesch</surname>
          </string-name>
          , S. Sadiq (Eds.),
          <source>Business Process Management Forum</source>
          , Springer Nature Switzerland, Cham,
          <year>2023</year>
          , pp.
          <fpage>107</fpage>
          -
          <lpage>123</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>OpenAI</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Achiam</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Adler</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Agarwal</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Ahmad</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Akkaya</surname>
            ,
            <given-names>F. L.</given-names>
          </string-name>
          <string-name>
            <surname>Aleman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Almeida</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Altenschmidt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Altman</surname>
          </string-name>
          , et al.,
          <source>Gpt-4 technical report</source>
          (
          <year>2023</year>
          ). arXiv:
          <volume>2303</volume>
          .
          <fpage>08774</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>