=Paper=
{{Paper
|id=Vol-3299/Paper07
|storemode=property
|title=Multimodal Process Prediction (Extended Abstract)
|pdfUrl=https://ceur-ws.org/Vol-3299/Paper07.pdf
|volume=Vol-3299
|authors=Johannes Lahann
|dblpUrl=https://dblp.org/rec/conf/icpm/Lahann22
}}
==Multimodal Process Prediction (Extended Abstract)==
<pdf width="1500px">https://ceur-ws.org/Vol-3299/Paper07.pdf</pdf>
<pre>
Multimodal Process Prediction (Extended Abstract)
Johannes Lahann1,2
1
    Saarland University, Saarbrücken, Germany
2
    German Research Center for Artificial Intelligence (DFKI), Saarbrücken, Germany


                                         Abstract
                                         Process prediction enables the forecast of a process instance based on event log data from process-
                                         oriented information systems. Recently, deep learning methods have improved the state-of-the-art in
                                         various process prediction tasks. Currently, most research addresses the control flow or time perspective.
                                         However, critical information is often hidden in context variables and represented in various data types
                                         such as tabular data, image data, and sensor data. This work explores the influence of context variables
                                         on process prediction. The main objective is to develop new deep learning-based methods to increase
                                         the efficiency and applicability of multimodal process prediction.


1. Introduction
Process prediction, also referred to as predictive business process monitoring or predictive
process analytics is a subset of process mining that tries to gain predictive insights into the
future of ongoing process instances. Organizations can leverage these predictions to adjust
ongoing process execution in real time to prevent undesirable outcomes [1, 2].
   In recent years, deep learning-based approaches to process prediction have attained popularity
and led to breakthrough results in various tasks such as next step prediction, remaining time,
and outcome prediction [3, 4, 5]. Accordingly, numerous deep learning architectures that differ
in data processing, modeling, and evaluation have been developed.
   Existing work mainly focuses on common event log attributes, especially activities, resources,
or timestamps. In practice, however, a few event log attributes rarely determine the future of a
process instance alone. Other context variables represented in various data types might also
be relevant. These may be tabular data directly associated with the event log, including event
attributes such as resource or timestamp or case attributes such as customer or product. In
addition, context variables can contain unstructured data such as text, images, and sensor data.
These contextual variables are often critical, as they can directly influence the continuation of
the process and thus significantly improve the prediction quality.


2. Research Goal and Research Questions
While deep learning techniques have been successfully applied to structured and unstructured
data, research on process prediction has mainly involved structured information such as ac-

ICPM 2022 Doctoral Consortium and Tool Demonstration Track
Envelope-Open johannes.lahann@dfki.de (J. Lahann)
                                       © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073
                                       CEUR Workshop Proceedings (CEUR-WS.org)


                                                                                                          32
tivities, resources, or timestamps. To address this research gap, this thesis investigates how
context variables of various data types can be processed and combined for process prediction.
Designing and implementing deep learning-based methods for multimodal process prediction
is the primary goal of this thesis. In particular, the following three research questions shall be
addressed.

RQ1. What are the strengths and weaknesses of existing deep learning-based process prediction
     architectures regarding data processing, modeling, and evaluation?
RQ2. How can different contextual variables be effectively incorporated into multimodal process
     prediction?
RQ3. How can process prediction enable other predictive process analytics such as trace
     clustering or anomaly detection?

   RQ1 is an empirical knowledge question that involves identifying existing approaches for
process prediction and determining their capabilities, limitations, and tradeoffs. In particular,
it examines whether there is a superior method or whether specific methods are preferable
depending on situational circumstances.
   In contrast, RQ2 aims to provide a deeper understanding of the different use cases in which
process prediction is used in practice. For this purpose, several use cases from different applica-
tion domains with different context variables should be examined. Furthermore, the context
variables that have the most significant effect on the continuation of the process should be
identified. Based on these findings, effective multimodal prediction methods should be devel-
oped. These methods should support relevant contextual variables and build on sound design
decisions that have been benchmarked against alternative approaches in RQ1.
   Finally, RQ3 investigates the potential of neural networks for other predictive process moni-
toring tasks. In natural language processing, language models have been exploited for solving
complex downstream tasks like machine translation, summarization, or question answering.
Similarly, with this research question, we want to explore whether neural networks originally
trained for process prediction can be advantageously leveraged for anomaly detection, trace
clustering, etc.


3. Research Method
The research project follows the design science research (DSR) paradigm for Information Systems
(IS) research [6]. Hevner et al. propose seven guidelines (G1-G7) to address a research problem
in the IS domain.

G1. Design as an Artifact. The result of the research project should be one or more novel
    artifacts.
G2. Problem Relevance. The addressed problem should be relevant to the IS research area.
G3. Design Evaluation. The artifacts should be evaluated through an appropriate method to
    prove their efficiency and utility.


                                                33
G4. Research Contribution. The artifacts should either solve an unsolved problem or improve
    on the existing solutions for the problem.
G5. Research Rigor. The artifacts should be defined through formal definitions, pseudo code, or
    source code in order to ensure consistency and reproducibility.
G6. Design as a Search Process. The artifacts should be designed through an iterative search
    process. The process consists of defining problem space and solution criteria, designing
    the solution, and verifying the solution based on the criteria.
G7. Communication of Research. The results and outcomes of the research project should be
    communicated to academics and practitioners.

We plan to implement the above guidelines as follows. To focus on relevant research problems,
we performed a structured literature review (SLR) in order to screen existing deep learning-
based process prediction methods and reveal trade-offs and limitations (G2). Through the
SLR, we identified two major research problems: (i) a lack of a documented and reproducible
benchmark for process prediction methods; (ii) a focus on a selection of event log variables
leading to a neglect of relevant context information. Accordingly, we set to develop the following
novel artifacts (G1, G4): (i) a modular framework for benchmarking novel prediction methods
against existing methods. (ii) three new deep learning-based process prediction methods that
can effectively incorporate tabular data, image data, and sensor data for multimodal process
prediction. We show how these developed predictive methods can be utilized to solve a variety
of predictive process monitoring tasks, including next-step prediction, outcome prediction,
remaining-time prediction, trace clustering, and anomaly detection. Each artifact is designed
formally by justifying all design decisions and reporting pseudo-code or source code (G5).
Additionally, we describe the experimental setup of each artifact in detail and release the
conducted experiments whenever possible. All artifacts are assessed by empirical evaluations
on real-live event logs or synthetic datasets (G3). Finally, all artifacts that we introduce in this
thesis are developed through an iterative design flow (G6), while major milestones are presented
at conferences and published in journals (G7).


4. Completed Research
To address RQ1, we performed a structured literature review concerning the state-of-the-art
of deep learning-based process prediction [7]. The review compares 32 process prediction ap-
proaches that are classified along different carefully selected dimensions such as neural network
architecture, prediction target, input features, and data processing methods. In particular, the
review showed the benefits and drawbacks of the existing approaches. It revealed research gaps
that played a significant role in determining the research direction of this dissertation project.
   Most of the other author’s contributions have addressed RQ2 (e.g. [8, 9, 10]). In [11], we
covered a use case from the steel industry where sensor data are processed through an LSTM
autoencoder to detect the quality of semi-finished products. Furthermore, we used the learned
representations of the autoencoder to predict the next step in the production process. In contrast
to most existing prediction approaches, we consider sensor data as an additional input source
next to typical event log data.


                                                34
   In [12], we proposed a novel prediction method that can process a flexible number of inputs
while supporting categorical and continuous variables by using gramian angular fields and
convolutional neural networks. The approach can be adapted through configuration to handle
different prediction tasks, including next step prediction, next resource prediction, remaining
time prediction, and outcome prediction. Its effectiveness is measured by comparing the
prediction quality with existing approaches on publicly available datasets. The results show
that the proposed method is an effective alternative to the more frequently used LSTM-based
approaches.
   In [13], we investigated how deep learning can be applied to computer-aided designs (CAD)
to support the planning process in a ”one-of-a-kind production” scenario of a manufacturing
firm. To this end, we utilized convolutional neural networks to cluster process instances based
on their CAD images and connected them with event log data to gather predictive insights.
   Only initial research has been conducted concerning RQ3. In [12], we showed that the hidden
representations of process prediction models could also be utilized for process trace clustering.
Additionally, in [9], we examined the potential of anomaly detection through process prediction
for a purchase order handling process. To this end, we investigated the effectiveness of an
unsupervised language model pretraining with a target task classifier finetuning applied to
process prediction and anomaly detection.
   Our other studies examined the strengths and weaknesses of machine learning methods in
various use cases [14, 15].


5. Planned Research
Future work comprises activities to analyze the integration of other context attributes of
additional data types, such as text data. In addition, thorough benchmarking of design decisions
in process prediction research will be conducted, with a focus on comparability and transparency.
Finally, further research is needed to deepen the understanding of how an underlying predictive
model affects the quality of downstream tasks. To this end, we plan to further explore the extent
of a predictive model’s impact on anomaly detection.


References
 [1] W. Van der Aalst, M. Pesic, M. Song, Beyond process mining: from the past to present
     and future, in: International Conference on Advanced Information Systems Engineering,
     Springer, 2010, pp. 38–52.
 [2] F. M. Maggi, C. Di Francescomarino, M. Dumas, C. Ghidini, Predictive monitoring of busi-
     ness processes, in: International conference on advanced information systems engineering,
     Springer, 2014, pp. 457–472.
 [3] J. Evermann, J. R. Rehse, P. Fettke, Predicting process behaviour using deep learning,
     Decis. Support Syst. 100 (2017) 129–140. doi:1 0 . 1 0 1 6 / j . d s s . 2 0 1 7 . 0 4 . 0 0 3 .
 [4] N. Tax, I. Verenich, M. La Rosa, M. Dumas, Predictive business process monitoring with
     lstm neural networks, in: Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif.


                                                 35
     Intell. Lect. Notes Bioinformatics), volume 10253 LNCS, Springer Verlag, 2017, pp. 477–492.
     doi:1 0 . 1 0 0 7 / 9 7 8 - 3 - 3 1 9 - 5 9 5 3 6 - 8 _ 3 0 .
 [5] I. Verenich, M. Dumas, M. L. Rosa, F. M. Maggi, I. Teinemaa, Survey and cross-benchmark
     comparison of remaining time prediction methods in business process monitoring, ACM
     Transactions on Intelligent Systems and Technology (TIST) 10 (2019) 1–34.
 [6] A. R. Hevner, S. T. March, J. Park, S. Ram, Design science in information systems research,
     MIS quarterly (2004) 75–105.
 [7] D. A. Neu, J. Lahann, P. Fettke, A systematic literature review on state-of-the-art deep
     learning methods for process prediction, Artificial Intelligence Review (2021) 1–27.
 [8] S. Dadashnia, P. Fettke, P. Hake, J. Lahann, P. Loos, S. Klein, N. Mehdiyev, T. Niesen, J.-R.
     Rehse, M. Zapp, Exploring the potentials of artificial intelligence techniques for business
     process analysis (2017).
 [9] O. Gutermuth, J. Lahann, J.-R. Rehse, M. Scheid, S. Schuhmann, S. Stephan, P. Fettke,
     Efficient and compliant purchase order handling - a contribution to bpi challenge 2019
     (2019).
[10] S. Klein, J. Lahann, L. Mayer, D. Neu, P. Pfeiffer, A. Rebmann, M. Scheid, B. Willems,
     P. Fettke, Business process intelligence challenge 2020: Analysis and evaluation of a travel
     process, in: 10th Business Process Intelligence Challenge at the Int. Conf. on Process
     Mining (ICPM), 2020.
[11] N. Mehdiyev, J. Lahann, A. Emrich, D. Enke, P. Fettke, P. Loos, Time series classification
     using deep learning for process planning: A case from the process industry, Procedia
     Computer Science 114 (2017) 242–249.
[12] P. Pfeiffer, J. Lahann, P. Fettke, Multivariate business process representation learning
     utilizing gramian angular fields and convolutional neural networks, in: International
     Conference on Business Process Management, Springer, Cham, 2021, pp. 327–344.
[13] N. Mehdiyev, L. Mayer, J. Lahann, P. Fettke,                              Deep learning-based clustering
     of processes and their visual exploration: An industry 4.0 use case for small,
     medium-sized enterprises,                               Expert Systems n/a (????)  e13139. URL: https://
     onlinelibrary.wiley.com/doi/abs/10.1111/exsy.13139. doi:h t t p s : / / d o i . o r g / 1 0 . 1 1 1 1 / e x s y .
     13139. arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1111/exsy.13139.
[14] J. Lahann, M. Scheid, P. Fettke, Utilizing machine learning techniques to reveal vat
     compliance violations in accounting data, in: 2019 IEEE 21st conference on business
     informatics (CBI), volume 1, IEEE, 2019, pp. 1–10.
[15] J. Lahann, M. Scheid, P. Fettke, Towards optimal free trade agreement utilization through
     deep learning techniques (2020).


                                                         36

</pre>