=Paper=
{{Paper
|id=Vol-3783/paper_162
|storemode=property
|title=Multidimensional Process Model Forecasting (MuDiPMF)
|pdfUrl=https://ceur-ws.org/Vol-3783/paper_162.pdf
|volume=Vol-3783
|authors=Yongbo Yu
|dblpUrl=https://dblp.org/rec/conf/icpm/Yu24
}}
==Multidimensional Process Model Forecasting (MuDiPMF)==
Multidimensional Process Model Forecasting (MuDiPMF)
Yongbo Yu
KU Leuven, Naamsestraat 69, 3000 Leuven, Belgium
Abstract
Process analytics aims to improve processes based on event logs generated by information systems by, among
others, automatically discovering models representing the current system. This discovery, however, is typically
based on static models, ignoring its underlying trends and shifts. Recently, the modeling and prediction of the full
system have been proposed as process model forecasting (PMF). However, the current SOTA lacks the ability to
model intricate control flow constructs while also not incorporating extra information, such as resources tied to
the process. Besides, by using univariate models, the underlying relations between the different elements of the
system are ignored. This proposal addresses these issues by firstly extending PMF to richer control flow models
that are able to capture relationships between activities in workflows. Secondly, the current PMF techniques
will be replaced by a multivariate framework based on state-of-the-art deep learning techniques such as graph
neural networks, which form a natural fit for graph-based models such as workflow diagrams and capture both
temporal, structural, and multiscale patterns. Next, additional perspectives, such as resources, will be added to
obtain fully object-centric process model forecasts, which can incorporate any data related to a process through
the forecasting of event knowledge graphs. Finally, two industry cases in finance and logistics will be used to
validate the findings in a real-life setting.
Keywords
Process Model Forecasting, Time Series Forecasting, Graph Neural Networks
1. Positioning and Motivation
Within the field of Process Mining, Predictive Process Monitoring (PPM) entails forecasting future
elements of ongoing process instances or cases, including the most probable next activities, outcomes,
and remaining runtime. Notably, the integration of machine learning and deep learning solutions
into this domain has been extensively explored in academic literature [1]. While real-time insights at
the individual case level allow process owners to intervene in specific instances, they often lack the
capacity to provide end-users with information regarding the future trajectory of the entire process.
Consequently, a new paradigm known as Process Model Forecasting (PMF) has emerged [2], focusing
on predicting future states of the overall process model over a long-term horizon, drawing information
from historical event data [3]. This Multidimensional Process Model Forecasting (MuDiPMF) project
aims to develop and validate a set of tailored and integrated multi-dimensional forecasting models using
multivariate predictive methods for multi-perspective business process models.
The current state-of-the-art in PMF involves depicting the evolution of process behavior through
time series analysis of individual Directly-Follows relations (DFs), which track the frequency of one
activity following another within cases, over a predefined timeframe [3]. These DFs collectively form a
Directly-Follows Graph (DFG), a widely utilized process visualization tool offering a clear representation
of the flow of the process. The individual DFs are forecasted using univariate time series forecasting
techniques, overlooking correlations between different DFs induced by the underlying relations between
process elements within the information system. Additionally, DFGs lack the capability to model more
nuanced process constructs, such as parallel behavior, in contrast to more semantically rich process
model notations like Petri nets and BPMN models.
Therefore the first objective of MuDiPMF is to forecast more semantically rich process models. This
entails expanding the feature set beyond DFs, such as the constructs utilized in advanced process
discovery techniques. Furthermore, multimodal predictive models can be used to forecast various
ICPM 2024 Doctoral Consortium, October 14–18, 2024, Kongens Lyngby, Denmark
$ yongbo.yu@kuleuven.be (Y. Yu)
0009-0004-2964-6611 (Y. Yu)
© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
CEUR
ceur-ws.org
Workshop ISSN 1613-0073
Proceedings
time series of multidimensional feature sets simultaneously, thereby accounting for process related
dependencies and correlations. Next to this, MuDiPMF aims to enhance the forecasted process models
beyond the control-flow aspect, by incorporating additional dimensions such as resource occupation,
execution times, and decision point analyses. This would, among others, allow process owners to
perform timed interventions regarding bottlenecks, and optimize resource allocations. Furthermore,
in recent years Object-Centric Process Mining (OCPM) has emerged as a new family of approaches
tailored to handle event data from processes involving different interconnected objects such as orders,
items, and shipments, garnering widespread attention in both academia and industry [4]. Given the
rapid emergence and relevance of these object-centric process models, a third objective of MuDiPMF is
to develop a framework extending forecasting capabilities to such process models.
Figure 1: Overview of research gaps and objectives
In summary, the Multidimensional Process Model Forecasting (MuDiPMF) project will significantly
contribute to the current state-of-the-art in Business Process Management and Process Mining by
enhancing the recently proposed PMF framework by broadening the feature set to be predicted with
additional dimensions, while exploring the application of more suitable multivariate predictive methods.
Finally, the project aims to demonstrate the practical utility of the different PMF enhancements using
real-life process data from two different domains: financial services and logistics.
2. Current Solutions and Research Objectives
Figure 1 illustrates the research gaps and research objectives (ROs). Given the recent inception of the
PMF paradigm, the state-of-the-art is currently confined to univariate forecasting of distinct DFs [3].
More specifically, current solutions rely on auto-regressive time-series forecasting techniques such as
ARIMA and GARCH. The forecasted DFs can collectively represent a process model (DFG), but are
not sufficient to discover more complex process model structures such as parallelism. In contrast, the
literature on automated process discovery is more developed, with numerous approaches proposed over
the years to discover, e.g., Petri Nets or BPMN models [5]. Other examples include Heuristics Miner and
its extension Fodina, which utilizes various heuristics and formalisms to automatically discover among
others, concurrency, exclusive choices, and loops in a process from event logs [6], [7]. Another approach
involves a top-down strategy, exemplified by methods like Inductive Miner, which partitions larger
event logs into more manageable segments for analysis [8]. RO1 discusses expanding PMF towards
forecasting shifts in processes expressed by more semantically rich process model representations.
Next, the growing literature on PPM remains relevant despite its focus on single objectives such as
suffix prediction [9], or case outcome prediction [10]. Many of the predictive approaches have assumed
deep learning models such as long short-term memory networks and even graph neural networks.
Many operate a multivariate, but not a multitarget approach, as envisioned for RO2. Finally, a growing
interest in the object-centric perspective of process mining has been evident in recent years [11]. The
representation of these object-centric event logs as an event knowledge graph is especially of interest
for this project given the similarity of object-centric process models changing over time according to a
temporal graph-based structure of the data [12]. This will be addressed in RO3.
From an algorithmic perspective underpinning these applications, various data-driven and deep
learning approaches for multivariate time series forecasting have been proposed [13]. Given the graph-
based structure of process models, together with their emergence as powerful predictors in different
tasks, Graph Neural Networks (GNNs) are a natural match to learn both temporal and structural
properties of process models. Particularly, work incorporating the time dimension into GNNs to
investigate both spatial and temporal dependency together could provide PMF with more powerful and
flexible predictors capable of taking into account process-specific dependencies. Different approaches
for spatial-temporal graph neural networks (STGNNs), such as STGCN [14] and StemGNN [15], have
shown potential in domains such as traffic forecasting.
Multidimensional Process Model Forecasting (MuDiPMF)
RO1: Semantically rich RO2: Extend process model RO3: Object-centric process
process model representations forecasting to other dimensions model forecasting algorithm
Multidimensional Engineer data structures for Develop time series data Construct event knowledge
feature set semantically rich control -flow transformation techniques graphs of object-centric
process model forecasting. for bottleneck, resource, and event logs tailored to process
decision points. model forecasting.
WP1.1 WP2.1 WP3.1
Multivariate Develop advanced multivariate Design and implement a Develop heterogeneous
predictive models and multiscale forecasting GNN-based multidimensional graph-based predictive
algorithms for semantically process model forecasting models to forecast object -
rich process models. algorithm. centric process models.
WP1.2 WP2.2 WP3.2
Case study in financial services Case study in logistics industry
WP4.1 WP4.2
Figure 2: Overview of the work packages
3. Planned Research Methodology
Figure 2 presents a schematic overview of the proposed work plan designed based on [16], illustrating
the alignment of different work packages (WPs) with the four research objectives (ROs).
RO1 aims to extend the feature set of the forecasting techniques beyond DFs by incorporating process
representations and dependencies utilized in various widely used process discovery methodologies. For
example, we can forecast full dependency graphs, as this would allow us to discover parallel activities
or forecast the required metrics for the partition creation used by top-down discovery algorithms.
Correspondingly, existing predictive models will be replaced with multivariate models capable of
simultaneously forecasting all time series while accounting for cross-dependencies. One promising
avenue involves the exploration of multi-scale spatial-temporal graph neural networks.
RO2 extends process model forecasting capabilities to incorporate control-flow orthogonal dimen-
sions, including bottlenecks, resource allocation, and decision points. We aim to integrate resource
information into the feature set, considering multiple granularity levels from overall resource occupancy
to allocations at specific activities. Finally, we will incorporate the attention mechanism in GNNs to
extract more reliable and efficient patterns and leverage the multitask learning (MTL) framework to
implement a multidimensional PMF algorithm.
RO3 aims to design and implement a comprehensive object-centric process model forecasting al-
gorithm. To account for the complexity of object-centric event logs, we will develop an appropriate
event knowledge graph (EKG) structure on top of which a new process model forecasting algorithm can
be built. This would entail the construction of time series features for heterogeneous graph elements
within EKGs. Next to it, we aim to explore extra architectures tailored toward heterogeneous graph
forecasting. The initial avenue that will be pursued focuses on the use of heterogeneous temporal graph
neural networks.
RO4 aims to extend the impact of the developed techniques within MuDiPMF by deploying them
in practical applications across diverse industries, specifically targeting the financial services and
logistics sectors. The research group’s network will be leveraged to collaborate with two partnering
companies. Through these case studies, the objective is to demonstrate how the advancements made
can substantially improve the state-of-the-art in Process Model Forecasting (PMF) and highlight their
practical effectiveness. By validating our algorithms using real-world problems and data, we do not
only aim to emphasize their added value, but refinement and adaptation strategies will be developed to
make MuDiPMF algorithms extensible to other application domains.
Acknowledgments
This study was financed by the Research Foundation Flanders under grant number G039923N and
Internal Funds KU Leuven under grant number C14/23/031.
This Ph.D. thesis is supervised by Prof. dr. Johannes De Smedt and Prof. dr. Jochen De Weerdt.
References
[1] F. M. Maggi, C. Di Francescomarino, M. Dumas, C. Ghidini, Predictive monitoring of business
processes, in: Advanced Information Systems Engineering: 26th International Conference, CAiSE
2014, Thessaloniki, Greece, June 16-20, 2014. Proceedings 26, Springer, 2014, pp. 457–472.
[2] R. Poll, A. Polyvyanyy, M. Rosemann, M. Röglinger, L. Rupprecht, Process forecasting: Towards
proactive business process management, in: Business Process Management: 16th International
Conference, BPM 2018, Sydney, NSW, Australia, September 9–14, 2018, Proceedings 16, Springer,
2018, pp. 496–512.
[3] J. De Smedt, A. Yeshchenko, A. Polyvyanyy, J. De Weerdt, J. Mendling, Process model forecasting
and change exploration using time series analysis of event sequence data, Data & Knowledge
Engineering 145 (2023) 102145.
[4] W. M. van der Aalst, Object-centric process mining: Dealing with divergence and convergence in
event data, in: Software Engineering and Formal Methods: 17th International Conference, SEFM
2019, Oslo, Norway, September 18–20, 2019, Proceedings 17, Springer, 2019, pp. 3–25.
[5] A. Augusto, R. Conforti, M. Dumas, M. La Rosa, F. M. Maggi, A. Marrella, M. Mecella, A. Soo,
Automated discovery of process models from event logs: Review and benchmark, IEEE transactions
on knowledge and data engineering 31 (2018) 686–705.
[6] A. J. Weijters, W. M. van Der Aalst, A. A. De Medeiros, Process mining with the heuristicsminer
algorithm (2006).
[7] S. K. vanden Broucke, J. De Weerdt, Fodina: A robust and flexible heuristic process discovery
technique, decision support systems 100 (2017) 109–118.
[8] S. J. Leemans, D. Fahland, W. M. Van Der Aalst, Discovering block-structured process models from
event logs-a constructive approach, in: Application and Theory of Petri Nets and Concurrency:
34th International Conference, PETRI NETS 2013, Milan, Italy, June 24-28, 2013. Proceedings 34,
Springer, 2013, pp. 311–329.
[9] M. Camargo, M. Dumas, O. González-Rojas, Learning accurate lstm models of business processes,
in: Business Process Management: 17th International Conference, BPM 2019, Vienna, Austria,
September 1–6, 2019, Proceedings 17, Springer, 2019, pp. 286–302.
[10] I. Teinemaa, M. Dumas, M. L. Rosa, F. M. Maggi, Outcome-oriented predictive process monitoring:
Review and benchmark, ACM Transactions on Knowledge Discovery from Data (TKDD) 13 (2019)
1–57.
[11] R. Galanti, M. De Leoni, N. Navarin, A. Marazzi, Object-centric process predictive analytics, Expert
Systems with Applications 213 (2023) 119173.
[12] D. Fahland, Process mining over multiple behavioral dimensions with event knowledge graphs,
in: Process mining handbook, Springer, 2022, pp. 274–319.
[13] B. Lim, S. Zohren, Time-series forecasting with deep learning: a survey, Philosophical Transactions
of the Royal Society A 379 (2021) 20200209.
[14] B. Yu, H. Yin, Z. Zhu, Spatio-temporal graph convolutional networks: A deep learning framework
for traffic forecasting, arXiv preprint arXiv:1709.04875 (2017).
[15] D. Cao, Y. Wang, J. Duan, C. Zhang, X. Zhu, C. Huang, Y. Tong, B. Xu, J. Bai, J. Tong, et al., Spectral
temporal graph neural network for multivariate time-series forecasting, Advances in neural
information processing systems 33 (2020) 17766–17778.
[16] J. Mendling, H. Leopold, H. Meyerhenke, B. Depaire, Methodology of algorithm engineering, arXiv
preprint arXiv:2310.18979 (2023).