=Paper= {{Paper |id=Vol-2973/paper_265 |storemode=property |title=Hybrid Process Modeling and Mining |pdfUrl=https://ceur-ws.org/Vol-2973/paper_265.pdf |volume=Vol-2973 |authors=Anti Alman |dblpUrl=https://dblp.org/rec/conf/bpm/Alman21 }} ==Hybrid Process Modeling and Mining== https://ceur-ws.org/Vol-2973/paper_265.pdf
Hybrid Process Modeling and Mining
Anti Alman1
1
    University of Tartu, Tartu, Estonia


                                         Abstract
                                         One of the cornerstones of business process management and process mining is the ability to describe
                                         any business process systematically as a process model. The notations used for these descriptions can, in
                                         general, be divided into two paradigms: procedural and declarative. The procedural paradigm describes
                                         processes in terms of explicit process control flow, whereas the declarative paradigm describes processes
                                         in terms of process rules while leaving the process control flow implicit. This difference has significant
                                         implications for describing knowledge-intensive processes. A knowledge-intensive process may have
                                         thousands of case variants, all of which may correspond to valid process executions. While procedural
                                         models are generally considered more understandable, describing a knowledge-intensive process using
                                         the procedural paradigm often results in an unreadable spaghetti model. The declarative paradigm
                                         avoids this issue by leaving the control flow of the process implicit. However, this has a tendency to
                                         hide the overall structure of the process, and therefore again leading to understandability issues. During
                                         this project we plan to address these issues by first analysing the existing notations suitable for process
                                         modeling, and then using the results of this analysis for developing a novel hybrid process modeling
                                         approach and a set of corresponding process mining techniques.

                                         Keywords
                                         Process modeling, Process mining, Knowledge-intensive processes, Hybrid modeling




1. Introduction and Motivation
In the field of Business Process Management (BPM) there are two types of languages for
describing a process model [1]. The first type is procedural languages (sometimes referred to as
imperative languages), which aim to describe end-to-end processes and allow only for activities
that are explicitly triggered through control-flow. Some examples of procedural languages are
BPMN [2], Petri Nets [3] and YAWL [4]. The second type is declarative languages, which aim
to describe the process as a set of rules that the process should follow, while allowing for any
control flow that does not conflict with these rules. Some examples of declarative languages are
Declare [5], DCR Graphs [6] and OCBC models [7].
   Both procedural and declarative languages have their strengths and weaknesses [8, 9, 10].
Procedural languages are considered to be especially suitable for predictable processes that
have very few deviations from the main flow of the process (sometimes referred to as strict
or structures processes). Conversely, the declarative approach is considered to be especially
suitable for less predictable processes where many deviations from the main flow of the process

Proceedings of the Demonstration & Resources Track, Best BPM Dissertation Award, and Doctoral Consortium at BPM
2021 co-located with the 19th International Conference on Business Process Management, BPM 2021, Rome, Italy,
September 6–10, 2021
" anti.alman@ut.ee (A. Alman)
                                       © 2021 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073
                                       CEUR Workshop Proceedings (CEUR-WS.org)
are possible (sometimes referred to as flexible or unstructured processes). An example of the
latter can be seen in knowledge-intensive processes [11].
   This doctoral project aims to develop a hybrid approach for process modeling and mining
that would combine the strengths while also addressing the weaknesses of both procedural and
declarative languages. More specifically, the current direction of research is to use aspects of
the procedural languages to represent the overall structure of the process and to use aspects of
the declarative languages to represent parts or rules of the process where the order of activities
is less relevant than the business rules associated with these activities.


2. Related Work
Developing a hybrid process modeling and mining approach has been previously suggested
and there are indications that practitioners would be open to such an approach [10]. There has
also been some research in that direction both in the context of process modeling and process
mining [12]. Some examples are:

    • BPMN-D [13], which is a conservative extension of the standard Business Process Model
      and Notation (BPMN) for declarative process modeling.
    • Pockets of flexibility [14, 15], which is a concept for handling flexibility in workflow
      specifications.
    • Hybrid Miner [16], which discovers a hybrid model where each subprocess is either a
      Declare model or a Petri net.
    • Fusion Miner [17], which first uses Heuristics Miner to discover a procedural process
      model and then Declare Miner to refine the model by replacing some of the flow connec-
      tions of the procedural model with declarative constraints.

Overall, the area of hybrid modeling and mining is still relatively unexplored and there are
ample opportunities for research and development of novel approaches.


3. Example
Consider a possible workflow for the treatment of a patient consisting of the following sequential
steps: admittance, diagnosis, treatment, discharge, follow-up care. Such steps would be best
described with a procedural language. However, each of these steps is likely to contain a high
variability of different activities depending on the conditions being treated, status of the patient,
analysis results, etc. Such high variability would be better described as a set of business rules,
which can be confined to one of the main steps but could also apply to multiple steps.
    For example, during diagnosis there may be some analysis that that will always be performed
together (rule confined to one main step) but may also not be performed at all (irrelevant to this
case). The decision of which analysis to perform can depend on the complaints of the patient
during admittance (rule applying to multiple steps). If there is a specific complaint during
admittance then this could require some events to occur during follow-up care, meaning that
there could possibly be hundreds of events between when a business rule is triggered and when
it is met (rule applying to multiple steps).
Table 1
Envisioned phases and the current status of the doctoral project.
 Phase                           Summary                                               Status
 Systematic literature review    Identification and analysis existing notations that   In Progress
                                 are suitable for process modeling. The focus is
                                 mainly on declarative and hybrid notations.
 Development of process          Development of hybrid process mining approaches       In Progress
 mining approaches               for the main process mining branches.
 Requirements    formaliza-      Setting the scope for the new hybrid notation based   Pending
 tion                            on the systematic literature review.
 Notation development and        Creating a prototype of the new hybrid notion and     Pending
 evaluation                      identifying further improvement opportunities via a
                                 user evaluation.
 Process modeling and            Implementing a process discovery approach based       Pending
 mining case study               on the new hybrid notation and validating it via a
                                 case study based on a real-life data set.


  These are just some examples of the complexities that should be taken into account when
modeling processes which have both strict and flexible aspects. Suitable notations already
exist for both ends of the flexibility spectrum in isolation (e.g. BPMN and Declare). However,
existing notations tend to be more focused on one end of the spectrum while often handling
the other end of the spectrum in an ad hock manner. As a result, when working with processes
having both strict and flexible aspects (such as the example above) one is often forced to prioritize
precise representation of flexible aspects at the expense of strict aspects or vice versa.


4. Research Plan and Current Results
This doctoral project is currently at an early stage and therefore the current research plan is
not fully fixed and some adjustments are likely to occur. We envision this doctoral project
consisting of five phases starting from an extensive review of related works and culminating
with the evaluation of the developed hybrid process modeling and mining approaches. An
overview of the project phases and the current status is presented in Table 1. Furthermore, a
short description of the first two phases together with a brief overview of the current status
and results is given in the following sections.

4.1. Systematic Literature Review
The Systematic Literature Review (SLR) will be conducted based on the guidelines presented
in [18]. The contribution of this SLR will be twofold. First, we will identify existing notations
and languages that are suitable for modeling business processes. Second, we will analyse the
characteristics of the identified languages and create a corresponding classification framework.
Given that the procedural language BPMN is specified as an Object Management Group (OMG)
standard [2], the SLR will focus on declarative and hybrid languages. However, we have not
designed the SLR to specifically exclude other procedural languages.
   The resulting classification framework will be used in the third phase of the project to
formalize the requirements of the new hybrid notation and to select the possible base languages
that we believe can be either successfully combined or that have some characteristics we believe
can be utilized for creating the new hybrid notation. The SLR is currently in progress. The
search for primary studies is completed, and we are in the study selection phase of the SLR.

4.2. Development of Process Mining Approaches
The purpose of this phase is to develop process mining approaches that would support the
adoption of the hybrid process modeling notation developed during this project. Initially the
focus will be on approaches that are not language specific. Later, during phases three and
four, the focus will shift towards supporting the new hybrid notation. Ideally, approaches for
discovery, conformance checking, and monitoring will be developed. However, this list is not
yet final and will depend on the results of the SLR and the requirements formalization.
   Currently we have been considering the scenario in which a single process execution is
governed by multiple process specifications. For example, consider a single patient having
multiple medical conditions, all of which must be treated simultaneously. This would necessitate
following multiple clinical guidelines (procedural models) simultaneously, which additionally
must be overlaid with basic medical knowledge (declarative models). The interplay of these
models may lead to conflicts (situations where violating at least one model becomes inevitable).
For example, different guidelines recommending to administer drugs, which on their own are
safe, but may have serious side-effect when combined.
   Inspired by the above-mentioned scenario, we have developed a prototype of a monitoring
approach for hybrid process specifications. This approach does not focus on a specific language,
but instead uses standard formal semantics for expressing procedural and declarative models.
Petri Nets and 𝐿𝑇 𝐿𝑓 formulas [19] (respectively) were chosen at this stage, because both can
be translated into finite-state automata [20, 21] in a way that allows to compute the cross-
product of these automata. The cross-product will represent a conjunction of the behaviours
specified by the individual models, therefore allowing for monitoring multiple process models
simultaneously. Additionally, the cross-product can be augmented with violation costs such
that recommendations on the next course of action in case of conflicts become possible.


Acknowledgments
The work of A. Alman was supported by the Estonian Research Council (project PRG1226) and
ERDF via the IT Academy Program.


References
 [1] W. M. P. van der Aalst, Process Mining - Discovery, Conformance and Enhancement of
     Business Processes, Springer, 2011.
 [2] B. P. Model, Notation (BPMN) version 2.0, OMG Specification, Object Management Group
     (2011).
 [3] W. M. P. van der Aalst, C. Stahl, Modeling Business Processes - A Petri Net-Oriented
     Approach, Cooperative Information Systems series, MIT Press, 2011.
 [4] A. H. M. ter Hofstede, W. M. P. van der Aalst, M. Adams, N. Russell (Eds.), Modern Business
     Process Automation - YAWL and its Support Environment, Springer, 2010.
 [5] M. Pesic, H. Schonenberg, W. M. P. van der Aalst, DECLARE: full support for loosely-
     structured processes, in: EDOC, IEEE Computer Society, 2007, pp. 287–300.
 [6] T. T. Hildebrandt, R. R. Mukkamala, Declarative event-based workflow as distributed
     dynamic condition response graphs, in: PLACES, volume 69 of EPTCS, 2010, pp. 59–73.
 [7] A. Artale, A. Kovtunova, M. Montali, W. M. P. van der Aalst, Modeling and reasoning
     over declarative data-aware processes with object-centric behavioral constraints, in: BPM,
     volume 11675 of Lecture Notes in Computer Science, Springer, 2019, pp. 139–156.
 [8] S. Zugal, J. Pinggera, B. Weber, The impact of testcases on the maintainability of declarative
     process models, in: BMMDS/EMMSAD, volume 81 of Lecture Notes in Business Information
     Processing, Springer, 2011, pp. 163–177.
 [9] P. Pichler, B. Weber, S. Zugal, J. Pinggera, J. Mendling, H. A. Reijers, Imperative versus
     declarative process modeling languages: An empirical investigation, in: Business Process
     Management Workshops (1), volume 99 of Lecture Notes in Business Information Processing,
     Springer, 2011, pp. 383–394.
[10] H. A. Reijers, T. Slaats, C. Stahl, Declarative modeling-an academic dream or the future
     for bpm?, in: BPM, volume 8094 of Lecture Notes in Computer Science, Springer, 2013, pp.
     307–322.
[11] C. D. Ciccio, A. Marrella, A. Russo, Knowledge-intensive processes: Characteristics,
     requirements and analysis of contemporary approaches, J. Data Semant. 4 (2015) 29–57.
[12] T. Slaats, Declarative and hybrid process discovery: Recent advances and open challenges,
     J. Data Semant. 9 (2020) 3–20.
[13] G. D. Giacomo, M. Dumas, F. M. Maggi, M. Montali, Declarative process modeling in
     BPMN, in: CAiSE, volume 9097 of Lecture Notes in Computer Science, Springer, 2015, pp.
     84–100.
[14] S. W. Sadiq, W. Sadiq, M. E. Orlowska, Pockets of flexibility in workflow specification, in:
     ER, volume 2224 of Lecture Notes in Computer Science, Springer, 2001, pp. 513–526.
[15] S. W. Sadiq, M. E. Orlowska, W. Sadiq, Specification and validation of process constraints
     for flexible workflows, Inf. Syst. 30 (2005) 349–378.
[16] F. M. Maggi, T. Slaats, H. A. Reijers, The automated discovery of hybrid processes, in:
     BPM, volume 8659 of Lecture Notes in Computer Science, Springer, 2014, pp. 392–399.
[17] J. D. Smedt, J. D. Weerdt, J. Vanthienen, Fusion miner: Process discovery for mixed-
     paradigm models, Decis. Support Syst. 77 (2015) 123–136.
[18] B. Kitchenham, S. Charters, Guidelines for performing systematic literature reviews in
     software engineering, 2007.
[19] A. Bauer, M. Leucker, C. Schallhart, Comparing LTL semantics for runtime verification, J.
     Log. Comput. 20 (2010) 651–674.
[20] M. Notomi, T. Murata, Hierarchical reachability graph of bounded petri nets for concurrent-
     software analysis, IEEE Trans. Software Eng. 20 (1994) 325–336.
[21] D. Giannakopoulou, K. Havelund, Automata-based verification of temporal properties on
     running programs, in: ASE, IEEE Computer Society, 2001, pp. 412–416.