1. Introduction

Rome, Italy * Corresponding author. $ manal.laghmouch@uhasselt.be (M. Laghmouch); benoit.depaire@uhasselt.be (B. Depaire); Nicola.Gigante@unibz.it (N. Gigante); mieke.jans@uhasselt.be (M. Jans); montali@unibz.it (M. Montali)

Declare MoGeS: Model Generator and Specializer

Manal Laghmouch

1 2

Benoît Depaire

Nicola Gigante

Mieke Jans

1 2

Marco Montali

0 0 Free University of Bozen-Bolzano , Piazza Università, 1, 39100 Bolzano BZ , Italy 1 Hasselt University , Martelarenlaan 42, 3500 Hasselt , Belgium 2 Maastricht University , Minderbroedersberg 4-6, 6211 LK Maastricht , Netherlands

2023

000 0 0002

This demo introduces Declare MoGeS, an automated approach for generating and specializing Declare process models that can be employed as input for log generation. The specialization of Declare models is particularly interesting to produce event logs that encompass a subset of the behavior of other logs. Declare MoGeS seamlessly integrates with existing log generators, streamlining the log generation process.

eol>Declare Model Generation Model Specialization Linear Temporal Logic

1. Introduction

specialization primarily involves adding constraints to an initial model, resulting in limited variations of process models. To address this limitation, the second objective of the demo is to propose an automated approach to generate specializations by adapting constraints from an initial model, thereby enabling controlled variations [ 3, 4 ]. For instance, a constraint stating that activity a should be followed by activity b (i.e. Response(a,b)) can be specialized by requiring immediate occurrence of activity b after a (i.e. ChainResponse(a,b)).

2. Innovations and Main Features

Given that the demo has to be able to (1) generate artificial declarative process models and (2) specialize declarative process models that can serve as input to generate event logs, the developed declare Model Generator and Specializer (Declare MoGeS) adheres to the following requirements.

• Declarative Modeling Language – describe business processes in a flexible declarative language (declare). • Consistency – the generated and specialized models only consist of non-contradictory constraints. • Specialization of Process Models – enable refinement and tailoring of model behavior. • Balance Between Randomness and User Control – allow for variations of process models, and, at the same time, enough control over the generated models. • Compatibility Output Models with Existing Log Generators – The output model is a declare model saved in a file format 1 suitable as input for existing log generators.

In the following subsections, we describe the algorithms behind Declare MoGeS. 2.1. Generating a Random Declare Model Algorithm 1 shows that a desired number of activities and constraints (ℎ_ and

_), a set of declare templates that can be selected to generate a model (_), and the probability that a particular declare template is chosen (_) serve as inputs for model generation.

Model Generation starts with initializing an empty list of declare constraints. This list will

eventually form the created model. Next, a declare constraint is selected by randomly choosing a template from _, taking into account the _. Afterward, activities from an alphabet of size ℎ_ are chosen to obtain a _.

The _ is added to the model IF it complies with the following two key condi

tions. First, the _ must be consistent with the constraints already present in the model, i.e., their conjunction must be satisfiable . We refer to the existing set of constraints as the temporary model. For instance, consider a temporary model consisting of the constraints

1*.decl file format

Input : Size of the alphabet of : ℎ_

Number of declare constraints: _ List of declare constraint templates: _ Initial probability of choosing templates: _ Number of subsequent tries to add a constraint: Output: Set of declare constraints:

Initialize: = [] ← 0 ← 0 while < _ do _ = random(_, _, ) if _ is consistent w.r.t. and _ is not redundant w.r.t. then ← ∪ _ ← + 1 ← 0 else ← + 1 if > then print No model found with the given parameters return return

Algorithm 1: Model Generator

[Response(a,b), ChainResponse(b,c)]. The constraint ChainResponse(b,d) would be inconsistent because it contradicts ChainResponse(b,c). Second, the new constraint should not be redundant.

For example, ChainResponse(b,d) (i.e. if b occurs, then d should occur in the next position)

implies Response(b,d) (i.e. if b occurs, then d should occur eventually after b). In this case, adding Response to the model when a ChainResponse is already included is redundant.

Both conditions, i.e. consistency and non-redundancy, are checked with BLACK [5] by using

the Linear Temporal Logic over finite traces (LTLf) encoding of the declare constraints. If both conditions are met, the _ is added to the model, or discarded otherwise.

The algorithm keeps track of how many subsequent times a constraint is discarded (). This

process continues until the _ is met (model is returned) or until times in a row, a _ cannot be added to the model (a message is shown to the user and the model () is returned).

2.2. Specializing a Declare Model Algorithm 2 shows the process for specializing a declare process model. To specialize a

model, the user provides an initial model consisting of constraints that need to be specialized (). Optionally, the user can input a set of constraints from the initial model that should be kept in the specialized model (). Furthermore, a specialization percentage (_) that defines the probability a constraint will be specialized is set. Input : Initial declare model:

Specialization percentage: _

Initial specialized model:

Output: A specialization of for each _ in do if _ can be specialized then if () < _ then Generate if ̸∈ then

← ∪ else ←

∪ _ else if _ ̸∈ then

← ∪ _ return

Algorithm 2: Model Specializer

The process of specialization (algorithm 2) starts with an _ from . If _ can be specialized, then a specialization is added to the in some cases. The _ is taken into account to determine whether a specialization should be added or not. Otherwise, the _ is added to the . This process ends when all constraints from the initial model are considered. The specialized model is a specialization of the initial model .

3. Maturity

Declare MoGeS is implemented in Python and stored in a GitHub repository2. Additionally, a comprehensive video tutorial demonstrating the tool’s usage can be found within the same repository, providing users with an informative resource for getting started with Declare MoGeS.

In computational tests, we tested the Declare MoGeS by conducting a total of 2392 runs, each aimed at artificially generating and automatically specializing each of the generated declare process models at four distinct percentages (30%, 50%, 70%, and 100%). Approximately 75% of the runs resulted in the generation of models containing between 5 to 25 constraints, all achieved within an 11-minute time frame. Furthermore, it’s worth noting that models with fewer than 16 constraints were generated almost instantly, with a median time of less than a second. However,

2https://github.com/manallaghmouch/DeclareMoGeS

for models comprising more than 35 constraints, the execution time could exceed an hour. This prolonged execution was primarily attributed to the computationally intensive consistency and non-redundancy checks performed by BLACK.

On the other hand, the Model Specializer displayed eficiency throughout our tests, consistently boasting running times of less than one second for all specialization percentages. These results highlight the efectiveness of specialization through adapting constraints. 4. Conclusion and Future Work This paper presents a novel approach for automatically generating and specializing declare

process models to facilitate log generation. The efectiveness of the approach is demonstrated and evaluated, highlighting its ability to swiftly generate and specialize declare models containing

5 to 25 constraints. In future research, there are opportunities to expand. One potential avenue involves incorpo

rating a data-aware aspect. After integration, studies can evaluate data-aware process discovery algorithms using logs generated from data-aware input models. Additionally, it is interesting to extend the study beyond the predefined templates ofered by the declare language. Future research will delve into exploring LTL formulas that surpass the existing templates.

Acknowledgments Manal Laghmouch thanks Research Foundation - Flanders for the SB PhD fellowship (1S40622N)

granted to support this research. Nicola Gigante acknowledges the support of the PURPLE project, 1st Open Call for Innovators of the AIPlan4EU H2020 project, a project funded by EU

Horizon 2020 research and innovation programme under GA n. 101016442 (since 2021)”

[1]

Jouck ,

Depaire , Generating artificial data for empirical analysis of control-flow discovery algorithms: a process tree and log generator , Business & Information Systems Engineering 61 ( 2019 ) 695 - 712 .

[2]

Di Ciccio ,

M. L.

Bernardi ,

Cimitile ,

F. M.

Maggi , Generating event logs through the simulation of declare models , in: Enterprise and Organizational Modeling and Simulation: 11th International Workshop, EOMAS 2015 , EOMAS 2015, Springer, 2015 , pp. 20 - 36 .

[3]

D. M.

Schunselaar ,

F. M.

Maggi ,

Sidorova , Patterns for a log-based strengthening of declarative compliance models , in: International Conference on Integrated Formal Methods , Springer, 2012 , pp. 327 - 342 .

[4] R. De Masellis , C.

Di Francescomarino , C.

Ghidini , F. M.

Maggi , Declarative process models: Diferent ways to be hierarchical , in: International Conference on Service-Oriented Computing , Springer, 2016 , pp. 104 - 119 .

[5]

Geatti ,

Gigante ,

Montanari , Black: A fast, flexible and reliable ltl satisfiability checker , in: Proceedings of the 3rd Workshop on Artificial Intelligence and fOrmal VERification , Logic, Automata, and sYnthesis, volume 2987 , CEUR-WS , 2021 , pp. 7 - 12 .