=Paper= {{Paper |id=Vol-1592/paper06 |storemode=property |title=Transition Systems Reduction: Balancing between Precision and Simplicity |pdfUrl=https://ceur-ws.org/Vol-1592/paper06.pdf |volume=Vol-1592 |authors=Sergey A. Shershakov,Anna A. Kalenkova,Irina A. Lomazova |dblpUrl=https://dblp.org/rec/conf/apn/ShershakovKL16 }} ==Transition Systems Reduction: Balancing between Precision and Simplicity== https://ceur-ws.org/Vol-1592/paper06.pdf
        Transition Systems Reduction: Balancing
           between Precision and Simplicity

        Sergey A. Shershakov, Anna A. Kalenkova, and Irina A. Lomazova

               National Research University Higher School of Economics,
                     20 Myasnitskaya st., 101000 Moscow, Russia,
                                sshershakov@hse.ru,
                       WWW home page: http://pais.hse.ru



        Abstract Transition systems are a powerful formalism, which is widely
        used for process model representation. A number of approaches were pro-
        posed in the process mining ﬁeld to tackle the problem of constructing
        transition systems from event logs. Existing approaches discover tran-
        sition systems that are either too large or too small. In this paper we
        propose an original approach to discover transition systems that per-
        fectly ﬁt event logs and whose size is adjustable depending on the user’s
        need. The proposed approach allows achieving a required balance be-
        tween simple and precise models.

        Keywords: transition systems, process mining, model reduction, pro-
        cess model quality


1     Introduction

Process mining is a relatively new discipline, whose basic research and practi-
cal purpose is to extract process models from data given in the form of event
logs, checking existing models for conformance to actual processes and improving
them. Transition systems are extensively used to formalize processes extracted
from event logs. A transition system can be constructed from an event log by
using preﬁx-based techniques in a very natural way [2]. We consider several met-
rics that describe the model’s quality [7]. Replay ﬁtness quantiﬁes the extent to
which a process model can reproduce the behavior recorded in a log. Complexity
of the model is estimated by simplicity and precision (the metrics, which shows
how precise the model is in respect to the event log).
    The major weakness of models constructed from real-life event logs is their
size. Despite the fact that there are a number of approaches aimed to reduce
the size of transition systems [2], application of the existing approaches results
in either too large or too small models. In the former case the model size is big
enough for being readable. Furthermore, it becomes diﬃcult or even impossible

    This work is supported by the Basic Research Program at the National Research
    University Higher School of Economics in 2016 and the Russian Foundation for Basic
    Research, project No. 15-37-21103.




                                          78
to apply existing transition system analysis techniques that are sensitive to the
size of input models. For example, the state-based region algorithm [8] has an
exponential complexity dependence on the size of the input model, so its applica-
bility is limited with fairly small models. In the latter case, due to states merging
a rather small model implies considerably much of extra behavior, which makes
the model less precise and thus less applicable.
    The main goal of our study is to develop an approach for reducing the size
of a transition system mined from an event log in a ﬂexible manner. This paper
describes an original 3-step algorithm achieving the goal by using a variable-
size window based on a state frequency characteristic. The approach preserves
(perfect) ﬁtness of a model and balances between its simplicity and precision by
introducing a set of adjustable parameters.
    Thus, the main contributions are as follows: (1) an original method for re-
ducing transition systems and justiﬁcation of its applicability; (2) a set of experi-
mental results which show the advantages of the proposed approach as compared
with existing methods; an openly available proof of the concept through imple-
mentation in a set of ProM plug-ins.
    The remaining part of the paper is organized as follows. Section 2 gives an
overview of related work in the context of inferring transition systems and their
application in the process mining domain. Section 3 introduces basic concepts
used further in this paper. A detailed description of the proposed algorithm is
given in Section 4. A novel precision calculation algorithm, some signiﬁcant im-
plementation details, and experimental results are discussed in Section 5. Finally,
Section 6 concludes the paper and discusses some directions for future work.


2   Related work
A number of works concerning inferring transition systems from event traces
exist. Biermann and Feldman in their work [6] proposed a k-tails algorithm
which merges states of a FSM by basing on the similarity of their behavior. The
algorithm falls into the class of preﬁx tree merging methods. Angluin [4] pro-
posed a method of preﬁx tree states merging based on a notion of k-reversibility.
In [11], Lorenzoli et al. proposed a GK-tail approach, an extension of the k-tail
algorithm, dealing with parametrized ﬁnite state automata.
    Cook and Wolf in their work [9] introduced process discovery, a new data
analysis technique in the context of software engineering processes. They conside-
red automatic generation of a formal model describing an ongoing process from
captured event data. A new Markov method was developed speciﬁcally for this
purpose. Moreover, two existing methods, the k-tail and RNet (based on neural
networks) ones, were adopted for the process discovery technique.
    Process discovery along with conformance checking and process enhancement
form the basis of process mining [3], which deals with various types of process
models including Petri nets, transition systems, fuzzy maps, C-nets, BPMN and
others. In the context of process mining, transition systems are considered both
as a self-independent model and an intermediate model for building another




                                        79
type of model on its basis. In the latter case, one should mention region-based
approaches discussed in [5,10,8,15].
    The leading role of a transition system as an intermediate representation of
a process is discussed in [2]. The paper considers a number of diﬀerent strategies
to construct a transition system that is more suitable to be a base for a resulting
ﬁnal Petri net model with respect to desirable metrics. Nevertheless, all discussed
strategies are based on inferring algorithms with a ﬁxed window.


3    Preliminaries
This section introduces basic concepts related to event logs, transition systems
and some other notations that are needed for explaining the approach.
    P(A) denotes the power set of A — the set of all subsets of A. For a given
set A, A∗ is the set of all ﬁnite sequences over A.

Deﬁnition 1 (Event Trace, Event Log). Let A be a set of activities. The
(event) trace is a sequence σ = a1 , a2 , ..., ai , ..., an  ∈ A∗ . By σ(i) = ai we
denote i-th element (event) of the trace. The [i, k]-subtrace of trace σ ended at
i-th event ai is deﬁned as
                          ⎧
                          ⎪
                          ⎨σ(1), σ(2), ..., σ(i),        if k > i;
                 σ[i, k] = σ(i − k + 1), ..., σ(i), if 1 ≤ k ≤ i;                (1)
                          ⎪
                          ⎩
                            ,                            if k = 0.

    The complete subtrace of the trace σ ended at i-th event ai is σ[i] = σ[i, i].
By |σ| we denote a trace length. For k ≤ i, k denotes the length of the subtrace.
L ∈ P(A∗ )is an event log and |L| is a log size that is equal to a number of all
traces.

    We assume, that event logs do not contain process states explicitly. This
way, we need to deduce the desirable states from an event log based on some
approach.
    In [13], four approaches to determine the state in a log were proposed. They
are past, future, past and future and explicit knowledge. In this paper we consider
only past approach, according to which a state is constructed based on the preﬁx
of a trace. Then, the order of activities is important. Hence, we apply sequence
policy [13] for determining a state.

Deﬁnition 2. A labeled transition system is a tuple TS = (S, E, T, s0 , AS),
where S is a state space, E is a set of labels, T ⊆ S × E × S is set of transitions,
s0 ∈ S is an initial state, and AS ⊆ S is a set of accepting (ﬁnal) states. We
denote the set of output ( input) transitions of a state s ∈ S as s• = {t =
(s, e, s ) ∈ T | e ∈ E, s ∈ S} (•s = {t = (s , e, s) ∈ T | e ∈ E, s ∈ S}).

Deﬁnition 3 (k-window transition system). Let L ∈ P(A∗ ) be a log over set
of activities A and let k ∈ N be a natural number called window size. TS(L, k) =




                                         80
(S, E, T, s0 , AS) is a k-window (labeled) transition system built for log L and
window size k, where S = {s0 } ∪ {σ[i, k] | σ ∈ L, 1 ≤ i ≤ |σ|, k ≤ i} T =
{(s0 , σ(1), σ[1, k]) | σ ∈ L} ∪ {(σ[i − 1, k], σ(i), σ[i, k]) | σ ∈ L, 2 ≤ i ≤ |σ|}, AS =
{s ∈ S | s = σ[|σ|, k], σ ∈ L}, and E = A.
Deﬁnition 4 (Full transition system). Let L ∈ P(A∗ ) be a log over set A of
activities. The full transition system TS(L) = (S, E, T, s0 , AS) for the log L is
a labeled transition system built for log L, where S = {s0 } ∪ {σ[i] | σ ∈ L, 1 ≤
i ≤ |σ|}, T = {(s0 , σ(1), σ[1]) | σ ∈ L} ∪ {(σ[i − 1], σ(i), σ[i]) | σ ∈ L, 2 ≤ i ≤ |σ|},
AS = {s ∈ S | s = σ[|σ|], σ ∈ L}, and E = A.
Deﬁnition 5. Let TS(L) = (S, E, T, s0 , AS) be a transition system over a set
of activities E = A, and let σ = a1 , ..., an  be a trace over A and n = |σ|.
We say that trace σ can be replayed in transition system TS(L) if there is a se-
quence of states s0 , ..., sn  such that ∃t1 = (s0 , a1 , s1 ), t2 = (s1 , a2 , s2 ), ..., tn =
                                                                                               a1
(sn−1 , an , sn ) where s0 , s1 , ..., sn ∈ S, t1 , t2 , ..., tn ∈ T . We denote this as s0 −→
     a2       an
s1 −→ ... −−→ sn . We say that trace σ can be partially replayed by its pre-
                                                                    a1      a2      ak
ﬁx in a transition system TS if ∃k < n : ∃s0 −→                         s1 −→  ... −→    sk and
       ak+1          ak+2           an
sk −−−→ sk+1 −−−→ ... −−→ sn . We denote σ + (TS) = a0 , a1 , ..., ak  and
σ − (TS) = ak+1 , ..., an . Hence, σ(TS) = σ + (TS) + σ − (TS) where + denotes
concatenation of two sequences.
    To measure the quality of resulting models, we consider three quality met-
rics. We based them primarily on the work [7] and adopted them for transition
systems. Fitness quantiﬁes the extent to which a transition system can repro-
duce traces recorded in a log. Simplicity quantiﬁes the complexity of a model.
Simplicity is measured by comparing the size of a given transition system TS(L)
with the simplest possible transition system, which is the ﬂower model (Fig. 1a).
Precision compares transition system TS(L) with the full transition system built
for log L, considering the latter to be the most precise.
Deﬁnition 6 (Metrics). Let L be an event log and let TS(L) = (S, E, T, s0 , AS)
be a transition system built for L. Fitness is deﬁned to be the ratio of the number
of traces from log L that can be fully replayed in transition system TS(L) to the
total number of all traces. Log L perfectly ﬁts transition system TS(L) iﬀ all
traces of L can be fully replayed in TS(L).
    Simplicity of TS(L) is:
                                                      |E| + 1
                                Simpl(TS(L)) =                  .
                                                     |T | + |S|
    Precision of TS(L) is:

                  1                                   |s • | − |s•i |
                                                     N oV (s)
                                             1
   Prec(TS(L)) =     · Prec(s), Prec(s) =          ·                    ,
                 |S|                      N oV (s)     i=1
                                                              |s • |
                            s∈S

where Prec(s) is a partial precision for state s, N oV (s) is a number of all visits
of state s during a replay of the full transition system. s•i is a set of penalized




                                              81
                                                                                   a/8

                                                                                 ‫ۃ‬a‫ۄ‬

                                                                                    b/8

                                                                                ‫ۃ‬a, b‫ۄ‬
                                                                     c/4                    d/4


                                                                ‫ۃ‬a, b, c‫ۄ‬                  ‫ۃ‬a, b, d‫ۄ‬                                                                  a/8
                                                                                   g/1                     e/2
                                                              d/4
                                                                                                                                                                            ‫ۃ‬a‫ۄ‬

           a/8                                       ‫ۃ‬a, b, c, d‫ۄ‬           ‫ۃ‬a, b, d, g‫ۄ‬                ‫ۃ‬a, b, d, e‫ۄ‬                                         ‫ۃ‬с‫ۄ‬              b/8
                                                                                                                                                                      c/4
                   b/8                         e/2                   f/2
                                                                                                  f/1                  g/1
 g/4                                                                                                                                     ‫ۃ‬f‫ۄ‬
                                                                                                                                                              d/4           ‫ۃ‬b‫ۄ‬
                                                                                            ‫ۃ‬a, b, d, e, f‫ۄ‬      ‫ۃ‬a, b, d, e, g‫ۄ‬                       f/2          d/4
                     c/4                   ‫ۃ‬a, b, c, d, e‫ۄ‬     ‫ۃ‬a, b, c, d, f‫ۄ‬
                                                                                                                                               e/1
f/4                                  f/1              g/1           e/1           g/1                                              f/2                       ‫ۃ‬d‫ۄ‬      g/1
                                                                                                                                                     e/4
                             ‫ۃ‬a, b, c,           ‫ۃ‬a, b, c,          ‫ۃ‬a, b, c,            ‫ۃ‬a, b, c,
                   d/8        d, e, f‫ۄ‬            d, e, g‫ۄ‬           d, f, e‫ۄ‬             d, f, g‫ۄ‬
                                                                                                                                         ‫ۃ‬e‫ۄ‬
                                                                                                                                                             g/2
                                                                                                                                                                            ‫ۃ‬g‫ۄ‬
             e/5

           (a)                                                            (b)                                                                                (c)

Figure 1: (a) “Flower” model for log L1 ; (b) TS1 (L1 ) built for log L1 ; (c) tran-
sition system built for log L1 with a ﬁxed window of size 1


output transitions of state s, i.e., transitions that do not have active counterparts
as compared to the precise (full TS) model.


4        Algorithm Description

For the clariﬁcation of the approach, the following motivating example is con-
sidered. Let L1 be an event log that is deﬁned as follows:

      L1 = {a, b, c, d, e, f , a, b, c, d, e, g, a, b, c, d, f, e, a, b, c, d, f, g,
                                                             a, b, d, a, b, d, g, a, b, d, e, f , a, b, d, e, g}                                                          (2)

    In addition to the previously discussed ﬂower model, one can build a number
of other models that perfectly ﬁt log L1 . A model built with unlimited window
size is depicted on Figure 1b. This model is a full transition system by the
deﬁnition. Another model built by an algorithm with a ﬁxed window of size 1
is depicted on Figure 1c. Although all these models perfectly ﬁt log L1 , none of
them is satisfactory in simplicity and precision at the same time. Thus, we are
interested in a trade oﬀ between these metrics.
    The proposed approach incorporates a 3-steps algorithm sequentially building
3 transition systems. The ﬁrst transition system, TS1 , is built from an event log.
The second (TS2 ) and the third (TS3 ) transition systems are built from TS1 and
TS2 , respectively. Finally, TS3 is considered as a desirable result.
    The main point of the proposed approach is dynamic variation of the win-
dow used for deducing states. For this very purpose, two adjustable parameters
are involved into the approach. The ﬁrst one, Threshold, aﬀects the size of the




                                                                                   82
intermediate transition system (TS2 ). The second one, Vwsc, is a linear factor
used for the dynamic calculation of a variable window size while building the
resulting model (TS3 ). Each step of the algorithm along with both parameters
is thoroughly discussed in the following sections.


4.1   Constructing a Full Transition System (Step 1)

The ﬁrst step of the approach is to construct a full transition system and deﬁne
a special labeling function mapping every transition to a natural number that
determines its frequency characteristic.

Deﬁnition 7 (Frequency characteristic). Let L ∈ P(A∗ )be a log over set
of activities A and let TS(L) = (S, E, T, s0 , AS) be a full transition system for
the L. A frequency characteristic of TS(L) is a function f : T → N deﬁned for
t = (σ[j − 1], σ(j), σ[j]) as f (t) = |L |, where L ⊆ L is the maximum subset of
L, and ∀σ  ∈ L ∃l ∈ N, l > 0 : σ  [l − 1] = σ[j − 1], σ  (l) = σ(j), σ  [l] = σ[j].
    Note that for l = 1, σ(0) =  = s0 by (1).

    Frequency characteristic determines for every transition
t a number of traces in log L that start with preﬁxes σ[j].                   a/8

The entire procedure of building a full transition system is               ‫ۃ‬a‫ۄ‬
presented in Algorithm 1.                                                     b/8
    We denote a full transition system for a given log L as               ‫ۃ‬a, b‫ۄ‬
TS1 (L) = TS(L). TS1 (L1 ) built for log L1 with function f        c/4             d/4

is depicted on Figure 1b; it is a tree by the construction.
                                                                ‫ۃ‬a, b, c‫ۄ‬         ‫ۃ‬a, b, d‫ۄ‬
It is easy to see that ﬁtness of the full transition system
                                                               d/4
(TS1 (L)) is perfect (equals 1). This is inherent in the al-
gorithm since it builds for each trace in a log a full chain ‫ۃ‬a, b, c, d‫ۄ‬
of states following one after another that corresponds to a
sequence of events in the trace.                             Figure 2: TS2 (L1 )
                                                             (condensed) built
                                                             from TS1 (L1 ) with
4.2 Constructing a Condensed Transition System Threshold = 0.33
(Step 2)

The second step of our approach involves cutting some branches of the full
transition system with frequency values less than a cutting threshold parameter;
we refer to it as f1 . Having f1 set, we can exclude from a model all the states
and transitions that correspond to behavior in the event log which is rarely
observed. This results in simplifying the tree structure and reduction of the
number of states and transitions.

Deﬁnition 8 (Condensed Transition System). Let TS1 (L) = (S1 , E1 , T1 ,
s0 , AS1 ) be a full transition system constructed for log L and let f be a frequency
characteristic. The Threshold is a real number from [0; 1] determining a cutting
threshold f1 as follows: f1 = round(|L| · Threshold) − 1. The value f1 + 1 =
round(|L| · Threshold) is a minimum preserved frequency for TS1 (L).




                                           83
  Algorithm 1: Building a full tran-              Algorithm 2: Building a con-
  sition system for a given log L                 densed transition system TS2 (L)
      Input : an event log L                       Input : an event log L;
      Output : a full transition system;           a full transition system
      TS1 (L) = (S, E, T, s0 , AS); f is a         TS1 (L) = (S1 , E1 , T1 , s0 , AS1 ); f
      frequency characteristic;                    is a frequency characteristic;
      begin                                        T hreshold is a real number
          S ← {s0 };                               determining a cutting threshold
          for σ ∈ L do                             and a minimum preserved
              s ← s0 ;                             frequency;
              for i ← 1 to |σ| do                  Output : TS2 (L) =
                  s ← σ[i];                                   (S2 , E2 , T2 , s0 , AS2 ) is a
                  t ← (s, σ(i), s );                          condensed transition
                  if t ∈
                       / T then                                system;
                       T ← T ∪ {t};                begin
                       f (t) ← 1;                      f1 = round(|L|·T hreshold)−1;
                 else                                  for t ∈ T1 do
                     f (t) ← f (t) + 1;                     if f (t) > f1 then
                                                                 T2 ← T2 ∪ {t};
                 S ← S ∪ {s };
                 E ← E ∪ {σ(i)};                       S2 ← {s0 };
                 if i = |σ| then                       for t = (s, a, s ) ∈ T2 do
                      AS ← AS ∪ {s};                       S2 ← S2 ∪ {s };
                 s ← s ;                              E2 ← E1 ;



    A condensed transition system TS2 (L) built for TS1 (L) with function f and a
given cutting threshold f1 is a transition system TS2 (L) = (S2 , E2 , T2 , s0 , AS2 ),
where S2 ⊆ S1 , E2 = E1 ,T2 ⊆ T1 , AS2 ⊆ AS1 and T2 = {t | t ∈ T1 & f (t) >
f1 }, S2 = {s0 } ∪ {s | s ∈ S1 & ∃t = (s , a, s) ∈ T2 }, AS2 = AS1 ∩ S2 .

    Note that the frequencies of transitions diminish on the way to the leaves of
TS1 (L) and TS2 (L). Hence, the exclusion of transition tk from a TS1 (L) implies
the exclusion of a total subtree that has state sk as a root. Thus, TS2 (L) obtained
as a result of cutting with a given threshold, cannot be disconnected.
    The entire procedure of construction a condensed transition system is pre-
sented in Algorithm 2. For log L1 with the size |L| = 8 and Threshold = 0.33, we
have f1 = 2. A TS2 (L) built for the log L1 and f1 = 2 is depicted in Figure 2.
It is easy to see that not all the traces from the log can be replayed on TS2 (L)
as its ﬁtness is not perfect. Therefore, we cannot consider this model as a ﬁnal
result.
4.3     Constructing a Reduced Transition System (Step 3)
In this section, we propose an approach to convert TS2 (L) to a model with
perfect ﬁtness and a size that it less than the size of TS1 (L).
   Our proposal is to construct a new transition system TS3 (L) based on TS2 (L)
by adding missing states and transitions in order to fully replay all the traces.




                                             84
                                                                                                                                                                            a/8
                                     a/8                                                          a/8
                                                                                                                                                                      ‫ۃ‬a‫ۄ‬
                               ‫ۃ‬a‫ۄ‬                                                          ‫ۃ‬a‫ۄ‬                                                                             b/8
                                     b/8                                                          b/8                                                                ‫ۃ‬a, b‫ۄ‬
                                                                                                                                                         c/4                      d/4
                              ‫ۃ‬a, b‫ۄ‬                                                      ‫ۃ‬a, b‫ۄ‬
                c/4                        d/4                                 c/4                      d/4
                                                                                                         /                                      ‫ۃ‬a, b, c‫ۄ‬                           ‫ۃ‬a, b, d‫ۄ‬

                                                                                                                                               d/4                            g/1
                                                                       ‫ۃ‬a, b, c‫ۄ‬                            ‫ۃۃ‬a,    d‫ۄۄ‬
                                                                                                              a, b, d                                                                           e/2
        ‫ۃ‬a, b, c‫ۄ‬                             ‫ۃ‬a, b, d‫ۄ‬
                                                          ‫ݐ‬ෝସ         d/4                          g /1
                                                                                                   g/1                                         ‫ۃ‬a, b, c, d‫ۄ‬                   ‫ۃ‬g‫ۄ‬
        d/4                             ‫ݐ‬ෝଷ                                                                               e/2            e/2
                                                                                                                                                              f/2

                                                                      ‫ۃ‬a, b, c, d‫ۄۄ‬                 ‫ۃۃ‬g‫ۄ‬
                                                                                                      g‫ۄۄ‬                                                              g/1
        ‫ۃ‬a, b, c, d‫ۄ‬                                                                  f/2
                                                                                      f/
                                                                                       /2                                                 ‫ۃ‬d, e‫ۄ‬         ‫ۃ‬d, f‫ۄ‬
  ‫ݐ‬ෝଵ                  ‫ݐ‬ෝଶ             ‫ݏ‬ෝଷ            ‫ݏ‬ෝସ       e/2
                                                                 /                                                               f/2               g/2              e/1

         ‫ݏ‬ෝଵ                 ‫ݏ‬ෝଶ                                ‫ۃۃ‬d,
                                                                 ‫ۃ‬d
                                                                  d, e‫ۄۄ‬        ‫ۃۃ‬d, f‫ۄ‬
                                                                                  d f‫ۄ‬                                          ‫ۃ‬e, f‫ۄ‬         ‫ۃ‬e, g‫ۄ‬           ‫ۃ‬e‫ۄ‬



                              (a)                                                         (b)                                                                 (c)

Figure 3: (a) TS3 under a restoring algorithm: temporary states and transitions
after the ﬁrst stage; (b) restored states and transitions after the second stage;
(c) TS3 (L1 ) (reduced) built from TS2 (L1 ) with Threshold = 0.33 for log L1


Unlike building the full transition system, in this case we use partial subtrace
σ[i, k] for representing newly added states of TS3 (L). The important point here
is that parameter k is proportional to the frequency of a corresponding input
transition.
    The algorithm implementing the proposed approach includes a few steps per-
formed iteratively. Its main part is represented in Algorithm 5. In the beginning,
we create a copy of TS2 (L), which is denoted as TS3 (L).Then, we try to replay
all the traces σ from event log L until there is no trace that cannot be fully re-
played in the transition system under construction. Along with replay, we build
on additional elements of TS3 (L) such that an increasing number of traces can
be replayed.
    Coming back to the example with log L1 , we consider transition system
TS3 (L1 ) copied from TS2 (L1 ) depicted in Figure 2. We try to replay log L1 on
it and perform its transformation. Let σ = a, b, c, d, e, f . The only preﬁx of σ
that can be successfully replayed is σ + (TS3 (L1 )) = a, b, c, d. Correspondingly,
the unreplayable suﬃx of σ is σ − (TS3 (L1 )) = e, f .
    Algorithm 3 replays a single trace σ and also gets as its input transition sys-
tem TS3 (L) and a frequency characteristic f (discussed above). Moreover, it uses
a special function ξ, which maps every trace σ onto number j that determines
element σ(j) splitting σ into σ + and σ − , σ(j) ∈ σ − , and set T T of temporary
transitions. The algorithm starts with initial state s0 as a current state s and the
ﬁrst element σ(1) of a trace as a current element σ(i). Then it tries to ﬁnd an
appropriate transition t starting with current state s and marked by symbol σ(i).
    We use an example in Figure 3 to illustrate the proposed approach. Next,
there are three possible cases. In the ﬁrst one (for instance, during the replay
σ = a, b, d), t = (s, σ(i), s ) exists with some s ∈ S2 and it is a regular one
(t ∈/ T T ). In this case current state s is changed to s and the next element




                                                                                          85
σ(i + 1) is processed. In the second case (if σ = a, b, d, g), t does not exist. For
that case a new temporary transition t = (s, σ(i), s) is added to both the set
of transitions and the set of temporary transitions. The end state s is a special
temporary state too. It is not marked by any substring and is unique for any
temporary transition. For newly added transition t frequency characteristic f is
deﬁned to be equal to 1; It means transition t “ﬁred” only once. In the third
case, t exists and it is a temporary one (t ∈ T T ). This is the case when transition
t “ﬁres” one more time; hence, its frequency should be increased by one.
     In both the second and the third cases replaying of the current trace σ is
broken and ξ(σ) is set to the position of the ﬁrst unreplayable element. This is
the reason why the temporary state s cannot still be marked with any subtrace
and, consequently, one cannot guarantee that s will be preserved in TS3 (L).
     This way, at each iteration Algorithm 5 tries to replay as many traces from
the log as possible. For each state the ﬁrst unreplayable element is determined
and a new temporary transition for the element along with a temporary state
are built. TS3 (L1 ) with temporary transitions and states marked by symbols e,
f , g and e are depicted in Figure 3a. Note, the total frequency for all temporary
transitions is equal to a number of traces that were not able to be replayed at
the ﬁrst iteration.
     Trace σ5 = a, b, d from log L1 is an example of a trace that can be replayed
at the very ﬁrst iteration. Once all the traces from the log can be replayed, the
reconstruction of TS3 (L) is successfully ended.
     As long as there is at least one temporary transition/state in TS3 (L), it has
to be converted to a regular one. It is done by Algorithm 4, which enumerates all
uncompleted traces. For each such trace σ, the last regular state s, temporary
transition t and temporary state s are obtained; they correspond to the ﬁrst
unplayable element σ(ξ(σ)). Then state s is converted to a regular state by
being marked with subtrace σ[ξ(σ), m] of trace σ ended by element σ(ξ(σ))
with length m that is proportional to the frequency of temporary transition
t. Furthermore, the dedicated state s0ws is used for the case of zero window
size, which accumulates all rare behavior. This results in the shortest subtrace
marking such a state than its counterpart in the full transition system. Note
that temporary states and transitions are always converted to regular ones.
     The latter results in a probability of coinciding numbers of previously dis-
tinguishable states. In such a case, all matching states are merged and the total
number of states and transitions in the resulting transition system is decreased.
Figure 3b shows how two states marked with subtrace d, e are merged to one
state.
     After the algorithm is ﬁnished, no temporary transitions and states are
present in TS3 (L) anymore. Moreover, all previously unreplayable elements in
the traces can now be replayed in TS3 (L) as new states for them have been
established. At the end of algorithm’s iteration each trace from the log can be
replayed at least by one more element than before the iteration. Since the length
of each trace is ﬁnite, the number of algorithm’s iterations is also ﬁnite. Thus,
the algorithm ﬁnally stops. The resulting TS3 (L1 ) is depicted in Figure 3c.




                                         86
Algorithm       3: Function                    Algorithm 4: Procedure RestateTS
ReplayTrace of the algo-                       of the algorithm of building a reduced
rithm of building a reduced                    transition system TS3 (L)
transition system TS3 (L)                       Input : log L;
 Input : trace σ;                               reduced transition system
 reduced transition system                      TS3 (L) = (S3 , E3 , T3 , s0 , AS3 ); frequency
 TS3 (L) = (S3 , E3 , T3 , s0 , AS3 );          characteristic f ; set of completely
 frequency characteristic f ; set               replayed traces CompleteT races ⊆ L;
 of completely replayed traces                  function ξ mapping each trace to a
 CompleteT races ⊆ L; function                  number of the ﬁrst unreplayable symbol;
 ξ mapping each trace to a                      multiplicative factor for ﬁxed window size
 number of the ﬁrst                             V wsc ∈ R; set of temporary transitions
 unreplayable symbol ; set of                   TT;
 temporary transitions T T ;                    /* Converts temporaries             */
 Output : true, if a trace                      Procedure RestateTS(L, TS3 (L), f ,
           is replayed                          CompleteT races, ξ, V wsc, T T )
 completely, f alse otherwise                      for σ ∈ L do
 Function ReplayTrace(σ,                               i ← ξ(σ);
 TS3 (L), f , CompleteT races,                         /* If the trace is already
 ξ, T T ): Boolean                                     complete                     */
     /* Already completed */                           if i = |σ| + 1 then
     if σ ∈ CompleteT races                                 return;
     then                                               s ← σ[i − 1];
          return true;                                  t ← (s, σ(i), s);
     s ← s0 ;                                           if t ∈
                                                             / T T then
     for i ← 1 to |σ| do                                     return;
         if ∃s : t =                                   maxW ndSize ← max(|σ|);
         (s, σ(i), s ) ∈ T3 then                                             σ

              if s = s then                           wndSize ← round(maxW ndSize ·
                  f (t) ← f (t) + 1;                    f (t) · V wsc ÷ |L|);
                  ξ(σ) = i;                             /* a special ‘trash’ state */
                  return f alse;                        if wndSize = 0 then
                                                             s ← s0ws ;
              s ← s ;
                                                        else
          else                                             s ← σ[i, wndSize];
              S3 ← S 3      {
                             s};
              t ← (s, σ(i),                            t ← (s, σ(i), s );
                          );
                            s
                                                        /* Replace the temporary
              T3 ← T3     {t};
              TT ← TT         {t};                      transition and the state by
              f (t) = 1;                                regular ones                */
              ξ(σ) = i;                                 S3 ← S3 \ {   s} ∪ {s };
              return f alse;                            T3 ← T3 \ {t} ∪ {t };
                                                        T T ← T T \ {t};
     /* Trace’s complete             */                 f (t ) ← f (t);
     ξ(σ) = |σ| + 1;                                    if i = |σ| then
     return true;                                             AS3 ← AS3 ∪ {s };

                                                    return;




                                          87
 Algorithm 5: Building a reduced transition system TS3 (L)
   Input : an event log L;
   a condensed transition system TS2 (L) = (S2 , E2 , T2 , s0 , AS2 ); a frequency
   characteristic f ; a multiplicative factor V wsc ∈ R for ﬁxed window size;
   Output : a reduced transition system TS3 (L) = (S3 , E3 , T3 , s0 , AS3 );
   Data: a set of completely replayed traces CompleteT races ⊆ L;
   a function mapping each trace to a number of ﬁrst unreplayable symbol ξ;
   a set of temporary transitions T T ;
   /* Main part of the algorithm                                                */
   begin
      TS3 (L) ← TS2 (L);
      repeat
          unreplayableTraces ← f alse;
          for σ ∈ L do
              if ReplayTrace(σ, TS3 (L), f , CompleteT races, ξ, T T ) = f alse
              then
                  unreplayableTraces ← true;

           if unreplayableTraces = true then
               RestateTS (L, TS3 (L), f , CompleteT races, ξ, V wsc, T T );
       until unreplayableTraces = f alse;



    In Algorithm 5 Vwsc is an additional parameter used to combine a varying
window size approach with a classical ﬁxed window size approach. It is a real
value from [0, 1] determining a maximum size of a state window during the
reconstruction phase of TS3 (L).
    Finally, considering ﬁtness of reduced transition system TS3 (L) one can pos-
tulate the following proposition.

Theorem 1. Let L be a log and let TS3 (L) be a reduced transition system based
on condensed transition system TS2 (L). Then, TS3 (L) perfectly ﬁts L.

Proof. Let σ = a1 , a2 , ..., an  ∈ A∗ be a trace of log L and σ = σ + (TS2 (L)) +
σ − (TS2 (L)), where σ + (TS2 (L)) is a trace preﬁx that can be “replayed” on
TS2 (L) and σ − (TS2 (L)) is a trace suﬃx that cannot be “replayed” on TS2 (L).
σ + (TS2 (L)) can be “replayed” on TS3 (L) by the construction. Now we need to
prove that the entire sequence σ can be “replayed” on TS3 (L). We will prove that
iteratively for sequences σ(1), ..., σ(i), where i varies from |σ + (TS2 (L))| to |σ|.
Basis of induction: the proposition valid for i = |σ + (TS2 (L))|, since σ + (TS2 (L))
can be “replayed” on TS3 (L). Step of induction: the trace σ(1), ..., σ(i) can
be “replayed”. Now let us prove that the trace σ(1), ..., σ(i + 1) can be “re-
played” as well. According to Algorithms 3 and 4, we “replay” σ(1), ..., σ(i)
and add a new state s (if it has not been added previously) and a new edge
t = (s, σ(i + 1), s) correspondingly. Thus, trace σ(1), ..., σ(i + 1) now can be
replayed, and that proves the step of induction.     




                                            88
                                                                                                                             a/2                  b/2
        a/2
                     b/2                                                                                            ‫ۃ‬a‫ۄ‬                                   ‫ۃ‬b‫ۄ‬

      ‫ۃ‬a‫ۄ‬                                                                                                             b/2                        c/1             d/1

             b/2    ‫ۃ‬b‫ۄ‬
                                                   a                              b2                               ‫ۃ‬a, b‫ۄ‬                     ‫ۃ‬b, c‫ۄ‬            ‫ۃ‬b, d‫ۄ‬
                           d/2
        c/2                                                                                              c/1                  d/1                  d/1               c/1
                   d/1
                                                   b1                   c2              d3
       ‫ۃ‬c‫ۄ‬                       ‫ۃ‬d‫ۄ‬                                                                   ‫ۃ‬a, b, c‫ۄ‬            ‫ۃ‬a, b, d‫ۄ‬         ‫ۃ‬b, c, d‫ۄ‬         ‫ۃ‬b, d, c‫ۄ‬

                   c/1                   c1             d1                   d2              c3


                                              d4             c4                    c5             d5



                   (a)                                            (b)                                                                   (c)

Figure 4: Transition systems built for log L2 : (a) 1-window size (ﬁxed) TSf (L2 );
(b) unfolding graph obtained after TSf (L2 ) simulated TS1 (L2 ) ; (c) full TS1 (L2 )


5       Evaluation and Discussion
In this section we evaluate the proposed approach on real-life event logs. In the
beginning of the section, an algorithm for precision calculation is introduced.

5.1          Metrics Calculation
Calculation of metrics for all three transition systems is performed throughout
their building.
    As we have shown above, ﬁtness of TS1 (L) and TS3 (L) is perfect. Further,
simplicity of a model is easily calculated on the basis of the number of model’s
elements.
    We propose an algorithm calculating precision metrics for a given transition
system TS(L) based on an idea of simulation [12]. The algorithm assumes that
TS(L) perfectly ﬁts log L. Suppose that TS(L) can simulate TS1 (L). All extra
behavior observed for TS(L) is penalized. Finally, a normalized total penalty
forms the basis of a precision value.
    The approach for calculation of precision is described in Algorithm 6. The
algorithm consists of two main steps. First, the algorithm iteratively calculates
so-called partial precisions for every state in TS(L) (Algorithm 9). Second, the
algorithm sums partial precisions (Algorithm 7) and calculates the average over
the number of states.
    To get an illustration of this idea, consider a following log:
                                       L2 = {a, b, c, a, b, d, b, c, d, b, d, c}
    Full transition system TS1 (L2 ) for the log is depicted in Figure 4c. It is
considered to be the most precise reference model. Transition system TSf (L2 )
in Figure 4a is built with a ﬁxed window size (equal to 1). As a more general
model than TS1 (L2 ) it allows more behavior than the reference model. All extra
behavior is counted as penalty and inﬂuence precision.
    Precision calculation invokes simulation routine implemented as a recur-
sive procedure (Algorithm 8). For the example above, TS(L) = TSf (L2 ) and




                                                                             89
 Algorithm 6: Calculating precision               Algorithm     7:   Function
 for transition system TS(L)                      SumPartialPrecisions    cal-
   Input     : general (perfectly ﬁt)             culating total precision of
               transition system                  TS(L)
   TS(L) = (S, E, T, s0 , AS) built for log         Input     : general (perfectly ﬁt)
   L; reference full transition system                          transition system
   TS1 (L) = (S1 , E1 , T1 , s01 , AS1 );           TS(L) = (S, E, T, s0 , AS) built
   Output : value of precision                      for log L; function η mapping
               P rec(TS(L)) for TS(L);              each state of TS(L) to a real
   Data: partial function η mapping each            number determining state’s
   state of TS(L) to a real number                  “partial precision”;
   determining the state’s “partial                 Output : value of precision
   precision”; partial function θ mapping                       P rec(TS(L)) for
   each state s ∈ TS(L) to a natural                            TS(L);
   number determining how many times
                                                    Function
   state s has been visited;
                                                    SumPartialPrecisions(TS(L),
   /* Main part of the algorithm */                 η): Real
   begin                                                sum ← 0;
      CalcStatePrecision (s0 , s01 ,                    for s ∈ S do
      TS(L), TS1 (L), η, θ);                                sum ← sum + η(s);
      P rec(TS(L)) ←SumPartialPrecisions
      (TS(L), η);                                       res ← sum/|S|;
                                                        return res;




TS1 (L) = TS1 (L2 ). Initially s = s0 ∈ TS(L) and s1 = s01 ∈ TS1 (l) are passed
as parameters to the procedure. For each output transition of a current state
s, the algorithm tries to ﬁnd a transition labeled with the same event among
output transitions of a current state s1 . It is possible to have not more than one
transition labeled with the same event, since all transition systems considered
in the paper are deterministic by the construction.
    For example, consider transition (s0 , a, a) in TSf (L2 ). It has a matching
transition (s0 , a, a) in TS1 (L2 ). The procedure is recursively called with s = a
and s1 = a. This step also produces an input edge to vertex a in an unfold-
ing graph depicted on Figure 4b. This step has to be repeated for transitions
(a), b, b) and (a), b, a, b) (obtains b1 in the unfolding graph) and transitions
(b), c, c) and (a, b), c, a, b, c) (obtains c1 in the unfolding graph). Here, c
in TSf (L2 ) is an accepting state; it contains a virtual output transition (depicted
as a gray dashed arrow) to a virtual ﬁnal state. Similarly, a, b, c from TS1 (L2 )
also has a virtual ﬁnal transition, which maintains balance (depicted as a dashed
output edge in the unfolding graph).
    Together with that, c has transition (c), d, d) to state d, which has
no counterpart in TS1 (L2 ) (edge (c1 , d4 ) in the graph). The algorithm penalizes
this extra transition and calculates partial precision for state c as a diﬀer-
ence between a number of output transitions and a number of penalized output
transitions divided by a total number of output transitions.




                                          90
Algorithm 8: Procedure CalcStatePrecision calculating “partial preci-
sion for entire states” as function η
 Input : current state s of TS(L);
 current state s1 of TS1 (L); general (perfectly ﬁt) transition system
 TS(L) = (S, E, T, s0 , AS) built for log L; reference full transition system
 TS1 (L) = (S1 , E1 , T1 , s01 , AS1 ); partial function η mapping each state of TS(L)
 to a real number determining the state’s “partial precision”; partial function θ
 mapping each state s ∈ TS(L) to a natural number determining how many
 times state s has been visited;
 Procedure CalcStatePrecision(s, s1 , TS(L), TS1 (L), η, θ)
    pen ← 0;
    for t = (s, a, s ) ∈ s• do
        /* If no matching trans.                                                    */
        if ∃t1 = (s1 , a, s1 ) ∈ TS1 (L) then
            CalcStatePrecision (s , s1 , TS(L), TS1 (L), η);
         else
             pen ← pen + 1;

     /* Number of output trans-s                                                    */
     otn ← |s • |;
     if s ∈ AS then
         otn ← otn + 1
         if s1 ∈
               / AS1 then
             pen ← pen + 1;

     if otn = 0 then
         partP artP rec = (otn − pen)/otn;
         RecalcStatePrecision (s, partP artP rec);


Algorithm 9: Procedure RecalcStatePrecision reﬁnes the value of a
state’s “partial precision”
 Input : state s ∈ S of TS(L) = (S, E, T, s0 , AS);
 function η mapping each state of TS(L) to a real number determining the
 state’s “partial precision”; partial function θ mapping each state s ∈ S to a
 natural number determining how many times state s has been visited; new
 state’s partial precision pprec for reﬁning;
 Procedure RecalcStatePrecision(s, η, θ, pprec)
    /* If either η or θ is not defined for this state                               */
    if η(s) is not deﬁned then
        η(s) ← 0;
     if θ(s) is not deﬁned then
         θ(s) ← 0;
     stP rec ← η(s) · θ(s);
     θ(s) ← θ(s) + 1;
     η(s) ← (stP rec + pprec)/θ(s);




                                     91
    Since state a, b, c of TS1 (L2 ) does not allow any further moves, the algo-
rithm leaves the current iteration of the procedure and, thereby, returns to a
higher level, to state b of TSf (L2 ) and a, b of TS1 (L2 ). Then, the algorithm
repeats the same steps for all unvisited output transitions.
    During its work, the algorithm can normally visit some states of a more
general model (TSf (L2 ) in the example) more than once. For each such visit a
value of partial precision for the state is recalculated (Algorithm 9).
    Finally, after all states of the reference model have been visited during the
simulation, values of ultimate partial precision for each state of TS(L) are repre-
sented by η. The last step is to calculate an average value (Algorithm 7), which
is the required value of the model’s precision.

5.2   Implementation Details
To evaluate the proposed approach, we have developed a number of routines for
the ProM toolkit [16]. The routines are implemented as plug-ins with several
entry points intended for diﬀerent sets of input parameters.
    Build and reduce transition systems (xi) plug-in combines routines for
building TS1 (L), TS2 (L), TS3 (L) for a given input log L, along with calculation
metrics for each transition system built. In the simplest case, the plug-in obtains
at its input only event log L and provides ability to conﬁgure the settings as
follows. (1) Specify a maximum window size for building TS1 (L). Default size is
unlimited (that is set at a value of -1). By setting the size to a natural number,
one can make algorithm act with a ﬁxed window. We use this option for building
reference models with ﬁxed windows. (2) Specify a value of Threshold parameter
used for building TS2 (L). (3) Specify a value of multiplicative factor Vwsc used
when building TS3 (L).
    Once successfully ﬁnished, the plug-in produces at its output three transition
systems, and, what is the most important for analyzing the results, a hierarchical
report. The report represents a set of characteristics organized in a tree structure.
They include metrics for each built transition system and additional attributes
calculated during the algorithm’s operation. We created a special “view” plug-
in for browsing such reports, which allows to export information as a JSON-
structure or HTML-formatted text.
    The both plug-ins are openly available to download on http://pais.hse.ru.

5.3   Experiments and Discussion
We evaluated our approach on a set of event logs, both artiﬁcial and real-life.
In the following sections we consider “BPI challenge” logs [1] L3 (11 traces, 89
activities) and L4 (251 traces, 247 activities). Full transition systems built for
the logs have the following frequency characteristics:
 – TS1 (L3): Maximum Window Size: 71; 514 states; 513 transitions;
 – TS1 (L4): Maximum Window Size: 83; 8088 states; 8087 transitions.
    Our goal is to compare metrics of models built with an existing ﬁxed window
algorithm and models built with the algorithms proposed in this paper.




                                        92
Table 1: Dependence of model’s metrics on parameters Threshold and Vwsc with
an unlimited window size
  Settings            TS2(L3 ): Condensed                  TS3(L3 ): Reduced                     TS2(L4 ): Condensed                      TS3(L4 ): Reduced
Thres-                    States Transs Act       States Transs                                         States Transs Act        States Transs
       VWSC        TThr                                         Simpl             Prec       TThr                                              Simpl          Prec
hold                      #      #      #         #      #                                              #      #      #          #      #
    0         1       1     514     513     89       514      513    0,0876              1          0    8088      8087 247        8088     8087   0,0153            1
 0,05         1       1     514     513     89       514      513    0,0876              1      13            37    36   20         320     1573     0,131      0,567
  0,1         1       1     514     513     89       514      513    0,0876              1      25            12    11   10         327     1595     0,129     0,5482
 0,25         1       3      11      10       9      470      478    0,0949        0,9911        63            9     8       8      320     1531     0,134     0,5585
 0,33         1       4       9       8       8      464      474    0,0959        0,9905       83            9      8       8      320     1531     0,134     0,5585
  0,5         1       6       7       6       6      470      478    0,0949        0,9911       126            6     5       5      308     1540   0,1342      0,5523
 0,65         1       7       6       6       6      470      478    0,0949        0,9911       163            3     2       2      289     1490   0,1394      0,5595
 0,75         1       8       7       6       6      470      478    0,0949        0,9911       188            3     2       2      289     1490   0,1394      0,5595
 0,85         1       9       7       6       6      470      478    0,0949        0,9911       213            3     2       2      289     1490   0,1394      0,5595
 0,95         1      10       7       6       6      470      478    0,0949        0,9911       238            2     1       1      308     1544   0,1339      0,5581
    1         1      11       1       0       0      470      478    0,0949        0,9911       251            2     1       1      308     1544   0,1339      0,5581

 0,25        0,5      3      11      10       9      397      438    0,1078        0,9505        63            9     8       8      173      992   0,2129      0,5514
 0,33        0,5      4       9       8       8      395      438         0,108    0,9477       83            9      8       8      173      992   0,2129      0,5514
  0,5        0,5      6       7       6       6      397      438    0,1078        0,9505       126            6     5       5      147      869   0,2441       0,556
 0,75        0,5      8       7       6       6      397      438    0,1078        0,9505       188            3     2       2      141      866   0,2463      0,5809

 0,25    0,25         3      11      10       9      312      403    0,1259        0,8764        63            9     8       8       73      645   0,3454      0,5116
 0,33    0,25         4       9       8       8      311      405    0,1257        0,8739       83             9     8       8       73      645   0,3454      0,5116
  0,5    0,25         6       7       6       6      309      400    0,1269        0,8771       126            6     5       5       78      639   0,3459      0,5387
 0,75    0,25         8       7       6       6      309      400    0,1269        0,8771       188            3     2       2       86      709   0,3119      0,4843

 0,33    0,12         4       9       8       8      139      320    0,1961        0,7012       83            9      8       8       59      565   0,3974      0,5333
 0,33    0,05         4       9       8       8       55      201    0,3516          0,676      83            9      8       8       26      404   0,5767      0,5266




                                      Table 2: Metrics of k-window models
                                  k (wnd
                                                             L3                                          L4
                                   size) States Transs Simpl Prec
                                                                     States
                                                                             Transs # Simpl Prec
                                          #      #                   #
                                        1     90   273 0,2479 0,6117     248    1755 0,124 0,3791
                                       2      274      371     0,1395       0,8574       1756       3602 0,046 0,7607
                                       3      372      418     0,1139       0,9457       3603       4838 0,029 0,8892
                                       4      419      437     0,1051       0,9828       4839       5600 0,024       0,946
                                       5      438      451     0,1012       0,9882       5601       6129 0,021 0,9671
                                       7      464      474     0,0959       0,9914       6515       6806 0,019 0,9836
                                      10      492      497        0,091     0,9961       7228       7392 0,017 0,9929
                                      15      508      508     0,0886        0,999       7840       7905 0,016 0,9968
                                      20      513      513     0,0877        0,999       8037       8054 0,015 0,9991




    Table 1 shows metrics of condensed and reduced models constructed with
unlimited window size. Metrics of k-window models for the logs are presented in
Table 2. Comparing the results from both tables, one can consider the follow-
ing outcome. By using a ﬁxed window approach presented in [2] the maximum
reduction is reached for a parameter k = 1. This is the smallest model that can
be built by the algorithm. Moreover, Tables 2 shows that increasing windows
size (k) to a value greater than 7 does not signiﬁcantly impact simplicity and
precision.




                                                                            93
    In our algorithms, there are two parameters aﬀecting a model size: Threshold
and Vwsc. Parameter Threshold has a limited impact on the model size. As it is
shown in Table 1, dependence of the model size from Threshold is nonlinear in
the entire domain of Threshold. For log L4, better simplicity results are in the
vicinity of the points 0.2 and 0.8 and worse in the range ends. This is because
in the case of dismissive small Threshold model TS2 (L) is similar to TS1 (L); so,
application area of a variable size window lies near the tree leaves. In such a
case, noticeable reduction of a model is achievable only for traces with similar
suﬃxes.
    Conversely, the closer a value of Threshold to 1, the lesser TS2 (L) is built.
Consequently, at the stage of building TS3 (L) model most of it is to be recon-
structed. Preliminary experiments show that selecting Threshold from a range
[0.2; 0.8] leads to better results; nevertheless, further elaboration of the param-
eter’s impact is needed.
    By decreasing the value of Vwsc parameter from 1 to 0, signiﬁcant reduction
of a resulting model size is achieved (last two rows of Table 1). For example,
for log L3 and a value of Vwsc = 0.05 we have Simpl(TS3 (L3)) = 0.3516 and
Prec(TS3 (L3)) = 0.676 versus Simpl(TSf (L3)) = 0.2479 and Prec(TSf (L3)) =
0.6117 for the case of one window size model. Generally, by adjusting the value
of Vwsc, one can obtain a resulting model in a wide range of sizes. Unlike pa-
rameter Threshold, parameter Vwsc gives a linear dependence of the model size.
Moreover, by varying values both of Threshold and Vwsc it is possible to enhance
mutual inﬂuence of the parameters on each other. That allows ﬂexible balancing
between precision and simplicity.

6   Conclusion
This paper presented a new approach for reducing transition systems, based on
an inference algorithm with a varied window size. In contrast to the existing
approaches the approach presented in this paper shows advantages of ﬂexible
adjustment of a resulting model size. This way, estimation of achievability of the
main goal is made on the basis of numerical characteristics of resulting models,
both absolute and integral metrics. Future work is aimed at further investigation
of impacts of various algorithm coeﬃcients on time costs of a region-based algo-
rithm applied to resulting transition systems. Moreover, we plan to investigate
the quality metrics of Petri nets obtained as an outcome of the algorithm.

References
 1. http://fluxicon.com/blog/2015/05/bpi-challenge-2015/
 2. van der Aalst, W.M.P., Rubin, V., Verbeek, H.M.W., Dongen, B.F., Kindler,
    E., Günther, C.W.: Process mining: a two-step approach to balance between
    underﬁtting and overﬁtting. Software & Systems Modeling 9(1), 87–111 (2008),
    http://dx.doi.org/10.1007/s10270-008-0106-z
 3. van der Aalst, W.M.P.: Process Mining - Discovery, Conformance and Enhance-
    ment of Business Processes. Springer (2011)




                                       94
 4. Angluin, D.: Inference of reversible languages. J. ACM 29(3), 741–765 (Jul 1982),
    http://doi.acm.org/10.1145/322326.322334
 5. Badouel, E., Darondeau, P.: Lectures on Petri Nets I: Basic Models: Advances
    in Petri Nets, chap. Theory of regions, pp. 529–586. Springer Berlin Heidelberg,
    Berlin, Heidelberg (1998), http://dx.doi.org/10.1007/3-540-65306-6_22
 6. Biermann, A.W., Feldman, J.A.: On the synthesis of ﬁnite-state machines from
    samples of their behavior. IEEE Trans. Comput. 21(6), 592–597 (Jun 1972), http:
    //dx.doi.org/10.1109/TC.1972.5009015
 7. Buijs, J., Dongen, B., Aalst, W.: On the Role of Fitness, Precision, Generaliza-
    tion and Simplicity in Process Discovery. In: Meersman, R., Rinderle, S., Dadam,
    P., Zhou, X. (eds.) OTM Federated Conferences, 20th International Conference
    on Cooperative Information Systems (CoopIS 2012). Lecture Notes in Computer
    Science, vol. 7565, pp. 305–322. Springer-Verlag, Berlin (2012)
 8. Carmona, J., Cortadella, J., Kishinevsky, M.: A region-based algorithm for discov-
    ering petri nets from event logs. In: BPM. pp. 358–373 (2008)
 9. Cook, J.E., Wolf, A.L.: Discovering models of software processes from event-based
    data. ACM Trans. Softw. Eng. Methodol. 7(3), 215–249 (Jul 1998), http://doi.
    acm.org/10.1145/287000.287001
10. Cortadella, J., Kishinevsky, M., Lavagno, L., Yakovlev, A.: Deriving petri nets
    from ﬁnite transition systems. IEEE Trans. Comput. 47(8), 859–882 (Aug 1998),
    http://dx.doi.org/10.1109/12.707587
11. Lorenzoli, D., Mariani, L., Pezzè, M.: Inferring state-based behavior models. In: 4th
    International Workshop on Dynamic Analysis (WODA 2006) co-located with the
    28th International Conference on Software Engineering (ICSE 2006). pp. 25–32.
    ACM Press (2006)
12. Park, D.M.R.: Concurrency and automata on inﬁnite sequences. In: Theoretical
    Computer Science, 5th GI-Conference, Karlsruhe, Germany, March 23-25, 1981,
    Proceedings. pp. 167–183 (1981), http://dx.doi.org/10.1007/BFb0017309
13. Rubin, V., Dongen, B.F.V., Kindler, E., Günther, C.W.: Process mining: A two-
    step approach using transition systems and regions. Tech. rep., BPM Center Report
    BPM-06-30, BPM Center (2006)
14. Shershakov, S.A., Lomazova, I.A., Kalenkova, A.A.: Method for transition sys-
    tems reduction based on state frequencies (preliminary version). Tech. rep., Higher
    School of Economics (2016), http://dx.doi.org/10.13140/RG.2.1.3438.6328
15. Sole, M., Carmona, J.: Region-based foldings in process discovery. IEEE Transac-
    tions on Knowledge and Data Engineering 25(1), 192–205 (2013)
16. Verbeek, H., Buijs, J., Dongen, B., Aalst, W.: ProM 6: The Process Mining Toolkit.
    In: Rosa, M.L. (ed.) Proc. of BPM Demonstration Track 2010. CEUR Workshop
    Proceedings, vol. 615, pp. 34–39 (2010)




                                           95