<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>model inference</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pierre Cry</string-name>
          <email>pierre.cry@centralesupelec.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Approximate Bayesian Computation.</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>MICS Laboratory</institution>
          ,
          <addr-line>CentraleSupélec</addr-line>
          ,
          <institution>Université Paris-Saclay at Gif-sur-Yvette</institution>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Stochastic Process Discovery, Stochastic Workflow Nets, Stochastic Process Trees</institution>
          ,
          <addr-line>Model Inference, Optimization</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>school CentraleSupélec in Paris, France, with special thanks to András Horváth, from the university of Turin</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <fpage>20</fpage>
      <lpage>24</lpage>
      <abstract>
        <p>Business processes often exhibit significant variability in both structure and execution probabilities. While existing process discovery techniques focus primarily on control-flow, they typically overlook the stochastic nature of processes, limiting their ability to accurately simulate and predict real-world behaviour. My research addresses this gap by developing computationally eficient and statistically grounded methods for indirect stochastic process discovery from event logs. I propose a dual approach: (i) optimization-based discovery, where exact stochastic languages are extracted from workflow nets via log-driven unfolding of the net space and aligned to observed behaviour using distance metrics such as Kullback-Leibler divergence and Earth Mover's Distance; and (ii) statistical inference-based discovery, where probabilistic parameters are inferred using Approximate Bayesian Computation Sequential Monte Carlo (ABC-SMC), providing a posterior distribution over model weights. A third contribution focuses on extending process tree semantics to the stochastic dimension, enabling both efective optimization and subsequent exploitation by humans and computational tools. The proposed methods are validated on real-life event logs, demonstrating improved accuracy, scalability, and interpretability compared to state-of-the-art stochastic process discovery techniques. This work aims to establish a unified, reproducible framework for discovering probabilistic behaviours in business processes. This Ph.D. project began in October 2022 under the supervision of Paolo Ballarini and Pascale Le Gall at the MICS Laboratory of the engineering Italy, for his significant contributions to the majority of the work presented in this thesis.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>https://pierrecry98.github.io (P. Cry)</p>
      <p>CEUR</p>
      <p>ceur-ws.org
the observed process behaviour.</p>
      <p>My thesis addresses key challenges in discovering stochastic process models that reproduce observed
behaviours both accurately and eficiently. In particular, it seeks to answer the following research
questions:
1. Given a stochastic Petri net model, can we compute its corresponding stochastic language
eficiently?
2. Given a parametric stochastic Petri net model, can we design an optimisation procedure to identify
the optimal parameters with respect to a chosen stochastic conformance measure?
3. Since the exact computation of the stochastic language issued by a stochastic Petri net can be
computationally intensive and thus limited by scalability issues, can we devise techniques to
alleviate these limitations?
To address these questions, we propose methods that combine formal model analysis, optimisation
techniques, and statistical inference to uncover the probabilistic nature of processes from event data.
Section 2 situates this work within the state of the art in stochastic process discovery. Section 3 presents
the three main contributions of the thesis, and Section 4 discusses the results obtained so far, outlines
current limitations, and highlights directions for future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Relation to state of the art</title>
      <p>
        Only a limited number of methods have been proposed in the literature for discovering stochastic
process models directly from event logs. One of the earliest contributions is by Rogge-Solti et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ],
who introduced a framework for discovering generalized stochastic Petri nets (GSPNs) enriched with
generally distributed timed transitions, enabling performance analysis of the mined process.
      </p>
      <p>
        Burke et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] addressed the problem of converting a workflow Petri net, discovered using a
conventional algorithm, into a stochastic workflow Petri net through weight estimation. In contrast
to [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], their focus was on a subclass of GSPNs containing only immediate transitions. This means
that time delays are excluded, and the models capture only the probability of traces while ignoring
timestamps. They proposed six lightweight weight estimators that combine summary statistics from the
log (e.g., subsequence frequencies) with statistics from the model that consider structural relationships
between Petri net nodes (e.g., transition causality). Although computationally eficient, since they do
not require enumerating the model language, these methods often yield suboptimal conformance in
practice. In a follow-up study [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], the authors introduced a novel framework for directly discovering
untimed Generalized Stochastic Petri Nets (GSPNs) from event logs using trace frequencies. Unlike
previous methods that apply weights to an existing control-flow model, this approach builds both the
structure and the stochastic behaviour simultaneously. The key innovation lies in the application of
reduction and abstraction rules that operate on the log without relying on a pre-mined model. These
rules form the core of a framework known as the Toothpaste Miner, which constructs Probabilistic
Process Trees. These trees are then systematically transformed into corresponding GSPNs.
      </p>
      <p>A diferent line of work approaches stochastic process discovery as an optimization problem. The
main challenge lies in estimating the probability of each trace generated by the model, an algorithmically
non-trivial task due to the potentially large or infinite size of the model’s language. In [ 4], the authors
proposed optimizing the earth mover’s stochastic conformance [5] (EMSC) score of a discovered
stochastic Petri net via subgradient ascent. Their method consists of two steps: (i) computing the
stochastic language of the model by analysing its structure, and (ii) performing subgradient optimization
on the conformance loss, propagating gradients to update the model’s transition weights.</p>
      <p>Leemans et al. [6] examined the stochastic process discovery problem from a broader perspective,
emphasising how the choice of modelling formalism (representational bias) can afect the discovery
of an optimal stochastic model. They showed that a perfectly fitting control-flow model can be less
conformant than a less fitting one when trace probabilities are considered. In the same work, they
proposed two strategies:
1. Direct discovery of an optimal stochastic Petri net from the log (one go).
2. Indirect discovery, where an existing non-stochastic model is enriched with optimal weights (two
stages).</p>
      <p>For the second approach, they introduced a weight optimisation scheme based on analytical probability
expressions. Given  distinct traces in the log, these expressions are obtained from  absorbing-state
probability problems on  discrete-time Markov chains, each derived from the cross-product of the
model’s stochastic reachability graph and a deterministic finite automaton representing a log trace.
The resulting formulas are fed to an optimisation engine to maximise either the unit Earth Mover’s
Stochastic Conformance [5] (uESMC) or the inverted entropic relevance [7] (ER−1). While efective,
this approach sufers from scalability issues: the analytical expressions grow rapidly with trace length,
making the method impractical for even moderately complex models.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Contribution and results</title>
      <sec id="sec-3-1">
        <title>3.1. Stochastic process discovery via optimization</title>
        <p>We address the problem of discovering stochastic process models by formulating them as optimization
problems. Given a parametric stochastic Petri net, we aim to identify parameters (e.g., transition firing
probabilities) that maximize the model’s alignment with the observed behaviour recorded in an event
log.</p>
        <p>A central challenge is that optimizing a stochastic model requires evaluating its stochastic language,
i.e., the probability distribution over all possible traces generated by the model. This computation
is algorithmically non-trivial for workflow nets due to the potentially large or infinite size of the
underlying language.</p>
        <p>To address this, we proposed in [8] a log-driven unfolding-based procedure to compute the probability
of each trace in the log without enumerating the entire model language. This approach avoids the need
to derive and store large symbolic formulas, as required in previous methods, and directly exploits the
structure of the model to identify relevant execution paths. We further improved this procedure by
introducing memoization, preventing redundant recalculations for common prefixes shared across log
traces. These enhancements yield substantial performance gains, particularly in iterative contexts such
as optimization.</p>
        <p>Once the stochastic language can be computed eficiently, we define objective functions to quantify
stochastic conformance between the model and the log. In this thesis, we consider two such measures:
the Kullback–Leibler divergence (KLD) and the restricted Earth Mover’s Distance (rEMD), both of which
compare probability distributions over traces. When the objective function is diferentiable, as with
KLD, we exploit analytical expressions of its derivatives to enable the use of gradient-based optimization
methods such as L-BFGS-B [9] and TNC [10]. We employ derivative-free optimizers such as Powell’s
method [11] or Nelder–Mead [12] for non-diferentiable objectives.</p>
        <p>The resulting framework allows us to iteratively adjust the model’s stochastic parameters to improve
its conformance to the observed behaviour, balancing computational eficiency with conformance
quality. Empirical evaluation on real-life event logs from the BPI Challenge demonstrates that this
optimization-based approach can yield models with improved stochastic accuracy compared to existing
state-of-the-art methods.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Stochastic process discovery via Bayesian inference</title>
        <p>While the optimisation-based approach seeks a single set of parameters that maximises a stochastic
conformance measure, a Bayesian perspective aims to estimate an entire posterior distribution over
the model parameters given the observed event log. This allows for estimating and quantifying the
uncertainty in the inferred parameters, providing deeper insights into how each parameter influences
the model’s stochastic behaviour.</p>
        <p>This work [13] employs the Approximate Bayesian Computation Sequential Monte Carlo (ABC-SMC)
algorithm to perform Bayesian inference on stochastic process models. ABC-SMC is particularly
wellsuited for this setting because it does not require the explicit likelihood function of the observed data
under the model, which is intractable in most cases. Instead, it relies on simulating the model under
candidate parameter values, comparing the resulting stochastic language to the log using a suitable
distance measure, and progressively refining the parameter population across successive generations.</p>
        <p>Our ABC-SMC framework first starts with an initial population of parameter vectors, sampled from
a prior uniform distribution. For each parameter vector, the model is simulated, using a model checker,
COSMOS, relying on the HASL logic to approximate its stochastic language. This simulation approach
helps us deal with larger logs. A distance function (e.g., restricted Earth Mover’s Distance) compares the
simulated stochastic language with the event log. Only parameter vectors producing a distance below a
predefined threshold are retained. The threshold is gradually reduced in subsequent generations, leading
to increasingly accurate parameter estimates. New parameter vectors are generated by perturbing
previously accepted ones, forming the next population generation.</p>
        <p>The procedure’s outcome is an empirical approximation of the posterior distribution over the
parameters. This provides several benefits over pure optimisation as it captures multiple plausible parameter
configurations rather than a single optimum and helps understand each parameter’s impact on the
overall model.</p>
        <p>Experimental results on real-life BPI Challenge logs show that ABC-SMC can discover models with
high stochastic conformance while ofering richer interpretability and robustness than
optimisationonly methods. Parallelisation of the particle search further improves scalability, making the approach
applicable to realistically sized event logs.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. A new formalism for stochastic process discovery: stochastic process trees</title>
        <p>While stochastic Petri nets ofer a powerful and expressive formalism for representing stochastic
behaviours, they often become large and complex when modelling real-life processes, making them
harder to operate and interpret. This complexity can hinder both the optimisation of their parameters
and the communication of results to process analysts and domain experts.</p>
        <p>To address these limitations, we introduce, in stochastic process trees (SPTs), an extension of classical
process trees in which control-flow operators are enriched with stochastic semantics. Process trees
provide a hierarchical and block-structured representation of processes, where each internal node
corresponds to a control-flow operator ( sequence, choice, parallel, loop) and leaves correspond to
activities. Their structured nature ensures soundness by construction and facilitates interpretability.
In our stochastic extension, each operator, but not the sequence, is associated with parameters that
govern the probabilistic behaviour of its execution:
• For choice nodes, probabilities determine the likelihood of selecting each branch.
• For parallel nodes, probabilities influence the ordering of interleaved activities.</p>
        <p>• For loop nodes, probabilities model the likelihood of repeating the loop body versus exiting it.</p>
        <p>We formalise the semantics of SPTs by defining their associated stochastic language, extending the
deterministic process tree semantics to account for execution probabilities. This formalisation enables
a direct computation of the probability of any tree-generated trace, either by simulating the tree or by a
log-driven exploration of it and applying optimization techniques to fit the tree’s parameters to event
log data.</p>
        <p>The SPT formalism is particularly well-suited for parameter discovery, as the block-structured
representation reduces the number of stochastic parameters. Moreover, its interpretability makes it a
practical choice for interactive analysis, allowing analysts to understand not only the structure of the
process but also the likelihood of alternative behaviours.</p>
        <p>We demonstrate the applicability of SPTs in both optimisation-based stochastic discovery settings,
showing that they can achieve competitive conformance while ofering significant advantages in terms
of scalability, interpretability, and ease of use.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion and Future works</title>
      <p>In the context of this Ph.D. project, we have developed diferent approaches aimed at providing answers
to the research questions outlined in Section 1, namely: (1) the eficient computation of the exact
stochastic language issued by a stochastic workflow net, (2) the optimisation of stochastic nets with
respect to a given stochastic conformance criterion, and (3) the alleviation of the computational hardness
associated with stochastic language calculation. Specifically, we introduced a log-driven unfolding
procedure for exact probability computation, extended with memoization to reuse intermediate results
and reduce redundant calculations. We proposed optimisation schemes based on diferentiable and
non-diferentiable conformance measures and a Bayesian inference framework to estimate complete
posterior distributions over model parameters. Finally, we extended the process tree formalism with
stochastic semantics, enabling scalable, interpretable, and structurally guaranteed sound stochastic
models.</p>
      <p>In the short-term future, we plan to refine the optimisation and inference procedures developed
in this thesis and extend our series of experiments to additional datasets. More specifically, we are
improving the unfolding procedure’s eficiency, aiming to further reduce computation time and enhance
scalability without compromising accuracy. In the same spirit, we are working on the traversal algorithm
for stochastic process trees to compute their stochastic language eficiently. We are also developing
mapping rules that would allow us to convert sPTs into other stochastic formalisms, which, so far, have
proven to be non-trivial.</p>
      <p>In the longer term, we envision extending our methods to support richer behavioural features, such
as time distributions. We aim to model activity durations, waiting times, and temporal correlations
between events by exploiting the temporal information in event logs through their timestamps. This
would allow the discovery of timed stochastic process models capable of jointly capturing control-flow,
probabilistic behaviour, and temporal dynamics. This would enable such models to support novel forms
of analysis, such as performance evaluation, bottleneck detection, and predictive monitoring, and to
derive deeper insights from event logs.</p>
    </sec>
    <sec id="sec-5">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the author used Grammarly in order to: Grammar and spelling
check, Paraphrase and reword. After using this tool/service, the author reviewed and edited the content
as needed and take full responsibility for the publication’s content.
Proceedings, volume 12734 of Lecture Notes in Computer Science, Springer, 2021, pp. 312–336. URL:
https://doi.org/10.1007/978-3-030-76983-3_16. doi:10.1007/978-3-030-76983-3\_16.
[4] T. Brockhof, M. S. Uysal, W. M. Van Der Aalst, Wasserstein weight estimation for stochastic
petri nets, in: 2024 6th International Conference on Process Mining (ICPM), 2024, pp. 81–88.
doi:10.1109/ICPM63005.2024.10680664.
[5] S. Leemans, A. Syring, W. Aalst, Earth Movers’ Stochastic Conformance Checking, 2019, pp.</p>
      <p>127–143. doi:10.1007/978-3-030-26643-1_8.
[6] S. J. J. Leemans, T. Li, M. Montali, A. Polyvyanyy, Stochastic process discovery: Can it be done
optimally?, in: Advanced Information Systems Engineering: 36th International Conference, CAiSE
2024, Limassol, Cyprus, June 3–7, 2024, Proceedings, Springer-Verlag, Berlin, Heidelberg, 2024, p.
36–52. URL: https://doi.org/10.1007/978-3-031-61057-8_3. doi:10.1007/978-3-031-61057-8_3.
[7] A. Polyvyanyy, A. Mofat, L. García-Bañuelos, An entropic relevance measure for stochastic
conformance checking in process mining, in: 2020 2nd International Conference on Process
Mining (ICPM), IEEE, 2020, pp. 97–104.
[8] P. Cry, A. Horváth, P. Ballarini, P. Le Gall, A framework for optimisation based stochastic process
discovery, in: J. Hillston, S. Soudjani, M. Waga (Eds.), Quantitative Evaluation of Systems and
Formal Modeling and Analysis of Timed Systems, Springer Nature Switzerland, Cham, 2024, pp.
34–51.
[9] R. H. Byrd, P. Lu, J. Nocedal, C. Zhu, A limited memory algorithm for bound constrained
optimization, SIAM Journal on Scientific Computing 16 (1995) 1190–1208. URL: https://doi.org/10.1137/
0916069. doi:10.1137/0916069. arXiv:https://doi.org/10.1137/0916069.
[10] S. G. Nash, Newton-type minimization via the lanczos method, SIAM Journal on
Numerical Analysis 21 (1984) 770–788. URL: https://doi.org/10.1137/0721052. doi:10.1137/0721052.
arXiv:https://doi.org/10.1137/0721052.
[11] M. J. D. Powell, An eficient method for finding the minimum of a function of
several variables without calculating derivatives, The Computer Journal 7 (1964)
155–162. URL: https://doi.org/10.1093/comjnl/7.2.155. doi:10.1093/comjnl/7.2.155.
arXiv:https://academic.oup.com/comjnl/article-pdf/7/2/155/959784/070155.pdf.
[12] F. Gao, L. Han, Implementing the nelder-mead simplex algorithm with adaptive parameters,
Computational Optimization and Applications 51 (2012) 259–277. doi:10.1007/s10589-010-9329-3.
[13] P. Cry, P. Ballarini, A. Horváth, P. Le Gall, Statistical Bayesian Inference for Stochastic Process
Discovery, in: Proceedings of the International Conference on Quantitative Evaluation of SysTems
(QEST), Aarhus (Danemark), Denmark, 2025. URL: https://hal.science/hal-05134848.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Rogge-Solti</surname>
          </string-name>
          , W. Aalst, van der,
          <string-name>
            <given-names>M.</given-names>
            <surname>Weske</surname>
          </string-name>
          ,
          <article-title>Discovering stochastic petri nets with arbitrary delay distributions from event logs</article-title>
          , in: N.
          <string-name>
            <surname>Lohmann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Song</surname>
          </string-name>
          , P. Wohed (Eds.), Business Process Management Workshops :
          <article-title>BPM 2013 International Workshops</article-title>
          , Beijing, China,
          <year>August 26</year>
          ,
          <year>2013</year>
          ,
          <string-name>
            <given-names>Revised</given-names>
            <surname>Papers</surname>
          </string-name>
          ,
          <source>Lecture Notes in Business Information Processing</source>
          , Springer, Germany,
          <year>2014</year>
          , pp.
          <fpage>15</fpage>
          -
          <lpage>27</lpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>319</fpage>
          - 06257-
          <issue>0</issue>
          _
          <fpage>2</fpage>
          , 9th International Workshop on Business Process Intelligence (BPI
          <year>2013</year>
          ), BPI 2013 ; Conference date:
          <fpage>26</fpage>
          -
          <lpage>08</lpage>
          -2013 Through 26-
          <fpage>08</fpage>
          -
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Burke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J. J.</given-names>
            <surname>Leemans</surname>
          </string-name>
          , M. T. Wynn,
          <article-title>Stochastic process discovery by weight estimation</article-title>
          , in: S.
          <string-name>
            <surname>J. J. Leemans</surname>
          </string-name>
          , H. Leopold (Eds.),
          <string-name>
            <surname>Process Mining</surname>
          </string-name>
          Workshops - ICPM 2020 International Workshops, Padua, Italy, October 5-
          <issue>8</issue>
          ,
          <year>2020</year>
          , Revised Selected Papers, volume
          <volume>406</volume>
          <source>of Lecture Notes in Business Information Processing</source>
          , Springer,
          <year>2020</year>
          , pp.
          <fpage>260</fpage>
          -
          <lpage>272</lpage>
          . URL: https://doi.org/10.1007/ 978-3-
          <fpage>030</fpage>
          -72693-5_
          <fpage>20</fpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>030</fpage>
          - 72693- 5\_
          <fpage>20</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Burke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J. J.</given-names>
            <surname>Leemans</surname>
          </string-name>
          , M. T. Wynn,
          <article-title>Discovering stochastic process models by reduction and abstraction</article-title>
          , in: D.
          <string-name>
            <surname>Buchs</surname>
          </string-name>
          , J. Carmona (Eds.),
          <source>Application and Theory of Petri Nets and Concurrency - 42nd International Conference, PETRI NETS</source>
          <year>2021</year>
          ,
          <string-name>
            <given-names>Virtual</given-names>
            <surname>Event</surname>
          </string-name>
          , June 23-25,
          <year>2021</year>
          ,
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>