<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Federated Stochastic Process Discovery: Definition, Benefits, and Challenges</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Hootan Zhian</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>The University of Melbourne</institution>
          ,
          <addr-line>Victoria 3010</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>A process model is a representation that describes the behavior of a system, constructed by applying process mining algorithms to data found in event logs. The primary advantage of a process model is that it provides a clear understanding of how processes function. The concept of process mining initially emerged from research focused on extracting sequences of activities within workflow environments. Today, modern organizations widely use process mining techniques, including process discovery, to analyze their event logs. These techniques help organizations gain insights into the reality of their operational processes and identify opportunities for improvement. However, in practical scenarios, event log data is often distributed and may contain sensitive information. Traditional process discovery methods rely on event logs stored in centralized repositories. This centralization, however, poses challenges in distributed environments, such as concerns about data availability, privacy, and the high communication and bandwidth demands associated with centralization. I explore how organizations can collaboratively create a process model without sharing their local logs, addressing challenges in distributed environments. Additionally, I discuss how to optimize the quality of extracted models.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Stochastic process mining</kwd>
        <kwd>federated process discovery</kwd>
        <kwd>cross-silos process discovery</kwd>
        <kwd>optimization</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Process mining has become an important field that bridges traditional business process management
with data-driven analytics. It enables organizations to uncover, monitor, and enhance their actual
processes by extracting valuable insights from event logs recorded during business operations. As
organizations increasingly digitize their activities, the volume and distribution of process-related data
have expanded significantly, presenting both opportunities and challenges for the implementation of
process mining techniques. A key problem in process mining is process discovery, which focuses on
automatically creating process models from event logs. These models serve as visual representations of
actual business processes, helping organizations understand, analyze, and improve their operations [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
However, traditional process discovery approaches face significant limitations in today’s distributed
business environment, particularly when dealing with cross-organizational processes and sensitive
data [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        The emergence of federated process discovery addresses the limitations of centralized approaches
by enabling organizations to collaborate on process mining initiatives without sharing raw event log
data [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. This approach is particularly crucial in scenarios where data privacy regulations, competitive
concerns, or technical constraints prevent the centralization of event logs. Moreover, process discovery
is inherently a time-consuming task, which limits the ability to exploit optimization solutions, such as
metaheuristics, to a full extent [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In contrast, federated stochastic process discovery ofers a significant
advantage by allowing organizations to avoid the need to aggregate all event logs. This decentralized
approach facilitates the application of optimization techniques, enabling a more efective exploration of
the solution space to identify optimized solutions.
      </p>
      <p>However, despite these advancements, process discovery still faces challenges related to the evaluation
of process models across diferent and conflicting quality dimensions. Process discovery is a
multiobjective optimization problem. Yet, most existing process discovery techniques neglect this fact.
23rd International Conference on Business Process Management
$ hzhian@student.unimelb.edu.au (H. Zhian)</p>
      <p>© 2025 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).</p>
      <p>
        Traditionally, process discovery techniques aggregate diferent objectives into a single objective (SO)
function. This approach, however, comes with limitations, such as the need for a priori knowledge
of objectives’ importance and dificulty in evaluating trade-ofs between the objectives. Recently,
metaheuristic optimization techniques have been used in process discovery [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], particularly when it
comes to balancing diferent quality goals. This success is also seen in the works by Buijs et al. [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ],
who use multiple quality dimensions to evaluate the quality of discovered process models. Maintaining
a diverse list of process models is vital in multi-objective optimization to ensure solutions are uniformly
distributed over the search space. Without preventive measures, populations of discovered models can
cluster, leading to traps in local maxima.
      </p>
      <p>My PhD project aims to answer these research questions (RQs):
RQ1: How to perform federated stochastic process discovery efectively and eficiently?
RQ2: Can multi-objective metaheuristics improve solutions to the stochastic process discovery problem?
RQ3: How to perform federated stochastic process discovery in online settings?</p>
    </sec>
    <sec id="sec-2">
      <title>2. Research Problem</title>
      <p>My PhD project addresses three critical challenges in modern process mining. First, organizations
increasingly need to collaborate on process mining initiatives while maintaining data privacy. Traditional
centralized approaches require sharing raw data, which is often impossible due to privacy regulations,
competitive concerns, technical constraints, and high communication and bandwidth costs. Second,
current process discovery methods struggle to balance multiple objectives, including model performance
metrics like fitness, precision, and simplicity. Third, in spite of federated process discovery potentials, it
introduces new challenges, including the complexity of merging distributed process models, maintaining
data privacy, and preserving good quality models that reflect the behavior of the whole organization.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Related Work</title>
      <p>
        In practice, event logs are often distributed and may contain sensitive data, challenging traditional
process mining, which assumes centralized logs. To address this, van der Aalst [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] proposed federated
process mining, mapping organization-specific logs into a unified federated log. Two approaches were
proposed: sharing filtered event data or abstractions like directly-follows graphs (DFGs) to ensure
confidentiality. However, merging these abstractions remains an unresolved challenge. Rojo et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]
explored federated mining using distributed devices, such as smartphones, to analyse human actions.
Similarly, Khan et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] developed a federated approach for cross-silo mining, using dependency
graphs and a privacy-preserving protocol based on the Heuristic Miner algorithm, coordinated by a
centralized server. Rafiei and van der Aalst [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] proposed a privacy-aware abstraction-based method
for federated process discovery. Their approach uses directly-follows relations to create abstractions,
enabling collaboration while protecting sensitive data through mechanisms like handover relations.
      </p>
      <p>
        Finding the best process model that clearly shows all the details from the event log is a complex
problem. It involves trying to meet diferent objectives that can sometimes conflict with each other,
necessitating multi-objective optimization (MOO) techniques [
        <xref ref-type="bibr" rid="ref4 ref9">9, 4</xref>
        ]. Alkhammash et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] recently
demonstrated that process discovery based on the genetic optimization of the ALERGIA grammatical
inference algorithm—called GASPD—can construct interesting DFGs in terms of size and accuracy
of reflecting the likelihood of traces of the system that generated the event log. Addressing multiple
objectives has typically involved consolidating them into a single objective (SO) function. However,
this method has several drawbacks, such as the necessity of having prior knowledge about the relative
importance of each objective, resulting in only one solution, and complicating the evaluation of trade-ofs.
The limitations of this approach include the need for an understanding of each objective’s significance,
the fact that the aggregated function yields only a single solution, the dificulty in assessing trade-ofs
between objectives, and the potential infeasibility of the solution unless the search space is convex. In
contrast, multi-objective optimization (MOO) problems are inherently more complex, as they generate
a set of optimal solutions that represent acceptable trade-ofs among the various objectives rather than
a single solution.
      </p>
      <p>
        Multi-optimization evolutionary solutions in process discovery began with the work of Van der Aalst
et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], utilizing genetic algorithms to extract features from global searches while addressing noise
challenges. This concept was further developed through methods like the Evolutionary Tree [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], a
genetic approach is applied to identify Pareto frontiers. The evolutionary Miner [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Alkhammash et al.
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] have demonstrated the efectiveness of genetic algorithms for optimizing grammatical inference for
the discovery of superior stochastic process models by extracting pareto-optimal models.
      </p>
      <p>
        Maintaining a diverse population in multiobjective optimization (MOO) is crucial to ensure solutions
are well-distributed across the Pareto front and to prevent genetic drift [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Niching techniques, such
as fitness sharing and crowding, help algorithms explore multiple peaks and avoid local optima [
        <xref ref-type="bibr" rid="ref14 ref15">14, 15</xref>
        ].
These methods have proven efective in enhancing evolutionary algorithms’ ability to solve complex,
multimodal problems [
        <xref ref-type="bibr" rid="ref14 ref15">14, 15</xref>
        ].
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Approach</title>
      <sec id="sec-4-1">
        <title>4.1. Federated Stochastic Process Discovery</title>
        <p>
          I presented an extension of GASPD capable of federated process discovery that operates over a distributed
event log in two phases [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. Unlike existing techniques, which typically rely on projections of complete
traces, our approach works directly with multiple collections of complete traces. Each collection captures
aspects of the overall process based on a specific data subset. In the first phase, GASPD discovers models
with various quality characteristics from each part of the distributed event log, which can be viewed as
a collection of local event logs stored on dedicated devices. These superior models, discovered from
each local event log, are then sent to a central server. The server aggregates them into a model that
describe the overall system.
        </p>
        <p>The standard implementation, referred to as FedGASPD, discovers one model from each input event
log and merges them, providing a basic baseline for federated process discovery. However, when the
discovered models exhibit diverse characteristics, this merging may lead to a loss of distinctiveness and
dilute the strong individual features due to the averaging efect.</p>
        <p>To address this limitation, I proposed a variation called FedCGASPD, which avoids merging all
models into a single output. Instead, it groups similar models into clusters and merges the models
within each cluster. This approach yields multiple models that retain the strong characteristics of the
merged models, thereby preserving their distinctiveness. This method supports scalable, eficient, and
privacy-preserving process discovery across organizational boundaries.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Stochastic Process Discovery as Multi-Objective Optimization</title>
        <p>
          I conducted a comprehensive review of multi-objective metaheuristics to identify those that eficiently
support the discovery of models—based on the ALERGIA grammatical inference algorithm—that are
simple (in terms of size), accurate (in terms of Entropic relevance), and diverse [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. Entropic relevance
is chosen as the accuracy measure because it assesses how well a model reflects the likelihood of target
traces, balances precision and recall, and is computationally eficient, that is, is computable in time linear
to the size of the input event log [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. I introduced a classification of multi-objective metaheuristics as
alternatives to the genetic algorithm used in GASPD, based on their strategies for selecting candidate
solutions. Additionally,to ensure the discovered models exhibit a wide range of quality characteristics, I
incorporated a niching technique, which allows metaheuristics to explore multiple promising search
subspaces simultaneously, reducing the risk of premature convergence to local optima [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. Finally, I
conducted an empirical evaluation using industrial event logs.
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Research Methodology</title>
      <p>
        My PhD leverages the Design Science research methodology [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], focusing on creating innovative
solutions for federated stochastic process discovery and optimization. The approach balances rigor—through
literature review, benchmark construction, and simulation-based validation—with practical relevance
by testing on real-world datasets and exploring organizational case studies. This dual emphasis ensures
both theoretical advancement and actionable outcomes, addressing real implementation challenges.
Ultimately, the research aims to bridge academic innovation with practical impact in federated process
optimization, aligning with the core principles of Design Science [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
      </p>
    </sec>
    <sec id="sec-6">
      <title>6. Achieved Results and Future Work</title>
      <p>
        I presented two algorithms for stochastic process discovery in distributed environments [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. These
algorithms are built upon GASPD that operates over centralized event logs. The new algorithms discover
process models from several local event logs, possibly scattered across diferent organizations or silos,
and aim to preserve the autonomy and privacy of each party and to decrease data communication
requirements and the overall model discovery time. Our experiments demonstrate the efectiveness
of FedGASPD and FedCGASPD, providing scalable alternatives for process discovery in distributed
environments.
      </p>
      <p>While federated approaches have shown promising results, several avenues remain to further enhance
their efectiveness. One key area for improvement lies in systematically exploring the impact of diferent
orders of merging models discovered from local event logs on the quality of the constructed global
model. Such an exploration could provide valuable insights into optimizing the merging process.</p>
      <p>Another promising direction involves optimizing the discovered models for quality criteria beyond
size and Entropic relevance. Additionally, exploring alternative policies for selecting superior local
models represents an intriguing avenue for future research. One critical challenge identified is that the
process of merging models can introduce features into the global model that do not reflect any behavior
present in the local logs. To address this, performing optimization directly on the global model could
help achieve a more accurate representation of the overall behavior of the participating organizations.</p>
      <p>
        To improve the quality and diversity of discovered models, I implemented and evaluated nine
metaheuristics as alternatives to the Genetic Algorithm used in the GASPD stochastic process discovery
algorithms, enhanced by niching techniques to ensure the diversity of the discovered models. For
performance evaluation, I used two metrics: dominance count and Diversity Comparison Indicator
(DCI) [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. Dominance count evaluates how many solutions from one algorithm dominate others
on the global Pareto front, while DCI assesses diversity by examining solution spread in objective
space. Experiments on real event logs reveal that niching techniques enhance model diversity and help
avoid local maxima. Empirical results across twelve logs show Diferential Evolution (DE) consistently
surpasses other metaheuristics, including GASPD. Future work may refine niching methods and improve
DE and other metaheuristics for stronger process discovery algorithms.
      </p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>My PhD program has been funded by the Melbourne Research Scholarship from the University of
Melbourne. I want to thank Prof. Artem Polyvyanyy and Prof. Rajkumar Buyya for supporting and
inspiring my PhD research.</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <sec id="sec-8-1">
        <title>The author has not employed any Generative AI tools.</title>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>A. Online Resources</title>
      <sec id="sec-9-1">
        <title>Our implemented algorithms are publicly available via the following links.</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>W. van der</given-names>
            <surname>Aalst</surname>
          </string-name>
          , T. Weijters, L. Maruster,
          <article-title>Workflow mining: Discovering process models from event logs</article-title>
          ,
          <source>Transactions on Knowledge and Data Engineering</source>
          <volume>16</volume>
          (
          <year>2004</year>
          )
          <fpage>1128</fpage>
          -
          <lpage>1142</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Khan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ghose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Dam</surname>
          </string-name>
          ,
          <article-title>Cross-silo process mining with federated learning</article-title>
          ,
          <source>Journal</source>
          <volume>13121</volume>
          (
          <year>2021</year>
          )
          <fpage>612</fpage>
          -
          <lpage>626</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>W. van der Aalst</surname>
          </string-name>
          ,
          <article-title>Federated process mining: Exploiting event data across organizational boundaries</article-title>
          ,
          <year>2021</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>H.</given-names>
            <surname>Alkhammash</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polyvyanyy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mofat</surname>
          </string-name>
          ,
          <article-title>Stochastic directly-follows process discovery using grammatical inference</article-title>
          ,
          <source>in: CAiSE</source>
          ,
          <year>2024</year>
          , pp.
          <fpage>87</fpage>
          -
          <lpage>103</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Buijs</surname>
          </string-name>
          ,
          <string-name>
            <surname>B. van Dongen</surname>
          </string-name>
          , W. van der Aalst,
          <article-title>A genetic algorithm for discovering process trees</article-title>
          ,
          <source>in: Proceedings of the IEEE Congress on Evolutionary Computation</source>
          ,
          <year>2012</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Buijs</surname>
          </string-name>
          ,
          <string-name>
            <surname>B. van Dongen</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. van der Aalst</surname>
          </string-name>
          ,
          <article-title>Quality dimensions in process discovery: The importance of fitness, precision, generalization and simplicity</article-title>
          ,
          <source>International Systems</source>
          <volume>23</volume>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Rojo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Garcia-Alonso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Berrocal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hernández</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Murillo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Canal</surname>
          </string-name>
          ,
          <article-title>Sowcompact: A federated process mining method for social workflows</article-title>
          ,
          <source>Information Sciences 595</source>
          (
          <year>2022</year>
          )
          <fpage>18</fpage>
          -
          <lpage>37</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rafiei</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. van der Aalst</surname>
          </string-name>
          ,
          <article-title>An abstraction-based approach for privacy-aware federated process mining</article-title>
          ,
          <source>IEEE Access 11</source>
          (
          <year>2023</year>
          )
          <fpage>33697</fpage>
          -
          <lpage>33714</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Augusto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rosa</surname>
          </string-name>
          ,
          <article-title>Metaheuristic optimization for automated business process discovery</article-title>
          ,
          <source>in: Business Process Management</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>268</fpage>
          -
          <lpage>285</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>W.</given-names>
            <surname>Van der Aalst</surname>
          </string-name>
          , A. de Medeiros, A. Weijters,
          <article-title>Genetic process mining</article-title>
          ,
          <source>in: Applications and Theory of Petri Nets</source>
          <year>2005</year>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>W.</given-names>
            <surname>Van der Aalst</surname>
          </string-name>
          , T. Weijters, L. Maruster,
          <article-title>Workflow mining: Discovering process models from event logs</article-title>
          ,
          <source>Transactions on Knowledge and Data Engineering</source>
          <volume>16</volume>
          (
          <year>2004</year>
          )
          <fpage>1128</fpage>
          -
          <lpage>1142</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>T.</given-names>
            <surname>Molka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Redlich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Gilani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Drobek</surname>
          </string-name>
          ,
          <article-title>Evolutionary computation-based discovery of hierarchical business process models</article-title>
          , Springer,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Konaka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Coit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Smith</surname>
          </string-name>
          <article-title>, Multi-objective optimization using genetic algorithms: A tutorial, Reliability Engineering and System Safety (</article-title>
          <year>2006</year>
          )
          <fpage>992</fpage>
          -
          <lpage>1007</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>B.</given-names>
            <surname>Sareni</surname>
          </string-name>
          , L. Krahenbuhl,
          <article-title>Fitness sharing and niching methods revisited</article-title>
          ,
          <source>IEEE Transactions on Evolutionary Computation</source>
          <volume>2</volume>
          (
          <year>1998</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>K.</given-names>
            <surname>Deb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pratap</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Agarwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Meyarivan</surname>
          </string-name>
          ,
          <article-title>A fast and elitist multiobjective genetic algorithm: Nsga-ii</article-title>
          ,
          <source>IEEE Transactions on Evolutionary Computation</source>
          <volume>6</volume>
          (
          <year>2002</year>
          )
          <fpage>182</fpage>
          -
          <lpage>197</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Buyya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polyvyanyy</surname>
          </string-name>
          ,
          <article-title>Federated stochastic process discovery using grammatical inference</article-title>
          ,
          <source>in: Proceedings of the CAiSE 2025 Conference</source>
          ,
          <year>2025</year>
          .
          <article-title>Accepted for publication</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Buyya</surname>
          </string-name>
          ,
          <article-title>Artem, Multi-objective metaheuristics for efective and eficient stochastic process discovery</article-title>
          ,
          <source>in: Proceedings of the BPM 2025 Conference</source>
          ,
          <year>2025</year>
          .
          <article-title>Accepted for publication</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>H.</given-names>
            <surname>Alkhammash</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polyvyanyy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mofat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>García-Bañuelos</surname>
          </string-name>
          ,
          <article-title>Entropic relevance: A mechanism for measuring stochastic process models discovered from event data</article-title>
          ,
          <source>Information Systems</source>
          <volume>107</volume>
          (
          <year>2022</year>
          )
          <fpage>101922</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hevner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>March</surname>
          </string-name>
          , J. Park,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sudha</surname>
          </string-name>
          , Design science in
          <source>information systems research, Management Information Systems Quarterly</source>
          <volume>28</volume>
          (
          <year>2008</year>
          )
          <article-title>6</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>J.</given-names>
            <surname>Doe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <article-title>Diversity comparison of pareto front approximations in many-objective optimization</article-title>
          ,
          <source>Journal of Optimization Research</source>
          (
          <year>2023</year>
          )
          <fpage>123</fpage>
          -
          <lpage>145</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>