<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Object-Centric Process Mining for Process Analysis and Operational Support⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Giulia Rufini</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Turin</institution>
          ,
          <addr-line>Turin</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <abstract>
        <p>Object-centric process mining is a novel paradigm which has been gaining an increasing attention thanks to its potential to overcome limitations of traditional process mining techniques. In this PhD project, we aim at developing novel techniques able to leverage an object-centric paradigm to provide data-driven decision support on running process executions. In particular, we plan to develop novel techniques to a) identify process behaviors which impact the process outcome; b) generate diagnostics for deviant behaviors; and, c) deliver predictions on a KPI of interest at a given moment of a process execution. The manuscript will delve into the research questions and the methodology envisioned for each of these goals.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Object-Centric Process Mining</kwd>
        <kwd>Predictive Process Monitoring</kwd>
        <kwd>Conformance Checking</kwd>
        <kwd>Pattern Extraction</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        An increasing attention in the field of automated process analysis concerns the Object-Centric
Process Mining (OC-PM) perspective, which can better represent the complexity and
interconnections existing in business operations compared to classical Process Mining (PM) techniques
[
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. First, OC-PM handles multi-dimensional event logs better: by considering diferent objects
rather than a single object as case ID and the other objects as attributes, the convergence and
divergence problem can be avoided [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. OC-PM also considers the interrelationships between
diferent objects, allowing one to grasp information that would otherwise be lost.
      </p>
      <p>
        Related Works. With the development of object-centric process mining and object-centric
event-log (OCEL) formats [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ], existing process mining techniques must be adapted as well.
Several eforts have already been made in this direction. Contributions made by previous
work include: the introduction of object-centric process models (e.g., [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]); techniques to
explore process models and features in the OC setting (e.g. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], or [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]); OC-based approaches to
performance analysis [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and constraint monitoring [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. However, to the best of our knowledge,
only a few studies have addressed important process analysis tasks such as process pattern
extraction [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], conformance checking techniques [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], and predictive process monitoring [
        <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
        ]
within the OC paradigm. Several open challenges exist for each of these tasks.
      </p>
      <p>This project aims to contribute to this research area by introducing OC techniques to support
the previously mentioned process analysis tasks. More precisely, the project aims at answering
the following research questions:
RQ1 How to extract OC process patterns which impact the process outcome?
RQ2 How to asses the conformance of process executions considering OC relations?
RQ3 How to leverage the OC paradigm to enhance the quality of process predictions?</p>
      <p>The rest of this manuscript delves into each of these research problems, elaborating upon the
corresponding challenges and the envisioned methodology.</p>
    </sec>
    <sec id="sec-2">
      <title>2. OC-Pattern Extraction</title>
      <p>
        Research Problem. Process discovery techniques allow one to automatically extract a process
model from a given event log tracking past process executions. A well-known limitation of
these techniques is that many real-world processes are characterised by complex and variable
behaviours, that lead to the extraction of overly complex models which ofer little or no support
to the analysis. A possible strategy to mitigate this issue consists in inferring interesting process
behaviours, rather than start-to-end process model. A process pattern, in classical PM, is a
sequence of activities that repeats over multiple cases. Traditional process pattern discovery
techniques return process patterns only focused on ordering relations existing between activities
occurring in the same process execution. In this way, potential interactions with process objects
and with activities occurring in another process instance cannot be captured. Few papers
leverage object-centric domain in pattern extraction, e.g. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], with certain limitations since
authors considered flattened event-logs, leading to information loss, and focus on the frequency
to measure the importance of a pattern, which often leads to generating a multitude of
notinteresting patterns.
      </p>
      <p>
        Research Goal. We aim at developing novel pattern mining techniques to extract OC process
patterns. Following recent approaches for process pattern extraction [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], we aim at extracting
patterns that are predictive of process outcome rather than focusing solely on frequent patterns.
To this end, we plan to extend previous pattern extraction approaches leveraging graph mining
and data abstraction techniques to: a) extend the set of pattern relations considered during the
mining to include object-dependent relations; b) develop tailored analysis techniques and tools
to allow human users to navigate the set of mined patterns, exploring both intrinsic pattern
characteristics and possible relations with other patterns.
      </p>
      <p>
        Test and Validation. The proposed procedures will be validated using optimisation methods
to find particular patterns, e.g. with the highest correlation with the outcome. These techniques
will be tested on publicly available datasets after any necessary data pre-processing. Another
validation can be performed by considering a dataset of Italian public tenders: after exploring
the relational tables and extracting an OCEL from them [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], patterns can be extracted using
the investigated techniques.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. OC-Conformance Checking</title>
      <p>
        Research problem. Identifying anomalies present in an event-log requires a process model
and a case notion to compare with it. Classic conformance-checking techniques generate
diagnostics which consider process executions in isolation, neglecting possible OC relations
and thus potentially missing some deviant behaviours. Adapting conformance checking to the
OC paradigm would allow us to overcome this limitation, enabling more accurate diagnostics.
Some previous work (e.g., [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]) proposed techniques to calculate conformance metrics
in an OC domain. However, authors worked with "flattened" event-logs.
      </p>
      <p>
        Research Goal. We aim at developing novel OC conformance checking techniques able to
handle the graph-structure of OCEL. In this setting, comparing an individual case with the
model entails comparing a graph extracted from OCEL with a graph representing admissible
behaviors within the considered model. We plan to investigate and leverage previous work
introducing OC case notions [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and OC process models (e.g., the OC-PN model [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], or the
OC-DFG model [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]) to develop techniques able to find a particular morphism to map a graph
onto another graph, to identify conforming behaviours and anomalies in OCEL. We also aim
at investigating the use of heuristics to mitigate possible computational challenges usually
associated with graph analysis techniques.
      </p>
      <p>Test and Validation. To validate the results, we first plan to use a synthetic dataset to test
the capability of the proposed method to detect injected anomalies. We then plan to explore
publicly available datasets.</p>
    </sec>
    <sec id="sec-4">
      <title>4. OC-Predictive Process Monitoring</title>
      <p>
        Research problem. Predictive process monitoring techniques aim at predicting the unfolding
of an ongoing process execution to determine, e.g. the next activity to be executed or the
total duration of the execution. Previous studies have pointed out the potential benefits of
leveraging the OC paradigm for prediction tasks, showing that taking OC relations into account
often results in improved prediction performance. As an example, [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] presented a predictive
approach using gradient boosting, while [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] proposed another predictive approach using LSTM
and neural networks. Despite considering a multi-dimensional event log, both techniques
operate with an enriched flattened event log and a multiset of sequences. This approach can
lead to a loss of information stemming from the relationships between objects.
      </p>
      <p>Research Goal. We aim at developing novel predictive process monitoring techniques to
leverage the graph structure of OC event logs for prediction. To this end, we intend to investigate
the use of graph neural networks (GNN), given their ability to deal with graph structures natively.
Furthermore, we intend to investigate how to represent additional relations among process
activities (e.g., relations between event data, or ordering relations diferent from the directly
following ones) and to develop tailored XAI techniques to analyse the impact of the diferent
relations on the predicted outcome.</p>
      <p>Test and Validation. We plan to test and validate the developed techniques on publicly available
datasets using traditional classification performance metrics, such as AUC and f1.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
          </string-name>
          , Process Mining - Data science in action,
          <source>Second Edition</source>
          , Springer,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
          </string-name>
          ,
          <article-title>Object-centric process mining: Dealing with divergence and convergence in event data</article-title>
          ,
          <source>in: Software Engineering and Formal Methods: 17th International Conference, SEFM 2019</source>
          , Oslo, Norway,
          <source>September 18-20</source>
          ,
          <year>2019</year>
          , Proceedings 17, Springer,
          <year>2019</year>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>25</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A. F.</given-names>
            <surname>Ghahfarokhi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Park</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Berti</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
          </string-name>
          ,
          <article-title>Ocel: A standard for objectcentric event logs</article-title>
          ,
          <source>in: European Conference on Advances in Databases and Information Systems</source>
          , Springer,
          <year>2021</year>
          , pp.
          <fpage>169</fpage>
          -
          <lpage>175</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Berti</surname>
          </string-name>
          ,
          <article-title>Discovering object-centric petri nets</article-title>
          ,
          <source>Fundamenta informaticae 175</source>
          (
          <year>2020</year>
          )
          <fpage>1</fpage>
          -
          <lpage>40</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Berti</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
          </string-name>
          ,
          <article-title>Oc-pm: analyzing object-centric event logs and process models</article-title>
          ,
          <source>International Journal on Software Tools for Technology Transfer</source>
          <volume>25</volume>
          (
          <year>2023</year>
          )
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J. N.</given-names>
            <surname>Adams</surname>
          </string-name>
          , G. Park,
          <string-name>
            <given-names>S.</given-names>
            <surname>Levich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schuster</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
          </string-name>
          ,
          <article-title>A framework for extracting and encoding features from object-centric event data</article-title>
          ,
          <source>in: International Conference on Service-Oriented Computing</source>
          , Springer,
          <year>2022</year>
          , pp.
          <fpage>36</fpage>
          -
          <lpage>53</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>G.</given-names>
            <surname>Park</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. N.</given-names>
            <surname>Adams</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
          </string-name>
          , Opera:
          <article-title>Object-centric performance analysis</article-title>
          ,
          <source>in: International Conference on Conceptual Modeling</source>
          , Springer,
          <year>2022</year>
          , pp.
          <fpage>281</fpage>
          -
          <lpage>292</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>G.</given-names>
            <surname>Park</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
          </string-name>
          ,
          <article-title>Monitoring constraints in business processes using object-centric constraint graphs</article-title>
          ,
          <source>in: International Conference on Process Mining</source>
          , Springer,
          <year>2022</year>
          , pp.
          <fpage>479</fpage>
          -
          <lpage>492</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>E. R. Faria</given-names>
            <surname>Junior</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. R.</given-names>
            <surname>Neubauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fantinato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Peres</surname>
          </string-name>
          ,
          <article-title>Clustering analysis and frequent pattern mining for process profile analysis: An exploratory study for object-centric event logs</article-title>
          ,
          <source>in: International Conference on Process Mining</source>
          , Springer,
          <year>2022</year>
          , pp.
          <fpage>269</fpage>
          -
          <lpage>281</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J. N.</given-names>
            <surname>Adams</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
          </string-name>
          ,
          <article-title>Precision and fitness in object-centric process mining</article-title>
          ,
          <source>in: 2021 3rd International Conference on Process Mining (ICPM)</source>
          , IEEE,
          <year>2021</year>
          , pp.
          <fpage>128</fpage>
          -
          <lpage>135</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Galanti</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. De Leoni</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Navarin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Marazzi</surname>
          </string-name>
          ,
          <article-title>Object-centric process predictive analytics</article-title>
          ,
          <source>Expert Systems with Applications</source>
          <volume>213</volume>
          (
          <year>2023</year>
          )
          <fpage>119173</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>W.</given-names>
            <surname>Gherissi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. El</given-names>
            <surname>Haddad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Grigori</surname>
          </string-name>
          ,
          <article-title>Object-centric predictive process monitoring</article-title>
          ,
          <source>in: International Conference on Service-Oriented Computing</source>
          , Springer,
          <year>2022</year>
          , pp.
          <fpage>27</fpage>
          -
          <lpage>39</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>V.</given-names>
            <surname>Mozhgan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Genga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Verhoeven</surname>
          </string-name>
          ,
          <string-name>
            <surname>H. van Laarhoven</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Dijkman</surname>
          </string-name>
          ,
          <article-title>Interactive multi-interest process pattern discovery (</article-title>
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Berti</surname>
          </string-name>
          , G. Park,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rafiei</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
          </string-name>
          ,
          <article-title>A generic approach to extract object-centric event data from relational databases (</article-title>
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>J. N.</given-names>
            <surname>Adams</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schuster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schmitz</surname>
          </string-name>
          , G. Schuh,
          <string-name>
            <surname>W. M. P. Van Der Aalst</surname>
          </string-name>
          ,
          <article-title>Defining cases and variants for object-centric event data</article-title>
          ,
          <source>in: 2022 4th International Conference on Process Mining (ICPM)</source>
          , IEEE,
          <year>2022</year>
          , pp.
          <fpage>128</fpage>
          -
          <lpage>135</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>