<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards the Discovery of Ob ject-Aware Processes</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Databases and Information Systems, Ulm University</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>There has been a huge body of research in order to reduce manual efforts in creating executable process models through the automated discovery of process models from the event logs created by information systems. Regarding activity-centric processes, such event logs comprise case ids and events related to the execution of process activities. However, there exist alternative process management paradigms, such as object-aware processes, for which existing algorithms fail to discover a sound model. These algorithms do not treat data as first-class citizens, but solely rely on the information from event logs. In consequence, existing discovery algorithms are insufficient for discovering object-aware processes. To address this issue, discovery algorithms need to consider additional data sources (e.g., existing forms). This paper discusses the need for dedicated discovery techniques in object-aware processes.</p>
      </abstract>
      <kwd-group>
        <kwd>object-aware processes</kwd>
        <kwd>process mining</kwd>
        <kwd>process discovery</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Despite the many mining approaches that exist for activity-centric processes,
adequate support for discovering data-centric process models, e.g., in the context
of artifact-centric processes [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], case handling [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], or object-aware processes [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ],
is still lacking. While an activity-centric process model consists of a sequence of
activities that need to be executed in a defined order, a data-driven and -centric
process allows for greater flexibility through the use of declarative process rules
and generated forms [
        <xref ref-type="bibr" rid="ref15 ref6">15,6</xref>
        ]. Current process discovery algorithms are able to
discover the schema of an activity-centric process from an event log, whereas
information about the internal logic of the activities (e.g., user forms or data
required for an activity) is often neglected. As data is treated as first-class
citizen in data-centric (e.g., object-aware) process management, the discovery of
corresponding models should consider this issue as well.
      </p>
      <p>
        To understand the nature of the problem at hand, a short introduction into
data-centric and object-aware process management becomes necessary.
PHILharmonicFlows, our approach to data-centric processes, introduces the concepts of
objects, object behavior, and object interaction. For each business object present
in a real-world business process, one such object exists. The latter comprises
data, represented by attributes, and a state-based process model describing the
object behavior in terms of an object lifecycle model. When data becomes
available during runtime, this enables transitions between the various states of the
lifecycle process, i.e., execution is data-driven. In the e-learning system
PHoodle, a practical application of the PHILharmonicFlows system [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], examples of
business objects include Submission, Exercise, and Lecture (see Fig. 1 for the
respective data model). In turn, when the values of certain attributes, such as
Points or Feedback, become available at runtime, this enables the transition
between the states of a lifecycle process (see Fig. 2). Finally, the interactions
between object lifecycles are managed by coordination processes [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>Lecture 1:n Download
1
:
n
1
:
n</p>
    </sec>
    <sec id="sec-2">
      <title>Exercise :1n Tutorial</title>
      <p>1
:
n
1
:
n</p>
    </sec>
    <sec id="sec-3">
      <title>Submission</title>
    </sec>
    <sec id="sec-4">
      <title>Attendance</title>
      <p>Exercise #1</p>
      <sec id="sec-4-1">
        <title>DescriptionEditExFeirlecsise PSuubbmlisishieodn PaDsDtautDeeue</title>
        <sec id="sec-4-1-1">
          <title>ALtifreiAbcysuctilegesnDmesecnrtip:tSiuopne:rSvtirsionrg ExerciseFilesA:sFiilgenment:SSutbumdeisntion:File ADsueignDmateen:tD:aStuepervisor</title>
          <p>Exercise #2</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>DescriptionEditExFeirlecsise PSuubbmlisishieodn PaDsDtautDeeue</title>
        <sec id="sec-4-2-1">
          <title>ALtifreiAbcysuctilegesnDmesecnrtip:tSiuonpe:rSvtirsionrg ExerciseFilesA:sFiilgenment:SSutbumdeisntion:File ADsueignDmateen:tD:aStuepervisor</title>
          <p>
            AELtifxreiecbyructcleeAissseiEgExnedmrcieitnset::SFSttiurlidenegsnt SuFilebPs:omFiilnetissSAssuiigbonmmPnoeiiFnFntFetet#:ste:TeddIeub1bndtaatdoeccrkgbke==arcftkarlusee Fe dback:PBFoaaoslilseedd
AELtifxreiecbyructcleeAissseiEgExnedmrcieitnset::SFSttiurlidenegsnt SuFilebPs:omFiilnetissSAssuiigbonmmPnoeiiFnFntFetet#:ste:TeddIeub2bndtaatdoeccrkgbke==arcftkarlusee Fe dback:PBFoaaoslilseedd
AELtifxreiecbyructcleeAissseiEgExnedmrcieitnset::SFSttiurlidenegsnt SuFilebPs:omFiilnetissSAssuiigbonmmPnoeiiFnFntFetet#:ste:TeddIeub3bndtaatdoeccrkgbke==arcftkarlusee Fe dback:PBFoaaoslilseedd
Process discovery summarizes techniques that leverage information from event
logs to discover process models [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ]. For activity-centric processes, there exist
a variety of approaches (see [
            <xref ref-type="bibr" rid="ref1 ref17">1,17</xref>
            ] for an overview). Various algorithms use
event logs as input to discover an activity-centric process model. Regarding
datacentric processes [
            <xref ref-type="bibr" rid="ref16">16</xref>
            ], however, there only exist few approaches for discovering
process models. [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ] describes an approach for discovering artifacts and their
lifecycles from structured datasets as opposed to lifecycle-enabled objects in our
approach. In turn, [
            <xref ref-type="bibr" rid="ref9">9</xref>
            ] deals with methods for discovery of artifacts and the
interactions between them; additionally, an evaluation based on real-life datasets
from ERP systems is provided. In turn, [
            <xref ref-type="bibr" rid="ref13">13</xref>
            ] decomposes the problem of artifact
lifecycle discovery such that existing process mining algorithms can be applied.
The construction of data and object models from different data structures (e.g.,
databases, legacy systems) has been investigated in reverse engineering [
            <xref ref-type="bibr" rid="ref10 ref4">4,10</xref>
            ].
While database reverse engineering reconstructs logical or conceptual models,
other aspects of data-driven process management are neglected (e.g., lifecycles
or the interactions between object lifecycles). An approach to automatically
generate event logs from databases is described in [
            <xref ref-type="bibr" rid="ref11">11</xref>
            ]. Since data is treated as a
first-class citizen in object-oriented process management, additional information
(i.e., data sources) need to be considered to discover an object-aware process.
          </p>
          <p>Towards the Discovery of Object-Aware Processes</p>
          <p>
            Research direction
In our PHILharmonicFlows framework, an object-aware process consists of a data
model, one lifecycle model for each object, and a coordination process enforcing
constraints regarding object interactions [
            <xref ref-type="bibr" rid="ref8">8</xref>
            ]. In order to discover an executable
object-aware process, all three aspects need to be considered. For the discovery
of various aspects of object-aware processes (e.g., relations between objects or
states of a lifecycle), solely considering event logs is not sufficient and, hence,
additional data sources need to be taken into account. For example, the data model
underlying an object-aware process provides the foundation for both lifecycles
and object coordination [
            <xref ref-type="bibr" rid="ref5">5</xref>
            ].
          </p>
          <p>The first step during process discovery is to identify objects, including
their attributes and relations. Note that the structure of a normalized relational
database, to a certain degree, is comparable to a data model, which offers the
opportunity to discover the data model from the structure (i.e., the create table
statements) of a database. Each table in the database may, but does not have to
correspond to an object in the data model, whereas columns of a table represent
the attributes of an object. One-to-many relations between tables can be used
to identify relations between the objects of a data model. Additionally, relations
can be used as an indicator if a table corresponds to a correct object.</p>
          <p>After discovering the data model, the object lifecycles need to be discovered
in the second step. Based on the attributes from the data model, lifecycle
discovery shall deliver object states as well as the transitions between them. In
general, a lifecycle process may enter another state, if all necessary data (i.e.,
attribute values) are available. In particular, lifecycle states cannot be discovered
from event logs, whose entries solely refer to activities due to the mismatch
between states (i.e., defined by attributes) and cases (i.e., a collection of activities).
To tackle this mismatch, discovery algorithms for object lifecycles, suitable event
log preprocessing (e.g., splitting an event log into event logs for each object),
and additional data sources (e.g., forms of existing information systems) need
to be considered as well during the discovery process.</p>
          <p>The third step in discovering an object-aware process is to unravel the
coordination of interactions between objects (e.g., a submission may only be
created if the corresponding exercise is in state published). As object interaction
can only be discovered with the data model and the lifecycles present, their
discovery is a secondary problem for now.
4</p>
          <p>Conclusion
This paper discusses the need for spending research efforts on the discovery of
object-aware process models. As major advantage, the discovery of an
objectaware processes allows to identify the underlying logic of a process. Finally, due
to the strong linkage between process and data in object-aware processes, it is
possible that not every aspect of each element (i.e., data model, lifecycles, and
coordination) may be discovered from the presented data sources and, therefore
further research is of utmost importance.</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>van der Aalst</surname>
            ,
            <given-names>W.M.P.</given-names>
          </string-name>
          : Process Mining: Data Science in Action. Springer,
          <volume>2</volume>
          <fpage>edn</fpage>
          . (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>van der Aalst</surname>
            ,
            <given-names>W.M.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weske</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grünbauer</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Case handling: a new paradigm for business process support</article-title>
          .
          <source>DKE</source>
          <volume>53</volume>
          (
          <issue>2</issue>
          ),
          <fpage>129</fpage>
          -
          <lpage>162</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>van der Aalst</surname>
            ,
            <given-names>W.M.P.</given-names>
          </string-name>
          , et al.:
          <article-title>Process mining manifesto</article-title>
          .
          <source>In: Int'l Conf on BPM'11</source>
          . pp.
          <fpage>169</fpage>
          -
          <lpage>194</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Alhajj</surname>
          </string-name>
          , R.:
          <article-title>Extracting the extended entity-relationship model from a legacy relational database</article-title>
          .
          <source>Information Systems</source>
          <volume>28</volume>
          (
          <issue>6</issue>
          ),
          <fpage>597</fpage>
          -
          <lpage>618</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steinau</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reichert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Engineering a highly scalable objectaware process management engine using distributed microservices</article-title>
          .
          <source>In: Int'l Conf on CoopIS'18</source>
          . pp.
          <fpage>80</fpage>
          -
          <lpage>97</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steinau</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reichert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Enabling runtime flexibility in data-centric and data-driven process execution engines</article-title>
          .
          <source>Information Systems</source>
          p.
          <volume>101447</volume>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Cohn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hull</surname>
          </string-name>
          , R.:
          <article-title>Business artifacts: A data-centric approach to modeling business operations and processes</article-title>
          .
          <source>IEEE Data Eng. Bull</source>
          .
          <volume>32</volume>
          (
          <issue>3</issue>
          ),
          <fpage>3</fpage>
          -
          <lpage>9</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Künzle</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reichert</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>PHILharmonicFlows: towards a framework for objectaware process management</article-title>
          .
          <source>J of Soft Maint &amp; Evo</source>
          <volume>23</volume>
          (
          <issue>4</issue>
          ),
          <fpage>205</fpage>
          -
          <lpage>244</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nagelkerke</surname>
            , M., van de Wiel,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fahland</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Discovering interacting artifacts from ERP systems</article-title>
          .
          <source>IEEE Trans Serv Com</source>
          <volume>8</volume>
          (
          <issue>6</issue>
          ),
          <fpage>861</fpage>
          -
          <lpage>873</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Mfourga</surname>
          </string-name>
          , N.:
          <article-title>Extracting entity-relationship schemas from relational databases: a form-driven approach</article-title>
          .
          <source>In: WCRE'97</source>
          . pp.
          <fpage>184</fpage>
          -
          <lpage>193</lpage>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>de Murillas</surname>
          </string-name>
          , et al.:
          <article-title>Case notion discovery and recommendation: automated event log building on databases</article-title>
          . Know &amp; Inf
          <string-name>
            <surname>Sys</surname>
          </string-name>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Nooijen</surname>
            ,
            <given-names>E.H.J.</given-names>
          </string-name>
          , van Dongen,
          <string-name>
            <given-names>B.F.</given-names>
            ,
            <surname>Fahland</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          :
          <article-title>Automatic discovery of datacentric and artifact-centric processes</article-title>
          .
          <source>In: Int'l Conf on BPM'12</source>
          . pp.
          <fpage>316</fpage>
          -
          <lpage>327</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Popova</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fahland</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Artifact lifecycle discovery</article-title>
          .
          <source>Int'l J of Coop Inf Sys</source>
          <volume>24</volume>
          (
          <issue>01</issue>
          ),
          <volume>1550001</volume>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Steinau</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reichert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Modeling process interactions with coordination processes</article-title>
          .
          <source>In: CoopIS'18</source>
          . pp.
          <fpage>21</fpage>
          -
          <lpage>39</lpage>
          . LNCS, Springer (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Steinau</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reichert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Executing lifecycle processes in objectaware process management</article-title>
          .
          <source>In: Data-Driven Process Discovery and Analysis</source>
          . pp.
          <fpage>25</fpage>
          -
          <lpage>44</lpage>
          . Springer (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Steinau</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marrella</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leotta</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mecella</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reichert</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>DALEC: A framework for the systematic evaluation of data-centric approaches to process management software</article-title>
          .
          <source>Softw &amp; Sys Modeling</source>
          <volume>18</volume>
          (
          <issue>4</issue>
          ),
          <fpage>2679</fpage>
          -
          <lpage>2716</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Weerdt</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Backer</surname>
            ,
            <given-names>M.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vanthienen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baesens</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A multi-dimensional quality assessment of state-of-the-art process discovery algorithms using real-life event logs</article-title>
          .
          <source>Inf Sys</source>
          <volume>37</volume>
          (
          <issue>7</issue>
          ),
          <fpage>654</fpage>
          -
          <lpage>676</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>