<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Online Conformance Checking to Support Human Behavior Study (Extended Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gemma Di Federico</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Applied Mathematics and Computer Science Technical University of Denmark Kgs. Lyngby</institution>
          ,
          <country country="DK">Denmark</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <abstract>
        <p>-Human behavior could be represented in form of a process model. Internet of Things systems allow the collection of the behavior in the reality, and the data are then processed by Process Mining algorithms. To achieve this integration, several challenges must be faced. The objective of this PhD project is to adapt online conformance checking to be able to deal with human-related processes.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>I. PROBLEM CONTEXT AND MOTIVATION</title>
      <p>
        The project aims to contribute into the integration between
Internet of Things (IoT) and Business Process Management,
combining the potentiality of sensor data toward the
implementation of dynamic and enriched models with the objective
to support conformance checking techniques. In particular,
the project focuses on the modeling of human behavior [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]
by means of process mining techniques, obtaining data from
IoT devices. There is a desire to dwell on aspects relating to
the use of conformance checking techniques in the context of
behavioral study. Observing human behavior for a long time,
can lead to the collection of various information that can be
used in a plenty of application fields [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The use case of
my PhD project is the dementia disease, under the healthcare
field. Patients effected by dementia usually fit into a very tight
routine. The repetition of routines helps people to control their
world in a predictable manner, adding a sense of order to the
days. Repetition of activities is a comfort zone. Establishing
a routine contributes to a sense of security and peace, which
helps to reduce agitation and troublesome behavior [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. In the
middle to later stages of most types of dementia, a person may
start to behave differently. Given that, the breaking of a routine
could be symptom of a worsening of the disease [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. The idea
is to observe the behavior of people living with dementia to
identify significant variations in the behavior. To do so, an
habits model is derived for each patient and it is continuously
compared with the actual behavior to find discrepancies. IoT
systems are used to capture the behavior in the reality, while
online conformance checking techniques to compare the actual
behavior with the modeled (and expected) one.
      </p>
      <p>The work in this PhD project also aims to be generalized,
so that it can be reused in other contexts. My plan is to
extend Process Mining techniques to be applicable in IoT,
with the aim of contribute into verifying the compliance of
the actual IoT system execution with the expected process
schema. In order to succeed in such goal, in the following
section I elaborate the research questions.</p>
    </sec>
    <sec id="sec-2">
      <title>II. RESEARCH QUESTIONS</title>
      <p>The overall process could be divided into three main topics
which are the data collection, the process modeling and the
mining. Firstly, there is a simulated environment, equipped
with IoT sensors, that captures the behavior of a patient in
form of sensors measurements. Secondly, data collected must
be organized in an event log that is suitable for discovery
algorithms which are used to derive a reference model. This
model represents the typical behavior of the patient. The
last is the conformance, in which the behavior in the reality
is compared with the reference models. The derivation and
replication of processes is what process mining is intended to
do. Complications emerge as the nature of human processes
differs from the structured one that are typically dealt with.</p>
      <p>
        RQ1 - What is the most appropriate way to abstract
activities from a stream of sensor measurements?
Are semi-supervised approaches the proper manner
to organize sensor data into an event log?
Data produced by IoT systems are unstructured and of a
different level of granularity compared to the event log
expected by Process Mining algorithms. There is a need to
bring sensors measurements and model to the same level
of abstraction. Actually, there is not a standard technique
to organize sensors data into a structured event log. In the
literature there are several approaches of event abstraction[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ],
both supervised or unsupervised. The unsupervised techniques
group fine-granular events on the basis of strong re-occurrence
of patterns [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In the supervised event abstraction techniques,
authors propose a set of dimensions based on their
understanding of the field and classifying them accordingly [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. After a
first investigation of the available techniques, we would like
to focus in the between, i.e., on semi-supervised approaches.
A fully supervised abstraction could be influenced by human
errors, since it is an expert who must identify every single
activity starting from a series of proposed configurations. On
the other hand, an unsupervised technique does not allow
to verify the way in which the labeling was carried out, or
how the sequence of actions for each activity was identified.
Therefore, a semi-supervised approach could provide both
the right level of automation and allow us to oversee and
contribute with knowledge to the activity recognition. Once
the data has been structured in an event log, we need to focus
on the process modeling language that is able to represent
human-related processes.
      </p>
    </sec>
    <sec id="sec-3">
      <title>RQ2 - Are current modeling languages able to</title>
      <p>
        represent human behavior and their context? It may
be valid to consider stochastic models?
Human related process differs from business one. Human
behavior evolves over the time [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and it is influenced by
the participant and by the context. The process is completely
guided by the participants, who decide how and when to
perform activities [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The concept means that there are a series
of variables, external or internal to the process execution,
which influence the process. These variables could also change
over the time [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. For example, the factors that influence
the behavior could be the seasonality, the weather, the day of
the week, but also a dependency between executed activities.
Based on each of them, the behavior adapts [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Often the
cause of change is hidden, not known a priori, making the
learning task more complicated. Hidden contexts may be
expected to recur, generating cycling phenomena. In this domain,
is a key element to keep track of these changes in order to
use them to adapt models. Another important fact to consider
is the non-determinism of human behaviors. Human behavior
is usually characterized by the repetition of activities [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
and, each time an activity is executed it follows a different
execution pattern. In addition, human beings could perform
multiple actions at the same time, mixing the execution of
activities, i.e. concurrency. Since the objective of the use case
is to identify discrepancies in the behaviors, also the frequency
concept must be taken into account [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Human behavior is
governed by uncertainty and variability, therefore it is not
possible to establish ex ante which is a frequent or a not
frequent behavior. As a starting point I identified a list of
requirements that the model must fulfill [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], hence now the
objective is to position the work toward one direction. Once
the modeling language has been identified (or extended) I can
focus on the process conformance.
      </p>
    </sec>
    <sec id="sec-4">
      <title>RQ3 - Are the actual mining algorithms able to deal</title>
      <p>
        with unstructured scenario? Which one is the most
appropriate? Is there the need of extending these
algorithms?
In conformance checking, the behavior of a process model and
the behavior recorded in an event log are compared to find
commonalities and discrepancies. This model is the reference
model, on which all the replay work is based, so it is very
important that it is accurate and representative of the behavior
recorded in the log. The discovery of a process related to
human behavior, unstructured by nature, is an aspect not to
be underestimated. Considering the above, there is a close
dependence between mining algorithms and modeling
languages. Most of the current process mining algorithms, assume
the derivation of Petri nets and consequently conformance
checking techniques have been developed along this line. But
there is no certainty that a Petri net has enough expressiveness
to represent these processes. For this reason, the conformance
algorithm must be able to deal with the process modeling
language selected, perhaps, also in this case, the algorithm
should be adapted to respond to the process requirements. To
conclude, given the large amount of data produced by IoT
systems, it is desirable that the conformance is carried out
online [
        <xref ref-type="bibr" rid="ref16 ref2">2, 16</xref>
        ].
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>N.</given-names>
            <surname>Banovic</surname>
          </string-name>
          et al. “
          <article-title>Modeling and understanding human routine behavior”</article-title>
          .
          <source>In: Proc. of CHI</source>
          .
          <year>2016</year>
          , pp.
          <fpage>248</fpage>
          -
          <lpage>260</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Burattin</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Carmona</surname>
          </string-name>
          .
          <article-title>“A framework for online conformance checking”</article-title>
          .
          <source>In: BPM</source>
          . Springer.
          <year>2017</year>
          , pp.
          <fpage>165</fpage>
          -
          <lpage>177</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Di Ciccio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marrella</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Russo</surname>
          </string-name>
          . “
          <article-title>Knowledgeintensive processes: characteristics, requirements and analysis of contemporary approaches”</article-title>
          .
          <source>In: J. Data Semant. 4</source>
          .
          <issue>1</issue>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Di Federico</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Burattin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Montali</surname>
          </string-name>
          . “
          <article-title>Human Behavior as a Process Model: Which Language to Use?” In: ITBPM (</article-title>
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Feldman</surname>
          </string-name>
          and
          <string-name>
            <given-names>B. T.</given-names>
            <surname>Pentland</surname>
          </string-name>
          . “
          <article-title>Reconceptualizing organizational routines as a source of flexibility and change”</article-title>
          .
          <source>In: Adm. Sci. Q</source>
          .
          <volume>48</volume>
          .1 (
          <issue>2003</issue>
          ), pp.
          <fpage>94</fpage>
          -
          <lpage>118</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>H. C.</given-names>
            <surname>Kales</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. N.</given-names>
            <surname>Gitlin</surname>
          </string-name>
          , and
          <string-name>
            <surname>C. G. Lyketsos.</surname>
          </string-name>
          “
          <article-title>Assessment and management of behavioral and psychological symptoms of dementia”</article-title>
          .
          <source>In: BMJ</source>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>M. de Leoni</surname>
          </string-name>
          and S. Du¨ndar. “
          <article-title>Event-log abstraction using batch session identification and clustering”</article-title>
          .
          <source>In: Proc. SAC20</source>
          .
          <year>2020</year>
          , pp.
          <fpage>36</fpage>
          -
          <lpage>44</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>F.</given-names>
            <surname>Leotta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mecella</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Mendling</surname>
          </string-name>
          . “
          <article-title>Applying process mining to smart spaces: Perspectives and research challenges”</article-title>
          .
          <source>In: Lect. Notes Bus. Inf. Process</source>
          .
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>F.</given-names>
            <surname>Leotta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mecella</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Sora</surname>
          </string-name>
          . “
          <article-title>Visual process maps: A visualization tool for discovering habits in smart homes”</article-title>
          .
          <source>In: J. Ambient Intell. Humaniz</source>
          . (
          <year>2019</year>
          ), pp.
          <fpage>1</fpage>
          -
          <lpage>29</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S.</given-names>
            <surname>Mandal</surname>
          </string-name>
          et al. “
          <article-title>A classification framework for IoT scenarios”</article-title>
          .
          <source>In: BPM</source>
          . Springer.
          <year>2018</year>
          , pp.
          <fpage>458</fpage>
          -
          <lpage>469</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Rebuge</surname>
          </string-name>
          and
          <string-name>
            <given-names>D. R.</given-names>
            <surname>Ferreira</surname>
          </string-name>
          . “
          <article-title>Business process analysis in healthcare environments: A methodology based on process mining”</article-title>
          .
          <source>In: Inf. Syst</source>
          . (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>K.</surname>
          </string-name>
          <article-title>SantaCruz</article-title>
          and Jr. Daniel L. Swagerty.
          <source>Early Diagnosis of Dementia. Tech. rep</source>
          .
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Stefanini</surname>
          </string-name>
          et al. “
          <article-title>A process mining methodology for modeling unstructured processes”</article-title>
          .
          <source>In: Knowl. Process. Manag. 27.4</source>
          (
          <issue>2020</issue>
          ), pp.
          <fpage>294</fpage>
          -
          <lpage>310</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Tsymbal</surname>
          </string-name>
          .
          <article-title>The problem of concept drift: definitions and related work</article-title>
          .
          <source>Tech. rep</source>
          .
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>S. J. van Zelst</surname>
          </string-name>
          et al. “
          <article-title>Event abstraction in process mining: literature review and taxonomy”</article-title>
          . In: Granul. (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>S. J. van Zelst</surname>
          </string-name>
          et al. “
          <article-title>Online conformance checking: relating event streams to process models using prefixalignments”</article-title>
          .
          <source>In: Int. J. Data Sci. Anal</source>
          . (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>