=Paper=
{{Paper
|id=Vol-2973/paper_281
|storemode=property
|title=VPM: Analyzing Human Daily Habits through Process Discovery
|pdfUrl=https://ceur-ws.org/Vol-2973/paper_281.pdf
|volume=Vol-2973
|authors=Francesco Leotta,Silvestro Veneruso
|dblpUrl=https://dblp.org/rec/conf/bpm/LeottaV21
}}
==VPM: Analyzing Human Daily Habits through Process Discovery==
VPM: Analyzing Human Daily Habits through Process Discovery Francesco Leottaa , Silvestro V. Venerusoa a Dipartimento di Ingegneria Informatica Automatica e Gestionale “A. Ruberti”, Sapienza Università di Roma, via Ariosto 25, Roma, Italy Abstract Models usually employed for Ambient Intelligence (AmI) in smart homes are usually obtained directly from sensor logs composed by timestamped sequences of sensor measurements. Such approaches, still effective at different tasks, have the drawback of producing representations difficult to read and validate. In this paper we propose a tool, called Visual Process Maps (VPM), intended to allow the analysis of human routines at the human action level thanks to log preprocessing and the application of process discovery. 1. Introduction The term smart space refers to an environment enriched with a set of devices (e.g., sensors and actuators) which aims at providing intelligent services to the human user, realizing the paradigm known as ambient intelligence (AmI) [1]. In order to provide people with these automatic or semi-automatic utilities, an AmI system acquires data from the environment in the form of a sensor log, i.e., a sequence of measurement values acquired from sensors. The literature in the area proposes several solutions to analyze sensor logs and to automatically perform actions on the basis of the current context, user preferences and habits. All of these approaches are based on models which describe the relationship between (i) the behavior of a smart space’s inhabitant(s), (ii) the specific contextual state of the environment, and (iii) a portion of sensor measurements from the log. Most of these techniques anyway, and especially those based on machine learning techniques (the so called learning-based methods), only rely on raw sensor measurements, whereas human actions are usually neglected, making it difficult to visually inspect daily routines and habits [2]. Process Mining [3] combines data mining methods with techniques from the Business Process Management (BPM) area [4]. In particular, it aims at extracting meaningful information from event logs, i.e., sequences of readable human actions performed by user(s). Therefore, applying techniques from the BPM area can be a good compromise to overcome the gap between raw sensor measurements and human actions. Due to the BPM’s input limitation, a log-preprocessing step is required to infer human actions from the sensor log. Proceedings of the Demonstration & Resources Track, Best BPM Dissertation Award, and Doctoral Consortium at BPM 2021 co-located with the 19th International Conference on Business Process Management, BPM 2021, Rome, Italy, September 6-10, 2021 Envelope-Open leotta@diag.uniroma1.it (F. Leotta); veneruso@diag.uniroma1.it (S. V. Veneruso) © 2021 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings http://ceur-ws.org ISSN 1613-0073 CEUR Workshop Proceedings (CEUR-WS.org) In this paper we propose the Visual Process Maps (VPM) system, consisting of a complete pipeline formed by (i) a tool for the visual analysis of sensor logs, (ii) a method to transform raw movement measurements into actions, and (iii) a method to identify and visually analyze prece- dence relationships between human actions. The tool and related resource can be downloaded from https://www.diag.uniroma1.it/leotta/demos/vpm_bpm2021.html. 2. Relationship with the State of the Art VPM is a tool that allows to graphically represent log activations and then to extract habit models out of them. The intended user of such a tool is a domain expert, who analyses the activities in the environment (e.g., an house) and takes decisions about the design of the space itself. The literature about representing models of human habits is wide. In this section, we will highlight the differences between our work and those representation tools available in literature, which produce a human readable output that is easy to validate (once the representation is known). In [5], the Context Modeling Language (CML) is presented. CML represents an initial approach to provide a graphical representation to a formalism for situations and contexts. Derived from software and databases design techniques, it introduces constructs to model typical smart spaces issues, as missing and/or conflicting information, events dependencies and constraints. A survey about context modeling and reasoning techniques over contexts and situations is provided in [6]. However, these works on context-awareness are focused on the representational and reasoning issues; conversely they do not address the issue of learning the model from the sensor logs. This is indeed the main contribution of our work, in which we revert to visual process maps as representation for human habits. Recently, the BPM research community has applied process mining techniques to smart spaces [4]. In [7], authors propose both supervised and unsupervised techniques to fill the gap between raw sensor measurements and human readable models (as discussed in Section 1), to then apply an inductive miner and obtain a Petri-net, i.e., a graphical representation of an obtained model characterized by nodes and arcs. These first attempts to apply techniques taken from the business process management were focused on workflow specifications to anticipate user actions. A workflow is composed by a set of tasks related by qualitative and/or quantitative time relationships. In [8, 9], authors implemented a system called “Sequential Patterns of User Behavior System” (SPUBS) to automatically retrieve these workflows from sensor data. This category of workflow-based modeling techniques (like SPUBS) can, in the case of highly variable processes, produce unreadable models; furthermore this approach requires the sensor log to be already segmented before mining. With VPM, indeed, we decided to apply a specific technique named fuzzy mining that extracts models, which are more suitable than workflows employed in SPUBS, to describe human habits that are flexible by nature. As an advantage, this representation naturally supports visual analytics. Finally, we want to mention Sitivus, a tool adopting visual analytics in smart spaces [10]. In Expert Processing Pipeline Visual Analysis Smart Environment Sensor Log Trajectory Visualization Tool Habit Extraction Extracted Models Figure 1: The proposed approach. their work, they remark the importance of providing domain experts with a tool allowing to graphically inspect models of smart spaces. However, with respect to our work, sitivus is focused on modeling and representing situations more than habits. A situation is an interpretation of the context at a higher level and can comprehend the state of multiple users and consider the activities that are executed by each of them (e.g., “users are having lunch”), while a habit is focused on the routine of the user on specific time intervals (e.g., “what the user usually does after work?”). 3. Features Walkthrough Figure 1 shows the data processing pipeline implemented through the tool. The sensor log generated by the smart home is converted to an event log (containing human actions instead of raw sensor measurements) by running a modified version of the TRACLUS algorithm (see Section 3.1). At this point, human habit extraction is performed thanks to fuzzy mining [11]. The proposed technique has been applied to the Aruba dataset, part of the CASAS project1 freely available datasets [12]. The Aruba dataset contains sensor data that were collected for two years in the home of a volunteer adult. 3.1. Sensor Log conversion and Trajectory Visualization Tool As discussed in Section 1, in order to apply process mining techniques, the sensor log 𝒮 must be translated into a suitable event log ℰ. In this work, we adapted a portion of the TRACLUS algorithm by [13] to perform this task. It considers the entire sensor data log as a single complex trajectory in which it can identify several sub-trajectories, each one describing a portion (a sub-log) of the original sensor log, which are related to the same action. This segmentation is based on finding the so called “characteristic points”: points in which the trajectory’s behaviour changes rapidly. Finally, it classifies each sub-trajectory by considering information about its movement and also which are the sensors involved and for how long, and labels them with an action (MOVEMENT, AREA or STAY), and their relative location inside the smart environment (e.g.,). Now the log can be visually analysed by a human domain expert, who could be interested in replaying the log (or a portion of it) by employing the trajectory visualization tool (see Figure 2). Furthermore, it can now be processed through process mining techniques. 1 http://casas.wsu.edu/datasets/ Figure 2: The trajectory visualization tool. 3.2. Habit Extraction The event log, obtained from the previous step (Section 3.1), is fed into the fuzzy miner. The fuzzy miner produces a directed graph where each node represents a task (or in our case a human action) and each arc represents a precedence constraint between an action and its successor. Nodes and arcs are weighted, respectively with the number of times an action appears in the log and the number of times that an action follows another in the log. These information are used to extract frequent sequences of human actions (i.e., paths inside the graph), by using an entire day as time reference. Finally, we merge all these sequences into single activity models. As this approach was evaluated on the partially labeled Aruba dataset, we had at our disposal a ground truth consisting of sequences labeled with the name of the activity (e.g., Relax, Eating, Sleeping, and others). All frequent sequences extracted were compared with these labeled “ground truth” sequences, by computing the Jaccard coefficient: each sequence is then assigned to the activity label of the ground truth sequence obtaining the maximum score. As a final step, the sequences belonging to the same activity, are combined together: a graph for each activity class, that is also a subgraph of the original graph. The results obtained are very interesting, as the segments extracted are representative of the activity performed. 4. Discussion This paper introduces VPM, a tool for the automated analysis of smart home sensor logs through process discovery. At the current stage, the proposed graphical representation and, as consequence, models produced by the tool are limited by the type of sensors considered during the analysis. At the current stage, only PIR sensors have been considered; this does not allow to understand precisely the action performed by the user, but only the usage of a device. Supporting other types of sensors, would led to better classify actions and to more readable and accurate models, thus allowing a greater precision of the whole approach. Extending the analysis to other types of sensors will be an important starting point for our future work. In addition, the employment of the tool in a real scenario would require also automatic segmentation techniques in order to describe human routines at a level of granularity finer than an entire day. References [1] M.-R. Tazari, F. Furfari, A. Fides-Valero, S. Hanke, O. Höftberger, D. D. Kehagias, M. Mosmondor, R. Wichert, P. Wolf, The universaal reference model for aal., Hand- book of Ambient Assisted Living 11 (2012) 610–625. [2] F. Leotta, M. Mecella, J. Mendling, Applying process mining to smart spaces: Perspectives and research challenges, in: International conference on advanced information systems engineering, Springer, 2015, pp. 298–304. [3] W. M. P. van der Aalst, Process Mining: Data Science in Action, 2 ed., Springer, 2016. [4] M. Dumas, M. La Rosa, J. Mendling, H. A. Reijers, Fundamentals of Business Process Management (2nd ed.), Springer, 2018. [5] K. Henricksen, J. Indulska, Developing context-aware pervasive computing applications: Models and approach, Pervasive and mobile computing 2 (2006) 37–64. [6] C. Bettini, O. Brdiczka, K. Henricksen, J. Indulska, D. Nicklas, A. Ranganathan, D. Riboni, A survey of context modelling and reasoning techniques, Pervasive and Mobile Computing 6 (2010) 161–180. [7] N. Tax, N. Sidorova, R. Haakma, W. M. van der Aalst, Event abstraction for process mining using supervised learning techniques, in: Proceedings of SAI Intelligent Systems Conference, Springer, 2016, pp. 251–269. [8] A. Aztiria, A. Izaguirre, R. Basagoiti, J. C. Augusto, D. J. Cook, Discovering frequent sets of actions in intelligent environments., in: Intelligent Environments, iOS Press, 2009, pp. 153–160. [9] A. Aztiria, A. Izaguirre, R. Basagoiti, J. C. Augusto, D. Cook, Automatic modeling of frequent user behaviours in intelligent environments, in: Intelligent Environments (IE), 2010 Sixth International Conference on, IEEE, 2010, pp. 7–12. [10] A. K. Clear, T. Holland, S. Dobson, A. Quigley, R. Shannon, P. Nixon, Situvis: A sensor data analysis and abstraction tool for pervasive computing systems, Pervasive and Mobile Computing 6 (2010) 575–589. [11] C. W. Günther, W. M. Van Der Aalst, Fuzzy mining–adaptive process simplification based on multi-perspective metrics, in: International conference on business process management, Springer, 2007, pp. 328–343. [12] D. J. Cook, Learning setting-generalized activity models for smart spaces, IEEE intelligent systems 27 (2012) 32–38. [13] J.-G. Lee, J. Han, K.-Y. Whang, Trajectory clustering: a partition-and-group framework, in: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, 2007, pp. 593–604.