<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Composite State Machine Miner: Discovering and Exploring Multi-perspective Processes</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Maikel</forename><forename type="middle">L</forename><surname>Van Eck</surname></persName>
						</author>
						<author role="corresp">
							<persName><forename type="first">Natalia</forename><surname>Sidorova</surname></persName>
							<email>n.sidorova@tue.nl</email>
						</author>
						<author>
							<persName><forename type="first">Wil</forename><forename type="middle">M P</forename><surname>Van Der Aalst</surname></persName>
						</author>
						<author>
							<affiliation key="aff0">
								<orgName type="institution">Eindhoven University of Technology</orgName>
								<address>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="institution">Eindhoven University of Technology</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Composite State Machine Miner: Discovering and Exploring Multi-perspective Processes</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">C0E4B65E1179F5BFF7D10D6D38A2C376</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T23:24+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>process discovery</term>
					<term>state-based models</term>
					<term>multi-perspective processes</term>
					<term>interactive process exploration</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Process mining provides fact-based insights into processes based on behaviour captured in event data. An important aspect of this is the discovery of process models from such data. Traditionally, the focus of process discovery is on learning the ordering of activities. We deviate from this dominating activity view on processes to focus on states and state changes. Specifically, we aim to discover state-based models for processes where different facets, or perspectives, of the process can be identified. In this paper we describe an interactive process discovery tool that can be used to discover and explore state-based models for such multi-perspective processes: the Composite State Machine Miner. It quantifies and visualises the interactions between perspectives to provide additional process insights. This tool has been used to analyse the BPI Challenge 2012 data of a loan application process and product user behaviour data gathered by Philips during the development of a smart baby bottle equipped with various sensors.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>We assume that the reader is familiar with the basic concepts of process mining and process discovery, and we refer to <ref type="bibr" target="#b0">[1]</ref> for an in-depth overview. The goal of most process discovery approaches is to obtain models that describe the ordering of activities within a process. These approaches usually have an implicit notion of the state of a process.</p><p>In this work we deviate from the dominating activity view on processes in order to focus explicitly on process states and state changes. This state view is more intuitive than an activity view for processes for which state information is explicitly available. Examples of such explicit state information are the diagnosis of a patient in a healthcare process or the status of an order in a purchasing process.</p><p>When studying the states of a process, that single process can have different facets, or perspectives, each with their own state space. For example, consider the homeostatic process in a person, parts of which regulate sleep and nutrition. From the perspective of sleep the state of a person can be e.g. awake or asleep, while the state of the nutrition perspective can be e.g. hungry, eating or sated. The state of a person at any point in time is the composition of the state of both perspectives. These perspectives have individual process cycles, but there are interdependencies between states from different perspectives, e.g. people are awake while eating. Our goal is to analyse these interdependencies between perspectives for multi-perspective state-based processes.</p><p>An approach to achieve this is described in detail in <ref type="bibr" target="#b2">[3]</ref>. In this work we discuss the implementation of this approach: the Composite State Machine (CSM) Miner.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Implementation</head><p>The CSM Miner has been implemented as a plug-in of the ProM framework <ref type="bibr" target="#b3">[4]</ref>. It is obtained by installing ProM 6.6 or later from http://www.promtools.org/ and then using the ProM package manager to instal the CSMMiner package. Example logs that can be used with the CSM Miner can be obtained from https://svn.win.tue.nl/repos/prom/Packages/CSMMiner/Logs. There is a screencast providing a demonstration of the main features of the miner at: https://svn.win.tue.nl/repos/prom/Packages/CSMMiner/Documentation.</p><p>The main assumption behind the CSM Miner is that each state of the multiperspective composite process under study is a vector of the states of its perspectives. That is, for a process with n perspectives each state of the composite process is a state of the form: s = (s 1 × . . . × s n ). The CSM Miner discovers a state machine describing the states and state changes of the composite process, as well as a state machine for each perspective. Transitions (s → s ) in the discovered composite state machine represent state changes in one or more perspectives, while transitions (s i → s i ) in the discovered state machine of perspective i represent a change in that specific perspective.</p><p>The input data for the CSM Miner is assumed to be an XES event log <ref type="bibr" target="#b3">[4]</ref> where each event represents a state change in a specific perspective. In Fig. <ref type="figure">1</ref> an example of the desired input is shown. Each trace in the log is required to contain attributes of the form process:initialstate:[perspectiveName] specifying the initial state of each perspective at the start of the trace. Each event is required to contain an attribute of the form process:name specifying the perspective for which it is a state change.</p><p>An example of a composite state machine and its perspective state machines discovered by the CSM Miner is shown in Fig. <ref type="figure">3</ref>. Each state machine is displayed separately in an interactive visualisation where states and transitions can be dragged to move them around. Fig. <ref type="figure">1</ref>: A partial XES event log that can be used as input for the CSM Miner. Each trace in the log is required to be annotated with attributes specifying the initial state of each perspective and each event is required to be annotated with the perspective to which they belong.</p><p>States and transitions are annotated with statistics and additional statistics are shown at the bottom of the visualisation for the state or transition that is currently selected. For each state the statistics show the total number of times this state was observed to occur in the log, as well as the total time that was spent in this state. For each transition the statistics also show the total number of times this transition was observed. The percentage for each transition shows the observed frequency with which this transition was taken from the source state of the transition, i.e. if a state has only one outgoing transition then it is annotated with 100% and if there are two outgoing transitions that have been observed equally many times then they are each annotated with 50%.</p><p>By clicking on a state or transition the states and transitions in the other models that co-occur with the selected element are highlighted. The highlighted states also show two different statistics: confidence and lift. Confidence expresses what percentage of time the highlighted state was observed together with a selected state compared to the total time spent in that selected state. Lift expresses how much the confidence differs from the percentage of time that a highlighted state is expected to occur independent of the selected state. For a more detailed discussion of confidence and lift we refer to <ref type="bibr" target="#b2">[3]</ref>.</p><p>The visualisation of the discovered models also contains functionality that enables the user to transform the model. This functionality can be accessed from an expandable menu, as shown in Fig. <ref type="figure">2</ref>, and allows the user to remove selected states from the model and to aggregate selected states. During the model transformation the statistics are recalculated.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Case Studies</head><p>The Composite State Machine Miner has been used during the analysis of two case studies. The first case study concerned the BPI Challenge 2012 data of Fig. <ref type="figure">2</ref>: The green states are selected for aggregation by the user because they are always executed together, but in arbitrary order. a loan application process <ref type="bibr" target="#b1">[2]</ref>. The second case study involved the analysis of product user behaviour data gathered by Philips during the development of a smart baby bottle equipped with various sensors. These case studies and the insights provided by the CSM Miner are discussed in detail in <ref type="bibr" target="#b2">[3]</ref>.</p><p>The models shown in Fig. <ref type="figure">3</ref> have been discovered on the BPI Challenge 2012 data. These models are much more structured than models discovered by traditional process discovery algorithms on the same data. Analysing them provided useful insights into the difference between automatically and manually processed applications and the effectiveness of fraud investigation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusion</head><p>In this paper we have presented the Composite State Machine Miner, an interactive tool for the discovery and exploration of state-based multi-perspective processes. The CSM Miner can be used to study the interactions between perspectives in such processes. This has been used successfully in two case studies.</p><p>Future work that we plan to do is focussed at improving the practical usability of the tool. For example, it would be useful to suggest states or relations between perspectives that are likely to be of interest to the user, to avoid having to click on every element of the discovered models to explore the results. Also, additional support for different types of relations between perspectives would enable the CSM Miner to generate additional insights. Fig. <ref type="figure">3</ref>: The interactive visualisation of a discovered composite state machine. The selected state is denoted with a red box and its co-occurring states and transitions are highlighted in the other perspectives and the overall view based on their confidence. The statistics below the visualisation refer to the selected state or transition.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="5,144.98,179.43,515.28,210.68" type="bitmap" /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Process Mining: Discovery, Conformance and Enhancement of Business Processes</title>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M P</forename><surname>Van Der Aalst</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2011">2011</date>
			<publisher>Springer</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title/>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">F</forename><surname>Van Dongen</surname></persName>
		</author>
		<idno type="DOI">10.4121/uuid:3926db30-f712-4394-aebc-75976070e91f</idno>
		<ptr target="http://dx.doi.org/10.4121/uuid:3926db30-f712-4394-aebc-75976070e91f" />
	</analytic>
	<monogr>
		<title level="j">BPI Challenge</title>
		<imprint>
			<date type="published" when="2012">2012. 2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Discovering and Exploring State-based Models for Multi-perspective Processes</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Van Eck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Sidorova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M P</forename><surname>Van Der Aalst</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">14th International Conference on Business Process Management (BPM)</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
	<note>In press</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">XES, XESame, and ProM 6</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">M W</forename><surname>Verbeek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C A M</forename><surname>Buijs</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">F</forename><surname>Van Dongen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M P</forename><surname>Van Der Aalst</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Information Systems Evolution</title>
				<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="60" to="75" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
