<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Interest-Driven Recommendations To Support Time Performance Analyses</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Carlos</forename><surname>Capitán-Agudo</surname></persName>
							<email>ccagudo@us.es</email>
							<affiliation key="aff0">
								<orgName type="laboratory">SCORE Lab</orgName>
								<orgName type="institution">University of Seville</orgName>
								<address>
									<settlement>Seville</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Interest-Driven Recommendations To Support Time Performance Analyses</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">0E591C123B08ADE29026650BA394FCA7</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T20:03+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Time performance analysis</term>
					<term>BPI challenge</term>
					<term>Business questions</term>
					<term>Interestingness</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper outlines a doctoral research work that investigates how to support analyses that address time performance questions. To do that, it is investigated how process analysts behave to answer them. Aligned with insights of their behaviors, a system will be developed to recommend the most interesting insights for a certain analyst, with the objective of automating the analysis process.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>In recent years, there has been an increasing development of process mining techniques with the objective of discovering, monitoring, and improving business processes <ref type="bibr" target="#b0">[1]</ref>. These techniques usually focus on specific aspects of the processes, such as the time performance of process executions or the distribution of activities. However, these techniques have the problem that they require extensive previous knowledge to be applied. This problem has inspired the creation of various types of support to facilitate the use of these techniques.</p><p>On the one hand, methodologies and case studies have been developed to understand how these techniques are applied. For example, in <ref type="bibr" target="#b1">[2]</ref> the authors evaluated the maturity of the field from a practical perspective, taking into account the adoption of tools and the use of process mining methodologies throughout the years. In <ref type="bibr" target="#b2">[3]</ref> the authors evaluated qualitatively reports submitted to an annual competition called the Business Process Intelligence Chanllenge (BPIC). In that competition, participants receive an event log provided by a real organization along with specific business questions, and they submit a report with the analyses performed to answer them. Specifically, the objective in <ref type="bibr" target="#b2">[3]</ref> was to understand the use of visual techniques in process mining. Furthermore, in <ref type="bibr" target="#b3">[4]</ref> the authors performed an empirical study to observe how analysts tackle the task of initial exploration. In their study, they conducted interviews with 12 participants where they asked a general business question. To answer it, the participants examined and analyzed an event log.</p><p>On the other hand, other authors have focused on developing tool support to assist exploratory analyses. For instance, in <ref type="bibr" target="#b4">[5]</ref> the authors developed a system that receives an event log and recommends subsets that could contain interesting insights. To achieve this, they applied trace clustering techniques to split the log into trace subsets. Subsequently, they rank these subsets based on their level of deviation in Key Performance Indicators (KPI) or Process Performance Indicators (PPI), as well as trace diversity, from the most to the least diverse and deviating. Additionally, in <ref type="bibr" target="#b5">[6]</ref> the authors proposed a solution to support the awareness and the track of steps in exploratory analyses. To accomplish that, they integrate the system into a process mining tool, where it is built a replayable history of the user interactions with the tool. They also provide views where the goals of the user and the applied data operations of the tool are represented as graphs.</p><p>The previous research approaches have facilitated the application of process mining techniques from a global or exploratory perspective. However, it is unknown how analysts use these techniques to address specific business questions, which usually play an important role during analysis <ref type="bibr" target="#b1">[2]</ref>. Furthermore, no tool has been developed to provide recommendations to address specific business questions. The research goal of this doctoral research work is to extend existing tools by providing a recommendation tool to address time performance questions, with the objective of automating the analysis to answer time performance questions. The recommendations will be aligned with the behavior of the analysts. In particular, the tool support is focused on time performance questions referred to aspects such as calculating cycle times of the process, finding bottlenecks, calculating waiting times of resources, or determining time performance differences. Time performance is one of the most frequent problems in process mining <ref type="bibr" target="#b6">[7]</ref>. From this main goal, the following research questions should be answered to achieve it: RQ1: How do process analysts behave when answering to time performance questions? RQ2: How can we support time performance analyses through recommendations aligned with their behaviors to automate the analysis process?</p><p>By addressing the first question, we can derive comprehensive knowledge that represents the proper approaches for conducting these analyses. To answer the second question, a tool support will be developed to make recommendations about the analysis, with the objective of automating part of the analyst process. These recommendations will be aligned with the knowledge discovered in the first question. The following objectives have been defined to answer them: OBJ1: Understand how process analysts behave when dealing with time performance questions OBJ2: Provide support through recommendations aligned with the analyst behaviors to automate the process</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Methodology</head><p>This doctoral research work will follow the principles of Design Science Research (DSR) to develop the needed support <ref type="bibr" target="#b7">[8]</ref>. This research methodology focuses on the creation of artifacts that improve the quality of life of our society through technological progress. As artifact is understood any element containing knowledge such as models, methods, or theories. The first step of DSR is to identify a specific problem and collect data related to it. Secondly, the objectives and requirements to solve that problem must be defined. Thirdly, the data collected in the first step is analyzed, and the artifact is designed. Fourthly, the usefulness of the artifact to address the problem is evaluated. Finally, the process is documented and the results are communicated to other researchers. Taking into account the previous methodology, the following steps will be performed to answer the questions. By thoroughly reviewing related literature, we will collect analyses that address time performance questions. Specifically, we will collect these analyses from different sources such as reports or screen and audio recordings. The objectives have already been defined and highlighted in the Introduction. To achieve these objectives, we will firstly conduct a mixed quantitative-qualitative study of these analyses to understand how they are performed. Secondly, we will qualitetively develop a set of codes related to the actions that were performed in the analyses inspired by Grounded Theory <ref type="bibr" target="#b8">[9]</ref>. Afterwards , we will apply both quantitative and qualitative methods to obtain conclusions of the codes (e.g., clustering techniques, similarity measures). These conclusions will be a set of insights describing the behavior of analysts to address time performance questions. Once, these insights are obtained, a recommender system will be developed to help analysts answer them. This system will support users throughout the process, offering automatic recommendations for doing time performance analyses to answer business questions. These recommendations will be aligned with the insights of the analyst behaviors. A study involving experts in process mining will be conducted to evaluate the tool</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Research state</head><p>The first step of this research has been completed, and the specific problem has been highlighted in the Introduction.</p><p>The data collection step of the investigation is close to completion, and a significant portion of the required data has been gathered. To gather the reports, we have analyzed the questions related to all the BPICs identifying those that were related to time performance. Subsequently, we collected reports submitted to the BPIC that had a clear section to answer time performance questions. In total, four BPI challenges had time performance questions, and 110 answers were selected. With respect to the recordings, a dataset composed of 12 videos of analysts answering time performance questions has been gathered, where the conduct of the analysts was captured in real time during their analysis process.</p><p>The study of the previous data is in advanced state. With respect to the answers of the BPICs, they were independently coded by two researchers who shared their codings, and they have already been analyzed. These results results were published in the BPM 2022 <ref type="bibr" target="#b9">[10]</ref>, and are described next. A set of codes of 55 different operations was obtained (e.g., Filter traces) and a set of 137 different ways to implement these operations (e.g. Filter traces by activities), which were called variants. These operations had different purposes such as analyze time (e.g., Calculate cycle time) or manipulate the data (e.g., Filter traces). Moreover, we applied the Kmeans algorithm to classify the answers in different groups. The best results were obtained for four clusters using the Silhouette index as evaluation measure. Each group represented an answer type that were characterized using depending on the operations used along with their frequencies. We also measured the similarities between the answers depending on different contexts(e.g., the author profile, the objective of the question) to see their influence. These insights have recently been expanded with the dataset of recordings. This dataset could enrich and generalize them by adding the conduct of analysts. This extension of the results is under revision in a scientific journal.</p><p>Finally, the development of the recommender system is in an early stage <ref type="bibr" target="#b10">[11]</ref>. We are currently designing the recommendation process and establishing its alignment with the previous insights. One way to perform the recommendations could be to assess the operations related to process metrics and manipulation methods of the discovered catalog that are interesting for a time performance analysis of a specific user. This evaluation could be performed using measures that assess the interestingness of association rules <ref type="bibr" target="#b11">[12]</ref>, which have already been used to recommend filtering methods that provide the most interesting data subsets or summaries in data mining <ref type="bibr" target="#b12">[13]</ref>. These measures assess the interestingness of the data based on various aspects, including conciseness (few attribute-value pairs in the result) and generality (a result that encompasses a significant subset of the dataset). Thus, the challenge is to make them suitable for process mining and the mentioned operations.</p></div>		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work has been funded by projects RTI2018-100763-J-I00 (CONFLEX), and PID2021-126227NB-C21 (PERSEO), and TED2021-131023B-C22 (ORCHID) granted by MCIN/ AEI/ 10.13039/501100011033/ and ERDF A way of making Europe. C. Capitán-Agudo is supported by the Spanish Ministry of Education, under the national FPU plan (FPU21/03631 grant).</p><p>This PhD is supervised by Manuel Resinas Arias de Reyna and Cristina Cabanillas Macías from the University of Seville.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Process Mining -Data Science in Action</title>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M P</forename><surname>Van Der Aalst</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-662-49851-4</idno>
		<imprint>
			<date type="published" when="2016">2016</date>
			<publisher>Springer</publisher>
		</imprint>
	</monogr>
	<note>Second Edition</note>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">A Case Study Lens on Process Mining in Practice</title>
		<author>
			<persName><forename type="first">F</forename><surname>Emamjome</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Andrews</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">H</forename><surname>Hofstede</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">OTM Conferences</title>
				<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="127" to="145" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Mining Process Mining Practices: An Exploratory Characterization of Information Needs in Process Analytics</title>
		<author>
			<persName><forename type="first">C</forename><surname>Klinkmüller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Müller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Weber</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Int. Conf. on Business Process Management (BPM)</title>
				<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="322" to="337" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Initial insights into exploratory process mining practices</title>
		<author>
			<persName><forename type="first">F</forename><surname>Zerbato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Soffer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Weber</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">BPM Forum</title>
		<imprint>
			<biblScope unit="page" from="145" to="161" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">ProcessExplorer: Intelligent Process Mining Guidance</title>
		<author>
			<persName><forename type="first">A</forename><surname>Seeliger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Sánchez Guinea</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Nolle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Mühlhäuser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Business Process Management (BPM)</title>
				<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="216" to="231" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Supporting provenance and data awareness in exploratory process mining</title>
		<author>
			<persName><forename type="first">F</forename><surname>Zerbato</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Burattin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Völzer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">N</forename><surname>Becker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Boscaini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Weber</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Int. Conf. on Advanced Information Systems Engineering (CAiSE)</title>
				<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="454" to="470" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">PM 2 : A Process Mining Project Methodology</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Van Eck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Lu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">J J</forename><surname>Leemans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M</forename><surname>Van Der Aalst</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Int. Conf. on Advanced Information Systems Engineering (CAiSE)</title>
				<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="297" to="313" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">A Design Science Research Methodology for Information Systems Research</title>
		<author>
			<persName><forename type="first">K</forename><surname>Peffers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Tuunanen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Rothenberger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Chatterjee</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. Manag. Inf. Sys</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="page" from="45" to="77" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Grounded theory in software engineering research: a critical review and guidelines</title>
		<author>
			<persName><forename type="first">K</forename><surname>Stol</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Ralph</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Fitzgerald</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ICSE</title>
		<imprint>
			<biblScope unit="page" from="120" to="131" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Analyzing how process mining reports answer time performance questions</title>
		<author>
			<persName><forename type="first">C</forename><surname>Capitán-Agudo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Salas-Urbano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Cabanillas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Resinas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Business Process Management (BPM)</title>
				<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="234" to="250" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Recommending interesting results in process mining analysis</title>
		<author>
			<persName><forename type="first">C</forename><surname>Capitán-Agudo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Salas-Urbano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Cabanillas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Resinas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">JCIS</title>
		<imprint>
			<biblScope unit="volume">2022</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">J</forename><surname>Hilderman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">J</forename><surname>Hamilton</surname></persName>
		</author>
		<title level="m">Knowledge Discovery and Measures of Interest</title>
				<imprint>
			<publisher>Kluwer Academic Publishers</publisher>
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Predicting &quot;What is interesting&quot; by mining interactive-dataanalysis session logs</title>
		<author>
			<persName><forename type="first">T</forename><surname>Milo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Ozeri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Somech</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Int. Conf. on Extending Database Technology (EDBT)</title>
				<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="456" to="467" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
