<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">A Tool for Incorporating Eye Tracking Data in RPA: Enhancing User Behavior Logs</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">M</forename><surname>García-Romero</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer Languages and Systems. E.T.S</orgName>
								<orgName type="institution">University of Seville. Ingeniería Informática. Avd. Reina Mercedes</orgName>
								<address>
									<addrLine>s/n</addrLine>
									<postCode>41012</postCode>
									<settlement>Seville</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Antonio</forename><surname>Martínez-Rojas</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer Languages and Systems. E.T.S</orgName>
								<orgName type="institution">University of Seville. Ingeniería Informática. Avd. Reina Mercedes</orgName>
								<address>
									<addrLine>s/n</addrLine>
									<postCode>41012</postCode>
									<settlement>Seville</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author role="corresp">
							<persName><forename type="first">J</forename><forename type="middle">G</forename><surname>Enríquez</surname></persName>
							<email>jgenriquez@us.es</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer Languages and Systems. E.T.S</orgName>
								<orgName type="institution">University of Seville. Ingeniería Informática. Avd. Reina Mercedes</orgName>
								<address>
									<addrLine>s/n</addrLine>
									<postCode>41012</postCode>
									<settlement>Seville</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">A</forename><surname>Jiménez-Ramírez</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer Languages and Systems. E.T.S</orgName>
								<orgName type="institution">University of Seville. Ingeniería Informática. Avd. Reina Mercedes</orgName>
								<address>
									<addrLine>s/n</addrLine>
									<postCode>41012</postCode>
									<settlement>Seville</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">A Tool for Incorporating Eye Tracking Data in RPA: Enhancing User Behavior Logs</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">28DEF7A59A6E02C7DD3662A1787530FE</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T17:02+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Robotic Process Automation</term>
					<term>Robotic Process Mining</term>
					<term>User Behaviour Mining</term>
					<term>User Behaviour Log</term>
					<term>Eye Tracking</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper presents UBGI, an innovative tool designed to enhance Robotic Process Automation (RPA) by integrating eye tracking data with user interface (UI) logs. UBGI processes and combines gaze logs with UI logs to create enriched User Behaviour (UB) logs, enabling more precise identification of user focus areas. By applying filtering masks to screenshots, UBGI highlights relevant data, facilitating analysis of user interactions. This tool enables further analysis of user behaviour through an external source, specifically eye tracking data.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>In the current era of Hyperautomation <ref type="bibr" target="#b0">[1]</ref>, companies are driven to quickly identify and automate every possible business process. This trend coincides with the growing popularity of RPA. Unlike traditional automation methods, such as those based on APIs, RPA is based on graphical user interfaces to automate and integrate systems <ref type="bibr" target="#b1">[2]</ref>.</p><p>As the first condition for automating a process, it is necessary to understand how it is performed and even discover what process needs to be automated. To address this, the so-called mining techniques emerged. Mining techniques like Robotic Process Mining (RPM) <ref type="bibr" target="#b1">[2]</ref> and User Behaviour Mining (UBM) <ref type="bibr" target="#b2">[3]</ref> are essential to identify process models that represent human behaviour, facilitating automation of processes.</p><p>These techniques rely on the UI log <ref type="bibr" target="#b3">[4]</ref>, which records user actions as clicks or keystrokes within a graphical user interface. In addition, the information on the screen can be incorporated as features to provide context for user actions. This can be useful for analysing the routines or decisions recorded in the UI log <ref type="bibr" target="#b4">[5]</ref>. Nevertheless, a challenge arises because of the extensive amount of data contained in the screenshots. These screenshots capture relevant and irrelevant information, encompassing all UI elements displayed on the screen <ref type="bibr" target="#b5">[6]</ref>. Consequently, extracting meaningful insights from UI logs becomes challenging, especially when dealing with complex and information-dense graphical user interfaces. This mixture of data requires careful filtering to ensure optimal performance. To address this problem, eye tracking technology can be applied to capture the points of the screen where the user focuses his gaze while interacting with the user interface <ref type="bibr" target="#b6">[7]</ref>. Additional gaze information such as the Point of Gaze (POG), gaze fixations and dispersion <ref type="bibr" target="#b7">[8]</ref> are obtained by eye tracking software based on webcams or infrared eye tracking tools. This additional gaze information is essential to define attention areas and enables us to distinguish between relevant and irrelevant data in the screenshots <ref type="bibr" target="#b5">[6]</ref>, as shown in Figure <ref type="figure" target="#fig_0">1</ref>. These partially-masked screenshots can help by revealing the UI elements users need to interact with, connecting clicks or keystrokes to specific on-screen content.</p><p>This article introduces User Behaviour and Gaze Integrator (UBGI) <ref type="foot" target="#foot_0">1</ref> , a tool capable of performing the following functions:</p><p>• Proccess and combine the eye tracking data extracted from the eye tracking software as a gaze log with the UI log defined in. The gaze log is any CSV file that includes gaze events as defined in <ref type="bibr" target="#b5">[6]</ref>. The UI log supports formats such as XES/CSV/MHT that include user actions, as reflected in <ref type="bibr" target="#b8">[9]</ref>. As a result of this combination, the User Behaviour (UB) log is obtained <ref type="bibr" target="#b6">[7]</ref> . • Generate screenshots from those obtained in the UI log with an overlay mask. This mask covers all regions of the screenshot where fixations have not been detected, leaving visible only the regions where fixations have been already detected. These visible regions in the screenshots represent the attention areas <ref type="bibr" target="#b5">[6]</ref>.</p><p>The UB log and the generated screenshots can be used as seen in <ref type="bibr" target="#b5">[6]</ref> to filter information extracted from the screen to improve performance, which is useful for RPM approaches such as routine discovery proposals <ref type="bibr" target="#b3">[4]</ref> or decision models <ref type="bibr" target="#b4">[5]</ref>.</p><p>The remainder of this work is structured as follows. Section 2 describes the main features of the tool. In Section 3, the maturity of the tool is discussed. Section 4 provides links to the demonstration of the tool in the form of a video. Finally, the conclusions of this work are presented in Section 5</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">UBGI Tool Features</head><p>The proposed tool has been developed as a web application using Django REST framework for Python. JavaScript is used for client-side rendering, and PostgreSQL is used as the database.</p><p>The innovative features of the tool reside in two phases: the UB Log Gaze Enricher phase and the Filtering Mask phase. Before these two phases, the user must record the execution of the process, i.e., the case study. The case study consists of one or more UI logs and gaze logs for one or more scenarios. To obtain the UI log for each scenario with the screenshots associated with each action, loggers are used to monitor the user's activity and save it. To obtain the gaze log, it is necessary to have an eye tracking tool, such as webcam-based eye tracking software that can capture the POGs on the screen during the case study recording. WebgaWzer.js is an open source webcam-based eye tracking web application<ref type="foot" target="#foot_1">2</ref>  <ref type="bibr" target="#b9">[10]</ref>. It is already included in the UBGI tool.</p><p>Once the case study is available and loaded into the UBGI tool, it is executed through the following phases:</p><p>• UB Log Gaze Enricher phase: In this phase, we integrate the UI log with the gaze log to form the UB log. The eye tracking software data is analyzed to identify gaze events such as fixations, saccades, and dispersion values <ref type="bibr" target="#b7">[8]</ref>. The integration of information from two different logs into a single log is made possible by timestamps (in UTC format), which allow us to match user actions, such as clicks and keystrokes, with the corresponding gaze events occurring within the time interval of a specific action and the next one. In this way, we obtain the UB Log enriched with gaze data. • Filtering Mask phase: From the screenshots extracted from the UI log, a copy of each is generated. For each of these new screenshot copies, a gaze-based mask is created. This gaze-based mask consists of a black filter mask that covers the entire screenshot, except for the regions identified as attention areas <ref type="bibr" target="#b5">[6]</ref>. The attention areas, which are the regions that are not overlapped by the filtering mask, are circles drawn from the centroid of each fixation with a radius determined by the value of the dispersion. The user can increase or decrease this parameter within the UBGI tool <ref type="bibr" target="#b5">[6]</ref>. In this way, it is possible to observe which regions of the screen have been viewed by the user and which have not, to determine the relevant and irrelevant information between the user actions. Figure <ref type="figure" target="#fig_0">1</ref> shows an example of a screenshot copy generated in this phase.</p><p>The UB log generated in the UB Log Gaze Enricher phase and the screenshots that include the filtering mask can be downloaded as a.zip file once the case study is executed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Tool Maturity</head><p>The UBGI tool is currently in a stable version that meets its purposes. Nevertheless, there are several lines of research and future work that can contribute to a higher level of maturity.</p><p>First, the UBGI tool demonstrates flexibility in adjusting different parameters and formats of UI logs. However, it does not have the same level of adaptability for gaze logs. In future work, it is necessary to deepen the understanding of the various available eye tracking software and apply a similar treatment to that used for UI logs. This will allow UBGI to process any gaze log from any eye tracking software.</p><p>Second, determining the attention areas and gaze masks in the Filtering mask phase only considers fixation and dispersion gaze events. Other parameters can be configured to determine an optimal drawing of the attention areas and, conversely, the filtering mask (e.g., thresholds, saccade eye movements, etc.).</p><p>Third, the artefacts generated in the phases of the UBGI tool contain important information only if they are going to be processed. Therefore, it is important to integrate the artefacts generated in the UBGI phases into new phases or modules that can be created within the tool for the discovery of automatable processes. Even include the UBGI tool in other tools that perform the function of extracting features from the UI for process discovery.</p><p>UBGI has been subjected to pilot tests in simulated environments that replicate real-use conditions. These tests have provided valuable data to identify possible improvements and confirm the effectiveness of UBGI in data integration and analysis. However, implementation in real productive environments will be crucial to evaluate its performance in intensive use situations and its capacity to handle massive amounts of data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Tool Demonstration</head><p>In this section, we present a demonstration of the artefacts generated by the UBGI tool. To do this, we will record and execute a complete case study <ref type="foot" target="#foot_2">3</ref> . The case study simulates a process within the HR department of a company. Specifically, the case study involves an administrative worker who must attach the COVID 19 certificates of company employees to the database from a folder. If an employee is not found in the database, the administrator must report the missing employee with his COVID 19 certificate.</p><p>For recording the UI log, we use Steprecorders, a native application of the Windows operating system. To record the gaze log, we use the Webgazer.js eye tracking software.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions</head><p>The User Behaviour Gaze Integrator (UBGI) tool contributes to the BPM community with the ability to integrate and analyse eye tracking data and user interface logs, enabling the consideration of broader contextual information when studying user behaviour. Therefore, this opens up new lines of work in fields such as RPA, UBM, and RPM.</p><p>The proposed tool processes user activities by first integrating UI logs with gaze logs to form an enriched UB log. This is achieved through the UB Log Gaze Enricher phase, which matches user actions with corresponding gaze events using timestamps. In the subsequent Filtering Mask phase, the tool generates modified screenshots highlighting attention areas by overlapping a gaze-based mask, revealing relevant and irrelevant information found in the screenshots, whether the user is performing an action or not. These capabilities allow for a more detailed analysis of user interactions, providing valuable insights to further analysis for process automation strategies.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: (A). A screenshot extracted from a UI Log. All UI element in screenshot must be processed later. (B). A screenshot obtained after executing UBGI. A black gaze mask hides all the regions where the user has not focused his gaze.</figDesc><graphic coords="2,89.29,55.84,416.69,111.12" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">Source Code: https://github.com/RPA-US/ubgi</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">Webgazer.js source code: https://github.com/brownhci/WebGazer.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">Video: https://youtu.be/hW4-oQRNBC8</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This research was supported by the EQUAVEL project PID2022-137646OB-C31, funded by MICIU/AEI/10.13039/501100011033 and by FEDER, UE; the DISCOVERY project (2021/C005/00148631), funded by Unión Europea NextGeneration EU and "Plan de Recuperación, Transformación y Resiliencia" of the Ministry of Economic and Digital Transformation; and the grant FPU20/05984 funded by MICIU/AEI/10.13039/501100011033 and by FSE+.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Humans, processes and robots: a journey to hyperautomation</title>
		<author>
			<persName><forename type="first">A</forename><surname>Jiménez-Ramírez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Business Process Management</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="3" to="6" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M</forename><surname>Van Der Aalst</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bichler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Heinzl</surname></persName>
		</author>
		<title level="m">Robotic process automation</title>
				<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">User behavior mining: A research agenda</title>
		<author>
			<persName><forename type="first">J.-R</forename><surname>Rehse</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Abb</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Berg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bormann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Kampik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Warmuth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Business &amp; Information Systems Engineering</title>
		<imprint>
			<biblScope unit="page" from="1" to="18" />
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Reactive synthesis of software robots in rpa from user interface logs</title>
		<author>
			<persName><forename type="first">S</forename><surname>Agostinelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lupia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Marrella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Mecella</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computers in Industry</title>
		<imprint>
			<biblScope unit="volume">142</biblScope>
			<biblScope unit="page">103721</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">A screenshotbased task mining framework for disclosing the drivers behind variable human actions</title>
		<author>
			<persName><forename type="first">A</forename><surname>Martínez-Rojas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jiménez-Ramírez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">G</forename><surname>Enríquez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">A</forename><surname>Reijers</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information Systems</title>
		<imprint>
			<biblScope unit="volume">121</biblScope>
			<biblScope unit="page">102340</biblScope>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">What are you gazing at? an approach to use eye-tracking for robotic process automation</title>
		<author>
			<persName><forename type="first">A</forename><surname>Martínez-Rojas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">A</forename><surname>Reijers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jiménez-Ramírez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">G</forename><surname>Enríquez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Business Process Management</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="120" to="134" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Incorporating the user attention in user interface logs</title>
		<author>
			<persName><forename type="first">A</forename><surname>Martínez-Rojas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jiménez-Ramírez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">G</forename><surname>Enríquez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lizcano-Casas</surname></persName>
		</author>
		<idno type="DOI">10.5220/0011568000003318</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 18th International Conference on Web Information Systems and Technologies -WEBIST</title>
				<meeting>the 18th International Conference on Web Information Systems and Technologies -WEBIST<address><addrLine>SciTePress</addrLine></address></meeting>
		<imprint>
			<publisher>INSTICC</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="415" to="421" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Identifying fixations and saccades in eye-tracking protocols</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">D</forename><surname>Salvucci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">H</forename><surname>Goldberg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2000 symposium on Eye tracking research &amp; applications</title>
				<meeting>the 2000 symposium on Eye tracking research &amp; applications</meeting>
		<imprint>
			<date type="published" when="2000">2000</date>
			<biblScope unit="page" from="71" to="78" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">A toolsupported method to generate user interface logs</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Martínez</forename><surname>Rojas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jiménez Ramírez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">Gonzalez</forename><surname>Enriquez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">A</forename><surname>Reijers</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">HICSS: Hawaii International Conference of System Sciences</title>
				<imprint>
			<date type="published" when="2023">2023. 2023</date>
			<biblScope unit="page" from="5472" to="5481" />
		</imprint>
	</monogr>
	<note>56th Hawaii International Conference on System Sciences</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Webgazer: scalable webcam eye tracking using user interactions</title>
		<author>
			<persName><forename type="first">A</forename><surname>Papoutsaki</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Sangkloy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Laskey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Daskalova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hays</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI&apos;16</title>
				<meeting>the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI&apos;16</meeting>
		<imprint>
			<publisher>AAAI Press</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="3839" to="3845" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
