<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Mining for RPA Implementation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tom Hohenadl</string-name>
          <email>tom.hohenadl@stud.ku.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Robotic Process Automation (RPA)</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Task mining</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Noise Filtering</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Business Administration and Business Informatics, Catholic University of Eichstätt-Ingolstadt</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Robotic Process Automation (RPA) has gained significant attention in recent years as a technology enabling organizations to automate repetitive tasks and improve operational eficiency. Task mining, a technique used to capture users interactions with software systems, plays a crucial role in understanding user behavior and identifying automation opportunities in RPA implementations. However, task mining data often contains noise, which is erroneous or irrelevant action data, that can afect the accuracy and reliability of analysis results and synthesized bots. This extended abstract presents a research proposal focused on noise filtering techniques specifically tailored for user interaction data recorded for RPA implementation.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>
        Task mining is a growing research area and technology domain. It is a part of process mining
and Business Process Management (BPM). Both academia and industry have shown growing
interest in understanding user actions in business processes [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Currently, process mining
and BPM techniques are mainly focused on information systems [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Most techniques in the
process mining domain focus on static information provided from system logs [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Yet, capturing
ad-hoc user actions and user processes is a BPM problem to be solved in future research [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
Task mining delivers process insights on a user interaction level through recording of clicks or
keystrokes and adds significant benefits for process enhancement [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In addition, the analysis
of task mining logs facilitates the implementation of Robotic Process Automation (RPA) scripts,
so called bots. Where currently a lot of manual initiative is necessary for identifying RPA bot
opportunities and their implementation [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], better bots can be created utilizing task mining
methods [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Leno et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] combined the interconnection of task mining and RPA by creating
the Robotic Process Mining (RPM) framework.
      </p>
      <p>
        While task mining focuses on capturing user actions in user interaction logs, discovering user
processes, checking behavior conformance, and enhancing manual processes [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], RPM uses task
mining techniques to generate bots without the need for manual analysis or implementation
CEUR
Workshop
Proceedings
      </p>
      <p>
        © 2023 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
efort, thereby automating user behavior across diferent information systems [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Although
bots created using the available methods can work, they are still susceptible to erroneous or
irrelevant behavior recorded in the logs [10]. This behavior is defined as noise, e.g. attribute
noise [11], and is not relevant for analysis of manual processes or implementation [10]. When
implementing this behavior into bots automatically, the benefits of RPA, such as eficiency and
quality [12], are marginal due to erroneous RPA bot executions, which lead to failures and
increased maintenance efort [ 13]. These factors diminish the advantages of RPA compared to
manual work.
      </p>
      <p>The consequences of faulty bot implementation, driven by noisy user interaction logs,
highlight two core areas of focus for this research proposal: the identification of noise-indicating
properties within user actions recorded in interaction logs, and the development of methods
for cleaning such noise from these logs. The primary objective of the research is to create an
artifact capable of identifying such noisy actions so that subsequently automated RPA bots are
not afected by erroneous or irrelevant user behavior. In addition, the purpose of this research
project is to evaluate the transformation of actual user actions into automated RPA routines
during bot implementation and to determine the actions that are important for automation.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Research Proposal and Research Questions</title>
      <p>
        As outlined by Mayr et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] there are three particular challenges to overcome in task mining
research at the moment: segmentation of logs, privacy preservation of user related
information, and noise filtering. This proposal focuses on the noise filtering aspects of the described
challenges.
      </p>
      <p>
        So far, the preprocessing of user interaction logs is performed within the developed RPM
recorders, e.g. actionLogger [14] or smartRPA [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. These developed Design Science Research
(DSR) artifacts use hard-coded static rules to remove unwanted or incorrect user behavior. On
the one hand, Leno et al. mention exemplary rules, such as repeated navigation or copy actions
without pasting [14]. On the other hand, Agostinelli et al. identify the quality of the recorded
ifne-grained data as a weakness of the RPM artifact [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], so that not all contingencies may be
mapped to static rules. Thus, this proposal entails research targeting the detection and removal
of noise in user interaction logs and is grounded on the following research questions:
• What user actions are relevant for RPA?
• How can irrelevant actions be identified and removed after recording users and before
      </p>
      <p>RPA bot creation?
• Do existing algorithms for noise detection from the process mining domain exceed the
capabilities of hard-coded static rule based filters from existing RPM DSR artifacts?
• Is it possible to create a better filtering method based on the results of the previous
research questions?</p>
    </sec>
    <sec id="sec-4">
      <title>3. Research Methodology</title>
      <p>Initially, a structured literature review based on guidelines by Kitchenham et al. [15] was
conducted. This literature review was used to identify user action properties described in the
existing literature. Based on the acquired data a taxonomy of user actions in manual processes
was created following the directive of Nickerson et al. [16].</p>
      <p>Following the literature review and the identification of action categories, a semi-structured
interview study is used to identify translation patterns. A translation is the mapping of user
actions to bot functionality. The interview study is based on the methods described in Saldana
[17]. The guiding questions for the interview is ”Which user actions, intrinsic to users daily
routines, are being incorporated into the operational framework of an RPA bot?”. By answering
this question and encoding the interviewees’ answers, an overview of actions that are valuable
in RPA bots, as well as actions that are nonfunctional, can be created.</p>
      <p>Based on the theoretical findings of the first two research questions, a comparative analysis is
conducted to examine existing process mining noise detection methods and their applicability
for task mining. Initially, students at the researchers institute are instructed to perform a range
of manual processes on their digital devices and record the corresponding actions to create a
reference data set. This data set serves as a baseline for the subsequent comparison of algorithms.
By applying Leno’s et al. [18] and Agostinelli’s et al. [19] hard-coded static noise filtering rules
on the data set, an initial base line to remove noise or irrelevant events from the log can be
established. The disparity between the original logs and the filtered logs will subsequently be
employed to identify appropriate noise detection algorithms. A set of outlier and noise detection
algorithms is reviewed by Koschmieder et al. [11] . The set of publicly available algorithms,
i.e. source code or DSR artifacts, will be used as an initial set of comparable algorithms to the
baseline data. Furthermore, a literature review and forward backward search will be used to
identify further relevant noise filtering algorithms.</p>
      <p>The final objective of this study is to develop a DSR artifact aimed at eliminating the reliance
on hard-coded filtering. The findings from the comparative study of available algorithms as well
as the interview study will be used to elaborate noise filtering techniques for user interaction
logs. Based on these results, a DSR noise filtering artifact will be developed. The reference data
set will be labeled in collaboration with the students and will be used as a basis for evaluation
using scores such as precision, recall or the F-score. The artifact aims to establish a robust data
preprocessing foundation for the automated creation of RPA bots by leveraging filtered user
interaction logs.</p>
    </sec>
    <sec id="sec-5">
      <title>4. Current Research Status</title>
      <p>The initial literature review to identify user actions currently processed in task mining and
RPM has been conducted. The result is a taxonomy containing six categories of value adding
user actions and a category for not relevant actions. These value adding actions categories are
opening, navigating, transforming, transferring, concluding and closing actions. Furthermore,
the category of empty actions, i.e. doing nothing at all, was identified. However, a validation of
these categories through practical use cases or empirical evaluation is still necessary.</p>
      <p>Following the literature review, the initial two stages of the qualitative interview study have
been accomplished. The first stage involved the formulation of interview questions and the
identification of the target group of participants, specifically focusing on RPA developers and
researchers in the fields of RPM or task mining. In the second stage, interviews were conducted
with a total of five RPA developers, four task mining researchers and four RPA business experts.
Following the interviews, an initial round of process coding is conducted. This will be followed
by the categorization of the codes and a second cycle coding approach.</p>
    </sec>
    <sec id="sec-6">
      <title>5. Conclusion</title>
      <p>In the evolving research and industry landscape, the automation of user processes is achieved
using RPA, which has opened up new avenues for improving process eficiency and efectiveness.
RPA implementation, in turn, is further enhanced by leveraging task mining and the RPM
framework. While research on task mining is increasing and industry interest shifts towards
automated automation, the task mining, RPA and RPM domain still has challenges. The described
research proposal aims at improving the automatic creation of RPA bots by removing noise from
user interaction logs. Consequently, this work addresses relevant aspects in the convergence of
task mining, RPA and RPM, bridging gaps in the pursuit of eficient and efective automation
solutions.
Proceedings, volume 393 of Lecture Notes in Business Information Processing, Springer
International Publishing, Cham, 2020, pp. 116–131. URL: doi.org/10.1007/978-3-030-58779-6_8.
[10] S. Agostinelli, M. Lupia, A. Marrella, M. Mecella, Smartrpa: A tool to reactively synthesize
software robots from user interface logs, in: S. Nurcan, A. Korthaus (Eds.), Intelligent
Information Systems, volume 424 of Lecture Notes in Business Information Processing,
Springer International Publishing and Imprint Springer, Cham, 2021, pp. 137–145. URL:
doi.org/10.1007/978-3-030-79108-7_16.
[11] A. Koschmider, K. Kaczmarek, M. Krause, S. J. van Zelst, Demystifying noise and
outliers in event logs: Review and future directions, in: A. Marrella, B. Weber (Eds.),
Business Process Management Workshops, volume 436 of Springer eBook Collection,
Springer International Publishing and Imprint Springer, Cham, 2022, pp. 123–135. URL:
doi.org/10.1007/978-3-030-94343-1_10.
[12] A. Meironke, S. Kühnel, How to measure rpa’s benefits? a review on metrics, indicators, and
evaluation methods of rpa benefit assessment, in: Wirtschaftsinformatik 2022 Proceedings,
2022, pp. 1–19. URL: https://aisel.aisnet.org/wi2022/bpm/bpm/5.
[13] P. Noppen, I. Beerepoot, I. van de Weerd, M. Jonker, H. A. Reijers, How to keep rpa
maintainable?, in: D. Fahland, C. Ghidini, J. Becker, M. Dumas (Eds.), Business Process
Management, volume 12168 of Springer eBook Collection, Springer International Publishing
and Imprint Springer, Cham, 2020, pp. 453–470. URL: doi.org/10.1007/978-3-030-58666-9_
26.
[14] V. Leno, A. Polyvyanny, M. Dumas, F. Maggi, Action logger: Enabling
process mining for robotic process automation, in: Proceedings of the
Dissertation Award, Doctoral Consortium, and Demonstration Track at BPM 2019,
2019, pp. 124–128. URL: https://bia.unibz.it/esploro/outputs/conferenceProceeding/
Action-logger-Enabling-process-mining-for/991006186495001241.
[15] B. Kitchenham, Procedures for performing systematic reviews, Keele, UK, Keele Univ 33
(2004). URL: http://www.inf.ufsc.br/~aldo.vw/kitchenham.pdf.
[16] R. C. Nickerson, U. Varshney, J. Muntermann, A method for taxonomy development and
its application in information systems, European Journal of Information Systems 22 (2013)
336–359. doi:10.1057/ejis.2012.26.
[17] J. Saldaña, The coding manual for qualitative researchers, 4e ed., SAGE, Los Angeles and</p>
      <p>London and New Delhi and Singapore and Washington DC and Melbourne, 2021.
[18] V. Leno, A. Polyvyanyy, M. La Rosa, M. Dumas, F. M. Maggi, Action logger: Enabling
process mining for robotic process automation, in: Proceedings of the Dissertation Award,
Doctoral Consortium, and Demonstration Track at BPM 2019, volume 2420 of CEUR
Workshop Proceedings, CEUR-WS, 2019, p. 5. URL: https://hdl.handle.net/10863/19700.
[19] S. Agostinelli, A. Marrella, M. Mecella, Exploring the challenge of automated segmentation
in robotic process automation, in: Research Challenges in Information Science 2021, 2021,
pp. 38–54. URL: doi.org/10.1007/978-3-030-75018-3_3.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>I. Beerepoot</surname>
          </string-name>
          et al,
          <article-title>The biggest business process management problems to solve before we die</article-title>
          ,
          <source>Computers in Industry</source>
          <volume>146</volume>
          (
          <year>2023</year>
          )
          <article-title>103837</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.compind.
          <year>2022</year>
          .
          <volume>103837</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. La</given-names>
            <surname>Rosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mendling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. A.</given-names>
            <surname>Reijers</surname>
          </string-name>
          ,
          <article-title>Fundamentals of business process management, softcover re-print of the hardcover 2nd edition 2018 ed</article-title>
          ., Springer, Berlin,
          <year>2018</year>
          . URL: https://link.springer.com/book/10.1007/978-3-
          <fpage>662</fpage>
          -56509-4.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>W. van der Aalst</surname>
          </string-name>
          et al,
          <article-title>Process mining manifesto</article-title>
          , in: F. Daniel,
          <string-name>
            <given-names>K.</given-names>
            <surname>Barkaoui</surname>
          </string-name>
          , S. Dustdar (Eds.),
          <source>Business Process Management Workshops, Lecture Notes in Business Information Processing</source>
          , Springer, Berlin,
          <year>2012</year>
          , pp.
          <fpage>169</fpage>
          -
          <lpage>194</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>W. van der Aalst</surname>
          </string-name>
          , Process mining - Data science in action, second edition ed., Springer, Berlin and Heidelberg,
          <year>2016</year>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>662</fpage>
          - 49851- 4.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>W. van der Aalst</surname>
          </string-name>
          ,
          <article-title>On the pareto principle in process mining, task mining, and robotic process automation</article-title>
          ,
          <source>DATA 2020 - Proceedings of the 9th International Conference on Data Science, Technology and Applications</source>
          (
          <year>2020</year>
          ). doi:
          <volume>10</volume>
          .5220/0009979200050012.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Agostinelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marrella</surname>
          </string-name>
          , M. Mecella,
          <article-title>Research challenges for intelligent robotic process automation, Business Process Management Workshops</article-title>
          ,
          <string-name>
            <surname>BPM</surname>
          </string-name>
          <year>2019</year>
          (
          <year>2019</year>
          )
          <fpage>12</fpage>
          -
          <lpage>18</lpage>
          . URL: doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -37453-
          <issue>2</issue>
          _
          <fpage>2</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. La</given-names>
            <surname>Rosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Leno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polyvyanyy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Maggi</surname>
          </string-name>
          ,
          <article-title>Robotic process mining</article-title>
          , in: W.
          <string-name>
            <surname>M. P. van der Aalst</surname>
          </string-name>
          , J. Carmona (Eds.),
          <source>Process Mining Handbook</source>
          , Springer International Publishing, Cham,
          <year>2022</year>
          , pp.
          <fpage>468</fpage>
          -
          <lpage>491</lpage>
          . URL: doi.org/10.1007/978-3-
          <fpage>031</fpage>
          -08848-3_
          <fpage>16</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Mayr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.-V.</given-names>
            <surname>Herm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wanner</surname>
          </string-name>
          , Christian Janiesch,
          <article-title>Applications and challenges of task mining: A literature review</article-title>
          ,
          <source>ECIS 2022 Research-in-Progress Papers</source>
          (
          <year>2022</year>
          ). URL: https://aisel.aisnet.org/ecis2022_rip/55.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Agostinelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lupia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marrella</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mecella</surname>
          </string-name>
          ,
          <article-title>Automated generation of executable rpa scripts from user interface logs</article-title>
          , in: A.
          <string-name>
            <surname>Asatiani</surname>
          </string-name>
          (Ed.),
          <article-title>BPM 2020 Blockchain and</article-title>
          RPA Forum
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>