=Paper=
{{Paper
|id=Vol-3299/Paper08
|storemode=property
|title=Process Mining on Blockchain Data (Extended Abstract)
|pdfUrl=https://ceur-ws.org/Vol-3299/Paper08.pdf
|volume=Vol-3299
|authors=Richard Hobeck
|dblpUrl=https://dblp.org/rec/conf/icpm/Hobeck22
}}
==Process Mining on Blockchain Data (Extended Abstract)==
Process Mining on Blockchain Data (Extended Abstract) Richard Hobeck Technische Universitaet Berlin, Software and Business Engineering Abstract Process Mining has been applied to several domains and IT systems in previous years. This extended abstract describes an approach to process mining on blockchain data. Keywords Process Mining, blockchain, monitoring, software logging 1. Motivation Business processes are at the heart of every enterprise and are decisive for their success [1]. With the rise of information technology, processes were increasingly managed digitally in specialized information systems for business environments. The execution of the processes in information systems generates data. To improve the processes, the execution data can be computationally analyzed with a set of techniques called process mining [2]. Process mining experienced increasing popularity in research as well as in the industry across domains. Vital remained the access to process execution data. Similar to traditional enterprise information systems, blockchains can also serve as process execution platforms and are already used as such for various applications. As a cross-organizational platform, blockchains have the potential to substitute trustworthy mediator entities in an otherwise trustless environment thanks to their technological properties [3]. These properties include blockchains to be an append-only data-storage [4, p. 5] in which process execution data can be stored. Here process mining techniques might be a valuable addition to generate insights from the execution data. That includes benefits seen in other technological domains, but encompasses blockchain-specific benefits, e.g., security analysis. The opportunities for research are outlined in the following chapters. 2. Research Contribution and Plan The dissertation will aim at providing an approach to process mining on blockchain data. Initially, the usefulness and suitability of process mining on blockchain application data obtained with existing extraction tools will be examined in case studies [5] to establish two base assumptions: ICPM 2022 Doctoral Consortium and Tool Demonstration Track Envelope-Open richard.hobeck@tu-berlin.de (R. Hobeck) Orcid 0000-0001-5242-2510 (R. Hobeck) © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings http://ceur-ws.org ISSN 1613-0073 CEUR Workshop Proceedings (CEUR-WS.org) 37 1) Smart contract code can represent a meaningful process that can be executed, and: 2) Process execution data of smart contract code can be extracted from a blockchain and its analysis with process mining techniques can be valuable. Would these assumptions not hold, process mining on blockchain data would not be possible. That results in the research question: RQ1) Is applying process mining techniques to blockchain application data feasible and useful? To achieve that, we will make event data available by extracting it from the blockchain, for which queries and third party solutions already exist. In a second step, we intend to answer: RQ2) How can existing tools for extracting event data from blockchains be extended benefiting event log quality? Foremost, existing solutions currently are limited to smart contracts with known addresses, while it is an open challenge to identify temporarily invoked contracts and extract data they logged. For logging, these temporary nested contracts go unnoticed. Primary suspect for the identification of invoked contracts is the transaction receipt that is stored on-chain to log called contract’s addresses. Open to research remains extracting the contract code and specifying which event data shall be logged automatically - a task that so far has been manual. Tied to nested logging, is the logging format. In a nested logging scenario, a clear separation of traces, vital for logging in XES format, may not be suitable. Hence, we plan to explore shifting to OCEL or event knowledge graphs to better accommodate documenting blockchain application event logs. Based on RQ1 and RQ2, we will move from a post-mortem (historic) analysis of event logs into an online-setting in RQ3) What does a process monitoring solution for blockchain applications look like that benefits application designers and users? We will conceptualize a process monitoring approach tailored to the blockchain environment. The approach will be designed with respect to the following design challenges: event streaming in a blockchain environment, blockchain-specific process KPIs, and technical blockchain peculiarities, e.g., block inclusion for ledgers with different consensus protocols. Generally, this dissertation will be written in the spirit of Design Science Research, taking into account the design guidelines [6]. Our research endeavors will mostly be viewed as problems that can be solved by designing and developing solutions which need to be demonstrated and evaluated. The approaches will be implemented and evaluated using existing technologies such as [7, 8]. The evaluations will take into account previously defined requirements against which the implementations are checked. 3. Related Work Several prior contributions with related content have been published in the past and will be contextualized briefly. Data Extraction. Process execution data of blockchain-based applica- tions that is stored on-chain can generally be accessed through native APIs or language-specific API wrappers. The extraction of the data and its provision in an event log, however, is not an easy task [9] and was subject to prior research [10]. Most notably the tool Ethereum Logging Framework (ELF) offers a configurable logging solution for the Ethereum blockchain [11, 8]. Utilizing blockchain application data. Extracted blockchain data has been put to use for validating smart contracts on Hyperledger Fabric [12], auditing blockchain applications on the Ethereum [13], analyzing transactions on Ethereum without focusing on specific DApps [14], and using process mining on blockchain-wide data [15] but no single process in particular. 38 Case studies. The suitability of process mining for data from classic data sources (ERP, CRM, etc.) has been covered in various case studies across industries [16]. In these industries process mining was applied to analyze processes from different perspectives including control flow, con- formance, drifts, and performance [17]. A study showing value in process mining on blockchain data has been added only recently as part of the dissertation [18]. Monitoring solutions. Monitoring of processes is based on the analysis of process execution data. Process monitoring is widely considered to be based on process mining techniques and divided in offline and online monitoring [1, p. 413]. Online monitoring requires event streaming [19] which has been subject of research [20]. An example of monitoring a choreography of process participants based on message-flow was provided by [21]. Furthermore, potential and challenges for monitoring business processes have been sketched previously [22]. References [1] M. Dumas, M. La Rosa, J. Mendling, H. A. Reijers, Fundamentals of business process management, Springer, 2018. [2] W. M. P. van der Aalst, Process mining: Data science in action, Springer, 2016. [3] J. Mendling, et al., Blockchains for business process management – Challenges and opportunities, ACM Transactions on Management Information Systems (TMIS) 9 (2018) 4:1–4:16. [4] X. Xu, I. Weber, M. Staples, Architecture for Blockchain Applications, Springer, 2019. [5] J. Recker, Scientific Research in Information Systems: A Beginner’s Guide, Springer Inter- national Publishing, 2021. [6] A. Hevner, S. March, J. Park, S. Ram, Design Science in Information Systems Research, MIS Quarterly 28 (2004) 75–105. [7] O. López-Pintado, M. Dumas, L. García-Bañuelos, I. Weber, Interpreted execution of business process models on blockchain, in: 2019 IEEE 23rd International Enterprise Distributed Object Computing Conference (EDOC), IEEE, 2019, pp. 206–215. [8] C. Klinkmüller, I. Weber, A. Ponomarev, A. B. Tran, W. van der Aalst, Efficient logging for blockchain applications, CoRR abs/2001.10281 (2020). URL: https://arxiv.org/abs/2001. 10281. a r X i v : 2 0 0 1 . 1 0 2 8 1 , accessed 2021-03-21. [9] C. Di Ciccio, et al., Blockchain-based traceability of inter-organisational business processes, in: Business Modeling and Software Design, 2018, pp. 56–68. [10] L. Moctar M’Baba, M. Sellami, W. Gaaloul, M. F. Nanne, Blockchain logging for process mining: a systematic review, HICSS (2022). [11] C. Klinkmüller, A. Ponomarev, A. B. Tran, I. Weber, W. M. P. van der Aalst, Mining blockchain processes: Extracting process mining data from blockchain applications, in: BPM Blockchain Forum, 2019, pp. 71–86. [12] F. Duchmann, A. Koschmider, Validation of smart contracts using process mining, in: Central Europ. Workshop on Services and their Composition, 2019, pp. 13–16. [13] F. Corradini, F. Marcantoni, A. Morichetta, A. Polini, B. Re, M. Sampaolo, Enabling auditing of smart contracts through process mining, in: From Software Engineering to Formal Methods and Tools, and Back, 2019, pp. 467–480. 39 [14] R. Mühlberger, S. Bachhofner, C. Di Ciccio, L. García-Bañuelos, O. López-Pintado, Extract- ing event logs for process mining from data stored on the blockchain, in: Business Process Management Workshops, 2019, pp. 690–703. [15] M. Müller, P. Ruppel, Process mining for decentralized applications, in: IEEE Int. Conf. on Decentralized Applications and Infrastructures, 2019, pp. 164–169. [16] R. Andrews, S. Suriadi, M. Wynn, A. H. M. ter Hofstede, S. Rothwell, Improving pa- tient flows at St. Andrew’s War Memorial Hospital’s emergency department through process mining, in: Business Process Management Cases: Digital Innovation and Business Transformation in Practice, 2018, pp. 311–333. [17] L. Reinkemeyer, Process mining in action: Principles, use cases and outlook, Springer- Verlag, Berlin, 2020. [18] R. Hobeck, C. Klinkmüller, H. D. Bandara, I. Weber, W. van der Aalst, Process mining on blockchain data: A case study of Augur, in: BPM’21: International Conference on Business Process Management, Rome, Italy, 2021, pp. 306–323. [19] J. Gama, Knowledge discovery from data streams, CRC Press, 2010. [20] A. Burattin, Streaming process mining, Process Mining Handbook (2022) 349. [21] I. Weber, X. Xu, R. Riveret, G. Governatori, A. Ponomarev, J. Mendling, Untrusted business process monitoring and execution using blockchain, in: Int. Conf. on Business Process Management, Rio de Janeiro, Brazil, 2016. [22] C. Di Ciccio, G. Meroni, P. Plebani, On the adoption of blockchain for business process monitoring, Software and Systems Modeling 21 (2022) 915–937. URL: https://doi.org/10. 1007/s10270-021-00959-x. doi:1 0 . 1 0 0 7 / s 1 0 2 7 0 - 0 2 1 - 0 0 9 5 9 - x . 40