ProcessGold: Enterprise Process Mining 1st Roeland Scheepens 2nd Erik-Jan van der Linden Head of Research Non-Executive Director ProcessGold ProcessGold Eindhoven, The Netherlands Eindhoven, The Netherlands roeland.scheepens@processgold.com erik-jan.vanderlinden@processgold.com Abstract—The ProcessGold Enterprise Platform offers innovative or event sliders, apply different process mining algorithms, features that are required for mature, scalable, and secure use filter, or select different case ID’s for multidimensional process of Process Mining throughout large enterprises. Processes are mining, they are making different selections from the full event displayed using process graphs, which require stable layouts to ensure continued use by business users. The platform uses a novel data. Round trips to databases then hamper performance, and graph layout algorithm, called Tracy, that ensures this stability hence in-memory architectures have become a standard feature over user interaction. of enterprise process mining. Data selection, visualization, user Relevance of enterprise features in general is evident from the navigation, as well as process mining algorithms all run in- dispersion of process mining in enterprises, from use cases, and memory. from broad adoption of the Platform. Index Terms—Process mining, Enterprise process mining, sta- All features mentioned are part of the ProcessGold En- ble graph layout terprise Platform. For a full overview of features see the ProcessGold website [Pro] and sources below. I. I NTRODUCTION III. N OVEL FEATURE : T RACY PROCESS GRAPH LAYOUT Since its inception around the turn of the century [AGL98], the first decade of Process Mining [Aal16] saw the de- Besides these, by now common, features that meet require- velopment from academic use and academic tools, such as ments for enterprise use, graph layout has become a topic ProM [vDdMV∗ 05], to stand-alone tooling for use by individ- which requires attention to ensure process mining is contin- ual analysts in businesses. The second decade, since around uously usable by business users. The ProcessGold Platform 2010, has been characterized by increasing use of and interest includes a novel algorithm for process graph layout, Tracy1 . in process mining of non-technical business users in large A full technical paper about Tracy is available [MSW19]. enterprises. A. Process layout II. C OMMON FEATURES Since its inception, Process Mining has mainly relied on exist- The above development has led to a number of properties that ing algorithms for graph layout [GKN15]. However, these only are by now common in Process Mining platforms. use the graph topology to compute a layout and do not make Firstly, Process Mining tools have become part of the use of the particular properties of processes. For example, system landscape of enterprises, which requires server- and most processes have some sort of a main path [RBRB06], web-based architectures, and full compliance with standard [AEHK10] that is often the most frequent behavior. To fit technical enterprise requirements such as data preparation on with intuitive semantics of process graphs, this path needs to board, collaborative development, controlled deployment, and be centralized in the layout and depicted as a straight path security. through the graph. While existing techniques fail to properly Secondly, dispersion to non-technical business users re- do this [GKN15], the Tracy approach implements this. quires easy point-and-click applications for everyday use, and hence Process Mining has naturally integrated with Data Ana- B. Stable process graphs lytics. This way, users have full access to required functionality Interaction with process graphs, required by business users, from the field of Data Analytics. This includes any kind of by means of sliders, filtering, drill-down and alike leads to business graphic besides process graphs, and full navigation changes in the layout of process graphs. Users build mental options for selection, filtering, drill-down, etc. [Kei02]. maps of these graphs. When the graph changes, users need Thirdly, for non-technical users, Process Mining is not their to fit their mental model with the altered graph. Conventional primary task, and they require an optimal balance between algorithms [GKN15] lead to smaller or larger changes, and time they spend working with the application on the one hand require users to at least put in significant cognitive effort and business value on the other hand. In practice, users expect to adapt their mental map, but in the worst case even to response times they experience in other web-based applica- rebuild their mental map altogether. Graph layout stability tions. This performance requirement needs to be aligned with the need for flexibility: when users apply, for example, edge 1 Patent pending. A B C D Fig. 1. Graphs A and B show the same process as graphs C and D, respectively. The layouts in A and B are computed by the industry standard [GKN15], where the actual process is poorly represented, while graphs C and D are computed by our novel graph layout algorithm where the process is easy to follow. Graphs B and D are obtained after removing the edge highlighted in red from A and C. As we can see, B differs significantly from A; especially note how nodes 1 and 2 swap vertically. Consequently, the mental map of the user is lost. On the other hand, C and D barely differ, preserving the mental map of the user. helps preserve the mental map of the user [PHG06], [ZKS11], Stability vs quality: Besides stability, we also require lay- significantly reducing cognitive load. outs to be of high quality. Stability and quality are two Therefore, we need to preserve the mental map when a conflicting requirements: graph layout stability helps preserve new graph layout is displayed. The three models of Misue the mental map of the user, but also restricts the graph layout [MELS95] to represent the mental map are our starting point algorithm in optimizing layout quality. A way of dealing with to accomplish this. These models state that a layout adjustment this conflict is to allow somewhat larger changes to the layout should preserve the direction of node n to node m for each pair and to make use of animation and transitioning as a secondary of nodes n and m, that nodes that are close together should approach to mental map preservation. The combination of remain close together, and that graphical objects in a region layout stability and transitioning provides the best approach should stay in that region. to preserving the mental map. In online dynamic graph drawing [BBDW17], graph layouts C. Graph layout: Tracy need to be computed for a sequence of graphs without knowing Tracy is a novel stable layout algorithm for process graphs the full sequence from the beginning. In contrast, in offline that computes layouts that intuitively represent the semantics dynamic graph drawing [BBDW17], the whole sequence of of the process. Our algorithm is based on the Sugiyama graphs is known up front. Interactive Process Mining is offline framework [STT81] but includes: in the sense that we know which graphs we can potentially encounter, since any filtered graph will be a subset of the data • A novel ranking algorithm; found in the event log, but it is also online because we do not • A novel order constraint computation algorithm; know the exact sequence of graphs beforehand. The Tracy- • A novel crossing minimization algorithm. approach presents a novel way to take this into account, and In Figure 1, we present a comparison of Tracy and conven- results in layouts that are stable for the user. tional layout. Tracy is part of the production version of ProcessGold [AGL98] AGRAWAL R., G UNOPULOS D., L EYMANN F.: Mining pro- starting from version 16, and will be publicly announced at cess models from workflow logs. In International Conference on Extending Database Technology (1998), Springer, pp. 467– ICPM 2019. 483. [BBDW17] B ECK F., B URCH M., D IEHL S., W EISKOPF D.: A taxon- IV. M ATURITY omy and survey of dynamic graph visualization. Computer Graphics Forum 36, 1 (Jan. 2017), 133–159. The ProcessGold Platform is a mature enterprise Process [GKN15] G ANSNER E. R., KOUTSOFIOS E., N ORTH S.: Drawing Mining Platform. Customers of the platform include global graphs with dot. Tech. rep., Graphviz, Jan. 2015. consulting firms, and global enterprises with over 1 billion [Kei02] K EIM D. A.: Information visualization and visual data mining. IEEE transactions on Visualization and Computer euro in annual turnover. Graphics 8, 1 (2002), 1–8. A further indication of maturity is the business value that [MELS95] M ISUE K., E ADES P., L AI W., S UGIYAMA K.: Layout customers get from implementing and using applications on adjustment and the mental map. Journal of Visual Languages & Computing 6, 2 (June 1995), 183–210. the platform. We present two use cases which illustrate this. [MSW19] M ENNENS R. J., S CHEEPENS R., W ESTENBERG M. A.: A stable graph layout algorithm for processes. Eurographics A. Use cases Conference on Visualization (EuroVis) (June 2019). Telecom provider: A telecom provider is faced by increas- [PHG06] P URCHASE H. C., H OGGAN E., G ÖRG C.: How important is the ”mental map”? – an empirical investigation of a ing competition on price, requiring the need to reduce cost. dynamic graph layout algorithm. In Graph Drawing (Sept. A process mining effort identifies and resolves a number 2006), Lecture Notes in Computer Science, Springer, Berlin, of process issues, in particular related to manual exception Heidelberg, pp. 184–195. [Pro] P ROCESS G OLD: ProcessGold - business intelligence and pro- handling, communication with strategic suppliers. This leads cess mining platform. https://processgold.com/en/. Accessed: to 20 percent personnel costs savings, and more broadly to 1+ 2019-23-02. M cost savings in general, which equals 300 percent return on [RBRB06] R INDERLE S. B., B OBRIK R., R EICHERT M. U., BAUER T.: Business process visualization – use cases, challenges, solu- investment of the effort within 3 months. tions. In Proceedings of the Eighth International Conference Insurance company: An insurance company is faced with on Enterprise Information Systems (ICEIS’06): Information customers that are dissatisfied by untimely customer contacts. System Analysis and Specification (May 2006), INSTICC PRESS. Firstly, 71 percent of regular customer questions are not [STT81] S UGIYAMA K., TAGAWA S., T ODA M.: Methods for visual responded to within the agreed service level, two days. A understanding of hierarchical system structures. IEEE Trans- process mining effort identifies incorrect routing of questions, actions on Systems, Man, and Cybernetics 11, 2 (1981), 109– 125. leading to delays. Once resolved, this dramatically improves [vDdMV∗ 05] VAN D ONGEN B. F., DE M EDEIROS A. K. A., V ERBEEK the timeliness. Secondly, in case of disease of a person having H. M. W., W EIJTERS A. J. M. M., VAN DER A ALST W. a pension insurance, the company witnesses highly negative M. P.: The prom framework: A new era in process mining tool support. In Applications and Theory of Petri Nets 2005 comments from relatives on social media. These need to be (Berlin, Heidelberg, 2005), Ciardo G., Darondeau P., (Eds.), prevented for their negative impact on the reputation of the Springer Berlin Heidelberg, pp. 444–454. company. In the process mining effort, the 15 percent of cases [ZKS11] Z AMAN L., K ALRA A., S TUERZLINGER W.: The effect of animation, dual view, difference layers, and relative re- where communication is not timely handled are identified layout in hierarchical diagram differencing. In Proceedings as well as the root causes, which are found in hand-over of Graphics Interface 2011 (GI’11) (2011), Canadian Human- over cases and consulting secondary systems. By-catch is Computer Communications Society, pp. 183–190. identification of 11 percent cost reduction. B. Scale Since 2016, customers and partners have acquired licenses for use of the ProcessGold Platform for 10.000+ potential users. ACKNOWLEDGMENT Research and development presented in this demo paper was carried out at ProcessGold. The authors acknowledge all colleagues at ProcessGold that contributed to this work, as well as academic partners at the Universities of Eindhoven (the Netherlands) and RWTH Aachen (Germany) for productive and long-lasting cooperation. R EFERENCES [Aal16] A ALST W. M. P. V. D .: Process Mining: Data Science in Action. Springer, Apr. 2016. [AEHK10] A LBRECHT B., E FFINGER P., H ELD M., K AUFMANN M.: An automatic layout algorithm for BPEL processes. In Proceedings of the 5th International Symposium on Software Visualization (New York, NY, USA, 2010), SOFTVIS ’10, ACM, pp. 173–182.