=Paper=
{{Paper
|id=Vol-1920/BPM_2017_paper_172
|storemode=property
|title=CJM-ex: Goal-oriented Exploration of Customer Journey Maps using Event Logs and Data Analytics
|pdfUrl=https://ceur-ws.org/Vol-1920/BPM_2017_paper_172.pdf
|volume=Vol-1920
|authors=Gaël Bernard,Periklis Andritsos
|dblpUrl=https://dblp.org/rec/conf/bpm/BernardA17
}}
==CJM-ex: Goal-oriented Exploration of Customer Journey Maps using Event Logs and Data Analytics==
CJM-ex: Goal-oriented Exploration of Customer Journey Maps using Event Logs and Data Analytics Gaël Bernard1 and Periklis Andritsos2 1 University of Lausanne, Faculty of Business and Economics (HEC), Switzerland gael.bernard@unil.ch 2 University of Toronto, Faculty of Information, Canada periklis.andritsos@utoronto.ca Abstract. Customer Journey Mapping (CJM), is an emerging area of research tackling issues related to customer behavior and user trajectories when consuming a service. The increasing complexity of the service in- dustry makes this type of tools popular amongst practitioners. However, to date, it is not clear how a CJM can be used to depict hundreds or thousands of customer journeys. Inspired by process discovery techniques – borrowed from Process Mining – we present CJM-explorer (CJM-ex). CJM-ex is a web interface that uses hierarchical clustering and statistical indexes to allow interactive navigation, with or without a-priori informa- tion, through numerous journeys stored in standard event log formats. The exploration of the underlying journeys can be done in the whole set of data available or driven by user goals in order to examine events and patterns in specific areas of interest. Keywords: customer journey mapping, process mining, customer jour- ney analytics, hierarchical clustering, sequence mining 1 Motivations In order to deliver great service, companies need to have an understanding of the quality of customer experience at an end-to-end level, [5,7]. The ever growing amount of services offered to users for consumption has made the ability to understand their behavior very important, [8]. Similarly important is the knowledge extracted by the increasing number of ways organizations interact with their customers; e.g., a customer might visit a physical store, purchase a product online, and provide feedback on social media. As a response, new customer-centric approaches have surfaced. The customer journey map (CJM) is an example of such new techniques. CJMs allow for better understanding of a customer’s end-to-end experience when using a service by mapping any interactions with the company (called touchpoints) on a map that ultimately contributes to better understanding and serving customer needs. In our previous work, [4], we discussed the benefit of bringing customer journey mapping and process mining together by proposing a CJM model that can be used with process mining techniques. More specifically, CJM-ex aims at providing a solution to explore numerous customer journeys at the same time. Similar to discovery techniques used in process mining, our algorithm takes event logs as input, without using any a-priori information, [1]. However, instead of outputting a business process model (BPM), we display the journeys onto a CJM. For a complete definition of process mining and event logs, the reader is referred to the well known Process Mining Manifesto, [2]. Visualizing event logs on CJMs, instead of BPMs, exhibits two interesting features. First, CJMs focus more on personal customer activities (e.g., by incorporating customer emotions), rather than the “internally-focus problem-solving approach” of BPMs, [3]. Second, contrary to BPMs, CJMs can incorporate customer journeys that are deemed exceptional behaviors, rather than removing them to increase model readability. Despite these interesting features, representing many customer journeys onto a CJM in an intelligible manner remains a challenge. Current research typically limits the number of journeys to be compared to less than ten, making the overall process relatively straightforward. However, we argue that companies in the service industries tend to deal with hundreds or thousands of journeys. To overcome this challenge and identify different areas of interest, a hierarchical clustering algorithm is employed to segment the original data. The hierarchical nature allows for a top-down navigation of automatically generated groups of similar journeys. Once the clusters are formed, CJM-ex is able to leverage the contextual information that comes along a typical customer journey such as the customers’ characteristics, or the emotions, [4]. It does so in two different ways. First, we employ statistical indexes in order to explain why the different clusters were generated. Second, we let the users define their own exploration goals, making CJM-ex the first goal-oriented tool that allows analysts to set a-priori goals to guide their journey exploration. We implemented CJM-ex, which aims to: 1) show how numerous event logs can be displayed onto CJMs; and 2) let users navigate into these journeys. The next section introduces our tool, while the third section highlights the key parts of its technical implementation. We conclude by providing an outlook. 2 CJM-ex Representative Journey … abz … The main objective of CJM-ex is to let users upload acbz … and explore their own dataset using a customer journey aecbz acgbz map layout. To limit the number of journeys displayed Actual Journey on the same CJM and allow their intuitive exploration, we took a hierarchical clustering approach, as illus- Fig. 1. Hierarchical clus- tering of journeys trated in Fig. 1. Each letter represents an activity and the tree is built bottom up by merging the activities that are most similar at each iteration. By default, our hierarchical algorithm uses a proximity measure that takes into account the order of activities and is a variant of the Jaccard similarity based on shingles, [6]. In our clustering tree (or dendrogram), the journeys seen in the event logs are at the leaf level and as they get merged they form “representative” journeys. A representative journey is a single pattern of activities whose purpose is to summarize the patterns contained in a cluster. Because the representative journeys at the top of the tree summarize many – potentially distant – journeys, it will tend to show only few activities shared by many. In contrast, the representative journeys closer to the leafs will show more details. Borrowing a cartographic metaphor from [1], the first layers show general patterns and hide less important activities – like a world map would omit small cities. However, our application allows a drill-down into ‘countries’ of interest (i.e., pattern of activities), which would redirect to new CJMs where the previously hidden ‘cities’ (i.e., omitted activities) will be shown. CJM-ex is accessible at http:// customer-journey.unil.ch, where we 1 2 also provide a screencast explain- ing its usage. The screenshot visi- ble in Fig. 2 points out three views that are available to navigate the clusters. Each of these views full- fills specific objectives. First, the 3 CJM view Ê shows journeys that are in the same cluster. This repre- sentation allows to easily compare the pattern of activities. Second, the tree Ë displays the hierarchi- Fig. 2. Interface pointing to three views: Ê CJM, cal structure of the journey clus- Ë tree and, Ì textual boxes. ters – useful in providing a holistic view of the clusters and where we currently are. Third, a box per cluster Ì provides a convenient means to display statistical indexes that we named “salient characteristics”. The salient characteristics is the top 5 results of a chi-square test applied on all the contextual information. For instance, if at a global level (the entire dataset) the number of women is equal to the number of men, it might be surprising to find a cluster with large majority of women. Therefore, this information might come up as one of the top 5 salient characteristics. Moreover, the user might be interested in specific characteristics occurring during the journey. For this reason, we allow user-defined goals. For instance, one might be interested in journeys that started by the activity “attending class” experienced by young people. The top part À of Fig. 3 displays the settings, while the bottom part Á shows that some branches of the tree are interesting with regards to the goal (red “hot” area at the top). Hence, our application allows navigation without using any a-priori information, but also setting Fig. 3. À goal settings, Á navigation goals, and guidance by the resulting colors. impact on the tree. Finally, when moving from one view to the others, the three views are updated synchronously, allowing a smooth exploration amongst journeys. 1,2,3,4,5,6,7,8,9 ab 1. Hierarchical 2. Recursive 1,2,3,4 5,6 7,8,9 3. Finding abc abxy abz clustering 1 2 3 4 5 6 7 8 9 cuts 1,2 3,4 Representative abcd abce Fig. 5. Data preprocessing phase 3 Implementation CJM-ex is build around four main elements: 1) a web interface; 2) the XES-parser; 3) Hcluster; and 4) a data warehouse. We will describe the main parameters and choices we made for each of them. Web interface. The web interface leverages bootstrap3 , jquery4 and d3js5 to provide a user-friendly interface to upload and navigate journeys. Both the CJM view and the tree view are implemented in d3js. The CJM view is our own implementation, while the tree is an adaptation of existing code6 . XES Parser. CJM-ex works with event logs. More specifically, we leverage the XES (eXtensible Event Stream) standard born within the process mining taskforce. The XES Parser is a Java implementation that encapsulates the OpenXES library7 to parse XES file. Use of this open source software ensures that our application is strictly compatible with the XES standard. Hcluster. Hcluster is a python implemen- tation containing the three steps illustrated in Fig. 5. The first one is the hierarchical cluster- ing implemented using Scipy8 . Two parameters should be provided as inputs: the distance mea- sure between event sequences and the methods for calculating the distance between clusters. They can both be chosen by the user when up- Fig. 4. Preview of the parameters loading a dataset (see Fig. 4). While further when uploading a dataset research is needed to understand why these dif- ferences exist, it seems that using shingles, [6], as a distance metric provides a more intuitive way to navigate through journeys. Once the hierarchical cluster is formed, the next step consists of cutting the clustering to form layers. A layer is a set of predetermined number of journeys that will be grouped together and will ultimately appear on the same CJM. To achieve this, we developed an algorithm that recursively cuts the dendrogram returned by Scipy. A small number of journeys will lead to a simple CJM that is easy to visualize, but in more complex tree structures (i.e., trees with larger height). Finally, the last step consists of finding the representative journey using a frequent sequences mining algorithm9 . Data Warehouse. Each dataset is saved in its own database schema designed as a star schema. Due to lack of space, we do not include its full schema, which 3 http://getbootstrap.com/ 4 https://jquery.com/ 5 https://d3js.org/ 6 http://bl.ocks.org/robschmuecker/7880033 7 http://www.XES-standard.org/openXES/start 8 https://docs.scipy.org/doc/scipy/reference/generated/scipy.cluster.hierarchy.linkage.html 9 https://github.com/bartdag/pymining/blob/master/pymining/seqmining.py stores all the information required to use the application (e.g., clusters, journeys, events) as well as some precomputations. For instance, we count the number of occurences for each characteristic at each cluster, so the goals and the salient characteristics can be retrieved quickly. Altogether, the parameters visible in Fig. 4 allow users to explore the exact same dataset from different perspectives. 4 Discussion and outlook With CJM-ex, we have demonstrated that many journeys can be displayed onto CJM in an intelligible and efficient manner, offering an alternative to a BPM representation. As Gartner highlighted, CJMs should be used to complement, but not to replace, BPMs, [9]. By leveraging a standard born within the process mining taskforce (i.e., XES) and by mimicking a typical process mining activity (i.e., discovery), we bring these techniques closer together. However, further research is required to fully understand how they can complement each other. This is a call to the process mining and business process management communities to consider CJMs as an integral part of the process management toolkit. References 1. van der Aalst, W.: Process Mining: Data Science in Action. Springer (2016) 2. van der Aalst, W., Adriansyah, A., De Medeiros, A.K.A., Arcieri, F., Baier, T., Blickle, T., Bose, J.C., van den Brand, P., Brandtjen, R., Buijs, J., et al.: Process mining manifesto. In: International Conference on Business Process Management. pp. 169–194. Springer (2011) 3. van den Bergh, J., Işik, O., Viaene, S., Helsen, E.: Re-positioning business process management: Exploring key capabilities for successful business transformation (2016) 4. Bernard, G., Andritsos, P.: A process mining based model for customer journey mapping. In: Proceedings of the Forum and Doctoral Consortium Papers Presented at the 29th International Conference on Advanced Information Systems Engineering (CAiSE 2017) (2017) 5. Bernard, G., Boillat, T., Legner, C., Andritsos, P.: When sales meet process mining: A scientific approach to sales process and performance management. In: International Conference on Information Systems (2016) 6. Broder, A.Z., Glassman, S.C., Manasse, M.S., Zweig, G.: Syntactic clustering of the web. Computer Networks and ISDN Systems 29(8-13), 1157–1166 (1997) 7. Haugstveit, I.M., Halvorsrud, R., Karahasanovic, A.: Supporting redesign of c2c services through customer journey mapping. In: Service Design Geographies. Pro- ceedings of the ServDes. 2016 Conference. pp. 215–227. No. 125, Linköping University Electronic Press (2016) 8. Lemon, K.N., Verhoef, P.C.: Understanding customer experience throughout the customer journey. Journal of Marketing pp. jm–15 (2016) 9. Olding, E., Cantara, M., Robertson, B., Dunie, R., Huang, O., Searle, S.: Pre- dicts 2016: Business transformation and process management bridge the strategy- toexecution gap. Tech. rep., Gartner (November 2015), https://www.gartner.com/ doc/3173020/predicts--business-transformation-process