<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Professional use of Process Mining for analyzing Business Processes</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Capgemini</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cluster TRAILS</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Reykjavikplein</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>KA Utrecht</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>A professional application of Process Mining has been established in the context of a methodology as defined by a consultancy firm. The results of the research show where in the context of consultancy Process Mining is used and how clients can benefit from expertise and standardized work.</p>
      </abstract>
      <kwd-group>
        <kwd>Process Mining</kwd>
        <kwd>Consultancy</kwd>
        <kwd>Business Analysis</kwd>
        <kwd>SEMBA</kwd>
        <kwd>Business Process Management</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>1.1</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <p>Reason for research</p>
      <p>
        Luftman et al. analyze IT management issues and multiple topics have been
identified and known to shift in importance over time [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ][
        <xref ref-type="bibr" rid="ref2">2</xref>
        ][
        <xref ref-type="bibr" rid="ref3">3</xref>
        ][
        <xref ref-type="bibr" rid="ref4">4</xref>
        ][
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Results by
Luftman and Ben-Zvi [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] shows that the topic of Business productivity and cost
reduction is the most important issue for C-level management in 2010. From the top 5
topics from 2010, three issues (Business productivity and cost reduction, IT and
Business alignment, IT reliability and efficiency) are presented in some form in the
BPI challenge 2013 [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Process Mining allows for analysis of raw data sets to
discover process flows and analyze the important elements related to such flows [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
1.1.1
      </p>
      <p>Business productivity and cost reduction</p>
      <p>
        Business productivity is measured by Key Performance Indicators (KPI) which are
common in mature businesses, aligned to strategic and tactical goals and drive the
decision making processes [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Common KPIs are constructed by evaluating data
against a benchmark value about input, output and throughput of business processes
and their related waste and outage.
      </p>
      <p>Having an excellent productivity performance with maximized effectiveness of
expenses on the operation of assets and employees allow for margin translates to a
maximization of profit in the case of for profit organizations. To achieve this
optimum state, it is essential to know what the operational performance is of the
complete- and sub-system of the organizational processes.
1.1.2 IT and Business Alignment</p>
      <p>
        Bridging the gap between Business and IT is one of the most challenging activities
for IT and Business professionals, it has been “…a top concern of IT managers for
almost 30 years” [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Blum et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] researched the position of the information
manager, a role which is concerned with many of the issues as described by Luftman
et al. and concluded that the organizational maturity defines the position and
importance acceptance of higher management to solve such issues.
      </p>
      <p>
        The Business Analyst (BA) is the role that inhibits a set of competences that allow
BA professionals to close the gap [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Therefore, Business Analysts of Capgemini
present you these research results for the BPI Challenge 2013.
1.1.3 IT reliability and efficiency
      </p>
      <p>The case for 2013 in the BPI Challenge is based on information of an IT system
which is part of the IT department of Volvo [11] responsible for problem and incident
management in combination with a call center. Call centers are the de facto standard
for efficiency studies [12] and their performance is highly reliant on supporting IT
systems for providing applicable knowledge. Because the Volvo call center as subject
of analysis is the problem solving unit for incidents, reliability and efficiency are
applicable topics of research for this analysis.
1.2</p>
      <p>Aim
There are multiple aims for this BPIC ’13 research:
1. To position Process Mining in the collection of competences of Business</p>
      <p>Analysts in relation to Business Process Management
2. To position the research method characteristics in the context of Business</p>
      <p>Analysis
3. To provide proof that Process Mining is beneficial in a methodological approach
of analysis in context of Business and IT gaps and the SEMBA method
4. To provide insight in how Business Analysis is applied and what next steps are
with Process Mining outcomes.</p>
      <p>The aims as presented are positioned by answering the questions as stated by Volvo
[11], where we assume that the author is the requestor.
1.3</p>
      <p>Added value</p>
      <p>There are two sides for the added value of this paper. For professionals, this paper
shows how Process Mining and facilitating tools can be applied and how the rationale
is defined when handling complex customer cases. For Science, this paper allows to
relate the insight of the requirements of consulting professionals for non-standard
expertise and how Process Mining is validated as an important method for Business
Analysts and their profession.
1.3.1</p>
      <p>Professionals – positioning in practice</p>
      <p>The consulting profession is a field of expertise that is highly reliant on academic
work and evidence driven. Customers grow in their insights and requirements and
demand factual decision making solutions, from one-off decisions to continuous
business management tooling.</p>
      <p>Capgemini established the Structured Expert Method for Business Analysis
(SEMBA) as method for Business and IT analysis. SEMBA consists of four phases
(Focus &amp; Direction, As-Is Analysis, To-Be Design and Migration Design) and
multiple streams (Business Context, Business Process, Information, Application
Landscape and Requirements Engineering), depicted in figure 1. SEMBA is
established with predefined deliverables, which allow for a consistent, predictable
outcome of complex analysis. The method is a standard, however, the content and
interpretations are customer tailored. The combination of evidence driven tools with a
standardized methodology of analysis resulting in predictable delivery is developed
for client satisfaction [13].
1.3.2</p>
      <p>Science – positioning in literature</p>
      <p>The BPI Challenge is a great way to present and combine science with application
in practical settings. Where the BPI Challenge is a challenge for academics and
professionals, this paper is a presentation of analysis and professional positioning for
anyone interested in the field of process mining and business process management. As
Business Analysts bridge the gap between Business and IT, this paper bridges the gap
between science and business by applying research findings.
2
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>Research Design</title>
      <p>The case positioned in SEMBA</p>
      <p>For this analysis and the deliverable (this paper) the limitations are based on time
restrictions and client contact. The time restriction is 18 hours and there cannot be
client interaction because of the design of the BPI challenge.
2.1.1 Focus and Direction</p>
      <p>In the focus and direction phase, there are seven steps followed. As SEMBA is a
Capgemini proprietary approach to analysis, not all details are presented in this case.
The common result is that the problem is defined as the combination of questions as
stated in the Volvo case description in the context of the Volvo IT department related
to incident and problem management with the use of a call center in multiple
countries. The client expects answers to the stated questions with the use of the
provided input: datasets and descriptions of the dataset and the system where the
dataset is obtained from.</p>
      <p>The approach is described from paragraph 2.2, normally this phase describes the
approach and scope of the activities. The outcome of this phase is a formal and
exhaustive overview of what is to be done, who does it, how activities are done, when
activities take place, where and why. The scope for this research has no objective to
capture requirements, therefore the stream Requirements Engineering will be left out
of this paper.
2.1.2</p>
      <p>As-Is Analysis</p>
      <sec id="sec-3-1">
        <title>Business Context</title>
        <p>For the BPI Challenge 2013, the As-Is situation is established for some of the
streams.</p>
        <p>The business context is related to the IT department of Volvo. The unit of analysis
is the functioning of the VINST system, its users and the registrations in the system
across a limited timeframe. The VINST system is used globally by multiple support
organizations.</p>
        <p>Due to limitations in report size this item is kept condensed and refer to the VINST
context description [11] and VINST user guide [XX] for more detail.</p>
      </sec>
      <sec id="sec-3-2">
        <title>Business Processes</title>
        <p>The business processes for analysis are related purely to the registration of
activities within the VINST system. There are no satellite systems or procedures in
scope. The higher hierarchy process could be captured under “Incident and Problem
Management”. The classes of activities can be defined as Incident solving and
Problem solving activities. Activities can be handled by first, second and third line
support employees. Support employees have each a specific area of expertise related
to technology.</p>
      </sec>
      <sec id="sec-3-3">
        <title>Information</title>
        <p>Information is stored and transformed in the VINST System. Information is related
to the employees working for Volvo on a global scale, their position in the
organizational hierarchy, their expertise, products and geographical position.
Furthermore, information is assumed to be present in the VINST system which
enables knowledge transfer for storing, retrieving and adding solutions to problems
related to products and services.</p>
        <p>There is no formal description available other than the VINST system. No
peripheral system, interface or other element is mentioned in the context other than an
e-mail facility.
2.1.3</p>
        <p>To-Be Design</p>
        <p>The phase To-Be design is not applicable for this research. For applicable cases,
the outcomes and decisions of the As-Is phase are used to create at least one To-Be
design. The design elements can be prescreened to design only one most feasible
solution, or multiple scenarios are considered. In case of multiple scenario’s, the
individual scenarios are scored using multiple optional activities such as
MultiCriteria Decision Analysis (MCDA) techniques covering related aspects or
Simulation for business performance, for example. The outcome is a so called Gap
Analysis which covers the difference between the As-Is situation and the To-Be
design(s). The Gap Analysis covers each of the aforementioned streams: Business
context, Business Processes, Information and Application Landscape.</p>
        <p>
          Creation of To-Be designs can be accelerated by usage of reference models such as
the Supply Chain Operations Reference (SCOR®) [
          <xref ref-type="bibr" rid="ref11">17</xref>
          ], Process Classification
Framework by APQC [
          <xref ref-type="bibr" rid="ref12">18</xref>
          ], Frameworx [19] and the Banking Industry Architecture
Network (BIAN) [20].
2.1.4
        </p>
        <p>Migration Design</p>
        <p>The phase “Migration Design” is not applicable for this research. For applicable
cases, the outcome of the To-Be design phase is used to review the methods of how
the As-Is situation can be migrated to the To-Be design. Common scenarios are i.e.
Big-Bang, Pilot location, Linear migration and Exponential migration amongst others.
2.2</p>
        <p>Research method</p>
        <p>As discussed in chapter 2, the basic steps for Process Mining are followed as
described by van der Aalst et al., to cover the exploratory element of this research.
Then professional insights on what to analyze or ask the problem owner in a next
activity to proceed towards a To-Be phase or suggestions for improvement.
2.2.1</p>
        <p>Research design</p>
        <p>Because of the characteristics of Process Mining mainly consisting of exploratory
research, the limited interaction for research by the researchers, the data type being
Quantitative and a setting which resembles a Laboratory, the research method is
determined as Non-reactive research, as presented in table 1.
Method
Action Research
Case Study
Experiment
Non-reactive
Survey</p>
        <p>Setting
Field
Field
Laboratory
Laboratory
Field</p>
        <p>Data Type
Qualitative
Qualitative
Quantitative
Quantitative
Quantitative</p>
        <p>
          Researcher Role
Active
Passive
Active
Passive
Passive
Process mining for processes is mainly exploratory research [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. First, the researcher
needs to get a feeling for what the data represents. Second, assumptions and
statements about the dataset need to be stated to test which part of the data is relevant
for the desired answer. The two aforementioned elements are attained iteratively by
doing small experiments and testing.
        </p>
        <p>For the research on the stated (sub)question, three basic topics will be stated:
Research scope, Filters used and the results of the research with optional elaboration
for each of the topics.</p>
      </sec>
      <sec id="sec-3-4">
        <title>Research scope</title>
        <p>The research scope limits the unit of analysis to least possible number of attributes
to consider with the relevant subset of the data. The research scope is limited through
some elements: the dataset, the assumption, the method and a threshold. The dataset
element shows which dataset is used. The assumption is the description of which
assumption(s) would lead to the right subset of the data. The method element
describes how the assumption is translated into the subset. The threshold is set for
limiting the results as presented for this research.</p>
      </sec>
      <sec id="sec-3-5">
        <title>Filters used</title>
        <p>The filters used give a description on how the tool was set and which settings were set
to obtain the subset results.</p>
      </sec>
      <sec id="sec-3-6">
        <title>Results</title>
        <p>The results show in either figure or table form the results using the aforementioned
limitations, settings and scope.
2.2.3</p>
        <p>Explanatory research</p>
        <p>The explanatory element in the research is highly limited, due to no client
interaction, no strategic and tactical information about the company and no baseline
information about performance or access to operational teams and systems.</p>
        <p>Possible explanations will be provided as suggested research topics based on
previous commercial engagements of the researcher. These explanatory contents are
2.4</p>
        <p>Tooling
2.4.1 ProM 5
2.4.2 ProM 6
not tested to be applicable in the Volvo situation and should be tested with a proper
hypothesis which is refined by client interaction.
2.3</p>
        <p>Data usage</p>
        <p>Provided data sets Incidents [14], Open Problems [15] and Closed Problems [16].
Sets are provided in the XES data format, however for this research the prepared
combined dataset by Fluxicon is used for the Disco tool [22]. Because a non-primary
source is used for the data, a comparison has been run between exports of the
Fluxicon dataset and the provided XES dataset. There have not been found any
inconsistencies.
2.3.1</p>
        <p>Assumptions</p>
        <p>On the topic of data, there are many possible issues resulting in an incomplete or
sometimes unusable dataset. Because of the nature of the BPI Challenge and the
available prepared data, the assumption is that the dataset is fit for research purposes.</p>
        <p>In this paragraph multiple toolsets for analysis of the process mining category are
discussed. Four software candidates are discussed about features and applicability for
use in this research case.</p>
        <p>ProM [23] is the acronym which stands for Process Mining. The tool is open
source and mainly aimed at researchers and scientific application. It is a collection of
custom written plugins for various insights that can be obtained from datasets.
Version 5 is the last version that has a certain interface which is more complex but
powerful for the experienced user.</p>
        <p>ProM 6 [23] is a continuation of the ProM application which has been overhauled
on the UI and activity design so analysis is more straightforward and entry-user
friendly. The package is a platform which can be upgraded with multiple plug-ins for
several types of analysis depending on the requirements of the user.
2.4.3 Fluxicon Disco</p>
        <p>Fluxicon is the company which creates the commercial tool Disco for process
mining analysis of datasets [22]. Disco is capable of delivering quick analysis results
on desktop computers and is optimized for the areas of process discovery and a set of
statistical overviews. It has multiple options to filter data into subsets and quickly trim
sets for specific analysis.
2.4.4 Perceptive Process Mining</p>
        <p>Perceptive is the company which creates the product Perceptive Process Mining
(PPM) for analysis of datasets [24]. PPM is capable to analyze datasets in both social
network and process flow methods, using cloud technology to provide performance
beyond desktop computers. The tool is powerful and feature rich but requires more
experienced researchers to use the tool to its maximum effectiveness.
2.4.5</p>
        <p>Tool selection</p>
        <p>The tool(s) will be selected based on the availability of the tool, the user
friendliness and timely analysis results whilst working with the tool.</p>
        <p>Based on the access to the tool, Perceptive Process Mining is not used, it would
require a license or accredited access to the tool for analysis. Due to time restrictions
the researcher did not contact Perceptive to consider this opportunity.</p>
        <p>Based on previous experiences with ProM 5 and 6, the applications are not used for
this research.</p>
        <p>The research tool for this paper is Disco by Fluxicon, the demo product with the
prepared dataset made available.
2.5
2.5.1</p>
        <p>Process discovery methodology</p>
        <p>Social Network Analysis</p>
        <p>Social network analysis is a representation of the dataset which uses the people or
departments as the unit of analysis instead of the events. This allows for another
dimension of outlier and deviant activity analysis.
2.5.2</p>
        <p>Process Network Analysis after Process Discovery</p>
        <p>Process Network analysis is the analysis of sequential events that form some sort
of network based on the number of similar cases and flows of events. The flow of
events is constructed using Process Discovery, in this research based the fuzzy mining
technique. Some tools allow for automated generated models to be derived from
datasets for further use. There is a limitation on the discovered processes in such
forms, as events are the result of a process, not the process itself.
2.5.3</p>
        <p>Methodology selection</p>
        <p>Due to the restricted timeframe as discussed in paragraph 2.1 and the dismissal of
ProM, Social Network analysis will not be applied for this research. Process Network
analysis will be applied with the notion that the discovered processes might not be the
processes but the sequences of end-states per process step.
3</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>Questions
In this paragraph, the questions are answered in the described methodology from
chapter 2.
3.1.1 Q1.1 Push to front: For what Products is the push to front mechanism
most used and where not?</p>
      <sec id="sec-4-1">
        <title>Research scope: For what Products Push to Front is used</title>
      </sec>
      <sec id="sec-4-2">
        <title>Filters used on attributes of the dataset (in sequence)</title>
        <p>Filter name
Attribute</p>
        <p>Filter by:</p>
        <p>Org:group
Endpoints</p>
        <p>Activity – Mode discard cases
The resulting set has N=1854 cases
Event values:
NOT ({A..Z}{1..99} 2nd OR
{A..Z}{1..99} 3rd )
Start event values:
All
End event values:
Completed / In Call</p>
      </sec>
      <sec id="sec-4-3">
        <title>Results</title>
        <p>PROD453 3,16% 59 43,34%
PROD321 2,76% 51 46,10%
PROD544 2,06% 38 48,16%
PROD832 1,96% 36 50,12%
PROD253 1,80% 33 51,92%
PROD369 1,46% 27 53,38%
PROD104 1,39% 26 54,77%
PROD434 1,37% 25 56,14%
PROD363 1,27% 24 57,41%
PROD328 1,24% 23 58,65%
PROD423 1,20% 22 59,85%
PROD815 1,17% 22 61,02%
PROD698 1,00% 19 62,02%
* Absolute frequency is calculated using the N=X number and multiplied by the
relative frequency. The resulting set (with disregard of the set threshold) can add up to
another number than N=X.</p>
        <p>** Cumulative percentage is obtained by adding up the various (rounded) results
from the relative frequency. The resulting set (with disregard of the set threshold) can
add up to another number than 100%.</p>
      </sec>
      <sec id="sec-4-4">
        <title>Research scope: For what Products Push to Front is NOT used</title>
        <p>The standard assumption would be that you would select the product list from the
previous sub-question and would subtract it from the total list of products. However,
when doing that, you will never discover whether there are duality issues with the
same product(s). Therefore, the following research approach is followed:
Element Description
Dataset Incidents
Assumption Events have a specific sequence and the scope is limited to these
events.</p>
        <p>Method Analyze the distribution of products
Threshold All results with a relative percentage of &lt;1% of N cases is not
represented</p>
      </sec>
      <sec id="sec-4-5">
        <title>Filters used on attributes of the dataset (in sequence)</title>
        <p>{A..Z}{1..99} 2nd AND {A..Z}{1..99} 3rd
Filter name
Attribute
Endpoints</p>
        <p>Filter by:
Org:group
Activity –
Mode discard
cases</p>
        <p>Start event values:
All
End event values:
Closed / {Cancelled, Closed, Resolved}
The resulting set has N=2198 cases</p>
      </sec>
      <sec id="sec-4-6">
        <title>Results</title>
        <p>Product Relative Absolute Cumulative</p>
        <p>Frequency Frequency* Percentage**
PROD424 5,56% 122 5,56%
PROD542 4,86% 107 10,42%
PROD698 3,24% 71 13,66%
PROD607 3,00% 66 16,66%
PROD802 2,73% 60 19,39%
PROD805 2,14% 47 21,53%
PROD660 1,95% 43 23,48%
PROD604 1,84% 40 25,32%
PROD617 1,62% 36 26,94%
PROD243 1,37% 30 28,31%
PROD253 1,33% 29 29,64%
PROD544 1,23% 27 30,87%
PROD325 1,19% 26 32,06%
PROD631 1,18% 26 33,24%
PROD267 1,08% 24 34,32%
PROD337 1,06% 23 35,38%
* Absolute frequency is calculated using the N=X number and multiplied by the
relative frequency. The resulting set (with disregard of the set threshold) can add up to
another number than N=X.</p>
        <p>** Cumulative percentage is obtained by adding up the various (rounded) results
from the relative frequency. The resulting set (with disregard of the set threshold) can
add up to another number than 100%.</p>
        <p>Q1.2 Where in the organization is the push to front process most implemented?</p>
      </sec>
      <sec id="sec-4-7">
        <title>Research scope: differences between organizations A2 and C</title>
        <p>Element
Dataset
Assumption
Method</p>
        <p>Description
Incidents
Events have a specific sequence and the scope is limited to these
events.</p>
        <p>Compare input and output by case numbers.</p>
        <p>Filter
name</p>
      </sec>
      <sec id="sec-4-8">
        <title>Filters used on attributes of the dataset (in sequence)</title>
        <p>Filter by:</p>
        <p>Event values:
Attribute</p>
        <p>Endpoints
Organization
involved
Activity –
discard cases</p>
        <p>Mode</p>
        <p>Org line A2 [set 1]
Org line C [set 2]
Start event values:
Accepted / In Progress OR Queued / Awaiting
Assignment
End event values:</p>
        <p>Completed / Closed OR Completed / In Call</p>
        <p>The most important organization elements are C and A2, as stated in the VINST
data set description are confirmed by the data set with a distribution by organization
of 67% of the cases handled by organization C and 17% of the cases handled by
organization A2. The resulting breakdown of how many cases are receiving the status
“Completed / In Call” show a difference (not calculated for significance) of 35,5%
cases solved by organization C and 2,4% by organization A2.</p>
        <p>Organization
Input
Accepted / In Progress
Queued /
Assignment
Total
Output
Completed / Closed
Completed / In Call
Total</p>
        <p>A2 (17%)
[set 1]
Cases (N=)
N=595
N= 1289
Cases (N=)
N=1258
N= 31</p>
        <p>N=1289
Awaiting</p>
        <p>N=694
%
%
46,20%
63,80%
97,60%
2,40%
100,00%</p>
        <p>C (67%)
[set 2]
Cases (N=)
N=4445
N=619
Cases (N=)
N=3264
N= 1800</p>
        <p>N=5064
100,00%</p>
        <p>N=5064
%
%
87,80%
12,20%
100,00%
64,50%
35,50%
100,00%
3.2</p>
        <p>Q1.3 What functions are most in line with the push to front process?</p>
      </sec>
      <sec id="sec-4-9">
        <title>Research scope: the functions which solve first line support calls</title>
        <p>Element
Dataset
Assumptions
Method
Threshold</p>
        <p>Description
Incidents
All support desks are investigated (not only A2 and C)
Status Completed / In Call is the correct end state
Research on attribute org:role
All results with a relative percentage of &lt;1% of the cases is not
represented</p>
      </sec>
      <sec id="sec-4-10">
        <title>Filters used on attributes of the dataset (in sequence)</title>
        <p>Filter name
Attribute
Endpoints
* Absolute frequency is calculated using the N=X number and multiplied by the
relative frequency. The resulting set (with disregard of the set threshold) can add up to
another number than N=X.</p>
      </sec>
      <sec id="sec-4-11">
        <title>2.1: What are the …</title>
      </sec>
      <sec id="sec-4-12">
        <title>Research scope: responsibles for ping-pong behavior in incidents?</title>
        <p>Element
Dataset
Assumptions
Method
Threshold</p>
        <p>Description
Incidents
Cases with &gt;8 events are ping-pong cases (based on sampling
ping pong case flows
1-627819166 / 1-621825480 / 1-650013051)
Cases cannot have the end-state Completed / In-Call
Research on attributes functions / organizations / support teams /
products
All results with a relative percentage of &lt;4% of the cases is not
represented</p>
      </sec>
      <sec id="sec-4-13">
        <title>Filters used on attributes of the dataset (in sequence)</title>
        <p>Filter name
Performance</p>
        <p>Absolute Frequency
Percentage
43,01%
18,35%
10,61%
9,47%
4,67%
Absolute
Frequency
1119
477
276
246
121
1599
535
213
415
151</p>
      </sec>
      <sec id="sec-4-14">
        <title>Results: functions</title>
        <p>The resulting set has N=2601 cases.
Organizational Role
(Function)
V3_2
A2_1</p>
      </sec>
      <sec id="sec-4-15">
        <title>Results: support teams</title>
        <p>G97
G96</p>
      </sec>
      <sec id="sec-4-16">
        <title>Results: products</title>
        <p>Product
61,46%
20,57%
8,18%
15,96%
5,82%</p>
      </sec>
      <sec id="sec-4-17">
        <title>Results: organizations</title>
        <p>Organization involved</p>
        <p>Percentage</p>
        <p>Absolute Frequency
Support teams</p>
        <p>Percentage</p>
        <p>Absolute Frequency
Research: responsibles for ping-pong behavior in closed problems?
15,36%
4,23%
4,01%
400
110
104
Description
Closed Problems
Cases with &gt;8 events are ping-pong cases (based on sampling
ping pong case flows
1-736351127 / 1-653989471 / 1-563477371)
Mean time is &gt;=23d (1st block is &lt;23 days and averaging 7d
which is non-exceptional)
Research on attributes functions / organizations / support teams
/ products
All results with a relative percentage of &lt;4% of the cases is not
represented
PROD424
PROD660
PROD542
Element</p>
      </sec>
      <sec id="sec-4-18">
        <title>Results: functions</title>
        <p>Organizational
(Function)</p>
      </sec>
      <sec id="sec-4-19">
        <title>Filters used on attributes of the dataset (in sequence)</title>
        <p>Filter name</p>
        <p>Filter by:</p>
        <p>Event values:
Performance
Performance</p>
        <p>Number of events Minimum number of events = 9</p>
        <p>Maximum number of events = 36
Case duration Minimum duration = 23 d</p>
        <p>Maximum duration = 6 years, 87 days
The resulting dataset has N=114 cases</p>
        <p>Role</p>
        <p>Percentage</p>
        <p>Absolute Frequency
20,48%
14,07%
13,09%
12,10%
6,90%
23
16
15
14
8</p>
        <p>Applying another filter to the dataset of N=114 cases where the Function
(unknown) has been selected results in a subset of N=43 cases.</p>
        <p>The characteristics of these cases are (with the &lt;4% threshold still applied)
Group Org. Country Org. Involved Product
Value % Value % Value % Value
Org line G3</p>
      </sec>
      <sec id="sec-4-20">
        <title>Results: support teams</title>
        <p>Support team
G199 3rd</p>
        <p>G21 2nd</p>
      </sec>
      <sec id="sec-4-21">
        <title>Results: products</title>
        <p>Products
PROD97
PROD98
PROD802
PROD96
49,19%
25,26%
16,33%
5,21%
16,33%
7,18%
Percentage
11,26%
8,02%
5,91%
4,15%
Percentage</p>
        <p>Absolute Frequency</p>
        <p>Open problems are not reviewed due to their incomplete state. However, the
dataset can be handled in the same manner as presented in the previous two
exhibitions to obtain the results.
3.4</p>
        <p>Q3 Wait User
3.4.1</p>
        <p>Q3.1: Who is making most use of the state Wait / User?</p>
      </sec>
      <sec id="sec-4-22">
        <title>Research: Most use of state Wait / User</title>
        <p>Element
Dataset
Assumptions
Incidents
A subsequence is present which is Accepted / In Progress followed
in time by Accepted / Wait User
Research on attributes impact / support teams / products
All results with a relative percentage of &lt;4% of the cases is not
represented</p>
      </sec>
      <sec id="sec-4-23">
        <title>Filters used on attributes of the dataset (in sequence)</title>
        <p>Filter name</p>
        <p>Filter by:</p>
        <p>Event values:
Follower
Reference event value: Accepted / In Progress
Follower event values: Accepted / Wait User</p>
      </sec>
      <sec id="sec-4-24">
        <title>Results: Who is making most use of this status?</title>
        <p>The answer from the selection and breakdown to the resource that uses this
function the most provided only because we assume that the name is related to a
system name: Siebel. This is furthermore assumed as the status changes produced by
this user are mainly at a specific time (01:19 – 01:22) each day, which is assumed to
be an automated script.</p>
        <p>Because of limitations of sharing personal details based on legal protection of
employees and their performance assessment [21] we will not publicize the names of
individual employees making use of the Accepted / Wait User status.</p>
      </sec>
      <sec id="sec-4-25">
        <title>Results: Impact</title>
        <p>Impact
Cases</p>
        <p>Major
0</p>
        <p>High
83</p>
        <p>Medium
1216</p>
        <p>Low
1186</p>
        <p>Q3.2: What is the behavior per</p>
        <p>A: Support team
B: Function</p>
        <p>C: Organization</p>
        <p>No results have been produced because this part is best researched using Social
Network analysis. As described in paragraph 2.5.3, Social Network analysis is not
included in this research.
3.4.3</p>
        <p>Q3.3: Is there overuse of the Wait / User state by location?</p>
      </sec>
      <sec id="sec-4-26">
        <title>Research: overuse of the Wait / User state</title>
        <p>Element
Dataset
Assumptions
Method</p>
        <p>Description
Incidents
A subsequence is present which is Accepted / In Progress followed
in time by Accepted / Wait User
Compare the wait / user resulting country breakdown to the general
country breakdown of cases</p>
      </sec>
      <sec id="sec-4-27">
        <title>Filters used on attributes of the dataset (in sequence)</title>
        <p>Filter name</p>
        <p>Filter by:</p>
        <p>Event values:
br
nl
cn
kr
gb
ca
SE
ru
jp
de
au
my
0
th
0,0316
0,0221
0,0178
0,0098
0,0029
0,0029
0,0028
0,0024
0,0023
0,0016
0,0014
0,0009
0,0009
0,0003
0,0198
0,0181
0,0088
0,0041
0,0061
0,0083
0,0053
0,0023
0,0008
0,0029
0,0019
0,0037
0,0004
0,7783
1,1162
0,9834
1,1136
0,7073
0,4754
0,3373
0,4528
1,0000
2,0000
0,4828
0,4737
0,2432
0,7500</p>
        <p>No
No
No
No
No
No
No
No
No
Yes
No
No
No
No</p>
        <p>There is a rudimentary evaluation applied with respect of the ‘distance’ between
the percentage of the handled incidents by country versus the usage of the Wait / User
state. Distance is calculated by dividing the Total incident percentage by the Use
Wait/User percentage. A threshold of 1,26 (126%) is used to distinguish if further
analysis into the Wait / User state use for that particular country would be useful. The
threshold is based on the average of the Distance (0,8548) added with one Standard
Deviation (0,4099).</p>
        <p>The resulting countries for further analysis are “in” and “de”.
3.5</p>
        <p>Q4 Process Conformity per Organization</p>
        <p>Research: Do organization A2 and C work in the same way?
Element Description
Datasets
Assumptions
Method
Threshold</p>
        <p>Incidents &amp; Closed Problems
Test this for the top product with issues, as this will be the
largest impact if optimized.</p>
        <p>Only successfully closed cases are compared. This allows for
standard flow to be expected.</p>
        <p>Compare the process flows of A2 and C. Research on attributes
organization AND product AND endpoint</p>
        <p>No threshold is applicable in process flow analysis</p>
      </sec>
      <sec id="sec-4-28">
        <title>Filters used on attributes of the dataset (in sequence)</title>
        <p>Filter name
These resulting sets did not give enough information to be conclusive if both
organizations have similar operations, because the spread in the case variants was
huge, as well as a too small subset for Organization A2. The attribute Product was
removed from the filterset, resulting in other datasets: N=1258 for Organization A
and N=3274 for Organization C.</p>
        <p>The short answer to the question is no; the organizations do not work in the same
way. When comparing the process flows derived from the resulting datasets, settings
for Activities at 100% and Paths for 0%, a direct difference is visible:</p>
        <p>Organization A2 (figure 2) starts recording cases using the status Queued /
Awaiting assignment, whereas Organization C (figure 3) starts recording cases with
the status Accepted / In Progress. Furthermore, the event Wait / User is used more
frequently in Organization C3. This is a different way of working.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions &amp; Recommendations</title>
      <p>This chapter discusses the conclusions and recommendations, about the research,
the aims for this research paper and the limitations about the results.</p>
      <p>The data provided in this challenge was prepared well, but not yet cleaned, as some
result sets show empty fields. The conclusion is that in such cases a choice should be
made such as rework should be done to augment the dataset to become most effective,
or strip the inconclusive cases for example. However, the researcher will best use the
most pure dataset and remark deviations found.</p>
      <p>The questions as asked by the problem owner are answered, except for the social
network analysis question, behavior in various ways. From the results of the various
questions, there can be concluded that there is not ‘one’ way of working around the
globe whilst using the VINST system. Furthermore, within a product line there are
differences as concluded from the most and least use of the push to front mechanism.</p>
      <p>There are manu loops found in the dataset, of unknown cause. Analysis on the
timespans shows that there are short periods of time between looping steps in most
cases.</p>
      <p>The SEMBA approach can benefit from Process Mining in creating an unbiased
insight in how processes are used within a company. The Process Mining technique is
therefore a valuable competence for Business Analysts, to complement basic
modeling based on anecdotal or more formal process registrations.</p>
      <p>Process Mining can be used to do exploratory and in lesser fashion explanatory
research. In the situation that there is no interaction, it is not possible to do
explanatory research. Process Mining is highly related to practical use, based on the
source of the data. However, the interpretation of the data owner and data
manipulating operators is required to make founded statements and conclusive
hypothesis testing feasible.
4.2</p>
      <p>Recommendations
These recommendations stem from the insights and musings coming from the
research. They show some insight in where consultants would go next and where
further research is recommended.</p>
      <p>The first recommendation is to get access to the problem owner and system operators
in order to provide meaningful answers. The feeling that arose during this research is
that there might be questions behind these questions of higher importance. For
example, issues with outsourcing could be driving the questions, or Service Level
Agreement performance.</p>
      <p>The second recommendation is to inquire about the procedural approach between the
different organizations. The loops as presented in the data are a typical symptom of a
missing event or misuse by employees of a status. The other approach would be to
strip the dataset of the first iteration of “Accepted / In Progress” to make better sense
of the volumes of cases and events.</p>
      <p>The third recommendation is about the data itself. In some cases fields did not contain
content or double entries or otherwise inconsistent content. This could be user errors,
however a choice should be made upon including or excluding such cases.
The fourth recommendation is about the system automated actions. During the
research multiple mutations were found which were executed by the user Siebel
which is assumed to be the system. There are many events where it is unclear if the
system ever should be able to set a case to such a state, i.e. “Waiting / User”.
4.2.1 Practical Implications and Limitations</p>
      <sec id="sec-5-1">
        <title>The question behind the question.</title>
        <p>Improving outsourcing? Contract issues? Performance improvement? As mentioned
at the first recommendation, the question behind the question is something more
valueable to be answered. Using the Process Mining technique researchers and
consultants get the opportunity to spot ‘the elephant in the room’ and work towards a
situation which has high client benefit.</p>
      </sec>
      <sec id="sec-5-2">
        <title>What research can do.</title>
        <p>Research in the field of Process Mining is still in high flux. The role of such a
developing field is that it gives guidelines on how to proceed and discover what
works and what not. The author would like to thank and encourage anyone pushing
forward to set the boundaries, however (in)feasible the results.</p>
      </sec>
      <sec id="sec-5-3">
        <title>What research cannot do.</title>
        <p>Provide specific answers, sometimes. This is exactly why Business Analysis is up and
coming as a profession. Business Analysis allows for business improvement and
setting up the requirements for what the desired state would be. But moving towards a
To-Be design is always limited by time, money or quality, so there will never be a
best fitting solution. Specific and a high fit between Business demand and IT delivery
are only possible when the requirements are identified, assessed and put into context.
Then the result will be realistic on the elements of time, budget and required quality.
This realism allows for high customer satisfaction with limited resources.
About the author:</p>
        <p>Jef Martens is a Business Analyst for Capgemini and Business Process
Management consultant. He is the curator of reference material for Capgemini on an
international scale.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Luftman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ben-Zvi</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Key Issues for IT executives 2010: Judicious IT Investments Continue Post-Recession</article-title>
          .
          <source>MIS Quarterly Executive</source>
          , Vol.
          <volume>9</volume>
          , No.
          <issue>4</issue>
          , pp.
          <fpage>263</fpage>
          -
          <lpage>273</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Luftman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ben-Zvi</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Key Issues for IT executives 2009, MIS Quarterly Executive</article-title>
          , Vol.
          <volume>9</volume>
          , No.
          <issue>1</issue>
          , pp.
          <fpage>49</fpage>
          -
          <lpage>59</lpage>
          . (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Luftman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kempaiah</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rigoni</surname>
            ,
            <given-names>E.H.</given-names>
          </string-name>
          :
          <article-title>Key Issues for IT Executives 2008, MIS Quarterly Executive</article-title>
          , Vol.
          <volume>8</volume>
          , No.
          <fpage>3</fpage>
          . (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Luftman</surname>
            ,
            <given-names>J</given-names>
          </string-name>
          , Kempaiah, R.:
          <article-title>Key issues for IT executives 2007</article-title>
          .
          <source>MIS Quarterly</source>
          Executive Vol.
          <volume>7</volume>
          , No. 2 pp.
          <fpage>151</fpage>
          -
          <lpage>159</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Luftman</surname>
          </string-name>
          , J.:
          <source>Key Issues for IT Executives</source>
          <year>2005</year>
          , MIS Quarterly Executive Vol.
          <volume>4</volume>
          No.
          <issue>2</issue>
          , pp.
          <fpage>269</fpage>
          -
          <lpage>286</lpage>
          , (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Van</given-names>
            <surname>Dongen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            et al.:
            <surname>Business Processing Intelligence Challenge (BPIC) - Third International</surname>
          </string-name>
          Business Process Intelligence Challenge http://www.win.tue.nl/bpi2013/doku.php?id=challenge (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>van der Aalst</surname>
            ,
            <given-names>W. M. P.</given-names>
          </string-name>
          : Process Mining: Discovery, Conformance and Enhancement of Business Processes. Springer-Verlag, Berlin (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Bonakdar</surname>
          </string-name>
          ,
          <string-name>
            <surname>Amir</surname>
          </string-name>
          , et al.:
          <article-title>Transformative Influence of Business Processes on the Business Model: Classifying the State of the Practice in the Software Industry</article-title>
          ,
          <source>System Sciences (HICSS)</source>
          ,
          <year>2013</year>
          46th Hawaii International Conference on.
          <source>IEEE</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Blum</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Landkroon</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , Tjon Tjauw Liem, J.:
          <source>Informatiemanagersurvey</source>
          <year>2010</year>
          -2011 -
          <article-title>Are you in the driver's seat of your information vehicle? Whitepaper by Capgemini Nederland B</article-title>
          .V. (
          <year>2012</year>
          ) http://www.nl.capgemini.com/sites/default/files/resource/pdf/Informatie_Managers_Survey_ 0.pdf
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Schreiner</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>The Bridge and Beyond: Business Analysis Extends Its Role and Reach</article-title>
          . IT Professional vol.
          <volume>9</volume>
          ,
          <issue>iss</issue>
          .6, pp.
          <fpage>50</fpage>
          -
          <lpage>54</lpage>
          . IEEE Computer Society (
          <year>2007</year>
          ) http://dx.doi.org/10.1109/MITP.
          <year>2007</year>
          .
          <volume>122</volume>
          11.Steeman,
          <string-name>
            <given-names>W.</given-names>
            ,
            <surname>Volvo</surname>
          </string-name>
          <string-name>
            <surname>IT</surname>
          </string-name>
          :
          <article-title>VINST Data Set - VINST information needed to understand the dataset</article-title>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          (
          <year>2012</year>
          )
          <fpage>12</fpage>
          .
          <string-name>
            <surname>Batt</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moynihan</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>The viability of alternative call centre production models</article-title>
          .
          <source>Human Resource Management Journal</source>
          , Volume
          <volume>12</volume>
          ,
          <string-name>
            <surname>Issue</surname>
            <given-names>4</given-names>
          </string-name>
          , pages
          <fpage>14</fpage>
          -
          <lpage>34</lpage>
          (
          <year>2002</year>
          ) DOI:
          <fpage>10</fpage>
          .1111/j.1748-
          <lpage>8583</lpage>
          .
          <year>2002</year>
          .tb00075.x 13.
          <string-name>
            <surname>Capgemini Nederland</surname>
            <given-names>B.V.</given-names>
          </string-name>
          :
          <article-title>SEMBA Structured Expert Method for Business Analysis, Whitepaper by Capgemini Nederland B</article-title>
          .V. pp.
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          , (
          <year>2011</year>
          ) http://www.capgemini.com/sites/default/files/resource/pdf/SEMBA_Structured_
          <article-title>Expert_Met hod_for_</article-title>
          <source>Business_Analysis.pdf 14</source>
          .
          <string-name>
            <surname>Steeman</surname>
          </string-name>
          , W.:
          <source>BPI Challenge</source>
          <year>2013</year>
          ,
          <article-title>incidents</article-title>
          . Ghent University. Dataset. http://dx.doi.org/10.4121/uuid:500573e6-accc
          <string-name>
            <surname>-</surname>
          </string-name>
          4b0c-
          <fpage>9576</fpage>
          -
          <lpage>aa5468b10cee</lpage>
          (
          <year>2013</year>
          )
          <fpage>15</fpage>
          .
          <string-name>
            <surname>Steeman</surname>
          </string-name>
          , W.:
          <source>BPI Challenge</source>
          <year>2013</year>
          ,
          <article-title>open problems</article-title>
          . Ghent University. Dataset. http://dx.doi.org/10.4121/uuid:
          <fpage>3537c19d</fpage>
          -6c64
          <string-name>
            <surname>-</surname>
          </string-name>
          4b1d
          <string-name>
            <surname>-</surname>
          </string-name>
          815d-
          <fpage>915ab0e479da</fpage>
          (
          <year>2013</year>
          )
          <fpage>16</fpage>
          .
          <string-name>
            <surname>Steeman</surname>
          </string-name>
          , W.:
          <source>BPI Challenge</source>
          <year>2013</year>
          ,
          <article-title>closed problems</article-title>
          . Ghent University. Dataset. http://dx.doi.org/10.4121/uuid:
          <fpage>c2c3b154</fpage>
          -ab26
          <string-name>
            <surname>-</surname>
          </string-name>
          4b31
          <string-name>
            <surname>-</surname>
          </string-name>
          a0e8-
          <fpage>8f2350ddac11</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          17.
          <string-name>
            <surname>Supply Chain</surname>
            <given-names>Council</given-names>
          </string-name>
          ,
          <source>What is SCOR? Retrieved</source>
          <volume>10</volume>
          -
          <fpage>07</fpage>
          -2013 https://supply-chain.org/scor
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          18. APQC, Process Classification Framework,
          <source>Accessed</source>
          <volume>10</volume>
          -
          <fpage>07</fpage>
          -2013, http://www.apqc.
          <article-title>org/process-classification-framework 19</article-title>
          .TM Forum, Frameworx, eTOM and other Frameworks,
          <source>Accesed</source>
          <volume>10</volume>
          -
          <fpage>07</fpage>
          -2013, http://www.tmforum.org/TMForumFrameworx/1911/home.html 20.
          <article-title>Banking Industry Architecture Network</article-title>
          ,
          <string-name>
            <surname>About</surname>
            <given-names>BIAN</given-names>
          </string-name>
          , Accessed
          <volume>10</volume>
          -
          <fpage>07</fpage>
          -2013 http://bian.org/about-bian/ 21.van den Brand, P.:
          <article-title>Training Perceptive Process Mining, inhouse for Capgemini and Capgemini Consulting</article-title>
          ,
          <source>Training date 26-04-2012</source>
          , Utrecht 22.
          <string-name>
            <surname>Fluxicon</surname>
          </string-name>
          , Fluxicon Blog Entry,
          <source>BPI Challenge</source>
          <year>2013</year>
          , Accessed 02-
          <fpage>06</fpage>
          -2013 https://fluxicon.com/blog/2013/06/bpi-challenge-2013/ 23.Process Mining group,
          <source>Accessed</source>
          <volume>10</volume>
          -
          <fpage>06</fpage>
          -2013 http://processmining.org 24.
          <source>Perceptive Software: Perceptive Process Mining, Accessed</source>
          <volume>10</volume>
          -
          <fpage>06</fpage>
          -2013 http://www.perceptivesoftware.com/products/perceptive-process/process-mining
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>