=Paper=
{{Paper
|id=Vol-2114/paper4
|storemode=property
|title=Situational Reference Model Mining
|pdfUrl=https://ceur-ws.org/Vol-2114/paper4.pdf
|volume=Vol-2114
|authors=Jana-Rebecca Rehse
}}
==Situational Reference Model Mining==
Situational Reference Model Mining
Jana-Rebecca Rehse
Institute for Information Systems (IWi) at the German Center for Artificial
Intelligence (DFKI GmbH) and Saarland University
Campus D3 2, Saarbruecken, Germany
Jana-Rebecca.Rehse@iwi.dfki.de
Abstract. Reference models can be considered as special conceptual
models that serve to be reused for the design of other conceptual mod-
els. Due to an ongoing need for high-quality reference models, reference
model mining, i.e. the (semi-)automatic derivation of reference models
from a set of existing process models, has recently gained the attention
of researchers. The presented dissertation project addresses the concept
of Situational Reference Model Mining, i.e. the idea that mined reference
models, although based on the same input data, are intended for different
use cases and thus have to meet different requirements. These require-
ments determine the reference model character and thus the technique
that is best suited for mining it. The dissertation’s major objective is to
design, elaborate, and validate a method for Situational Reference Model
Mining, which provides reference modelers with a clear guideline on how
to use automated reference model mining techniques to their advantage.
Keywords: Inductive reference modeling, Reference model mining, Ref-
erence model design principles, Context-aware process design
1 Motivation and Research Problem1
Reference models can be considered as special conceptual models that serve to be
reused for the design of other conceptual models [4]. They provide a template for
process models in a certain industry and thus facilitate a resource-efficient im-
plementation of the respective process and its adaption to the individual needs
of an organization. This way, companies may benefit from best practices and
industry-specific experience. The use of reference models is associated with a
higher quality of processes and process models, as it simplifies internal commu-
nications by introducing a common terminology and considerably reduces the
resources required for Business Process Management (BPM) [5].
Leveraging the multiple benefits of reference modeling in both research and
practice depends on the widespread availability of high-quality reference models.
Generally accepted and widely used reference models exist only for a few indus-
tries, such as the IT Infrastructure Library (ITIL) for IT service management or
1
This section is based on previous work presented in [1–3] and has been adapted.
the Supply Chain Operations Reference Model (SCOR) for supply chain man-
agement, eliciting an ongoing need for reference model construction methods.
Reference models may be constructed both deductively and inductively. De-
ductive methods or “top-down” approaches employ generally accepted theories
and principles. Models are constructed on a theoretical base and gradually sub-
stantiated along the way. In contrast, inductive or “bottom-up” development of
reference models makes use of real-world data such as existing process models
or execution logs. It focuses on similarities and commonalities within the input
data and abstracts from a model’s individual features. As is it possible to auto-
mate the inductive derivation of a potential reference model from a collection of
existing process models (Reference Model Mining), inductive reference modeling
may contribute to the trend of combining BPM with artificial intelligence (AI)2 .
Until a few years ago, although inductive development was generally acknowl-
edged, the literature on methods for reference model development was dominated
by deductive approaches. This is interesting, as many existing reference models
were at least partially developed inductively [3]. Recently, however, there have
been some research activities regarding inductive reference modeling. Methods
have been proposed to (semi-)automatically derive reference models from both
instance level data in the form of process logs [6] and type-level data, i.e. process
models [7]. They make use of a multitude of existing computational techniques,
such as subgraph isomorphism [3], genetic algorithms [8], or heuristic approxi-
mations of the graph edit distance [9].
When applying reference model mining (RMM), i.e. (semi-)automatic induc-
tive reference model development, it becomes evident that the reference model
content and character are significantly influenced by the choice of mining tech-
nique and its parametrization. Different mining techniques yield different models,
even when applied to the same set of input models. Meanwhile, given a reuse-
oriented understanding of reference models, the requirements to the reference
model are determined by the situational context it will be used in, similar to the
adaptation of software developments methods in Situational Method Engineering
(SME) [10]. It will hence increase an inductively developed reference model’s use-
fulness and situational suitability, if the mining technique is determined by the
circumstances of its intended reuse. We call this concept “Situational Reference
Model Mining” (S-RMM), i.e. extending RMM towards consciously considering
the situational context when designing and using a reference model (cf. Fig. 1).
Designing a method for S-RMM and understanding the influencing factors
behind it is the main objective of the presented dissertation project titled “Sit-
uational Reference Model Mining — Conceptual Design and Selected Applica-
tions”, which is conducted at the Institute of Information Systems at Saarland
University, under the supervision of Prof. Dr. Peter Loos. It started in October
2015 and is planned to be completed in 2020. Some preliminary results and cited
publications were already achieved during the author’s time as research assistant
at the same institute (April 2011 until October 2015).
2
Cf. the keynote speech “Intelligent Continuous Improvement, When BPM Meets
AI” by Miguel Valdés at BPM 2017
29
Reference
Model
Design
Require-
Choice of Design
ments
Technique Principle
Individual Target
Models Models
Fig. 1. Main idea behind Situational Reference Model Mining
2 Research Method and Research Questions
As the main objective of the project is the design of an artifact (a method
for S-RMM), it follows the design science research paradigm (DSR), originally
coined by Hevner et al. [11]. More specifically, it applies the design science
methodology by Wieringa [12], which describes design science as the “design
and investigation of artifacts in context”. According to its template for design
questions [12, p.16], the overall goal of the dissertation is the following:
Improve availability and suitability of reference models
by introducing a method for reference model mining
that considers the situational context
in order to facilitate reference model development and usage.
This goal shall be reached by answering the following four research questions.
The first two questions are knowledge questions, determined to gather knowl-
edge on the reference modeling process. The second two questions are design
questions, asking for new artifacts to improve the reference modeling process.
1. Which contextual factors influence reference model design?
2. Which techniques exist for reference model mining and which contextual
factors influence their choice and applicability?
3. How can reference model mining techniques be matched with situational
contexts to produce applicable reference models?
4. How can reference model mining be applied in a given application case,
considering situational contextual factors?
The first question addresses the contextual factors that influence reference
modeling. It is an open descriptive empirical research question, meant to ob-
serve reference modeling in real-world scenarios, gathering knowledge about the
context of the artifact to be designed without the need for explanations. In con-
trast, the second question is an open explanatory empirical knowledge question
with the goal to identify existing RMM techniques and to determine their poten-
tials and limitations. We want to know under which situational circumstances a
certain technique may and may not produce an applicable and well-suited ref-
erence model. Because these circumstances may apply to multiple techniques,
they should be conclusively explained by the technique’s properties.
30
Treatment Implementation Problem Investigation
What must be improved? Why?
Treatment Validation Treatment Design
How does this artifact contri- Which artifact could solve the
bute to stakeholder goals? problem?
Fig. 2. The design cycle to be followed [12]
The third question joins the existing artifacts (i.e. the RMM techniques)
with the context by designing a method to match an existing technique with a
situational context, such that the produced reference models are applicable and
useful for their intended purpose. Therefore, the results from the previous two
questions need to be related to each other through some intermediary concept.
This concept and the matching method are then used in the fourth question,
which finally addresses the overall goal of the dissertation project, as sketched
by the template above. It is set out to design a comprehensive method for S-
RMM, ranging from the identification of a need for a new reference model to its
application and validation in a concrete use case.
Knowledge questions and design questions are answered by means of different
research methods, following Wieringa’s design cycle [12, p.27ff.] in Fig. 2 and
Frank’s pluralistic conception of research methods in IS research [13]. Prob-
lem investigation concerns learning more about the problem context, including
stakeholder goals, conceptual problem frameworks, and related phenomena. In
our case, this means an in-depth analysis of the context factors that determine
a reference model’s applicability and usefulness. Therefore, we will perform lit-
erature reviews and conduct interviews with expert modelers. Treatment design
consists of defining requirements for the artifact and designing it accordingly.
Therefore, we will translate the knowledge from the first stage into concrete re-
quirements, before the main artifact (the method) is designed. In the treatment
validation stage, the artifact is tested in its context. In our case, this validation
will be conducted by applying the method for inductively developing a reference
model for a to-be-defined use case, using observational case studies as well as
single-case mechanism experiments along with qualitative research methods.
3 State of the Art and Related Solutions3
The concept of situational reference model construction based on design princi-
ples is not new, but has yet only been elaborated for deductive reference model
development [4]. Inductive reference model development has only recently gained
attention in research, so there is little methodological seminal work. We presented
first ideas towards S-RMM in [2], where the choice of an appropriate mining tech-
nique is discussed. In [14], we illustrate concrete challenges of inductive refer-
3
This section is based on previous work presented in [1] and has been extended.
31
ence modeling according to a seven-stage framework. A number of contributions
describe concrete RMM techniques, but do not take on a methodological per-
spective, reflecting on the ways of model construction and the requirements of
specific use cases. A first overview on existing techniques is given in [14]. Since
then, we have identified a few additional the contributions, such as [15].
The term “context” can be defined as “any information that can be used to
characterize the situation of an entity” [16]. The idea of adapting an artifact, such
as a method or a model, to fit the specific context in which it is used, originated
in the software engineering domain, where a rigid one-size-fits-all methodology
for software development is not only unattainable but also inefficient [10, p.5].
Their explanations and formalizations of the terms “situation” and “context”
can be adapted to the BPM domain [17]. Regarding business processes, the con-
text describes the “environment in which a business process artefact is used”
[18], formalized in terms of a “minimum set of variables containing all relevant
information that impact the design and execution of a business process” [19].
Similar to reference modeling, context is also considered in process mining re-
search to achieve better and more specific results (e.g. [20]).
S-RMM is innovative in the way that it combines the domain of reference
modeling with the idea to adapt an artifact to the context in which it is used, as
coined by SME, and techniques to derive a process model from collected real-life
data, as in process mining. Giving a manual procedure model for inductive refer-
ence modeling based on RMM techniques, S-RMM goes beyond SME, because it
does not directly influence the artifact itself (the model), but instead adapts the
method used for deriving it. It differs from process mining, as both process mod-
els and execution logs are considered potential data sources. To our knowledge,
no comparable approaches or solutions exist in state-of-the-art literature.
4 Contributions4
Whereas most of the author’s contributions have so far addressed research ques-
tion 2 (e.g [2, 21, 14]), the main contribution to S-RMM is made in [1], where
a first answer is given to questions 3 and 4. With a reuse-oriented conceptual-
ization of reference models, their main purpose is to serve as an orientation in
the design of new business process models. In this context, we decipher two gen-
eral design processes [4]. Deriving an individual model from a reference model
is known as “Design With Reuse” (DWR), i.e. an existing model is used as a
blueprint offering guidance to the process model designer by giving suggestions
for both content and design of the individual model. On the other hand, “Design
For Reuse” (DFR) describes the process of constructing a (reference) model for
the purpose of being reused, i.e. composing model parts and domain knowledge,
such that they achieve a certain degree of universality.
Considering a model construction process, there exist several different tech-
niques for deriving a conceptual model from another one. These so-called design
4
This section is based on previous work presented in [1] and has been adapted.
32
Determine Target Determine Determine
Determine
Model Design Reference Model Reference Model
Situational Context
Principle Requirements Design Principle
Choose Mining Choose Mine
Technique Input Models Reference Model
Design For Reuse
Design With Reuse
Adapt Design Evaluate
Reference Model Target Models Target Models
Fig. 3. Procedure model for Situational Reference Model Mining [1]
principles describe how the content of the original model is adopted, adapted,
and extended in order to create a new model. Five design principles are described
in the literature [4]. Each configuration, instantiation, specialization, aggrega-
tion, and analogy may be used in the context of reference modeling and applied
to both DFR and DWR. However, not every design principle may be applied to
every reference model, nor may every intended target model be derived by any
design principle. Instead, the choice of model design principle depends on the
situational circumstances, i.e. the requirements posed to the target model and
the construction process itself. These factors also determine the character and
the choice of an appropriate reference model for a certain application context.
The main idea behind our method for S-RMM is to match RMM techniques
with applicable contexts by means of selecting appropriate design principles for
the construction of both target and reference models. Depending on the situa-
tional circumstances and the target model requirements, a certain design prin-
ciple is applied to derive the target model from the reference model. This design
principle poses certain restrictions and requirements to the reference model de-
sign, which is mainly influenced by the choice of technique that was used to
mine the reference model. Hence, the choice of mining technique is ultimately
determined by the situational context of the reference model application.
Research question 4 is addressed by the first conceptual design for a pro-
cedure model for S-RMM, shown in Fig. 3. It constitutes the main artifact to
be delivered in the dissertation. It is designed around the conceptualization of
S-RMM in Fig. 1 and describes a generic execution process of an S-RMM appli-
cation. Of the ten steps, seven belong to DFR and focus on the reference model
construction (i.e. the actual mining), whereas three belong to DWR and are con-
cerned with the target model construction (i.e. the reference model application).
The procedure model starts with defining the situational context, which de-
termines the target model design principle in the second step. As stated below,
this step is not yet conclusively elaborated and requires a lot of additional re-
search. In the third step, the reference model requirements are defined, which
are partly determined by the used principle and partly by the context itself. The
reference model design principle is derived from these requirements, with the
help of additional methods and artifacts, as explained below. For certain combi-
33
nations of target model and reference model principle, several mining techniques
are available; their choice may depend on additional factors. The input mod-
els can only be finally selected after the mining technique, as some techniques
pose additional requirements to their input data. Afterwards, the model can be
mined, concluding the DFR process. In the DWR process, the model is manually
or automatically adapted, before the target models can be derived and evaluated
regarding their purpose.
To provide a guideline for applying the procedure model and to answer re-
search question 3, we analyzed existing mining techniques regarding their un-
derlying principles and requirements (cf. Table 1). For each target model design
principle, we suggest corresponding reference model design principles and, for
each pair, a example of a suitable mining technique. A more detailed version of
this table, classifying many state-of-the-art techniques can be found in [1].
Table 1. Analysis of matching principles and according mining techniques
Target Model Reference Model RMM Technique
Design Principle Design Principle (Example)
Aggregation [2]
Configuration
Analogy [7]
Instantiation Aggregation [21]
Aggregation [2]
Specialization
Analogy [7]
Aggregation – –
Configuration [6]
Analogy Aggregation [2]
Analogy [8]
5 Intended Future Work
Although the to-be-designed artifacts are roughly sketched in the previous sec-
tions, there are numerous steps to be conducted in order to complete the research
activities that are required to satisfactorily answer the posed research questions.
Readers will notice that the above procedure model was designed without per-
forming an in-depth requirements analysis first. This analysis will be built on
an identification of relevant context factors, which will be identified by means
of a literature analysis of inductive reference modeling case studies and seminal
works on reference modeling. The goal is an integrated and conclusive model of
influential context factors for reference model development. This model will then
be validated and enhanced with expert interviews and provide a comprehensive
answer to research question 1. From that, we can derive requirements, which can
be used to adapt and refine the procedure model.
To conclusively answer research question 2, it is necessary to update the
literature analysis from [14], in order to capture the newest developments in the
34
field of RMM techniques. In addition, there needs to be an extensive literature
review for the identification of techniques that are not explicitly set out for
RMM, but could be used for that purpose, e.g. from the field of process mining.
Our analysis of matching existing RMM techniques with accepted design
principles for reference models in Tab. 1 is a preliminary sketch of a conclu-
sive answer to research question 3. It needs to be complemented by additional
principles such as modification, elimination, or union, potentially by means of a
structuring framework. The RMM techniques identified in the previous research
question need to be matched the the individual principles. Most importantly,
there needs to be some matching procedure between the context factors from
the first question to the design principles, and then, the RMM techniques.
The procedure model in Fig. 3 is the framework for an answer to research
question 4. Not only does it require an in-depth analysis of posed requirements
and an according adaptation and refinement, it also has to be substantiated by
concrete operationalizations of the individual stages. At this point, it is unclear
how the stages should be executed. To complete the design cycle, the procedure
model is supposed to be evaluated in practical case studies, where it is used to
inductively develop a reference model for a certain context. These case studies
have yet to be determined, planned, and executed.
References
1. Rehse, J.R., Fettke, P.: Towards Situational Reference Model Mining - Main Idea,
Procedure Model & Case Study. In: Leimeister, J.M., Brenner, W. (eds.) Proceed-
ings der 13. Internationalen Tagung Wirtschaftsinformatik. Internationale Tagung
Wirtschaftsinformatik (WI-2017), February 12-15, St. Gallen, Switzerland. AIS
(2017)
2. Rehse, J.R., Fettke, P., Loos, P.: An execution-semantic approach to inductive
reference models development. In: 24th European Conference for Information Sys-
tems (ECIS). European Conference on Information Systems (ECIS-16), June 12-15,
Istanbul, Turkey. Association for Information Systems (AIS) (2016)
3. Rehse, J.R., Fettke, P., Loos, P.: A graph-theoretic method for the inductive devel-
opment of reference process models. Software & Systems Modeling 16(3), 833–873
(2017)
4. vom Brocke, J.: Design principles for reference modeling: Reusing information mod-
els by means of aggregation, specialisation, instantiation, and analogy. In: Fettke,
P., Loos, P. (eds.) Reference Modeling for Business Systems Analysis, pp. 47–75.
Idea Group Publishing, Hershey (2007)
5. Fettke, P., Loos, P.: Perspectives on reference modeling. In: Fettke, P., Loos, P.
(eds.) Reference modeling for business systems analysis, pp. 1–20. Idea Group
Publishing, Hershey, PA (2007)
6. Gottschalk, F., van der Aalst, W.M., Jansen-Vullers, M.H.: Mining reference pro-
cess models and their configurations. In: Meersman, R., Tari, Z., Herrero, P. (eds.)
On the Move to Meaningful Internet Systems: OTM 2008 Workshops, OTM Con-
federated International Workshops and Posters, Monterrey, Mexico, November 9-
14, 2008. Lecture Notes in Computer Science, vol. 5333, pp. 263–272. Springer
Berlin Heidelberg (2008)
35
7. Li, C., Reichert, M., Wombacher, A.: Mining business process variants: Challenges,
scenarios, algorithms. Data & Knowledge Engineering 70(5), 409–434 (2011)
8. Yahya, B., Bae, H., Bae, J., Kim, D.: Generating valid reference business process
model using genetic algorithm. International Journal of Innovative Computing,
Information and Control 8(2), 1463–1477 (2012)
9. Ardalani, P., Houy, C., Fettke, P., Loos, P.: Towards a minimal cost of change
approach for inductive reference model development. In: Proceedings of the 21st
European Conference on Information Systems. European Conference on Informa-
tion Systems (ECIS-13), 21st, June 5-8, Utrecht, Netherlands. vol. Paper 127. AIS
(2013)
10. Henderson-Sellers, B., Ralyté, J., Ågerfalk, P.J., Rossi, M.: Situational method
engineering. Springer (2014)
11. Hevner, A., March, S., Park, J., Ram, S.: Design science in information systems
research. MIS quarterly 28(1), 75–105 (2004)
12. Wieringa, R.J.: Design Science Methodology for Information Systems and Software
Engineering. Springer Berlin Heidelberg (2014)
13. Frank, U.: Towards a pluralistic conception of research methods in informa-
tion systems research. ICB-Research Report No. 7, Institut für Informatik und
Wirtschaftsinformatik (ICB) der Universität Duisburg-Essen (2006)
14. Rehse, J.R., Hake, P., Fettke, P., Loos, P.: Inductive reference model development:
Recent results and current challenges. In: Mayr, H.C., Pinzger, M. (eds.) INFOR-
MATIK 2016 (LNI Volume P-259). pp. 739–752. Lecture Notes in Informatics,
Gesellschaft für Informatik (GI), Bonn (2016)
15. Leng, J., Jiang, P.: Granular computing–based development of service process ref-
erence models in social manufacturing contexts. Concurrent Engineering 25(2),
95–107 (2017)
16. Dey, A.: Understanding and using context. Personal and ubiquitous computing
5(1), 4–7 (2001)
17. Kornyshova, E., Deneckère, R., Claudepierre, B.: Contextualization of method
components. In: Fourth International Conference on Research Challenges in In-
formation Science (RCIS). pp. 235–246. IEEE (2010)
18. Born, M., Kirchner, J., Müller, J.P.: Context-driven business process modeling. In:
The 1st International Workshop on Managing Data with Mobile Devices (MDMD
2009), Milan, Italy. pp. 6–10 (2009)
19. Rosemann, M., Recker, J.: Context-aware process design: Exploring the extrinsic
drivers for process flexibility. In: The 18th International Conference on Advanced
Information Systems Engineering. Proceedings of Workshops and Doctoral Con-
sortium. pp. 149–158. Namur University Press (2006)
20. Li, J., Bose, R.P.J.C., van der Aalst, W.M.P.: Mining context-dependent and inter-
active business process maps using execution patterns. In: zur Muehlen, M., Su, J.
(eds.) Business Process Management Workshops: BPM 2010 International Work-
shops and Education Track, Hoboken, NJ, USA, September 13-15, 2010, Revised
Selected Papers. pp. 109–121. Springer Berlin Heidelberg (2011)
21. Rehse, J.R., Fettke, P., Loos, P.: Eine Untersuchung der Potentiale automatisierter
Abstraktionsansätze für Geschäftsprozessmodelle im Hinblick auf die induktive En-
twicklung von Referenzprozessmodellen. In: Alt, R., Franczyk, B. (eds.) Proceed-
ings of the 11th International Conference on Wirtschaftsinformatik. Internationale
Tagung Wirtschaftsinformatik. Internationale Tagung Wirtschaftsinformatik (WI-
2013), February 27 - March 1, Leipzig, Germany (2013), (In German)
36