<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Knowledge-based Approach to the Configuration of Business Process Model Abstractions</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Shamila Mafazi</string-name>
          <email>1shamila.mafazi@mymail.unisa.edu.au</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wolfgang Mayer</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Georg Grossmann</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Markus Stumptner</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of South Australia</institution>
          ,
          <addr-line>Adelaide, SA, 5095</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
      </contrib-group>
      <fpage>60</fpage>
      <lpage>74</lpage>
      <abstract>
        <p>ion mechanisms have focused predominantly on structural aggregation, projection, and ad-hoc transformations. We propose an approach for configuration of process abstractions tailored to a specific abstraction goal expressed as constraints on the abstraction relation and process transformation operators. Our framework goes beyond simple structural aggregation and leverages domain-specific properties, taxonomies, meronymy, and flow criteria. In this paper we outline the constraint-based framework and its underlying inference procedure. We show that our approach can handle most of the common process analysis use cases.</p>
      </abstract>
      <kwd-group>
        <kwd>business process abstraction</kwd>
        <kwd>business process management</kwd>
        <kwd>process configuration</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Models of business processes and operational procedures are increasingly being used
in modern organizations, and the size and complexity of processes and their models
can often be large. Development processes in large technology-focused organizations
can easily span more than one thousand process steps [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. As a result, process models
have become difficult to understand and manage, as they may not be specified in full
in order to enable flexible executions. However, such flexibility comes at a price: it is
no longer easily possible to reason about executions based on a single process model.
Although learning methods have been developed to reconstruct process models from
execution logs [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], the resulting processes are often very specific and can be difficult
to comprehend in full. Therefore, methods for business process abstraction are desired
that enable process analysts to tailor large models to their specific analysis task at hand.
      </p>
      <p>
        Methods for abstraction have been proposed to ease comprehension, monitoring,
and validation of large processes and their running instances. To date, abstraction
mechanisms have focused predominantly on structural aggregation and projection.
Collapsing “similar” entities in a process model into one abstract element and projecting away
irrelevant entities are among the most common forms of simplification employed for
abstraction. Similarity and relevancy of process entities is often defined ad-hoc using
process structure, clustering techniques, and user-specified selection criteria [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
Clustering techniques, statistical methods, and ad-hoc criteria are commonly used to devise
a concise summary representation that reflects certain aspects of the larger process.
      </p>
      <p>Although structural aggregation can lead to considerable simplification of large
process models, the resulting model may not show all required elements or aggregate
elements together that would be better kept separate. However, these measures fail to take
into consideration the purpose of the abstraction for the user.</p>
      <p>We propose an approach to computing abstractions of business process models
tailored to conducting selected common business process analysis tasks. We address this
problem by imposing constraints on the abstraction relations that relate concrete and
abstract process models such that the abstract process model induced by the abstraction
relation is guaranteed to include the information needed to assess selected properties of
the process. Rather than relying on cumbersome explicit specification of relevant
process elements, we combine a questionnaire-driven approach to eliciting constraints for
common analysis tasks with explicit specification of additional constraints a user may
have.</p>
      <p>As a result, significance and granularity of an abstract model can be explicitly
controlled and adjusted to suit a given task. Furthermore, the granularity need not be
uniform across the entire model; different abstraction operators can be applied to different
regions of the process model.</p>
      <p>
        Although techniques for parameterizing the granularity of the resulting abstractions
have been introduced in order to compensate for current techniques’ inability to devise
representations that are fit for the user’s objective [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], to the best of our knowledge,
no explicit means to control abstractions is available to non-experts in formal process
analysis.
      </p>
      <p>Our method can be seen as configuration of process models, where configuration
applies to the abstraction operators used in devising process rather than the process model
itself. In contrast to classic configuration where one chooses between alternative
instantiations of given variation points within a parametric process model, our approach takes
a detailed process model without explicit variation points and derives simplified
variations thereof. Hence, our configuration method controls the operators applied within
the abstraction process rather than the underlying process model.</p>
      <p>In this paper we make the following contributions:
– a knowledge-based framework for configuring purposeful abstractions;
– a framework for specifying constraints on the abstraction;
– a method to infer the process elements (nodes, data, labels) that need to be retained
in a conforming abstraction;
– a method to compute abstractions conforming to the abstraction goal.</p>
      <p>The subsequent sections are structured as follows. Our process model and
abstraction framework are introduced in section 2, our constraint-based abstraction framework
and configuration mechanism are described in sections 3 and 4, respectively.
Abstraction operators are modeled in section 5 and our method of synthesizing conforming
abstractions is summarized in section 6, followed by discussion of related work in
section 7.</p>
    </sec>
    <sec id="sec-2">
      <title>Process Model Abstractions</title>
      <p>Different users of a process model are usually interested in observing a process model
at different levels of details. This requires creation of different abstract process models
from one model. However, not all abstract views of a process are equally desirable, as
useful abstractions should be tailored to the user’s needs. In this work, we pursue this
aspect of process abstraction by constraining abstractions such that certain user-selected
properties of the underlying concrete process are maintained in its abstract view.</p>
      <p>
        We adapt the process model of Smirnov et al.[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] for our purposes and furnish the
model with explicit representations of data- and domain-specific properties attached to
tasks:
Definition 1 (Process Model). A tuple (N; F; P; ; DP ) is a process model, where N
is a finite set of nodes partitioned into tasks Nt and gateways Ng, F N N is the
flow relation such that (N; F ) is a connected graph, P is a finite set of properties of
tasks, DP is a finite set of property values of tasks, and : N P 7! DP is a function
that maps each property of a node to its value. For brevity, we write n:p for (n; p). Let
M denote the set of all process models.
      </p>
      <p>The set of properties P comprises common domain-specific properties, predicate
valuations, and information derived from executions of process instances. Common
properties include roles, resources, timing information, and used and modified data flow
information. Domain-specific predicates are boolean properties expressing facts such
as “is on a critical path”. Information derived from executions indicate aggregate
information, for example execution frequencies or number of running instances of a task.</p>
      <p>Given a concrete model m of a business process, an abstract view of m is a process
model m^ that retains “significant” entities of m and omits insignificant ones. In our
framework, entities comprise the nodes, flows, and properties associated with nodes in
a given model. We write m to denote the set of entities in m where m N [ F [
fn:pjn 2 N; p 2 P g.</p>
      <p>Which entities are considered significant is largely determined by the purpose of the
abstract model and hence should be defined flexibly based on the goals of the analyst.
We will therefore use an abstract predicate sign m [ m^ to capture the significant
entities.</p>
      <p>Whereas insignificant entities can be either eliminated from in the abstraction or
absorbed into an abstract entity, the significant elements are to be retained. The
correspondence between significant entities of m and their abstract counterpart in m^ is given
by an abstraction relation R m m^ .</p>
      <p>Definition 2 (Process model abstraction). A business process model abstraction is
a function : M 7! M that transforms a model m into a model m^ = (m) with
correspondence relation R such that
– 8!^ 2 m^ sign(!^) is true,
– 8!^ 2 m^ 9! 2 m (!; !^) 2 R ,
– 8! 2 m sign(!) ! 9!^ 2 m^ : (!; !^) 2 R , and
– preserves local composition of m in m^ .</p>
      <p>Receptionist
5 min
Receptionist
5 min</p>
      <p>Receptionist
10 min
Concrete Process Model m</p>
      <p>Receptionist
10 min</p>
      <p>Admin
7 min
Admin
2 min</p>
      <p>Staff
10 min
Customer
7 min
Receptionist
3 min
Accountant
1 min</p>
      <p>Receptionist
7 min</p>
      <p>Receptionist
7 min
The first three conditions ensure that all retained entities in the abstraction are
significant, are justified by the existence of at least one entity in the concrete process, and that
all significant concrete entities have a corresponding element in the abstract model. The
fourth condition restricts correspondences to meaningful maps that preserve the local
structural composition of m in m^ . We require that each concrete entity maps to at most
one abstract counterpart. Where each abstract property attaches to the abstraction of
the concrete nodes belonging to the concrete properties. Also the abstract flow relation
reflects the flow in the concrete process model:
– 8! 2
m 8!^; !^0 2
m^ (!; !^) 2 R
^ (!; !^0) 2 R
! !^ = !^0,
– 8n:p 2 m8n^:p^ 2 m^ (n:p; n^:p^) 2 R</p>
      <p>^
– ( m^; n^) 2 F ! 9m; n 2 N (m; n) 2 F
! (n; n^) 2 R ,
^ (m; m^ ) 2 R
^ (n; n^) 2 R .</p>
      <p>
        Consider the example process models in Figure 1, where the model in the lower half
depicts the concrete process and the upper half shows the abstract model. The
correspondence relation for tasks is indicated by dashed lines; the correspondences for flows
are left implicit. Assuming that all elements performed by role Receptionist in m are
significant, the abstraction satisfies the condition of Definition 2 as well as the three
constraints stated above. For illustration, assume that tasks Cancel Late and Send
Cancellation Confirmation each have a property Duration, then the constraints on R
ensure that property Duration of abstract task Cancel is an abstraction of only the concrete
tasks’ property.
According to Smirnov et al.[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] business process abstraction consists of three aspects:
the why, the when and the how aspect. The why aspect captures the reasons for building
an abstraction of a process model (fragment), the when aspect describes the conditions
under which an element of a process model needs to be abstracted, and the how aspect
relates to the concrete transformation mechanism to devise an abstraction. Whereas an
extensive body of work covers the how aspect, comparatively little work is available to
address the remaining aspects.
      </p>
      <p>Our work aims to address the why and when aspects. We assume that a specification
of the information, its granularity, and predicates whose truth values shall be preserved
by the abstraction can be elicited, represented formally, and exploited to guide a search
procedure to infer suitable abstractions. Let be such a specification, formulated over
the entities in a given process model m. Specifically, we are interested in abstract
models m^ = (m) satisfying .</p>
      <p>By making the abstraction criterion explicit, the why aspect of process abstraction
is captured, which can be translated into conditions for when it is admissible to abstract
different entities. We define the significance predicate such that the entities are
preserved which are required to ensure that criterion is fulfilled on the abstract model.
Building on prevalent structural rewriting mechanisms, we provide generic operators
on properties and their values in order to automatically eliminate or aggregate entities,
and furnish the abstract model with a suitable representation of aggregated information.
The application of operators is restricted such that the resulting abstract model retains
the significant entities and predicates.</p>
      <p>An abstraction criterion may be composed of the following specification primitives:
– sign(!) for ! 2 m;
– ! = !0 for !; !0 2 m [ m^ ;
– (n; n0) 2 F [ F^ , n; n0 2 Nt [ N^t, n; n0 2 Ng [ N^g;
– n:p c, where n, p, and c are a node, a property and a constant drawn from DP ,
respectively, and is a relational operator (e.g. , , =, 6=, . . . );
– (!; !^) 2 R ;
– negation, conjunction, disjunction, universal and existential quantification.
This language is expressive enough to capture many interesting properties, including
domain-specific predicates and some aggregate instance information. The starred F
and F^ denote the transitive closure of the flow relation.</p>
      <p>For example, one could be interested only in the expensive tasks in the process
model in Figure 1, where the value of Fee exceeds some threshold $$: = x:Fee
$$ ! 9x (x; x^) 2 R ^ x:p = x^:p ^ (x:p; x^:p) 2 R for p 2 fFee; Label g. Capturing
this explicitly in , significance predicate and aggregation operators can be found. The
example formula implies that all “expensive” tasks will retain their precise labels and
fee information, whereas all other tasks and properties can potentially be abstracted
away (subject to maintaining the generic abstraction constraints and well-formedness
of the resulting abstract process).</p>
      <p>While this example may seem trivial, our approach generalizes to more involved
situations. For example, if execution times shall be retained, but labels of some tasks need
Cancel</p>
      <p>Cancel
Cancel Early Cancel Invoice Cancel Late Send Cancelation</p>
      <p>Confirmation
Fig. 3. Task Meronymy
not be, our approach allows us to absorb otherwise insignificant tasks into other tasks,
but prevents us from eliminating the task entirely, which would result in its contribution
to execution time being lost. Similarly, the model abstractions that may be applied in
devising an abstraction would be restricted to aggregate the property of sequence of
nodes using the sum function but not, for example, max function. Furthermore, data
flow in the model may impose restrictions on significance of non-local process entities.</p>
      <p>To facilitate the abstraction of data properties and other non-structural aspects of
the business process, we assume that the value domain Dp of each property (including
the label of nodes) p 2 P forms a (finite-height) (semi-)lattice with partial order p,
where x p y denotes that x is more precise (or has more information) than y. We use
&gt;p to denote the least element of Dp, which provides no information. In this case, the
property can be omitted.</p>
      <p>For example, let us revisit the model in Figure 1. An example of the (semi-)lattice
for the Role properties is shown in Figure 2. The lattice for roles indicates that roles
Receptionist and Admin are specializations of role Staff and are therefore candidates
for role abstraction.</p>
      <p>For example, one could be interested in distinguishing Customers from Staff but not
the precise staff roles. This could be captured in as a constraint on the Role property
of nodes. As a result, any value r of property Role that satisfies r Sta would be
abstracted to value Staff.</p>
      <p>
        We impose one more constraint on R : any admissible R must satisfy that no
information can be gained in the abstract model. That is, (!; !^) 2 R ! ! !^ must
hold for all property entities !; !^.
4
Although the method of constraining valid abstractions is powerful, direct exposure
of the formal framework to business analysts is rarely feasible in practice. Therefore,
we employ knowledge-based configuration mechanisms to elicit appropriate partial
abstraction specifications. We use a variant of the questionnaire method of process
configuration [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], which interacts with the user in terms of simple domain-specific questions
in order to construct the formal domain representation from the user’s answers.
Different from previous work, our configuration model does not rely on established variation
points within the process model, but rather aims to construct a formula that constrains
the admissible abstraction relations and operators that can be used to construct it. No
explicit library of processes and variation points specific to the process under
consideration is needed.
      </p>
      <p>We envision our process abstraction configurator to provide a wizard-like
interaction where process analysts may select the information and predicates they wish to
retain in the abstraction, and define domain-specific value lattices, aggregation- and
structural transformation operators. Underlying our configurator is a catalog of
abstraction constraint templates, which can be selected and its parameters instantiated by the
user.</p>
      <p>Definition 3 (Configuration Model). A configuration model is a triple (C; O; G), where
C is a catalog of abstraction aspects, O is a library of abstraction operators (defined in
section 5), and G is a finite set of boolean propositions.</p>
      <p>The catalog contains configuration options and associated abstraction constraints, the
library of abstraction operators defines the transformations that can potentially be
applied to the process model, and the set of propositions allows one to restrict the set of
applicable operators based on choices made for aspects in the catalog. We first describe
the catalog and defer discussion of the operators until the next section.
Definition 4 (Abstraction Aspect Catalog). An abstraction aspect catalog is a set of
templates (Q; X; C[X; G]) where Q is a configuration option, X is a set of parameter
variables, and C[X; G] is a formula template parametric in X specifying the
abstraction constraints associated with Q in terms of the process model, and abstraction
operator constraint in terms of assignments to G. Each placeholder variable x 2 X can be
assigned a predicate or domain value from the process model (subject to resulting in a
well-formed formula C[X; G]). The configuration criterion is simply the conjunction
of all constraints Ci[xi; Gi] of selected Qi with binding Xi = xi.</p>
      <p>As an example, let the configuration option Q1 be ’Get a process view from all
the interactions between two specific roles’. By selecting this configuration option, the
parameter variables are set as: X = fRole1; Role2g. The values for the roles are
requested and assigned as Role1 = Admin and Role2 = Accountant. The
configuration imposes constraints on the abstraction relation: a task n must be retained in the
abstraction if its Role property valuation matches either Role1 or Role2, and there is a
flow from n to another task n0 that has property Role set to the remaining given role.
Formally, the abstraction criterion can be expressed as
8n1; n2 2 Nt : (n1; n2) 2 F
^ ((n1:Role = Role1 ^ n2:Role = Role2)
_ (n1:Role = Role1 ^ n2:Role = Role2))
! (n1; n1) 2 R
^ (n2; n2) 2 R :</p>
      <p>
        The catalog allows for convenient elicitation of user’s requirements based on
common abstraction goal patterns. Table 1 shows how 11 of the 14 common use cases for
process abstraction presented by Smirnov et al[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] can be expressed in our framework.
Most constraints restrict which tasks and properties may be abstracted, and whether
insignificant tasks shall be eliminated or aggregated. In the first group of uses cases
(1–4), a process view respecting one or more properties of a task, such as resources
and data objects, is required. For this purpose the properties of all tasks are compared
with the user specified property P. Tasks satisfying property P over property A are
retained in the abstraction, whereas others are eliminated. In the second use case, tracing
a task, the effect of a task in the process model needs to be assessed. For this purpose
a process view containing the tasks which are reachable from the interesting task is
produced. The constraint ensures that all tasks x0 reachable from a given task x are
retained in the abstraction. For instance-related use cases (5–7), we currently require a
pre-processing stage, where the tasks in the process model are furnished with aggregate
property information derived from the instances. For example, an property representing
execution frequencies or cumulative case costs could be added. For use case 9, adapt
process model for an external partner, the tasks which need to be presented to the
external partner are selected. The selected tasks are considered as significant, hence they
need to be retained while the rest of the tasks are aggregated. The first constraint ensures
that selected tasks are retained in the abstraction, whereas the second constraint ensures
that no insignificant tasks are eliminated from the model (although such tasks may be
aggregated with other insignificant tasks). In use case 10, a process view respecting the
data dependencies of the tasks is required. For this purpose those tasks which make
use of the data objects of interest are considered as significant and must be retained
in the abstraction while the rest of the tasks are considered as insignificant and can be
eliminated from the abstract model. For use case 13, a process view respecting user
specified property(s) is required. Different from use cases 1–4, in this process view the
insignificant tasks (tasks without interesting property(s)) are aggregated and presented
as a composite task in the process view. Hence the constraint prohibiting the elimination
of insignificant tasks must be imposed in addition to the constraint capturing use cases
1–4.
      </p>
      <p>Three use cases cannot directly be expressed in our framework. In use case 14,
Retrieve coarse grained activities, a view over the coarse-grained tasks are required but
not a view over the process model. This requires inferring the coarse-grained activities,
i.e, abstraction hierarchies and meronymy, from the detailed process model. In contrast,
our approach relies on given abstraction hierarchies and meronymy to compute
abstractions. In use case 12, the user needs to control the abstraction level gradually while in
our approach the process model is abstracted until all the user specified criteria are met.
Finally, use case 8 requires to infer possible executions of the process model given a
specification of a case instance. Extensions to our framework would be required in
order to infer transitions that are potentially enabled or blocked based on guard conditions
and values in the given case instance.
5
Once abstraction constraints have been set, the concrete process model m can be
transformed into a customized process view m^ . In our framework, this amounts to
constructing an abstraction function and its induced R such that all abstraction constraints
are satisfied when applying on m. We employ generic search techniques to compose
from individual model transformation operators selected from a library of abstraction
operators.</p>
      <p>Preserving Relevant Tasks (Use cases 1–4)
Q1 : Retain a task if property [A] satisfies [P]
C1[A; P ] = 8x 2 Nt [P ](x:[A]) ! (x; x) 2 R
Tracing a Task (Use case 11)
Q2 : Retain a task if it is reachable from the node [x]
C2[x] = 8x0 2 N (x; x0) 2 F ! (x0; x0) 2 R
Preserving Relevant Process Instances (Use cases 5–7)
Q1 and Q2, based on pre-processed model
Adapt Process Model for an External Partner (Use case 9)
Q3 : Retain selected tasks in set T
8x 2 T (x; x) 2 R
Q03 : Aggregate insignificant tasks:
8x 2 N sign(x)
Trace Data Dependencies (Use case 10)
Q4 : Retain a task if it uses data property [P]
C4[P ] = 8x 2 Nt8p 2 [P ] HasP roperty(x; p) ! (x; x) 2 R
Get Process Quick View Respecting a Property (Use case 13)
Q1 and Q03</p>
      <p>Abstraction operators are model transformations that rewrite the concrete model’s
entities into their abstract counterparts. Traditionally, work on business process
abstraction focuses predominantly on structural transformations, where rules specify how
fragments in a model shall be transformed into an abstract (smaller) fragments in the
abstract model. Our work extends this approach to data properties.</p>
      <p>Similar to constraints on the abstraction relation, which limit the information
retained in the abstraction, the selection of abstraction operators is subject to constraints
imposed by the configuration model that ensure abstract data values are given
meaningful values consistent with the purpose of the abstraction.</p>
      <p>Definition 5 (Abstraction Operator). An abstraction operator is a tuple (R; S; V; W )
where R, S are fragments of a process model (“patterns”) with common variables V ,
and W is a boolean expression over propositions G (in the configuration model) and
V , governing the application of the operator. If R matches with binding in a model
m, and W is satisfiable, a model m0 = m[R 7! S ] is obtained by replacing the
matched part R in m with the replacement fragment S . Substitute S may contain
data transformation functions that compute the aggregate value for properties in the
abstract model. Operators include sum, min, max, avg, for numeric properties, and
least upper bound and greatest lower bound operators (if defined) on properties’ value
lattices.</p>
      <p>Our library of abstraction operators currently comprises:
– Projection operators that eliminate tasks/flows;
– Entity abstraction rules that transform labels and properties of individual tasks.</p>
      <p>These operators abstract property values according to the corresponding lattices of
domain values;
– Structural rewrite rules that transform the process structure and re-arrange tasks
and flows;
– Aggregation rules that aggregate values of properties of multiple tasks. Separate
rules exist for properties of different type, and different aggregation functions may
need to be used for sequence, choice, parallel, and loop constructs.</p>
      <p>For space reasons we cannot present the entire collection in detail. Figure 4 contains
examples of property-related aggregation for properties of different types (numeric,
set-valued, boolean). The bottom part shows the concrete fragments and the top part the
abstract counterparts; X and Y represent variables to be matched and a,b,c represent
placeholders for numeric, set-valued, and boolean properties, respectively.</p>
      <p>Figure 4a indicates 2 tasks in a block. To aggregate the numeric properties of the
two tasks, the operators such as Max, Min, Avg, Sum can be employed. Selecting an
operator is completely case based. For example, assume a user is interested in tasks
with high hand-off times. In this case, the operator Max needs to be selected to
assign the maximum hand-off time to the composite task XY. Likewise, for the set-valued
properties, an operator such as union, aggregate meronymy, abstract label, based on the
configuration option in hand, can be selected. The operators for boolean properties of
the tasks, include, Or, And, Xor. As an example, assume a user is interested in observing
the tasks which are in a critical path, the operator Or can be employed which indicates
the composite task is whether on a critical path or not. Figure 4b shows an abstraction
operator for two tasks in a loop. For the numeric properties of these tasks, based on
how many times the loop is executed, the result from the abstracting operator needs to
be multiplied or widened to infinity, if an upper bound is not known. Figure 4c shows
an abstraction operator for sequential tasks. In this case, where numeric properties
typically are aggregated, set-valued properties are merged, and boolean properties are either
merged or combined using logic operators to infer the property value associated with
the abstract task.</p>
      <p>
        Table 2 gives a list of operators currently defined in our library. The table on the
right-hand side of the figure shows examples for formalization of three operators in our
framework. Our formalization relies on a set G of propositions defined in the
configuration model that is used to govern the application of certain abstraction operators.
The elements of this set are determined by the selected configuration options and
domain model and consists of propositions of the form Enable(o; op; p), where o is the
name of an abstraction operator, op is an aggregation operation, and p is a property.
Together with a hierarchy of properties (with specialization ordering v), the propositions
are used to control which operators can be applied to certain operations. For example,
abstraction operator SumNumPropSeq is only applicable if none of the configuration
options prohibits its application. Whereas most operators are generic and can be
applied to process models from any domain, domain-specific operators can be introduced
to account for specific abstractions, such as the meronymy approach presented in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
6
Conceptually, our abstraction method proceeds as follows. Starting with a given
concrete process model m and configuration constraints , we employ a search procedure
      </p>
      <p>
        Fig. 4. Structural and Property Aggregation Operators
SumNumPropSeq(x,y,p):
x; y 2 Nt ^ (x; y) 2 F ^
(x; xcy) 2 R ^ (y; xcy) 2 R
^p v N umeric
^ :Enable(SumN umP ropSeq; +; p) 2= G
! xcy:p = x:p + y:p
to incrementally build an abstraction. An applicable abstraction operator r is selected
and applied to m, yielding a transformed model m0. If structural aggregation was
performed, additional rules to determine the property values of new task(s) are applied.
Concurrently, the abstraction function and its correspondence relation are extended to
account for the effects of r. This process repeats until an abstraction satisfying all
constraints in has been created and no further rule applications are possible. As a result,
we obtain an abstraction function that transforms the given model m in a maximally
abstract process model reflecting the relevant tasks and properties. If the intermediate
results are recorded, this yields a hierarchy of abstractions of varying granularity.
Although not all models in this hierarchy necessarily satisfy all abstraction constraints,
navigating the abstraction hierarchy could be useful to “drill-down” in specific areas if
needed (comparable to the approach in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]). Incremental specification and adjustment
of abstraction constraints based on initial abstract views remains a direction for future
research.
      </p>
      <p>If multiple operators are applicable, this approach may result in multiple possible
abstractions. To steer our algorithm towards desirable abstractions, we employ a
simple optimization method that aims to minimize both constraint violations and model
complexity. When selecting an abstraction operator, we choose the operator that
minimizes the sum viol( ; ; m) + size( (m)), where viol( ; ; m) denotes the number
of constraints in that are violated by the current abstraction when applied to m,
and size( (m)) measures the number of elements (jN j + jF j) in the abstract model
(m). In addition, we maintain a worklist of the current best k abstraction functions.
Currently, k is a user-set parameter.</p>
      <p>For example, let us revisit the process in Figure 1. Assume that only tasks that are
involving role “Receptionist” with Duration&gt; 3min are required to be shown in the
abstraction.</p>
      <p>Based on the given the abstraction constraints, the abstraction criterion can be
expressed as:
8n 2 Nt ^ n:Role = Receptionist ^ n:Duration &gt; 3 ! (n; n) 2 R</p>
      <p>Considering the criteria, tasks Use, Cancel Early and Cancel Invoice are
insignificant, as for example U se:Role 6= Receptionist. Aggregating the two tasks Cancel
Early and Cancel Invoice does not result in a significant task either. Hence among
others, operator “Remove Task” can be applied to these tasks to eliminate them from the
process model. Tasks Cancel Late and Send Cancellation Confirmation are also
insignificant but unlike Cancel Early and Cancel Invoice, aggregating these two tasks
results in a significant task. Hence, operator Abstract Property Value can be applied to
their role properties to lift the property value to the abstract value Staff. Now, operator
Aggregate Meronymy can be applied (based on the meronymy in Figure 3),
combining Cancel Late and Send Cancellation Confirmation into Cancel. The operator
SumNumPropSeq is applied on the duration properties of the two tasks to add up these
properties. Since the abstract task was formed by sequential composition, Aggregate
Value (Seq) must be applied twice to infer the value for properties Role and Duration of
the abstract task.</p>
      <p>At this point, no operators are applicable that satisfy the abstraction constraints.
Further simplification of properties and removal of tasks or flows would yield either an
ill-formed process model or violate an abstraction constraint.
7</p>
    </sec>
    <sec id="sec-3">
      <title>Related Work</title>
      <p>The research presented in this paper complements the areas of business process model
abstraction and process model configuration. Due to emerging various needs, several
approaches have been proposed by which the size of a process model can be reduced.
However, no single approach provides the same level of configuration ability as ours.</p>
      <p>
        Many approaches for simplifying a given process model based on rewrite rules have
been developed [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Rewriting approaches based on process fragments, process
regions and patterns aim to simplify the structure of large processes by hierarchical
aggregation. Various process visualization techniques rely on users selecting interesting
tasks and eliminating the remaining tasks from the process model [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Pankratius et al.
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] proposed Petri Net based reduction patterns, including place and transition
elimination and place and transition join, for abstraction. Liu et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] cluster tasks in a
process model, preserving ordering constraints, roles, execution frequencies, and
process view for external partners. Since their main abstraction operation is aggregation,
the clusters are aggregated into composite nodes. In both of these approaches [
        <xref ref-type="bibr" rid="ref11 ref9">11, 9</xref>
        ],
the authors address the how component of the business process abstraction. Since the
papers ignore the execution semantic of the process model and treat only tasks, but
not the reachability criterion, as the abstraction objects, the process views related to the
process instances (use cases [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5-7</xref>
        ]) cannot be captured by their techniques. Additionally,
compared to our approach, their approach is not user interactive. Cardoso et al.[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
proposed reduction rules to synthesize process views respecting ordering constraints and
roles. The paper concentrates on the how component of the process abstraction while
only non-functional property values have been considered. Furthermore, their reduction
technique is pattern based. Once a region matches one of their predefined patterns, the
region is aggregated into a composite node. Hence, it is not always possible to
aggregate an insignificant task, as forming a region for the task that matches the patterns, can
be impossible.
      </p>
      <p>
        Bobrik et al.[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] aggregate information derived from running instances into a
summary process model, including completion status, data properties, and timing
information. In this paper only the how component is discussed. Also the paper does not discuss
the property aggregation operations for different types of properties. Polyvyanyy et al.
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] defined abstraction criteria based on slider approach which separate significant
from insignificant tasks, which are subsequently aggregated based on structural process
patterns. Although the abstraction criteria can be extended to cover more abstraction
scenarios, they are limited to those properties which have a quantitative measurement
such as cost and execution duration.
      </p>
      <p>
        Fahland et al.[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] proposed a simplification strategy for Petri nets that is based on
unfolding and subsequent transformation and folding regardless of abstraction purposes.
Overall most of the process model abstraction approaches focus on only the how
component, reduce a process model based on predefined patterns, consider only a limited
number of properties, and are not user interactive. In contrast, we take other process
abstraction components into account, we do not restrict the preservation or aggregation
of a task based on its region and the corresponding patterns, we provide an
aggregation solution for properties with different types. Finally using a questionnaire, different
needs of a user from abstracting a process model are taken into account.
      </p>
      <p>
        In process model configuration literature, La Rosa et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] introduce a
questionnaire approach for system configuration. The questionnaire elicits facts about the
desired process variant. Facts are associated with actions that adapt a given generic
reference process to suit the users requirements. Gottschalk et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] summarizes similar
approaches for EPCs and YAWL, where tasks in the process are either blocked or
hidden. In contrast, our approach does not rely on a reference process with variation points.
Instead, we constrain the resulting abstraction relation and employ search techniques to
compute suitable abstractions for tasks and data entities in the process model.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>We presented a configuration method for generating tailored business process
abstractions that satisfy user-selected abstraction criteria. Our method is based on imposing
constraints on the abstraction relation, which is computed using a generic search
procedure using a library of generic and domain-specific abstraction operators. Elicitation
of relevant abstraction constraints is simplified by a questionnaire-based approach that
hides much of the formal underpinnings of our method. Our abstraction approach goes
beyond simple structural transformation and also considers data properties and flow
aspects within the process model in the abstraction.</p>
      <p>In this paper we focused on conceptual elaboration of our method. Immediate future
work will focus on empirical evaluation of the approach on large business processes,
and on incorporating preference orderings into our search and operator selection
algorithms. Other avenues for research are incremental elicitation of abstraction constraints
in the context of incremental process exploration and integration of process
instancebased properties and further reachability-based criteria.
9</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgement</title>
      <p>We would like to acknowledge that this research was supported by the Australian
Research Council (ARC) under grant DP0988961.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bobrik</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reichert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bauer</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Parameterizable views for process visualization</article-title>
          .
          <source>Tech. rep., Centre for Telematics and Information Technology</source>
          , University of Twente (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bobrik</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reichert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bauer</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>View-based process visualization</article-title>
          .
          <source>In: Proc. BPM</source>
          . pp.
          <fpage>88</fpage>
          -
          <lpage>95</lpage>
          . Springer (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Cardoso</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sheth</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arnold</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kochut</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Quality of service for workflows and web service processes</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>1</volume>
          (
          <issue>3</issue>
          ),
          <fpage>281</fpage>
          -
          <lpage>308</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Ehrig</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koschmider</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oberweis</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Measuring similarity between semantic business process models</article-title>
          .
          <source>In: Proc. APCCM</source>
          . pp.
          <fpage>71</fpage>
          -
          <lpage>80</lpage>
          . Australian Computer Society (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Fahland</surname>
          </string-name>
          , D., van der Aalst, W.:
          <article-title>Simplifying mined process models: An approach based on unfoldings</article-title>
          . In: Rinderle-Ma,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Toumani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Wolf</surname>
          </string-name>
          ,
          <string-name>
            <surname>K</surname>
          </string-name>
          . (eds.) Business Process Management,
          <string-name>
            <surname>LNCS</surname>
          </string-name>
          , vol.
          <volume>6896</volume>
          , pp.
          <fpage>362</fpage>
          -
          <lpage>378</lpage>
          . Springer (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Gottschalk</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>La</surname>
            <given-names>Rosa</given-names>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Process configuration in yawl</article-title>
          . In: Hofstede,
          <string-name>
            <given-names>A.H.M.</given-names>
            ,
            <surname>Aalst</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.M.P.</given-names>
            ,
            <surname>Adams</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Russell</surname>
          </string-name>
          , N. (eds.) Modern Business Process Automation, pp.
          <fpage>313</fpage>
          -
          <lpage>382</lpage>
          . Springer (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Gottschalk</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wagemakers</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jansen-Vullers</surname>
          </string-name>
          , M.,
          <string-name>
            <surname>van der Aalst</surname>
          </string-name>
          , W.,
          <string-name>
            <surname>La</surname>
            <given-names>Rosa</given-names>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Configurable process models: Experiences from a municipality case study</article-title>
          .
          <source>In: Advanced Information Systems Engineering, Lecture Notes in Computer Science</source>
          , vol.
          <volume>5565</volume>
          , pp.
          <fpage>486</fpage>
          -
          <lpage>500</lpage>
          . Springer Berlin / Heidelberg (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>La</given-names>
            <surname>Rosa</surname>
          </string-name>
          , M.,
          <string-name>
            <surname>van der Aalst</surname>
          </string-name>
          , W.,
          <string-name>
            <surname>Dumas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>ter Hofstede</surname>
          </string-name>
          , A.:
          <article-title>Questionnaire-based variability modeling for system configuration</article-title>
          .
          <source>Software and Systems Modeling</source>
          <volume>8</volume>
          ,
          <fpage>251</fpage>
          -
          <lpage>274</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Workflow modeling for virtual processes: an order-preserving processview approach</article-title>
          .
          <source>Information Systems</source>
          <volume>28</volume>
          ,
          <fpage>505</fpage>
          -
          <lpage>532</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Mayer</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Killisperger</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stumptner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grossmann</surname>
          </string-name>
          , G.:
          <article-title>A declarative framework for work process configuration</article-title>
          .
          <source>AI</source>
          EDAM
          <volume>25</volume>
          (
          <issue>2</issue>
          ),
          <fpage>145</fpage>
          -
          <lpage>165</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Pankratius</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stucky</surname>
            ,
            <given-names>W.:</given-names>
          </string-name>
          <article-title>A formal foundation for workflow composition, workflow view definition, and workflow normalization based on petri nets</article-title>
          .
          <source>In: APCCM</source>
          . pp.
          <fpage>79</fpage>
          -
          <lpage>88</lpage>
          . APCCM '
          <volume>05</volume>
          ,
          <string-name>
            <surname>Australian</surname>
            <given-names>Computer Society</given-names>
          </string-name>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Polyvyanyy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smirnov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weske</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Process model abstraction: A slider approach</article-title>
          . In: EDOC. pp.
          <fpage>325</fpage>
          -
          <lpage>331</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Polyvyanyy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smirnov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weske</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Business process model abstraction</article-title>
          .
          <source>In: Handbook on Business Process Management 1</source>
          , pp.
          <fpage>149</fpage>
          -
          <lpage>166</lpage>
          . Springer (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Smirnov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dijkman</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mendling</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weske</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Meronymy-based aggregation of activities in business process models</article-title>
          . In: Parsons,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Saeki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Shoval</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Woo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Wand</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y</surname>
          </string-name>
          . (eds.)
          <source>Conceptual Modeling ER</source>
          <year>2010</year>
          ,
          <article-title>LNCS</article-title>
          , vol.
          <volume>6412</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>14</lpage>
          . Springer (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Smirnov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reijers</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weske</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nugteren</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Business process model abstraction: a definition, catalog, and survey</article-title>
          .
          <source>Distributed and Parallel Databases</source>
          <volume>30</volume>
          ,
          <fpage>63</fpage>
          -
          <lpage>99</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>