<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Learning from Quality Issues of BPMN Models from Industry (Extended Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Henrik Leopold</string-name>
          <email>h.leopold@vu.nl</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jan Mendling</string-name>
          <email>jan.mendling@wu.ac.at</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oliver Gu¨ nther</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <abstract>
        <p>Many organizations use business process models for documenting their business operations. In recent years, the Business Process Model and Notation (BPMN) evolved into the leading standard for process modeling. However, BPMN is complex: the specification offers a huge variety of different elements and often several representational choices for the same semantics. This raises the question of how well modelers can deal with these choices. Empirical insights into BPMN usage from the perspective of practitioners are still missing. We close this gap by analyzing a large set of BPMN 2.0 process models from practice. We found that particularly representational choices for splits and joins, the correct use of message flow, the proper decomposition of models, and the consistent labeling appear to be connected with quality issues. Based on our findings we give five recommendations how these issues can be avoided in the future. The work summarized in this extended abstract has been published in [LMG16].</p>
      </abstract>
      <kwd-group>
        <kwd>Process Model Quality</kwd>
        <kwd>BPMN Modeling Guidelines</kwd>
        <kwd>Modeling Recommendations</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Business process models play an important role for documenting business operations and
for formalizing business requirements in software engineering. In recent years, the
Business Process Model and Notation (BPMN) has become de-facto standard for process
modeling. A major challenge of BPMN in practice concerns its complexity resulting from the
considerable number of elements it offers, its sophisticated semantics, and its
representational choices.</p>
      <p>While the uptake of BPMN in practice has triggered the research community to study its
usage, empirical studies on how BPMN is actually used are scarce [zMR08, MH08, Re10].
The scope of most contributions is restricted to language properties, e.g. [Re11, FMS13],
instead of actual usage.</p>
      <p>With this paper, we aim to shed light on the actual usage of BPMN, and conducted a study
with six companies from industry. This way, we wanted to understand if quality issues
arise and how they can be prevented. The participating companies provided us access to a
total of 585 BPMN process models. We implemented an automatic guideline checker that
covers rules described in BPMN textbooks [Al09, Si11, WM08]. The results helped us to
learn about the frequency of different classes of modeling problems and to suggest a set of
measures to overcome them.</p>
    </sec>
    <sec id="sec-2">
      <title>A Study on BPMN Use in Industry</title>
      <p>For our study, we collected a total of 585 BPMN 2.0 process model from six companies.
The companies came from different industries and varied in size as well as in their degree
of modeling experience. Using a variety of automated techniques, we developed a tool for
checking a set of 35 well-known BPMN guidelines and correctness rules. This set covers
in particular the guidelines proposed by Silver [Si11] and Allweyer [Al09] as well as the
recommendations by White and Miers [WM08].
The structure category refers to the consistent and correct use of modeling elements such
as activities, gateways, events, pools, and flow connectors. For this category, our study
yielded mixed results. On the positive side, we found that about 99% of the investigated
models are free from any syntactical errors. On the negative side, we observe that 22%
of the models contain deadlocks and 42% contain multi merges. The biggest problem,
however, is caused by the inconsistent association of main and sub processes. In 86%
of all models containing sub processes, the roles of the sub process do not match the
corresponding role of the main process.</p>
      <p>The layout category is concerned with the proper positioning of the process model
elements in terms of cognitive effectiveness. Hence, it is the goal of the rules and guidelines
of this category to guarantee that a model can be easily read and understood. Our study
shows that only a few models suffer from issues such as inappropriate spacing, arcs
flow3 Note that there are no overlaps among the quality issues.
ing into the wrong direction, or inconsistent incoming and outgoing behavior.
Nevertheless, not all aspects are respected and implemented to the same degree. The biggest layout
issue concerns models of extensive size. About 47% of all models exceed the maximum
diagram size, i.e., they do not fit on a DIN A3 page.</p>
      <p>The labeling category refers to the proper use of natural language in the process model.
Our empirical results show that between 40% and 47% of the labels follow syntactic
patterns that are potentially ambiguous and hence may negatively affect the understanding of
the model [MRR10]. Moreover, glossaries are used by a fraction of modelers only. About
72% of all roles and 77% of all data objects are not linked to a glossary. As a result, an
inconsistent usage of roles and data objects can be expected.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Five Measures for Improved BPMN Modeling</title>
      <p>The results show that many quality and correctness criteria are well respected in the
investigated organizations. However, we also observed that many advanced structural concepts,
such as consistency among process models, process model size as well as the labeling of
process model elements, appear to be connected with quality issues. Apparently, the
available modeling recommendations and guidelines are not sufficiently clear. In the following,
we discuss the five major problem areas we identified and give specific recommendations
on how to avoid them.
1.</p>
      <p>Avoid implicit splits and joins: Implicit splits and joins via multiple outgoing and
incoming arcs are the major cause for deadlocks and multi merges. This problem is
caused by BPMN offering several options to represent such semantics. We therefore
recommend prohibiting the use of multiple arcs. The semantics of splits and joins
can be clearly and unambiguously defined using gateways.</p>
      <p>Provide tool support for proper model decomposition: Our empirical results show
that modelers may struggle with the proper decomposition of their models. Either
the models are too big, or they are not fully consistent. Since both problems can be
effectively enforced by a modeling tool, we recommend implementing respective
mechanisms.</p>
      <p>Omit the throwing message event: Our study suggests that message flow arcs may
cause several problems. Particularly the throwing message event appeared to cause
confusion. We hence recommend removing the throwing message event from the
symbol set. It is easy to use activities for throwing events instead.</p>
      <p>Establish centrally maintained glossary: The consistent reuse of central concepts
such as roles and data objects is an important requirement for a sound process
architecture. Thus, we propose to introduce a centrally defined glossary that either
automatically monitors and imports new terms or is regularly updated by a
dedicated glossary manager.</p>
      <p>Provide tool support for applying linguistic checks during the modeling process:
Achieving consistency with respect to the structural use of natural language seems
to be difficult. The most effective measure seems to communicate such
inconsistencies already during the modeling process. Modeling tools could use techniques such
as refactoring to automatically suggest a correct version of a non-compliant label
[LSM12].</p>
      <p>A closer look at our recommendation list reveals that particularly recommendations 1 and
3 can be traced back to the representational choices of BPMN. The modelers from the
investigated organizations struggled with correctly dealing with these choices and
incorporated errors that should be avoided. Our recommendations have the advantage that they
do not restrict the expressive power of BPMN. Instead, they help the modeler to select a
preferable representation when a specific pattern of behavior needs to be expressed.
Recommendations 2, 4, and 5 refer to quality issues that may also occur in other process
notations such as Event-driven Process Chains or UML activity diagrams. Still, our study
demonstrates that also BPMN models may suffer from these problems. Hence, also these
recommendations contribute to a consistent process architecture.
[Al09]
[MH08]
[Re10]
[Re11]
[Si11]
[WM08]</p>
      <p>Allweyer, Thomas: BPMN 2.0 - Business Process Model and Notation. Books on
Demand GMBH, Norderstedt, 2nd edition, 2009.</p>
      <p>Muehlen, Michael zur; Ho, Danny T: Service process innovation: a case study of BPMN
in practice. In: Hawaii international conference on system sciences, proceedings of the
41st annual. IEEE, pp. 372–372, 2008.
Recker, Jan: Opportunities and constraints: the current struggle with BPMN. Business
Proc. Manag. Journal, 16(1):181–201, 2010.</p>
      <p>Silver, Bruce: BPMN Method and Style, with BPMN Implementer’s Guide.
CodyCassidy Press, 2nd edition, January 2011.</p>
      <p>White, Stephen A.; Miers, Derek: BPMN Modeling and Reference Guide: Understanding
and Using BPMN. Future Strategies Incorporated, 2008.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>[zMR08] zur Muehlen, Michael; Recker, Jan: How Much Language Is Enough? Theoretical and Practical Use of the Business Process Modeling Notation</article-title>
          .
          <source>In: Proceedings of CAiSE</source>
          . Springer-Verlag, pp.
          <fpage>465</fpage>
          -
          <lpage>479</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>