<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Language-centric approaches for improving business process model acceptance</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Lars Ackermann</string-name>
          <email>lars.ackermann@uni-bayreuth.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Bayreuth</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Business process modeling is a crucial task in business process management. The plethora of process modeling languages available o ers exibility but often leads to language barriers, too. This pertains to, e.g., the comprehensibility of a model in case that a stakeholder is unfamiliar with the modeling language and to its compatibility with \foreign-language" process execution systems. Among others literature and several research projects suggest (i) a meticulously compiled set of process execution traces, (ii) a translation into a suitable language and (iii) a model description in natural language as means that cover the potential to solve these issues. This abstract and the associated dissertation describe approaches that automatically generate these artifacts. Comparable trace generators solely operate on imperative or plain controlow based process models. Translation techniques and natural-languagetext generators for process models are rare and mostly limited to a speci c, imperative language. The approaches are evaluated by their application to various exemplary models and through a qualitative survey.</p>
      </abstract>
      <kwd-group>
        <kwd>business processes model acceptance</kwd>
        <kwd>trace generation</kwd>
        <kwd>process model translation</kwd>
        <kwd>natural language generation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Modeling an organization's business processes has proved to be a crucial task in
business process management (BPM). Process models often play a dual role, i.e.
they are used as means of communication between human process stakeholders
and as a speci cation for system-aided process enactment [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        As a means of communication a model has to be comprehensible [
        <xref ref-type="bibr" rid="ref12 ref2">2</xref>
        ], requiring
at least that the stakeholders are familiar with the modeling language used. Due
to the plethora of highly diverse process modeling languages it is hard to get
familiar with all of them. Especially the notions of imperative and declarative
modeling often cause confusions [
        <xref ref-type="bibr" rid="ref13 ref3">3</xref>
        ]. Imperative languages specify all permitted
process instances explicitly. In contrast declarative process modeling languages
are used to describe process instances implicitly based on rules. However,
comprehensibility for human stakeholders does not imply that a process model ships
with a formal speci cation of its execution semantics (e.g. swimlanes in BPMN),
that it is compatible with the desired business process execution system and not
even that it is a formal model at all [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Thus, if a process model is meant to play
this dual role an adverse language selection or tradeo between the two
requirement dimensions might cause language barriers and, thus, acceptance issues.
      </p>
      <p>
        The PhD thesis [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] associated with this abstract identi es three artifacts
which were likewise considered to be useful in literature and in several research
projects1 in order to improve a process model's acceptance:
{ A carefully selected set of intentionally valid/invalid process execution traces,
{ a model translation into a desired target language and
{ a natural-language process description.
      </p>
      <p>The following discussions focus on approaches that derive the artifacts
automatically from process models o ering a scalable user-side control over the results.</p>
      <p>
        Process execution traces are usually records of real-world process instances.
A selected set of traces with certain properties (e.g. a particular activity has to
be executed) in combination with the source model might help to understand the
model's behavioral semantics. Additionally, several cognitive studies showed that
learning formal speci cations can be boosted by positive and negative examples.
However, traces of real-world process instances have several drawbacks, including
uncontrolled contents and noise and sometimes they are simply not available
(e.g. due to privacy policies [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]). Generating them arti cially (Sec. 2.1) shifts
the control over contents and noise to the generation tool and, thus, to its user.
      </p>
      <p>
        Translating a process model manually is a cumbersome and error-prone
task [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Consequently translation systems for several process modeling languages
have been developed. However, the overall coverage of a selection of 15 common
process modeling languages is rather low (10% direct, 20% including transitive
closure). Therefore, Sec. 2.2 describes a complementing translation method.
      </p>
      <p>
        Natural language process descriptions can solve language barriers since, in
contrast to formal languages, natural languages are a common means of
communication in daily life. Compiling natural language process descriptions manually
and keeping them consistent can be time-consuming and requires ongoing
effort [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Natural-language-text generation (NLG) techniques can overcome this
issue but there are none for declarative process models. Thus, the NLG approach
for declarative process models sketched in Sec. 2.3 is a necessary complement.
2
      </p>
      <p>Improving process model acceptance: Three approaches
This section summarizes the basic ideas and innovative facets of three approaches
(Fig. 1) for generating the artifacts discussed in the introduction.</p>
      <p>In a nutshell the main advantage of the trace generator is its ability to
create traces with controllable contents for declarative process models that cover
also process aspects beyond control ow (e.g. organizational or data-related
dependencies). The translation approach for process models avoids a cumbersome,
language-speci c de nition of mapping rules between source and target
metamodel elements and is generic. This increases the overall coverage of supported
1 e.g. the C2P2 project (http://kppq.de/) that focused on tools for all BPM tasks
process modeling language pairs from 10 to 25%. Finally, the major innovation
of the NLG approach is, that it operates on declarative process models.
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Trace generation (MuDePS)</title>
      <p>
        Trace generation means to simulate process executions in discrete time steps
and to log each execution state as an event. An event is characterized by several
attributes, e.g. a timestamp, an activity name and a snapshot of the current
values of all process variables [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. A collection of traces is called event log.
      </p>
      <p>
        Most existing trace generators operate on imperative process models making
them not applicable to declarative process models [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. However, some approaches
o er features that are also relevant for the approach discussed below (MuDePS),
e.g. (i) support for di erent process perspectives, (ii) con gurable contents,
intentional noise and (iii) a con gurable initial state. Business process modeling
languages di er widely in terms of their expressiveness which can be analyzed
based on so called process perspectives [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Single-perspective languages, such
as Petri nets, are limited to descriptions of the control ow. Multi-perspective
notations (e.g. BPMN) also involve organizational and data-related information.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] one of the few trace generators for declarative process models is
described. It is based on regular expressions which is applicable to
single-perspective, declarative models but cannot be applied to multi-perspective models. It
would cause an alphabet of in nite size if a process model contains variables with
continuous value ranges. A few other approaches operate on multi-perspective,
declarative models but are unable to generate arti cial noise because their
simulation capabilities are limited to an exploration of the positive solution space.
      </p>
      <p>
        MuDePS overcomes these drawbacks by translating the process model into
the logic language Alloy [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] which means transforming the trace generation
problem into a satis ability (SAT) problem. An included analysis engine
transforms the speci cation into a set of boolean equations that can be evaluated
by deterministic SAT solvers. By formulating the process rules as constraints
the SAT solvers determine valid solutions for a con gurable number of events
per trace. Alternatively the rules can be formulated as assumptions causing the
analysis engine to search for counter examples. Constraints can further be used
to con gure the initial state for the generation and desired trace contents.
2.2
      </p>
    </sec>
    <sec id="sec-3">
      <title>Process model translation (SiMiTra)</title>
      <p>The coverage of existing translation approaches for process models is rather
low (Sec. 1). This issue is intensi ed considering that for n languages (n2-n)
transformation rule sets are needed to enable translations between all languages.</p>
      <p>
        SiMiTra is a translation approach that discards the traditional principle that
is based on meta-model transformation rules [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and instead relies on
combinations of existing trace generation and process mining techniques. It uses event
logs as a transfer medium which decreases the overall number of required
translation components to 2n. Process mining approaches are algorithms that discover
process models from event logs, usually based on co-occurrence thresholds for
event attributes. These thresholds are used to compensate undesired noise in
event logs of real-world process executions. However, logs in the SiMiTra
translation system are noise-free as they are created by trace generators { assuming
they work as expected. Consequently, all thresholds should be set as tolerantly as
possible since most trace generators are based on local, random decisions which
might cause infrequently occurring trace contents that would be ltered as noise
otherwise. However, log completeness still cannot be guaranteed since process
variables can have a continuous value range causing an in nite solution space.
Additionally, the thesis [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] discusses limitations regarding the expressiveness of
event logs, which are for instance, their inability to explicitly encode decision
points or to di erentiate between static and dynamic relations of process entities.
2.3
      </p>
    </sec>
    <sec id="sec-4">
      <title>Natural language generation (NL4DP)</title>
      <p>
        NL4DP a NLG approach for multi-perspective process models combines
guaranteed model-text consistency with customization options based on a given user
model and purpose. The backbone of the technique is a well-established pipeline
model [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] consisting of three transformation phases. NLG techniques for
imperative process models (e.g. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]) make use of the native structuring properties of
the directed-graph-based models themselves in order to compose a coherent text.
However, this cannot be applied to declarative models and dedicated techniques
for the latter are not available. Hence, the main contribution in the rst phase
(Document Planning) is an algorithm that groups related process rules based
on rule patterns and common parameters and a transformation of the relevant
information from the process model into formal sentence speci cations. In the
remaining two phases the results of the rst phase are re ned (Microplanning)
by using di erent formulations depending on whether the generated texts are
explanations of single rules during process execution or a complete description
of the model for documentation purposes. In the former case the texts are also
customized in order to address a user explicitly if applicable. Finally, standard
Surface Realization techniques derive one natural language sentences from each
sentence speci cation which are then compiled to a text document according to
the speci ed macrostructure.
3
      </p>
      <p>Implementation and Evaluation
MuDePS and SiMiTra are implemented as an operator2 and a set of process
templates for the RapidMiner data mining platform. NL4DP is based on Eclipse
and the Acceleo text generation plugin3.</p>
      <p>Since the trace generator relies on matured SAT solvers the translation of
process rules into Alloy is the only error source which could be eliminated by
means of Alloy-based test cases. SiMiTra has been evaluated based on
conformance checking between the resulting model and an unseen log generated from
the source model. The results are encouraging but di er depending on the
complexity of the source model. NL4DP has been evaluated through its application
to selected process models and comparing the generated with manually created
texts. Finally, a qualitative survey presages that the three artifacts can be
valuable means for understanding and analyzing declarative process models.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. W. van der Aalst, Process Mining: Data Science in Action. Springer,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>J.</given-names>
            <surname>Mendling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Reijers</surname>
          </string-name>
          , and W. van der Aalst, \
          <article-title>Seven process modeling guidelines (7PMG),"</article-title>
          <source>Information and Software Technology</source>
          , vol.
          <volume>52</volume>
          , no.
          <issue>2</issue>
          , pp.
          <volume>127</volume>
          {
          <issue>136</issue>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>D.</given-names>
            <surname>Fahland</surname>
          </string-name>
          , D. Lubke, J. Mendling,
          <string-name>
            <given-names>H.</given-names>
            <surname>Reijers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Weber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Weidlich</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Zugal</surname>
          </string-name>
          , \
          <article-title>Declarative versus imperative process modeling languages: The issue of understandability," in Ent., BP and Inf</article-title>
          . Sys. Mod., pp.
          <volume>353</volume>
          {
          <issue>366</issue>
          , Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>L.</given-names>
            <surname>Ackermann</surname>
          </string-name>
          , Sprachzentrierte Ans
          <article-title>atze zur Steigerung der Akzeptanz von Geschaftsprozessmodellen</article-title>
          .
          <source>PhD thesis</source>
          , University of Bayreuth, Marz
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>A.</given-names>
            <surname>Burattin</surname>
          </string-name>
          , \
          <article-title>PLG2: Multiperspective Process Randomization with Online and O ine Simulations," in Onl. Proc. of the BPM Demo Track, CEUR-WS</article-title>
          .org,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>K.</given-names>
            <surname>Czarnecki</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Helsen</surname>
          </string-name>
          , \
          <article-title>Feature-based survey of model transformation approaches," IBM Systems Journal</article-title>
          , vol.
          <volume>45</volume>
          , no.
          <issue>3</issue>
          , pp.
          <volume>621</volume>
          {
          <issue>645</issue>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>E.</given-names>
            <surname>Reiter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Dale</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <article-title>Building natural language generation systems</article-title>
          , vol.
          <volume>33</volume>
          . MIT Press,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Ciccio</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Bernardi</surname>
          </string-name>
          , \
          <article-title>Generating Event Logs through the Simulation of Declare Models,"</article-title>
          <source>in EOMAS workshop</source>
          , pp.
          <volume>20</volume>
          {
          <issue>36</issue>
          , Springer,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>S.</given-names>
            <surname>Jablonski</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Bussler</surname>
          </string-name>
          ,
          <article-title>Work ow management: modeling concepts, architecture and implementation</article-title>
          . International Thomson Computer Press,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>D. Jackson</surname>
          </string-name>
          ,
          <article-title>Software Abstractions: logic, language, and analysis</article-title>
          . MIT press,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. H.
          <string-name>
            <surname>Leopold</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Mendling</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Polyvyanyy</surname>
          </string-name>
          , \
          <article-title>Supporting process model validation through natural language generation,"</article-title>
          <source>IEEE TSE</source>
          , vol.
          <volume>40</volume>
          , no.
          <issue>8</issue>
          , pp.
          <volume>818</volume>
          {
          <issue>840</issue>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <article-title>2 There is also a standalone version of MuDePS: https</article-title>
          ://youtu.be/JhqSiAxChKQ
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <article-title>3 All tools and some test data are available</article-title>
          at http://mps.kppq.de.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>