<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards Understandability Evaluation Models using Activity Textual Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Andrii Kopp</string-name>
          <email>kopp93@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dmytro Orlovskyi</string-name>
          <email>orlovskyi.dm@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sergey Orekhov</string-name>
          <email>sergey.v.orekhov@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>National Technical University “Kharkiv Polytechnic Institute”</institution>
          ,
          <addr-line>Kyrpychova str. 2, Kharkiv, 61002</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>There are two purposes of business process modeling. Business process models are created by business analysts for understanding, analysis, and improvement of process scenarios, search, and elimination of weak spots and bottlenecks in organizational activities. Another purpose of business process models is the requirements engineering in software development projects. In both cases, the quality of created business process models is the core issue. Poor models are similar to text documents written with mistakes - they are not understandable, which may negatively impact the real processes they represent and the software workflows they describe. However, existing studies in the field of business process model quality mostly focus on the structural analysis of models using size, complexity, and other metrics with thresholds, while the textual analysis of activity labels is omitted. Therefore, in this paper, we propose an approach to the analysis of business process model understandability taking into account best practices of activity labeling. The proposed approach includes the use of natural language processing techniques, so the respective software tool was developed to perform experiments with a set of business process models. According to obtained results, we suggest considering both textual and structural qualities to achieve the understandability of business process models due to the bad correlation between these metrics (0.0171) - well-structured models can have unclear activity labels and vice versa.</p>
      </abstract>
      <kwd-group>
        <kwd>1 Business Process Model</kwd>
        <kwd>Model Quality</kwd>
        <kwd>Model Understandability</kwd>
        <kwd>Textual Analysis</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction: Related Work and Problem Statement</title>
      <p>
        Business processes are organized sequences of activities that take different kinds of input and
produce value for customers, e.g. goods or services. Nowadays Business Process Management (BPM)
is the widely used management approach. This approach is based on the business process modeling
technique – a visual representation of organizational activities, events, and decisions using graphical
diagrams. Business process models are the most valuable assets of the BPM lifecycle. They help to
design, analyze, improve, and automate organizational workflows [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Business process modeling
helps stakeholders to understand, capture (i.e. document using graphical models), analyze, and
improve the enterprise workflows. The analysis stage includes performance measurement and errors
detection activities, which help to improve captured business processes [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
1.1.
      </p>
      <p>
        According to the analysis of the latest survey, there are various business process modeling
notations used to document business operations in companies that practice the BPM approach [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]:
 64% of respondents use BPMN (Business Process Model and Notation);
 18% of survey participants use EPC (Event-driven Process Chain);
 4% of organizations use IDEF-based notations, e.g. IDEF0 and DFD (Data Flow Diagram).
      </p>
      <p>
        Other survey participants use less popular business process modeling notations, however, the
BPMN notation is a leader and currently the de-facto standard for business process modeling [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        According to [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], BPMN models describe workflows as sequences of tasks and events connected
using control flows (Fig. 1). Moreover, business processes described using the BPMN notation
contain start events and end events to signalize their beginning and finishing (Fig. 1). Hence, the
simplest BPMN business process consists of events and activities [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]:
 things that happen in an instant are represented by events;
 activities are work units that have a set duration.
      </p>
      <p>
        Also, events and activities are logically related in a business process workflow using sequences. A
sequence means that one event or activity is followed by another event or activity [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Fig. 1 shows
the most basic business process structure, described using BPMN graphical notation, that consists of
events (start and end) and activities connected using sequences (also referred to as arcs).
      </p>
      <p>According to Fig. 1, when describing a business process using BPMN graphical notation, the
modeler should answer the following questions:
 “when a new instance of the business process starts?” – for the start event;
 “when the instance completes?” – for the end event;
 “what to do on the particular process step?” – for activities.</p>
      <p>
        Thus, if events are usually named as combinations of nouns followed by verbs in past participle
form (i.e. “order received”, “order fulfilled”), which is quite intuitive, empirical studies have shown
that real-world business process models created by many practitioners do not always follow naming
conventions for activities [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The verb-object labeling style (i.e. a verb in infinitive form followed by
the noun: “submit order”, “confirm order”, etc.) is recommended for activity labels [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. This rule is
even included in the Seven Process Modeling Guidelines (7PMG) by Mendling et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Fig. 1 demonstrates all the essential elements of BPMN graphical notation [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        Advanced business process models created using BPMN graphical notation may contain particular
elements to demonstrate the branching and merging workflow scenarios, business process boundaries,
and participants. Gateways (Fig. 2) are particular elements that define parallel (AND), inclusive (OR),
or exclusive (XOR) branching within workflow scenarios. Pools describe the boundaries of business
processes, while lanes define different roles of business process participants [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>According to [8], there are various metrics and thresholds exist to evaluate BPMN models:
 size (i.e. the number of tasks, events, gateways, and control flows).
 gateway mismatch (the sum of gateway pairs of different types).
 connectivity coefficient (the number of arcs divided by the number of nodes).
 control flow complexity (the sum of gateways weighted by their possible combinations of
states after the split).</p>
      <p>Other studies are also focused mostly on size metrics for the evaluation of business process model
efficiency from understandability and maintainability views:
 authors of [9] have analyzed a large collection of BPMN models created by practitioners and
found that improper usage of splits and joins, message flows, decomposition, and labeling lead to
the poor quality of business process models;
 in [10] authors propose control-flow complexity metrics and corresponding threshold values
they have obtained using data mining techniques to help designers evaluate the quality of business
process models;
 authors of [11] formulate the importance of having high-quality business process models as
inputs for requirements engineering since the quality of BPMN models influences the software
quality; however, this study proposes quality checklists for model reviewers instead of metric and
formal approaches to verify the business process model quality.</p>
      <p>We have discovered within the context of BPMN and quality assurance two more interesting
studies [12] and [13] that consider the quality of the business process itself and do not analyze the
quality of a business process model reflecting a particular process.
1.2.</p>
    </sec>
    <sec id="sec-2">
      <title>Problem Statement</title>
      <p>Thus, poorly designed business process models are hard for understanding and maintenance, and
they cannot be efficiently used to document business operations, measure business performance, or
find workflow errors that may reduce organizational performance. However, existing studies mostly
focus on structural analysis of BPMN model flow using the size and control-flow metrics, and
thresholds, while relatively smaller attention is paid to the textual analysis of activity labels used in
business process models.</p>
      <p>Hence, in this study, we propose to pay more attention to labeling styles used for business process
model activities (i.e. tasks and collapsed sub-processes) when analyzing the understandability of
BPMN models. The soundness of the business process model structure is extremely important for the
proper understanding of process scenarios, decisions, occurring events, and other important workflow
elements by readers. However, improper naming of activities may mislead the essential understanding
of which particular tasks should be completed on each step of the business process scenario or which
exactly sub-processes should be initialized. This misunderstanding caused by invalid activity labels
can negatively impact business processes and software guided by business process models with these
poorly-described activities.</p>
      <p>Let us formally describe a business process model as a coherent directed labeled graph [14]:
BPGraph  N, F, L, ,
(1)
where:
 N is the set of business process elements, which includes subsets of activities A , events E ,
and gateways G ;
 A is the set of activities;
 E is the set of events, which includes subsets of start events E s , intermediate events E i , and
end events Ee ;
</p>
      <p>G is the set of gateways, which includes subsets of XOR gateways G xor , AND gateways
G and , and OR gateways Gor ;
 F is the set of sequence flows between business process elements, F  N  N ;
 L is the set of labels defined for business process elements and sequence flows;
  is the mapping that assigns labels to business process elements and sequence flows,
 : N  F  L .</p>
      <p>Thus, the formal statement of a high-quality business process modeling to achieve understandable
diagrams may be given as the following:
(2)</p>
      <sec id="sec-2-1">
        <title>QStructural BPGraph  max,</title>
      </sec>
      <sec id="sec-2-2">
        <title>QTextual BPGraph  max,</title>
        <p>models, QStructural : BPGraph  0,1;
models, QTextual : BPGraph  0,1.</p>
        <p>QStructural is the mapping that assigns respective structural quality values to business process
</p>
        <p>
          QTextual is the mapping that assigns respective textual quality values to business process
Equation (2) formally describes the problem of business process modeling, according to which
created BPMN diagram should be of maximum structural and textual quality [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
        <p>The demonstrated graph (1) can be built automatically, as the result of a BPMN file processing,
which is the XML (eXtensible Markup Language) document created according to the specific schema
of the BPMN 2.0 format [15].</p>
        <p>Hence, we suggest the following workflow of the approach to understandability evaluation of
BPMN 2.0 business process descriptions (Fig. 3).</p>
        <p>The proposed approach (Fig. 3) may not only allow evaluation of the understandability of BPMN
models based on the textual analysis of business process activities but also answer the following
question – “does the structural quality of business process models affects their textual quality?”. This
may help to formulate recommendations for business process modelers to pay attention not only to the
structural soundness of created diagrams but also to the textual quality of described business process
steps to achieve better understandability of models and make sure they serve their purpose.</p>
        <p>Therefore, in this study, we need an approach to the textual analysis of business process model
activity labels to elaborate the techniques of understandability evaluation of BPMN diagrams. We
assume that our approach may include the use of Natural Language Processing (NLP) techniques and
work with collections of BPMN 2.0 files, so the particular software tool should be developed to
perform experiments with a set of business process models. In general, this study considers the
process of business process modeling using BPMN graphical notation and aims at the improvement of
created models’ quality to assure their understandability by stakeholders for organizational activity
analysis and software engineering.</p>
        <p>The rest of this paper is organized as follows. Section 2 outlines the textual analysis approach for
the evaluation of business process model understandability. Section 3 proposes the structural analysis
of business process models based on metrics and thresholds. Section 4 includes experiments, analysis,
and discussion of the obtained results.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>2. Textual Analysis of Business Process Model Activity Labels</title>
    </sec>
    <sec id="sec-4">
      <title>2.1. Activity Labels Extraction from BPMN Models</title>
      <p>Before the proposed approach outline, let us demonstrate the sample BPMN 2.0 business process
model and its file representation (Fig. 4). According to the example below (Fig. 4), the “process” tag
includes all core business process items such as events (i.e. “startEvent” and “endEvent”), activities
(i.e. “task”), and sequence flows (i.e. “sequenceFlow”) [16]. Thus, it is quite easy to read such an
XML document and represent it formally using the coherent directed labeled graph (1).</p>
      <p>Described graph (Fig. 4) consists of the following sets of business process items:
 start events E s  e1s ;</p>
      <p>end events E e  e1e ;

 activities A  a1, a2;
 sequence flows F  f1, f2 , f3.</p>
      <p>In addition, the mapping  assigns labels to business process elements and sequence flows, which
can be extracted using the “name” attribute of respective tags (Fig. 4):
  e1s  "Order received" – using the “name” attribute of the “startEvent” tag;
  a1  "Confirm order" – using the “name” attribute of the first “task” tag;
  a1  "Send goods" – using the “name” attribute of the second “task” tag;
</p>
      <p> e1e  "Order fulfilled" – using the “name” attribute of the “endEvent” tag.</p>
      <p>Therefore, it is possible to obtain the set of activity labels Lactivity  L :</p>
      <p>Lactivity  liactivity , i  1, A ,
where liactivity is the label assigned to the i -th activity ai  A , i  1, A .</p>
      <p>Tokenize each activity label liactivity  Lactivity , i  1, A to get bags of words that correspond to
(3)
(4)
(5)
 is the mapping that assigns a bag of words wactivity W activity to each activity label
i
liactivity  Lactivity , i  1, A ;
</p>
      <p>W activity is the collection of bags of words wactivity W activity formulated for each activity label
i
liactivity  Lactivity , i  1, A .
2. For each word of tokenized activity labels (4) define one or several parts of speech to which it
belongs:</p>
      <p> : wiactivity  PoS,
 is the mapping that assigns one or several parts of speech PoSi  PoS to each word that
belongs to the bag of words wactivity W activity created for each activity label liactivity  Lactivity ,
i
 PoS is the set or all parts of speech that can be assigned to each of words in tokenized
activity labels, PoS  Noun,Verb, Adjective, Adverb.
3. For each activity label check its length (i.e. the number of words it contains) and if the label
consists of at least two words, check if the first and second words are verbs and nouns
correspondingly (5):
0, wiactivity  liactivity   2,

i1, A liactivity  Lactivity : qiactivity liactivity   1,Verb wiactivity 0  Noun wiactivity 1,

(6)
0, else,


where qiactivity is the mathematical logic predicate that returns 1 for activity labels that match the
verbobject labeling style and 0 for activity labels that do not match the verb-object labeling style,
qiactivity 0,1.</p>
      <p>A i1
Fig. 5 demonstrates the algorithm of the proposed activity labels analysis method.
4. Calculate the textual quality as the ratio between the number of activities, which labels match
the verb-object labeling style (6), and the total number of business process activities:</p>
      <p>QTextual BPGraph   1 A qiactivity liactivity . (7)</p>
      <p>Activity labels tokenization and part of speech assignment to extracted words can be achieved
using particular NLP software components, which will be used for experiments in Section 4.</p>
    </sec>
    <sec id="sec-5">
      <title>3. Structural Analysis of Business Process Models based on</title>
    </sec>
    <sec id="sec-6">
      <title>Thresholds</title>
    </sec>
    <sec id="sec-7">
      <title>Metrics and</title>
      <p>Let us also describe the method for structural analysis of business process models to then answer
the question of how the structural quality of business process models affects their textual quality.</p>
      <p>N is the number of nodes;
E s is the number of start events;
E e is the number of end events;</p>
      <p>
        G or is the number of OR gateways.
2. Therefore, using business process modeling guidelines defined in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], the following
threshold values can be defined for the respective structural metrics (8):
      </p>
      <p>TStructural  31,2,2,0. (9)</p>
      <p>
        Given threshold values (9) reflect the business process modeling guidelines suggested by authors
of [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], which say:
 do not use more than 31 nodes;
 do not use more than 2 start and end events;
 do not use OR gateways.
      </p>
      <p>These threshold values (9) were also confirmed in the latest paper by Mendling et al. [17].
3. Then, using values of the basic structural metrics (8) and corresponding threshold values (9),
calculate the structural quality as the average of inverse sigmoid function results:
(8)
(10)
(11)</p>
      <p>
        M Structural   N , E s , E e , G or ,
1. Calculate values of the basic structural metrics proposed in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] to manage the business
process model’s structural quality:
      </p>
      <sec id="sec-7-1">
        <title>QStructural BPGraph  </title>
        <p>1
M Structural</p>
        <p>M Structural
 vm j , t j ,
j1
m j is the value of j -th structural metric (8);
t j is the threshold value for j -th structural metric (9);
vm j , t j  is the function that returns values in the range 0,1 :
1, m j  t j ,

vm j , t j    1</p>
        <p>1  em j t j 1 , m j  t j .</p>
        <p>In (11) obtained vm j , t j   1 values signalize that the value of j -th structural metric m j
while smaller values vm j , t j   1
completely corresponds to the respective threshold value t j
signalize violations of thresholds (9) by the metric values (8).</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>4. Results and Discussion</title>
      <p>Let us use the collection of BPMN diagrams created during business process modeling training
sessions by Camunda company. This collection of BPMN 2.0 diagrams includes four subsets that
describe four business processes: goods dispatch, insurance recourse, credit-scoring, and self-service
restaurant flows. It is freely available in Camunda’s GitHub repository for research purposes [18].</p>
      <p>In general, this dataset includes 197 models in English:
 67 models are alternative versions that describe the goods dispatch business process;
 47 models are alternative versions that describe the insurance recourse business process;
 34 models are alternative versions that describe credit-scoring business processes;
 49 models are alternative versions that describe self-service restaurant business processes.</p>
      <p>Hence, to perform experiments with such a collection of BPMN 2.0 files, the software tool was
created. It was built using the Python programming language, which has a great tool NLTK (Natural
Language Toolkit) for working with computational linguistics [19].</p>
      <p>Fig. 6 below demonstrates the workflow and dependencies of the developed software tool, which
will be used to perform experiments in this study.</p>
      <p>According to Fig. 6, the developed software tool uses the following external packages:
 the “os” and “xml” packages for working with the file system and processing BPMN 2.0
models that are stored as XML files;
 the “nltk” package for tokenization of activity labels (the “word_tokenize” utility) and words
tagging (the “wordnet” lexical database);
 the “math” package for calculations, e.g. exponentiation;
 the “pandas” package for the correlation analysis to study the relationship between business
process models’ textual and structural quality.</p>
      <p>Table 1 below shows correlation analysis results obtained using the Pandas package that allows the
computation of the Pearson standard correlation coefficient [20].</p>
      <p>Calculated correlation analysis results (Table 1) demonstrate bad correlation (0.0171) which means
there is no relationship between textual (7) and structural (10) quality coefficients calculated for each
of the experimental BPMN business process models [18].</p>
      <p>All of these business process models were designed by different persons that were using textual
descriptions of business processes they are supposed to create as part of BPMN training sessions.
Thus, we may conclude that textual and structural quality dimensions of business process modeling
using BPMN graphical notation are not connected. For example, among the obtained calculation
results we can discover perfect BPMN models from the textual quality point of view, but poor BPMN
models from the structural quality point of view and vice versa.</p>
      <p>Table 2 demonstrate such cases:
 the business process model of high textual quality (1.00) has structural issues (0.88) – the OR
gateway is used (Fig. 7);
 the business process model of high structural quality (1.00) has poor textual quality (0.43) – 4
of 7 activities has labelling style that does not match the recommended verb-object style.</p>
      <p>Indeed, the model can be perfectly structured but have uninformative activity labels (see 2nd row in
Table 2), while there could be desired labeling style used (e.g. verb-object style as the recommended
best practice) but the process scenario can be poorly structured so there will be barely understandable
in which way activities and events follow each other (see 1st row in Table 2).</p>
    </sec>
    <sec id="sec-9">
      <title>5. Conclusion and Future Work</title>
      <p>In this paper, we addressed the problem of the understandability evaluation of business process
models using the textual analysis of activity labels. We focused on the BPMN diagramming notation
since it is the de-facto standard for business process modeling nowadays, which allows the creation of
not only visual models but also machine-readable XML-alike files for interexchange between BPM
suites and workflow automation. As it was discovered in the related work in the domain of business
process model quality analysis, the structural-based approaches that use metrics and thresholds are
much more elaborated than approaches based on textual analysis of BPMN activity labels. We
identified this situation as a serious limitation – a business process model can have a perfect structure
but can have poorly labeled activities making such a model hard to understand by involved
stakeholders. Poor models that are not understandable can lead to errors in organizational
improvement and software development projects, cause extra resource allocation to fix arising errors,
and, therefore, more costs.</p>
      <p>Therefore, in this paper, we proposed an approach to the analysis of business process models’
understandability taking into account best practices of activity labeling. The proposed approach and
the software tool created for experimental processing of the sample BPMN 2.0 files collection are
based on particular NLP techniques such as tokenization and part of speech tagging.</p>
      <p>Obtained results confirm that the structural quality of a business process model does not mean its
understandability since there is a bad correlation between these metrics (0.0171). Provided examples
(Fig. 7 and 8, Table 2) show how the models of high textual quality (1.00) can be of moderate
structural quality (0.88) and vice versa – how the models of poor textual quality (0.42) can be of high
structural quality (1.00). Therefore, understandable business process models, which are valuable for
the stakeholders, should demonstrate high textual and structural quality.</p>
      <p>Thus, we can recommend business process modelers pay for the textual quality and proper activity
labeling as much attention as they pay to the structural quality of business process scenarios. Having a
business process model both structurally and textually sound will make it serve its initial purpose to
communicate knowledge about ongoing or planned business processes.</p>
      <p>Future work in this field may include the use of advanced NLP and machine learning methods and
techniques to allow the automatic correction of poorly named activity labels to ensure the
understandability of business process models. Also, more advanced metrics of structural analysis can
be applied to continue the study of the relationship between the textual and structural quality of
business process models.</p>
    </sec>
    <sec id="sec-10">
      <title>6. References</title>
      <p>[8] F. Corradini, F. Fornari, S. Gnesi, A. Polini, B. Re, Quality assessment strategy: Applying
business process modelling understandability guidelines, University of Camerino, Italy, 2015.</p>
      <p>URL: https://openportal.isti.cnr.it/data/2017/380283/2017_380283.pdf
[9] L. Henrik, J. Mendling, O. Günther, Learning from quality issues of BPMN models from
industry, IEEE software 4(33) (2015) 26–33. doi:10.1109/MS.2015.81
[10] W. Kbaier, S. A. Ghannouchi, Determining the threshold values of quality metrics in BPMN
process models using data mining techniques, Procedia Computer Science 164 (2019) 113–119.
doi:10.1016/j.procs.2019.12.161
[11] W. M. C. da Silva, A. P. F. Araújo, M. T. Holanda, R. T. de Sousa Jr., A Method for Quality
Assurance for Business Process Modeling with BPMN, in: Developments and Advances in
Intelligent Systems and Applications, Springer, Cham, 2018, pp. 169–179.
doi:10.1007/978-3319-58965-7_12
[12] A. L. da Costa, S. A. F. Salles, R. L. Carvalho, A. S. C Morais, S. V. and Silva, BPMN and
quality tools for process improvement: a case study. Gepros: Gestão da Produção, Operações e
Sistemas 14(4) (2019) 156–175. doi:10.15675/gepros.v14i4.2308
[13] P. Peggy, H. Schlieter, Process-based quality management in care: adding a quality perspective
to pathway modelling, in: OTM Confederated International Conferences “On the Move to
Meaningful Internet Systems”, Springer, Cham, 2019, pp. 385–403.
doi:10.1007/978-3-03033246-4_25
[14] M. T. Gómez-López, J. M. Pérez-Álvarez, A. J. Varela-Vaca, R. M. Gasca, Guiding the creation
of choreographed processes with multiple instances based on data models, in: International
Conference on Business Process Management, Springer, Cham, 2016, pp. 239–251.
doi:10.1007/978-3-319-58457-7_18
[15] M. Kurz, F. Menge, Z. Misiak, Diagram Interchangeability in BPMN 2, 2014. URL:
https://www.omg.org/oceb-2/documents/BPMN_Interchange.pdf
[16] Business Process Model and Notation (BPMN), Version 2.0, 2011. URL:
https://www.omg.org/spec/BPMN/2.0/PDF/changebar
[17] J. Mendling, L. Sanchez-Gonzalez, F. Garcia, M. La Rosa, Thresholds for error probability
measures of business process models, Journal of Systems and Software 85(5) (2012) 1188–1197.
doi:10.1016/j.jss.2012.01.017
[18] BPMN for research. URL: https://github.com/camunda/bpmn-for-research
[19] Natural Language Toolkit. URL: https://www.nltk.org/
[20] pandas.DataFrame.corr – pandas 1.5.0 documentation URL:
https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.corr.html</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hammer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Champy</surname>
          </string-name>
          ,
          <article-title>Reengineering the Corporation: A Manifesto for Business Revolution</article-title>
          , Zondervan,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>W. M. P. van der Aalst</surname>
          </string-name>
          ,
          <article-title>Business process management: a comprehensive survey</article-title>
          , in: International Scholarly Research Notices, volume
          <volume>2013</volume>
          , Hindawi,
          <year>2013</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>37</lpage>
          . doi:
          <volume>10</volume>
          .1155/
          <year>2013</year>
          /507984
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>P.</given-names>
            <surname>Harmon</surname>
          </string-name>
          ,
          <article-title>The State of Business Process Management, in: The State of the BPM Market</article-title>
          , volume
          <year>2016</year>
          , BPTrends,
          <year>2016</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>50</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. La</given-names>
            <surname>Rosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mendling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. A.</given-names>
            <surname>Reijers</surname>
          </string-name>
          , Fundamentals of business process management, Springer, Heidelberg,
          <year>2013</year>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>642</fpage>
          -33143-5
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Mendling</surname>
          </string-name>
          ,
          <article-title>Managing structural and textual quality of business process models</article-title>
          ,
          <source>International Symposium on Data-Driven Process Discovery and Analysis</source>
          , Springer, Berlin, Heidelberg,
          <year>2012</year>
          , pp.
          <fpage>100</fpage>
          -
          <lpage>111</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>642</fpage>
          -40919-
          <issue>6</issue>
          _
          <fpage>6</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Mendling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. A.</given-names>
            <surname>Reijers</surname>
          </string-name>
          , W. M. van der Aalst,
          <article-title>Seven process modeling guidelines (7PMG)</article-title>
          ,
          <source>Information and software technology 52(2)</source>
          (
          <year>2010</year>
          )
          <fpage>127</fpage>
          -
          <lpage>136</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.infsof.
          <year>2009</year>
          .
          <volume>08</volume>
          .004
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>H.G.</given-names>
            <surname>Ceballos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Flores-Solorio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Garcia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A Probabilistic</given-names>
            <surname>BPMN</surname>
          </string-name>
          <article-title>Normal Form to Model and Advise Human Activities</article-title>
          , in: International Workshop on Engineering Multi-Agent Systems, Springer, Cham,
          <year>2015</year>
          , pp.
          <fpage>51</fpage>
          -
          <lpage>69</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>319</fpage>
          -26184-
          <issue>3</issue>
          _
          <fpage>4</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>