=Paper= {{Paper |id=Vol-2542/MOD-KI4 |storemode=property |title=Conceptual Modelling and Artificial Intelligence: Overview and research challenges from the perspective of predictive business process management |pdfUrl=https://ceur-ws.org/Vol-2542/MOD-KI4.pdf |volume=Vol-2542 |authors=Peter Fettke |dblpUrl=https://dblp.org/rec/conf/modellierung/Fettke20 }} ==Conceptual Modelling and Artificial Intelligence: Overview and research challenges from the perspective of predictive business process management== https://ceur-ws.org/Vol-2542/MOD-KI4.pdf
           Joint Proceedings of Modellierung 2020 Short, Workshop and Tools & Demo Papers
                                                           Workshop on Models in AI 157

Conceptual Modelling and Artificial Intelligence
Overview and research challenges from the perspective of
predictive business process management

Peter Fettke1,2



Abstract. Currently, the visibility of Artificial Intelligence (AI) and society’s expectations of
AI are very high, particularly compared to other research topics, namely Modelling. However,
between Conceptual Modelling (short: Modelling) and AI exist many interesting and important
interrelationships. This position paper overviews possible applications of AI for Modelling and
Modelling for AI. After this general discussion, the field of predictive business process man-
agement is focused as a particular application case of AI and Modelling. Predictive process
management uses machine learning for predicting the future state of a running process instance.
The paper closes with some general remarks and research challenges.

Keywords: Artificial Intelligence, Modelling, Business Process Modelling, Deep Learning,
Explainability


1       Motivation

The field of Artificial Intelligence (AI) receives tremendous public visibility and ex-
pectations of the society regarding the transformational potential of AI are extremely
high. Although it is not the first time that AI receives so much attention in society, it
is safe to say that the field has made some important and remarkable progress, e.g.
machine translation, speech recognition, image classification, or playing board games
archives results and quality levels which were not foreseen a decade before.
    On the other hand, the field of Conceptual Modelling (short: Modelling) does not
receive similarly high attention from the general audience. Moreover, from the tre-
mendous success of using data for machine learning often the conclusion is drawn
that the explicitly, hand-crafted making of a model which represents a domain is not
necessary or useful during system development anymore. Such a negative conclusion
about the importance of Modelling is false and dangerous because it is well-known
that AI in general and machine learning in particular has important application pre-
requisites and severe limitations under particular application characteristics [1, 2].
    Hence, it is much more fruitful to explore and to elaborate the various and rich in-
tellectual interrelationships between AI and Modelling. At the moment, no clear un-
derstanding exists in how Modelling and AI fit together. Against this background, the
main objective of this position paper is to elaborate on interrelationships between AI


1 German Research Center for Artificial Intelligence (DFKI), Saarbrücken, Germany
2 Saarland University, Saarbrücken, Germany, peter.fettke@dfki.de



Copyright © 2020 for this paper by its authors.
Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
158   Fettke


and Modelling. As such, this short position paper does not aim to make a final state-
ment on this topic, but it stimulates further discourse.
   The paper unfolds as follows: After this introduction, Section 2 frames and posi-
tion the fields of AI and Modelling. General application potentials of AI and Model-
ling are overviewed by Section 3. Section 4 focusses on the case of predictive process
management. The paper closes with some remarks and research challenges.


2      Background

2.1    Artificial Intelligence
The field of AI has a long history and its original foundation is typically dated back to
the 1950s [3]. Since then, numerous research projects were undertaken and several
well-established subfields of AI emerged, e.g. knowledge representation, natural lan-
guage processing, automated planning, data mining, pattern recognition, machine
learning, robotics, or computer vision. Note, that for each subfield mentioned, well-
established textbooks are available. Furthermore, the progress of these subfields is
documented by well-established conference tracks, e.g. the renowned International
Conference on Artificial Intelligence (IJCAI). Several of these subfields are not well
integrated [4].
   Typically, three different approaches to AI can be distinguished:

 Narrow AI: Just a precisely defined task should be automated, e.g. playing chess,
  finding the shortest path between two cities, or steering a car. The accomplishment
  of the task typically involves some level of natural intelligence.
 General AI: The objective of general AI is to build a machine that has all the phys-
  ical and intellectual capabilities of a human person.
 Super AI: The objective of super AI is to build a machine that is much more intel-
  ligent than a human.

Note, the level of super AI is not yet reached when a machine is superior concerning
one particularly defined task. Such superiority of a machine is already achieved in
numerous tasks, e.g. machines are now better than humans in many board games.
Instead, super AI implies that the machine is in principle more intelligent than a hu-
man being. How super AI can exactly be defined and whether or when super AI can
be realized is not clear, but under discussion [5].
   One further important distinction between different approaches to AI is the distinc-
tion between symbolic and sub-symbolic techniques. Symbolic AI uses explicit sym-
bols to capture the domain knowledge; sub-symbolic AI applies ideas from interacting
components of large systems; knowledge is typically represented by artificial net-
works. Symbolic AI is often referenced as “Good Old-Fashioned Artificial Intelli-
gence”, short: GOFAI.
                                Conceptual Modelling and Artificial Intelligence 159

2.2    Conceptual Modelling
Modelling is typically understood as an interdisciplinary field that is used in many
different disciplines as a method or instrument to capture knowledge or to assist other
(research) actions [6]. One possible distinction of this heterogeneous research field is
the kind of understanding of what a model is and what approach is used to represent a
model. In the following just explicitly stated models are understood as models. From
that perspective a continuum of different modelling approaches can be distinguished,
ranging from completely informal to strict, formal modelling understanding:

 Informal modelling: Just natural text or graphical symbols are used to represent a
  model.
 Formal modelling: A formal modelling language has a precise syntax, clear seman-
  tics, and well-understood pragmatics. These aspects include notation, semantic
  domain, modelling procedure and others [7]. Typical examples are Petri Nets or
  State Charts.

Between the mentioned continuum many more approaches exist, e.g. Business Pro-
cess Modelling Notation, entity-relationship modelling, Unified Modeling Language,
and others. From the perspective of informatics, modelling is used in different sub-
domains, e.g. model-driven software development, theoretical analysis of organiza-
tions or hardware systems, specifying database systems, workflow specification, and
many other application domains.


3      Applications of Artificial Intelligence and Modelling

Models and modelling are used in several fields of AI (Modelling4AI, not exhaustive):

 Natural Language Processing: Models of language are used for natural language
  processing, e.g. syntax, semantics, and pragmatics of natural languages are repre-
  sented by models, e.g. [8].
 Automated Planning: In the domain of automated planning, a planning domain is
  represented by a model. In such a model relevant planning states and possible ac-
  tions for manipulating the planning states are specified, e.g. [9].
 Machine Learning: In machine learning, models are used to describe the experi-
  ences which are made by solving the learning tasks. For instance, a classifier au-
  tomatically learned is typically understood as a model which represents the charac-
  teristics of the learned classes, e.g. [10].
 Computer Vision: In computer vision models are used to describe graphical scener-
  ies, e.g. which graphical objects exist [11].
 Robotics: The possible behavior of a robot is specified by a model, e.g. [12].

AI can be used to solve different modelling problems (AI4Modelling, not exhaustive):

 Pattern mining in models: Mining typical modelling patterns can be done to identi-
  fy similar modelling components that can be reused, e.g. [13].
160   Fettke


 Finding matches between modelling constructs: Similar modelling constructs in
  different models can be automatically identified, e.g. [14].
 Modelling assistance: During the modelling process, the modeler can by assisted
  by syntactic, semantic, or pragmatic guidance, e.g. [15].
 Model-to-Text, Text-to-Model or Picture-to-Model: Natural texts / pictures can be
  transformed into models and vice versa, e.g. [16, 17].
 Automatic modelling and model correction: The construction of a model is auto-
  mated by automated planning, or models can be automatically corrected [18].


4      The case of predictive business process management

To elaborate more deeply on particular challenges, the case of predictive business
process management is presented. Business process monitoring is a phase of the busi-
ness process management life cycle [19]. Typical examples of business processes are
order-to-cash, purchase-to-pay, and complaint-to-resolution. Running process in-
stances, also known as cases, are monitored and managed during the process execu-
tion, also known as process run-time. Typically monitored parameters and process
characteristics are its current status, the executed process steps, the time taken to exe-
cute particular steps, or the throughput time (see Fig. 1). The objective of predictive
process management is to gain insights about the future of a case. Based on the cur-
rent case status, the future of the case is predicted. Typical questions are: What will be
the next action to be taken for this case? When will the next event occur? When will
this case terminate? Will the case be completed on time?




               Fig. 1. Input data and predictands of process prediction (source: [20])

The conventional approach for this scenario is to develop a theoretical model of the
process reality, comprised of the identified process steps and the possible state transi-
tions with their transition probabilities. For model building, the system’s boundaries
must be defined and the causal structure between possible process events and state
transitions must be identified from the given process reality and transformed into an
adequate model. However, building these models of organizations and their processes
is challenging, as the context and rules for process execution cannot be easily identi-
fied, or are too complex to be easily comprehensible.
   A different approach in this example is the use of deep learning, which is based on
data that represent prior observed process instances. These process instances are used
                                Conceptual Modelling and Artificial Intelligence 161

to train a deep artificial neural network (ANN). In an example application, historical
process traces generated by a workflow management system are used as the basis for
deep learning [21]. These process traces consist of process steps, process execution
times, organizational units responsible for the execution of different tasks, and other
information. The approach described by Evermann et al. [21] is but one example of
using AI in process modelling, other examples of deep learning in the business pro-
cess management domain are discussed by Di Francescomarino et al [22].
   Using deep learning has not only advantages but one important drawback, too. In
classical model building, the model can be intuitively and immediately grasped and
understood by humans as it is represented explicitly. In contrast, lacking an explicit
representation of theory, an ANN cannot be easily understood in terms of traditional
theory elements, such as constructs, causes, etc. This is unsatisfying from both a
pragmatic as well as a scientific perspective.
   Although the overall problem of interpreting an ANN remains unsolved, several
approaches have made significant progress in overcoming this drawback. Because
humans think in terms of features, we want to be able to reason back from the net-
work architecture description to a feature description and demonstrate that various
network components “encode” or recognize different features. For example, image
recognition research shows features that are encoded in convolutional filters. In the
context of predictive process management, we have identified hidden-state activations
for each process activity, process hallucinations (how an ANN represents a domain
without seeing real data), and other explanation techniques for explaining the predic-
tion results [21, 23].


5      Research challenges and outlook

The particular case of integrating AI and Modelling, namely, using deep learning for
predictive process management, exemplifies several challenges:

 Understanding of the term modelling and model: The idea of modelling is inten-
  sively used in the field of AI. However, the precise understanding of a model and
  usage of modelling is different. Hence, a comprehensive conceptual discussion of
  important characteristics and application possibilities of Modelling in AI would be
  necessary.
 Network architecture: Although the general idea of ANNs is quite simple, there
  exists a wide variety of different network architectures, e.g. recurrent neural net-
  work (RNN) and convolutional neural network (CNN). The network architecture
  determines the capabilities of integration of deep learning into process modelling.
 Data: The presented case for predictive business process management relies on
  data being available. Data is used for training the ANN, for testing, and for the val-
  idation of its performance.
 Representation of data: Data is not just “given” to the ANN in any form, it must be
  represented appropriately. Different theoretical approaches for such a representa-
  tion are known. Past research has shown that problem representation has a major
  influence on problem performance. In particular, the word2vec approach has led to
162   Fettke


  significant progress in the area of text processing. Similarly, different theoretical
  approaches for representing process instances are available, e.g. “process2vec”.
 Changing process behavior: Predictive business process management currently
  relies on historic process behavior. As such, the data does not reflect future process
  changes. The detection and prediction of such process drifts would be useful.
 Hybrid Modelling: In AI it is common to combine symbolic and sub-symbolic
  approaches. Such idea of hybrid modelling is more or less unknown in the domain
  of predictive business process management. However, it would be very useful, if a
  priori knowledge of process behaviour can be encoded in an ANN before training.
 Training algorithm: While the backpropagation training algorithm has been used
  since the 1970s, important theoretical advances and pragmatic improvements have
  been made in the last two decades, leading to novel variants of backpropagation.
 Trained ANN: The ANN is trained using training data. Training adjusts the con-
  nections between the thousands or millions of artificial neurons. This demonstrates
  that the trained ANN is an important theoretical building block. After testing and
  validating the network, it can be used for transfer learning and feedback-learning.
  A trained ANN is not specific to its training data: It is able to answer questions or
  make predictions about cases not contained in its training data set.
 Transfer learning: Known approaches to predictive business process management
  start learning from scratch. It would increase the productivity if it would be possi-
  ble to use pretrained ANN for different process types.
 Explainability: Although some first approaches to explain process predictions are
  known, more research on this topic is needed.
 Particular machine learning challenges: The evaluation shows promising results.
  Nevertheless, particular machine learning challenges occur, e.g. overfitting, ro-
  bustness against new training data, the influence of data manipulation, insufficient
  distinction learning, or the integration of predefined process models with machine
  learning approaches.

   To sum up: The particular case of predictive business process management demon-
strates interesting interrelationships between traditional business process modelling
and machine learning. Although promising results are achieved, challenging further
integration possibilities are open. This particular case demonstrates just one particular
example, how Modelling and AI can be used together. As sketched in this paper,
many more interesting challenges for the integration of AI and Modelling exists
which have to be explored more deeply in the future.


References
 1. Brynjolfsson, E., Mitchel, T.: What can machine learning do? Workforce implications.
    Science 358, 1530-1534 (2017)
 2. Ng, A.: What Artificial Intelligence Can and Can’t Do Right Now. Harvard Business
    Review Digital Articles, pp. 2-4. Havard Business Review (2016)
 3. Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach. Prentice Hall Press
    (2009)
                                   Conceptual Modelling and Artificial Intelligence 163

 4. Geffner, H.: Model-free, Model-based, and General Intelligence. In: Twenty-Seventh
    International Joint Conference on Artificial Intelligence, pp. 10-17. (2018)
 5. Bostrom, N.: Superintelligence: Paths, Dangers, Strategies. Oxford University Press
    (2016)
 6. Embley, D.W., Thalheim, B. (eds.): Handbook of Conceptual Modeling Theory, Practice,
    and Research Challenges. Springer, Berlin (2011)
 7. Bork, D., Fill, H.-G.: Formal Aspects of Enterprise Modeling Methods: A Comparison
    Framework (2014)
 8. Indurkhya, N., Damerau, F.J. (eds.): Handbook of Natural Language Processing. Chapman
    & Hall, Boca Raton, FL (USA) (2010)
 9. Ghallab, M., Nau, D., Traverso, P.: Automated Planning: Theory and Practice. Morgan
    Kaufmann (2004)
10. Bishop, C.M.: Pattern Recognition and Machine Learning. Springer Science + Business
    Media, New York (2009)
11. Davies, E.R.: Computer Vision, Fifth Edition: Principles, Algorithms, Applications,
    Learning. Academic Press, Inc. (2017)
12. Siciliano, B., Khatib, O. (eds.): Springer Handbook of Robotics. Springer (2008)
13. Hake, P., Fettke, P., Loos, P.: Automatic Pattern Mining in Repositories of Graph-based
    Process Models. In: Nissen, V., Stelzer, D., Straßburger, S., Fischer, D. (eds.)
    Multikonferenz Wirtschaftsinformatik 2016 (MKWI 2016), pp. 1143-1154.
    Universitätsverlag Ilmenau, Ilmenau, Germany (2016)
14. Antunes, G., Bakhshandeh, M., Borbinha, J., Cardoso, J., Dadashnia, S., Di
    Francescomarino, C., Dragoni, M., Fettke, P., Gal, A., Ghidini, C., Hake, P., Khiat, A.,
    Klinkmüller, C., Kuss, E., Leopold, H., Loos, P., Meilicke, C., Niesen, T., Pesquita, C.,
    Péus, T., Schoknecht, A., Sheetrit, E., Sonntag, A., Stuckenschmidt, H., Thaler, T., Weber,
    I., Weidlich, M.: The Process Model Matching Contest 2015. In: Kolb, J., Leopold, H.,
    Mendling, J. (eds.) Enterprise Modelling and Information Systems Architectures, pp. 127-
    155. Gesellschaft für Informatik, Bonn (2015)
15. Morana, S., Schacht, S., Mädche, A.: Exploring the Design, Use, and Outcomes of Process
    Guidance Systems: A Qualitative Field Study. In: Parsons, J., Tuunanen, T., Venable, J.,
    Donnellan, B., Helfert, M., Kenneally, J. (eds.) Tackling Society's Grand Challenges with
    Design Science: 11th International Conference, DESRIST 2016, St. John’s, NL, Canada,
    May 23-25, 2016. Springer (2016)
16. Riefer, M., Ternis, S.F., Thaler, T.: Mining Process Models from Natural Language Text:
    A State-of-the-Art Analysis. In: Nissen, V., Stelzer, D., Straßburger, S., Fischer, D. (eds.)
    Multikonferenz Wirtschaftsinformatik (MKWI) 2016. Technische Universität Ilmenau, 09.
    - 11. März 2016, Research-in-Progress- und Poster-Beiträge, pp. 1-12. Universitätsverlag
    Ilmenau, Ilmenau, Germany (2016)
17. Zapp, M., Fettke, P., Loos, P.: Towards a Software Prototype Supporting Automatic
    Recognition of Sketched Business Process Models. In: Leimeister, J.M., Brenner, W.
    (eds.) Proceedings der 13. Internationalen Tagung Wirtschaftsinformatik (WI 2017), St.
    Gallen, pp. 1283-1286 (2017)
18. Heinrich, B., Klier, M., Zimmermann, S.: Automated Planning of Process Models: Design
    of a Novel Approach to Construct Exclusive Choices. Decision Support Systems 78, 1-14
    (2015)
19. Dumas, M., La Rosa, M., Mendling, J., Reijers, H.A.: Fundamentals of Business Process
    Management. Springer, Berlin (2018)
164   Fettke


20. Rehse, J.-R., Dadashnia, S., Fettke, P.: Business process management for Industry 4.0 –
    Three application cases in the DFKI-Smart-Lego-Factory. it – Information Technology 60,
    133-141 (2018)
21. Evermann, J., Rehse, J.-R., Fettke, P.: Predicting process behaviour using deep learning.
    Decision Support Systems 100, 129-140 (2017)
22. Di Francescomarino, C., Ghidini, C., Maggi, F.M., Milani, F.: Predictive Process
    Monitoring Methods: Which One Suits Me Best? In: Weske, M., Montali, M., Weber, I.,
    vom Brocke, J. (eds.) BPM 2018, LNCS 11080, pp. 462-479. Springer Nature, Cham
    (2018)
23. Rehse, J.-R., Mehdiyev, N., Fettke, P.: Towards Explainable Process Predictions for
    Industry 4.0 in the DFKI-Smart-Lego-Factory. KI - Künstliche Intelligenz 33, 181-187
    (2019)