=Paper=
{{Paper
|id=Vol-2600/paper2
|storemode=property
|title=Towards AI-based Solutions in the System Development Lifecycle
|pdfUrl=https://ceur-ws.org/Vol-2600/paper2.pdf
|volume=Vol-2600
|authors=Stephan Jüngling,Martin Peraic,Andreas Martin
|dblpUrl=https://dblp.org/rec/conf/aaaiss/JunglingPM20
}}
==Towards AI-based Solutions in the System Development Lifecycle==
Towards AI-based Solutions in the System Development Lifecycle
Stephan Jüngling, Martin Peraic, Andreas Martin
FHNW University of Applied Sciences and Arts Northwestern Switzerland, School of Business
Peter Merian-Strasse 86, 4052 Basel, Switzerland
stephan.juengling@fhnw.ch, martin.peraic@bvb.ch, andreas.martin@fhnw.ch
Abstract trained models can be re-used as modules, and with the help
Many teams across different industries and organizations of transfer learning be adapted to a variety of similar
explicitly apply agile methodologies such as Scrum in their applications, AI components will slowly be deployed in
system development lifecycle (SDLC). The choice of the many more business cases. In the case of semiconductor
technology stack, the programming language, or the decision
manufacturing, it was demonstrated that a portable image
whether AI solutions could be incorporated into the system
design either is given by corporate guidelines or is chosen by classifier could be embedded in offline edge devices to
the project team based on their individual skill set. The paper detect defects on laser chips with an accuracy of 97% (Hou,
describes the business case of implementing an AI-based Liu, Pan, & Hou, 2019). However, what would be the impact
automatic passenger counting system for public on the software development process within companies,
transportation, shows preliminary results of the prototype
given the fact that AI will be increasingly used as part of
using anonymous passenger recognition on the edge with the
help of Google Coral devices. It shows how different their IT systems, products, and solutions?
solutions could be integrated with the help of rule base
systems and how AI-based solutions could be established in
the SDLC as valid and cost-saving alternatives to Application Domains for AI-Based solutions
traditionally programmed software components.
Smith and Eckroth (2017) provide a comprehensive insight
into lessons learned from building AI applications during
Introduction the last three decades. In one of their key insights, they
mention that the ease of use delivered by the human
AI in general and deep learning, in particular, are amongst
interface is the “license to operate”. This statement, which
the current hot topics in computer science research, and
has been made focusing on the client and user perspective,
many universities create new bachelor and master programs
most probably will hold true during the entire software
in data science. The hype is also visualized by the Gartner
development lifecycle (SDLC). The “new” discipline of
hype cycle for AI (2019), where edge AI, deep neuronal
data scientists needs to be incorporated into the SDLC.
networks, and machine learning are at the peak of inflected
Traditional skills of people that are involved in software
expectations. Nevertheless, AI technologies are already
design need to be extended with AI topics such as machine
used and well established in a wide area of different business
learning and knowledge engineering.
domains. However, most of the time, these are isolated and
In many practical situations, the choice of the best suitable
specialized applications, where they are clearly the sole
system development methodology is actively discussed and
possible IT solution, e.g., machine translation or image
explicitly decided at the project start. Arguments for
processing for cancer detection. In such cases, ML
different methodologies such as Scrum, Kanban, SAFE, or
capabilities are mostly predominantly compared to human
even traditional water methods are sought, and different
skills. There are not yet many situations where AI-based
suitable tools are evaluated and selected. However, all these
components are compared to traditional software or
methods still take the predominant traditional design and
hardware components. But with the current state of the art
implementation processes of software components into
neuronal network models and new emerging infrastructure
account. Also, with agile methods, some sort of
around TensorFlow (Tensorflow Hub, n.d.), where pre-
Copyright © 2020 held by the author(s). In A. Martin, K. Hinkelmann, H.- Stanford University, Palo Alto, California, USA, March 23-25, 2020. Use
G. Fill, A. Gerber, D. Lenat, R. Stolle, F. van Harmelen (Eds.), Pro- permitted under Creative Commons License Attribution 4.0 International
ceedings of the AAAI 2020 Spring Symposium on Combining Machine (CC BY 4.0).
Learning and Knowledge Engineering in Practice (AAAI-MAKE 2020).
requirements engineering, be it in the form of use cases or As such, existing systems do not provide real-time data, and
user stories, is conducted. Maybe it is followed by test- the measurements must first be collected and processed for
driven feature implementation using continuous integration subsequent use. In addition to the high acquisition costs, this
as a core idea, and customers can, at any time, verify the also results in ongoing costs for storing and processing the
current implementation state of the application and provide recorded data, which is costly, inflexible, and no longer
immediate feedback. Given the availability of state-of-the- appropriate. New methods for passenger counting must be
art AI-based video analysis algorithms and edge computing studied. In particular, it would be a good idea if existing
capabilities, the idea of building an Al-based passenger systems in the vehicles, such as the cameras installed for
counting system for public transit was reasonable and could security purposes, could be re-used. It is therefore suitable
be conducted during a bachelor thesis in cooperation with to apply the latest achievements in the field of AI-based
Basler Verkehrs-Betriebe, a leading public transport object recognition. Such an APC method could re-use
provider in the area of Basel (Peraic, 2019). existing systems, collect data in real-time, and directly
process the data without further manual operations. In order
to demonstrate the feasibility of an AI-based counting
Business Case of an AI-based Passenger system and to describe first benefits and limitations, the
Counting System for Public Transit following research questions have to be answered:
With almost 29000 kilometers, Switzerland has one of the • RQ1: Is AI-based object detection competitive against
densest public transport networks in Europe (VÖV, 2017). traditional infrared-based measurement systems under
realistic conditions?
This public service is offered by numerous private transport
companies throughout Switzerland. In order to make it • RQ2: Is it possible to perform offline object recognition
easier for customers to use these services, a variety of on edge devices from a legal perspective?
regional and national fare associations exist in different • RQ3: Can existing vehicle systems, such as surveillance
geographical areas. They ensure that public transport cameras, be re-used?
services can be accessed with a single subscription across • RQ4: What are the benefits of an AI-based automated
Switzerland. Consequently, all public transport operators passenger counting system?
are obliged to record passenger data of onboarding and
alighting passengers. Based on these figures, the percentage
Implemented Prototype
of the subscription income of the respective fare association
is calculated for the participating companies. The Basler Verkehrs-Betriebe strives to continuously
Automatic Passenger Counting Systems (APC), improve its core business, the transport of passengers, in
especially developed for public transport, are applied to order to offer the best possible service to its 350,000 daily
collect such data. Since the early 1970s, various customers. More and more processes are being enhanced or
manufacturers have been offering solutions for passenger even replaced by modern software solutions. The need for
counting (Siebert & Ellenberger, 2019), and many different tailor-made and in-house developed applications is
approaches and measuring methods are applied. Some increasing and consequently the demand for optimized
operators rely on the measurement of boarding and alighting conditions for the development of software. Currently, this
motion with the help of light barrier sensors in the infrared new corporate focus is taking place, thus no company-wide
spectrum, and some systems measure weight changes in the SDLC for development projects could be applied for the
boarding area based on associated spring movements of the prototype. This led to challenges before and during the
vehicle suspension. All of these “classic” methods are project. Development and test environments first had to be
complex systems that must be installed at every door of the created from scratch. Alongside development, the Hermes
vehicle. This requires additional hardware components, project methodology practiced by public companies in
which are expensive to purchase or to upgrade. This poses Switzerland could not be applied, as this methodology
financial problems, especially for smaller public transport follows little to no agile approaches. However, an agile
operators. Such companies use manual counting methods approach was crucial for the development of an AI-based
instead, which are frequently based on insufficient customer APC. Without a flexible delimitation of the objectives, the
surveys regarding driving behavior and do not provide continuous testing of the prototype versions under real
reliable figures (Siebert & Ellenberger, 2019). conditions and the subsequent optimizations, a runnable
Previous classic APC models are rarely deployed fleet- prototype could not have been implemented in such a short
wide due to high costs. Instead, the systems are distributed timeframe.
across all existing routes and randomly measure the number The AI-based APC was realized with Google Coral devices
of passengers, which are later extrapolated to estimate the (Google Coral, n.d.), which are optimized for TensorFlow
total number of passengers in the entire transport network. Lite (TFL, n.d.) AI models such as the mobile Single Shot
Detector (SSD). The TFL mobile SSD object recognition is source for the AI-based prototype. Subsequently, the
supported by the correlation tracking provided by DLIB measurements could be compared and evaluated using the
(Rosebrock, n.d.). With this combination, the SSD model timestamps of both measurement systems. The AI-based
recognizes all objects (position coordinates) and their object prototype correctly determined all passengers in 54% of the
class (human, bicycle, dog, etc.) in predefined frame cases. The traditional infrared APC solution achieved a
intervals. Subsequently, position and class type are stored result of 72% correct measurements.
temporarily and are further tracked during the consecutive
frames by the less computationally intensive correlation
tracker. Thereby the Coral Edge device can be relieved by
the reduced utilization of the object recognition model to
keep resources free for additional operations. That capacity
is used for the ID-based tracking of captured and stored
objects. For this purpose, Centroid Tracking is being used.
A primitive yet efficient tracking algorithm. This feature
provides the ability to observe objects across multiple
frames and measure motion into or out of the vehicle.
The setup is specially designed for offline operations to Figure 2 - Arithmetic Mean of Success in Relation to People
enable onboard computation. It allows bypassing possible involved (AI System)
future data protection obstacles of object recognition in
public space by avoiding the transmission of sensitive data RQ1: First results measured under real conditions
to a remotely located datacenter. demonstrate convincingly that with a small number of
passengers, the AI-based APC can compete with the
infrared system in terms of accuracy. Figure 2 reveals how
only a few counting errors could be observed during
experiments with one or two passengers. As complexity
increases, so does the error rate. The object tracking mostly
causes this increase. With a growing number of passengers
or increasing complexity of movements, the basic Centroid
Tracker can hardly distinguish between objects. They are
either swapped or mistakenly re-detected, causing erroneous
measurements.
Conversely, the infrared system proved to be acceptable for
multiple passengers, as shown in Figure 3.
Figure 1 - Displayed Detection and Tracking Information
After the development of the prototype, a first comparison
of the counting accuracy between the already deployed
APC, the infrared measurement, and the newly developed
AI-based system was conducted. In this respect, real test
scenarios from day-to-day life were executed under
laboratory conditions. The number of passengers and their
measurement complexity was continuously increased.
Extreme cases, such as simultaneous and dense boarding of
a larger crowd, were considered. These test scenarios were Figure 3 - Arithmetic Mean of Success in Relation to People
created by observations from past experiments and real-life involved (IR System)
situations. This combination of knowledge engineering for
the optimization of the measurement scenarios proved to be Further erroneous measurements are attributed to the AI-
a great advantage for the development of realistic and based object recognition. In realistic circumstances, object
meaningful test cases. recognition can be misled by background noise or
In order to obtain reliable results and to exclude any disturbance of passing objects. Since for performance
phenomena, each test case was performed three times. All reasons the AI-based recognition is not applied at every
measurements were recorded with a webcam directed at the frame, objects can pass the image without being detected.
door of the vehicle. This video feed was as well as the data Possibilities for improving the AI-based APC have already
been explored. Improved tracking algorithms like Deep performed passenger counting. Significantly fewer
SORT (Wojke, Bewley, & Paulus, 2017) or the utilization hardware components must be installed for the AI-based
of recent RGB-video or Depth-video approaches could APC. This results in further cost savings in the procurement
provide the required performance improvement. In and maintenance of the system. Ultimately, edge devices
particular, the use of depth sensors, as applied by Sun et al. can be continuously updated with new and enhanced AI
(2019), theoretically outperform infrared sensors regarding models. Other applications such as the automatic detection
accuracy. However, when facing problems with ML of dangers in the vehicle such as violence or emergencies
methods, this could be the point to improve the results by are just a few examples that support a switch to this new
combination with rule-based systems as described in the technology.
section about combining machine learning with knowledge
engineering.
SDLC with Embedded AI-Based Solutions
RQ2: The legal use of high-resolution video cameras as a Given the assumption of continuous improvements in the
sensor has already been discussed for autonomous driving. area of object detection with low-cost edge devices, the
In particular, the storage of video data was defined as a initial prototype could be improved within the next couple
breach of data protection (Kunnert, 2017) if the image data of months, but there will be additional issues, which must be
is stored on the device or an external database (e.g., cloud considered for the organization.
storage). The AI-based prototype is therefore designed to According to a recent survey in Japan (Ishikawa &
process the image data in real-time – storage-free. Yoshioka, 2019) software engineering professionals report
Anonymous counting results could subsequently be sent to difficulties in system engineering when ML-based
service providers for further processing. Such an components are incorporated in the engineering process.
architecture requires further security considerations. It is Many of the existing principles and best practices need to be
therefore essential to prevent third parties from gaining enlarged with additional domain knowledge on how
access to the edge devices. These could collect sensitive machine learning and knowledge engineering can be
image data or manipulate counting results, which would incorporated into the software development lifecycle of
cause manipulation of the entire service (Zhang, Chen, companies.
Zhao, Cheng, & Hu, 2018). Worth mentioning is also the performance difference when
RQ3: Existing security cameras can theoretically be SDLC methodologies known from software development
leveraged for AI-based passenger counting. Both the are applied in data science projects, where little standardized
resolution and the quality of the images are sufficient for process methodologies exist. By comparing the efficiency
object recognition. Only the position and the associated of different data science student teams working with
viewing angle of the cameras are decisive for re-use. New different SDLC methodologies, Saltz, Shamshurin, &
vehicle procurements, though, could take this into account Crowston (2017) reported an improved performance and
without additional efforts and costs. For already procured efficiency of CRISP-DM and agile Kanban, while agile
vehicles, the camera would have to be repositioned in the Scrum was even less efficient than using no methodology at
event of misplacement. all.
RQ4: The edge device architecture offers the ability to Further differences need to be considered when existing
transmit real-time count data. Furthermore, the data could code or models will be re-used. While object-oriented
be automatically merged and analyzed, allowing real-time programming existing functionality can be extended using
measurement of passenger flows using the fleet-wide well-known inheritance mechanisms, new concepts such as
deployment of the AI-based APCs. This information transfer learning need to be researched in more depth-first.
provides the control center or emergency services, with Adding additional functionality would not only mean
valuable information in a variety of situations. For example, adding additional code but also retraining some of the
in the case of a traffic accident, the number of people in the models and redeploy it to the edge devices.
vehicle can be evaluated immediately and required Besides the additional skills required for the design and
emergency resources can be notified accordingly. On the implementation of hybrid solutions, characteristics of AI-
other hand, conventional APC systems are usually entirely based solutions will most probably also have an impact on
isolated and require complex and sometimes labor-intensive the skill sets in the field of requirements engineering as well
processes to evaluate the data. Sensor data must be as testing. As stated by Jüngling & Hofer (2019), AI
converted into human-readable datasets and merged by components could even become active parts of business
skilled professionals. By eliminating these time-consuming scenarios and represented as actors in UML use case
and resource-intensive tasks, major financial savings can be diagrams.
achieved. Re-using already installed hardware such as
surveillance cameras is an additional advantage of AI-
Overall, companies that have internal software development bases about the most successful transfer learning
skills will need either additional staff with appropriate modifications of re-usable pre-trained modules. Such
experience in implementing AI technologies or educate knowledge bases could be established not only for the given
internal employees and enable them to gain experience in case of passenger counting. Such a concept could generally
order to establish new best practices for SDLC with hybrid be applied to different business application scenarios.
solutions. With more experience from practical situations,
all different disciplines, from requirements engineering to
testing and deployment will have to develop new insights Conclusion and Outlook
into how these two different kinds of systems engineering Although the given potential of AI in general and image
can be combined. It could well be, that it is best to split up recognition and object tracking in particular, practical AI-
the entire development into two parallel or serial sub- based business applications are still rare. Openness for new
projects, one for the development of the AI components, the solutions and gaining practical experience within companies
other for the traditional software components, where the will be key. Given the different nature of both ML and KE
SDLC of both components can be decoupled, having solutions compared to traditional SDLC, it seems important
different life cycles, independent regression tests, automated for companies to develop and deploy prototypes of AI-based
build pipelines and integration-tests in the end. solutions into production. It will be necessary to manage
Alternatively, both disciplines could be tighter integrated, these components over the entire lifecycle and optimize the
such as we have seen from certain database engineering current SDLC accordingly. It will be important to gain
frameworks, where the persistence layer is automatically experience with low-risk type of applications such as a
generated based on the given design of the business layer. passenger counting system first, before thinking about more
advanced application scenarios such as self-driving trams or
trains, which, compared to self-driving cars seem to be
Combining Machine Learning with Knowledge
feasible earlier due to a lesser degree of freedom.
Engineering
Since different ML-based AI components are used as
References
sensors on edge devices, rule-based systems could be handy
to orchestrate their results. Given that multiple ML-based Gartner Hype Cycle for AI, 2019. (n.d.). Retrieved from
sensors are installed, their results could be combined with https://www.gartner.com/smarterwithgartner/top-trends-on-the-
gartner-hype-cycle-for-artificial-intelligence-2019/
results from traditional sensors. Rule-based systems could
Google Coral. (n.d.). Retrieved November 12, 2019, from
be used to aggregate multiple inputs and decide about the
https://coral.withgoogle.com/
most suitable estimates of a number of passengers. Similar
Hou, D., Liu, T., Pan, Y. T., & Hou, J. (2019). AI on edge device
to Grangström, Baum, & Reuter (2017), where they compile for laser chip defect detection. 2019 IEEE 9th Annual Computing
and overview of different approaches combining various and Communication Workshop and Conference, CCWC 2019,
sensor types for object tracking, or improvements reported 247–251. https://doi.org/10.1109/CCWC.2019.8666503
by Tian et. al (Tian, Luo, Wang, & Tang, 2015), where they Ishikawa, F., & Yoshioka, N. (2019). How Do Engineers Perceive
improved pedestrian detection by adding semantics based Difficulties in Engineering of Machine-Learning Systems? -
on additional pedestrians and scene attributes, sensors Questionnaire Survey. 2–9. https://doi.org/10.1109/cesser-
ip.2019.00009
within the tram could be combined with sensors outside to
determine the most appropriate result. Jüngling, S., & Hofer, A. (2019). Leverage White-Collar Workers
with AI. Proceedings of the AAAI 2019 Spring Symposium on
Considering the management of existing models that are Combining Machine Learning with Knowledge Engineering
trained for particular purposes, knowledge bases (AAAI-MAKE 2019). Retrieved from http://ceur-ws.org/Vol-
incorporating an ontology could be established describing 2350/paper2.pdf
the context in which the models can be applied. Facts could Kunnert, G. (2017). “Autonomes Fahren” aus
be collected from the results of different AI-based modules datenschutzrechtlicher Sicht (I. Eisenberger, K. Lachmayer, & G.
in combination with the run-time-environment they are Eisenberger, Eds.). MANZ’sche Verlags- und
Universitätsbuchhandlung.
deployed, and rules can be learned, in which cases, which
modules work best. Typical situations of runtime Peraic,M. (2019). “Feasibility Study: Al-based Passenger
Counting System for Public Transit” Assistant. Bachelor
configurations could be tracked, and the models could even
thesis. Bachelor of Science in Business Information Technology,
be updated or deployed to the edge devices based on
School of Business, University of Applied Sciences and Arts
decisions from a knowledge-based system. Northwestern Switzerland, Basel, Switzerland
Also, for the general management of the SDLC and the Rosebrock, A. (n.d.). pyimagesearch. Retrieved November 14,
design-time of ML-based components, best practice rules 2019, from https://www.pyimagesearch.com/2018/07/23/simple-
could be established and stored in appropriate knowledge object-tracking-with-opencv/
Saltz, J., Shamshurin, I., & Crowston, K. (2017). Comparing Data
Science Project Management Methodologies via a Controlled
Experiment. Proceedings of the 50th Hawaii International
Conference on System Sciences (2017), 1013–1022.
https://doi.org/10.24251/hicss.2017.120
Siebert, M., & Ellenberger, D. (2019). Validation of automatic
passenger counting: introducing the t-test-induced equivalence
test. Transportation, (0123456789).
https://doi.org/10.1007/s11116-019-09991-9
Smith, R. G., & Eckroth, J. (2017). Building AI applications:
Yesterday, today, and tomorrow. AI Magazine, 38(1), 6–22.
https://doi.org/10.1609/aimag.v38i1.2709
Sun, S., Akhtar, N., Song, H., Zhang, C., Li, J., & Mian, A. (2019).
Benchmark Data and Method for Real-Time People Counting in
Cluttered Scenes Using Depth Sensors. IEEE Transactions on
Intelligent Transportation Systems, 20(10), 3599–3612.
https://doi.org/10.1109/tits.2019.2911128
TensorFlow Hub. (n.d.). Retrieved from
https://www.tensorflow.org/hub
TensorFlow Lite. (n.d.). Retrieved from
https://www.tensorflow.org/lite
Tian, Y., Luo, P., Wang, X., & Tang, X. (2015). Pedestrian
detection aided by deep learning semantic tasks. Proceedings of the
IEEE Computer Society Conference on Computer Vision and
Pattern Recognition, 07-12-June-2015, 5079–5087.
https://doi.org/10.1109/CVPR.2015.7299143
VÖV. (2017). Facts & Figures Swiss Public Transport 2016/2017
(Vol. 41).
Wojke, N., Bewley, A., & Paulus, D. (2018). Simple online and
realtime tracking with a deep association metric. Proceedings -
International Conference on Image Processing, ICIP, 2017-
September, 3645–3649.
https://doi.org/10.1109/ICIP.2017.8296962
Zhang, J., Chen, B., Zhao, Y., Cheng, X., & Hu, F. (2018). Data
Security and Privacy-Preserving in Edge Computing Paradigm:
Survey and Open Issues. IEEE Access, 6(Idc), 18209–18237.
https://doi.org/10.1109/ACCESS.2018.2820162