<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Characteristics⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Till Carlo Schelhorn</string-name>
          <email>till.schelhorn@kit.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jonas Gunklach</string-name>
          <email>jonas.gunklach@kit.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alexander Maedche</string-name>
          <email>alexander.maedche@kit.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Quality Management, Statistical Process Control, Control Charts, Machine Learning</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Human-centered Systems Lab (h-lab), Karlsruhe Institute of Technology (KIT)</institution>
          ,
          <addr-line>Karlsruhe</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>M. Leyer</institution>
          ,
          <addr-line>Wichmann, J. (Eds.): Proceedings of the LWDA 2023 Workshops: BIA, DB, IR, KDML and WM. Marburg</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <fpage>3</fpage>
      <lpage>16</lpage>
      <abstract>
        <p>Quality management plays a vital role in manufacturing organizations to ensure efective and eficient production processes. To achieve this, organizations implement various data-driven techniques and tools to monitor and manage the quality of their production processes. One essential tool is the control chart, which tracks the performance of a specific quality characteristic over time by taking samples. However, manual sample-taking for a large number of quality characteristics can be time-consuming and costly. To address this challenge, organizations seek to enhance the eficiency of the sample-taking process while accurately detecting production process performance. Recently, machine learning (ML) models have been proposed to predict various quality characteristics, thereby reducing the need for manual measurements. However, existing control chart system designs have been found to be inadequate for integrating ML-predicted quality characteristics. To address this gap, this research aims to design an analytical control chart system with quality characteristics predicted by ML models. Our technical evaluation indicates significant improvements in the eficiency of the quality management process while feedback from a focus group demonstrates the efectiveness of our proposed solution.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>CEUR
Workshop
Proceedings</p>
      <p>
        Research in the field of control charts has a long history. For example, Reynolds et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]
proposed an adaptive control chart with variable sampling intervals based on the current state
of the process. Alwan and Roberts [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] relied on the correlation of features to increase sampling
intervals. With the proliferation of machine learning (ML), recent literature proposed leveraging
these techniques to extend existing control chart designs. For instance, Zan et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] rely on
convolutional neural networks for automated recognition of unnatural patterns, Ferrer [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]
uses principal component analysis to reduce the dimensions of data in multivariate production
settings, and Tong et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] make use of clustering algorithms for adaptive control charts.
However, these approaches do not improve the eficiency of manual sample-taking. Hryniewicz
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] aimed to solve this by replacing manual samples with predicted quality characteristics,
but found that the current design of control charts is not suited for this approach. A central
challenge relates to the performance of the underlying ML models and incorporating the
inherent inaccuracies in these models’ predictions into the control chart design. We therefore
aim to investigate the development of a control chart system that relies on ML-predicted quality
characteristics to reduce the necessity for manual sampling. To solve this, we articulate the
following research question:
      </p>
      <p>RQ: How to design an analytical control chart system based on predicted quality characteristics?
To answer this question, we conducted expert interviews with employees of a leading German
manufacturing company to derive design principles for our system. Based on these principles,
we implemented an analytical control chart system prototype. We used the quality data from
an exemplary manufacturing process to design the control charts for the respective quality
characteristics. Then we identified the characteristics that could be replaced by ML predictions.
Finally, we evaluated (1) the ML model by comparing control charts based on ML-predicted
quality characteristics with standard control charts and (2) the analytical control chart system
using a focus group with manufacturing employees.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Foundations</title>
      <p>
        SPC is a collection of methods and techniques based on statistics to establish process control and
on this basis continuously improve process performance [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. A process is out of control if the
variability present in a process is produced by an assignable cause and not by natural variation
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Identifying the out-of-control process as quickly as possible is one of the major objectives
of SPC [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Control charts are the main tools for detecting out-of-control processes. Over time
samples are taken from the running process and all relevant quality characteristics are measured.
These characteristics are all features of the manufactured product relevant to its quality and
functioning. To detect if a process is out of control, the location (mean) and variability (range
or variance) of the samples are tracked. For each quality characteristic, two control limits are
defined: the upper control limit and the lower control limit setting the borders of the in-control
process. A value exceeding one of the control limits signals the process being out of control
and requires countermeasures to be taken [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Important for a well-functioning control chart
are meaningful control limits. Therefore a control chart is divided into two phases [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Phase I
consists of a retrospective analysis of past process data and setting control limits to determine
if and when the process was in control. In Phase II these control limits are used to monitor
the process [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. It normally requires an iterative process of setting control limits, revising
and adjusting them until appropriate control limits are found [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Once process control is
established various process capability indices (Cpk) are used to measure the actual performance
of the in-control process and its capability to produce parts inside given specification limits [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
Figure 1 illustrates the underlying components of Cpk: process width and design width.
      </p>
      <p>
        The design width establishes the range within which the specific quality characteristic
must fall to meet the criteria for satisfactory quality. It is restricted by the upper and lower
specification limits. The process width defines the actual performance of the process. With the
probability distribution of quality characteristics derived from historical data, the process width
is determined by the 3-sigma (3 ) limits of this distribution, representing the range within which
99.7% of produced parts fall (for simplification we chose to display a normal distribution). The
Cpk is determined by the ratio between the process width and the design width, reflecting
the process’s capacity to meet specified quality criteria. The greater the distance between the
3 limits and the specification limits, the higher the Cpk value, indicating superior process
performance. In practice, a Cpk higher than 1.33, indicating the process produces approximately
96 defective parts per million, is assumed to be suficient [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
    </sec>
    <sec id="sec-4">
      <title>3. Design Principles</title>
      <p>
        To derive design principles for our system we conducted a series of semi-structured expert
interviews with process experts responsible for the production lines and quality management
experts [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. We started with two expert interviews and identified the other participants using
snowball sampling [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. In total, we interviewed 15 people, twelve process experts, and three
from the domain of quality management. The interviews took between 25 and 60 minutes. Each
interview was recorded and later transcribed. Based on the transcription we identified several
key issues relevant to our design process.
      </p>
      <p>To track the relevant quality characteristics like length, diameter, or the surface finish of the
products samples are taken in regular intervals, so-called interval inspections. According to
the experts, these are required for assessing the current state of the process. However, they
represent a time-consuming and therefore costly activity. More eficient planning with fewer
inspections or a smaller sample size was requested by the experts. On the other hand, these
interval inspections are necessary to ensure process control by identifying outliers. The experts
feared that reducing the number of interval inspections may lead to a loss of control over the
process. They conclude that a decent trade-of between an increased eficiency of the interval
inspections versus a decrease in the efectiveness of control charts is necessary to guarantee
suficient process control. We formulate our first design principle:</p>
      <p>DP1 (Eficiency vs. Efectiveness trade-of): The system should increase the eficiency of
interval inspections, however, needs to ensure suficient process control</p>
      <p>According to the experts, they often feel uncomfortable adjusting intervals or sample sizes
of interval inspections because they lack objective metrics for evaluating possible risks. They
stated that only the process capability index is used to determine the current state of the process.
The experts conclude that metrics for assessing the potential risks of not detecting an
out-ofcontrol process would be useful when adjusting interval inspections. Therefore, we formulate
our second design principle:</p>
      <p>DP2 (Risk evaluation): The system should provide metrics for risk assessment
The experts stated that most decisions regarding the processes are made subjectively based on
their know-how and process knowledge. This expert knowledge includes past experiences with
the process, quality incidents, corrective actions, and general process performance. According
to the experts, this knowledge should be incorporated into our solution and we formulate the
following design principle:</p>
      <p>DP3 (Expert knowledge): The system should include the experts domain knowledge
To reduce the workload of the expert they asked for automation of a possible solution.
Analyzing data manually and making changes in the software for inspection planning are tedious
tasks. This contradicts the expert’s desire to control all decisions made in a possible solution.
Also, the incorporation of expert knowledge (DP3) implies some manual tasks. Therefore a
trade-of between automation of the solution and the expert controlling the whole process is
required. Our last design principle reads:</p>
      <p>DP4 (Automation): The system should be automated as much as possible while still giving the
user control over all decisions made</p>
    </sec>
    <sec id="sec-5">
      <title>4. System Design</title>
      <p>Based on these design principles we implemented our prototype. The system consisted of two
main components: A front-end application served as a tool for designing control charts and
supporting the decision of choosing the quality characteristics to replace with ML prediction
and a back-end algorithm for computing all possible combinations of characteristics that could
be replaced by these predictions.</p>
      <p>
        The front-end application was implemented using the open-source framework Dash [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. It
consists of four diferent views (see Figures 2-5). The first two views are meant to assist in the
design of control charts for a given process. The purpose of views three and four is to support
the decision to choose the characteristics to replace by ML predictions.
      </p>
      <p>The design of a control chart is conducted in a retrospective analysis (Phase I) of process
data to define the in-control process (see section 2). Based on the historic data of the in-control
process the control limits for the control charts are defined. Therefore our application first
enables the user to identify the in-control process by setting suitable limits (see Figure 2). The
user is presented with historical data of interval inspections to support this process. The
dropdown menu at the top left (1) allows the user to select the quality characteristic to inspect. Next
to the drop-down menu (2) he can switch between showing the mean, range, or all values of
the individual samples of the inspections. Also, the period to consider can be changed (3). With
simple input fields (4) the user is able to adjust the limits for the in-control process. All changes
are shown in real time in the central graph (5). Specification limits of the quality characteristics
are shown in red, and limits of the in-control process are in orange. In-control data points are
colored blue, and points indicating an out-of-control process are red.</p>
      <p>
        When satisfied by the definition of the in-control process the user continues to the detailed
view for configuring the control chart for that specific quality characteristic (see Figure 3).
The graphs on the left (1) show the distribution of the mean and range values grouped by the
in-control (green) and out-of-control (red) classification from the previous view. For each group,
a combination of a probability density function (PDF) and a histogram is shown. We used the
formula proposed by Doane [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] to determine the number of bins of the histogram and used
the open source package SciPy [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] to fit the best matching distribution function to the data.
The respective control limits for the control chart are displayed as orange vertical lines in the
graphs. The user can adjust them via the input fields at the top right (2). Below (3), the  - and
 -errors of the control charts are shown. These give the user an estimation of the performance
of the control chart. The graphs and the error metrics are updated in real time when adjusting
one of the control limits. In addition, the Cpk value for diferent time frames (last month, last 3
months, year to date, and all time) is calculated based on the in-control data and displayed at
the bottom right (4). The user can use these values to determine if the in-control process was
well defined.
      </p>
      <p>The user can switch back and forth between the two views we just described (Figure 2 and
3). This supports the iterative process of Phase I of designing a control chart where control
limits are evaluated and the in-control definition of the process can be revised (see Section 2).
The whole process requires extensive knowledge about the process from the user to identify
the in-control process correctly and find suitable control limits for the resulting control charts.
Using our application the process expert can incorporate his know-how (DP3).</p>
      <p>
        After finalizing this process the available data and parameters of the newly configured
control charts were fed into our back-end algorithm. Its purpose was to identify all possible
combinations of quality characteristics that are valid candidates to be replaced by ML predictions.
For each characteristic, a new ML model was constructed and trained. We used the Open
Source framework LightGBM as a regression model (see [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] for documentation). Gradient
Boosting Decision Trees (GBDT) became more and more popular over the last years and
proved to be accurate models in many data science competitions [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. LightGBM provides a
GBDT framework with high eficiency and fast training without compromising the prediction
accuracy [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. For hyper-parameter tuning of the models, we used the Optuna framework,
a state-of-the-art optimizer for hyper-parameters also proven to be one of the most eficient
available [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. Our train and validation data set contained historical data for 7 months (Sep
2021 - Mar 2022). A hold-out test set contained almost 2.5 months (Apr 2022 - mid-June
2022), 25% of the total data available. The hold-out set was reserved for testing our results
in our technical evaluation. After preprocessing the train and validation data consisted of
359 individual inspections each containing the measurements of the 15 quality characteristics.
The inspections were taken in almost consistent intervals of 8 hours with only a few gaps
in between (e.g. due to public holidays). Each combination of quality characteristics was
evaluated if it was a reasonable candidate to be replaced by the machine learning predictions.
The models were trained individually. The target values were all quality characteristics that
should be predicted. As features, we used the data of all quality characteristics not included in
the prediction. We validated the prediction results with a 10-fold cross-validation approach [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
Due to the limitation of our hardware, only 30 trials of the Optuna algorithm were performed
for parameter optimization of each of the models. Each run was evaluated with the root mean
squared error (RMSE) metric with the aim of minimizing it [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ].
      </p>
      <p>Based on the predicted values we tested the previously designed control charts and compared
the results with the control charts using the real interval inspections. We used classification
error metrics for our evaluation. Of interest were the false negative rate (false alarms) and the
true negative rate (correctly identified out-of-control instances). Also, the Cpk of the process
could be calculated using the data labeled as in-control by our ML model. This was done for
each validation set of the cross-validation. The overall mean of the Cpk was used to identify if
the ML-based control charts could guarantee suficient process control. A Cpk lower than 1.33
disqualified a candidate.</p>
      <p>All valid combinations of quality characteristics that could be replaced by ML predictions
are presented to the user in view three of our front-end application (Figure 4). The error
metrics, overall out-of-control instances recognized (1- -error), and the number of false alarms
( -error) are displayed. These give the user an estimation of the overall performance of the
prediction models (DP2). For evaluating the eficiency increase the percentage savings of
physical inspections when using the ML-predicted quality characteristics is presented. Based
on this number the average monthly saved inspections are calculated and displayed.</p>
      <p>Clicking on one of the combinations opens a detailed view showing information on the
individual quality characteristics (Figure 5). For each characteristic, the average Cpk and the
standard deviation from our cross-validation runs are displayed. Also, the percentage of
out-ofcontrol instances recognized as well as the false alarms and the saved inspections are shown
for each quality characteristic individually. Note that the characteristics discussed here are
sensitive proprietary information of our industry partner and therefore concealed. They relate
to various features of the manufactured product, such as length, diameter, and surface finish.</p>
      <p>With this data at hand, the user can select a combination of quality characteristics to be
replaced by ML predictions. He can choose according to his personal risk afinity and weigh the
saved inspections and the possible risks of missed out-of-control instances against each other.
By only including combinations with a Cpk higher than 1.33 suficient process control is still
guaranteed (DP1). The user can incorporate his knowledge about the process into his decision
(DP3). The process of identifying the in-control process, designing the control charts, and finally
choosing the characteristics to replace by ML predictions is semi-automated. While all decisions
are made by the user the calculations regarding the process data and the displayed metrics are
automatically performed in the background (DP4). This being only a prototype there is no
integration in any productive systems. The automated replacement of interval inspections by
ML-predicted characteristics highly depends on the productive implementation of our solution
and is not part of the scope of this work.</p>
    </sec>
    <sec id="sec-6">
      <title>5. Evaluation</title>
      <p>
        To evaluate our prototype from a technical performance point of view, we used the hold-out
test set containing 2.5 months of historical process data. In addition, we also conducted a
confirmatory focus group [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] consisting of diferent process and quality management experts
to evaluate our proposed solution.
      </p>
      <sec id="sec-6-1">
        <title>5.1. Technical Evaluation</title>
        <p>Together with the responsible process expert of our exemplary production line (that also supplied
the data for our application) we chose one of the combinations of quality characteristics (see
Figure 4) for our technical evaluation. According to the expert, the decision was mainly made
by comparing the number of saved inspections and out-of-control instances recognized. The
Cpk was higher than 1.33 for each quality characteristic. Therefore, it played only a minor role
in the decision process. However, the expert stressed that a Cpk lower than 1.33 would have
been an absolute obstacle. Therefore, he commented on the Cpk to be a good fit as a restriction
when choosing the combinations. According to the expert, it is a metric he is quite familiar
with due to his position and can be used to justify his decision to implement the control chart
based on ML predictions.</p>
        <p>We then evaluated the underlying machine learning models of the chosen combination of
quality characteristics with our hold-out data set. The hold-out data contained all data not
used to train and evaluate our models (see section 4) The goal was to test the efectiveness and
eficiency of the control charts based on ML predictions. We evaluated the results with similar
metrics as displayed in our application (see Figure 4). In addition, we compared the Cpk to the
Cpk when using a standard control chart. The results of our evaluation are displayed in table 1.</p>
        <p>The second and third columns show the Cpk calculated based on the data labeled as in-control
by the baseline control charts and the control charts based on ML predictions. The amount
of miss-classified instances is displayed in the fourth and fith columns of the table as  and
 -error. In the last column, we can see the potentially saved inspections when replacing all
physical inspections of the given quality characteristics with ML predictions.</p>
        <p>When comparing these results to the results of our cross-validation displayed in figure 5 we
can see that the overall prediction accuracy was similar. Only characteristic 7 shows a significant
diference in its classification error of out-of-control instances. This indicates overfitting of the
underlying ML model which could be solved by providing more training data.</p>
        <p>Regarding the Cpk of the individual quality characteristic, all values lie above the limit of
1.33. This means suficient process control is provided by the control charts. It should be
highlighted that there were no out-of-control instances present for characteristic 15 and only
two for characteristic 5 which were both not recognized. The absence of out-of-control instances
makes the evaluation of these characteristics dificult but the high Cpk indicates very stable
processes with little risk of ever exceeding specification limits. Therefore, these characteristics
are promising candidates to be replaced by ML predictions.</p>
        <p>A large decrease of the Cpk for characteristic 2 can be detected. It is significantly worse
than the baseline control chart and the results from our validation set. This was mainly due to
the high  -error. While inspecting the prediction results we found some extreme outliers that
were not recognized and led to a wider spread of the distribution of the characteristics values,
hence a lower Cpk. Even though the Cpk dramatically decreased when using ML predictions it
still did not fall below the limit of 1.33. We argue that increasing the amount of training data
would probably lead to better prediction results by including instances of extreme outliers in
the future.</p>
        <p>Regarding the eficiency of the control charts based on ML-predicted quality characteristics,
we can see that they require between 62.5% and 100% less interval inspections. Even when
not all but every second interval inspection is replaced by the predicted quality characteristic
this could have an enormous impact. With an inspection conducted on average every 8 hours,
this leads to around 90 inspections per month per characteristic. With every inspection taking
around 5 minutes, based on our observations, this would result in around 18 hours of work
saved each month for one production line.</p>
        <p>With our technical evaluation, we validate the results of our implemented ML models. A
slight decrease in performance can be seen which indicates some overfitting to the test and
validation set. We also showed the potential impact on interval inspections of our solution. To
assess the long-term impact of our solution some field testing over a longer period is required.</p>
      </sec>
      <sec id="sec-6-2">
        <title>5.2. Focus Group</title>
        <p>
          To evaluate the efectiveness of our solution we conducted a confirmatory focus group [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ]. The
focus group consisted of six diferent process and quality management experts. We presented
the experts with our prototype and guided them through the diferent views.
        </p>
        <p>While adjusting the limits for defining the in-control process the experts highlighted the
simplicity of the solution. With only three limits the process instances could be classified as
in- or out-of-control. Easily switching between graphs of mean, range, and all values was
acknowledged to support defining the limits for the in-control process. In the next view (see
Figure 3) the visual representation of the probability density function of in-control and
out-ofcontrol data appeared to be a useful aid for the user when defining the control limits of the
control chart. In addition, the experts highlighted that the two error metrics ( and  error)
would help to assess the risk potential of the control limits.</p>
        <p>We also presented the results of our technical evaluation to the experts. The potential
reduction of interval inspections was a great inducement for using our solution. Multiple focus
group participants ofered their participation in a future field study. The features implemented
were confirmed to be useful and an overall high utility of the prototype was attested for. None
of the members of the focus group felt overwhelmed by the application but highlighted the
simplicity of adjusting the in-control process and the control chart limits.</p>
        <p>To better understand the underlying ML models the experts requested some insights into their
functioning. They suggested that this could generate profound knowledge about the process
which could prove useful. We conclude that the efectiveness of our solution is confirmed
by the focus group. Furthermore, the results of our technical evaluation were well received.
In the future, some methods for explaining the underlying machine learning and its feature
importance could be implemented to further support the process experts.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>6. Discussion</title>
      <p>
        The optimization of interval inspections with adaptive control charts has been a well-established
topic in the literature [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ]. There has also been significant research into economic
approaches for designing control charts, considering the costs associated with quality management
measurements and quality-related issues [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. However, the utilization of predicted quality
characteristics as a substitute for manual inspections has received limited attention. Existing
approaches had limited success in achieving a satisfactory outcome in the design of control
charts that rely on predicted quality characteristics [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Our research fills this gap by providing
valuable insights for researchers and practitioners interested in designing a control chart system
that leverages ML-predicted quality characteristics.
      </p>
      <p>We contribute to the literature on the utilization of ML techniques in the field of SPC and
control charts by providing four design principles for constructing such a system. These principles
can be applied across diferent problem scenarios, demonstrating the broad applicability of our
approach. Our implementation ofers practical guidance on how to address these principles in
various real-world scenarios. While ML-predictions decrease the necessity for manual interval
inspections we integrated the Cpk metric to ensure suficient process control (DP1). Given
the widespread utilization of Cpk as a performance indicator in SPC, this design aspect holds
relevance across diverse manufacturing settings. To assess the potential risks associated with
ML-predicted quality characteristics, we presented classification error metrics (  - and  -errors)
of the underlying control charts (DP2). These were validated using our dedicated test and
validation dataset. Through the manual identification of the out-of-control process, configuration
of control charts, and selection of quality characteristics to replace by predictions, the process
expert could leverage their process-specific knowledge in guiding their actions and decisions
(DP3). It is important to note that full automation of the process is not feasible due to certain
manual steps involved (DP4). While automated calculations were executed in the background,
the seamless integration of ML-predicted quality attributes into existing SPC solutions, which
is beyond the scope of this work, remains an unresolved aspect.</p>
    </sec>
    <sec id="sec-8">
      <title>7. Conclusion</title>
      <p>The goal of this work was to design an analytical control chart system for increasing the
eficiency of interval inspections. We therefore dealt with the question of how to design a
system that enables the use of predicted quality characteristics in control charts. To solve
this, we conducted a series of semi-structured interviews to derive design principles for such
a solution. Based on these principles we implemented a prototype to support the design of
control charts based on quality characteristics predicted by machine learning algorithms. We
then technically evaluated the system to measure the possible impact on the eficiency of
interval inspections and conducted a confirmatory focus group to evaluate the usefulness of
our solution. Our results show that substantial time savings could be achieved by replacing
interval inspections with ML-predicted quality characteristics. Furthermore, the focus group
confirmed the efectiveness of our solution. In the future, we aim to evaluate the system over a
longer period in a field experiment. Additionally, we plan to incorporate features for explaining
the underlying machine learning models. This could further support the user in his decision
and generate useful process insights.</p>
    </sec>
    <sec id="sec-9">
      <title>Acknowledgments References</title>
      <p>We thank Max Schemmer and Ulrich Gnewuch for their useful feedback, and Stefan Heitz, for
supporting our work at Robert Bosch GmbH.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rungtusanatham</surname>
          </string-name>
          ,
          <article-title>The quality and motivational efects of statistical process control</article-title>
          ,
          <source>Journal of Quality Management</source>
          <volume>4</volume>
          (
          <year>1999</year>
          )
          <fpage>243</fpage>
          -
          <lpage>264</lpage>
          . URL: https://www.sciencedirect.com/ science/article/pii/S1084856899000152. doi:
          <volume>10</volume>
          .1016/S1084-
          <volume>8568</volume>
          (
          <issue>99</issue>
          )
          <fpage>00015</fpage>
          -
          <lpage>2</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. H. L.</given-names>
            <surname>Chiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. C.</given-names>
            <surname>Storey</surname>
          </string-name>
          ,
          <article-title>Business intelligence and analytics: From big data to big impact</article-title>
          ,
          <source>MIS Quaterly 36</source>
          (
          <year>2012</year>
          )
          <fpage>1165</fpage>
          -
          <lpage>1188</lpage>
          . URL: https://www.jstor.org/stable/41703503. doi:
          <volume>10</volume>
          .2307/41703503, publisher: Management Information Systems Research Center, University of Minnesota.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Owen</surname>
          </string-name>
          , SPC and Continuous Improvement, Springer Berlin / Heidelberg, Berlin, Heidelberg,
          <year>1989</year>
          . URL: https://ebookcentral.proquest.com/lib/kxp/detail.action?docID=
          <fpage>6557055</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>D. C.</given-names>
            <surname>Montgomery</surname>
          </string-name>
          ,
          <article-title>Introduction to statistical quality control, 5</article-title>
          . ed. ed., Wiley, Hoboken, NJ,
          <year>2005</year>
          . URL: http://www.loc.gov/catdir/enhancements/fy0621/2004556782-b.html.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J. S.</given-names>
            <surname>Oakland</surname>
          </string-name>
          ,
          <article-title>Statistical process control, 6</article-title>
          . ed. ed.,
          <source>Routledge</source>
          , London and New York,
          <year>2011</year>
          . URL: http://www.sciencedirect.com/science/book/9780750669627.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Reynolds</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. W.</given-names>
            <surname>Amin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. C.</given-names>
            <surname>Arnold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Nachlas</surname>
          </string-name>
          ,
          <article-title>charts with variable sampling intervals</article-title>
          ,
          <source>Technometrics</source>
          <volume>30</volume>
          (
          <year>1988</year>
          )
          <article-title>181</article-title>
          . doi:
          <volume>10</volume>
          .2307/1270164.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Alwan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. V.</given-names>
            <surname>Roberts</surname>
          </string-name>
          ,
          <article-title>Time-series modeling for statistical process control</article-title>
          ,
          <source>Journal of Business &amp; Economic Statistics</source>
          <volume>6</volume>
          (
          <year>1988</year>
          )
          <article-title>87</article-title>
          . doi:
          <volume>10</volume>
          .2307/1391421.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>T.</given-names>
            <surname>Zan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <article-title>Control chart pattern recognition using the convolutional neural network</article-title>
          ,
          <source>Journal of Intelligent Manufacturing</source>
          <volume>31</volume>
          (
          <year>2020</year>
          )
          <fpage>703</fpage>
          -
          <lpage>716</lpage>
          . doi:
          <volume>10</volume>
          .1007/s10845- 019- 01473- 0.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Ferrer</surname>
          </string-name>
          ,
          <article-title>Multivariate statistical process control based on principal component analysis (MSPC-PCA): Some reflections and a case study in an autobody assembly process</article-title>
          ,
          <source>Quality Engineering</source>
          <volume>19</volume>
          (
          <year>2007</year>
          )
          <fpage>311</fpage>
          -
          <lpage>325</lpage>
          . URL: https://doi.org/10. 1080/08982110701621304. doi:
          <volume>10</volume>
          .1080/08982110701621304, publisher: Taylor &amp; Francis_eprint: https://doi.org/10.1080/08982110701621304.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>L.</given-names>
            <surname>Tong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <article-title>Constructing control process for wafer defects using data mining technique</article-title>
          ,
          <source>ICEB 2004 Proceedings (Beijing, China)</source>
          (
          <year>2004</year>
          ). URL: https://aisel.aisnet.org/iceb2004/208.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>O.</given-names>
            <surname>Hryniewicz</surname>
          </string-name>
          ,
          <article-title>Spc of processes with predicted data: Application of the data mining methodology</article-title>
          ,
          <source>in: Frontiers in Statistical Quality Control 11</source>
          , Springer, Cham,
          <year>2015</year>
          , pp.
          <fpage>219</fpage>
          -
          <lpage>235</lpage>
          . URL: https://link.springer.com/chapter/10.1007/978-3-
          <fpage>319</fpage>
          -12355-4_
          <fpage>14</fpage>
          . doi:
          <volume>10</volume>
          . 1007/978- 3-
          <fpage>319</fpage>
          - 12355- 4_
          <fpage>14</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Chakraborti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. W.</given-names>
            <surname>Human</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Graham</surname>
          </string-name>
          ,
          <article-title>Phase i statistical process control charts: An overview and some results</article-title>
          ,
          <source>Quality Engineering</source>
          <volume>21</volume>
          (
          <year>2008</year>
          )
          <fpage>52</fpage>
          -
          <lpage>62</lpage>
          . doi:
          <volume>10</volume>
          .1080/ 08982110802445561.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>W. H.</given-names>
            <surname>Woodall</surname>
          </string-name>
          ,
          <article-title>Controversies and contradictions in statistical process control</article-title>
          ,
          <source>Journal of Quality Technology</source>
          <volume>32</volume>
          (
          <year>2000</year>
          )
          <fpage>341</fpage>
          -
          <lpage>350</lpage>
          . doi:
          <volume>10</volume>
          .1080/00224065.
          <year>2000</year>
          .
          <volume>11980013</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>L. S.</given-names>
            <surname>Whiting</surname>
          </string-name>
          ,
          <article-title>Semi-structured interviews: guidance for novice researchers, Nursing standard (Royal College of Nursing (Great Britain) :</article-title>
          <year>1987</year>
          )
          <volume>22</volume>
          (
          <year>2008</year>
          )
          <fpage>35</fpage>
          -
          <lpage>40</lpage>
          . URL: https: //pubmed.ncbi.nlm.nih.gov/18323051/.
          <source>doi:10.7748/ns2008.02.22.23.35.c6420.</source>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Palinkas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Horwitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Green</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Wisdom</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Duan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hoagwood</surname>
          </string-name>
          ,
          <article-title>Purposeful sampling for qualitative data collection and analysis in mixed method implementation research, Administration and policy in mental health 42 (</article-title>
          <year>2015</year>
          )
          <fpage>533</fpage>
          -
          <lpage>544</lpage>
          . doi:
          <volume>10</volume>
          .1007/ s10488- 013- 0528- y.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Plotly</surname>
          </string-name>
          , Dash,
          <year>2022</year>
          . URL: https://dash.plotly.com/.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Doane</surname>
          </string-name>
          , Aesthetic frequency classifications,
          <source>The American Statistician</source>
          <volume>30</volume>
          (
          <year>1976</year>
          )
          <article-title>181</article-title>
          . doi:
          <volume>10</volume>
          .2307/2683757.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>SciPy</surname>
          </string-name>
          , Scipy,
          <year>2022</year>
          . URL: https://docs.scipy.org/doc/.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Microsoft</surname>
            <given-names>Corporation</given-names>
          </string-name>
          ,
          <source>Lightgbm 3.3.2.99 documentation</source>
          ,
          <volume>08</volume>
          .
          <fpage>07</fpage>
          .
          <year>2022</year>
          . URL: https://lightgbm. readthedocs.io/en/latest/.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>A.</given-names>
            <surname>Natekin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Knoll</surname>
          </string-name>
          ,
          <article-title>Gradient boosting machines, a tutorial, Frontiers in Neurorobotics 7 (</article-title>
          <year>2013</year>
          )
          <article-title>21</article-title>
          . URL: https://www.frontiersin.org/articles/10.3389/fnbot.
          <year>2013</year>
          .00021/full. doi:
          <volume>10</volume>
          . 3389/fnbot.
          <year>2013</year>
          .
          <volume>00021</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>G.</given-names>
            <surname>Ke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Meng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Finley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Chen</surname>
          </string-name>
          , W. Ma,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Ye</surname>
          </string-name>
          , T.-Y. Liu,
          <article-title>Lightgbm: A highly eficient gradient boosting decision tree</article-title>
          , in: I. Guyon,
          <string-name>
            <given-names>U.</given-names>
            <surname>Von Luxburg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bengio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wallach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fergus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vishwanathan</surname>
          </string-name>
          , R. Garnett (Eds.),
          <source>Advances in Neural Information Processing Systems</source>
          , volume
          <volume>30</volume>
          ,
          <string-name>
            <surname>Curran</surname>
            <given-names>Associates</given-names>
          </string-name>
          , Inc,
          <year>2017</year>
          . URL: https://proceedings. neurips.cc/paper/2017/file/6449f44a102fde848669bdd9eb6b76fa-Paper.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>T.</given-names>
            <surname>Akiba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Yanase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ohta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Koyama</surname>
          </string-name>
          ,
          <article-title>Optuna: A next-generation hyperparameter optimization framework</article-title>
          , in: A.
          <string-name>
            <surname>Teredesai</surname>
          </string-name>
          (Ed.),
          <source>Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &amp; Data Mining</source>
          , ACM Digital Library, Association for Computing Machinery, New York,NY,
          <string-name>
            <surname>United</surname>
            <given-names>States</given-names>
          </string-name>
          ,
          <year>2019</year>
          , pp.
          <fpage>2623</fpage>
          -
          <lpage>2631</lpage>
          . doi:
          <volume>10</volume>
          .1145/3292500.3330701.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>P.</given-names>
            <surname>Refaeilzadeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Tang</surname>
          </string-name>
          , H. Liu, Cross-validation,
          <source>in: Encyclopedia of Database Systems</source>
          , Springer US, Boston, MA,
          <year>2009</year>
          , pp.
          <fpage>532</fpage>
          -
          <lpage>538</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-0-
          <fpage>387</fpage>
          -39940-9_
          <fpage>565</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>T.</given-names>
            <surname>Chai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. R.</given-names>
            <surname>Draxler</surname>
          </string-name>
          ,
          <article-title>Root mean square error (rmse) or mean absolute error (mae)? - arguments against avoiding rmse in the literature</article-title>
          ,
          <source>Geoscientific Model Development</source>
          <volume>7</volume>
          (
          <year>2014</year>
          )
          <fpage>1247</fpage>
          -
          <lpage>1250</lpage>
          . URL: https://gmd.copernicus.org/articles/7/1247/2014/. doi:
          <volume>10</volume>
          .5194/ gmd-7-
          <fpage>1247</fpage>
          -
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>M. C. Tremblay</surname>
            ,
            <given-names>A. R.</given-names>
          </string-name>
          <string-name>
            <surname>Hevner</surname>
            ,
            <given-names>D. J.</given-names>
          </string-name>
          <string-name>
            <surname>Berndt</surname>
          </string-name>
          ,
          <article-title>The use of focus groups in design science research</article-title>
          , in: A.
          <string-name>
            <surname>R. Hevner</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Chatterjee</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Gray</surname>
          </string-name>
          , C. Y. Baldwin (Eds.),
          <source>Design research in information systems</source>
          , volume
          <volume>22</volume>
          <source>of Integrated Series in Information Systems</source>
          , Springer, New York, NY and Dordrecht and Heidelberg and London,
          <year>2010</year>
          , pp.
          <fpage>121</fpage>
          -
          <lpage>143</lpage>
          . doi:
          <volume>10</volume>
          .1007/ 978-1-
          <fpage>4419</fpage>
          -5653-8_
          <fpage>10</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>T.</given-names>
            <surname>Perdikis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Psarakis</surname>
          </string-name>
          ,
          <article-title>A survey on multivariate adaptive control charts: Recent developments and extensions</article-title>
          ,
          <source>Quality and Reliability Engineering International</source>
          <volume>35</volume>
          (
          <year>2019</year>
          )
          <fpage>1342</fpage>
          -
          <lpage>1362</lpage>
          . doi:
          <volume>10</volume>
          .1002/qre.2521.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>S.</given-names>
            <surname>Psarakis</surname>
          </string-name>
          ,
          <article-title>Adaptive control charts: Recent developments and extensions</article-title>
          ,
          <source>Quality and Reliability Engineering International</source>
          <volume>31</volume>
          (
          <year>2015</year>
          )
          <fpage>1265</fpage>
          -
          <lpage>1280</lpage>
          . doi:
          <volume>10</volume>
          .1002/qre.
          <year>1850</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>G.</given-names>
            <surname>Celano</surname>
          </string-name>
          ,
          <article-title>On the constrained economic design of control charts: a literature review</article-title>
          ,
          <source>Production</source>
          <volume>21</volume>
          (
          <year>2011</year>
          )
          <fpage>223</fpage>
          -
          <lpage>234</lpage>
          . doi:
          <volume>10</volume>
          .1590/s0103-
          <fpage>65132011005000014</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>