<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic ML for Manufacturing Monitoring at Bosch</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Baifan Zhou</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yulia Svetashova</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tim Pychynski</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Evgeny Kharlamov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Bosch Center for AI</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Karlsruhe Institute of Technology</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Oslo</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Motivation. Technological advances that come with Industry 4.0, in e.g. sensoring and communication, unlock unprecedented large volumes of manufacturing data. This opens new horizons for data-driven methods like Machine Learning (ML) in analysis of manufacturing processes for a wide range of industries. An important scenario here is monitoring of manufacturing processes, including e.g. analysing the quality of the manufactured products and predicting the health state of machines and equipment. Consider an example of welding quality monitoring at Bosch, where welding is performed with automated machines that connect pieces of metal together by pressing them and passing high current electricity through them. Development of ML approaches for welding quality monitoring used in Bosch follows an iterative workflow that includes data collection (Step 1), task negotiation (Step 2), data preparation (Step 3), ML model development (Step 4), result interpretation and model selection (Step 5), model deployment (Step 6). Challenges. Development of such ML approaches is complex and costly and at Bosch it is mostly affected by the following 3 challenges: - Transparency: Steps 2 and 5 of welding quality monitoring require collaborative work of experts from different areas. The asymmetric knowledge backgrounds, including complexity of engineering practices in manufacturing and sophistication of ML algorithms that constrain the transparency of ML results and models, make the communication time consuming and error-prone. - Data preparation: Step 3 requires to integrate data from dozens of sources and this is a labor-intensive effort that requires necessary understanding of multi-faceted domain knowledge and plentiful data complications. - Generalisability of ML quality models: each ML model developed in Step 4 is typically tailored to a specific dataset and one process and thus its reuse for other data or processes, which is often needed, requires a significant effort.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        By reasoning over ontologies we automate ML modelling. In particular, when
welding experts annotate raw welding data with features from the Domain (see Data
annotation box in Fig. 1), the reasoner derives from these features the relevant feature groups,
and then with the help of the ML Pipeline it derives relevant feature processing
algorithms, and relevant ML algorithms. Moreover, we use domain ontologies to annotate
resulting ML models and their selected features, thus enhancing their explainablility.
Evaluation at Bosch. We implemented and evaluated our approach at Bosch for
welding quality monitoring and prediction. The purpose was to enable predictive measures
such as equipment (re-)configuration or maintenance and thus to prevent welding
quality failures. The evaluation was done in the offline mode on two car manufacturing lines.
These lines generated large volumes of heterogeneous data such as historic data with
sensor measurements, welding machine configurations, manufacturing specifications,
and the quality estimates of finished welding operations. The raw data came in SQL,
Excel, plain text files, RUI-files and resulted in 263 fields with 53.2 million records, and
1.4 billion items after data integration. The accuracy of welding quality prediction by
ML models constructed with the help of SemML is good: the mean absolute
percentage error is within 1.61%–2.50% comparing to 3.19%–7.74% for simple baselines [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
Moreover, we evaluated the usability of our system [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] with very promising results:
Bosch engineers were able to construct domain and ML-pipeline ontologies relatively
fast with minor training and gave high overall usability scores to our system.
Outlook. Our current results on semanticfication and thus simplification and
automation of ML model development for monitoring of manufacturing processes are
promising. At the same time there is still a number of important steps to be done to bring them
the necessary degree of maturity and deploy them in production at Bosch by integrating
them into a new welding control system and automation platform. We plan to further
develop our ML pipelines with more methods, e.g. Feature Learning, ARIMA, and
conduct more extensive study specifically focusing on analysis of input feature importance.
We also plan to evaluate all our ML pipelines on scenarios different than welding.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Svetashova</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pychynski</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sure-Vetter</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikut</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kharlamov</surname>
          </string-name>
          , E.:
          <article-title>Ontology-Enhanced Machine Learning Pipeline: A Bosch Use Case of Welding Quality Monitoring</article-title>
          . In: ISWC (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Svetashova</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Byeon</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pychynski</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikut</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kharlamov</surname>
          </string-name>
          , E.:
          <source>Predicting Quality of Automated Welding with Machine Learning and Semantics: a Bosch Case Study. In: CIKM</source>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Svetashova</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pychynski</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kharlamov</surname>
          </string-name>
          , E.:
          <article-title>SemFE: Facilitating ML Pipeline Development with Semantics</article-title>
          .
          <source>In: CIKM</source>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>