=Paper= {{Paper |id=Vol-3759/workshop6 |storemode=property |title=Scalable processing of Bosch Welding Data with Graph Massivizer |pdfUrl=https://ceur-ws.org/Vol-3759/workshop6.pdf |volume=Vol-3759 |authors=Mikel Mendibe,Gad-Elrab Mohamed,Evgeny Kharlamov |dblpUrl=https://dblp.org/rec/conf/i-semantics/MendibeGK24 }} ==Scalable processing of Bosch Welding Data with Graph Massivizer== https://ceur-ws.org/Vol-3759/workshop6.pdf
                         Scalable processing of Bosch Welding Data with Graph
                         Massivizer
                         Mikel Mendibe1 , Gad-Elrab Mohamed1 and Evgeny Kharlamov1,2
                         1
                             Bosch Center for Artificial Intelligence, Germany
                         2
                             University of Oslo, Norway


                                        Abstract
                                        The transition to Industry 4.0 is driving the adoption of IoT technologies in manufacturing, transforming processes
                                        like Resistance Spot Welding (RSW) at companies such as Bosch. RSW is essential in automotive production but
                                        presents significant challenges in quality assurance due to its complexity and the limitations of traditional offline
                                        inspection methods. While data-driven techniques using dynamic electrical features offer some improvements,
                                        they often lack the ability to fully capture the intricacies of the RSW process or require expensive hardware.
                                            This paper introduces a hybrid approach from the Graph Massivizer project, combining sensor data with
                                        domain-specific knowledge in a knowledge graph. The proposed method leverages existing ontologies as inductive
                                        biases to enhance prediction accuracy and adaptability across various welding scenarios. By integrating time-
                                        series data with knowledge graph representations, the approach offers a more holistic understanding of the
                                        welding process. This paper discusses the methodology, the challenges of merging these data types, and the
                                        potential to improve RSW quality monitoring by bridging the gap between conventional data-driven methods
                                        and expert insights, resulting in a more robust and efficient monitoring framework.

                                        Keywords
                                        Industry 4.0, resistance spot welding, knowledge graphs, Graph Massivizer




                         1. Introduction
                         With the advent of the fourth industrial revolution, the manufacturing environments are evolving
                         towards digital ecosystems. Propelled by the improvement in sensing and communication technologies,
                         the Internet of Things (IoT) has taken root in traditional industries, and this has opened the world of Big
                         Data, where the volume, velocity and variety of data is rapidly rising [1]. This has been no different for
                         the case of Bosch, currently undergoing a transformation from a traditional manufacturing company to
                         an IoT one[2].
                            One of the most critical manufacturing processes at Bosch is Resistance Spot Welding (RSW), primarily
                         employed for joining car body components. This process involves passing a high-intensity electrical
                         current through two electrodes and the metal sheets positioned between them, generating localized
                         heat that fuses the sheets together at specific points, creating a weld spot [3]. Even if it is considered a
                         conventional method, this is still a very complex yet critical procedure [4], as the failure of a single spot
                         can halt an entire production line [3]. In the industry, quality assurance for RSW is typically achieved via
                         offline destructive inspection methods on samples selected by predefined empirical rules, but due to the
                         heavy class imbalance on the sets, detecting faulty spots is usually a very rare occurrence. Consequently,
                         data-driven approaches have been proposed in recent years, with the objective of detecting defective
                         samples more efficiently and reducing scrap and downtime [4]. Some of these approaches focus on
                         image processing, and they are oriented towards assessing the quality of the spots in an online fashion
                         with the help of cameras mounted on the welding devices Prediction of the Weld Qualities Using Surface
                         Appearance Image in Resistance Spot Welding, Monitoring of Resistance Spot Welding Process). Others,
                         mainly directed by the work of Chen et al., developed systems for weld quality estimation based on
                         infrared thermography [5] [6]. However, implementing these methods is often challenging due to the

                          First International Workshop on Scaling Knowledge Graphs for Industry, co-located with 20th International Conference on Semantic
                          Systems (SEMANTICS) - Amsterdam, Sept. 17–19, 2024
                          $ mikel.mendibeabarrategi@de.bosch.com (M. Mendibe); mohamed.gad-elrab@de.bosch.com (G. Mohamed);
                          evgeny.kharlamov@de.bosch.com (E. Kharlamov)
                                        © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).


CEUR
                  ceur-ws.org
Workshop      ISSN 1613-0073
Proceedings
need to acquire, install, and integrate specialized hardware, which can be costly and time-consuming.
Therefore, the research focused on using intrinsic process variables for monitoring is especially valuable.
From the early works of Cho et al. [7], many different data-driven methods have been developed, either
defined as classification or prediction tasks, depending on whether the intention is to estimate the
welding nugget diameter, or to identify defective welds. In this paper, developed under the Graph
Massivizer project, we introduce the need for a hybrid approach for welding quality estimation, one
that leverages on existing ontologies to act as an inductive bias on top of the available sensor data, with
the objective of obtaining a more grounded and efficient algorithm.


2. Use case
Resistance Spot Welding (RSW) is widely used in the automotive industry, as over 90% of assembly
work in a car body is completed by RSW [8]. These spots are critical for the structural integrity of
the vehicle, and the amount of spot welds in a car are counted in the thousands [9]. Resistance Spot
Welding is a process used to join two or more metal sheets together by applying pressure and passing an
electric current through the materials. The process generates heat due to the resistance to the electrical
current in the contact area between the metal sheets, causing the metals to melt and fuse together. The
welding electrodes also provide the necessary force to hold the materials in place during welding [10].
This is a very complicated process involving electromagnetic, thermal, mechanical and metallurgical
variables [8]. Even in very controlled environments, defects have been shown to be very difficult to
detect, as the welds are hidden between two sheets of metal. As such, in resistance spot welding, quality
assurance typically involves the use of offline destructive inspection techniques on selected samples.
These methods involve directly measuring the mechanical strength, fatigue resistance, and failure modes
of the weld joints. By physically testing these samples to the point of failure, critical information about
the integrity and durability of the weld can be obtained. This approach, while effective in assessing joint
quality, is limited by its destructive nature and the fact that it only provides insight into the specific
samples tested, rather than the entire production run [4].
   Most modern RSW operations are very efficient, and defective welds are a rare yet critical occurrence,
as such, defining data-driven methods to target these defects is a priority for the automotive industry,
and it has been a very rich subject of research. The main difficulties explored in the literature for the
implementation of efficient quality prediction algorithms are the following: (i) limit on labeled data
amount, (ii) limit on features, (iii) limit on coverage of relevant situations [11]. Increasing the amount of
labeled data is a resource-intensive task, therefore, it is often best avoided. Regarding (ii), some authors
have made use of additional sensors (infrared sensors [5], cameras...) for reliable quality prediction,
however, this is often difficult to implement and over-complicates the process, which leads to more
failures and downtime. Lastly, (iii) is the most pressing challenge, because as quality is good under
normal conditions, the datasets are often times heavily imbalanced, and creating more anomalous labels
requires expensive laboratory testing.
   Therefore, it is specifically interesting to detect RSW defects using the highly available dynamic
electrical features captured during the process, namely the current and the voltage, which are used to
conform to the dynamic resistance. These methods treat the dynamic curves as time series and aim
at identifying the underlying patterns that lead to anomalies in the welds. In [12], a random forest
classifier was trained with features derived from this signal and achieved up to 98% accuracy. Similarly,
[13] used a radial basis function neural network to predict nugget diameter and weld strength, achieving
98% accuracy by classifying features into different quality levels. In [14], a kernel extreme machine
model optimized with a particle swarm algorithm also reached 98% accuracy. The performance of these
models is very solid in their specific scenarios, and it leads to the incorrect idea that quality assurance in
RSW is a solved topic. However, as mentioned by Stavropoulos et al. [4], RSW is a very heterogeneous
process, and the variation of the elements that comprise the RSW ecosystem and process (machines,
materials, thickness, position...) heavily affect the final quality of the joint.
   Towards this direction, a need for a more holistic approach has been defined, one that takes into
account a bigger portion of the welding ecosystem and is able to adapt to the different flavors that RSW
can have. It is this approach that Tan et al. followed [15] by framing the quality prediction problem as
one of link prediction in a literal-aware RSW welding knowledge graph. By integrating background
knowledge on the prediction, the model is able to get a higher-level view of the operation and tackle the
heterogeneity of RSW in a more holistic manner. Even if the performance of this approach is notable,
in an attempt to frame it under the more traditional embedding methods, it deals with the time-series
nature of the curves by aggregating the values of the dynamical electrical features, leading to a great
loss of expressivity and decrease in performance, as Dickinson et al. stated in [16].


3. Proposed methodology
It has been shown that while data-driven methods lack the capacity to model the complexities of the
RSW ecosystem, knowledge-graph-driven approaches lack the expressivity and granularity afforded
by time-series data. Thus, motivated by the work of Jain et al. [17], the objective of Bosch and the
consortium partners under the Graph Massivizer project is to merge both approaches and design the
next-generation quality monitoring methods that go beyond the more traditional data-driven methods
by combining knowledge and sensor measurements.
   On the one hand, we refer to sensor measurements as the time series that are produced during the
welding operation, which can have the form of currents, voltages, temperatures. . . These are variables
that evolve during each weld and are fundamental to the estimation of the quality (e.g., if an abnormality
has been detected in the current that flows through the cathode of the welding machine, the presence
of an anomaly will be likely). On the other hand, we refer to knowledge that encompasses diverse and
rich prior information about the process, derived from expert knowledge, such as the ontology for a
specific RSW machine. This ontology is used as the backbone to create knowledge graphs for each
specific operation, similar to the approach that Tan et al. followed in [15], but instead of pointing at the
aggregated values of the dynamic features, it points at the entire representation of the time-series, thus
maintaining the expressivity in the analysis. This ontology is then used as a structural inductive bias
that inherently captures the nature of RSW. Subsequently, a Graph Neural Network (GNN) is trained
for prediction, leveraging the ontology-based knowledge graph to enhance its learning capability [18].
   This method is not only expected to improve the model’s accuracy in predicting outcomes but also
enable it to generalize better across different RSW operations by integrating domain-specific knowledge.
Additionally, the use of GNNs allows for the incorporation of complex relationships and dependencies
between various process parameters, providing a more holistic and detailed understanding of the RSW
process dynamics. This approach demonstrates the potential to bridge the gap between data-driven
methods and expert knowledge, resulting in a more robust and comprehensive modeling framework.


4. Drawbacks
Despite the high expectations for the project and the computational soundness of the method, we are
currently encountering some challenges.

    • A significant challenge lies in effectively representing time-series data within a graph-based
      framework. While time-series data provide rich, dynamic information about the welding process,
      converting this data into a graph structure that preserves its temporal properties and contextual
      relevance is complex. One possible solution is to use visibility graphs, which transform time-series
      data into networks where each data point (or "node") is connected to other nodes based on specific
      visibility criteria. Although visibility graphs can capture the intrinsic patterns and relationships
      within the time series, they may struggle with scalability when dealing with large datasets or
      high-dimensional data. Furthermore, it remains unclear how to best integrate these graph-based
      representations with existing domain knowledge in a cohesive manner that enhances the model’s
      predictive capability.
    • Combining knowledge graph data with time-series data in a meaningful way remains an open
      question. For example, a potential approach could involve appending embeddings of events
      generated by a BERT-like model [19] to embeddings derived from a feature tokenizer of the
      time-series data, as proposed by Jain et al. [17]. However, this strategy may not be effective in
      our use case due to the scarcity of events, which limits the potential information gain from such
      embeddings.
    • Using ontologies as structural inductive biases, while theoretically sound, raises questions about
      scalability. For instance, one could use a BERT-like model to create embeddings for materials,
      static conditions, and other domain-specific knowledge, and then train a Graph Neural Network
      (GNN) with this data. However, it is uncertain how well this approach would scale to larger
      datasets or more complex welding scenarios. Moreover, techniques like node2vec [20], which
      are often used for node embeddings in graphs, may not be suitable for capturing the nuanced,
      multi-relational nature of the knowledge graph in this context.


5. Conclusion
In conclusion, this paper highlights the need for a hybrid approach to quality assurance in Resistance
Spot Welding (RSW) by combining knowledge graphs and time-series data to address the limitations of
traditional data-driven and knowledge-graph-based methods. The proposed methodology, leveraging
Graph Neural Networks (GNNs) and domain-specific ontologies, offers a more comprehensive and
adaptive modeling framework that captures the complex dependencies and dynamics of the RSW
process, leading to improved prediction accuracy and robustness. However, challenges remain in
effectively integrating time-series data within a graph-based framework and ensuring scalability across
diverse welding scenarios.


Acknowledgments
The Graph Massiviser (GA 101093202) EU project supported this work.


References
 [1] D. Mourtzis, E. Vlachou, N. Milas, Industrial Big Data as a Result of IoT Adoption in Manufactur-
     ing, Procedia CIRP 55 (2016) 290–295. URL: https://www.sciencedirect.com/science/article/pii/
     S2212827116307880. doi:10.1016/j.procir.2016.07.038.
 [2] A.-K. Leiting, L. De Cuyper, C. Kauffmann, The Internet of Things and the case of Bosch: Changing
     business models while staying true to yourself, Technovation 118 (2022) 102497. URL: https://
     www.sciencedirect.com/science/article/pii/S016649722200044X. doi:10.1016/j.technovation.
     2022.102497.
 [3] B. Zhou, Y. Svetashova, S. Byeon, T. Pychynski, R. Mikut, E. Kharlamov, Predicting Quality of
     Automated Welding with Machine Learning and Semantics: A Bosch Case Study, in: Proceedings
     of the 29th ACM International Conference on Information & Knowledge Management, CIKM
     ’20, Association for Computing Machinery, New York, NY, USA, 2020, pp. 2933–2940. URL: https:
     //doi.org/10.1145/3340531.3412737. doi:10.1145/3340531.3412737.
 [4] P. Stavropoulos, K. Sabatakakis, Quality Assurance in Resistance Spot Welding: State of Practice,
     State of the Art, and Prospects, Metals 14 (2024) 185. URL: https://www.mdpi.com/2075-4701/14/
     2/185. doi:10.3390/met14020185, number: 2 Publisher: Multidisciplinary Digital Publishing
     Institute.
 [5] J. Chen, Z. Feng, Online resistance spot weld NDE using infrared thermography 10169
     (2017) 101690K. URL: https://ui.adsabs.harvard.edu/abs/2017SPIE10169E..0KC. doi:10.1117/12.
     2260896, conference Name: Society of Photo-Optical Instrumentation Engineers (SPIE) Confer-
     ence Series ADS Bibcode: 2017SPIE10169E..0KC.
 [6] J. Chen, Z. Feng,               IR-based spot weld NDT in automotive applications,               in:
     Thermosense: Thermal Infrared Applications XXXVII, volume 9485, SPIE, 2015, pp.
     283–288. URL: https://www.spiedigitallibrary.org/conference-proceedings-of-spie/9485/948513/
     IR-based-spot-weld-NDT-in-automotive-applications/10.1117/12.2177124.full. doi:10.1117/12.
     2177124.
 [7] Y. Cho, S. Rhee, Quality Estimation of Resistance Spot Welding by Using Pattern Recognition With
     Neural Networks, IEEE Transactions on Instrumentation and Measurement 53 (2004) 330–334.
     URL: http://ieeexplore.ieee.org/document/1284862/. doi:10.1109/TIM.2003.822713.
 [8] Y. Li, Z. Lin, L. Li, G. Chen, Numerical Analysis of Transport Phenomena in Resistance Spot Welding
     Process, volume 1, 2009. doi:10.1115/FEDSM2009-78210, journal Abbreviation: Proceedings of
     the ASME Fluids Engineering Division Summer Conference 2009, FEDSM2009 Publication Title:
     Proceedings of the ASME Fluids Engineering Division Summer Conference 2009, FEDSM2009.
 [9] K. Zhou, P. Yao, Overview of recent advances of process analysis and quality control in re-
     sistance spot welding, Mechanical Systems and Signal Processing 124 (2019) 170–198. URL:
     https://www.sciencedirect.com/science/article/pii/S0888327019300573. doi:10.1016/j.ymssp.
     2019.01.041.
[10] R. L. O’Brien, Welding handbook. Volume 2, Welding processes, 8th ed ed., American Welding
     Society, Miami, Fla., 1991. OCLC: 664579333.
[11] B. Zhou, T. Pychynski, M. Reischl, R. Mikut, Comparison of Machine Learning Approaches for
     Time-series-based Quality Monitoring of Resistance Spot Welding (RSW) (2018). doi:10.5445/
     KSP/1000087327/13.
[12] B. Xing, Y. Xiao, Q. H. Qin, H. Cui, Quality assessment of resistance spot welding process based
     on dynamic resistance signal and random forest based, The International Journal of Advanced
     Manufacturing Technology 94 (2018) 327–339. URL: https://doi.org/10.1007/s00170-017-0889-6.
     doi:10.1007/s00170-017-0889-6.
[13] H. Zhang, Y. Hou, T. Yang, Q. Zhang, J. Zhao, Welding quality evaluation of resistance spot welding
     using the time-varying inductive reactance signal, Measurement Science and Technology 29 (2018)
     055601. URL: https://dx.doi.org/10.1088/1361-6501/aaa830. doi:10.1088/1361-6501/aaa830,
     publisher: IOP Publishing.
[14] H. Sun, J. Yang, L. Wang, Resistance spot welding quality identification with particle swarm
     optimization and a kernel extreme learning machine model, The International Journal of Advanced
     Manufacturing Technology 91 (2017) 1879–1887. URL: https://doi.org/10.1007/s00170-016-9944-y.
     doi:10.1007/s00170-016-9944-y.
[15] Z. Tan, B. Zhou, Z. Zheng, O. Savkovic, Z. Huang, I.-G. Gonzalez, A. Soylu, E. Kharlamov, Literal-
     Aware Knowledge Graph Embedding for Welding Quality Monitoring: A Bosch Case, in: T. R.
     Payne, V. Presutti, G. Qi, M. Poveda-Villalón, G. Stoilos, L. Hollink, Z. Kaoudi, G. Cheng, J. Li
     (Eds.), The Semantic Web – ISWC 2023, Springer Nature Switzerland, Cham, 2023, pp. 453–471.
     doi:10.1007/978-3-031-47243-5_25.
[16] D. W. Dickinson, Characterization of Spot Welding Behavior by Dynamic Electrical Parameter
     Monitoring (????).
[17] S. Jain, M. Burger, G. Rätsch, R. Kuznetsova, Knowledge Graph Representations to enhance Inten-
     sive Care Time-Series Predictions, 2023. URL: http://arxiv.org/abs/2311.07180, arXiv:2311.07180
     [cs].
[18] M. Trębacz, Z. Shams, M. Jamnik, P. Scherer, N. Simidjievski, H. A. Terre, P. Liò, Using ontology
     embeddings for structural inductive bias in gene expression data analysis, 2020. URL: http://arxiv.
     org/abs/2011.10998. doi:10.48550/arXiv.2011.10998, arXiv:2011.10998 [cs, q-bio].
[19] J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, BERT: Pre-training of Deep Bidirectional Transform-
     ers for Language Understanding, in: J. Burstein, C. Doran, T. Solorio (Eds.), Proceedings of the
     2019 Conference of the North American Chapter of the Association for Computational Linguistics:
     Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational
     Linguistics, Minneapolis, Minnesota, 2019, pp. 4171–4186. URL: https://aclanthology.org/N19-1423.
     doi:10.18653/v1/N19-1423.
[20] A. Grover, J. Leskovec, node2vec: Scalable Feature Learning for Networks, in: Proceedings of
     the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,
     KDD ’16, Association for Computing Machinery, New York, NY, USA, 2016, pp. 855–864. URL:
     https://doi.org/10.1145/2939672.2939754. doi:10.1145/2939672.2939754.