=Paper=
{{Paper
|id=Vol-3830/paper1sim
|storemode=property
|title=Knowledge Representation and Engineering for Smart Diagnosis of Cyber Physical Systems
|pdfUrl=https://ceur-ws.org/Vol-3830/paper1sim.pdf
|volume=Vol-3830
|authors=Ameneh Naghdi Pour,Benno Kruit,Jieying Chen,Peter Kruizinga,Godfried Webers,Stefan Schlobach
|dblpUrl=https://dblp.org/rec/conf/semiim/NaghdipourKCKWS24
}}
==Knowledge Representation and Engineering for Smart Diagnosis of Cyber Physical Systems==
Knowledge Representation and Engineering for Smart
Diagnosis of Cyber Physical Systems
Ameneh Naghdi Pour1 , Benno Kruit1 , Jieying Chen1 , Godfried Webers2 , Peter Kruizinga3
and Stefan Schlobach1
1
Vrije Universiteit Amsterdam, De Boelelaan 1111, 1081 HV Amsterdam, NL
3
Canon Production Printing Netherlands, Van der Grintenstraat 1, 5914 HH Venlo, NL
2
Philips Medical Systems International, Veenpluis 6, 5684 PC Best, NL
Abstract
The traditional maintenance approach used by manufactures such as Canon and Philips, which relies on service
engineers’ expertise to diagnose the failure causes, poses significant challenges and costs. To overcome challenges
and minimize expenses, we aim to explore the construction and application of a fault knowledge graph that
integrate different maintenance data and knowledge sources. This involves developing an upper-level ontology
based on the requirements for the fault diagnosis of Cyber Physical Systems. This ontology draws inspiration
from the Industrial Domain Ontology (IDO) and Industrial Ontology Foundry-Maintenance Reference ontology
(IOF-MRO). By leveraging these two ontologies as foundation, we aim to construct a comprehensive framework
that captures and represents fault-related knowledge in a structured manner. Additionally, this paper envisions
the integration of different fault diagnosis methods and knowledge modeling techniques. Combining these
approaches is expected to enhance the accuracy and effectiveness of fault diagnosis, leading to more efficient and
reliable solutions.
Keywords
Fault diagnosis, cyber-physical systems, knowledge representation, knowledge graph
1. Introduction
Machine breakdowns result in a substantial financial burden for manufacturers and their customers,
primarily due to expenses related to training service engineers for fault diagnosis, their salaries, and
the provision of spare parts [1]. Additionally, the capacity of machines is adversely affected, as they
remain out of service during downtime, further increasing costs and impacting productivity. Therefore,
efficient fault diagnosis is of utmost importance for minimizing both downtime and costs. At Canon
and Philips manufactures, service engineers are trained using valuable documentation and occasionally
videos. However, significant challenges raised with this approach. These challenges include heavy
reliance on the engineers’ expertise, the complexity of navigating extensive documentation, and the
high costs associated with producing and providing instructional videos in terms of time and resources.
To address these challenges, the Zorro project has been initiated, focusing on achieving zero downtime
in Cyber-Physical Systems (CPS). Our goal in this project is to support service engineers in diagnosis
the failure cause. Figure 1 shows the proposed framework designed to achieve this goal. The input
comprises various sources that have been identified through interviews conducted with authorities in
the manufacturing field. In the first phase, a Knowledge Graph (KG) is constructed based on the input.
This involves manually developing an upper-level ontology, inspired by IDO and IOF-MRO. In the next
phase, service engineers observe a symptom that need to be converted into a query for the knowledge
SemIIM’24: Third International Workshop on Semantic Industrial Information Modelling, co-located with ISWC’24, 12th November
2024, Baltimore, USA
Envelope-Open a.naghdipour@vu.nl (A. Naghdi Pour); b.b.kruit@vu.nl (B. Kruit); j.y.chen@vu.nl (J. Chen); godfried.webers@philips.com
(G. Webers); peter.kruizinga@cpp.canon (P. Kruizinga); k.s.schlobach@vu.nl (S. Schlobach)
GLOBE https://https://github.com/Ameneh71 (A. Naghdi Pour); https://github.com/bennokr/ (B. Kruit);
https://github.com/JieyingChenChen (J. Chen)
Orcid 0000-0002-7357-7906 (A. Naghdi Pour); 0000-0002-3282-1597 (B. Kruit); 0000-0002-3282-1597 (J. Chen);
0000-0002-3282-1597 (P. Kruizinga); 0000-0002-3282-1597 (S. Schlobach)
© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
CEUR
ceur-ws.org
Workshop ISSN 1613-0073
Proceedings
graph reasoning. The KG then provides a root cause analysis and the corresponding repair procedure
as the output.
In addition to the proposed framework, we have envisioned several methods for fault diagnosis
that can offer significant advantages. Firstly, the automatic generation of Bayesian Networks (BNs)
from KG. The KG excels in data integration, providing a unified view of system faults, while BNs
enable probabilistic reasoning for more accurate diagnosis. This integration allows for a more precise
understanding of fault scenarios and their likelihood, leading to improved diagnostic outcomes. Secondly,
integrating KG with fault tree provides a means to capture complex causal mechanisms. This integration
also facilitates the generation of BNs. Moreover, we envision that KG can be constructed with minimal
effort using Language Models (LLMs). These LLMs can facilitate the integration of different fault
diagnosis representations, such as BNs and Fault Trees, thereby enhancing the robustness of CPS by
minimizing downtime. By leveraging the power of this method, we can work towards achieving zero
downtime and ensuring the continuous operation of CPS in the future.
2. Related work
There are various sources of knowledge and data that can be utilized to support service engineer in
diagnosing failure. These sources include machine log data from sensors, documented knowledge
provided by authorities, logbook data written by service engineers, and also tacit knowledge in the brain
of experts. Researchers have extensively explored different categories of these sources and proposed
various methods for supporting maintenance tasks. These methods can be broadly categorized into
four groups: (1) Model-based approach, which is based on constructing an accurate physical model
that captures the dynamic changes within the system. It diagnoses faults by comparing the actual
measurement output of the system with the predicted output of the model, relying on consistency
monitoring. While this method offers high accuracy, it necessitates the establishment of precise and
quantitative physical models [2]. (2) Signal-based approach, which diagnoses faults by analyzing and
processing monitored signals. However, it can only be used to identify fault types in key components
that are equipped with sensors [3]. (3) Quantitative-knowledge-based method utilize historical data to
train a model and treat fault diagnosis as a pattern recognition problem. However, it does rely on a
significant amount of historical fault data to train the model effectively[4, 5]. (4) Qualitative knowledge-
based method which adopts a different approach by constructing a qualitative model that describes
prior knowledge related to faults. It then utilizes techniques such as searching, matching, and reasoning
to diagnose faults within the system. Unlike the model-based and quantitative-based approaches, this
method does not require the construction of accurate physical models or rely heavily on extensive
historical fault data. Instead, it needs to build a fault knowledge base [6]. Therefore, considering the
requirements of system-level analysis and the need for searching, matching, and reasoning capabilities,
the qualitative-knowledge-based fault diagnosis approach appears to be a suitable choice for this project.
A crucial aspect of qualitative methods is the requirement for a well-defined and reasonable model
that accurately represents knowledge related to faults. The most popular models in the literature
include Rule-Based [7], Causal Models [8], Fault Trees [9], and Petri Nets [10]. Rule-based models use
if-then rules to represent expert knowledge about system behavior and fault conditions, allowing for
automated diagnosis and decision-making under predefined conditions. Causal models diagnose faults
in CPS by mapping cause-and-effect relationships between system components and events. Fault trees
represent different events that can lead to a specific system failure, helping to identify potential causes
and their relationships. Nevertheless, these models have certain limitations. They typically require
prior analysis of potential equipment fault modes and involve manual editing, which makes them
inflexible and challenging to update dynamically. Consequently, the systematic and timely sharing of
maintenance engineers’ experience and expertise in fault diagnosis becomes difficult [6]. Therefore,
this paper proposes to use knowledge graph technology to mine fault knowledge from vast and diverse
fault documents and then construct a structured and interconnected fault knowledge base.
Input Phase 1: Phase 2: Output
Knowledge Graph Diagnosis
Documented
Logbook Data Construction
Knowledge
FMEA
Root Cause
Upper-level Ontology Symptom
Construction Observation
Bill of
Materials
Tacit Knowledge
Repair
Troubleshooting
Querying & Procedure
Manual Information
Reasoning
Extraction
Figure 1: Construction and application framework of domain fault knowledge graph
3. Methodology
Figure 1 shows our proposed framework for fault diagnosis of CPS. The following subsections provide
a detailed illustration of the framework.
3.1. Input Source
Following regular interviews with authorities at Canon and Philips, we have identified various mainte-
nance sources and categorized them into three groups including data, documented knowledge, and tacit
knowledge. Each of these sources offers unique insights and information about the system, but they also
have limitations. Data, which is logbook written by service engineers, contains real-life information
regarding specific problems and actions taken to resolve them. However, the information provided
in the logbook may sometimes be incomplete. For instance, engineers may only write “done” as the
action taken without providing detailed explanations. Documented knowledge encompasses three
resources including (1) Bill Of Material, which provides information about the physical structure of
the system, including the location of each part within assemblies or components, but lacks details on
problems that may happen for each part of the system. (2) Failure Mode Effect Analysis, offers insights
into challenging failure modes and their potential effects, but may not guarantee an accurate and
comprehensive solution for failures. (3) Troubleshooting Manuals, provide insights into the causes of
failures and offer remedies to resolve them. Lastly, tacit knowledge refers to undocumented knowledge
residing in the minds of experts. These different sources are complementary, and relying on just one
of them would not result in an efficient fault diagnosis. By combining them, we can leverage their
strengths and compensate for their limitations.
3.2. Knowledge Graph Construction
The first phase involves two steps to construct the knowledge graph: Upper-level ontology construction
and Information Extraction. The upper-level ontology has been manually created to serve as the
foundational structure for organizing the schema of the knowledge graph. This involved extensive
analysis of various input sources to identify the most valuable knowledge. Additionally, the concepts
and terminology have been carefully selected to ensure clarity and consistency throughout the ontology.
We also drew inspiration from the IDO and IOF-MRO in developing our fault diagnosis ontology. IDO is
a recent work item, approved by the ISO TC 184/SC4 Industrial Data committee in July 2023, and holds
significance in the realm of industrial standardization. IOF-MRO is to support semantic interoperability
through the use of modular ontologies in the maintenance domain. Due to limited space, more detailed
information on the IDO and IOF-MRO models can be found in [11] and [12] respectively. Figure 2
shows the design of the Zorro ontology, where there are classes (nodes) and relationships (edges)
that represent the domain of interest. There is a class named “Component ” that can represent a part,
depensOn subFunction
Function
subComponent hasFunction define hasCause
failVia resultIn
Component Problem Effect
solve address
involve
Solution Workaround
subClassOf
consistOf
Step Procedure
Figure 2: Construction and application framework of domain fault knowledge graph
assembly, or subsystem, each serving a specific “Function ”. When a part encounter malfunction,
it leads to the occurrence of a “Problem ”. The problem has cause(s), which can be considered as
problem, result in a certain “Effect ”. To address these problems, there are two types of “Procedure ”
that can be implemented. The first is a “Solution ”, which directly solves the problem at hand. The
second is a “Workaround ”, which aims to address the effects caused by the problem. Both procedures
consist of multiple “Step ”, with each step involving a specific part of the system. For a thorough
understanding, Table 1 provides a depiction of the classes, their definitions and the sources containing
relevant information about each class, along with an example of the classes in real field. We also
compared our defined classes with the IDO and IOF-MRO. On the other hand, Table 2 illustrates the
object properties, including their definitions and comparison. For the information extraction step,
different techniques such as Regular Expression, Named Entity Recognition, and LLMs, have been
utilized to extracted required knowledge and populate it based on the ontology.
3.3. Diagnosis
The next phase, diagnosis, shows the application of the proposed method in which service engineers
observe symptoms of the failure, which should be converted to a query for KG-based reasoning. As a
result, a diagnosis which find the root cause of the issue along with a procedure should be suggested to
solve the problem. To this end, we are planning to develop querying and reasoning systems for diagnosis,
with the aim of integrating or supporting different fault diagnosis reasoning techniques, such as BNs
and Fault Trees. Crucially, such systems will need to take into account the maintenance engineers’
expert knowledge in order to augment their ability. However, integrating their tacit knowledge into the
diagnosis process is an open research question. We urge the community to engage in this challenge,
leveraging semantic technologies to enhance fault diagnosis and ensure operational resilience.
4. Discussion
The related work section has extensively analyzed the advantages and disadvantages associated with
various fault diagnosis methods and knowledge modeling techniques. Additionally, the rationale for
constructing a knowledge graph as the underlying model to capture and represent prior knowledge has
been explained. However, we believe that the combination of fault diagnosis methods and knowledge
modeling techniques holds great potential for enhancing overall capabilities by leveraging the strengths
Table 1
Classes in the Zorro Ontology
Zorro Definition and Source Close Match Example
Class
Compo- A ’physical object’ of industrial IDO: InAnimatePhysicalObject DC servo mo-
nent equipment that could be a part, a IOF-MRO: MaintainableMateri- tor
component or assembly alItem
Source: BOM, Logbook, FMEA, TM
Function A capability that meets a design IDO: Function Feeding paper
requirement IOF-MRO: MaintainableMateri- to printer
Source: FMEA, TM, Logbook alItemRole
Problem An issue that happened for the IDO: - Overheating
system IOF-MRO: FailedState
Source: Logbook, FMEA, TM
Effect An observable result of the problem IDO: - Wear and tear
on the system IOF-MRO: FailureEffect
Source:FMEA
Procedure Sequence of steps that can be taken IDO: - Check and
to solve the problem IOF-MRO: PlannedProcess clean the
Source: Logbook, FMEA, TM cooling fan
Solution Procedure to directly solve the root IDO: - Check and
cause of the failure IOF-MRO: MaintenanceActivity clean the
Source: Logbook, FMEA, TM cooling fan
Workaround Procedure that addresses the effect IDO: - Replace
of the failure, not the root cause IOF-MRO: SupportingMainte- damaged
Source: Logbook, FMEA, TM nanceActivity components
closer to
motor
Step Single action that need to be in the IDO: - Replace
procedure IOF-MRO: MaintenanceWorkO-
Source: Logbook, FMEA, TM rderRecord
of each approach. We have identified three specific combinations that hold promise in enhancing fault
diagnosis:
1. Automatically generating a Bayesian Network from our knowledge graph: data integration from
various sources is a crucial task for efficient fault diagnosis. While BNs face challenges in data
integration [13], KG excel in this aspect, allowing for a unified and comprehensive view of the
system’s faults. On the other hand, BNs offer a robust framework for probabilistic reasoning,
enabling more accurate diagnosis in the presence of uncertainty. By leveraging the probabilistic
capabilities of BNs, we can enhance the fault diagnosis process, considering the likelihood of
different fault scenarios and their potential impact on the system. This combination harnesses
the strengths of both approaches, enabling a more comprehensive and accurate fault diagnosis
methodology.
2. Integrating our knowledge graph representation with fault tree: The ontology now contains
only simple causal links, but often there is a more complex causal mechanism. Sometimes, faults
only occur when multiple events occur simultaneously, or under specific conditions (such as
humidity). Other times, faults have separate, orthogonal causes. These structures are often
modeled in fault trees. We envision that these structures can also be modeled in a KG, at various
levels of granularity, and that Fault Trees can be generated from them. Additionally, if fault trees
exist that model certain mechanisms, they could be useful sources of information to be linked
and integrated into the KG. Then, their content can be used to generate Bayesian Networks as
described above.
3. Combining model-based with quantitative fault diagnosis methods: In section 2, we have
Table 2
Properties in the Zorro Ontology
Property Definition IDO IOF-MRO
hasFunc- Every part, component, and subsystem of the ma- hasFunc- hasRole
tion chine has a function tion
define If the system does not function properly it defines a create describe
problem
failsVia System fails and initiate a problem - -
hasCause A problem has a cause which the cause itself is a - -
problem
resultIn Problem results in an effect - precedes
address Workaround just addresses the effect of the problem, - -
not the root cause
solve the Solution directly solves the root cause of the prob- - -
lem
subPart Each part may has some subparts contain hasRole
involve Each step of the procedure involves a part of the - -
system
consistsOf Each procedure consist of some steps - Step ”isInputOf”
Procedure
described the drawbacks of these two methods. Both exist on either ends of knowledge-to-data
spectrum: on one end, the model-based diagnosis requires detailed (physical) knowledge of the
system, and on the other end, quantitative fault diagnosis requires comprehensive observation
data to learn from. However, in most cases only a little of either is available, and it is difficult
to integrate the physical models with the observed data. We envision that the extraction of
fault diagnosis knowledge from diverse sources will aid the construction of models that are
both physically informed and data-driven. To this end, the Knowledge Graph facilitates the
integration of various views of historically collected (sensor) data within unified physical, logical
and functional views of the CPS. We envision that this will significantly reduce the efforts needed
to construct accurate models, by re-using modular model components, simplifying the data
analysis pipeline, and allowing efficient validation on historical maintenance data.
5. Conclusion and Future work
In this study, various fault diagnosis approaches were reviewed, and a framework consisting of two
primary phases, Knowledge Graph Construction and Diagnosis, was developed. By constructing a
knowledge graph using diverse input sources, accurate diagnosis and efficient repair procedures can
be achieved. In future work, the focus will be on refining and automating the construction of the
knowledge graph using advanced techniques like LLMs. Additionally, the transformation of KG into
Bayesian Network will be explored to have a probabilistic reasoning. Furthermore, the construction of
model-based and quantitative-based fault diagnosis methods using the knowledge graph will also be
investigated.
Acknowledgments
This publication is part of the project ZORRO with project number KICH1.ST02.21.003 of the research
programme Key Enabling Technologies (KIC) which is (partly) financed by the Dutch Research Council
(NWO).
References
[1] D. S. Thomas, The costs and benefits of advanced maintenance in manufacturing, US Department
of Commerce, National Institute of Standards and Technology, 2018. doi:https://doi.org/10.
6028/NIST.AMS.100- 18 .
[2] S.-H. You, Y. M. Cho, J.-O. Hahn, Model-based fault detection and isolation in automotive yaw
moment control system, International Journal of Automotive Technology 18 (2017) 405–416.
doi:https://doi.org/10.1007/s12239- 017- 0041- 5 .
[3] K. Zhang, Y. Xu, Z. Liao, L. Song, P. Chen, A novel fast entrogram and its applications in
rolling bearing fault diagnosis, Mechanical Systems and Signal Processing 154 (2021) 107582.
doi:https://doi.org/10.1016/j.ymssp.2020.107582 .
[4] J. Dong, D. Su, Y. Gao, X. Wu, H. Jiang, T. Chen, Fine-grained transfer learning based on deep feature
decomposition for rotating equipment fault diagnosis, Measurement Science and Technology 34
(2023) 065902. doi:https://doi.org/10.1088/1361- 6501/acc04a .
[5] J. Long, Y. Chen, Z. Yang, Y. Huang, C. Li, A novel self-training semi-supervised deep learning
approach for machinery fault diagnosis, International Journal of Production Research 61 (2023)
8238–8251. doi:https://doi.org/10.1080/00207543.2022.2032860 .
[6] X. Tang, G. Chi, L. Cui, A. W. Ip, K. L. Yung, X. Xie, Exploring research on the construction and
application of knowledge graphs for aircraft fault diagnosis, Sensors 23 (2023) 5295. doi:https:
//doi.org/10.3390/s23115295 .
[7] Z. Wang, S. Li, W. He, R. Yang, Z. Feng, G. Sun, A new topology-switching strategy for fault
diagnosis of multi-agent systems based on belief rule base, Entropy 24 (2022) 1591. doi:https:
//doi.org/10.3390/e24111591 .
[8] X. Bu, H. Nie, Z. Zhang, Q. Zhang, An industrial fault diagnostic system based on a cubic dynamic
uncertain causality graph, Sensors 22 (2022) 4118. doi:https://doi.org/10.3390/s22114118 .
[9] K. Pan, H. Liu, X. Gou, R. Huang, D. Ye, H. Wang, A. Glowacz, J. Kong, Towards a systematic
description of fault tree analysis studies using informetric mapping, Sustainability 14 (2022) 11430.
doi:https://doi.org/10.3390/su141811430 .
[10] C. Xu, J. Li, X. Cheng, Comprehensive learning particle swarm optimized fuzzy petri net
for motor-bearing fault diagnosis, Machines 10 (2022) 1022. doi:ttps://doi.org/10.3390/
machines10111022 .
[11] ISO, ISO 15926-14:2020(E), ISO 15926 Part 14: Industrial top-level ontology, Technical Report, ISO,
Geneva, CH, 2020.
[12] https://github.com/iofoundry/ontology/tree/master/maintenance, Technical Report, ????
[13] B. Cai, L. Huang, M. Xie, Bayesian networks in fault diagnosis, IEEE Transactions on industrial
informatics 13 (2017) 2227–2240. doi:https://doi.org/10.1109/TII.2017.2695583 .