<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Intelligent Fault Diagnosis of Cyber Physical Systems using Knowledge Graphs</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ameneh Naghdi Pour</string-name>
          <email>a.naghdipour@vu.nl</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Benno Kruit</string-name>
          <email>b.b.kruit@vu.nl</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jieying Chen</string-name>
          <email>j.y.chen@vu.nl</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Kruizinga</string-name>
          <email>peter.kruizinga@cpp.canon</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Godfried Webers</string-name>
          <email>godfried.webers@philips.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Schlobach</string-name>
          <email>k.s.schlobach@vu.nl</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Canon Production Printing Netherlands</institution>
          ,
          <addr-line>Van der Grintenstraat 1, 5914 HH Venlo, NL</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Philips Medical Systems International</institution>
          ,
          <addr-line>Veenpluis 6, 5684 PC Best, NL</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Vrije Universiteit Amsterdam</institution>
          ,
          <addr-line>De Boelelaan 1111, 1081 HV Amsterdam, NL</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Machine maintenance poses significant challenges and costs in equipment manufacturing. A considerable portion of the budget is dedicated to provide training materials (documentations) for service engineers to diagnose failure causes, as well as to cover their salaries and provide spare parts. Additionally, breakdowns adversely impact machine capacity, preventing customers from utilizing their equipment during downtime. The main reason for all these challenges is the lack of eficiently utilizing training documentations. To address this and reduce costs while mitigating the negative implications of machine breakdowns, we propose a two-phase framework consisting of knowledge graph construction and diagnosis. In the first phase, an upper-level ontology based on the requirements for fault diagnosis of Cyber Physical Systems is developed, by drawing inspiration from the Industrial Domain Ontology (IDO) and the Industrial Ontology Foundry-Maintenance Reference Ontology (IOF-MRO). In the second phase, SPARQL queries are executed on the knowledge stored in GraphDB, providing valuable insights for diagnosing machine failures.</p>
      </abstract>
      <kwd-group>
        <kwd>Fault diagnosis</kwd>
        <kwd>knowledge graphs</kwd>
        <kwd>ontology engineering</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Maintenance tasks in equipment manufacturing encounter several challenges. There is a considerable
ifnancial burden on both manufacturers and customers, stemming from expenses related to training
service engineers, their salaries, and spare parts. Additionally, downtime negatively impacts machine
capacity, preventing customers from utilizing their equipment during these intervals. Canon and Philips,
our industrial partners for this project, assist with maintenance tasks by training their service engineers
and providing them with documentation and occasional video resources. However, navigating extensive
documentation can be challenging, and the costs associated with producing training videos present
additional dificulties.</p>
      <p>To address these challenges and enhance support for service engineers in fault diagnosis of
CyberPhysical Systems (CPS), we are developing a framework based on qualitative knowledge-driven fault
diagnosis. It is important to highlight that our industrial partners are engaged in the production of
large-scale CPS, such as printers and MRI machines that comprise thousands of parts, sensors, actuators,
and computer interfaces, leading to a high level of complexity.</p>
      <p>Figure 1 shows our proposed framework in which there are two main phases along with the input
sources and output results. The input consists of three categories of data and knowledge identified
through interviews conducted with authorities in the manufacturing field. In the first phase, we will
LGOBE</p>
      <p>CEUR</p>
      <p>ceur-ws.org</p>
      <sec id="sec-1-1">
        <title>Documented</title>
      </sec>
      <sec id="sec-1-2">
        <title>Knowledge</title>
        <p>FMEA
Bill of
Materials
Troubleshooting
Manual</p>
      </sec>
      <sec id="sec-1-3">
        <title>Logbook Data</title>
      </sec>
      <sec id="sec-1-4">
        <title>Tacit Knowledge</title>
        <sec id="sec-1-4-1">
          <title>Phase 1:</title>
        </sec>
      </sec>
      <sec id="sec-1-5">
        <title>Knowledge Graph</title>
      </sec>
      <sec id="sec-1-6">
        <title>Construction</title>
        <p>Ontology
Engineering
Information
Extraction</p>
        <sec id="sec-1-6-1">
          <title>Phase 2:</title>
        </sec>
      </sec>
      <sec id="sec-1-7">
        <title>Diagnosis</title>
        <p>Symptom
Observation
Querying &amp;
Reasoning</p>
        <sec id="sec-1-7-1">
          <title>Output</title>
          <p>Root Cause</p>
          <p>
            Repair
Procedure
construct a Knowledge Graph (KG) based on the gathered input. This involves manually creating an
upper-level ontology, inspired by IDO [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ] and IOF-MRO [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ], which will serve as the foundation for the
KG. In the subsequent phase, service engineers will identify a symptom that needs to be translated
into a query for the KG reasoning. The KG will then support a root cause analysis and provide the
corresponding repair procedure, which will be presented to the service engineer as the output.
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Related work</title>
      <p>
        Various knowledge sources support service engineers in diagnosing failures, including machine log
data, documented knowledge, logbooks from engineers, and expert tacit knowledge. Researchers have
explored these sources and proposed methods for maintenance tasks, grouped into four categories:
(1) Model-based approach: Constructs a physical model to compare actual system output with
predicted values, using consistency monitoring. This method is accurate but requires precise physical
models [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. (2) Signal-based approach: Analyzes monitored signals to diagnose faults but is limited to
key sensor-equipped components [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. (3) Quantitative knowledge-based method: Treats diagnosis as a
pattern recognition problem using historical data but requires substantial fault data [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. (4) Qualitative
knowledge-based method: Uses a qualitative model and techniques like searching, matching, and
reasoning without needing physical models or extensive data, relying instead on a fault knowledge
base [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Given the need for system-level analysis and reasoning, the qualitative knowledge-based method is
ideal for this project. A key aspect of this method is the need for a model to represent fault knowledge,
with commonly utilized models including Rule-Based [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], Fault Trees [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], and Petri Nets [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. However,
these models have limitations, such as requiring prior fault analysis and manual editing, making updates
dificult [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. This paper proposes using knowledge graph technology to mine fault knowledge and
create a structured fault knowledge base.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>3.1. Input Source</p>
      <p>Following regular interviews with our partners, we have identified various maintenance sources and
categorized them into three groups: Data, Documented Knowledge, and Tacit Knowledge. Each of these
sources ofers unique insights and information about the system, but they also have limitations.
• Data: This includes logbooks that provide insights into problems and the actions taken to resolve
them. However, this information can be incomplete, as service engineers might only record
”done” without detailing the action taken.
• Documented Knowledge: This category encompasses three resources including (1) Bill Of Material,
which provides information about the physical structure of the system, but lacks details on
problems that may happen for each part of the system. (2) Failure Mode Efect Analysis, provides
insights into challenging failure modes and their potential efects, but it focuses solely on complex
issues. (3) Troubleshooting Manuals, ofers information on the causes of failures and remedies
for them.
• Tacit Knowledge: This refers to undocumented knowledge that resides in the minds of experts,
making it challenging to acquire.</p>
      <p>These sources are complementary; by integrating them, we can leverage their strengths and compensate
their limitations.
3.2. Knowledge graph construction
The first phase involves knowledge graph construction, which encompasses two primary steps:
upperlevel ontology construction and information extraction. The following subsections provide a detailed
explanation of these two steps.
3.2.1. Upper-level Ontology Construction
We manually developed an upper-level ontology as the foundational structure for organizing the schema
of the knowledge graph. This process involved thorough analysis of various input sources to pinpoint
the most valuable knowledge. We also formulated competency questions to highlight key queries for
the knowledge graph and conducted interviews with industrial partners to align their expectations
with the ontology. Our fault diagnosis ontology was also compared with the IDO and IOF-MRO. The
design of the ontology is depicted in Figure 2 , which comprises classes and relationships representing
the domain of interest. There is a class named “Component” that can represent a part, assembly, or
subsystem, each serving a specific “ Function”. When a part encounter malfunction, it leads to the
occurrence of a “Problem”. The problem has cause(s), which can be considered as problem, result
in a certain “Effect”. To address these problems, there are two types of “Procedure” that can be
implemented. The first is a “ Solution”, which directly solves the problem at hand. The second is a
“Workaround”, which aims to address the efects caused by the problem. Both procedures consist of
multiple “Step”, with each step involving a specific part of the system. Due to space constraints, we are
unable to provide additional information on the definitions of these entities and their comparison with
the IDO and IOF-MRO ontologies.
3.2.2. Information Extraction
In this step, we employed various techniques including Regular Expressions, Named Entity Recognition,
and Large Language Models (LLMs), to extract the necessary entities from the identified sources. These
techniques allowed us to systematically identify relevant information, which we then organized into
structured tables. A key challenge in our current data sources was the lack of information regarding
functions, dependencies, and causes associated with the entities. To bridge this gap, we leveraged the
capabilities of LLMs by treating them as expert engineers. We provided the model with information
about the system’s structure and posed specific questions related to its functions, dependencies, and
causes. This approach enabled us to gather insights that were previously unavailable. Once we acquired
the necessary information, we transformed the structured tables into RDF (Resource Description
Framework) triples. This conversion facilitated the integration of our data into a graph database,
allowing for enhanced querying and analysis of the relationships between various entities within the
system.</p>
      <p>depensOn</p>
      <p>subFunction
subComponent
hasFunction
define</p>
      <p>hasCause
Component
involve</p>
      <sec id="sec-3-1">
        <title>Function</title>
        <p>failVia</p>
      </sec>
      <sec id="sec-3-2">
        <title>Step</title>
      </sec>
      <sec id="sec-3-3">
        <title>Problem</title>
        <p>solve</p>
      </sec>
      <sec id="sec-3-4">
        <title>Solution</title>
        <p>consistOf
resultIn</p>
      </sec>
      <sec id="sec-3-5">
        <title>Procedure</title>
      </sec>
      <sec id="sec-3-6">
        <title>Effect</title>
        <p>address</p>
        <p>
          Workaround
subClassOf
3.3. Diagnosis
The next phase, diagnosis, illustrates the application of our proposed method, wherein service engineers
observe symptoms of a failure that need to be transformed into queries for knowledge graph-based
reasoning. This process will yield a diagnosis identifying the root cause of the issue, along with a
suggested procedure for repair. To achieve this, the knowledge extracted in the previous phase must be
organized and stored in a specific structure. In this paper, we utilize graphDB [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], one of the leading
graph database software, which is designed to store data in a graph structure based on the RDF model.
Once the data is stored in graphDB, we execute SPARQL queries over the knowledge graph. SPARQL, a
powerful query language for RDF data, enables us to retrieve and manipulate the data efectively. In
this setup, RDFS (RDF Schema) serves as the reasoning layer, providing semantic inference capabilities,
while SPARQL acts as the querying interface for accessing the knowledge graph.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion and Future work</title>
      <p>In this study, we developed a framework for fault diagnosis based on Knowledge Graph. First, we
manually created an upper-level ontology to serve as the schema for the knowledge graph. We
then employed various techniques to extract knowledge and populate the graph according to this
ontology. Subsequently, we implemented a straightforward approach utilizing RDFS reasoning and
SPARQL queries to perform diagnosis. This allowed us to analyze the data within the knowledge graph
efectively. For the future work, we will focus on enhancing the reasoning capabilities of the framework
by integrating additional techniques, such as Bayesian Networks. This refinement aims to improve the
accuracy and depth of the diagnostic process.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This publication is part of the project ZORRO with project number KICH1.ST02.21.003 of the research
programme Key Enabling Technologies (KIC) which is (partly) financed by the Dutch Research Council
(NWO).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>ISO</surname>
          </string-name>
          , ISO
          <volume>15926</volume>
          -
          <fpage>14</fpage>
          :
          <year>2020</year>
          (E),
          <source>ISO 15926 Part</source>
          <volume>14</volume>
          :
          <article-title>Industrial top-level ontology</article-title>
          ,
          <source>Technical Report</source>
          , ISO, Geneva,
          <string-name>
            <surname>CH</surname>
          </string-name>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hodkiewicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Woods</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Selway</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stumptner</surname>
          </string-name>
          ,
          <article-title>Industrial ontology foundry (iof) maintenance reference ontology</article-title>
          ,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .26182/chzp-vs60.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.-H.</given-names>
            <surname>You</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. M.</given-names>
            <surname>Cho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-O.</given-names>
            <surname>Hahn</surname>
          </string-name>
          ,
          <article-title>Model-based fault detection and isolation in automotive yaw moment control system</article-title>
          ,
          <source>International Journal of Automotive Technology</source>
          <volume>18</volume>
          (
          <year>2017</year>
          )
          <fpage>405</fpage>
          -
          <lpage>416</lpage>
          . doi:https://doi.org/10.1007/s12239-017-0041-5.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>K.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Liao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Song</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>A novel fast entrogram and its applications in rolling bearing fault diagnosis</article-title>
          ,
          <source>Mechanical Systems and Signal Processing</source>
          <volume>154</volume>
          (
          <year>2021</year>
          )
          <article-title>107582</article-title>
          . doi:https://doi.org/10.1016/j.ymssp.
          <year>2020</year>
          .
          <volume>107582</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Jiang</surname>
          </string-name>
          , T. Chen,
          <article-title>Fine-grained transfer learning based on deep feature decomposition for rotating equipment fault diagnosis</article-title>
          ,
          <source>Measurement Science and Technology</source>
          <volume>34</volume>
          (
          <year>2023</year>
          )
          <article-title>065902</article-title>
          . doi:https://doi.org/10.1088/
          <fpage>1361</fpage>
          -6501/acc04a.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>X.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Chi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Cui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. W.</given-names>
            <surname>Ip</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. L.</given-names>
            <surname>Yung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <article-title>Exploring research on the construction and application of knowledge graphs for aircraft fault diagnosis</article-title>
          ,
          <source>Sensors</source>
          <volume>23</volume>
          (
          <year>2023</year>
          )
          <article-title>5295</article-title>
          . doi: https: //doi.org/10.3390/s23115295.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <article-title>A new topology-switching strategy for fault diagnosis of multi-agent systems based on belief rule base</article-title>
          ,
          <source>Entropy</source>
          <volume>24</volume>
          (
          <year>2022</year>
          )
          <article-title>1591</article-title>
          . doi:https: //doi.org/10.3390/e24111591.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>K.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Gou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ye</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Glowacz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kong</surname>
          </string-name>
          ,
          <article-title>Towards a systematic description of fault tree analysis studies using informetric mapping</article-title>
          ,
          <source>Sustainability</source>
          <volume>14</volume>
          (
          <year>2022</year>
          )
          <article-title>11430</article-title>
          . doi:https://doi.org/10.3390/su141811430.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          , X. Cheng,
          <article-title>Comprehensive learning particle swarm optimized fuzzy petri net for motorbearing fault diagnosis</article-title>
          ,
          <source>Machines</source>
          <volume>10</volume>
          (
          <year>2022</year>
          ). doi:ttps://doi.org/10.3390/machines10111022.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Ontotext</surname>
          </string-name>
          ,
          <article-title>Graphdb - rdf triplestore semantic graph database</article-title>
          , https://www.ontotext.com/products/ graphdb/,
          <year>2023</year>
          . Accessed:
          <fpage>2024</fpage>
          -09-13.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>