<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Towards a Knowledge Graph-based Data Mesh for Smart Manufacturing</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Irlán Grangel-González</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marc Rickart</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oliver Rudolph</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rui Dias</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Bosch Car Multimedia Portugal</institution>
          ,
          <addr-line>S.A., Braga</addr-line>
          ,
          <country country="PT">Portugal</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Corporate Research</institution>
          ,
          <addr-line>Robert Bosch GmbH, Renningen</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Robert Bosch GmbH</institution>
          ,
          <addr-line>Automotive Electronics, Reutlingen</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Workshop Proceedings e.g., Enterprise Resource Planning (ERP), Manufacturing Execution Systems (MES), and Master Data (MD). These silos comprise no explicit semantics. They also contain diferences in the way real-world concepts are modeled, i.e., Semantic Interoperability Conflicts (SIC) [ 1], thus hindering data re-usability. To tackle these problems Knowledge Graph (KG)-based applications have emerged. For instance, the Line Information System (LIS) [2] for manufacturing enables semantic harmonization, i.e., the resolution of SICs of data on production lines. However, despite this and other previous eforts at Bosch using KGs [ 3, 4, 5, 6, 7] for handling semantic harmonization many more data is being generated and consumed (cf. Figure 1). In addition, there are still no mechanisms to fulfill the FAIR principles [ 8] in manufacturing scenarios at Bosch. Of key relevance here is to have the FAIR principles in action, i.e., the data consumers should be capable of finding, accessing, and reusing data whenever required. Moreover, these data should be interoperable which remains as a huge challenge. To accelerate the data exchange and to meet the expectations of data consumers, it is required to move from an application mindset to a data centric one, where KG-based data products present concrete solutions to the manufacturing domain. Despite having just one data product in place, i.e. LIS, many more data from other domains than manufacturing are required by consumers.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Motivation</title>
      <p>Manufacturing business competition is driven by eficiency in order to ofer the best price for
products. Of paramount importance to achieve this eficiency is to get the right information at
the right time. Manufacturing is a very complex process. To manage such a complex process
a lot of data are required. These data are diversely spread out in diferent IT systems or silos,</p>
    </sec>
    <sec id="sec-3">
      <title>2. KG-based Data Mesh for Manufacturing</title>
      <p>To tackle the data reusability problems in manufacturing, we propose a KG-based data mesh [9,
10, 11]. Our approach has the KG-based products at its core resolving SICs and exposing
CEUR
Workshop
Proceedings
semantically clean data to consumers (cf. Figure 2). Furthermore, being a source-oriented
solution where users can discover who are the data owners, source origin, sample data sets or
quality metrics. As being understandable means leveraging semantics to explain the syntax
of the datasets, the format on how the data is presented is also important, e.g., serialization,
queries to execute, proper ontologies. Then domain ownership is defined based on the data
products. Furthermore, support functionalities need to be implemented, e.g., policies to be
defined and governed, training, and consultancy of the organization, and provisioning the
data as a self-service. The platform for data as a self-service provides all domains with their
data products as host for all other consumers to integrate them in their applications. This as
a starting point, the decentralized approach accelerates the deployment of the data mesh and
thus faster data sharing and reusability via KGs and described ontologies.
2.1. Line Information System LIS as a data product
serves for instance other domains like product engineering or technology development for reuse
in their respective data products driven by KGs. Common concepts in manufacturing as used
in LIS are defined by the set of ontologies of the Core Information Model for Manufacturing
(CIMM) [12]. The case of the Internal Defect Costs (IDC) as a KG-based solution has to deal
with cost avoidance in a process failure for electronic products in the Surface-Mount Technology
(SMT) area at Bosch. The typical approach for the IDC project would have been to reinvent the
wheel by trying to semantically integrate manufacturing data that is already covered in the LIS
data product. With our approach, several months of man-hours are saved due to the fact that the
IDC project was able to reuse data out of the LIS KG. Having LIS as a data product enables IDC
to reuse related relevant plant manufacturing data, e.g., materials, processes, machines, lines and
even aggregated data by plant. This gives experts across all domains a deeper insight in defect
costs along with reducing the time of decision making with more precise and accurate data for
defining the product cost. Like this getting the edge on manufacturing business competition
improves eficiency and best product price can be ofered. Without LIS, there would be a danger
of reverting to siloed data with continuous requests for data in file formats with costly human
interaction, increasing data loss and time taken in decision making. Therefore, main driver is
the focus on semantic integration of the data that are still not part of the LIS data product for
further improvement work. For manufacturing domain this means higher performance with
less cost and additionally data available which can be reused by other domains and applications.
2.2. Insights and feedback of the organization
In established market enterprises the competition is tough and use of data will provide an
advantage. The organization in these enterprises is usually more hardware centric than data
oriented and thus data receives a diferent prioritization as if it was the only source of income.
Roles and responsibilities are equally diferent in that they are focused on the hardware product
development and assembly, while information technology remains a support function only.
In that setup, a strong lead on the tech stack and its application is missing. This creates the
opportunity for the individual domains to establish their own tech stacks, thus resulting in
a plethora of data storage technologies. As metadata has to be applicable to any and every
source of at least the structured data, a decoupling of the metadata layer from the storage
layer is advisable. Furthermore, implementing the FAIR principles nevertheless allows the
organization to adapt faster to use of the data and metadata ofered. At Bosch the data product
LIS is ofered to many other domains for reuse by APIs with defined data contracts. The data
product is described by KG-based technologies semantically and provides links between fields
not necessarily on same data source system. In order to avoid SICs link prediction methods
are embedded and by active use of the metadata system the instances themselves ofered. Any
data quality concern of mismatch in semantics will be spotted by all of the data consumers and
can be fed back, while the other metadata system can simply be ignored and data consumers
may start having their own description tables put in place. That is exactly what we observe in
our organization. As for future work, we envision to enable further KG-based data products,
e.g., for engineering, logistics, and sales to be able to cover a wider range for manufacturing
applications.
[9] V. K. Butte, S. Butte, Enterprise Data Strategy: A Decentralized Data Mesh Approach, in:</p>
      <p>Int. Conf. on Data Analytics for Business and Industry (ICDABI), 2022, pp. 62–66.
[10] L. V. Jochen Christ, S. Harrer, Data Mesh Architecture, https://www.datamesh-architecture.</p>
      <p>com, 2022. Accessed: 10-03-2023.
[11] I. A. Machado, C. Costa, M. Y. Santos, Data Mesh: Concepts and Principles of a Paradigm
Shift in Data Architectures, in: M. M. Cruz-Cunha, R. Martinho, R. Rijo, D. Domingos,
E. Peres (Eds.), CENTERIS 2021 - Int. Conf. on ENTERprise Information Systems
Information Systems and Technologies, Braga, Portugal, volume 196 of Procedia Computer Science,
Elsevier, 2021, pp. 263–271.
[12] I. Grangel-González, F. Lösch, A. ul Mehdi, Knowledge Graphs for eficient integration
and access of manufacturing data, in: 25th IEEE Int. Conf. on Emerging Technologies and
Factory Automation, ETFA, Vienna, Austria, September 8-11, IEEE, 2020, pp. 93–100.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>I.</given-names>
            <surname>Grangel-González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Vidal</surname>
          </string-name>
          ,
          <article-title>Analyzing a Knowledge Graph of Industry 4.0 Standards</article-title>
          , in: J.
          <string-name>
            <surname>Leskovec</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Grobelnik</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Najork</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Tang</surname>
          </string-name>
          , L. Zia (Eds.),
          <source>Companion of The Web Conference</source>
          , Virtual Event / Ljubljana, Slovenia, April
          <volume>19</volume>
          -23, ACM / IW3C2,
          <year>2021</year>
          , pp.
          <fpage>16</fpage>
          -
          <lpage>25</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>I.</given-names>
            <surname>Grangel-González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rickart</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Rudolph</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Shah</surname>
          </string-name>
          ,
          <article-title>LIS: A knowledge graph-based line information system</article-title>
          , in: C.
          <string-name>
            <surname>Pesquita</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Jiménez-Ruiz</surname>
            ,
            <given-names>J. P.</given-names>
          </string-name>
          <string-name>
            <surname>McCusker</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Faria</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Dragoni</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Dimou</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Troncy</surname>
          </string-name>
          , S. Hertling (Eds.),
          <source>The Semantic Web - 20th Int. Conf., ESWC</source>
          <year>2023</year>
          , Hersonissos, Crete, Greece, May 28 - June 1, Proceedings, volume
          <volume>13870</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2023</year>
          , pp.
          <fpage>591</fpage>
          -
          <lpage>608</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>I.</given-names>
            <surname>Grangel-González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Shah</surname>
          </string-name>
          ,
          <article-title>Link Prediction with Supervised Learning on an Industry 4.0 related Knowledge Graph</article-title>
          ,
          <source>in: 26th IEEE Int. Conf. on Emerging Technologies and Factory Automation</source>
          ,
          <string-name>
            <surname>ETFA</surname>
          </string-name>
          , Vasteras, Sweden, September 7-10, IEEE,
          <year>2021</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>E. G.</given-names>
            <surname>Kalayci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Grangel-González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Lösch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Xiao</surname>
          </string-name>
          , A. ul
          <string-name>
            <surname>Mehdi</surname>
            , E. Kharlamov,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <article-title>Semantic integration of Bosch manufacturing data using virtual knowledge graphs</article-title>
          , in: J.
          <string-name>
            <surname>Z. P.</surname>
          </string-name>
          et al. (Ed.),
          <source>19th Int. Semantic Web Conf</source>
          ., Athens, Greece, November 2-
          <issue>6</issue>
          , Proceedings,
          <string-name>
            <surname>Part</surname>
            <given-names>II</given-names>
          </string-name>
          , volume
          <volume>12507</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2020</year>
          , pp.
          <fpage>464</fpage>
          -
          <lpage>481</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M. N.</given-names>
            <surname>Mami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Grangel-González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Graux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Elezi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Lösch</surname>
          </string-name>
          ,
          <article-title>Semantic data integration for the SMT manufacturing process using SANSA stack, in: A. H</article-title>
          . et al. (Ed.), The Semantic Web:
          <article-title>ESWC 2020 Satellite Events</article-title>
          , Heraklion, Crete, Greece, May 31 - June 4, volume
          <volume>12124</volume>
          <source>of LNCS</source>
          , Springer,
          <year>2020</year>
          , pp.
          <fpage>307</fpage>
          -
          <lpage>311</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Mehdi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Kharlamov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Stepanova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Loesch</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.</surname>
          </string-name>
          <article-title>Grangel-Gonzalez, Towards Semantic Integration of Bosch Manufacturing Data</article-title>
          ,
          <source>in: Proc. of ISWC</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>303</fpage>
          -
          <lpage>304</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>B.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhou</surname>
          </string-name>
          , G. Cheng, E. Jiménez-Ruiz,
          <string-name>
            <given-names>T.</given-names>
            <surname>Tran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Stepanova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Gad-Elrab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Nikolov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Soylu</surname>
          </string-name>
          ,
          <string-name>
            <surname>E. Kharlamov,</surname>
          </string-name>
          <article-title>The data value quest: A holistic semantic approach at Bosch</article-title>
          , in: P. G. et al. (Ed.),
          <source>The Semantic Web: ESWC Satellite Events - Hersonissos</source>
          , Crete, Greece, May 29 - June 2, Proceedings, volume
          <volume>13384</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2022</year>
          , pp.
          <fpage>287</fpage>
          -
          <lpage>290</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>L.</given-names>
            <surname>Gleim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pennekamp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Liebenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Buchsbaum</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Niemietz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Knape</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Epple</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Storms</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Trauth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Bergs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Brecher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Decker</surname>
          </string-name>
          , G. Lakemeyer,
          <string-name>
            <given-names>K.</given-names>
            <surname>Wehrle</surname>
          </string-name>
          , Factdag:
          <article-title>Formalizing data interoperability in an internet of production</article-title>
          ,
          <source>IEEE Internet of Things Journal</source>
          <volume>7</volume>
          (
          <year>2020</year>
          )
          <fpage>3243</fpage>
          -
          <lpage>3253</lpage>
          . doi:
          <volume>10</volume>
          .1109/JIOT.
          <year>2020</year>
          .
          <volume>2966402</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>