Directive Driven System of Systems Approach to Visualise Data Chasms Gash Bhullar1 and Rachel Davies2 1 Control 2K Limited, Waterton technology Centre, Bridgend, CF313WT, UK 2 Digital Manufacturing Innovation Hub Wales, Waterton Technology Centre, Bridgend, CF31 3WT, UK Abstract With the continuous growth of automated data systems to control major tasks, it is clear that a new approach is needed to manage and decipher information across a range of interconnected, proprietary systems. Increasingly, process and production companies are progressing toward low carbon Net Zero and decarbonisation models to be deployed across the entire enterprise. This necessitates a system of systems approach, and this paper discusses how a “Directive Driven” model incorporating a vendor neutral approach can create interoperable systems that can visualise entire ecosystems beyond the standard Digital Twin concepts and bridge the chasms between operating domains of supply chains that provide products and services from raw materials to consumer goods. Keywords 1 Industry 4.0, system of systems, interoperability, circular system development, directive- driven, vendor-neutrality, digital twin 1. Introduction Building upon the concept of ‘Big Data,’ where the information extracted and shared between platforms and systems is often ‘too large or too complex’ to be handled by traditional data processing methods, it stands to reason that a new approach to visualising information will be required to meet the ever-increasing demands of business and society, both in terms of interoperability and interconnectivity between data services. The Variety and Velocity of information - two of the ‘five V’s’ of Big Data [1] is increasing with each evolution of technology at each stage of development; from the processing of raw materials, to the development of embedded devices for use in low energy systems, and the end users of those systems who will demand a minimum impact on the environment. The data created at each stage of a product’s lifecycle is typically vendor-specific and inaccessible to other systems, creating vast ‘Knowledge Chasms’ between proprietary systems, as described in Figure 1. Proceedings of the Workshop of I-ESA’22, March 23–24, 2022, Valencia, Spain EMAIL: gbhullar@control2k.co.uk (G. Bhullar); rae@dmiw.co.uk (R. Davies) © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Wor Pr ks hop oceedi ngs ht I tp: // ceur - SSN1613- ws .or 0073 g CEUR Workshop Proceedings (CEUR-WS.org) Figure 1: Complexity of data exchange throughout the supply chain Fully understanding the impact of a product requires system data held within these knowledge chasms to be extracted and presented in a meaningful way so that the user can make sense of the data provided and take appropriate action. 1.1. Key challenges New modeling tools are needed to support the visualization of the data, in particular the exchange of information in the chasms between domains. The challenge is to adopt a ‘neutral format’ for all stakeholders and create a vendor neutral platform to allow the flow of data across multiple platforms, using multiple protocols, using a System of Systems (SoS) approach [2]. The paradigm of system data has changed rapidly over the last 20 years, from manual distributed systems (local) to virtual centralisation (Cloud) and with it the need for advanced security and authentication methods. System developers now need to ensure data integrity across a very wide range of data sources as the risks of data breaches increase by 50% each year as volumes of data continue to increase [3]. Traditional ‘Relational Database Management Systems’ (RDBMS) are unable to guarantee data security and integrity across multiple domains [4], even when enhanced authorisation architecture such as ‘Single Sign On’ (SSO) is deployed [5]. Advanced Industrial Digital Technology (IDTs) such as Digital Twins were originally developed to help support the visualisation of system data by simulating a physical and functional representation of that system, to support design tasks or validate system properties [6]. However, in most cases these Digital Twins are based on a single data system or closed vendor ecosystem and are unable to interpret data from external data sources. Providing access to multiple systems or linking data from a variety of sources poses a serious challenge for industry and there is a clear requirement to pull common data from these systems and extract the data held between systems in knowledge chasms, to develop an approach to deliver directives for key issues, such as decarburization, reduction in the use of rare minerals and materials and the circular economy as a whole, rather than continually trying to develop new systems to determine the overall impact of disparate components. 2. Bridging knowledge chasms Whilst the majority of platform development begins with the definition of specifications for the target application domain, platforms within a technology genre are usually developed in parallel with other vendors rather than jointly and most only represent a partial solution to a problem rather than an end-to-end solution. It is up to the end user to connect the domains and glean meaningful insights from the information presented. Research into this area has highlighted a number of trial industrial platforms and projects, aimed at linking together sources of data using either an Application Programming Interface (API) between proprietary services; or have focused on the use of Open Development Platforms. In most cases the Volume and Variability of the data has proven too complex for visualisation of the end-to-end lifecycle in one place [7], platforms are hampered by software engineering challenges such as the availability or security of data; and most fail to address the issue of why the data is needed in the first place. The concept of ‘blueprinting’ or defining a directive or platform purpose has been explored with huge successes [8] where platforms are designed for a specific purpose, for example a product marketplace or a federated platform, such as a Data Spine model [9]. However, there is little evidence at the present time to demonstrate how this concept has been transferred to delivering on key industrial directives such as the circular economy, decarburization, or NetZero production across multiple vendor systems. 2.1. Managing multi-purpose platforms Based upon our understanding of information architecture, the key to unlocking the data between these domains is to create and deploy software ‘Agents’ [10] that are “capable of autonomous, reactive and proactive operation in response to changes in their local environment”. Agents, operating at the network edge, enable leveraging cloud resources into the proximity of the user devices. to interact with the core data in these systems and ‘pull’ information in a vendor-neutral format that can be accessed via most standard communication protocols, such as MQTT. These Agents are essentially a code-driven protocol link between the edge devices of a system and the Cloud services that support that ecosystem, using the primary directives of the platform to interrogate the knowledge held in each system. Using this model, the market or consumer demand provides the directive for a platform, much like a user journey template which forms the basis of Agile Software Development, i.e. “As a (WHO), I want to (WHAT), so that (WHY)” such as is described in Figure 2. Figure 2: Creating Directives to fulfil Data Transfer The agents are deployed downstream to stipulate the required data to meet the specific Directives and the corresponding knowledge is gained and pulled upstream to fulfill the Directives and presented via a visualisation system such as a Digital Twin or dashboard. Linking, passing through or sharing data minimises the traffic, optimises and rationalises the data (Smart Data); these event-driven Agents can provide a level of control over the flow of this data. For example: In a data model that seeks to understand the NetZero impact of a water treatment facility, the user needs to understand data from a range of different systems. A common data value would be current consumption for a given motorised pump. An agent needs to span across all Industrial Control System (ICS) ecosystems to pull the current consumption parameters from multiple vendors that will be in a variety of different formats. So, the agent is given the directive: “Please provide energy consumption of all pumps operating in your water treatment area stipulating a KWh value” The Directive needs to be specific enough to minimise the amount of data returned so that the ‘search’ algorithm can be specifically channeled to the most appropriate system. The sheer volume of data that could end up being interrogated could potentially run into Petabytes and this warrants the use of AI Technologies incorporating XAI (explainable AI) in order to have a chance of understanding the required data, its format and its specific properties. In addition, audit trails need to be maintained to ensure track and trace functionality to prove the validity of the data. XAI is more suited to handling unpredictable events and provide diagnostic data to trace the potential breakpoints within the knowledge retrieving process. 2.2. Modeling the Interface of data Economically, the advantages of crossing the chasms between knowledge ecosystems are held within a shared value proposition, or mutual complementarity where shared value can offer competitive advantage [9]. We need to understand the key issues and possible future solutions in order to fully exploit the potential value of digital twins, whether that be towards improved product development, more efficient production systems. As an example of the types of interfaces to data systems, Figure 3 shows an Agent-based management of support systems for distributed brainstorming [11] where Usability-aware Service Orchestration System (USOS) and a Flexible Support System (FSS) for distributed brainstorming can have multiple connections to edge or cloud services. Agent-Based Computing (ABC) is suitable for implementing robust scalable systems and interoperable and virtualisable ‘things’. ABC is suitable for supporting the design and implementation of autonomous IoT systems [12]. Figure 3: Agent-based and resource-oriented FUSSDB architecture 2.3. Visualisation systems Graphical representation of data has always been the easiest way to present data to the users / decision makers as a majority of people respond to graphical representation rather than steams of digits or lists but the challenge is finding the right representation. Modern methods to present data now include holographic imaging [13] and the more recent developments to present data via the Metaverse [14] which incorporates an substantive range of characteristics as shown in Figure 4. Figure 4: Metaverse – Seven Layers Whilst these visualisation systems are clearly going to dominate the marketplace in the coming years, most people still prefer a more conventional 3D image of the world without additional layers of complexity. This is the normal cultural shift delay whilst systems become normalised and accepted way to visualise the real world in the same way as digital twins became accepted a way to represent the real world. Figure 5: Pulling data into the digital twin and drilling down The methods to visualise the knowledge coming from our Directive driven approach needs to concisely represent the original Directive and a funnel approach pulls complex and raw data from multiple ecosystems and initially presents it at a dashboard level to condition the data before finally presenting to the Directive owner in a manor that suits the knowledge being presented. Figure 5 shows the way that data from a typical manufacturing environment can be conditioned via the agents to pull smart data that in this case be presented through the Industreweb Vactory™ environment [16] to visualise data from multiple shopfloor systems and presenting in a way that engineers and managers can understand. 2.4. Vendor neutrality and neutral formats Vendor neutrality is the result of creating an open dialogue within an ecosystem allowing multiple systems to intercommunicate using agreed protocols. This compatibility and interchangeability / interoperability works at a systems level. Vendor-neutral specifications encourage the development of competing yet compatible implementations, freeing the purchaser to choose from a multitude of vendors without suffering a loss of functionality” (Agnostic solutions) [15]. The adoption of cloud technologies and digital platforms has been restricted in the SME sector due to potential ‘lock-in’ issues with vendors, resulting in limited interoperability and fixed formats for cloud data. Neutral formats (agreed formats to exchange data and protocols) have been a subject of many EU projects such as STASIS [17] and conceptually appear to be a good way to semantically map information from one system to another but with the volume of data now present globally, the storage of such a repository would cause the same problems as relational databases being stored globally. 3. Summary A Directive Driven based Approach to pull data from complex systems including data from multiple digital twin environments serves multiple purposes to streamline operations, reduce waste, improve efficiency but most importantly provided the opportunity to maximise returns on investments through evidence provided by Visualise KPIs which are essentially what are behind the Directives. Data Chasms are reduced by all systems working towards the same directives and thereby the types of data and information is normalised across several sectors when considering common goals for key drivers such as carbon reduction and recycling of goods and materials. The approach outlined in this paper also acts as an Enabler of Circular Economy by ensuring a closed loop approach to the data requested from the ecosystem and the alignment of that data to the key Directives required by stakeholders and decision makers. By being very specific with the Directives, it is the most effective way of interrogating multiple data systems. 4. References [1] A. Jain, The 5 V’s of big data, 2016. URL: https://www.ibm.com/blogs/watson-health/the-5-vs- of-big-data/ [2] J. Boardman, B. Sauser, System of Systems - the meaning of of, in: 2006 IEEE/SMC International Conference on System of Systems Engineering, Los Angeles, 2006, 6. doi: 10.1109/SYSOSE.2006.1652284. [3] S. Wheatley, T. Maillart, D. Sornette, The extreme risk of personal data breaches and the erosion of privacy, The European Physical Journal B: Condensed Matter and Complex Systems 89 (2016) 1-12. doi: 10.1140/epjb/e2015-60754-4. [4] Codeacademy, What is a Relational Database Management System, 2022. URL: https://www.codecademy.com/article/what-is-rdbms-sql. [5] J. De Clercq, Single Sign-On Architectures, in: G. Davida, Y. Frankel, O. Rees (Eds.), Infrastructure Security, InfraSec 2002. Lecture Notes in Computer Science, vol 2437. Springer, Berlin, 2002. doi: 10.1007/3-540-45831-X_4. [6] S. Boschert, R. Rosen, Digital Twin - The Simulation Aspect, in: P. Hehenberger, D. Bradley (Eds.), Mechatronic Futures, Springer, Cham, 2016, pp. 59-74. doi: 10.1007/978-3-319-32156- 1_5 [7] A. B. M. Moniruzzaman, S. A. Hossain, NoSQL Database: New Era of Databases for Big data Analytics - Classification, Characteristics and Comparison, International Journal of Database Theory and Application 6 (2013) 1-13. [8] S. Garcia-Gomez, M. Jimenez-Ganan, Y. Taher, C. Momm, F. Junker, J. Biro, A. Menychtas, V. Andrikopoulos, S. Strauch, Challenges for the Comprehensive Management of Cloud Services in a PaaS Framework, Scalable Computing: Practice and Experience 13 (2012) 201-213. [9] R. A. Deshmukh, D. Jayakody, A. Schneider, V. Damjanovic-Behrendt, Data Spine: A Federated Interoperability Enabler for Heterogeneous IoT Platform Ecosystems, Sensors 21 (2021) 4010. doi: 10.3390/s21124010. [10] T. Leppänen, C. Savaglio, L. Lovén, W. Russo, G. Di Fatta, J. Riekki, G. Fortino, Developing Agent-Based Smart Objects for IoT Edge Computing: Mobile Crowdsensing Use Case, in: 11th International Conference on Internet and Distributed Computing Systems, Tokyo, 2018, pp. 235- 247. doi: 10.1007/978-3-030-02738-4_20. [11] Y. Kaeri, K. Sugawara, C. Moulin, T. Gidel, Agent-based management of support systems for distributed brainstorming, Advanced Engineering Informatics 44 (2020) 101050. doi: 10.1016/j.aei.2020.101050. [12] G. Fortino, R. Gravina, W. Russo, C. Savaglio, Modeling and Simulating Internet-of-Things Systems: A Hybrid Agent-Oriented Approach, Computing in Science & Engineering 19 (2017) 68-76. doi: 10.1109/MCSE.2017.3421541. [13] H. Jung, Y. Jung, M. Fulham, J. Kim, Mixed reality hologram slicer (mxdR-HS): a marker-less tangible user interface for interactive holographic volume visualization, arXiv – Human- Computer Interaction 2022. doi: arxiv-2201.10704 [14] T. Huynh-The, Q.-V. Pham, X.-Q. Pham, T. T. Nguyen, Z. Han, D.-S. Kim, Artificial Intelligence for the Metaverse: A Survey, 2022. URL: https://arxiv.org/pdf/2202.10336.pdf. [15] J. Opara-Martins, R. Sahandi, F. Tian, Critical review of vendor lock-in and its impact on adoption of cloud computing, in: Proceedings of the International Conference on Information Society, IEEE, New York, 2014, pp. 92-97, doi: 10.1109/i-Society.2014.7009018. [16] G. Bhullar, S. Osborne, M. J. Núñez Ariño, J. Del Agua Navarro, F. Gigante Valencia, Vision System Experimentation in Furniture Industrial Environment, Future Internet 13 (2021) 189. doi: 10.3390/fi13080189. [17] European Commission, Software for ambient semantic interoperable services, 2009. URL: https://cordis.europa.eu/project/id/034980.