=Paper= {{Paper |id=Vol-2475/short5 |storemode=property |title=Methodology for Development of Industrial Analytical Systems for Data Collection and Processing |pdfUrl=https://ceur-ws.org/Vol-2475/short5.pdf |volume=Vol-2475 |authors=Anton Misnik,Sergey Krutolevich,Siarhei Prakapenka,Eugene Lukjanov }} ==Methodology for Development of Industrial Analytical Systems for Data Collection and Processing== https://ceur-ws.org/Vol-2475/short5.pdf
       Methodology for the Development of Industrial
   Analytical Systems for Data Collection and Processing

                    A Misnik1, S Krutalevich1, S Prakapenka1, E Lukjanov2

      1
       Inter-state educational institution of higher education “Belarusian-Russian university”,
                                         Mogilev, Belarus
                     2
                       ZAO "GIAPDistCentr", Moscow, Russian Federation



          Abstract. This article is devoted to methods of the development of industrial
          analytical data collection and processing systems. It is shown how to obtain a
          synergistic effect by combining well-known approaches and ensuring
          interaction between them using analytical and neural network modules.




1 Introduction

    Modern enterprises, as an organizational and technical system, are becoming
increasingly complex both in structure and in internal connections between structural
components. The prerequisites for these processes are: expansion to foreign markets,
high competition, tendency to reduce costs, ensuring the safety of production, the
need for a dynamic response to market needs. Also, there are growing demands on
product quality, environmental protection, as well as occupational safety and health.
    Due to the development of technologies in the field of instrumentation, the amount
of data available for analysis on the state of technical devices has grown significantly,
besides, it is obvious, that automation of processes reduces the number of errors
caused by the human factor. These aspects should be controlled using appropriate
analytical data processing information systems, that ensure achievement of goals in
conditions of significant uncertainties, especially in the context of long time horizons.
    Currently, the growth rate of requirements for information systems and the need
for their modification during operation is very high. Very often there is a need to
modify the existing data structure, the way of data displaying, new data processing
scenarios appear. Involving developers to solve such problems is usually costly both
financially and in terms of time.
    In our opinion, it is optimal to develop a system tools that allows a specialist in the
subject area with the skills of an advanced Excel user (Architect) to create and modify
data structure, develop applications for displaying and interacting with data, and
implement data processing scenarios.




___________________________
Copyright © 2019 for this paper by its authors. Use permitted under Creative Commons License
Attribution 4.0 International (CC BY 4.0).
In: P. Sosnin, V. Maklaev, E. Sosnina (eds.): Proceedings of the IS-2019 Conference, Ulyanovsk, Russia,
24-27 September 2019, published at http://ceur-ws.org
224


2 Traditional approach to the development of analytical systems
for data collection and processing

    In the framework of the traditional approach to organizing industrial analytical
systems for data collection and processing, each part of such a system is, in fact, a
separate software module, with its own static data structure and internal logic that is
rigidly defined at the stage of system design and implementation.
    This approach has the following disadvantages:
     duplication of data in the system;
     the complexity, and, often, the inability to organize relationships between data
        in different modules;
     the need to involve developers to create new system applications, make
        changes to the logic and displaying data in the existing applications.


3 Methodology of the development of analytical systems for data
collection and processing

    It is proposed to use an object-oriented approach as the basis for organizing data
structure - the subsystem “Type Tree”.
    Type, in full accordance with the classical approach, is a description of the objects
through their common attributes. An attribute of a type is a named property or
characteristic of an object of that type. Each attribute of a type is characterized by a
name unique within this type and the datatype that this attribute will store. All
instances (objects) of the same type have the same set of attributes. Type instances
differ in attribute values. Attributes by the values of which each object can be
identified from other objects of this type are called key attributes. If there are several
key attributes, a composite key is generated. It is proposed to use the following main
types of attributes: string, number, counter, date, date and time, text, file. For
attributes of the form string, number, date, date and time, the file should be available
the ability to set the property "array". Also, an attribute may be a link to another type
of system’s data structure [1].
    Types should be in a hierarchical relationship and can be arbitrarily connected
both with each other and with themselves if required by the data structure or business
process. [3]
    Such an approach allows, without the involvement of developers and database
engineers, to form hierarchical data structures of arbitrary nesting by Architect with
the necessary connections between hierarchy levels.
    In practice, business processes rarely perfectly reflect the data structure. To create
a way of data displaying for the end-user of the system, a subsystem is required that
can interconnect data located at different levels of the hierarchy, and in different
branches of the "Type Tree". [3]
    As such a subsystem, "Applications" is proposed.
    Within the framework of this subsystem, the capabilities to configure the
displaying of data both in the form of modal forms and in the form of tables for
                                                                                    225


viewing and editing information should be realized. The range of settings should
allow the formation of a wide variety of business processes.
    User interfaces should have a hierarchical structure that provides consistent output
of related data.
    This approach allows to abandon the pre-configuration of all possible options for
using the business process.
    In the industrial analytical systems for data collection and processing it is not
enough just to input and output information. A key feature of data processing systems
oriented to use by engineering and technical personnel is the need to provide the user
with analytical data based on the information available in the system.
    To process data and generate analytical information, it is proposed to use the
subsystem "Analytics", which should have an intrasystem meta-programming
language that allows, first of all, to seamlessly operate data in the system in the best
possible user-friendly form, mainly using system dialogs - "masters".
    In order, to reduce the user entry threshold for interacting with this subsystem and
increase the level of security, it is not recommended to use traditional programming
languages, but to develop unique internal meta-programming language. This meta-
language should implement the principles of linear programming, including basic
algorithmic constructs, have access to the data structure of the subsystem "Type Tree"
in a user-friendly form.
    Practice shows that the main need of users when working with industrial analytical
systems for data collection and processing is to obtain the necessary data, simply
process it and record the results. Any algorithmically complex operations for the user
can be implemented by programmers within the framework of functions connected to
the “Analytics” subsystem. [2]
    The application of this approach will allow Architect to form business logic for
data processing within the system without involving developers.
    The big problem of analytical systems for data collection and processing is the
quality of the data, both system inputs and results of analysis. For input data, the
problem is exacerbated if they are fully or partially entered by operators. As for the
analytical data obtained by calculations, in addition to the risks associated with the
source data, the risks of imperfection of the calculation method affect their accuracy.
    Existing approaches, such as, for example, double-entry of information or constant
monitoring of its entry, are quite expensive both financially and in terms of time.
Such solutions are not always reasonable and effective.
    One of the possible solutions of the quality problem of source and analytical data
in industrial systems is to use the supervisor neural network module, which able in
real time to check the data changing in the system. The implementation of such a
“Supervisor” in each case should be individual, therefore it is most convenient to
implement it in the form of a certain designer tool, within which the Architect can
choose the network architecture and topology, as well as input and output parameters.
If there is a possibility of incorrect data entry, the supervisor informs the operator
about the need to verify the entered data, if the probability of error is recognized in
the analytical data, a message is sent to the expert or system Architect. For additional
training, the neural network "Supervisor" monitors the response of users to messages
sent to them about probable errors. Also, training can take place under the supervision
of an Architect. [4]
226


   Using the neural network "Supervisor" allows to improve the quality of the input
data and strengthen control over the results of analytical calculations. [5,6]
   To summarize, we can present the following simplified diagram of the interaction
of subsystems within the industrial analytical systems for data collection and
processing (Figure 1).




                           Figure 1. Subsystems interaction

   In addition to the described above, the scheme contains the subsystems
“Authentication and authorization”, “Data import” and “Reports”, the purpose of
which is obvious.


4 Example of methodology implementation

    ZAO “GIAP-DISTcenter” was established in 1995, the main activity is to ensure
industrial safety of hazardous production facilities.
    When carrying out large projects, such as the creation of control systems for
technical devices, including as part of unique work to increase the intervals between
overhauls of oil and gas and chemical complexes, the forces of relevant leading
institutes and expert organizations of the country are consolidated to fulfill the
assigned tasks.
                                                                                       227


    To ensure the reliability and safety of industrial plants, it is necessary to take into
account the accumulated scientific knowledge and some existing approaches,
supported by good engineering practice, which are included in international standards
and guidelines.
    The developed methodological documents, such as ICTE 1-002-14, ICTE 3-002-
14, ICTE 3-003-14, as well as software created by own IT department, allow to
optimize the process of analyzing the actual condition of the equipment, timely
identify critically dangerous objects and provide industrial safety.
    Based on many years of experience and analysis of existing systems, it was
decided to develop own industrial analytical systems for data collection and
processing, which allows to solve specific problems in the field of ensuring industrial
safety of the enterprise, as well as in the field of information support of various
business processes in the enterprise. The modularity and flexibility of the system
allows to quickly adapt it for implementation at a particular enterprise.
    GIAP-DIST CENTER, an industrial information system for data collection and
processing, is a client-server application that can be accessed both from stationary
computers and mobile devices, both from a local computer network and via the
Internet, which significantly increases the flexibility of working with it and also
increases the efficiency of access to the necessary data.
    This system includes the following main subsystems:
     a subsystem for managing users and user groups, responsible for providing
       users with access to the system and setting their privileges;
     a subsystem for constructing a type tree responsible for the formation of a data
       structure;
     a subsystem for constructing a tree of objects that allows managing the data
       entered into the system;
     a subsystem for configuring and displaying user applications, which is
       responsible for setting up and ensuring the functioning of the business
       processes of the system;
     a subsystem of units of measurement, providing the conversion of data from
       one dimension to another;
     a data conversion subsystem that allows bringing complex-structured data, for
       example, received from diagnostic devices, into the desired form;
     reporting subsystem responsible for generating documents in the system;
     a subsystem for constructing two-dimensional and three-dimensional schemes,
       providing visualization of system objects;
     a data import subsystem that allows loading data into the system from various
       sources;
     subsystem of analytical calculations;
     neural network supervisor.
    A general view of the user management subsystem is shown in Figure 2. In
addition to standard user operations, this subsystem allows configuring user rights for
any system objects, export and import users and their settings, and view user
interaction logs with the system. [7]
228




                         Figure 2. User management subsystem

   The general view of the subsystem “Type tree” is shown in Figure 3.
   This subsystem allows building a universal data storage system, without the
involvement of developers.




                             Figure 3. Type Tree subsystem




                             Figure 4. Type Tree subsystem

    To view all created objects in the system, the subsystem “Object Tree” has been
developed. This subsystem allows to track the status of all objects in the system, view
the interconnections of the object, and edit it if necessary.
    Objects are presented in a tree structure, which makes it easy to analyze the
relationship of the object. A general view of the subsystem “Object tree” is shown in
Figure 4.
                                                                                  229


   A general view of the Applications configuration subsystem is shown in Figure 5,
examples of applications are shown in Figures 6 and 7.
   The tabular display of the application has the functionality of sorting, filtering,
coloring data, and also allows for multiple operations with data.




                            Figure 5. Application subsystem




                            Figure 6. Application table form
230

                            Figure 7. Application combined form

   Figure 8 represents a general view of the editor of the subsystem "Analytics".
Within the framework of this subsystem, an internal linear programming language is
implemented, as well as the following sets of functions:
    mathematical functions;
    logical functions;
    functions of interactions with dates (getting today's date, getting the difference
      between dates, setting the date, getting the maximum/minimum date);
    statistical functions;
    functions for working with strings;
    conversion functions (converting a string to a number, converting Arabic
      numbers to Roman, and vice versa)).




                               Figure 8. Analytics subsystem




5 Conclusion

   In this paper, we describe the issues of developing industrial analytical systems for
data collection and processing, the internal structure of such systems. Also, an
example of the implementation of the methodology is considered.

References:
    1. Bertrand Meyer. Object-oriented software construction, volume 2. Prentice hall New
York, 1988.
    2. Kim Wald´en and Jean Marc Nerson. Seamless object-oriented software
architecture.Prentice-Hall, 1995.
    3. Misnik Anton, Krutalevich Siarhei, Intelligent decision support in an industrial data
collection and processing system. XVI All-Russian Scientific Conference "Neurocomputers
and their application." Abstracts. 2018 p. 204-205
    4. Prakapenka Siarhei, Misnik Anton, Decision support system for analysis risks of failure
of technical systems. XVI All-Russian Scientific Conference "Neurocomputers and their
application." Abstracts. 2018 p. 205-206
                                                                                 231

    5. Gorbunov-Possadov M.M. Extensible programs. Moscow, Poliptih, 1999, 336 p. (in
Russian).
    6. Andrew S. Tanenbaum, Maarten van Steen. "Distributed Systems. Principles and
paradigms". Prentice Hall, Inc., 2002
    7. Nancy G Leveson, Mats Per Erik Heimdahl, Holly Hildreth, and Jon Damon Reese.
Requirements specification for process-control systems. Software Engineering, IEEE
Transactions on, 20(9):684–707, 1994.
    8. Gustavo Alonso, Fabio Casati, Harumi Kuno, Vijay Machiraju. Web Services.
Concepts,Architectures and Applications. Springer-Verlag, 2004