<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Methodology for the Construction of Predictive Analysis Systems as Exemplified by the Mining Equipment in the Conditions of Big Data and Simulation Methods</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Automation Computer Science and Technologies</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Computer Systems and Networks Kryvyi Rih National University 11</institution>
          ,
          <addr-line>Vitaliya Matusevycha str., Kryvyi Rih</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <fpage>0000</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>Currently, almost all elements of economic activity somehow exist according to the laws of macroeconomics. This is facilitated by the rapid development of international relations, the acceleration of logistics operations, political, religious, cultural integration and unification at the level of interstate relations and interactions (as a result of the evolution of state-formed market systems). At the same time, every year more and more significant influence on macroeconomics is played by minor global factors that were previously practically not taken into account, such as climate change, growth of the world population, etc. Thus, today, every company seeking to be efficient and profitable needs to focus not only on the laws of the domestic market, but also on global trends when building it's strategies and implementing tactical tasks [1]. All the above prerequisites gave rise to a new concept of production development, called Industry 4.0. Previous scientific and technological revolution led to the automation of individual processes and devices, while Industry 4.0 provides for the end-to-end digitization of all physical assets and their integration into the digital ecosystem along with the assets of partners involved in the value chain. The creation of the concept Industry 4.0 in the framework of solving the problems of managing modern technological processes and production had several basic prerequisites. One of them is related with the fact that the complication of the socalled material part of production, of course, also leads to the complication of the organizational component. It is becoming more difficult for a modern manager to make the right management decisions. In the progression, the variability of the applied goals, conditions, restrictions, and with them the scale of possible consequences, increases. Another important reason is the fact that in modern management conditions it is necessary not only to obtain statistics and analytics of production, but also to be able to predict using the obtained data. High-performance methods should be applied to isolate the most important and relevant information at the time of the decision, with the possibility of a predictive analysis of possible options for events.</p>
      </abstract>
      <kwd-group>
        <kwd>Industry 4</kwd>
        <kwd>0</kwd>
        <kwd>Big Data</kwd>
        <kwd>Predictive analysis</kwd>
        <kwd>Simulation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>U
u1
u2
u3
.
.
.
un
y1
y2</p>
      <p>y3
z1
z2
z3
. уr
.</p>
      <p>zk
x1
x2
x3
.
.
.
xm</p>
      <p>
        Object Research
In general, the mathematical model of each stage of the hierarchy can be viewed as a
complex object of the mineral dressing technology and presented as a function of
variables. Here, three types of actions serve as the input of the object (Fig.1) [
        <xref ref-type="bibr" rid="ref3">2</xref>
        ]:
1. Uncontrolled (but monitored) input variables Y = {у1,...уr} constitute a disturbance
vector and, as a rule, characterize, as far as concentrating production is concerned,
quality indicators of the source material to be processed and those of its intermediate
products obtained during the concentrating process;
2. Controlled input variables U = {u1,...,un} constitute a control vector and
characterize, as a rule, quantitative indicators (expenditure) of material and energy flows;
3. The uncontrolled factors Z = {z1,...,zk} constitute an interference vector. Basically,
this is a disturbance vector, about which the developer of the control system knows
very little or nothing at all. Most often that vector is not taken into account at all.
The idea of the method is as follows. Instead of describing the process with the help of
an analytical apparatus (differential or algebraic equations), a random rally is performed
using a specially organized procedure that includes randomness and gives a random
result.
      </p>
      <p>The essence of the Monte Carlo method is as follows: it is required to find the value
A of a certain quantity under study. To do this, choose a random variable X, the
mathematical expectation of which is A, that is, M (X) = A. The probability of events in this
case is estimated using the frequency of outcomes in numerous sessions of simulation.</p>
      <p>Practically do the following: produce N tests, in which they receive N possible
values of X, calculate their arithmetic average and take it as an estimate (approximate
value) of the desired number A.
In terms of value, this criterion (1) can be expressed in terms of the amount of losses
that a combine may incur as a result of a possible failure of an individual equipment
segment(s).</p>
      <p>
        S у  (1 K1 ) A  KGК (1 K2 )QК T  min
(1)
where K1, K2 are the coefficients characterizing the share of costs due to a decrease
in the quality of products and their quantity, respectively (due to failures or failures)
KGК - the coefficient of readiness of operation of the plant; A is the unit cost (UAH /
t); QК - planned capacity of the plant (tons / hour); T - time of the plant for the year
[
        <xref ref-type="bibr" rid="ref4">3</xref>
        ].
      </p>
      <p>It should be noted that the exact calculation of the coefficients K1, K2 is quite
difficult, due to the multi-factorial nature of the latter. Therefore, we investigated the
possibilities for the approximate calculation of these quantities, by predicting the
probability of equipment failures, as well as their possible consequences using simulation and
statistical methods.</p>
      <p>For this, as applied to a specific scheme, it is a set of a finite number N of servicing
devices. Those any technical device is, in turn, a device for maintenance in the system.
The structural model of such a network is presented (see Fig. 12). Here, the designations
P1, P2, P3, ..., PN correspond to the numbers of instruments for maintenance.</p>
      <p>P1
P2</p>
      <p>P2
P3</p>
      <p>Pn</p>
      <p>It should be noted that absolutely all technical components may not be included in this
scheme. It is enough to enter only the most important (key) devices, the failure of which
can directly affect the work of the main divisions of the plant.</p>
      <p>As the main indicators characterizing the reliability of a device, we will use the time
between failures t ( t - expectation of time between adjacent failures of the system
being restored) and the average time the device can recover after a failure tВ . These
characteristics are usually determined by statistical tests and must be indicated by the
manufacturer of the equipment in the passport of the specific equipment.</p>
      <p>Thus, for a given structural scheme of the designed IC, we have two vectors in the
form:
(2)
where t ,t ,t ,...,t
1 2 3</p>
      <p>N</p>
      <p>- the values of MTBF for the corresponding devices in the
diagram of Fig.1; tв1, tв2 , tв3 ,..., tвN - average recovery time for each of the devices.</p>
      <p>Next, a simulation model is built in which the failures of the service devices are
reproduced (simulated) and their possible consequences are predicted. The block
diagram of the algorithm that implements such a model is shown in Fig. 3.
Imitation using tabular processors is a separate area with its own characteristics. Its
proponents argue that using these systems improves the understanding of the processes
taking place much better than using specialized software that has a high cost and takes
time to study, and also hides the mechanisms used (although such environments are
quite widely used because they provide more features and allows simulating complex
systems).</p>
      <p>One of the advantages of modeling by means of spreadsheets is that the tabular
model allows you to visually reflect (see Fig. 4) the behavior of the system under study
with the help of charts and graphs.</p>
      <p>Using spreadsheets, you can perform simulation modeling of discrete and
continuous non-deterministic dynamic systems. The basis for constructing a dynamic model of
a system is a description of the system using a system of differential equations or a
system of recurrent equations with given initial conditions.</p>
      <p>When constructing system-dynamic models using spreadsheets, the time delay is
modeled using links to cell ranges corresponding to previous points in model time.</p>
      <p>
        The technology of simulation using spreadsheets includes [
        <xref ref-type="bibr" rid="ref5 ref6">4,5</xref>
        ]:
 setting discrete points in time;
 description of the state of the system in a certain range of the table;
 setting the rules for the transition of the system to the state corresponding to the next
point in model time.
      </p>
      <p>To build non-deterministic models using MS Excel, the functions
RANDOMBETWEEN, RANDOMBETWEEN are used.</p>
      <p>When creating a dynamic model using a spreadsheet, the modeling horizon is
determined by building the required number of ranges (replicating ranges) corresponding to
discrete points in time.</p>
      <p>If the capabilities of the tabular processor do not allow to cover the entire modeling
horizon, then the behavior of the system on a shorter modeling horizon should be
investigated to put forward a hypothesis about its behavior throughout the modeling
horizon.</p>
      <p>A multiple start of a simulation session in a tabular processor is performed by
recalculating formulas in all cells of the table, which is started by pressing the F9 key.</p>
      <p>Months of operation (warranty)
Simulated time (months)
Price</p>
      <p>FROM</p>
      <p>Detail 1
Detail 2
Detail 3
Detail 4
Detail 5
Detail 6
Detail 7
Detail 8
Detail 9
Detail 10
Detail 11
Detail 12</p>
      <p>Device 2
6
900</p>
    </sec>
    <sec id="sec-2">
      <title>Device 1</title>
    </sec>
    <sec id="sec-3">
      <title>Device 2</title>
    </sec>
    <sec id="sec-4">
      <title>Device 3</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>Detail Detail Detail Detail Detail Detail Detail Detail Detail Detail Detail Detail 1 2</source>
          <volume>3 4 5 6 7 8 9 10 11 12</volume>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          1.
          <string-name>
            <surname>Magazine</surname>
          </string-name>
          <article-title>"neftegaz". “Industry 4.0: Key Tasks of Modern Enterprises”</article-title>
          . [Electronic resource] - Available: https://neftegaz.ru/science/view/1414-Industriya-
          <volume>4</volume>
          .0
          <article-title>-klyuchevyezadachi-sovremennyh-predpriyatiy-i-resheniya-kompanii-Yokogawa</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          2.
          <string-name>
            <surname>A.M. Mariuta</surname>
            ,
            <given-names>Yu. G.</given-names>
          </string-name>
          <string-name>
            <surname>Kachan</surname>
            ,
            <given-names>V. A.</given-names>
          </string-name>
          <string-name>
            <surname>Bunko</surname>
          </string-name>
          <article-title>: “Automatic Control of Technological Processes of Concentrating Plants”</article-title>
          . Moskow «Nedra». p
          <year>1983</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          3.
          <string-name>
            <surname>Molchanov</surname>
          </string-name>
          <article-title>A.A. Modeling and design of complex systems</article-title>
          .- K .: Vyshcha school,
          <year>1988</year>
          .-
          <fpage>359p</fpage>
          .
          <fpage>5</fpage>
          .
          <string-name>
            <surname>Denisov</surname>
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kolesnikov</surname>
            <given-names>D.N.</given-names>
          </string-name>
          <article-title>Theory of large control systems</article-title>
          . - L .:
          <string-name>
            <surname>Energoizdat</surname>
          </string-name>
          . Leningr. separation,
          <year>1982</year>
          .-
          <fpage>287p</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          4. “
          <article-title>Simulation of dynamic systems”</article-title>
          . [Electronic resource] - Available: edu.tltsu.ru/sites/sites_content/site216/html/media67140/lec7_ts.docx
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          5.
          <string-name>
            <given-names>A.</given-names>
            <surname>Kupin</surname>
          </string-name>
          , I. Muzyka, “
          <article-title>Information Technologies of Processing Big Industrial Data and Decision-Making Methods”</article-title>
          ,
          <source>IEEE “Problems of infocommunications science and technology</source>
          <year>2018</year>
          ”, p.
          <fpage>4</fpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>