Method Detection Audit Data Anomalies on Basis Restricted
Cauchy Machine
Tetiana Neskorodieva1, Eugene Fedorov1, Oleksii Smirnov2, Kostiantyn Rudakov3,
and Anastasiia Neskorodieva1
1
  Vasyl’ Stus Donetsk National University, 21 600-Richcha str., 21021, Vinnytsia, Ukraine
2
  Central Ukrainian National Technical University, 8 Universytetskyi ave., 25006, Kropyvnytskyi, Ukraine
3
  Cherkasy State Technological University, 460 Shevchenko ave., 18006, Cherkasy, Ukraine

                Abstract
                The paper presents a method for the anomalies detection in waste-free production audit data
                based on the neural network model of Gauss-Bernoulli bidirectional restricted Cauchy machine
                (BRCM). The purpose of the work is to increase the efficiency of audit data analysis of waste-
                free production on the basis of the neural network model of anomalies detection without the
                use of the marked data that simplifies audit. To achieve this goal, the following tasks have been
                set and solved: offered model of generalized multiple transformations of audit data in the form
                of a two-layer neural network. Cauchy offered neural network model of Gauss-Bernoulli
                bidirectional restricted Cauchy machine possesses a heteroassociative memory; works real
                data; has no restrictions for memory capacity; provide high accuracy of anomalies detection;
                uses Cauchy's distribution that increases the speed of convergence of a method of parametrical
                identification. To increase the speed of Gauss-Bernoulli parametric identification of a
                bidirectional restricted Cauchy machine, a parametric identification method was developed to
                be implemented on a GPU using CUDA technology. The offered method allows increasing
                training speed by approximately proportional to the product of numbers of neurons in the
                hidden layer and power of a training set. The made experiments confirmed the operability of
                the developed software and allow to recommend it for use in practice in a subsystem of the
                automated analysis of DSS of audit for detection of anomalies.

                Keywords1
                Audit, mapping by neural network, Gauss-Bernoulli bidirectional restricted Cauchy machine,
                anomalies detection.

1. Introduction
    Nowadays the scientific and technical issue of the modern information technologies in financial and
economic sphere is creation methodology forming of the decision support systems (DSS) at the
enterprises audit in the conditions of IT application on enterprises and with the use of information
technologies. Modern automated DSS audit are based on the automated analysis of the large volumes
of data about financial and economic activity and states of enterprises with the multilevel hierarchical
structure of heterogeneous, multivariable, multifunction connections, intercommunications and
cooperation of objects of audit. The tasks automated DSS audit are expansion of functional possibilities,
increase of efficiency and universality of IT-audit [1].


CPITS-II-2021: Cybersecurity Providing in Information and Telecommunication Systems, October 26, 2021, Kyiv, Ukraine
EMAIL: t.neskorodieva@donnu.edu.ua (T. Neskorodieva); y.fedorov@chdtu.edu.ua (E. Fedorov); dr.smirnovoa@gmail.com (O. Smirnov);
k.rudakov@chdtu.edu.ua (K. Rudakov); t.neskorodieva@donnu.edu.ua (A. Neskorodieva)
ORCID: 0000-0003-2474-7697 (T. Neskorodieva); 0000-0003-3841-7373 (E. Fedorov); 0000-0001-9543-874X (O. Smirnov); 0000-0003-
0000-6077 (K. Rudakov); 0000-0002-8591-085X (A. Neskorodieva)
             ©️ 2022 Copyright for this paper by its authors.
             Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
             CEUR Workshop Proceedings (CEUR-WS.org)


                                                                                  1
2. Problem Statement

                                                                                 m , d m , d m )} , m 1, M ,
   Let for model of detection of anomalies the training set be set S  {(xinm , xout   in    out


where x inm is m - y raw materials vector, x out                                    in
                                             m is m - y vector of finished goods, d m is m - y the expected

reference vector of raw materials, d out
                                     m is m - y the expected reference vector of finished goods.
   Then a problem of increase in accuracy of detection of anomalies on Gauss-Bernoulli's model of the
bidirectional limited machine of BRCM g (xin , xout , w ) , where xin is raw materials vector, xout is the
vector of finished goods, w is a vector of parameters, is represented as a stay problem for this model of
                                                             1 M
such vector of parameters w * , which meets criterion F   ( g (xinm , xout
                                                                           m , w )  (d m , d m ))  min .
                                                                                 *      in    out 2

                                                             M m 1

3. Literature Review
    Currently, the analytical procedures used during the audit are based on data mining techniques [2,
3]. Automated DSS audit means the automatic forming of recommendable decisions, based on the
results of the automated analysis of data, that improves quality process of audit. Unlike the traditional
approach, computer technologies of analysis of data in the system of audit accelerate and promote the
process accuracy of audit, that extremely critical in the conditions of plenty of associate tasks on lower
and middle levels, and amounts of indexes and supervisions in every task.
    The development of methods of estimation and prediction [4, 5], formation of generalized
associative relationships [6] are described in the works of the authors of this article. The goals of
creating these methods: reducing the computational complexity for simple tasks (a single mapping of
elements or sub-elements of the audit subject area), automatic structural identification, increasing the
accuracy for com-plex tasks (compositions of mappings of elements or sub-elements of the audit sub-
ject area) and the possibility of applying these methods for the generalized analysis of elements and
sub-elements of the audit subject area (Table 1).
    The choice of model in the audit DSS depends on:
        1. Characteristics of the audit data type (time series data, spatial data as mappings).
        2. Audit level (upper middle, lower).
        3. Audit tasks (internal, external).
        4. The type of analysis tasks (detection of anomalies, structural analysis, assessment of
             indicators).
        5. The characteristics of the enterprise (large, medium, small) and the type of activity
             (industry) at the top level.
        6. Characteristics of sets and subsets of operations at lower levels (numerological, quantitative,
             semantic, logical).
    This choice is schematically formalized in the form of a binary decision tree for choosing a neural
network data audit model (see Fig. 1).
    The proposed logical-neural network method makes it possible to automate the process of data
analysis in the audit DSS and optimize it depending on the characteristics of the audit process and the
audit object. One of the main tasks of data analysis of the audit subject area is the identification of
anomalies. Let's consider the existing types of anomalies and methods of their operation.
    Types of anomalies [7–9]:
        ● Point (are provided by points in character space).
        ● Contextual (usually a point of a time series or the rarefied data which depends on the
             environment).
        ● Collective (the section of a time series or the rarefied data).
    Methods of detection of anomalies [7–9]:
        1. Approach on the basis of rules (logical approach):
                 I. methods on the basis of associative rules with classification and without
                     classification (for example, the Apriori method);


                                                     2
                    II. methods on the basis of a decision tree with classification (for example, a method
                        of the isolated wood).

 Table 1
 Comparative analysis of intelligent analysis methods in audit tasks
                             Model of processing
   The economic                                                                                      Advantages
                        elements of the subject area,         Purpose of processing
   content of the                                                                                disadvantages of the
                           Features of the model or        elements of the subject area
      display                                                                                      model or method
                                     method
Payment - delivery      Modified       Liquid      State   Evaluation and prediction of       Reducing     computational
of raw materials        Machine, one- dimensional          indicators of raw material         complexity, improving the
                        hidden      layer,   parameter     supplies (by type) based on the    forecast accuracy
                        identification based on matrix     values of payment indicators
                        pseudoreversion [1]                in a direct check of the display
Settlements with        A neural network model based       Evaluation of indicators of        Reducing     computational
suppliers-customer      on a gateway recurrent unit.       settlements with customers on      complexity, improving the
settlements             For parametric identification      the basis of values of             forecast accuracy
                        of this model, adaptive cross      indicators of settlements with
                        entropy                            suppliers     in     a    direct
                        (a combination of random and       verification of mapping
                        directional search) is faster to
                        learn but less accurate than in
                        [1] because the pseudoreversal
                        is not paralleled
Settlements with        Forward-only                       Construction of generalized        Automating the formation
suppliers -             counterpropagating       neural    associative relationships for      of generalized features of
settlements with        network,      which     is     a   generalized analysis tasks (in     audit sets and their mapping
customers               nonrecurrent static two-layer      the forward direction)             by means of a forward-only
(a composition of       ANN [2], assumed that the                                             counterpropagating neural
mappings between a      audit indicators are noisy with                                       network the number of pairs
set of input and        Gaussian noise                                                        (neurons in the hidden layer
output data)                                                                                  N1) is set manually
Release of raw          Bidirectional                      Construction of generalized        Automating the formation
materials - posting     counterpropagating     neural      associative relationships for      of generalized features of
of finished products    network,      which   is    a      generalized analysis tasks (in     audit sets and their mapping
(a composition of       nonrecurrent static two-layer      the forward and backward           by means of a bidirectional
mappings between a      ANN BCPNN                          direction)                         counterpropagating neural
set of input and                                                                              network the number of pairs
output data)                                                                                  (neurons in the hidden
                                                                                              layer) is set manually
Payment - delivery      Modified       Liquid    State     Evaluation and prediction of       Reducing       computational
of raw materials        Machine, one- dimensional          indicators of raw material         complexity, improving the
                        hidden      layer,   parameter     supplies (by type) based on the    forecast accuracy
                        identification based on matrix     values of payment indicators
                        pseudoreversion [1]                in a direct check of the display

         2. Approach on the basis of ANN:
               I. ANN without classification (for example, the one-class SVM (support vector
                   machine), ANN associative memory (for example, the autoencoder, SOFM (self-
                   organizing feature map), Hopfield neural network, Boltzmann machine), ANN of
                   the forecast of a time series (for example, NARNN (non-linear autoregressive
                   neural network), NARMANN (nonlinear autoregressivemoving average neural
                   network), SRN (simple recurrent network), BRNN (bidirectional recurrent neural
                   network), LSTM (long short-term memory), BiLSTM, GRU (gated recurrent unit),
                   BiGRU));


                                                            3
           II. ANN with classification (for example, MLP (multilayer perceptron), RBFNN
                (radial-basis function neural network)).
     3. Approach on the basis of Bayes's networks with classification
     4. Approach on the basis of a clustering:
           I. clustering on the basis of centroid (for example, a method of k-means) or
                distributions (for example, the EM (expectation–maximization) method);
           II. clustering on the basis of medoid (for example, the PAM (partitioning around
                medoids) methods, a subtractive clustering);
           III. density clustering (for example, DBSCAN (density-based spatial clustering of
                applications with noise) methods, OPTICS (ordering points to identify the
                clustering structure methods).

                                                     ANN
               Time series data                              Mapp data


   ANN for forecasting                      ANN with associative memory
   прогноза
                                                                             with waste, layers correspond to
                                                                             the production of semi-finished
                                          no waste                           products

                                           CPNN                           Deep SRN with
                                           RCM                          associative memory
                                                                    accuracy, layers do not
                             speed                                  correspond production

                            CPNN                                         RCM
layers correspond to
the production of
semi-finished                                            forward-only               bidirectional
products                             do not correspond

   Deep CPNN               Shallow CPNN                  Shallow                        Shallow
    FOCPNN                   FOCPNN                   Gauss-Bernoulli                Gauss-Bernoulli
                                                         FORCM                          BRCM
                              BCPNN

                 forward-only                   bidirectional

                        Shallow              Shallow
                       FOCPNN                BCPNN
        Figure 1: Binary decision tree of neural network model selection for data analysis.

     5. Approach on the basis of the neighborhood (metric approach) (for example, methods of the
        k-nearest neighbors, LOF (local outlier factor))
     6. Approaches on the basis of distributions:
        a. Parametrical approach on a basis:
            I. Gaussian distributions (for example, MCD (minimum covariance determinant)
                method);
            II. mixtures of distributions (for example, HMM (hidden Markov models), GMM
                (Gaussian mixture models)).
        b. Nonparametric approach on a basis:
            I. histograms;
            II. functions of a kernel (for example, Parzen window method).
     7. Approach on the basis of regression model (for example, the Box-Jenkins meth-od)


                                                         4
        8. Approach on the basis of the spectral theory (matrix decomposition) (for example, PCA
             (principal component analysis) method)
        9. Approach on the basis of information theory (entropy).
    Now the most popular is approach of detection of anomalies based on neural networks.
    Disadvantages of the one-class SVM is restriction for quantity of support vectors. A disadvantage
of ANN of the forecast of a time series is that they require existence of a time series. A disadvantage of
ANN with classification is the requirement to classify anomalies that is not always possible owing to
labor input of obtaining the marked data on each type of anomalies. Therefore, in our work, we chose
ANN with associative memory.
    Traditional neural networks with an associative memory are:
        1. Neural networks only with heteroassociative memory (for example, FOCPNN (forward-
             only counter propagation neural network) [10], PCANN (principal com-ponent analysis
             neural network) [11], ICANN (independent component analysis neural network) [12],
             CMAC (cerebellar model articulation controller) [13].
        2. Neural networks only with an autoassociative memory (for example, the autoencoder [14],
             SBN (sigmoid belief network) [15], Helmholtz machine [16], SOFM [17], LVQNN
             (learning vector quantization neural network) [18], RCAM (recurrent correlation associative
             memory) [19], Hopfield neural network [20], Gauss machine [21], BSB (brain-state-in-box)
             [22], Hamming neural network [23], ART (Adaptive resonance theory) [24].
        3. ANN with a heteroassociative and autoassociative memory (for example, BCPNN
             (bidirectional counter propagation neural network) [25], BAM (bidirectional associative
             memory) [26], Boltzmann machine [27]).
    The majority of neural networks with an associative memory possess some or more shortcomings:
        1. Do not possess at the same time autoassociative and heteroassociative memory,
        2. Do not work with material data.
        3. Have no high capacity of an associative memory.
        4. Have no high accuracy.
        5. Have high computing complexity.
    In this regard, creation of a neural network which will allow to eliminate the specified defects is
relevant.
    The purpose of work is increase in efficiency of data analysis of audit of waste-free production on
the basis of neural network model of detection of anomalies with-out use of the marked data that
simplifies audit [28–30].
    For achievement of the goal, it is necessary to solve the following problems:
        ● Offer neural network model of detection of anomalies.
        ● Select criterion for evaluation of efficiency of neural network model of detection of
             anomalies.
        ● Offer a method of parametrical identification of neural network model of detection of
             anomalies.
        ● Execute numerical researches.

4. Block Diagram of Neural Network Model of Detection of Anomalies
    In this paper, the structure of the data transformation model is determined based on the production
structure. It is assumed that the transformation of raw materials into finished products in one step
without waste without intermediate products. Each type of raw material is used in the production of one
or more types of finished products. The production structure for each planning period (month, quarter,
year) is deter-mined on the basis of long-term contracts and short-term (in particular urgent) orders.
The production plan is decomposed into quantization periods of the planning period, taking into account
the production capacity for different types of products.
    In this case, the transformation of these raw materials into finished products for the planning period
can be represented in the form of a two-layer neural network. The number of neurons in the input layer
is equal to the number of raw materials used in production. The number of neurons in the output layer
is equal to the number of types of finished products. The input values are the amount of raw materials


                                                    5
by type, the output of the network is the finished product values for the planning period or the
quantization period.
    When taking into account the supply of raw materials and the release of finished products for the
quantization period, the amount of recorded released raw materials and released finished products is
made up of the actual values of indicators and the difference in residuals at the beginning of the
quantization period. These balances are not calculated at the end (beginning) of each quantization period
and not reflected in accounting systems, balances are determined only at the beginning and end of the
planning period).
    The detecting anomalies problem during the release (write-off) of raw materials for the production
of finished products for the periods of quantization of the verification period (in particular, the periods
of quantization can be chosen equal to the production cycle - shift, day, several days) is formulated as
follows. Determine the quantization periods for which the structure of consumption of raw materials is
significant (the level of materiality is set by the decision maker) is higher or lower than the average
values for the period according to the data of the release of raw materials into pro-duction and the
posting of products.
    This data transformation model is used to create an anomaly detection model that can be represented
as a two-layer neural network. The number of neurons in the visible layer is equal to the sum of the
number of raw materials used in production and the number of types of finished products. Thus, the
number of raw materials and finished products is supplied to the visible layer by type for the planning
period or the quantization period of the verification period. To train the neural network, the "correct"
data are used (the formation of which has been verified). Data that are subject to verification are used
as control data.
    The block diagram of model of Gauss-Bernoulli BRCM (bidirectional restricted Cauchy machine)
[6] which is recurrent ANN and contains one visible layer and one hidden layer (Fig. 2).


                             vector of raw              …            …
                               materials
                                                        …            …

                               vector of                …            …
                            finished goods

                                                     visible      hidden
                                                    neurons       neurons
Figure 2: Block diagram of model of Gauss-Bernoulli BRCM (bidirectional restricted Cauchy machine)

   Gauss-Bernoulli's components of BRCM are:
      ● Stochastic visible neurons which state is described based on Gaussian distribution in a form
                                      x j   j   j N (0,1) ,
where  j is mathematical expectation, which characterizes the average value of indicators of supplied
raw materials or capitalized products,  j is a mean square deviation (if the training a vector are
normalized and centered, then  j =1), which characterizes the variance of the difference in residuals at
the beginning of the quantization period, N (0,1) is the function returning standard normally distributed
random number.
              Transition probability j -th a stochastic neuron in a state  is defined in a form
                                             1  1     j 
                                    Pj    exp             .
                                     j 2      2   j 
                                                           
       ●   The stochastic hidden neurons which state is described on the basis of Bernoulli's
           distribution in a form


                                                    6
                                       1, with probability Pj ,
                                  xj  
                                        0, with probability 1  Pj .
             Transition probability j -th a stochastic neuron in a state 1 is defined in a form

                                      Pj   arctan  E j  ,
                                             1 1
                                             2 
   where E j is increment of energy of ANN at state change j -th a stochastic neuron with 0 on 1.
   Gauss-Bernoulli's BRCM advantages:
      1. Unlike the majority ANN possesses at the same time autoassociative and heteroassociative
          memory.
      2. Unlike a bidirectional associative memory and Boltzmann machine works real data.
      3. Unlike a bidirectional associative memory and Boltzmann machine has no restrictions for
          memory capacity.
      4. Unlike a bidirectional associative memory provide big accuracy.
      5. Unlike Boltzmann machine has smaller computing complexity.

5. Neural Network Model of Detection of Anomalies
   Positive phase (Step 1–3).
      1. Initialization of a state of the visible neurons corresponding to raw materials x1in  xin
       2. Initialization of a state of the visible neurons corresponding to finished goods x1out  xout
       3. Calculation of a state of the hidden neurons j 1, N h .                   
                                                                                x1iout 
                                                          in                    out
                                1 1                N
                                                              x1in N
                         Pj      arctan  b hj   wijin  h ini   wijout  h out   .
                                2                           i                  i 
                                                  i 1              i 1

                                               1, Pj  U (0,1),
                                        x1hj  
                                                0, Pj  U (0,1).
   where U (0,1) is the function returning uniform distributed random number in the range [0,1] .
   Negative phase (Steps 4 and 5).
                                                                                             
       4. Calculation of a state of the visible neurons corresponding to raw materials j 1, N in     
                                                                       h
                                                                   N
                                            inj  binj   inj  wijin  h x1ih ,
                                                                   i 1

                                             x2inj   inj   inj N (0,1) .

                                                                                                 
       5. Calculation of a state of the visible neurons corresponding to finished goods j 1, N out       
                                                                    Nh
                                          out             j  wij
                                                         out  out  h
                                           j    b out
                                                   j                    x1ih ,
                                                                    i 1

                                             j   j   j N (0,1) ,
                                           x2out   out   out


where bhj is bias for j-th of a neuron of the hidden layer, binj is bias for j-th of a neuron of the visible
layer corresponding to raw materials, bout
                                       j   is bias for j-th of a neuron of the visible layer corresponding
to finished goods, wijin  h is connection weight from the neuron i-th in a visible layer corresponding to
raw materials to j-th to a neuron of the hidden layer, wijout h is connection weight from the neuron i-th
in a visible layer corresponding to finished goods to j-th to a neuron of the hidden layer, N h is number
of neurons in the hidden layer, N in is the number of the neurons in a visible layer corresponding to raw
materials, N out is the number of the neurons in a visible layer corresponding to finished goods.


                                                               7
6. Choice of Criterion for Evaluation of Efficiency of Neural Network Model of
   Detection of Anomalies
   In work for training of the BRCM model the function of the purpose which means the choice of such
                                                    in  h
values of a vector of parameters is selected w  (w11      ,..., wNininNh h , w11out h ,..., wNoutoutNh h ) , which deliver a
minimum of a root mean square error (the differences of a sample on model and a test sample)

                                                                                                  
                                       M
                              1
                                  out 
                                                         2          out 2
                   F                       x 2inm  dinm  x 2out
                                                               m  dm      min ,
                       M ( N  N ) m 1
                            in                                               w

where x 2inm is m -th an evaluation vector of raw materials on model, d inm is m -th raw materials vector,
x 2out                                                           out
   m is m -th an evaluation vector of finished goods on model, d m is m -th vector of finished goods.


7. Method of Parametrical Identification of Neural Network Model of
   Detection of Anomalies on the basis of Algorithm CD-1 (One-Step
   Contrastive Divergence)
   The method of parametrical identification of neural network model of detection of anomalies on the
basis of algorithm CD-1 consists of the following blocks (Fig. 3).

                                 1. Initialization


                                 2. Initialization of the state of visible neurons corresponding to
                                 raw materials (positive phase)


                                 3. Initialization of the state of visible neurons corresponding to
                                 the finished product (positive phase)


                                 4. Calculation of the state of hidden neurons (positive phase)


                                 5. Calculation of the state of visible neurons corresponding to
                                 to raw materials (negative phase)


                                 6. Calculation of the state of visible neurons corresponding to
                                 the finished product (negative phase)


                                 7. Calculation of the state of hidden neurons (negative phase)


                                 8. Setting synaptic weights based on the stochastic rule


                                                                                          Yes
                                                            9. Continue?

                                                                   No

 Figure 3: The sequence of procedures of a method of parametrical identification of neural network
                       model of detection of anomalies on the basis of CD-1

         1. Initialization
                Number of iteration of training n  1 , initialization by means of uniform distribution on
              an interval (0.1) or [-0.5, 0.5] bias biin ( n) , i 1, N in , biout (n) , i 1, N out , bhj (n) , j 1, N h ,


                                                                  8
    and weights wijinh (n) , i 1, N in , j 1, N h , wijout h (n) , i 1, N out , j 1, N h , wiiin  h (n)  0 ,
     wiiout  h (n)  0 , wijinh (n)  winji h (n) , wijout h (n)  wout
                                                                        ji
                                                                            h
                                                                               (n) .
                                                                                 in               out

                                          m ) | x m  (0,1)
        The training set is set {(xinm , xout                    m  (0,1)   } , m 1, M , where x inm is
                                                 in         N
                                                              , xout       N


    mth raw materials vector, x out
                                m   – m -th vector of finished goods, M – power of a training
    set, vector of mean square deviations for a raw materials vector σin  ( inj ,..., Ninin ) ; vector
    of mean square deviations for a vector of finished goods σ out  ( out
                                                                        j ,...,  N out ) .
                                                                                  out


Positive phase (Step 2–4)
2. Initialization of a state of the visible neurons corresponding to raw materials x1m  xm ,
                                                                                                                  in           in


     m 1, M .

3. Initialization of a state of the visible neurons corresponding to finished goods x1m  x m ,
                                                                                                                out        out


     m 1, M .

4. Calculation of a state of the hidden neurons j 1, N h                                
                                                                                   
                                                in                              out
                1 1                    N
                                                      x1in N                  x1out
        Pmj      arctan  b hj (n)   wijin  h (n) inmi   wijout  h (n) out
                                                                                 mi
                                                                                     , m 1, M ,
                2                                   i                       i 
                                      i 1                   i 1

                                            1, Pmj  U (0,1)
                                    x1hmj                     . m 1, M
                                             0, Pmj  U (0,1)
Negative phase (Steps 5–7)
5. Calculation of a state of the visible neurons corresponding to raw materials j 1, N in                            
                                                         Nh
                              mj
                               in
                                   binj (n)   inj  wijin  h (n) x1hmi , m 1, M ,
                                                         i 1

                                    x2     inj N (0,1) , m 1, M .
                                        in
                                        mj
                                                    in
                                                    mj

6. Calculation of a state of the visible neurons corresponding to finished goods j 1, N out                              
                                                                h
                                                          N
                           mj      j ( n )   j  wij
                                                     out  h
                            out
                                 b out         out
                                                             (n) x1hmi , m 1, M ,
                                                          i 1

                                    x2     N (0,1) , m 1, M
                                        in
                                        mj
                                                    in
                                                    mj
                                                             in
                                                             j

7. Calculation of a state of the hidden neurons                              j 1, N 
                                                                                      h


                                                                                  
                                               in                               out
              1 1                    N
                                                    x 2in N                 x 2out
      Pmj      arctan  b hj (n)   wijin  h (n) inmi   wijout  h (n) out
                                                                               mi
                                                                                    , m 1, M ,
              2                                   i                       i 
                                    i 1                   i 1

                                  1, Pmj  U (0,1)
                         x 2hmj                     , m 1, M .
                                   0, Pmj  U (0,1)
8. Setup of bias and synoptic weights on the basis of the stochastic rule
                                                                            
                                             1 M x1in         1 M x 2in
               biin (n  1)  biin (n)     mi 2   mi 2  , i 1, N in ,
                                            M m 1  in
                                                     i     M m 1  in 
                                                                       i              
                                                                              
                                            1 M x1out         1 M x 2out  mi 
                                           M 
             bi (n  1)  bi (n)  
               out             out                      mi
                                                                               , i 1, N ,
                                                                                          out


                                           
                                               m 1 
                                                      i    
                                                       out 2  M  m 1  out 2 
                                                                         i                  

                                                                    9
                                                    1 M      1 M h 
                         bih (n  1)  bih (n)     x1hmi   x 2mi   , i 1, N ,
                                                                                   h

                                                    m 1
                                                     M        M m 1    
                                             M x1in x1h
                                          1
                                  ij   mi in mj , i 1, N in , j 1, N h ,
                                         M m 1  i
                                                  in    h
                                           1 M x 2mi x 2mj
                                  ij      
                                           M m1  iin
                                                           , i 1, N , j 1, N ,
                                                                    in        h


                         wijinh (n  1)  wijinh (n)   ( ij  ij ) , i 1, N in , j 1, N h ,
                                                 out    h
                                           1 M x1mi x1mj
                                  ij      
                                           M m1  iout
                                                          , i 1, N , j 1, N ,
                                                                   out       h


                                                  out    h
                                           1 M x 2mi x 2mj
                                 ij       
                                           M m 1  iout
                                                           , i 1, N , j 1, N ,
                                                                    out       h


                        wijout h (n  1)  wijout h (n)   ( ij  ij ) , i 1, N out , j 1, N h
       9. Check of termination condition
                        M  N in                       N out                    
                 1
                    out                             
       If                          | x1in
                                           x 2 in
                                                   |        | x1out  x 2out |    then n  n  1 , go to 2.
          M ( N  N ) m 1  i 1                                              
               in                      mi       mi               mi       mi
                                                       i 1                     

8. Experiments and Results
    The offered method was investigated on indicators of delivery and payment of stocks of
manufacturing enterprise with a two-year depth of sample with daily time intervals.
    Results of comparison of the offered neural network model (BRCM) with neural network model are
presented by a bidirectional counter propagation neural network (BCPNN) on the basis of criterion of
a root mean square error in Table 2.

Table 1
Comparison of the offered neural network BRCM model with traditional BCPNN on the basis of
criterion of a root mean square error
                            Root mean square error of neural network model
                             BRCM                                                                BCPNN
                               0.03                                                                 0.06

   According to Table 2, use of BRCM reduces a root mean square error and by that increases the
accuracy of detection of anomalies
   Results of comparison of the offered method of parametrical identification with use and without use
of GPU and technology of parallel processing information of CUDA are provided in Table 3.

Table 2
Comparison of computing complexity of a method of parametrical identification with use and without
use of GPU
                                                           Method
        Indicator
                                  use of GPU                         without use of GPU
       Computing
                            O(2log 2 ( N in N out N h M ))         O(4( N in  N out ) N h M )
       complexity


                                                              10
      According to tab. 2, use of GPU reduces computing complexity approximately in
         
    2 N h M log 2 ( N h M ) time and by that increases the speed of parametrical identification.

9. Conclusions
1. The relevant problem of increase in efficiency of detection of anomalies in data of audit of waste-
   free production was solved by means of neural network model of Gauss-Bernoulli bidirectional
   restricted Cauchy machine.
2. The proposed neural network model of Gauss-Bernoulli bidirectional restricted Cauchy machine
   possesses at the same time autoassociative and heteroassociative memory; real data; has no
   restrictions for memory capacity; provide high accuracy of anomalies detection; uses Cauchy's
   distribution that increases the speed of convergence of a method of parametrical identification.
3. For increase speed of parametrical identification of Gauss-Bernoulli bidirectional restricted Cauchy
   machine, the method of parametrical identification intended for implementation on GPU by means
   of CUDA technology was developed. The offered method allows to increase training speed
                              
   approximately in 2 N h M log 2 ( N h M ) time, where N h is number of neurons in the hidden layer,
      M is power of a training set.
4. The made experiments confirmed operability of the developed software and allow to recommend it
   for use in practice in a subsystem of the automated analysis of DSS of audit for anomalies detection.
   Prospects of further researches are in checking the offered methods on broader set of test databases.

10.       References
[1] The World Bank, World Development Report 2016: Digital Dividends. URL:
     https://www.worldbank.org/en/publication/wdr2016.
[2] M. Schultz, M. Tropmann-Frick, Autoencoder neural networks versus external auditors: detecting
     unusual journal entries in financial statement audits, in: Proceedings of the 53 rd Hawaii intern.
     conf. on System Sciences, HICSS, Maui, Hawaii, USA, 2021, рр. 5421–5430. doi:
     10.24251/HICSS.2020.666.
[3] J. Nonnenmacher, et al., Using autoencoders for data-driven analysis in internal auditing, in:
     Proceedings of the 54th Hawaii Intern. Conf. on System Sciences, Maui, Hawaii, USA, 2021, рр.
     5748–5757.
[4] T. Neskorodіeva, E. Fedorov, I. Izonin, Forecast method for audit data analysis by modified liquid
     state machine, in: Intelligent Information Technologies & Systems of Information Security.
     Proceedings of the 1st Intern. Workshop, Khmelnytskyi, Jun. 10–12, 2020.
[5] T. Neskorodieva, E. Fedorov, Automatic analysis method of audit data based on neural networks
     mapping, Information technology and interaction, in: Proceedings of the 7th Intern. Conf., IT&I,
     Kyiv, Dec. 2–3, 2020.
[6] T. Neskorodіeva, E. Fedorov, Method for automatic analysis of compliance of expenses data and
     the enterprise income by neural network model of forecast, Modern machine learning technologies
     and data science. Proceedings of the 2nd Intern. Workshop on MoMLeT&DS, Lviv-Shatsk, Jun. 2–
     3, 2020.
[7] V. Chandola, A. Banerjee, V. Kumar, Anomaly detection: A survey, ACM Computing Surveys
     41(3) (2009). doi: 10.1145/1541880.1541882.
[8] V. Hodge, J. Austin, A survey of outlier detection methodologies, Artificial Intelligence Reform
     22(2) (2004) 85–126.
[9] M. Agyemang, K. Barker, R. Alhajj, A comprehensive survey of numeric and symbolic outlier
     mining techniques, Intelligent Data Analysis 10(6) (2006) 521–538.
[10] U. P. Singh, S. Jain, A. Tiwari, R.K. Singh, Gradient evolution-based counter propagation network
     for approximation of noncanonical system, Soft Computing 23 (2019) 4955–4967. doi:
     10.1007/s00500-018-3160-7.
[11] J.-W. Cho, H.-M. Park, Independent vector analysis followed by hmm-based feature enhancement
     for robust speech recognition, Sig. Process 120 (2016) 200–208.


                                                  11
[12] R. Chai, et al., Driver fatigue classification with independent component by entropy rate bound
     minimization analysis in an EEG-based system, IEEE J. Biomed. Health Inf. 21 (3) (2017) 715–
     724.
[13] M. I. Achmad, H. Adinugroho, A. Susanto, Cerebellar model articulation controller (CMAC) for
     sequential images coding, in: 2014 The 1st International Conference on Information Technology,
     Computer, and Electrical Engineering. doi:10.1109/ICITACEE.2014.7065734.
[14] S. Haykin, Neural Networks and Learning Machines, Upper Saddle River, Pearson Education, Inc.,
     New Jersey, 2009.
[15] P. M. Baggenstoss, Applications of projected belief networks (PBN), in: 2019 27th European
     Signal Processing Conference (EUSIPCO). doi:10.23919/EUSIPCO.2019.8902708.
[16] P. Sountsov, P. Miller, Spiking neuron network Helmholtz machine. Comput. Neurosci., 2015.
     doi:10.3389/fncom.2015.00046.
[17] T. Kohonen, Self-organizing and associative memory, 3rd ed., Springer, New York, 2012.
[18] H. de Vries, R. Memisevic, A. Courville, Deep Learning Vector Quantization, in: ESANN 2016
     proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and
     Machine Learning, Bruges, Belgium, Apr. 27–29, 2016.
[19] R. A. Lobo, M. E. Valle, Ensemble of binary classifiers combined using recurrent correlation
     associative memories, Brazilian Conference on Intelligent Systems, BRACIS, 2020, pp 442–455.
[20] M. Kobayashi, Quaternionic Hopfield neural networks with twin-multistate activation function,
     Neurocomputing 267 (2017) 304–310. doi:10.1016/j.neucom.2017.06.013.
[21] K. L. Du, M. N. S. Swamy, Neural Networks and Statistical Learning, Springer-Verlag, London,
     2014. doi: 10.1007/978-1-4471-5571-3.
[22] Y. Park, Optimal and robust design of brain-state-in-a-box neural associative memories, Neural
     Networks 23(2) (2010) 210–218. doi:10.1016/j.neunet.2009.10.008.
[23] O. I. Khristodulo, A. A. Makhmutova, T. V. Sazonova, Use algorithm based at Hamming neural
     network method for natural objects classification, Procedia Computer Science 103 (2017) 388–
     395. doi:10.1016/j.procs.2017.01.126.
[24] T. Barszcz, A. Bielecki, M. Wójcik, M. Bielecka, ART-2 Artificial Neural Networks Applications
     for Classification of Vibration Signals and Operational States of Wind Turbines for Intelligent
     Monitoring, in: Dalpiaz G. et al. (Eds.), Advances in Condition Monitoring of Machinery in Non-
     Stationary Operations. Lecture Notes in Mechanical Engineering, Springer, Berlin, Heidelberg,
     2014. doi:10.1007/978-3-642-39348-8_58.
[25] A. P. Sonika, M. Chauhan, A. Dixit, New technique for detecting fraudulent transactions using
     hybrid network consisting of full-counter propagation network and probabilistic network, in: 2016
     International Conference on Computing, Communication and Automation (ICCCA), 2016, pp.
     177–182. doi:10.1109/CCAA.2016.7813713.
[26] E. Javidmanesh, Global stability and bifurcation in delayed bidirectional associative memory
     neural networks with an arbitrary number of neurons, J. Dyn. Sys., Meas., Control. 139(8) (2017).
     doi:10.1115/1.4036229.
[27] Q. Wang, et al., A novel restricted boltzmann machine training algorithm with fast Gibbs sampling
     policy, Mathematical Problems in Engineering, 2020. doi:10.1155/2020/4206457.
[28] V. Buriachok, et al., Invasion Detection Model using Two-Stage Criterion of Detection of Network
     Anomalies, Cybersecurity Providing in Information and Telecommunication Systems (CPITS),
     pp. 23–32, Jul. 2020.
[29] A. Carlsson, et al. Sustainability Research of the Secure Wireless Communication System with
     Channel Reservation. 2020 IEEE 15th International Conference on Advanced Trends in
     Radioelectronics, Telecommunications and Computer Engineering (TCSET), 2020.
     https://doi.org/10.1109/tcset49122.2020.235583
[30] V. Lakhno, et al., Funding Model for Port Information System Cyber Security Facilities with
     Incomplete Hacker Information Available, Journal of Theoretical and Applied Information
     Technology 96(13), 4215–4225, 2018.


                                                 12