<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Electoral neural network technologies in the information- analytical system of the security operation center</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Andrey Mikryukov associate Professor, PhD, Institute of Digital Economics and Information Technology of the Plekhanov Russian University of Еconomics</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Mikhail Mazurov professor, doctor of Physical and Mathematical Sciences Institute of Digital Economics and Information Technology of the Plekhanov Russian University of Еconomics</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <fpage>347</fpage>
      <lpage>355</lpage>
      <abstract>
        <p>The article describes an approach to improving methods for solving security event classification problems in information and analytical systems of security operation centers based on an ensemble of electoral neural networks, as well as using the adaptive reduction method of a neural network ensemble to improve the classification reliability. Electoral neurons differ from classic neurons in a more efficient way of processing input information. The approach is based on the application of procedures for adaptive reduction of the results of classifiers of the ensemble and the choice of the method of their aggregation. It is shown that the use of the considered approach provides an increase in the efficiency of solving the set task.</p>
      </abstract>
      <kwd-group>
        <kwd>collective classifier</kwd>
        <kwd>neural network ensemble</kwd>
        <kwd>electoral neuron</kwd>
        <kwd>electoral neural network</kwd>
        <kwd>information and analytical systems</kwd>
        <kwd>security operation center</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>Nowadays, neural network approaches, in particular, the use of neural network ensembles, which are an example of
collective problem solving, are widely used for solving problems of modeling various systems that are hard to formalize
and weakly structured.</p>
      <p>The quality of solving a specific task (data mining, forecasting, pattern recognition, classification, etc.) can be
significantly improved using neural network ensembles, which assume the formation and training of a finite set of neural
networks, the results of which are taken into account in the overall solution. At the same time, individual decisions are
coordinated in such a way that the overall final decision is the best.</p>
      <p>
        As you know, one of the fundamental problems of improving the functioning of an ensemble of neural networks in terms
of increasing their accuracy and reliability is the generation of the diversity of the ensemble (the difference between
individual models) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Aggregation of similar models in an ensemble cannot lead to a significant improvement in the
quality of the solution of the problem.
      </p>
      <p>
        To resolve this contradiction, it was proposed to use approaches based on a collective (ensemble) of neural networks, for
the construction of which a new class of artificial neurons, the so-called selective neurons, differing from classical neurons
using a more efficient way of processing input information close to a biological neuron are used [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The considered
approaches can be used as part of the decision support subsystems of information and analytical systems (IAS) of security
operational centers. The next section presents the features of the construction and operation of electoral neural networks.
2 Features of the construction and operation of electoral neural networks
Electoral networks are built on the basis of selective neurons. Teaching neural networks is not carried out by changing the
weights of synaptic connections, but by changing the quantity and quality of inhibitory and exciting dendrites (signal
transmitters) on the basis of which selective neurons are built [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        A mathematically artificial neuron usually has a non-linear function of a linear combination of signals with a limited
number of inputs. Exit the neuron from the next layer. The communication channels through which signals are received are
determined by weights. The task of learning is to minimize the total error value of the output value. Figure 1 shows the
McCulloch-Pitts neuron scheme, which is the basic element of the most classic neural networks of practical importance [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <sec id="sec-1-1">
        <title>Legend:</title>
        <p>x1, x2, …, x8 are input signals;
КС is communication channels,
∑ is adder;
W is the weight of synaptic connections;
P is function of the product xi * wi;
F is a nonlinear threshold function;
Y is a set of output signals.</p>
        <p>Figure 1 – McCulloch-Pitts neuron diagram
A mathematical model of an artificial neuron is presented below. The output value y can be represented by the formula (1).</p>
        <p>N
y  F (S )  F ( wi xi  )  F (w, x  ) ,</p>
        <p>i1
where F (S) is the data transformation function after adder ∑, x  (x1,..., xN ),
w  (w1,..., wN ), (w, x) is the scalar
(1)
product of vectors w, x ; N is the number of inputs of the neuron;  - threshold of arousal.</p>
        <p>However, the classical types of neural networks have a significant drawback - the instability to retraining. If the number
of weights in the network is sufficiently large, then the operation of summing their linear combination becomes extremely
computationally laborious, and a retraining effect is likely to occur when the network recognizes images on the training set
with extremely high accuracy, but at the same time shows a fairly large percentage of error on the test set input data.</p>
        <p>
          Thus, the drawbacks of classic neural networks are a consequence of the use of weights of synaptic connections. In
biological neurons there is no weight gradation of synaptic connections. They solve the main tasks of processing input
signals by changing the number of dendrites receiving an input signal at the input of a biological neuron and, accordingly,
changing the configuration of their clustering [
          <xref ref-type="bibr" rid="ref4 ref5">4,5</xref>
          ].
        </p>
        <p>The analysis showed that, by analogy with biological mechanisms, it is advisable to use a neural network that includes
so-called selective neurons, with controlled clustering of input channels (synapses) in quantity, as well as clustering of input
channels in quality (exciting or inhibiting). As an effective classifier of images, it was proposed to use a neural network
based on selective neurons. The selective neuron does not have weights of synaptic connections and is close in properties to
its biological analog (Figure 2).</p>
        <p>
          Cluster formation includes blocking non-informative communication channels that do not conduct excitatory or
inhibitory signals [
          <xref ref-type="bibr" rid="ref4 ref5 ref6 ref7">4-7</xref>
          ]. Thus, after learning, each neuron has a cluster of input communication channels with an individual
transfer characteristic, where some of the inputs are blocked for signals (the blocked inputs are marked with triangles),
which are certainly different from the training code combination.
        </p>
      </sec>
      <sec id="sec-1-2">
        <title>Legend:</title>
        <p>x1, x2, …, x9 are input signals in the form of a binary code;
K is a cluster of input channels of communication, formed</p>
        <p>in accordance with the binary code at the entrance;
∑ is adder;
F is a nonlinear threshold function;
Y is output signal.</p>
        <p>A distinctive feature of the electoral neuron is the presence of a cluster formed from a combination of internal
communication channels. The cluster is formed in accordance with the specific properties of the input signal; the cluster is
connected to an adder, after which a nonlinear threshold signal transformation is performed, which represents the output
signal.</p>
        <p>
          Taking into account the drawbacks of the known single-layer perceptron, a selective single-layer perceptron was
proposed, based on electoral neurons [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. An effective method of obtaining selectivity in the system of n neurons is
implemented practically. The block diagram of the electoral perceptron is shown in Figure 3.
        </p>
      </sec>
      <sec id="sec-1-3">
        <title>Legend:</title>
        <p>Кi is formed clusters of communication channels;
 is adder;
F is a non-linear function;</p>
        <p>Yi. is output signal.</p>
        <p>In a single-layer electoral perceptron, clusters of specialized communication channels, dendrites, are configured in each
electoral neuron and tuned to the corresponding characteristic vectors of the input signals. Triangles denote blocked
communication channels from among the input, which are not essential characteristic vectors for the object at the input of
the perceptron.
Mathematically, a selective neuron can be described as follows. The output value of y can be represented by the formula
y  F ( xi  ) , (2)</p>
        <p>iK
where K is a cluster of permissible values that are part of a cluster of an electoral neuron. Unlike formula (1) in formula (2)
there are no weight coefficients wi.</p>
        <p>Possible characteristic code combinations of objects at the input of a neuron in the form of vectors are represented by
expressions (3).</p>
        <p>x1 = ( x11 , ..., x1n ) ; … ; x m = ( xm1 , ..., xmn ) ,
where n is the number of elements of the code combination; m is the number of objects.</p>
        <p>All possible code combinations of input objects form a matrix A.</p>
        <p> x11, x12..., x1n 
 
A =  ... ... .. 
 
 xm1, xm2..., xmn </p>
        <p>B  A* AT ,
n
Sii   x x</p>
        <p>ij ji  ( xi , xi )  Ni  N ,
j1</p>
        <p>Let the i -th neuron contain a cluster of connections characterized by a code combination xi = ( xi1, ..., xin ) . Upon
receipt at the input of the i - th neuron of the code combination of the input object we get</p>
        <p>n
Sij   xik xkj  ( xi , x j )</p>
        <p>k 1</p>
        <p>The values of the sums S ij are equal to the elements of the matrix B (4).
where AT is the transposed matrix A. In total we get m x m sums Sij . The largest will be the sum (5).
(3)
(4)
(5)
where N i is the number of units in the code combination xi = ( xi1, ..., xin ) . The ratio Sij  N used to recognize input
features.</p>
        <p>Select the threshold valueUп   Ni , where 0   1. Then the output is</p>
        <p> 1
y= 
 0</p>
        <p>Sik  Uп</p>
        <p>Sik  Uп</p>
        <p>This property characterizes the selective properties of the considered perceptron in the recognition of various input
objects.</p>
        <p>The graphical interpretation of the selective properties of a single-layer perceptron is represented by the matrix values
B  A AT as a graph in three-dTimensional space. From a physical point of view, the values Sij  f (i, j) of the expression
are the matrix values B  A A equal to the sum of the products of the input signal x j after it passes through the cluster of
communication prechannels, combining the working (unlocked) inputs of the i - th neuron, to the corresponding values of
the threshold nonlinear function F at the perceptron output.</p>
        <p>To build the graph in three-dimensional space, the tools of the Matlab-7 application software package are used. The axes
Ох and Оу are the values of i and j. As an example, selective recognition was implemented for 20 works of art in the field of
150x100. A graphical interpretation of the selective properties of a single-layer perceptron based on selective neurons is
illustrated in Figure 4, which presents the results of processing (filtering) input signals (objects).</p>
        <p>The results of processing the input signals shown in Figure 4 confirm the selective properties of a single-layer
perceptron based on selective neurons, which are represented by a rather pronounced diagonal part.</p>
        <p>Due to the peculiarities of its construction and operation, the selective neural network provides:
- selective recognition of input signals without using the weighting of their synaptic connections;
- the ability to encode an input signal of a certain type by the channel number or the number of the recording neuron;
- compression of input information, due to the preservation of only that information about objects that falls into a given
channel or recording neuron;
- increase the speed of operation;
- improving the reliability of recognition of objects with a large number of them;
- a significantly greater degree of adequacy of the selective neuron to a biological single-layer perceptron.</p>
        <p>
          Taking into account the considered features of electoral neurons, proposals were developed for improving methods for
classifying security events in information and analytical systems of situational information security operational control
centers based on an ensemble of electoral neural networks.
3 Features of the composition of artificial neural networks in ensembles
A promising direction for the improvement of artificial neural networks (ANN) is the union (composition) of many
individual ANNs in an ensemble [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. In this case, the errors of individual classification algorithms are mutually
compensated. Ensemble organization is considered in a number of works [
          <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref9">9-14</xref>
          ]. In [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], the efficiency of using an
ensemble organization for image recognition was experimentally proved.
        </p>
        <p>When constructing an ensemble of neural networks, a finite set of previously trained neural networks is simultaneously
used, the output signals of which are combined into a joint estimate, superior in quality to the results obtained using the
local networks included in the ensembles.</p>
        <p>
          The ensemble H ( x̅) of models hi( x̄) (i=1,2,..., N) is a composition of algorithmic operators hi: Rd→R and the corrective
operation F: RN→R, in which the set of estimates h1( x̅),h2( x̅),...,hN( x̅) corresponds to the final grade H( x̅) [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]:
        </p>
        <p>H( x̅) = F(h1( x̅),h2( x̅),…,hN( x̅)) (6)</p>
        <p>
          As you know, the fundamental task in the construction of ensembles is the generation of the diversity of the ensemble
(or differences in individual models) [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
        </p>
        <p>
          Obviously, the aggregation of similar models in an ensemble cannot lead to a significant improvement in the quality of
the solution of the problem. The ensemble of models can be better than the individual models included in the ensemble for
the following reasons [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]:
        </p>
        <p>- The ensemble reduces the rms error. The use of an ensemble of models averages the error of each individual model
and reduces the influence of instabilities and randomness in the formation of hypotheses. Solving the problems of
classification and regression is a search for hypotheses about the properties of the system or the next state of the system. If
you use a sufficiently large number of models, trained in approximately the same set of examples, then you can reduce the
instability and randomness of the result obtained by combining the results. Averaging over the set of models built on the
basis of independent training sets, always reduces the expected value of the root-mean-square error.</p>
        <p>- Ensembles of models trained on different subsets of the source data have a greater chance of finding the global
optimum, since they are looking for it from different starting points.</p>
        <p>- The combined hypothesis may not be in the set of possible hypotheses for basic classifiers, that is, when building a
combined hypothesis, the set of possible hypotheses is expanded.</p>
        <p>Several approaches are applied to the construction of model ensembles. Most often, an ensemble consists of basic
models of the same type, which are trained in different sets of training samples (Figure 5).</p>
        <p>
          For the formation of the output value of the ensemble under certain states of the outputs of the models, the most
common are the following algorithms [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]:
1. Voting. Used in classification tasks. The class chosen by a simple majority of ensemble models is chosen.
2. Weighted voting. It differs from simple voting by assigning weights (points) for the results of different models. The
points take into account the accuracy of the work of different classifiers.
3. Averaging (weighted or unweighted). It is used in solving with the help of the ensemble for regression, when the outputs
of the models will be numeric. The output of the entire ensemble can be defined as a simple average value of the outputs of
all models. If weighted averaging is performed, the outputs of the models are multiplied by the corresponding weights.
        </p>
        <p>The simplest example of a vote is simple voting:</p>
        <p>R = f(r1,r2,…,rn), (10)
where R is the total solution, ri is the individual solution of the i-th ANN, n is the total number of ANN in the ensemble.
The function f determines the method of generalization of individual decisions.</p>
        <p>The solution to the problem of classification consists in choosing the number of one of the classes Aj, j = 1,2,..., J, or
choosing the empty set in the case of non-classification. Each partial solution ri may take the value of the image or be an
empty set if the pattern does not belong to the field of competence of the private classifier. The area of competence is
understood as a subset of objects of a feature space within which the scope of a private classifier with a given subset of
recognizable images is defined. Synthesis of the function f is the central task of using neural network ensembles.</p>
        <p>Each of the solutions ri can be assigned a weight, as well as a competence area. The collective decision is determined by
the set of individual decisions ri, which belong to the field of competence K(ri).</p>
        <p>Thus, the decision of the team is determined by a set of individual decisions corresponding to the areas of their
competence. This approach contains the ANN, whose solutions correspond to the field of competence. At the next stage, the
ensemble output value is calculated by one of the above mentioned algorithms.</p>
        <p>
          Currently, the most developed methods for building ensembles of neural networks are: equal or unequal voting for
classification problems and simple or weighted averaging for regression problems [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
        </p>
        <p>Analysis of existing approaches has shown that they do not always provide the necessary quality of making the final
decision (accuracy and validity).
The most significant of their disadvantages include:
- the dependence of the final result on the reliability of determining the coefficients of competence, which can lead to an
incorrect result;</p>
        <p>- often the training sample contains noise emissions, leading to an increase in the probability of erroneous decisions by
private classifiers, since an attempt to tune the noise to a training algorithm impairs the approximating capabilities of the
network;
- the need to use a large number of examples of training samples for the successful implementation of the algorithm;
- The architecture of a neural network ensemble is often redundant, which does not contribute to an increase in the
accuracy of solving classification problems and leads to a significant increase in the required computing resource.</p>
        <p>
          The effect of these disadvantages can be significantly reduced by using a neural network ensemble based on selective
neural networks.
4 An integrated approach to solving the problem of classifying security events based on neural
network ensembles
It is known that neural network models are used to solve classification problems in the decision support subsystems of
information analysis systems (IAS). They have a number of features and advantages: they are adaptive self-learning
systems, their use allows solving problems that are difficult or impossible to solve by traditional methods due to the absence
of formalized mathematical descriptions of the processes of the object of study. Neural network models have an associative
memory and in the process of work they accumulate and summarize information, from which their effectiveness increases
over time. Their use is based on training the neural network to extract information from experimental data, which ensures
the objectivity of the results and increases their reliability and reliability [
          <xref ref-type="bibr" rid="ref18 ref19">18,19</xref>
          ].
        </p>
        <p>One of the drawbacks of the existing classification models used in the information-analytical system of the security
operational center is the need to improve accuracy and reliability when solving problems of classification of security events
due to their significant increase and diversity.</p>
        <p>In order to improve the quality of solving the problem of classification of security events, the use of an ensemble based
on electoral neural networks has been proposed. This approach allows, using the advantages of selective neurosites, to
improve the accuracy and reliability of recognition of the input signal, to reduce the computational costs of implementing
the functioning of the neural network ensemble. Improving the accuracy and reliability of solving classification problems is
provided by applying the method of adaptive reduction of a neural network ensemble, as well as choosing and justifying the
method of aggregation of results (composition of outputs of private classifiers).</p>
        <p>The approach includes a set of steps:
1. Formation of the initial set (pool) of neural networks included in the ensemble (determination of the number of
hidden neuron layers, activation functions, size of the training sample).</p>
        <p>2. Reduction of the neural network ensemble (selection of the best classifiers based on the calculation of the
classification reliability coefficients, comparison of the obtained values of the coefficients with the specified threshold
values and evaluating the convergence of the results obtained by private classifiers).</p>
        <p>3. Selection and justification of the method of aggregation of results (voting method).</p>
        <p>The task of classification is to determine by its quantitative characteristics of an unknown object that it belongs to a
particular image.</p>
        <p>At the first stage, neural networks that meet the specified requirements are selected (generated). The architecture and
parameters of the artificial neural network - the classifier, the size of the training sample are determined. In this case, neural
networks are formed in the form of simpler structures, in contrast to the traditional approach to the synthesis of a classifier
using a single neural network. Simple networks are fairly easy to learn and less prone to retraining.</p>
        <p>At the second stage, the selection of the best classifiers is performed. To assess the competence of the classifier, a
special algorithm (referee) is used. Under the competence of the classifier in this area of the space of representation of
objects of classification is understood its accuracy probability of correct classification of objects whose description belongs
to this area.</p>
        <p>To formalize the method of aggregation of results (voting scheme), the coefficient µij≤1 of classification reliability by a
private classifier is used. The coefficient µij is the fraction of objects with a given image value j that fall within the
competence area of the i-th classifier
where F (j) is the cumulative frequency of solutions in the source database, , Fi(j) is the cumulative frequency of solutions
of the image j for the i-th private classifier in its own area of competence. The voting function qj of the j-th class is
represented by the expression:
µij =</p>
        <p>,
µ ,
Суммирование производится по всем оставшимся классификаторам. Решение о принадлежности паттерна Х к
одному из классов Аj принимается в соответствии с правилом:</p>
        <p>If the pattern X does not belong to the area of competence of the private i -th classifier, then the value µij=0. In this
case, the results of the work of the private classifier are not subsequently taken into account, i.e. The procedure for reducing
the neural network ensemble is performed.</p>
        <p>Summation is performed for all remaining classifiers. The decision on the belonging of the pattern X to one of the
classes Аj is made in accordance with the rule:</p>
        <p>If ∗
max
, then
∈
∗ .</p>
        <p>13</p>
        <p>
          Such an approach is not always justified in conditions of noisy initial data, in connection with which the refusal of
classification can be recorded. The choice of a strategy for combining the decisions of private classifiers usually does not
require large computational resources, but at the same time provides a higher quality collective decision. In this case, one of
the strategies can be used: selection and merging [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. In the first case, each subspace of solutions corresponds to a separate
classifier, in the second case, private classifiers are used throughout the solution space.
        </p>
        <p>
          Of the technologies that ensure the effective projection of solutions of private classifiers to the target space, the most
acceptable are [
          <xref ref-type="bibr" rid="ref20 ref9">9,20</xref>
          ]:
- the solution template method (the simplest method);
- weighted averaging;
- the method of multi-tiered generalization (uses a two-step procedure for forming decisions of classifiers with a
nonlinear combination of individual solutions), which has various modifications.
        </p>
        <p>The choice of the preferred aggregation method is based on the rule of minimum classification error.</p>
        <p>
          On the basis of the developed model of the selective neural network ensemble, representing a collection of neural
networks consisting of selective single-layer perceptrons, the modeling of the classification of network attacks on a
corporation's information system was carried out. Of the technologies that ensure the effective projection of the solutions of
private classifiers onto the target space, the method of multi-tiered generalization, having various modifications, was
applied. A computational experiment to test the proposed approach was carried out using test data from the repository [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ].
        </p>
        <p>The following network traffic parameters were used as source data:
1. Network recipient service (http, ssh, rdp, https);
2. The duration of the session in seconds;
3. The number of packets in the session;
4. The number of requests per session;
5. The number of responses in the session;
6. The number of requests with suspicious sequences of characters;
7. The number of authorization attempts per minute;
8. The number of erroneous packets in a session;
9. Connection status (active, established, closed);
10. The identity of the ports of the sender and receiver (identical / different).</p>
        <p>The result of the work of a trained neural network ensemble is the classification of network traffic into 2 classes
(legitimate traffic or suspicious traffic).</p>
        <p>A computational experiment showed an increase in the reliability of solving the classification problem on the basis of
neural network ensembles by an average of 8-12%, and the quality of diversity (by diversity is understood as the degree of
uncorrelated errors of members of the neural network ensemble, the significant effect of which is confirmed, including
experimentally) of models of selective neural networks calculated as the difference between the quadratic error of the
ensemble and the average error of the individual models of electoral neural networks improved by 10-14%. At the same
time, the volume (complexity) of calculations is significantly reduced when training the classifier electoral neural networks.</p>
        <p>A promising direction is the development of collective methods for classifying security events, taking into account their
considerable diversity and correlation. The most interesting results can be obtained when applying in the architecture of
neural network ensembles of neural networks based on new neural elements using memristor technology.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>5 Conclusion</title>
      <p>Studies have shown that the use of a new class of neural networks based on selective neurons to build neural network
ensembles in the decision support subsystems of information and analytical systems can significantly improve the quality
(accuracy and reliability) of decision making by an ensemble of neural networks. The key difference between a selective
neuron and a classical McCulloch-Pitts neuron is that it does not have weights of synaptic connections and is close in its
properties to its biological analogue.</p>
      <p>The functioning of electoral neural networks is based on controlled clustering of input channels, which is individual for
each neuron of the network, which, in turn, reduces the computational resources for its training and functioning. Taking into
account the peculiarities of the electoral neurons considered, proposals were developed for improving the classification
methods based on an ensemble of electoral neural networks.</p>
      <p>An approach is proposed to solve the problems of classification of security events in the information-analytical system
of the situational information security operational center based on an ensemble of electoral neural networks, which allows
to obtain more accurate and reliable results.</p>
      <p>An improved integrated approach has been developed that provides more efficient classification results. The approach is
based on the application of the procedure for adaptive reduction of the results of private classifiers and the procedure for
choosing the method of aggregation of the results of private classifiers.</p>
      <p>The results of the computational experiment showed an increase in the reliability of the solution to the classification
problem (reduction of the classification error) based on neural network ensembles by an average of 8-12%, the quality of
the diversity of models of electoral neural networks improved by 10-14%. At the same time, the volume (complexity) of
calculations is significantly reduced when training the classifier electoral neural networks.</p>
      <p>The main directions of development of approaches to improving the efficiency of neural network classification systems
include the following: implementation of neural network classifier architectures based on hybrid models of classical and
selective neural networks using multi-agent and cognitive technologies.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Kuncheva</surname>
            <given-names>L.I.</given-names>
          </string-name>
          <article-title>Combining Pattern Classifiers: Methods and algorithms</article-title>
          . John Wiley &amp; Sons, Hoboken, NJ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Mazurov</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Ye</surname>
          </string-name>
          .
          <article-title>Pulsed neuron, close to real. The patent for the invention number 2598298</article-title>
          . 08/30/
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Aleksandrov</given-names>
            <surname>Yu</surname>
          </string-name>
          .I.,
          <string-name>
            <surname>Anokhin</surname>
            <given-names>K.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sokolov</surname>
            <given-names>E.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grechenko</surname>
            <given-names>T.N.</given-names>
          </string-name>
          <article-title>and others</article-title>
          .
          <source>Neuron. Signal processing. Plastic. Modeling. Fundamental guide. Publishing house of Tyumen</source>
          State University.
          <year>2008</year>
          .
          <volume>548</volume>
          s.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Mazurov</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          <string-name>
            <surname>Neuron</surname>
          </string-name>
          <article-title>, modeling the properties of a real neuron. The patent for the invention</article-title>
          №
          <volume>2597495</volume>
          . 08/22/
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Mazurov</surname>
            <given-names>M.E.</given-names>
          </string-name>
          <article-title>Election neural networks for the recognition of complex objects. Q: Mathematical biology and bioinformatics: proceedings of the VI International Conference</article-title>
          . M .: MAX Press.
          <year>2016</year>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Mazurov</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          <article-title>One-layer perceptron based on selective neurons. The patent for the invention</article-title>
          №
          <volume>2597497</volume>
          . 08/22/
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Mazurov</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          <article-title>One-layer perceptron, which simulates the properties of a real perceptron. The patent for the invention</article-title>
          №
          <volume>2597496</volume>
          . 08/22/
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Bishop</surname>
            <given-names>C.M.</given-names>
          </string-name>
          <article-title>Neural Networks for Pattern Recognition</article-title>
          . - Oxford University Press,
          <year>1995</year>
          . - 496 p.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Zhou</given-names>
            <surname>Z.-H. Ensemble</surname>
          </string-name>
          <article-title>Methods: Foundations and algorithms</article-title>
          .
          <source>Chapman &amp; Hall / Crc Machine Learning &amp; Pattern Recognition</source>
          .
          <year>2012</year>
          . 236 p.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>KunchevaL.I.</surname>
          </string-name>
          <article-title>Combining Pattern Classifiers: Methods and algorithms</article-title>
          . JohnWiley &amp; Sons, Hoboken, NJ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Terekhov</surname>
            <given-names>S. A.</given-names>
          </string-name>
          <article-title>The genial committees of smart machines // Scientific session MEPhI-2007</article-title>
          .
          <article-title>IX All-Russian Scientific</article-title>
          and
          <source>Technical Conference "Neuroinformatics-2007": Lectures on neuroinformatics. Part</source>
          <volume>2</volume>
          .
          <string-name>
            <surname>- M .: MEPI</surname>
          </string-name>
          ,
          <year>2007</year>
          . - P.
          <fpage>11</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Vorontsov</surname>
            <given-names>K. V.</given-names>
          </string-name>
          <article-title>Lectures on algorithmic compositions. [Electronic resource]</article-title>
          . URL: http://www.ccas.ru/voron/download/Composition.pdf (appeal
          <source>date: 1</source>
          .
          <fpage>07</fpage>
          .
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Goncharov</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>Ensembles models</article-title>
          . [Electronic resource] .URL: http://www.business data analytics.ru/download/ModelEnsembles.pdf (appeal
          <source>date: 1</source>
          .
          <fpage>07</fpage>
          .
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Borovikov</surname>
            <given-names>V.P.</given-names>
          </string-name>
          <article-title>Neural networks</article-title>
          .
          <source>Statistica Neural Networks. Methodology and technology of modern data analysis. 2nd ed., Pererab. and add</source>
          . - M .: Hotline - Telecom,
          <year>2008</year>
          . - 392 p.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Yu</surname>
            .
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Voevodin</surname>
            ,
            <given-names>L.G. Komartsova</given-names>
          </string-name>
          <article-title>The use of a genetic algorithm to optimize the parameters of the neural network in the tasks</article-title>
          of classification // Informatics: problems, methodology, technology. - M .: Publishing House of Moscow State Technical University named after Bauman,
          <year>2005</year>
          . - From 42-46.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Plumton</surname>
            <given-names>C.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuncheva</surname>
            <given-names>L.I.</given-names>
          </string-name>
          <article-title>Choosing parameters for Random Subspase Assignments for fMRI classification</article-title>
          ,
          <source>Proceedings of Multiple Classifier Systems (MCS 10)</source>
          , Cairo, Edgipt, LNCS
          <volume>5997</volume>
          ,
          <year>2010</year>
          , pp.
          <fpage>54</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Paklin</surname>
            <given-names>N. B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oreshkov</surname>
            <given-names>V. I.</given-names>
          </string-name>
          <article-title>Business analyst: from data to knowledge</article-title>
          . - SPb .: Peter,
          <year>2013</year>
          . - 704 p.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Bova</surname>
            <given-names>V.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dukkart</surname>
            <given-names>A.N.</given-names>
          </string-name>
          <article-title>The use of artificial neural networks for the collective solution of intellectual problems. Problems of knowledge representation in integrated support systems for management decisions</article-title>
          .
          <source>Izvestia SFU. Technical science. - 2010</source>
          . - №
          <volume>7</volume>
          (
          <issue>108</issue>
          ). - P.
          <fpage>131</fpage>
          -
          <lpage>138</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>A.A.</given-names>
            <surname>Morozov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.P.</given-names>
            <surname>Klimenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.L.</given-names>
            <surname>Lyakhov</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.P. Aleshin</surname>
          </string-name>
          <article-title>The State and Prospects of Neural Network Modeling of DSS in Complex Socio-</article-title>
          <source>Technical Systems // Mathematical Machines and Systems. - 2010</source>
          . -
          <fpage>№</fpage>
          1.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Wolpert</surname>
            <given-names>D.H.</given-names>
          </string-name>
          <article-title>Stacked generalization</article-title>
          .
          <source>Neural Networks</source>
          ,
          <year>1992</year>
          , no.
          <issue>5</issue>
          , pp.
          <fpage>241</fpage>
          -
          <lpage>259</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Frank</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Asuncion</surname>
            <given-names>A.</given-names>
          </string-name>
          <article-title>UCL Machine Learning Repository</article-title>
          . Uniwersity of California, School of Information and Computer Science, Irvine, CA,
          <year>2010</year>
          . [Electronic resource] URL: http://arhive.ics.uci.
          <source>edu/ml (circulation date 1</source>
          .
          <fpage>07</fpage>
          .
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>