=Paper= {{Paper |id=Vol-3027/paper105 |storemode=property |title=The Timeliness of the Reserved Service in the Cluster with the Regulation of the Time of Destruction of Overdue Requests in the Node Queues |pdfUrl=https://ceur-ws.org/Vol-3027/paper105.pdf |volume=Vol-3027 |authors=Vladimir Bogatyrev,Stanislav Bogatyrev,Anatoly Bogatyrev }} ==The Timeliness of the Reserved Service in the Cluster with the Regulation of the Time of Destruction of Overdue Requests in the Node Queues== https://ceur-ws.org/Vol-3027/paper105.pdf
The Timeliness of the Reserved Service in the Cluster with the
Regulation of the Time of Destruction of Overdue Requests in
the Node Queues
Vladimir Bogatyrev 1,2, Stanislav Bogatyrev 1,3 and Anatoly Bogatyrev 3
    1
     ITMO University, Kronverksky Pr. 49, bldg. A, Saint-Petersburg, 197101, Russia
    2
     Saint-Petersburg State University of Aerospace Instrumentation, 67, Bolshaya Morskaia str. St Petersburg,
Russia
   3
     JSC NEO Saint Petersburg Competence Center, 1-Ya Sovetskaya, house 6 str. St. Petersburg, Russia

                Abstract
                With the increasing complexity of distributed control tasks based on their intellectualization,
                there are problems of insufficient time and computing resources for functioning in real time.
                In this regard, there is a need to develop methods for organizing distributed real-time computer
                systems, based on the consolidation of distributed computing resources with their integration
                into clusters. The possibilities of increasing the probability of timely servicing of waiting-
                critical requests in the cluster as a result of query replication and controlling the time of
                destruction of potentially expired replicas in node queues are investigated. The cluster is
                represented as a group of queuing systems with infinite queues with a limited average waiting
                time. The effectiveness of the reserved service of a real-time request is determined by the
                probability of executing at least one of the generated copies of the request in the maximum
                allowable time without losing it due to errors and waiting time limits in the queues of cluster
                nodes. It is shown that there is an optimal multiplicity of query replication with a significant
                influence of the choice of restrictions on the waiting time for requests in queues before they
                are destroyed.

                Keywords 1
                Replication, redundant service, timeliness, real-time, cluster

1. Introduction
    One of the key trends in the development of modern automated control systems is their
intellectualization and construction on the basis of distributed computer systems for storing,
transmitting and processing data. Ensuring the high quality of distributed management is based on the
concept of multi-agent systems. In multi-agent systems, decision-making in changing operating
conditions involves the interaction of agents characterized by autonomy, intelligence, purposefulness
and activity of behavior. The strategy of interaction of agents for making coordinated decisions,
especially when solving difficult-to-formalize tasks, is currently increasingly based on neural network
technology [1-3].
    With the increasing complexity of automated control tasks, there are problems of insufficient time
and computing resources for the functioning of a neural network in real time. In this regard, there is a
need to develop methods for organizing distributed computer systems in real time, based on the
consolidation of distributed computing resources with their integration into clusters [4-6].
    For distributed real-time computer systems, it is fundamental to support the reliability and timeliness
of the computing process, which can be considered as a key condition for their operability. [7- 10].
    The reliability and timeliness of executing requests that are critical to delays in computer systems
and networks can be increased as a result of increasing the computing resources of their consolidation

GraphiCon 2021: 31st International Conference on Computer Graphics and Vision, September 27-30, 2021, Nizhny Novgorod, Russia
EMAIL: vabogatyrev@corp.ifmo.ru (V. Bogatyrev); stanislav@nspcc.ru (S. Bogatyrev); anatoly@nspcc.ru (A. Bogatyrev)
ORCID: 0000-0003-0213-0223 (V. Bogatyrev); 0000-0003-0836-8515 (S. Bogatyrev); 0000-0001-5447-7275 (A. Bogatyrev)
             ©️ 2021 Copyright for this paper by its authors.
             Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
             CEUR Workshop Proceedings (CEUR-WS.org)
and load balancing and priority distribution. Additional opportunities to increase the reliability and
probability of timely execution of requests in real time are provided by their replication with redundant
service [11-13]. Real-time redundant service is successful when at least one replica (copy) of the request
is completed in a time less than the established limits [11-14]. The organization of redundant service in
the cluster, of course, allows you to increase the reliability of calculations, but it is associated with the
need to resolve a technical contradiction. Indeed, replication (copying) of requests executed in different
nodes of the cluster can lead to both a decrease in the time until the first result is received, and to its
increase due to an increase in the total load of the cluster.
    The efficiency and expediency of redundant query execution in a cluster represented by a group of
queuing systems with infinite queues is shown in [11-15]. The possibilities of combining the reserved
service of waiting-critical requests with the regulation of the number of places in the queues of cluster
system nodes are analyzed in [16]. The effect analyzed in [16] is achieved as a result of reducing the
load achieved by destroying requests received when the queue length exceeds the established limits. In
this case, the queue increases above the set limit when the system load increases. The boundary length
of the queue at which requests are not accepted for service at the node is set based on maximizing the
probability of timely servicing at least one of the copies of requests. At the same time, [16] shows the
existence of an optimal multiplicity of reservation requests and an adjustable queue length, at which the
maximum probability of timely error-free service of requests is achieved. Potentially, the effect of
increasing the probability of timely service can also be achieved by combining the reservation of
requests with the regulation of the average waiting time for requests before they are destroyed in queues.
Moreover, the specified average time before the destruction of requests in queues may in principle not
coincide with the maximum allowable time t0 of waiting for requests in queues, set based on the
requirements of application tasks performed in real time.
    The purpose of the work is to study the possibilities of increasing the probability of timely servicing
of waiting-critical requests in the cluster as a result of query replication and regulating the time of
destruction of potentially expired replicas in node queues.
    The effectiveness of the reserved service of a real-time request is determined by the probability of
executing at least one of the generated copies of the request in the maximum allowable time without
losing it due to errors and waiting time restrictions in the queues of cluster nodes.

2. The probability of servicing requests in the cluster when adjusting the time
   of destruction of expired replicas of requests
    A cluster of m nodes is represented as a group of m queuing systems (QMS) with infinite queues
with a limit on the average waiting time t (QMS with impatient customers) [17-19]. If the waiting time
for a request in the queue exceeds the set time, it leaves the system without maintenance (it is destroyed).
It is assumed that the time of destruction of overdue requests has an exponential distribution with an
average t. Note that individual requests can be in the queue for both more and less time t. It should also
be noted that the time t in general does not coincide with the maximum allowable waiting time t0, which
is set based on the requirements for solving real-time application problems.
    Let's consider the service process in some one of the m n-channel QMS with a waiting time limit
[17-19]. In each of the m QMS, a decrease in the number of requests occurs as a result of either the
completion of their service with an intensity of µ=1/v, or as a result of their leaving the queue with an
intensity of τ=1/t. At the same time, v is the average time of their execution,
    The transition from the state with k requests in the Sk system to the state with k-1 requests Sk-1 is
performed at. k