The Timeliness of the Reserved Service in the Cluster with the Regulation of the Time of Destruction of Overdue Requests in the Node Queues Vladimir Bogatyrev 1,2, Stanislav Bogatyrev 1,3 and Anatoly Bogatyrev 3 1 ITMO University, Kronverksky Pr. 49, bldg. A, Saint-Petersburg, 197101, Russia 2 Saint-Petersburg State University of Aerospace Instrumentation, 67, Bolshaya Morskaia str. St Petersburg, Russia 3 JSC NEO Saint Petersburg Competence Center, 1-Ya Sovetskaya, house 6 str. St. Petersburg, Russia Abstract With the increasing complexity of distributed control tasks based on their intellectualization, there are problems of insufficient time and computing resources for functioning in real time. In this regard, there is a need to develop methods for organizing distributed real-time computer systems, based on the consolidation of distributed computing resources with their integration into clusters. The possibilities of increasing the probability of timely servicing of waiting- critical requests in the cluster as a result of query replication and controlling the time of destruction of potentially expired replicas in node queues are investigated. The cluster is represented as a group of queuing systems with infinite queues with a limited average waiting time. The effectiveness of the reserved service of a real-time request is determined by the probability of executing at least one of the generated copies of the request in the maximum allowable time without losing it due to errors and waiting time limits in the queues of cluster nodes. It is shown that there is an optimal multiplicity of query replication with a significant influence of the choice of restrictions on the waiting time for requests in queues before they are destroyed. Keywords 1 Replication, redundant service, timeliness, real-time, cluster 1. Introduction One of the key trends in the development of modern automated control systems is their intellectualization and construction on the basis of distributed computer systems for storing, transmitting and processing data. Ensuring the high quality of distributed management is based on the concept of multi-agent systems. In multi-agent systems, decision-making in changing operating conditions involves the interaction of agents characterized by autonomy, intelligence, purposefulness and activity of behavior. The strategy of interaction of agents for making coordinated decisions, especially when solving difficult-to-formalize tasks, is currently increasingly based on neural network technology [1-3]. With the increasing complexity of automated control tasks, there are problems of insufficient time and computing resources for the functioning of a neural network in real time. In this regard, there is a need to develop methods for organizing distributed computer systems in real time, based on the consolidation of distributed computing resources with their integration into clusters [4-6]. For distributed real-time computer systems, it is fundamental to support the reliability and timeliness of the computing process, which can be considered as a key condition for their operability. [7- 10]. The reliability and timeliness of executing requests that are critical to delays in computer systems and networks can be increased as a result of increasing the computing resources of their consolidation GraphiCon 2021: 31st International Conference on Computer Graphics and Vision, September 27-30, 2021, Nizhny Novgorod, Russia EMAIL: vabogatyrev@corp.ifmo.ru (V. Bogatyrev); stanislav@nspcc.ru (S. Bogatyrev); anatoly@nspcc.ru (A. Bogatyrev) ORCID: 0000-0003-0213-0223 (V. Bogatyrev); 0000-0003-0836-8515 (S. Bogatyrev); 0000-0001-5447-7275 (A. Bogatyrev) ©️ 2021 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings (CEUR-WS.org) and load balancing and priority distribution. Additional opportunities to increase the reliability and probability of timely execution of requests in real time are provided by their replication with redundant service [11-13]. Real-time redundant service is successful when at least one replica (copy) of the request is completed in a time less than the established limits [11-14]. The organization of redundant service in the cluster, of course, allows you to increase the reliability of calculations, but it is associated with the need to resolve a technical contradiction. Indeed, replication (copying) of requests executed in different nodes of the cluster can lead to both a decrease in the time until the first result is received, and to its increase due to an increase in the total load of the cluster. The efficiency and expediency of redundant query execution in a cluster represented by a group of queuing systems with infinite queues is shown in [11-15]. The possibilities of combining the reserved service of waiting-critical requests with the regulation of the number of places in the queues of cluster system nodes are analyzed in [16]. The effect analyzed in [16] is achieved as a result of reducing the load achieved by destroying requests received when the queue length exceeds the established limits. In this case, the queue increases above the set limit when the system load increases. The boundary length of the queue at which requests are not accepted for service at the node is set based on maximizing the probability of timely servicing at least one of the copies of requests. At the same time, [16] shows the existence of an optimal multiplicity of reservation requests and an adjustable queue length, at which the maximum probability of timely error-free service of requests is achieved. Potentially, the effect of increasing the probability of timely service can also be achieved by combining the reservation of requests with the regulation of the average waiting time for requests before they are destroyed in queues. Moreover, the specified average time before the destruction of requests in queues may in principle not coincide with the maximum allowable time t0 of waiting for requests in queues, set based on the requirements of application tasks performed in real time. The purpose of the work is to study the possibilities of increasing the probability of timely servicing of waiting-critical requests in the cluster as a result of query replication and regulating the time of destruction of potentially expired replicas in node queues. The effectiveness of the reserved service of a real-time request is determined by the probability of executing at least one of the generated copies of the request in the maximum allowable time without losing it due to errors and waiting time restrictions in the queues of cluster nodes. 2. The probability of servicing requests in the cluster when adjusting the time of destruction of expired replicas of requests A cluster of m nodes is represented as a group of m queuing systems (QMS) with infinite queues with a limit on the average waiting time t (QMS with impatient customers) [17-19]. If the waiting time for a request in the queue exceeds the set time, it leaves the system without maintenance (it is destroyed). It is assumed that the time of destruction of overdue requests has an exponential distribution with an average t. Note that individual requests can be in the queue for both more and less time t. It should also be noted that the time t in general does not coincide with the maximum allowable waiting time t0, which is set based on the requirements for solving real-time application problems. Let's consider the service process in some one of the m n-channel QMS with a waiting time limit [17-19]. In each of the m QMS, a decrease in the number of requests occurs as a result of either the completion of their service with an intensity of µ=1/v, or as a result of their leaving the queue with an intensity of τ=1/t. At the same time, v is the average time of their execution, The transition from the state with k requests in the Sk system to the state with k-1 requests Sk-1 is performed at. k