=Paper=
{{Paper
|id=Vol-1689/paper6
|storemode=property
|title=On the Modeling and Performance Evaluation of Cloud Computing Centers Using M/G/c/c+r Queuing System
|pdfUrl=https://ceur-ws.org/Vol-1689/paper6.pdf
|volume=Vol-1689
|authors=Assia Outamazirt,Mohamed Escheikh,Djamil Aïssani,Kamel Barkaoui,Ouiza Lekadir
|dblpUrl=https://dblp.org/rec/conf/vecos/OutamazirtEABL16
}}
==On the Modeling and Performance Evaluation of Cloud Computing Centers Using M/G/c/c+r Queuing System ==
Stochastic Model for Cloud Dada Center with M/G/c/c+r Queue Assia Outamazirt Mohamed Escheikh Research unit LaMOS University of Bejaia SYS’COM ENIT Tunis Bejaia, Algeria Tunis, Tunisia outamazirt.assia@gmail.com mohamed.escheikh@gmail.com Djamil Aı̈ssani Kamel Barkaoui Ouiza Lekadir Research unit LaMOS, University of Bejaia CEDRIC, CNAM Research unit LaMOS University of Bejaia Bejaia, Algeria Paris, France Bejaia, Algeria lamos bejaia@hotmail.com kamel.barkaoui@cnam.fr ouizalekadir@gmail.com Analytical resolution of complex queuing systems remains nowadays an open and challenging issue and may be extensively used in modeling and representing dynamic behavior of sophisticated systems. This is particularly the case of M/G/c/c + r queue where exact analytical solution is difficult to reach. In this paper, we propose a new approximate analytical model in order to evaluate the performance of cloud computing center using M/G/c/c + r queuing system. The adopted modeling approach combines two models. The first one is a transform-based analytical model whereas the second relies on an approximate Markov chain. This combination enables to compute the one-step transition probabilities for the system M/G/c/c + r. Cloud Computing, Performance Evaluation, M/G/c/c+r queue, Transition Matrix, Embedded Markov Chain 1. INTRODUCTION throughput, security, performance indicators (Wang et al. (2010)). The US National Institute of Standards and Technology (NIST) has defined cloud computing Providing quality services require a solid model as a model for enabling convenient, on-demand that provides detailed insights of computing centers. network access to a shared pool of configurable Assessing the QoS offered by cloud providers computing resources (e.g., networks, servers, necessitates the accurate performance evaluation of storage, applications, and services) that can be the cloud center. Khazaei et al. (2012) adopted rapidly provisioned and released with minimal M/G/c/c + r queuing system as the abstract model management effort or service provider interaction for performance evaluation due to the characteristics (Mell et al. (2009)). of cloud computing centers: having Poisson arrival of task requests, generally distributed service time, Cloud services are usually divided into main large number of servers and the finite capacity of three categories. Software-as-a-Service (SaaS) is system. a software delivery model in which software and associated data are centrally hosted on the cloud. However analyzing of M/G/c/c + r queuing system Platform-as-a Service (PaaS) provides a computing is not an easy task, an exact solution for this platform like execution runtime, database, web model is only possible for special cases, such as servers, development tools etc. In infrastructure-as- exponential service, a single server, or no waiting a-Service (IaaS), cloud providers offer computers, queue at all. Consequently, an extensive research on as physical or more often as virtual machines, performance evaluation of M/G/c queuing systems servers, storage, load balancers, networks, etc. was described in the literature (Nozaki et al. (1978), (Furht (2010)). Typically a service level agreement Yao (1985)). According to Khazaei et al. (2012), (SLA) gives all the aspects of usage of cloud service the M/G/c approximation approaches that were and the obligations of service providers and clients. It proposed in the literature (Kimura (1983), Tijms et also includes various descriptors known as quality of al. (1981), Boxma et al. (1979)) are not suitable service (QoS) which includes availability, reliability, for performance evaluation of cloud center. They ⃝ c The Authors. Published by BISL. 1 Proceedings of . . . Stochastic Model for Cloud Dada Center with M/G/c/c+r Queue Outamazirt • Escheikh • Aı̈ssani • Barkaoui • Lekadir cited three issues that make it difficult to apply these Ani Brown Mary et al. (2013) modeled a cloud center approximations: as an [(M/G/1) : (∞/GD model)] queuing system with a single task arrivals and a task request buffer • a cloud center can have a large number of of infinite capacity. They used analytical methods to facility (server) nodes (Amazon (2010)); evaluate the performance of queuing system and solve it to obtain important performance factors • the complex task service times and higher like mean number of tasks, blocking probability coefficient of variation (CoV) of service time and probability of immediate service. Mean as well distribution (Zhang et al. (2011), Reiss et al. as standard deviation of the number of tasks is (2012)); computed. • due to the dynamic nature of cloud environ- ments, diversity of users requests and time de- Analysis of queuing systems with multiple servers pendency of load, cloud centers must provide and general distributed service time is more expected quality of service at widely varying complex. For these queuing systems the steady loads (Reiss et al. (2012), Xiong et al. (2009)). state probability, the distributions of response time and the queue length cannot be obtained in A new approximate approach used the combination closed form. Consequently, several researchers of a transform-based analytical model and an have developed many methods for approximating its embedded Markov chain model for the steady state solution. probabilities of the M/G/c/c + r queueing system was proposed by Khazaei et al. (2012). This new Kimura (1983) developed a diffusion approximation approach is suitable for cases of large number model for the queue M/G/c. The main idea is to of servers and when the distribution of service approximate formulas for the distributions of the time has a coefficient of variation more than one. number of customers. Kimura (1996a) described Improvements for this approach was proposed by an approximation for the steady state queue length Chang et al. (2014). In these approaches, the distribution in M/G/c queue with finite waiting instants of regeneration of the embedded Markov spaces. chain were misinterpreted and the system state A similar approach in the context of M/G/c queues transition behavior was not describe accurately. was described by Kimura (1996b), but extended so This paper makes some progress towards a good as to approximate the blocking probability and, thus, approximation for the computation of the one-step to determine the smallest buffer capacity such that transition probabilities of the system M/G/c/c + r. the rate of lost tasks remains under predefined level. This enables to enhance previous approximations. Nozaki et al. (1978) proposed an approximation The rest of the paper is organized as follows. for the average queuing delay in a M/G/c/c + r Section 2 presents a brief overview of related work queue based on the relationship of joint distribution on cloud performance evaluation and performance of remaining service time to the equilibrium service characterization of queuing systems. In Section 3, distribution. Smith (2003) proposed a different we present the analytical model in detail. In Section approximation for the blocking probability based on 4, we conclude by outlining same possible new the exact solution for finite capacity M/M/c/c + r directions for future work. queues. The estimate of the blocking probability is used to guide the allocation of buffers so that the blocking probability remains below a specific 2. RELATED WORK threshold. Xiong et al. (2009) modeled a cloud center as the However, the most of the approximations mentioned classic open network, from which the distribution above are not directly applicable to performance of response time was obtained by using Laplace analysis of cloud computing center due these transformation. Using the distribution of response following limitations (Khazaei et al. (2012)): time, the authors found the relationship between the maximum number of customers, the minimal service • for example, approximations proposed by resources and the highest level of services Kimura (1996a), Smith (2003) are reasonably accurate when the number of servers is small, Yang et al. (2009) proposed the M/M/c/c + r below 10 or so. They are not being suitable queuing system for modeling the cloud center, which for the cloud computing centers with more than indicates that both inter-arrival and service times are 100 servers; exponential, the system has a finite buffer of size r and its distribution of response time was obtained. • approximations proposed by Nozaki et al. (1978), Yao (1985) are inaccurate when the 2 Stochastic Model for Cloud Dada Center with M/G/c/c+r Queue Outamazirt • Escheikh • Aı̈ssani • Barkaoui • Lekadir coefficient of variation of the service time, CoV, The M/G/c/c + r queuing system is a non- is above 1.0; markovian process and can be analyzing by using the embedded Markov chain (EMC) technique • approximation errors are particularly pro- Kleinrock (1975). This will be discussed below. nounced when the traffic intensity ρ is small, and/or when both the number of servers c Task request arrivals follow a Poisson process, which and the CoV of the service time are large means that task inter-arrival time AX (x) , P [X ≤ (Kimura (1983), Tijms et al. (1981), Boxma x] is exponentially distributed with a rate of λ, its et al. (1979)). probability density function (pdf) is a(x) = λe−λx and its Laplace transform is As these approximations mentioned above are ∫ ∞ not directly applicable to performance analysis of λ cloud computing center, Khazaei et al. (2012) A∗ (s) = e−sx a(x)dx = . 0 λ + s described a new approximate analytical model using M/G/c/c + r queueing system. In order to evaluate its performances, they used a combination of a Task service times are identically and independently transform-based analytical model and an embedded distributed according to a general distribution Markov chain model, which obtained a complete HY (y) , P [Y ≤ y] with a mean service time equal probability distribution of response time and number to h = µ1 . its pdf is hY (y) and its Laplace transform is of task in the system. ∫ ∞ ∗ Chang et al. (2014) proposed an approximate H (s) = e−sx h(x)dy. analytical model using M/G/c/c + r queueing 0 system for performance evaluation of cloud center The traffic intensity is ρ , cµ λ . The Residual task close to the proposed by Khazaei et al. (2012), service time, H+ , is the time interval from an both divided the transition probabilities matrix of this arbitrary point (an arrival point) during a service time system into four regions. The difference between to the end of the service time. This time is necessary them lie in the approximation formula of the transition for our model since it represents time distribution probabilities matrix in regions 3 and 4, because between a task arrival and departure of the task according to these authors, the approximate formula which was in service when task arrival occurred. proposed by Khazaei et al. (2012) is an inaccurate The Laplace transform of H+ is calculated by Takagi approximation for the transition probabilities in these (1991) as: regions. ∗ 1 − H ∗ (s) H+ (s) = . (1) sh 3. THE ANALYTICAL MODEL 3.2. Model Analysis 3.1. Model description We use the same EMC technique adopted by We consider a c-server queueing system with Khazaei et al. (2012) for analyzing the M/G/c/c + r Poisson arrivals, general service times, and a queuing system. This EMC technique consists of capacity limit of c + r for the number of tasks in selecting the Markov renewal points at the instant the system, for modeling a cloud data center. In this of a new task arrival to the system. We choose two model we assume that: consecutive arrivals to be the observation interval for the EMC. The basic structure of the EMC is • all c servers render service in order of task shown in Fig.1. Therefore, we model the number request arrivals (FCFS); of the tasks in the system (both those in the service and those in the waiting queue) at the • each busy server is independent of the other moments immediately before the new task arrival, busy serves; if we enumerate these instances as 0, 1, ..., c + r, • if the waiting queue is empty and there is no we obtain a homogeneous Markov chain with state new task request arrival, the server enters in space S = {0, 1, 2, ..., c + r}. This Markov chain is the idle state; ergodic because it is irreducible, recurrent non-null, and aperiodic (Khazaei et al. (2012)). • if the task arrives while the system capacity has already been attained, this task will depart Let tn and tn+1 the moments of nth and (n + immediately without service; 1)th arrivals to the system respectively, while Xn and Xn+1 indicate the number of tasks found in • each task is serviced by a single server. the system immediately before these arrivals, Tn denotes the inter-arrival time between tn and tn+1 , 3 Stochastic Model for Cloud Dada Center with M/G/c/c+r Queue Outamazirt • Escheikh • Aı̈ssani • Barkaoui • Lekadir Bn+1 indicates the number of tasks which depart • Px is the probability of completing the service from the system between tn and tn+1 . of a task, which has already been in service In the rest of this paper we use T to denote any inter- during the previous observation interval and is arrival time. completed in the current interval. We must calculate the transition probabilities • Py is the probability of completing the service associated with EMC, and so we define of a task, which begins to be serviced in the current interval and is finished within the same pij , P (Xn+1 = j|Xn = i), (2) interval. otherwise, we must calculate • If a server completes the service of a task P (Bn+1 = i + 1 − j|Xn = i), (3) which has already been begun during the previous observation interval, in the current i.e., the probability that i − j + 1 tasks are serviced interval, this server will be idle. If the waiting during T . It is clear that queue is nonempty, that server as well may pij = 0, for j > i + 1. (4) complete a second service in the current interval, and if the waiting queue is still nonempty, a new service may be completed, and so on until the waiting queue gets empty. The probability of k services are completed by a single server is given by Pz,k . After defining the departure probabilities Px , Py and Pz,k in the embedded Markov chain, we describe the four different regions of the transition probability matrix P . These regions are schematically shown in Fig. 2, where the numbers on horizontal and vertical axes correspond to the number of tasks in the system immediately before a task request arrival Figure 1: Embedded Markov points. (i) and immediately before the next task request arrival (j), respectively. Since the EMC is ergodic, an equilibrium probability distribution π = [π0 , π1 , π2 , ..., πm+r ] exists for the number of tasks present at the arrival instants with πi = lim P (Xn = i) for 0 ≤ i ≤ m + r, and it is the n→∞ solution of equation π = πP , where P is the matrix whose elements are the transition probabilities pij . 3.2.1. Transition Probability Matrix To find the elements of the transition probability matrix P , we need to count the number of tasks departing from the system in an observation interval. Each server has zero or more departures in T . The transition probability matrix P has four regions as that defined by Khazaei et al. (2012) and Chang et al. (2014). Before presenting pij in each region, we first define the departure probabilities Px , Py and Figure 2: Range of validity for pij . Pz,k as follows: ∗ Px , P (A > H+ ) = H+ (λ), (5) Region 1: ∗ Py , P (A > H) = H (λ), (6) For i + 1 < j, pij = 0, since j cannot exceed i + 1 (Eq. 4). [∏ k ] Pz,k = P (A > H|A > (k − i)H + H+ ) · P (A > H+ ), i=2 Region 2: (7) For i + 1 ≤ c, 0 ≤ j ≤ c, and i + 1 ≥ j, in this region, all tasks are in the service (no waiting). The where: 4 Stochastic Model for Cloud Dada Center with M/G/c/c+r Queue Outamazirt • Escheikh • Aı̈ssani • Barkaoui • Lekadir probability that i − j + 1 tasks are served during T is: i−j i−j Ci Px j−1 (1 − Px )j Py + Cii−j+1 Pxi−j+1 (1 − Px ) (1 − Py ), if i + 1 > j; Pij = (8) min(w,c) { ∑ M1 (i, j, w) = Cm Px (1 − Px )c−s1 s1 s1 (1 − Px )j , if i + 1 = j. s1 =c−j min(w−s1 ,i−c+1) [ ∑ Region 3: Css12 Pz,2 s2 s2 =min(w−s1 ,c−j) ]} For c ≤ i ≤ c + r, c ≤ j ≤ c + r, and i + 1 ≥ j, all (1 − Pz,2 )i−c+1−s2 , (14) servers are busy during T . Let ω = i − j + 1 denotes the number of departures in the system, with ω ≥ 0 and, we assume each single server completes no min(w,c) { ∑ more than three services of tasks in T . We define M2 (i, j, w) = Ccs1 Pxs1 (1 − Px )c−s1 the transition probabilities for this region by: s1 =c−j ∑ min(w−s1 ,s1 ) [ M (i, j, w) · χ, if i < c + r; Css12 Pz,2 s2 (1 − Pz,2 )s1 −s2 s2 =min(w−s1 ,c−j) 1 −s2 w−s1 −s2 pij = M (i, j, w − 1) · χ, if i = c + r; (9) Csw−s Pz,3 2 ]} (1 − Pz,3 )s2 −(w−s1 −s2 ) . (15) 0, if i > c + r, where: 3.2.2. Discussion { Between two successive task arrivals (i.e. during T ) ∑ min(w,c) M (i, j, w) = Ccs1 Pxs1 (1 − Px )m−s1 there may be ω = i + 1 − j departures from the s1 =min(w,1) system. In the region 2 there will be at most one ∑ min(w−s1 ,s1 ) [ departure from each server of the system, however, s2 Css12 Pz,2 (1 − Pz,2 )s1 −s2 in regions 3 and 4, there can be more than one s2 =min(w−s1 ,1) departure from any given server. 1 −s2 w−s1 −s2 Csw−s Pz,3 ]} 2 Let us now examine in detail the equations of the (1 − Pz,3 )s2 −(w−s1 −s2 ) , (10) transition probabilities for regions 2, 3 and 4 that we propose and we compare these equations with those proposed by Khazaei et al. (2012) and Chang et al. and χ is the indicator function (2014). 1, if w − s1 − s2 ≤ s2 ; • Region 2 (i + 1 ≤ c, 0 ≤ j ≤ c, and i + 1 ≥ j): χ= (11) In this region, the nth arrival finds the waiting 0, if w − s1 − s2 > s2 . queue empty and i servers busy (all i tasks in the system are in service), then as i ≤ c−1 < c, it will find an idle server and will immediately Region 4: enter into service. However, the (n+1)th arrival For c ≤ i ≤ c + r, 0 ≤ j < c, and i + 1 ≥ j, all servers finds exactly j on arrival, i.e. there are i + are busy at the beginning of T , and c − j servers 1 − j tasks leave the system between two are idle at the end of T . In this region, the transition successive arrivals. We distinguish two cases: probabilities are defined by: – Case 1: If among these i busy servers, i − j of them completed the services of M (i, j, w) · χ, if i < c + r; tasks and, the service of the nth arrival is also completed before the (n+1)th arrival, pij = M (i, j, w − 1) · χ, if i = c + r; (12) then the probability of having i + 1 − j departures from the system in this case 0, if i > c + r, is equal to: where: Cii−j Pxi−j (1 − Px )j Py . M1 (i, j, w), if s1 ≥ i − c + 1; – Case 2: If among these i busy servers, M (i, j, w) = (13) i + 1 − j of them completed the services M2 (i, j, w), if n1 < i − c + 1, of tasks and, the service of the nth arrival 5 Stochastic Model for Cloud Dada Center with M/G/c/c+r Queue Outamazirt • Escheikh • Aı̈ssani • Barkaoui • Lekadir is not completed, then the probability of al. (2012), with the probability (1 − Pz,3 )s2 . having i+1−j departures from the system According to this authors this equation is equal in this case is equal to: to: ∑ min(w,c) { Cii−j+1 Pxi−j+1 (1 − Px )j−1 (1 − Py ). pij = Ccs1 Pxs1 (1 − Px )m−s1 s1 =min(w,1) In the two cases cited above, there are ∑ min(w−s1 ,s1 ) [ i + 1 − j tasks that leave the system Css12 Pz,2 s2 (1 − Pz,2 )s1 −s2 between two successive arrivals, thus the s2 =min(w−s1 ,1) probability of having i + 1 − j departures 1 −s2 w−s1 −s2 from the system during T is equal to: Csw−s Pz,3 ]} 2 Cii−j Pxi−j (1 − Px )j Py + (1 − Pz,3 )s2 . (17) i−j+1 i−j+1 Ci Px (1 − Px )j−1 (1 − Py ). (16) However, it is impossible to find exactly j tasks This equation was proposed by Khazaei et al. in the system at the end of T in this equation (2012) and Chang et al. (2014), but when we because the authors considered the number of have i + 1 = j, i.e. there is the same number servers that are still busy processing the third of tasks in the system at the beginning and service is set to s2 . Consequently, Chang et al. at the end of T , Eq. 16 can not be used to (2014) proposed a new equation for this region, compute the conditional probability P (Xn+1 = defined as: i + 1|Xn = i). This probability can be obtained { ∑ min(w,c) in the following manner: pij = Ccs1 Pxs1 (1 − Px )m−s1 s1 =min(w,1) (1 − Px )j . ∑ min(w−s1 ,s1 ) [ s2 Css12 Pz,2 (1 − Pz,2 )s1 −s2 That is presents the probability of no having s2 =min(w−s1 ,1) a departure between two successive arrivals. 1 −s2 w−s1 −s2 Csw−s Pz,3 ]} 2 Consequently, the transition probabilities for the region 2 defined in this paper is given by (1 − Pz,3 )s2 −(w−s1 −s2 ) . (18) Eq. 8. • Region 3 (c ≤ i ≤ c + r, c ≤ j ≤ c + r, and In this equation, the remaining ω − s1 − s2 i + 1 ≥ j): tasks that will must leave the system, can be The nth arrival finds all c servers are busy serviced by the subset of s2 servers that have and i − c tasks in the waiting queue. If the completed two services. When the number of number of tasks in the system is strictly less s2 servers is small, then it can happen that than c+r (system capacity), then the nth arrival the number of remaining tasks exceed this will be allowed entry. Therefore, there should number, therefore, in order that the assumption be i − c + 1 tasks in the waiting queue at the to have ”each single server completes no more beginning of T . than three services of tasks in T ” can verify, an In this region, there can be more than one indicator function must be used and it is equal departure from any given server. Among the to 1 if the number of remaining tasks in the c servers, s1 of them complete at least one system is less than the number of s2 servers, service during T . Note that all these completed is equal to 0, otherwise (Eq. 11). services must have already been in service at Also, in this region, if the nth arrival finds the the beginning of T . Among these s1 servers, system full (i = c + r), then it will be lost. s2 of them will complete a second service during T . As we assumed that each single Therefore, there will be i − j tasks that will server completes no more than three services leave the system between the nth arrival and of tasks in T , then the remaining ω − s1 − the (n + 1)th arrival instead of i + 1 − j tasks. s2 services must be completed in T . These Accounting for the above analysis we propose services will complete by a subset of these s2 in this paper a new approximate formula for servers. The number of servers in the subset the computation of the element pc+r,j , for all is ω − s1 − s2 , that is, there are ω − s1 − j. Consequently, a new accurate approximation s2 servers, each of which completes exactly of the transition probabilities for the region 3 is three services in T and, the number of servers defined in this paper by Eq. 9. This enables to that are still busy processing the third service enhance previous approximations. should be set to s2 − (ω − s1 − s2 ). This number is set to s2 in the equation of the • Region 4 (c ≤ i ≤ c + r, 0 ≤ j < c, and transition probabilities proposed by Khazaei et i + 1 ≥ j): 6 Stochastic Model for Cloud Dada Center with M/G/c/c+r Queue Outamazirt • Escheikh • Aı̈ssani • Barkaoui • Lekadir In this region, at the beginning of T there are In the case where s1 ≥ i − c + 1, the Eq. 20 will i − c + 1 tasks in the waiting queue and all c be equal to: servers are busy, while at the end of T , the min(ω,c) { ∑ waiting queue is empty and there are c − j pij = 1 Ccs1 Pxs1 (1 − Px )c−s1 servers are idle. (1 − Pz,3 )c−j s =c−j 1 The equation of the transition probabilities ∑ min(ω−s1 ,s1 ) [ proposed by Khazaei et al. (2012) for this s2 Ci−c+1 s2 Pz,2 (1 − Pz,2 )s1 −s2 region is given by: s2 =min(ω−s1 ,c−j) ω−s1 −s2 min(ω,c) { Cmin(max(i−c+1−s P ω−s1 −s2 ∑ 1 ,0),s2 ) z,3 ]} pij = Ccs1 Pxs1 (1 − Px )c−s1 (1 − Pz,3 )s2 −max(i−c+1−s1 −s2 ,0) , (21) s1 =c−j ∑ min(w−s1 ,s1 ) [ therefore, the max(i − c + 1 − s1 , 0) = 0, Css12 Pz,2 s2 (1 − Pz,2 )s1 −s2 min(max(i − c + 1 − s1 , 0), s2 ) = 0 and max(i − s2 =min(ω−s1 ,c−j) c + 1 − s1 − s2 , 0) = 0. As the waiting queue is 1 −s2 w−s1 −s2 Csω−s Pz,3 empty (all i − c + 1 tasks in waiting queue enter ]} 2 (1 − Pz,3 )s2 . (19) service because s1 ≥ i−c+1), then the number of servers that will complete three services during T is equal to 0 (i.e. ω − s1 − s2 =0). In this equation, Khazaei et al. (2012) had not Therefore, the number of servers that will be taken into account the number of tasks in the idle at the end of T is equal to s2 , i.e. c−j = s2 . waiting queue at the beginning of T . However, Thus, Eq. 21 will be equal to: in this region we distinguish two cases: – case 1: If s1 ≥ i − c + 1, there are at most min(ω,c) { ∑ i−c+1 tasks that begin service at a subset pij = Ccs1 Pxs1 (1 − Px )c−s1 of the s1 servers that have completed one s1 =c−j services in T and, there are s1 − (i − c + 1) ∑ min(ω−s1 ,s1 ) [ s2 s2 servers remain idle because the waiting Ci−c+1 Pz,2 s2 =min(ω−s1 ,c−j) queue is empty. ]} – case 2: If s1 < i − c + 1, there are s1 tasks (1 − Pz,2 )s1 −s2 . (22) begin service at a subset of the s1 servers that have completed one services in T . In this equation, if the number of servers that Chang et al. (2014) proposed a new equation complete the second service is equal to i−c+1, for the transition probabilities for this region: then at the end of T , the number of servers remain busy processing the second service is min(ω,c) { ∑ equal to i − c + 1 − s2 , while in the Eq. 22, 1 pij = Ccs1 Pxs1 (1 − Px )c−s1 this number is equal to s1 − s2 , consequently, (1 − Pz,3 )c−j s =c−j 1 the number of tasks in the system at the ∑ min(ω−s1 ,s1 ) [ end of T exceeds j. Therefore, the coefficient s2 P s2 (1 − Pz,2 )s1 −s2 (1−Pz,3 )c−j is not valid when s1 ≥ i − c + 1, but Cmin(i−c+1,s 1 1 ) z,2 s2 =min(ω−s1 ,c−j) it is valid just when s1 < i − c + 1. ω−s1 −s2 Cmin(max(i−c+1−s P ω−s1 −s2 1 ,0),s2 ) z,3 ]} Accounting for the above analysis for this (1 − Pz,3 )s2 −max(i−c+1−s1 −s2 ,0) . (20) region we study the two cases cited above separately and we propose in this paper a new In this equation, the authors considered the approximate formula for each case (Eq. 14, number of servers that are still busy processing Eq. 15 resp.). Consequently, a new accurate the third service is equal to s2 − max(i − c + approximation of the transition probabilities for 1 − s1 − s2 , 0), because they assumed that all the region 4 is defined in this paper by Eq. 12. servers that enter in the idle state during T they do not complete the service, even though the servers are idle. In order to correctly account 4. CONCLUSION for the c − j idle servers at the end of T , In this paper we used the same analytical model, they multiplied the equation of the transition 1 M/G/c/c + r queuing system, proposed by Khazaei probabilities for this region by (1−Pz,3 )c−j . et al. (2012) for modeling the cloud computing center. However, we focused on the one-step transition matrix of this model because we constated 7 Stochastic Model for Cloud Dada Center with M/G/c/c+r Queue Outamazirt • Escheikh • Aı̈ssani • Barkaoui • Lekadir that the analysis of this model differs from an Amazon (2010) Amazon Elastic Compute Cloud, author to another (see the analysis of Khazaei User Guide, API Version ed., Amazon Web et al. (2012), Chang et al. (2014)). So, we Service LLC or Its Affiliate. Available from proposed a new approximation formulas for the http://aws.amazon.com/documentation/ec2. one-step state-transition probabilities. Compared to the existing approximation formulas for the one- Zhang Q., Hellerstein J. and Boutaba R. (2011) step state-transition probabilities, our approximation Characterizing task usage shapes in Google’s formulas improved the computation of the transition compute clusters, LADIS Workshop. probabilities of the M/G/c/c + r transition matrix. Reiss C., Tumanov A., Ganger G. R., Katz R., and Kozuch M. (2012) Heterogeneity and Dynamicity Our future work, will be consecrate to the of Clouds at Scale: Google Trace Analysis. Proc. presentation of the numerical results. We also plan ACM Symposium on Cloud Computing. to extend the model M/G/c/c + r queuing system with one-step transition matrix by considering the Xiong K. and Perros H. (2009) Service Performance services with different priorities and batch-task and Analysis in Cloud Computing. proceedings of arrivals. the 2009 Congress on Services. Los Angeles, CA, 6-10 July 2009. IEEE. 693-700. REFERENCES Chang X., Wang B., Muppala J. K., Liu J. (2014) Modeling active virtual machines on IaaS clouds Mell P., and Grance T. (2009) The NIST using an M/G/m/m+K queue. IEEE Transactions Definition of Cloud Computing. Available from on Services Computing, vol. PP. 1-1. http://www.cloudbook.net/resources/stories/the- nist-definition-of-cloud-computing Yang B., Tan F., Dai Y. and Guo S. (2009) Perfor- mance Evaluation of Cloud Service Considering Furht B. (2010) Cloud Computing Fundamentals. In: Fault Recovery. proceedings First International Furht B. and Escalante A. (Eds.). Handbook of Conference, CloudCom 2009. Beijing, China, 1-4 Cloud Computing. Springer. 3-19. December 2009. Springer Berlin Heidelberg. 571- Wang L., Von Laszewski G., Younge A. , He X., 576 Kunze M., Tao J., and Fu C. (2010) Cloud Ani Brown Mary N. and Saravanan K. (2013) Computing: A Perspective Study. New Generation Performance Factors of Cloud Computing Data Computing, vol. 28. 137-146. Centers Using [(M/G/1) : (∞/GD M ODEL)] Khazaei H., Misic J. , and Vojislav B. M. (2012) Queuing Systems. International Journal of Grid Performance analysis of cloud computing centers Computing & Applications IJGCA, Vol 4, 1-9. using M/G/m/m + r queueing systems. IEEE Kimura T. (1996a) A Transform-Free Approximation Transactions on Parallel and Distributed Systems, for the Finite Capacity M/G/s Queue. Operations vol. 23. 936-943. Research, vol. 44, 984-988. Nozaki S.A. and Ross S.M. (1978) Approximations in Kimura T. (1996b) Optimal Buffer Design of an Finite-Capacity Multi-Server Queues with Poisson M/G/s Queue with Finite Capacity. Comm. in Arrivals. Applied Probability, vol. 15, 826-834. Statistics Stochastic Models, vol. 12, 165-180. Yao D.D. (1985) Refining the diffusion approximation Smith J.M. (2003) M/G/c/K Blocking Probability for the M/G/m queue. Operations Research, vol. Models and System Performance. Performance 33, 1266-1277. Evaluation, vol. 52, 237-267. Kimura T. (1983) Diffusion Approximation for an Kleinrock L. (1975) Queueing Systems vol. 1, M/G/m Queue. Operations Research, vol. 31, Theory. Wiley-Interscience. 304-321. Takagi H. (1991) Queueing Analysis. vol. 1, Vacation Tijms H.C., Hoorn M.H.V. and, Federgru A. (1981) and Priority Systems. North-Holland. Approximations for the Steady-State Probabilities in the M/G/c Queue. Advances in Applied Probability, vol. 13, 186-206. Boxma O.J., Cohen J.W. , and, Huffel N. (1979) Approximations of the Mean Waiting Time in an M/G/s Queueing System. Operations Research, vol. 27, 1115-1127. 8