=Paper=
{{Paper
|id=Vol-2815/CERC2020_paper24
|storemode=property
|title=Operational Management of Data Centers Energy Efficiency by Dynamic Optimization -Based on a Vector Autoregressive Model- Reinforcement Learning(VAR-RL) Approach
|pdfUrl=https://ceur-ws.org/Vol-2815/CERC2020_paper24.pdf
|volume=Vol-2815
|authors=Yang Hai,Truong Van Nguyen,Youngseok Choi,Mohammed Bahja,Habin Lee
|dblpUrl=https://dblp.org/rec/conf/cerc/HaiNCBL20
}}
==Operational Management of Data Centers Energy Efficiency by Dynamic Optimization -Based on a Vector Autoregressive Model- Reinforcement Learning(VAR-RL) Approach==
Business and Society Operational Management of Data Centers Energy Efficiency by dynamic optimization -Based on a Vector Autoregressive Model- Reinforcement Learning(VAR- RL) approach Yang Hai1[0000-0003-4656-1594] ,Truong Van Nguyen2[0000-0001-9380-5710], Youngseok Choi3, Mohammed Bahja4[0000-0002-2138-1784], and Habin Lee5[0000-0003-0071-4874] 1,2,3,5 Brunel University London, Middlesex UB8 3PH, UK 4 University of Birmingham, Edgbaston, Birmingham B15 2TT, UK Yang.Hai@Brunel.ac.uk TruongVan.Nguyen@Brunel.ac.uk Youngseok.Choi@Brunel.ac.uk M.Bahja@Bham.ac.uk Habin.Lee@Brunel.ac.uk Abstract. With the increasing demands of digital computing, Data Centers (DCs) have become a leading scheme for global energy issues. Major efforts that can be observed for DC energy efficacy solutions are focusing on relatively problematic infrastructure designs. Nevertheless, we emphasised the managerial strategies of using the existing facilities to achieve energy efficiency through active interven- tion. It is believed that there exists a trade-off between the cooling devices and IT devices. Accordingly, the Vector Autoregressive Model- Reinforcement Learning(VAR-RL) approach will be proposed as a combination of traditional multivariate time series modeling technique and the artificial intelligence tech- nique which allows us to predict and adjust the prediction of an error would help to explore the complex dynamic interrelationships between the two types of de- vices. Moreover, an optimization decision support system will also be conducted subsequently to optimize Power Usage Effectiveness (PUE) by controlling the combination of Air Conditioners (ACs). The proposed VAR-RL approach would not only increase the forecasting accuracy but also would adapt to the environ- ment changes dynamically, this would give a better foundation for the DC energy efficiency optimization. The data we adopted is the real-time data from a DC located in Turkey. Consequently, the novel of this study would save the DC en- ergy consumption tremendously. Keywords: DC, Energy consumption, optimisation. 1 Introduction DCs are energy-intensive industries and they are taking 1-1.5% of global electricity usage every year [1]. There are two main units of DCs that are supplied the energy, IT, Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License 371 CERC 2020 Attribution 4.0 International (CC BY 4.0). Business and Society 2 and cooling units. Widely cited studies and conservatively evaluation demonstrated the fact that there are around 40% of energy has been taken by the cooling system in a typical air-cooling DC [2]. IT equipment, along with its respective power supply, cre- ates heat when being in use and will raise the surrounding ambient temperature. In con- sequence, the equipment is likely to fail if the temperature becomes too high. Previous to 2004, IT companies were too fixated upon the performance of their equipment and adjusted the equipment’s environment with only reliability in mind, with little weighting to energy costs. Nowadays the temperature range has been wider to 18 − 27℃ (ASHRAE 2016)[3]. However, there are over 90% of DCs still keeping a constant temperature which means that the DCs are over-cooled and energy inefficiency [4]. Since most of the energy consumption lies in cooling, efforts have been made on reducing the cooling energy consumption in the DCs. It has been found that configura- tion design predominantly affects on energy consumption. As long as there is a design revolution, the application still requires a long time. As a result, traditional air-cooling will still domain the DC cooling system in the next few decades [5]. Therefore, we are seeking DC cooling efficiency solutions from a different aspect. As we mentioned ear- lier, cooling devices are the biggest energy consumers in the DC however there is no guideline on how to make the optimal usage of them. The common operations in DCs are still following traditional rules by turning off some certain number of ACs to save energy in winter. But to our knowledge, there is no sophisticated analysis so far to guide the optimal use of ACs combination, which gives us ideas to fill up this gap. According to the relationship of energy consumption units both IT and Air condi- tioning devices, a lower temperature will increase the energy supplied, inversely, will reduce the energy consumption of IT devices due to an increasing computing effi- ciency. Therefore, smart operation management on DCs to find out the optimal solution on temperature control to minimize the energy consumption without affecting the per- formance of IT devices and meeting the service-level agreement has been investigated. Therefore, this has become our motivation for this study. The question can be modeled as a Linear Programming (LP) problem. However, applying the LP method to the prob- lem requires understanding the complex interactions among many variables within the DC. To solve and simplify this issue, we adopt VAR model to identify such complex interactions. It has been taken to account for common features of the industry big data Changes rapidly in the structures. Changes in server workload, outside environment, device locations or human intervention all can be reasons that lead to structural breaks of the series. Numerous empirical studies put attention on post-event detection, which wildly used for economic or business analysis, while rarely of them are looking at this issue in a real-time. Instead, we expect the model would able to autonomous evaluate itself and correct the mistake once it notices it. In this study, we adopt RL approach for dynamic real-time adjustment of VAR model. It would take the responsibility to detect the structural break and trigger the parameter re-estimated system. With the proposed VAR-RL approach, the subsequent optimization problem can be solved with the con- sideration of the changes in the environment in real-time. Our contributions to the literature including the following: (1) Give the DC energy- efficient solution without changing DC configurations. (2) A dynamic simulation sys- CERC 2020 372 Business and Society 3 tem based on VAR-RL approach has been made, this will provide an efficient and ac- curate forecast for the complex environment with the adjustment of structural changes in the data. (3) Real-time optimization will be conducted based on the simulation result and future optimization also can be made by the forecasted data set. (4) This study will also arouse the environmental awareness of energy saving. 2 Literatures review We have searched the empirical studies on DC effective cooling strategy of simulation- based optimization. Among the ten results, eight of them adopted the Computational Fluid Dynamic (CFD) while and the rest of them used Data-driven Models (DDM) model and other configuration design simulation tools for airflow simulation. Most of them are from a pure configuration design and layout aspect, ie. [6] on sen- sors placement strategy and [8] on air aisle and racks layouts [7]. There are only [8, 9, 10] among the results that looking at the optimal temperature solutions, however [9] it doesn’t take the trade-off relationships between cooling and IT into considerations and [8] is an equation-based simulation that looking at system network control. Also [10] studied the combination of water and airflow in Indirect Adiabatic Cooling (IAC) DC. Numerous studies that use the First principle (FP) in terms of DC objectives largely rely on pre-defined algorithms. However, in practical, there are a variety of unknown relationships that cannot be acquired from physical principles. Data-Driven Models (DDMs) avoid this problem by adopting experimental data to train a system. There is study compares temperature prediction performance of four different types of DDMs including Artificial Neural Networks (ANN), Support Vector Regression (SVR), Gaussian Process Regression (GPR) as well as Proper Orthogonal Decomposition (POD) in a DC, the training data is given by CFD simulation and the result demon- strated that most of them can give a relatively accurate prediction however only ANN could handle multiple output points in one model. Because of the unknown features of the system and multi-dimensional problem need to be solved in one model, so that it requires a large volume of data to feed in the model and moreover, all these types of models are facing similar difficulties which are computational expensive practical cases and relatively time-consuming [11]. We conservatively conclude that our VAR-RL approach would be the first study that further extended Linear Regression (LR) based RL to analyze complex industrial envi- ronment, then apply the simulation result to real-time optimization in industrial practi- cal case. Due to the limited resources, we reviewed similar studies that used the similar method for different problems. RL approach has been used for an auto-select different combination of data streams to feed to the parameters-fixed LRs and practical applica- tion on typhoon rainfall prediction shows a better performance than traditional LRs [12]. More comprehensively, a Multi-Agent reinforcement learning (P-MARL) on pre- dicting the future environment which allows the agents to adapt to the changes off-line by the combination of ANN and Autoregressive Integrated Moving Average (ARIMA) models, this joint approach also increased the prediction accuracy of the agents [13]. Moreover, efforts have been made on adapting RL to the side of the sensor to reduce 373 CERC 2020 Business and Society 4 the energy transmission cost in the wireless sensor networks for signal prediction [14]. These approaches provided evidences that with RL adjustment, the prediction accuracy would be largely increased and gives the model more flexibility by self-learning during prediction without any training data which would require large storage and costs for computing. Our VAR-RL approach would take advantage of these empirical studies, moreover, we will adapt this into DC industrial practice: (1) Our model parameters will be dynam- ically changed according to the feedback from the learning process that would allow our model to adapt to the environment changes. (2) We will propose a time-series linear regression model that will not only consider the previous one step (MACOV) but the whole period which has an effect on the present. (3) RL tool will be plugin to determine whether the model should be reused or rebuilt. (4) We will also avoid using the tech- nique which has a black-box property such as Artificial Neuron Network (ANN) be- cause it hides the interrelationships into the black-box procedure which limited the in- terpretative of the model. (4) Also, as we expect to perform a light, fast, and efficient model to adapt to the rapid industrial practice. we will avoid the use of techniques that requires huge size of training data and local storage. (5) We will not change any DC configurations including the sensors, only managerial strategies will be applied to the DC energy-saving practice. 3 Methodology. 3.1 VAR model- A fundamental simulator Based on the field study in TUKSAT DC target IT room, we identified the main factors that participated in the IT room computing environment. We are going to include ceil- ing sensors temperatures, Server rack inlet and outlet temperatures, air conditioner out- flow temperatures as well as PDU values of the servers as our endogenous variables. To our knowledge, IT room objectives are mutually affected by each. As the graph shows below, we can infer that each variable can affect the others in two directions (direct or indirect), shown as a circulation (Fig.1). Statistical test (Granger causality test) results also confirmed the underlined inferences. Therefore, we briefly include all the related variables into one VAR model. Fig. 1. The DC variables and relationships Here we present how VAR modelling the above dynamic. VAR assumes all the var- iables to be endogenous and explain those endogenous variables one by one by all their CERC 2020 374 Business and Society 5 past values. This allows us to use the estimated model to predict the future values of variables. A 𝑝𝑡ℎ order 𝑉𝐴𝑅(𝑝) can be represented as: 𝑋𝑡 = ∑𝑘𝑖=1 П𝑖 𝑋𝑡−𝑖 + 𝐶 + 𝑢𝑡 (1) Assume we have 𝑁 variables, then 𝑋𝑡 is the 𝑁 × 1 order time series vector, 𝐶 is the 𝑁 × 1 order constant vector, the П𝑖 is the 𝑁 × 𝑁 order parameter matrix, 𝑢𝑡 is the 𝑁 × 1 order random error vector. We can extend the above equation to the matrix for- mula as following (The lag length will be selected by the combination of "AIC", "HQ", "SC", "FPE" criteria): (2) As VAR model is a dynamic forecasting model. We can use it to simulate the DC environment as well as forecast the future values of each variable. After we get the model parameters by real-time estimation, we will feed the data that cover the lag length and forecast the future value of each variable. To make it clear, here we summarize the procedure to train a VAR model and use it as a simulator to forecast DC environment in a flowchart (See Fig.2). Fig. 2. The structure of the simulator We first downloaded data from the historical Application Programming Interface (API) (Step 1). After data processing (Step 2), we use this data set to train a VAR model (Step 3). Then download historical data again (Step 4), after processing the data for another time (Step 5), we extract the most recent data which cover our lags, then input to the VAR model. The simulator will carry out iterations until reach to the requested forecasting length. 375 CERC 2020 Business and Society 6 3.2 RL -A dynamic environment adaptor As we mentioned earlier, the data estimated in DC has a rapidly changing feature, there- fore our fundamental simulator VAR may not apply for some special cases: ie., sud- denly changing load by holidays or online exams, temperatures changes, or other hu- man interventions. Therefore, we need to adjust our model to ensure the prediction ac- curacy and be able to detect the environment changes and adapt itself to the changing world. Reinforcement learning as an environment adaptor to VAR model will be intro- duced in this section. The process is shown in the following graph (Fig.3). Fig. 3. The environment adaptor based on RL With every prediction, we will have an evaluation of the accuracy. And we will give a reward (or punishment) to each prediction. The accumulated reward would be: 𝑅𝑐𝑢𝑚 = ∑𝑛𝑔=1 𝑅𝑡+𝑔 (3) Where 𝑅𝑐𝑢𝑚 is the total cumulated reward values, 𝑅 is the reward for each forecast eval- uation. With the number of time steps increasing, the difficulty level to predict would be increasing too, to make it fair enough for the judgement, we assign the weight to each reward, and the weight of the reward would be decreasing over the time. 𝑅𝑐𝑢𝑚 = ∑∞ 𝑘 𝑘=0 𝛾 𝑅𝑡+𝑘+1 , 𝛾 ∈ [0,1) (4) Where, 𝛾 is the weight. Time windows will be plugin at every seasonality changing point. And the system will trigger RL to evaluate the prediction result. In this case, the prediction result from the fundamental simulator VAR will be evaluated by the error rate. A reward will be given to each evaluation. When the accumulated reward value reaches to a certain boundary, the environment changes will be detected. Then the RL adaptor will trigger the alarm then the VAR model parameters and features will be rebuilt. 3.3 LP approach- An energy efficiency optimizer Empirical evidences show that increasing the AC setpoint by a single degree can result in 4-5% energy cost savings; and increasing the setpoint by 10 degrees, which is also a realistic number, can result in savings of over 40% [15]. Although this sounds straight- forward and simple, considering the complex nature of DC assets, it is hardly the case. Increasing the AC setpoints blindly can jeopardize the health of servers and other hard- ware, as existing hot spots may become even hotter and higher hot aisle temperature CERC 2020 376 Business and Society 7 may activate server fans and offset efficiency gains. Therefore, a rigorous plan for op- timizing the AC temperature setpoint is critical to increasing the energy efficiency of the DC. Particularly, we aim to optimize the energy efficiency in the DC by determining the optimal combination of the supplied temperature of AC units, while taking into consideration the dynamic nature of IT power consumption, as well as satisfying the temperature constraints. We will use the following notations: 𝑆 = {𝑆1 , 𝑆2 , … , 𝑆𝑚 } denotes the Server rack number 1 to 𝑚 𝐶 = {𝐶1 , 𝐶2 , … , 𝐶𝑙 } is a set of ACs unit number 1 to 𝑙. 𝑠𝑢𝑝 𝑇𝑥 (𝑡): the temperature supply of the 𝑥 𝑡ℎ AC unit at time 𝑡. 𝑇𝑗𝑖𝑛 (𝑡): the inlet temperature of the 𝑗𝑡ℎ server rack at time 𝑡. 𝑇𝑗𝑜𝑢𝑡 (𝑡): the outlet temperature of the 𝑗𝑡ℎ server rack at time 𝑡. 𝑇 𝑟𝑜𝑜𝑚 (𝑡): the room temperature at time 𝑡. 𝐶𝑜𝑚𝑝 𝑃𝑗 (𝑡): the computational power (PDU) for the 𝑗𝑡ℎ server rack at time 𝑡 . 𝑃𝑥𝐶𝑜𝑜𝑙 (𝑡): the computational power for the 𝑥 𝑡ℎ AC unit at time 𝑡. 𝐸[𝑊𝑗 (𝑡)]: the estimated workload (CPU usage) for the 𝑗𝑡ℎ server rack at time 𝑡 . 𝐶𝑜𝑃𝑥 : the coefficient of performance. 𝐶𝑇𝐼𝑗𝑥 is the thermal correlation index. The IT consumption . Assumingly, the IT power consumption would not only be influenced by the compu- tational workload, but also will be affected by the working temperature because the temperature will affect its working performance. Hence, at any time 𝑡, the total IT com- puting power of server rack 𝑆𝑗 is the function of power spent on executing IT jobs and the rack inlet temperature. 𝐶𝑜𝑚𝑝 𝑃𝑗 (𝑡) = 𝛼𝑗 𝐸[𝑊𝑗 (𝑡)] + 𝛽𝑗 𝑇𝑗𝑖𝑛 (𝑡) (5) Where 𝛼𝑗 and 𝛽𝑗 are weight coefficients. The cooling consumption . Based on [16,17,18], the cooling cost of AC device 𝐶𝑥 ∈ 𝐶 can be presented as: 𝐶𝑜𝑚𝑝 𝐶𝑇𝐼𝑥𝑗 ∑𝑚 𝑗=1 𝑃𝑗 (𝑡) 𝑃𝑥𝐶𝑜𝑜𝑙 (𝑡) = 𝑠𝑢𝑝 (6) 𝐶𝑜𝑃𝑥 (𝑇𝑥 (𝑡)) Where 𝐶𝑜𝑃𝑥 is the performance coefficient, shown as the ratio of the amount of heat the AC device 𝐶𝑥 needs to remove to the energy it needs to consume to perform the removal. 𝐶𝑜𝑃𝑥 indicates the efficiency of the AC device, and is typically a non-linear, 𝑠𝑢𝑝 increasing function of the supplied cold air temperature, 𝑇𝑥 (𝑡). It means that operat- ing the AC system at a higher temperature is saving energy, as providing colder air requires the AC to work harder and consume more energy to remove heat. Hence, we can minimise 𝑃𝑥𝐶𝑜𝑜𝑙 (𝑡) by maximise the allowable supplied cold air temperature, 𝑠𝑢𝑝 𝑇𝑥 (𝑡) that satisfies the constraint of redline thresholds. The simulation approach will 377 CERC 2020 Business and Society 8 𝑠𝑢𝑝 also be used to get the function of 𝐶𝑜𝑃𝑥 (𝑇𝑥 (𝑡)). 𝐶𝑇𝐼𝑗𝑥 is the thermal correlation in- ∆𝑇𝑗𝑖𝑛 dex, 𝐶𝑇𝐼𝑗𝑥 = 𝑠𝑢𝑝 , which represents the influence of each AC unit 𝐶𝑥 on inlet temper- ∆𝑇𝑥 ature of server rack 𝑆𝑗 . As defined in Eq. (6), it quantifies the response of the server 𝑠𝑢𝑝 𝑆𝑗 ’s inlet temperature 𝑇𝑗𝑖𝑛 to a step-change in the supply temperature 𝑇𝑥 of 𝐶𝑥 . 𝐶𝑇𝐼𝑗𝑥 is a static metric, which is stable with time but based on the physical configura- tion of the DC. Hence, we use the simulation approach to get the value of 𝐶𝑇𝐼𝑗𝑥 . The detailed explanation of this metric can be seen in [17,18]. Thermal modelling . According to the law of energy conservation, almost all the computing power con- sumed by a server is transformed into heat, hence the relationship between the power consumption and inlet/outlet temperature of server rack 𝑆𝑗 can be presented as: 𝐶𝑜𝑚𝑝 𝑇𝑗𝑜𝑢𝑡 (𝑡) = 𝑇𝑗𝑖𝑛 (𝑡) + 𝐾𝑗 𝑃𝑗 (𝑡) (7) Where 𝐾𝑗 = 𝑝𝑓𝑗 𝑐 is the thermal-physical term. This can be estimated by our data ob- tained in DC. Typically, the server’s inlet temperature (𝑇𝑗𝑖𝑛 ) tends to be higher than the AC’s sup- 𝑠𝑢𝑝 plied air temperature (𝑇𝑥 ) due to the phenomenon so-called heat recirculation where the hot air from the server and the supplied cool air from the AC are mixed then recir- culats in the room. Based on the energy conservation as described in Eq. (8) and the assumption of the fixed airflow pattern in the computer room, prior studies (eg. [19]) characterise this phenomenon with a heat distribution matrix 𝐴 = {𝑎𝑗𝑜 } where 𝑎𝑗𝑜 is the temperature increase at the inlet of server rack 𝑀𝑗 due to the heat emitted at the outlet of the server rack 𝑀𝑜 . Here we adjust this to matrix 𝐴 = {𝑎𝑗 } denotes the heat increased at the inlet of server rack 𝑆𝑗 caused by the heat recirculated inside the rack due to computation. Hence, the inlet temperature of a server rack 𝑆𝑗 comes from the combination of the supplied cold air from the AC and hot air recirculated inside the rack. This relationship can be written as: 𝑠𝑢𝑝 𝑇𝑗𝑖𝑛 (𝑡) = ∑𝑙𝑥=1 𝑐𝑗𝑥 𝐶𝑇𝐼𝑗𝑥 𝑇𝑥 (𝑡) + 𝑑𝑗 𝑃𝑗𝐶𝑜𝑚𝑝 (𝑡) (8) Where 𝑐𝑗𝑥 is a binary variable which equals to 1 if the AC unit 𝐶𝑥 is assigned to supply cold air to the rack slot of server rack 𝑆𝑗 , and 0 otherwise. As the DC layout is fixed, the value of 𝑐𝑗𝑥 will be given by VAR estimation. 𝑠𝑢𝑝 With equations (5) and (8), we can transfer 𝑇𝑗𝑖𝑛 (𝑡) to the function of 𝑇𝑥 (𝑡). 𝑠𝑢𝑝 ∑𝑙𝑥=1 𝑐𝑗𝑥 𝐶𝑇𝐼𝑗𝑥 𝑇𝑥 (𝑡)+𝛼𝑗 𝐸[𝑊𝑗 (𝑡)] 𝑇𝑗𝑖𝑛 (𝑡) = (9) 1−𝛽𝑗 Optimization solution CERC 2020 378 Business and Society 9 . PUE value is a measurement of the power utilization efficiency of DCs that is adopted internationally. It is the ratio of total power consumed by the DCs to the power consumed by the IT load. 𝑃𝑡𝑜𝑡𝑎𝑙 (𝑡) 𝑥 𝑃𝐶𝑜𝑜𝑙 (𝑡) 𝑃𝑈𝐸 = 𝐶𝑜𝑚𝑝 = 1 + 𝐶𝑜𝑚𝑝 (10) 𝑃𝑗 (𝑡) 𝑃𝑗 (𝑡) The closer the PUE value is to 1, the higher the greenness of a DC. 𝑃𝑥𝐶𝑜𝑜𝑙 (𝑡) Let 𝐺 ∗ = 𝐶𝑜𝑚𝑝 (11) 𝑃𝑗 (𝑡) Then to optimize 𝑃𝑈𝐸 is equal to the question to minimize 𝐺 ∗ . Let denote [𝑡1 , 𝑡2 ] be the interval of interest. Based on the Eq. (5, 6) above, our objec- tive function can be written as: 𝐶𝑜𝑚𝑝 𝐶𝑇𝐼𝑥𝑗 ∑𝑚 𝑗=1 𝑃𝑗 (𝑡) ∑𝑙𝑥=1(∑𝑚 𝑗=1( 𝑠𝑢𝑝 )) 𝑃𝐶𝑜𝑜𝑙 𝑡 ∑𝑙 𝑃𝐶𝑜𝑜𝑙 (𝑡) 𝑡 𝐶𝑜𝑃𝑥 (𝑇𝑥 (𝑡)) 𝑀𝑖𝑛 𝐺 ∗ = 2 𝑥=1 𝑥 𝐶𝑜𝑚𝑝 = ∫𝑡 ( 𝑚 𝐶𝑜𝑚𝑝 ) 𝑑𝑡 = ∫𝑡 2 𝑑𝑡 = 𝑃 1 ∑𝑗=1 𝑃𝑗 (𝑡) 1 ∑𝑚 𝑖𝑛 𝑗=1(𝛼𝑗 𝐸[𝑊𝑗 (𝑡)]+𝛽𝑗 𝑇𝑗 (𝑡)) ( ) 𝐶𝑇𝐼𝑥𝑗 (∑𝑚 𝑖𝑛 𝑖=1(𝛼𝑗 𝐸[𝑊𝑗 (𝑡)]+𝛽𝑗 𝑇𝑗 (𝑡)) ∑𝑙𝑥=1(∑𝑚 𝑗=1 ( 𝑠𝑢𝑝 )) 𝑡2 𝐶𝑜𝑃𝑥 (𝑇𝑥 (𝑡)) ∫𝑡 𝑑𝑡 (12) 1 ∑𝑚 𝑖𝑛 𝑗=1(𝛼𝑗 𝐸[𝑊𝑗 (𝑡)]+𝛽𝑗 𝑇𝑗 (𝑡)) ( ) Align Eq.(12) with Eq.(9), we can transfer the objective function to the function of the 𝑠𝑢𝑝 cooling temperature combinations 𝑇𝑥 (𝑡). Hence, to minimise 𝐺 ∗ , we can optimise 𝑠𝑢𝑝 𝑇𝑥 (𝑡), which is also the decision variable in this model. Nevertheless, the adjustment of the supplied cold air temperature is subject to the con- straint that the inlet temperatures of all server racks are below the redline temperature threshold specified by the device manufacturers (i.e. typically below 25oC). Hence, based on Eq. (8)(5), the constraint of redline threshold (𝑇 𝑟𝑒𝑑1 ) can be presented as: 𝑠𝑢𝑝 𝐶𝑜𝑚𝑝 𝑇𝑗𝑖𝑛 (𝑡) = ∑𝑙𝑥=1 𝑐𝑗𝑥 𝐶𝑇𝐼𝑗𝑥 𝑇𝑥 (𝑡) + 𝑑𝑗 𝑃𝑗 (𝑡) = ∑𝑙𝑥=1 𝑐𝑗𝑥 𝐶𝑇𝐼𝑗𝑥 𝑇𝑥𝑠𝑢𝑝 (𝑡) + 𝑑𝑗 . (𝛼𝑗 𝐸[𝑊𝑗 (𝑡)] + 𝛽𝑗 𝑇𝑗𝑖𝑛 (𝑡) ≤ 𝑇 𝑟𝑒𝑑1 (13) Also, we will restrict the room temperature to be within the allowance (18~27 oC ac- cording to AHREA 2016). As we define the room temperature as the function of ACs temperatures and the Server racks outlet temperatures, based on Eq.(7)(5), the temper- ature with thresholds (𝑇 𝑟𝑒𝑑2 ) and (𝑇 𝑟𝑒𝑑3 ) will be represented as: 𝑠𝑢𝑝 𝑇 𝑟𝑒𝑑2 ≤ 𝑇 𝑟𝑜𝑜𝑚 (𝑡) = ∑𝑙𝑥=1 ℎ𝑥 𝑇𝑥 (𝑡) + ∑𝑚 𝑗=1 𝑔𝑗 𝑇𝑗 𝑜𝑢𝑡 (𝑡) = ∑𝑙𝑥=1 ℎ𝑥 𝑇𝑥𝑠𝑢𝑝 (𝑡) + 𝐶𝑜𝑚𝑝 ∑𝑚 𝑖𝑛 𝑗=1 𝑔𝑗 (𝑇𝑗 (𝑡) + 𝐾𝑗 𝑃𝑗 (𝑡)) = ∑𝑙𝑥=1 ℎ𝑥 𝑇𝑥𝑠𝑢𝑝 (𝑡) + ∑𝑚 𝑖𝑛 𝑗=1 𝑔𝑗 (𝑇𝑗 (𝑡) + 379 CERC 2020 Business and Society 10 𝑠𝑢𝑝 𝐾𝑗 (𝛼𝑗 𝐸[𝑊𝑗 (𝑡)] + 𝛽𝑗 𝑇𝑗𝑖𝑛 (𝑡)) = ∑𝑙𝑥=1 ℎ𝑥 𝑇𝑥 (𝑡) + ∑𝑚 𝑖𝑛 𝑗=1((𝑔𝑗 + 𝑔𝑗 𝛽𝑗 )𝑇𝑗 (𝑡) + 𝑔𝑗 𝐾𝑗 𝛼𝑗 𝐸[𝑊𝑗 (𝑡)]) ≤ 𝑇 𝑟𝑒𝑑3 (14) Moreover, for each AC, there are thresholds (𝑇 𝑟𝑒𝑑4 ) and (𝑇 𝑟𝑒𝑑5 ), which will restrict the AC temperatures to be within the range (0 − 30℃). 𝑠𝑢𝑝 𝑇 𝑟𝑒𝑑4 ≤ 𝑇𝑥 (𝑡) ≤ 𝑇 𝑟𝑒𝑑5 (15) Similarly, by aligning with Eq. (9), we can transfer the constraints functions Eq. (13,14) 𝑠𝑢𝑝 to the function of 𝑇𝑥 (𝑡). In short, the operational problem that we will address at the first stage is to optimize the objective function in Eq. (12) by determining the optimal supplied cold temperature of the AC devices, given the constraints in Eq. (13, 14, 15). The optimization would start once we estimate the DC may perform inefficiently by VAR-RL forecasting, and target temperature combination will be set at 𝑡1 − 𝛥𝑡 periods on the timeline (Where 𝛥𝑡. is also defined by VAR-RL). 4 Future Studies Future studies will be made on forecasting verification and model adjustment. An ap- plication UI (User Interface) will be applied for the DC managers to make sustainable DC management. Field trial studies will also be conducted subsequently, we will mod- ify our models and further studies will be done accordingly. References 1. A. Andrae and T. Edler: On Global Electricity Usage of Communication Technology: Trends to 2030. Challenges, 6(1). 117–157, (2015). 2. A. Capozzoli and G. Primiceri: Cooling systems in data centers: state of art and emerging technologies. Energy Procedia, 83, 484–493, (2015). 3. ASHRAE: Data Center Power Equipment Thermal Guidelines and Best Practices. pp.1–60, (2016). 4. M.K. Stansberry: JulianUptime Institute 2012. Data Center Industry Survey, Uptime Insti- tute (2013). 5. Lu, H., Zhang, Z., Yang, L: A review on airflow distribution and management in data center. Energy & Buildings, 179, 264–277, (2018). 6. Wang, X., Xing, G., Chen, J., Lin, C. X., Chen, Y. Intelligent sensor placement for hot server detection in data centers. IEEE Transactions on Parallel and Distributed Systems, 24(8), 1577–1588, (2013). 7. Volk, E., Rathgeb, D., Oleksiak, A., Volk, E., Rathgeb, D, Rathgeb, D., Oleksiak, A.: CoolE- mAll-optimising cooling efficiency in data centres. Compute Sci Res Dev, 29, 253–26, (2014). CERC 2020 380 Business and Society 11 8. Fu, Y., Zuo, W., Wetter, M., VanGilder, J. W., Han, X., Plamondon, D.: Equation-Based object-oriented modeling and simulation for data center Cooling: A case study. Energy and Buildings. 186, 108–125, (2019). 9. Song, M., Chen, K., & Wang, J.: Numerical Study on the Optimized Control of CRACs in a Data Center Based on a Fast Temperature-Predicting Model. Journal of Energy Engineer- ing, 143(5), 1–8, (2017). 10. A. Beghi, L. Cecchinato, G. Dalla, and M. Lionello: Modelling and control of a free cooling system for Data Centers. Energy Procedia, vol. 140, 447–457, (2017). 11. J. Athavale, M. Yoda, and Y. Joshi: Comparison of data driven modeling approaches for temperature prediction in data centers. Int. J. Heat Mass Transf., vol. 135, 1039–1052, (2019). 12. S. Y. Lin: Reinforcement learning-based prediction approach for distributed Dynamic Data- Driven Application Systems.Inf. Technol. Manag., 16(4). 313–326, (2015). 13. A. Marinescu, I. Dusparic, and S. Clarke: Prediction-based multi-agent reinforcement learn- ing in inherently non-stationary environments, ACM Trans. Auton. Adapt. Syst., 12(2), 2017. 14. H. Nazaktabar, K. Badie, and M. N. Ahmadabadi: RLSP: a signal prediction algorithm for energy conservation in wireless sensor networks. Wireless Networks, 23(3), 919–933, 2017. 15. DCR Homepage, https://datacenterresources.com/articles/increasing-crac-set-points/. last accessed 2020/07/11 16. Q. Tang, S. K. S. Gupta, and G. Varsamopoulos: Energy-efficient thermal-aware task sched- uling for homogeneous high-performance computing data centers: A cyber-physical ap- proach. IEEE Trans. Parallel Distrib. Syst., 19 (11),1458–1472, (2008). 17. R. Zhou and Z. Wang: Modeling and control for cooling management of data centers with hot aisle containment. HP Lab. Tech. Rep., 82, (2011). 18. A. Beghi, L. Cecchinato, G. Dalla, and M. Lionello: Modelling and control of a free cooling system for Data Centers. Energy Procedia, 140, pp.447–457, (2017). 19. H. Sun, P. Stolf, J. M. Pierson, and G. Da Costa: Energy-efficient and thermal-aware re- source management for heterogeneous datacenters. Sustain. Comput. Informatics Syst., 4(4), 292–306, 2014. 381 CERC 2020