=Paper=
{{Paper
|id=Vol-2792/paper4
|storemode=property
|title=Optimal Strategy Modelling in an Online Auction for the Rent of Computing Resources
|pdfUrl=https://ceur-ws.org/Vol-2792/paper4.pdf
|volume=Vol-2792
|authors=Anna Ivashko,George Safonov
}}
==Optimal Strategy Modelling in an Online Auction for the Rent of Computing Resources==
<pdf width="1500px">https://ceur-ws.org/Vol-2792/paper4.pdf</pdf>
<pre>
      Optimal Strategy Modelling in an Online
    Auction for the Rent of Computing Resources

                      Anna Ivashko1,2     and George Safonov2
1
     Institute of Applied Mathematical Research of the Karelian Research Centre of the
    Russian Academy of Sciences, Pushkinskaya Str., 11, Petrozavodsk, 185910, Russia
      2
        Petrozavodsk State University, Lenina Str., 33, Petrozavodsk, 185910, Russia
                               aivashko@krc.karelia.ru
                                jiri.safonov@gmail.com


         Abstract. We consider the problem of optimal bid pricing in an auc-
         tion for cloud computing resource allocation. The multistage model con-
         nected with the full-information best-choice problem is investigated. In
         this model, users set bids as a sequence of thresholds at each step to
         minimize the expected resource rent price. A form of the spot price dis-
         tribution function is proposed based on real-life Amazon EC2 spot price
         dynamics. The optimal bid values, mean instance rent price and average
         step for buying the resource were found through simulations.

         Keywords: Cloud Computing · Spot Instance · Mathematical Modeling
         · Best-choice Problem · Amazon EC2 · Threshold Strategies


1      Introduction

Cloud computing has a leading position in the IT-market, and is constantly
increasing its market share. Public cloud computing resources are accessible, and
allow users who require complex computing or other hardware and software tools
to avoid software and hardware expenses and other costs. Also, these services
respond flexibly and quickly to the changing computing needs of the users. Cloud
services have also become increasingly popular in the modern world, where it
became necessary to work remotely. Therefore, the demand for such resources
has lately increased significantly. With cloud services available, a user only has
to pay the rent of the computing resource he/she needs and can then start using
it immediately.
    One of the major computing resources providers on the web is the cloud
computing platform Amazon EC2 [1]. Virtual machines on this platform are in
the form of so-called “instances”. Amazon EC2 offers its users various terms
for renting instances. A new type of instances is Spot Instances (SI). SI are
purchased through auction. Users can bid on unused Amazon EC2 capacity.
The spot price changes all the time, and users can monitor purchase prices over
previous periods. Therefore, the problem of optimal bid pricing in an auction
for renting a certain computing resource is of high relevance.


Copyright © 2020 for this paper by its authors. Use permitted under
Creative Commons License Attribution 4.0 International (CC BY 4.0).
    In this paper, the following model of an auction for renting a spot resource
(see Ivashko et al. [2]) is investigated. The user observes spot prices at discrete
moments of time and wants to buy the instance within a predetermined time
period. At each stage he/she can set their bid. If that bid exceeds the current
spot price, the instance is sold to the user. The objective is to find the optimal
strategy (minimal bid) that would minimize the cost of getting the instance
within a specific time period. This model is reduced to the best-choice problem,
which is well-known in the optimal stopping theory.
    This paper is structured as follows. Section 2 offers a brief review of related
works. In Section 3, we outline the spot auction mechanism. The mathematical
model of spot instance auctions is considered, and the strategy for minimizing
the expected spot price is described in Section 4. Next, in Section 5, we investi-
gate the spot price dynamics and conduct parameter fitting for the distribution
function of real data from real-like Amazon SI auctions. Finally, in Section 6, we
present the findings and conclusions, and draft plans for the future.


2    Related Works

Different schemes based on competitive prices are used when buying/selling or
renting resources. One can name various auctions and tenders, competition for
computing resources or storage capacity. Cloud computing resources are gaining
significance in the IT world. Therefore, a lot of research is going on in the area.
    Research papers are dedicated to both exploring the existing pricing mech-
anisms and behavior of real data, and proposing new mechanisms. There are
various approaches to analyzing cloud resource pricing. Different probabilistic
and statistical pricing models are considered in papers by Kumar et al. [3], Ab-
hishek et al. [4], and by Xu et al. [5]. Probabilistic models are grounded in the
assumption that spot prices are independent and identically distributed random
variables. Amazon EC2 spot price dynamics has been analyzed by Ben-Yehuda
et al. [6], Cheng et al. [7], and by Javadi et al. [8]. It is also noted in these papers
that spot prices have a random nature.
    An important task is to model the competitive behavior of participants in
various types of auctions. Online auction is the best example of modern markets.
In an auction, it is necessary for a participant to determine his/her optimal
strategy in order to increase the chance of winning.
    Auction designs vary. Mechanisms for bidding in auctions for a computing re-
source are proposed in papers by Karunakaran et al. [9], Sowmya, Sundarraj [10],
Tang et al. [11], and Wang et al. [12].
    Optimal stopping models are often used to study the behavior of partici-
pants in socio-economic systems. The optimal stopping problems where the par-
ticipants have to take the decision about choosing an item that fulfills certain
criteria are also known as best-choice problems. One of the first to describe the
classical statement of the best-choice problem was Moser [13]. Such problems
often arise in economics, finance, sociology and politics.
    The users’ optimal strategies are often derived using the dynamic program-
ming method, in which a complex problem can be solved by breaking it down
into simpler sub-problems represented as a recursive sequence. This method has
various applications in different spheres, such as the house-selling problem, the
job-search problem, or the mate-choice problem, and is successfully used in eco-
nomic contexts for best-choice problems and auctions.
    In some schemes of online auctions for buying an item or resource participants
do not know in advance the exact current price. Each customer can set the
threshold price value (bid) at which he/she agrees to buy the resource. During
the auction the deal is made if the price offered by a customer exceeds the current
price. An important task is to analyze the behavior of participants in this type
of auction. The optimal behavior of participants of various online auctions has
been studied by the best-choice problem in papers by Harrall et al. [14], Mazalov,
Ivashko [15], Babaioff et al. [16].
    Ivashko et al. [2] applied the best-choice model to determine the optimal
bid value in order to win the auction. The authors considered this probabilistic
model for the cases where spot prices have a uniform distribution on [0,1] or a
normal distribution. However, further studies prove that the distribution of spot
prices has a more sophisticated structure.
    In this paper, we use the model proposed by Ivashko et al. [2]. We assume that
the price dynamics of each spot instance is modeled by a mixture of Gaussian
distributions. This assumption is supported by studies in a paper by Javadi
et al.[8]. We estimated the unknown parameters of the mixture of Gaussian
distributions using the EM-algorithm. Optimal bid values in the auction were
determined and numerically simulated.


3   Amazon Spot Auctions

Elastic Compute Cloud platform (Amazon EC2) [2] is one of the earliest and
most popular web services offering the computing resources of the cloud com-
puting system AWS (Amazon Web Services) for use. The AWS system is located
in several data centers in different regions of the world so that the user can find
the most technically available resource. Resources are available to users in the
form of instances (virtual machines). A client pays a certain per hour fee. He/she
can also choose the type of instances needed for the calculations and decide how
to purchase instances. There are three alternative ways to buy instances. On-
demand Instances are charged full price. A user can purchase an On-demand
Instance when he/she needs it with no long-term commitments or upfront pay-
ments. Reserved Instances are rented for a fixed duration. They provide the
user with a significant discount (up to 75%) compared to On-Demand Instance
pricing. The Reserved Instance will always be available in the availability zone
in which it was purchased. Spot Instances (SI) are idle EC2 instances available
for less than the On-Demand price (up to 90% saving). A Spot Instance runs
whenever capacity is available but, unlike On-demand and Reserved Instances,
can be interrupted by the AWS system at any time.
    SIs are in high demand among users who do not need to use the resources
continuously. SIs are well-suited for data analysis, batch jobs, background pro-
cessing, and optional tasks. SIs are rented through an auction for unused capac-
ity. Customers bid the maximum cost of instance per hour of use they would
be ready to pay. If their bids exceed the spot price, they win the auction. All
winning customers pay the same price, which is equal to the value of the lowest
winning bid. This closed auction is a Vickrey auction.


4   Optimal Strategy
This paper examines SIs and the auction for their allocation. The mathematical
best-choice model proposed by Ivashko et al. [2] is used to determine the optimal
bid value in order to win the auction.
    The optimal stopping theory deals with the problems of choosing the time to
make a decision based on observing sequentially the random values. The aim is
to maximize the expected payoff. In the optimal stopping theory there is a class
of best-choice problems. They arise from real choice processes. The best-choice
problem with full-information is directly related to the problem of optimal bid
pricing in a spot auction.
    We describe the best-choice model as follows [2]. A customer willing to rent
an SI within a certain time interval observes sequentially the spot prices X1 ,
X2 , X3 , . . . as a sequence of independent and identically distributed random
variables from a known continuous cumulative distribution function F (x) on the
interval [pmin , pmax ]. pmin corresponds to a minimal possible SI price at the
auction. pmax is a maximal possible price not exceeding the on-demand price.
There are n steps available to win a SI. The customer’s strategy is a sequence of
thresholds (bids). He/she makes the bid τi before i-th step, i = 1, ..., n. His/her
aim is to minimize the expected spot price for the whole period of time.
    Full-information best-choice problems with finite horizon are solved by the
dynamic programming method. Let us find the optimal behavior of a customer.
If the customer does not win a SI within the period n, he/she will have to buy
it for the maximum price pmax . It makes sense for the customer to buy the SI
at any price below or equal to pmax . So, the bid at the last step is τn = pmax .
    Because spot prices are independent and identically distributed random vari-
ables with a continuous cumulative distribution function F (x), the expected SI
                                      Rτn
price at the last step n is equal to      xdF (x). At the step n − 1 the customer’s
                                    pmin
bid τn−1 is the minimum between the current SI price and the expected cost at
the n-th step:

                           pZ
                            max                         Zτn               pZ
                                                                           max


τn−1 = E[min{X, τn }] =          min{x, τn }dF (x) =          xdF (x) +        τn dF (x).
                          pmin                         pmin               τn

Here, E[X] is the expectation of the random variable X.
    Continuing the process, we get that the optimal bid value τi at each step i
is rendered by the system of recurrent equations:


τn = pmax ,
     pmax
      R                         τR
                                 i+1            pmax
                                                 R
τi =      min{x, τi+1 }dF (x) =      xf (x)dx +      τi+1 f (x)dx, i = 1, ..., n − 1,
     pmin                                pmin                τi+1
                                                                                 (1)
where f (x) is the probability density function of the distribution F (x).
    If the customer’s bid τ1 at the first step does not win, the next bid τ2 is used
at the second step. Continuing this process, the customer is guaranteed to get
an instance during the time period n with the minimum expected spot price.


5   Optimal Strategy Modelling

We describe a method to derive the strategy for optimal bid pricing. We use real
Amazon EC2 spot prices data [18]. Based on this history, we build the prices’
probability density function and derive the optimal threshold bids using formula
(1). Wolfram Mathematica platform is used for modelling.
    Price histograms are plotted using spot price statistics. Figure 1 shows ex-
amples of price histograms for the instances us-east-1b m4.xlarge linux/UNIX
and us-east-1b m4.2xlarge linux/UNIX.


        Fig. 1. Histograms and probability density functions of spot prices


    Analyzing the data one can construct the prices probability density function,
see Figure 1. We used the following truncated mixture of Gaussian distributions:

                       
                                        −(x−a1 )2                  −(x−a2 )2
                               C            2             C            2
       fb1 ,b2 (x) =       ω1 √2πσ  e      2σ1
                                                    + ω2 √2πσ  e      2σ2
                                                                               , x ∈ [b1 , b2 ],   (2)
                                  1                          2
                       0, x ∈
                             / [b , b ];
                                   1     2
                                    1
here C = b             −(x−a1 )2              −(x−a2 )2
                                                                   .
         R2
               √ ω1 e                  ω2
                             2
                           2σ1                      2
                                                  2σ2
                2πσ1
                                    + √2πσ e                   dx
                                          2
          b1

   Take for example the optimal strategy modelling for the us-east-1e p2.8xlarge
Linux/UNIX instance over the period of July12–August 30, 2019. We have
N = 200 values of spot prices (see Figure 2).


                             Fig. 2. Spot pricing dynamics


   Having the data we construct the histogram of prices (Figure 3). The step in
the histogram is 1 + [log2 N ], N = 200.


Fig. 3. Histogram and probability density function of spot prices for the us-east-1e
p2.8xlarge Linux/UNIX instance
    Using the data we get in formula (2) b1 = pmin = 2.216, b2 = pmax = 3.048,
where pmin corresponds to the minimum price over the entire period of time,
and pmax corresponds to the maximum price.
    We tested the statistical hypothesis using Pearson’s chi-squared test (χ2 ).
The null hypothesis H0 stated that the frequency distribution of certain events
observed in a sample (using histogram) is consistent with a mixture of two Gaus-
sian distributions. The alternative hypothesis H1 is that the frequency distribu-
tion does not have the form of a mixture of two Gaussian distributions. Having
run the χ2 test at a significance level α = 0.05, we got P -V alue = 0.801189. As
P -V alue > α, so normally we would not reject the null hypothesis.
    We estimate the unknown distribution parameters a1 , σ1 , a2 , σ2 , ω1 , ω2 using
an Expectation Maximization (EM) algorithm. The EM-algorithm is used to
find maximum likelihood parameters of a statistical model in cases where the
equations cannot be solved directly.
    We consider the vector of the estimated parameters θ = (ω1 , ω2 ; a1 , a2 ; σ1 , σ2 ).
    The density function of the normal distribution has the form:
                                                             (x−aj )2
                                                    1    −
                                                               2σ 2
                           N (x, aj , σj ) = √          e        j      , j = 1, 2.
                                                   2πσj
   Let us consider two steps of the EM-algorithm:
   The expectation (E) step:

           jω N (xi ,aj ,σj )
   gij = P
         2                           , i = 1, ..., m;
                ωs N (xi ,as ,σs )
          s=1

   The maximization (M) step:
             m
        1
             P
   ωj = m       gij ,
            i=1
              m
         1
              P
   aj = mω j
                       gij xi ,
                 i=1
                  m
          1
   σj2 = mω            gij (xi − aj )2 ,
                  P
            j
                                              j = 1, 2,
                 i=1
here, m is the number of elements in the sample, gij is the probability that
the observation xi comes from the component j of the mixture of two Gaussian
distributions.
    We should iterate the steps E and M until convergence: until the value dif-
ference between a1 , a2 at step h and a1 , a2 at step h − 1 is less than 10−4 .
    The application of the EM-algorithm yielded the estimations shown in Ta-
ble 1.
    Next, we get the optimal threshold values (bids) for n = 15 steps of the model
using formula (1) (Table 2). As one can see, the sequence of bids is increasing
with i, i = 1, ..., n.
    To illustrate the optimal behavior let us consider the time period September
5 – 28, 2019 with the number of steps n = 15. In Figure 4 the optimal thresholds
                        ω1           ω2            a1           a2        σ1        σ2
                       0.0701       0.9299       2.2404       2.6492     0.0149   0.1707

                                    Table 1. Parameter estimations


            Table 2. Bid values generated by the optimal strategy for n = 15

 i      1          2            3            4            5          6     7       8        9       10
 τi   2.5772    2.5775       2.5779       2.5784        2.579    2.58    2.581    2.583    2.585   2.588

 i     11       12        13          14          15
 τi   2.592    2.598    2.607       2.622        3.048


(grey line) and spot prices (black line) are presented at each step. Here, the
acceptance step is i = 13, and the bid value which exceeded the spot price is
equal to 2.5828.


                     Fig. 4. Spot prices and threshold values for n = 15


    In Table 3 the results on the optimized mean spot price and mean acceptance
step are given for n = 5, n = 10, n = 15, and for 90 price values from time period
September 7, 2019 – October 2, 2019.
    As illustrated by Table 3, if the number of steps available in an auction
increases, the mean acceptance step also grows, while the mean spot price de-
creases. As a result, the more steps are available for participating in the auction,
the lower is the price at which a user can rent the resource.
                         Table 3. Simulations for different n

                     n   Mean spot price   Mean acceptance step
                     5       2.46043                 2.3
                    10       2.45081                 3.9
                    15       2.43388                  4


6   Conclusion
This paper examines the problem of optimal bid pricing by cloud service users.
The problem is connected with the full-information best-choice problem. Based
on real data dynamics, the spot price distribution is modeled as a mixture of
Gaussian distributions. Thus, we apply the theoretic model to real spot price
dynamics. The optimal bid values, mean instance rent price and the acceptance
step for buying the instance were found through simulations. These values allow
winning the auction while minimizing the expected SI rent price.
   In the future, new models for optimal bid pricing in spot auctions can be
proposed. Various strategies can be compared to identify their respective benefits
and limitations. We collected the price history during one year and we can use
these data to further research. Also the model where the prices are dependent
random variable can be considered. For this case a new model can be constructed.


References
 1. Amazon      Inc.    Amazon     Elastic   Compute      Cloud    (Amazon       EC2).
    http://aws.amazon.com/ec2
 2. Ivashko, E.E., Tchernykh, A., Ivashko, A.A., Safonov, G.R.: Cost-efficient strategy
    in clouds with spot price uncertainity. Automation and Remote Control 81(4),
    731–745 (2020). https://doi.org/10.1134/S000511792004013X.
 3. Kumar, D., Baranwal, G., Raza, Z., Vidyarthi, D. P.: A Survey on Spot Pricing in
    Cloud Computing. Journal of Network and Systems Management, 809–856 (2018).
    https://doi.org/10.1007/s10922-017-9444-x
 4. Abhishek, V., Kash, I.A., Key, P.: Fixed and Market Pricing for Cloud Services.
    In:Proceedings of the 7th Workshop Economics of Networks System Computer
    (NetE- con 2012), IEEE Computer Society, Orlando, 20 March 2012, 157–162.
 5. Xu, H., Li, B.: Maximizing revenue with dynamic cloud pric-
    ing: The infinite horizon case. 2012 IEEE International Confer-
    ence on Communications (ICC), Ottawa, ON, 2929–2933 (2012).
    https://doi.org/10.1109/ICC.2012.6364013
 6. Ben-Yehuda, A. O., Ben-Yehuda, M., Schuster, A., Tsafrir, D.: Deconstructing
    Amazon EC2 Spot Instance Pricing. Cloud Computing Technology and Science
    (Cloud-Com), 2011 IEEE Third International Conference on, Athens, 304–311
    (2011). https://doi.org/10.1109/CloudCom.2011.48
 7. Cheng, H.K., Li, Z., Naranjo, A.: Cloud computing spot pricing dynamics: Latency
    and limits to arbitrage. Information Systems Research 27(1), 145–165 (2016).
 8. Javadi, B., Ruppa K. Thulasiram, Rajkumar Buyya: Characterizing spot price dy-
    namics in public cloud environments. Future Generation Computer Systems 29(4),
    988–999 (2013). ISSN 0167-739X, http://dx.doi.org/10.1016/j.future.2012.06.012.
 9. Karunakaran, S., Sundarraj, R. P.: Bidding Strategies for Spot Instances in
    Cloud Computing Markets. in IEEE Internet Computing 19(3), 32–40 (2015).
    https://doi.org/10.1109/MIC.2014.87
10. Sowmya, K., Sundarraj, R.P.: Strategic bidding for cloud resources under dynamic
    pricing schemes. Proceedings – 2012 International Symposium on Cloud and Ser-
    vices Computing, ISCOS 2012. art. no. 6481231, 25–30 (2013).
11. Tang, S., Yuan, J., Li, X.Y.: Towards Optimal Bidding Strategy for
    Amazon EC2 Cloud Spot Instance. Cloud Computing (CLOUD), 2012
    IEEE 5th International Conference on, Honolulu, HI, 91–98 (2012)
    https://doi.org/10.1109/CLOUD.2012.134
12. Wang, W., Liang, B., Li, B.: Revenue maximization with dy-
    namic auctions in IaaS cloud markets. Quality of Service (IWQoS),
    IEEE/ACM 21st International Symposium on, Montreal, QC, 1–6 (2013)
    https://doi.org/10.1109/IWQoS.2013.6550265
13. Moser, L.: On a problem of Cayley. Scripta Math. 22(5), 289–292 (1956).
14. Harrell, G., Harrison, J., Mao, G., Wang, J.: Online Auction and Secretary Prob-
    lem. Int’l Conf. Scientific Computing. 241–244 (2015).
15. Mazalov, V.V., Ivashko, A.A.: Online Auction and Optimal Stopping
    Game with Imperfect Observation. Intelligent Information and Database
    Systems. ACIIDS 2020, LNCS, volume 12033. Springer, 145–156 (2020).
    https://doi.org/10.1007/978-3-030-41964-6 13
16. Babaioff, M., Immorlica, N., Kempe, D., Kleinberg, R.: Online auctions
    and generalized secretary problems. SIGecom Exch 7(2), 1–11 (2008).
    http://doi.acm.org/10.1145/1399589.1399596
17. Amazon EC2 Spot Instances. http://aws.amazon.com/ec2/spot/
18. Spot     Instance    Pricing   History    Amazon     Elastic   Compute     Cloud
    http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/using-spot-instances-
    history.html

</pre>