<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Self-similar Traffic Research Experiment*</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Russian State Hydrometeorological University</institution>
          ,
          <addr-line>ul. Voronezhskaya, 79, 192007 St. Petersburg</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Saint-Petersburg State University of Aerospace Instrumentation</institution>
          ,
          <addr-line>Bolshaya Morskaya str. 67, 190000 St. Petersburg</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The article discusses the procedure of researching network traffic with a self-similar structure. It is a mixture of voice traffic, data and multimedia. Selfsimilar traffic covers very different time scales. Considered the characteristic properties of self-similar traffic both in geometric and statistical terms. Self-similar traffic is represented by the Pareto distribution. Received the characteristics of queuing systems using simulation. Results presented in this article make it possible to substantiate the prospective requirements for network node equipment. Obtained the evaluations quality of service for self-similar traffic in terms of buffer length and time delay in the network node. An experiment proving the operability of the ARIMA(p,d,q) model in the study of self-similar traffic was conducted</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Packet network traffic is the integration of voice, data and multimedia. Such traffic
covers very different time scales - from microseconds till seconds and even minutes
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Mixture of data streams different on content and properties generates to so named
self-similar traffic.
      </p>
      <p>
        Self-similar traffic at any time scale is longtime dependence − availability of
pulsations - activity periods, divided to less active periods [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        In classical models of information streams, such as Poisson stream, Erlang,
gammadistribution and other pulsations are strongly smoothed on large time scales, which
makes the property of long-time dependence is missed. As a result, the classical models
are not allowed to appreciate the volumes of calculation resources of systems when
servicing pulsating traffic [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>According to this fact, the present day task is simulation of network nodes with
selfsimilar traffic at the entrance. The aim of this modification is to reconsider
characteristics received in classical models of information streams.</p>
      <p>In addition, the self-similarity allows one to compose forecast models in different
time scales, which allows a long-term forecast of incoming traffic.</p>
    </sec>
    <sec id="sec-2">
      <title>2 Properties of self-similar traffic</title>
      <p>Self-similarity can be characterized as geometrically and so statistically.</p>
      <p>Self-similarity as geometric concept underlines that fact that process keeps the
structure on different time scales.</p>
      <p>In fig. 1-4 can be seen daily data of different traffic types from 08.20.2018. The data
are given by mobile operator MTS in St. Petersburg. They demonstrate the persistence
traffic structure over time.</p>
      <p>
        Self-similarity as a statistical understanding is characterized by such properties:
─ slowly damped dispersion;
─ a long-time dependence;
─ availability of distribution with heavy "tails" of time intervals between two
consistent events [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ].
      </p>
      <p>400
tyeB 300
G
,em200
u
l
cvo 100
i
f
fraT 0
120000
100000
e
tyB 80000
,M60000
e
lum40000
vo 20000
iffca 0
r
T
1 35 501 157 209 261 313 365 417 469 521 573 25 776 972 781 833 885 937 989 0141 1093 1145 1197 1249 1301 1353 1405 1457
t,6min
(1)
(2)
(3)</p>
      <p>The property slowly damped dispersion is that the dispersion of the sampling
average dampens slower than quantity reciprocal of sampling size</p>
      <p>D(X(n)(t)) =σ2n2H−2, n → ∞,
where σ2 − dispersion of process X (t);
n − the sampling size;
H − Hurst index.</p>
      <p>For meaning H&gt; 0.5 the direction of process dynamics most likely will not change;
for H &lt;0.5 prognoses that process will change direction;
for H = 0.5, we have uncertainty − Brownian moving.</p>
      <p>Availability of a long-time dependence shows that the self-similar process has
hyperbolically damped correlation function
A(k) − changing function on infinity, for which</p>
      <p>R(k) ≅ k(2H−2)A(k), ∀k ≥1, k → ∞,</p>
      <p>kli→m∞ AA((kkx)) =1 for all x&gt; 0</p>
      <p>Property of availability of distribution with heavy "tail" treat to random variable X
(4)
(5)
where 0 &lt;α &lt;2 − parameter of distribution form, the smaller meaning α, the heavier the
"tail" of the distribution;
c − some positive constant.</p>
      <p>For α≤2, distribution has endless dispersion; for α≤1 distribution has an endless
mean.</p>
      <p>
        Adequate description of self-similar traffic is given by probability distributions with
heavy "tails", in particular, by the Pareto distribution [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ].
      </p>
      <p>Pareto distribution is done by following function</p>
      <p>F (t)</p>
      <p> K α
=− 1  , t ≥ K ,</p>
      <p> t 
where α − form parameter (simply parameter);
K − a border parameter, specifies minimum meaning of random variable x, plays a role
of a scale coefficient.</p>
      <p>In further, the Pareto distribution will be denoted as P(α, K) or simply P.</p>
      <p>Mathematical expectation M(x) of a random variable x distributed on Pareto is
determined by the expression M(x)=αK/(α-1).
3</p>
      <p>
        Analysis of queuing systems M|M|1 and P|M|1
As known bufferisation is the main strategy provide resources. Research are
concentrated around statistical characteristics of the queues [
        <xref ref-type="bibr" rid="ref5 ref6">5,6</xref>
        ]. It is obvious, that for
selfsimilar traffic needs buffers by more size, then predicted classical queue analysis [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
Let us show it by example of estimating the characteristics of queuing systems (QS)
type M|M|1 and P|M|1. With this purpose were worked out simulation models QS in
program AnyLogic.
      </p>
      <p>Experiment on simulation model was carried out in follow limits: buffer size L = ∞;
average processing time of application in service node T = 0.02 s; service node load
factor varied ρ∈ [0,2; one]; K = 0.01; α∈ [1.1; 2].</p>
      <p>Evaluate difference in characteristics QS type M|M|1 and P|M|1 in form of relations:
─ average queues lengths LP and LE with Pareto and exponential distribution of time
intervals between two arrivals of traffic packets, accordingly;
─ maximum queues lengths LˆP and LˆE with Pareto and exponential distribution of
time intervals between two arrivals of traffic packets, accordingly;
─ average waiting time Tw and stay time Ts packets into QS type M|M|1 and P|M|1 for
different load.</p>
      <p>
        Since different QS type M|M|1 and P|M|1 are compared, that besides using one and
the same generator of random numbers and one the same scale coefficient, needs to use
one the same intensity arrivals of traffic packets [
        <xref ref-type="bibr" rid="ref10 ref8 ref9">8-10</xref>
        ].
      </p>
      <p>For QS type P|M|1, the intensity may be done through parameter α. If suppose that
the mathematical expectation of exponential distribution tends to mathematical
expectation of Pareto distribution, so 1 = αK , so then α = (1 − Kλ )−1 . Thus, if the intensity
λ α −1
λ = 25 packets / s, K =0.01, then for QS type P|M|1 with the same intensity, parameter
α= 1.33, that is not against property of distribution with heavy tails.</p>
      <p>The results of statistical characteristics queuing systems type M|M|1 and P|M|1 are
done in Table. 1. The number of experiments are 5104. Average queue length come to
whole rounded on the right.
Analysis of results from the table. 1 allows to make following conclusions.</p>
      <p>Difference in required buffer length for QS type M|M|1 and P|M|1 is obvious, starting
for α = 1.1. For α ∈ [1,1; 1.7], this difference is stable when comparing the maximum
queue lengths and this difference rise for α &gt; 1.7. At comparison of average queue
lengths, the buffer length for P|M|1 begins show the double rise already at α&gt; 1.2.</p>
      <p>The rise of queue is influenced not so much by the distribution of time intervals, as
by correlation structure of process.
4</p>
      <p>Traffic research experiment using the ARIMA(p,d,q) model
A feature of the self-similarity is also the ability to predict the amount of data in the
network, which helps prevent data loss associated with a denial of service.</p>
      <p>
        Based on [
        <xref ref-type="bibr" rid="ref11 ref12">11–13</xref>
        ], it was concluded that the autoregressive integrated moving
average model (ARIMA) can be used to predict self-similar traffic.
      </p>
      <p>An experiment was conducted, which task was to show the operability of the
ARIMA(p,d,q) model in the study of traffic. For the experiment, there was taken the
data of LTE traffic for six days received from MTS (Fig.5).</p>
      <p>The purpose of the experiment: to build an ARIMA forecast model based of traffic
data for 5 days for the sixth day and compare it with traffic data for this period.
It can be seen from the graph in Figure 5 that there is seasonality with a period of 24
hours, the graph fluctuates around a certain value, and it is also seen that there is a
number of peak values. An autocorrelation analysis of the existing time series was
carried out and an autoregression function was constructed with the number of steps equal
to 48 (Fig.6).
On the graph there was a peak of the first value of the time lag. This peak was removed
by taking a difference of the order of 1 (D-1) and the autocorrelation function for the
transformed time series and the partial autocorrelation function were constructed
(Fig.7). The parameters p, d, q were determined.</p>
      <p>Since the sequential difference operation was applied once, d=1. The parameter p
was chosen from the partial autocorrelation model p=1 (the first significant value of the
series of functions). The parameter q was chosen similarly from the autocorrelation
model and q=1.
As a result, the ARIMA(1,1,1) model was built and a forecast for one period was
obtained (Fig. 8).
Since, due to the properties of self-similarity, the structure of the time series is
preserved in different time scales, the constructed model is also suitable for other periods
of time (month, year, etc.).</p>
      <p>This experiment is an example for constructing an ARIMA model and illustrates the
order of forecasting using this model. A feature of the study of the time series using the
ARIMA model is a mandatory expert assessment of the obtained autocorrelation
models. On the one hand, this is a drawback, because forecasting is a labor-intensive process
and it requires the participation of a specialist, on the other hand, expert assessment
allows you to more accurately build a forecast model.
5</p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>The rise in the share of multi-service traffic in networks actualizes the problem of
meeting customer requirements for the quality of network services provided.</p>
      <p>It was conducted the simulation experiment for QS type M|M|1 and P|M|1. The
obtained evaluations quality of service for self-similar traffic in terms of the buffer length
and the time delay in the network node.</p>
      <p>The results presented in the article make it possible to substantiate the prospective
requirements for network node equipment.</p>
      <p>An experiment was conducted proving the operability of the ARIMA(p,d,q) model
in the study of self-similar traffic.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bogatyrev</surname>
            ,
            <given-names>V.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bogatyrev</surname>
            ,
            <given-names>S. V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parshutina</surname>
            ,
            <given-names>Bogatyrev A. V.</given-names>
          </string-name>
          :
          <article-title>Model and Interaction Efficiency of Computer Nodes Based on Transfer Reservation at Multipath Routing</article-title>
          .
          <source>In: 2019 Wave Electronics and its Application in Information and Telecommunication Systems (WECONF)</source>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          (
          <year>2019</year>
          ). doi:
          <volume>10</volume>
          .1109/WECONF.
          <year>2019</year>
          .8840647
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Tatarnikova</surname>
          </string-name>
          , T.M.
          <article-title>Statistical methods for studying network traffic</article-title>
          .
          <source>In Informatsionno-Upravliaiushchie Sistemy</source>
          , vol.
          <volume>96</volume>
          , no.
          <issue>5</issue>
          , pp.
          <fpage>35</fpage>
          -
          <lpage>43</lpage>
          (
          <year>2018</year>
          ). doi:
          <volume>10</volume>
          .31799/
          <fpage>1684</fpage>
          -8853-2018-5-
          <fpage>35</fpage>
          -43
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Tanenbaum</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wetherall</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          : Computer Networks. 5th ed.
          <source>Prentice Hall</source>
          (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Kutuzov</surname>
            <given-names>O. I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tatarnikova</surname>
            <given-names>T. M.</given-names>
          </string-name>
          <article-title>Model of a self-similar traffic generator and evaluation of buffer storage for classical and fractal queuing system</article-title>
          .
          <source>In Moscow Workshop on Electronic and Networking Technologies, MWENT 2018 - Proceedings 1</source>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>3</lpage>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Leland</surname>
            ,
            <given-names>W.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taqqu</surname>
            ,
            <given-names>M.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Willinger</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          and Wilson, D.V.
          <article-title>On The Self-Similar Nature of Ethernet Traffic</article-title>
          .
          <source>In Proc. ACM SIGCOMM'93</source>
          , pp.
          <fpage>183</fpage>
          -
          <lpage>193</lpage>
          . San-Fransisco (
          <year>1993</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Zwart</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          :
          <article-title>Queueing Systems with Heavy Tails</article-title>
          . Eindhoven University of Technology Publ. (
          <year>2001</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Poymanova</surname>
            ,
            <given-names>E.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tatarnikova</surname>
            ,
            <given-names>T. M.</given-names>
          </string-name>
          <string-name>
            <surname>Models</surname>
          </string-name>
          and
          <article-title>Methods for Studying Network Traffic</article-title>
          .
          <source>In 2018 Wave Electronics and its Application in Information and Telecommunication Systems (WECONF)</source>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          (
          <year>2018</year>
          ). doi:
          <volume>10</volume>
          .1109 / WECONF.
          <year>2018</year>
          .8604470
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Bogatyrev</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <article-title>A. Protocols for dynamic distribution of requests through a bus with variable logic ring for reception authority transfer</article-title>
          .
          <source>In Automatic Control and Computer Sciences</source>
          , vol.
          <volume>33</volume>
          , no.
          <issue>3</issue>
          , pp.
          <fpage>57</fpage>
          -
          <lpage>63</lpage>
          (
          <year>1999</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Tatarnikova</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kolbanev</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Statement of a task corporate information networks interface centers structural synthesis</article-title>
          .
          <source>In IEEE EUROCON</source>
          <year>2009</year>
          , pp.
          <fpage>1883</fpage>
          -
          <lpage>1887</lpage>
          . St.
          <string-name>
            <surname>Petersburg</surname>
          </string-name>
          (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Bogatyrev</surname>
            ,
            <given-names>V. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vinokurova</surname>
            <given-names>M. S.</given-names>
          </string-name>
          <string-name>
            <surname>Control</surname>
          </string-name>
          and
          <article-title>Safety of Operation of Duplicated Computer Systems</article-title>
          . In Communications in Computer and Information Science,
          <source>IET - 2017</source>
          , vol.
          <volume>700</volume>
          , pp.
          <fpage>331</fpage>
          -
          <lpage>342</lpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          12.
          <string-name>
            <surname>Крюков</surname>
            <given-names>Ю.А.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Чернягин</surname>
            <given-names>Д</given-names>
          </string-name>
          .
          <article-title>В. ARIMA - модель прогнозирования значений трафика // Информационные технологии и вычислительные системы</article-title>
          .
          <year>2011</year>
          /2, с.
          <fpage>41</fpage>
          -
          <lpage>49</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          13.
          <string-name>
            <surname>Соловьев</surname>
            <given-names>А.Ю.</given-names>
          </string-name>
          <article-title>О задаче прогнозирования самоподобных сетевых процессов // III Международная научная конференция «Современные проблемы информатизации в системах моделирования, программирования и телекоммуникациях»</article-title>
          . URL: http://econf.rae.ru/article/4745
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>