<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>On improving the quality of VoIP connections</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>A A Bukatov</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>D Y Polukarov</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>N D Zaitsev</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A M Sukhov</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Don State Technical University</institution>
          ,
          <addr-line>Lenina str. 69, Rostov-on-Don, Russia, 344079</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Samara National Research University</institution>
          ,
          <addr-line>Moskovskoe Shosse 34А, Samara, Russia, 443086</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Southern Federal University</institution>
          ,
          <addr-line>Stachki str. 200/1, k.213, Rostov-on-Don, Russia, 344090</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>366</fpage>
      <lpage>371</lpage>
      <abstract>
        <p>Improving the quality of VoIP connections is a very important goal in the area of telecommunications. The proportion of multimedia traffic in relation to the total traffic supported by providers is constantly increasing. To identify and troubleshoot issues with VoIP connections, network providers need both criteria and a methodology for assessing connection quality. We offer a methodology for assessing the quality of VoIP connections. A comparative analysis of VoIP codecs is also made.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>2. Related work</title>
      <p>
        Objective methods for assessing the quality of a voice signal are widely studied in [
        <xref ref-type="bibr" rid="ref4 ref5 ref6 ref7 ref8">4, 5, 6, 7, 8</xref>
        ]. In
these works, various methods are offered: Emodel, PSQM / PSQM +, PESQ, P.563. These will be
described in more detail below.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] methods are proposed for constructing tools for monitoring the quality of voice flow
transmissions. The development of these tools were based on the use of VoIP telephony systems to
support the activities of educational institutions using distance learning forms to enhance the distance
learning process.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] a new mechanism for providing online assessments of VVoIP quality of service is
introduced. This operates on network paths without the participation of users. The mechanism uses the
"GAP-model", which is a model for measuring the QoE in terms of measurable network factors such
as bandwidth, delay, jitter and loss (see above).
      </p>
    </sec>
    <sec id="sec-2">
      <title>3. Overview of assessment methods</title>
      <p>
        Speech quality assessment methods for VoIP systems are subdivided into subjective and objective
metrics [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Subjective methods require that an expert evaluates the situation in question, and therefore
are unacceptable in relation to automatic evaluation. Objective methods for assessing the quality of
voice transmission are divided into two groups. The first group assesses the quality of transmission of
primary data streams. This group includes Emodel [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The second group evaluates the quality of the
audio stream transmission itself. The second group includes the PSQM/PSQM+ method (Perceptual
Speech Quality Measure) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], which has been further developed into the PESQ method (Perceptual
Evaluation of Speech Quality) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]; this second group also includes the method P.563 [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ].
      </p>
      <p>
        The results of a comparative analysis of voice quality estimation methods for VoIP systems are
given in Table 1 [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>Table 1.Comparison of objective speech quality analysis methods.</p>
      <sec id="sec-2-1">
        <title>The absence of excess traffic</title>
      </sec>
      <sec id="sec-2-2">
        <title>The possibility of one-way data flow analysis</title>
      </sec>
      <sec id="sec-2-3">
        <title>The possibility of analyze the types of distortion of the received speech stream</title>
      </sec>
      <sec id="sec-2-4">
        <title>Emodel yes yes no</title>
      </sec>
      <sec id="sec-2-5">
        <title>PESQ no no yes</title>
        <p>P.563
yes
yes
yes</p>
        <sec id="sec-2-5-1">
          <title>4. Comparing codecs and choosing the best one</title>
          <p>There are network routes that are comparatively long but connect, end-to-end, quite closely spaced
nodes.</p>
          <p>For example, consider these cases.</p>
          <p>Case 1: a subscriber VoIP phone is connected to the corporate telecommunications network of the
Southern Federal University (SFU) through the LTE (mobile 4G network) of the mobile operator
MTS. The VoIP subscriber is located in the city of Rostov-on-Don.</p>
          <p>
            The route shown in Figure 2 can be described by the following sequence of cities: Rostov-on-Don
Moscow - St. Petersburg - Helsinki - Stockholm - Amsterdam - St. Petersburg - Moscow –
Rostov-onDon. It should be noted that the shorter return route from Amsterdam to St. Petersburg (which reduces
total length of the route) is due to the highly developed infrastructure of the international channels of
the branch network of the Ministry of Education and Science of the Russian Federation RUNNet [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ].
          </p>
          <p>
            It is known that the transmission of information flows through "long" routes subjects such streams
to certain kinds of distortion [
            <xref ref-type="bibr" rid="ref11">11</xref>
            ], such as those caused by delays in packet delivery (tempo
distortions) and those caused by packet loss — in the case of data transfer protocols based on RTP
protocols / RTTP data transfer for VoIP telephony, and over the transport protocol UDP. Regarding
delay, we note that the most "destructive" distortion imposed on the transmitted real-time signal
(which is the VoIP telephony signal) is not so much the delay per-se, as the variation in this value (or
jitter) [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ]. In Figure 2, the maximum delay value reaches 200 ms (in the 18th line), and the jitter
value is 76 ms. Note that the packet loss level is not displayed by the route trace command.
          </p>
          <p>Case 2: a subscriber VoIP phone is connected to the corporate telecommunications network of the
Southern Federal University (SFU) through the home Wi-Fi network. The VoIP subscriber is located
in the city of Rostov-on-Don. The sequence of cities, given in Figure 3, is the same as that of the
previous case: Rostov-on-Don - Moscow - St. Petersburg - Helsinki - Stockholm - Amsterdam - St.
Petersburg - Moscow – Rostov-on-Don.
subscribers is of great importance for the purpose of further improving the parameters affecting the
quality of voice transmission.</p>
          <p>There are known methods for assessing the quality of voice transmission, based both on the
analysis of primary distortions of the transmitted data stream, and on the analysis of secondary
distortions characteristic of voice transmission specifically. This article discusses the causes and types
of such distortions; compares the methods for assessing the presence of such distortions in order to
choose the method most suitable for use in monitoring the quality of speech; and suggests methods for
improving voice quality across corporate VoIP telephony systems by improving both the system
VoIP-telephony, and its environment. The article ends with general conclusions concerning the results
obtained.</p>
          <p>
            One of the most promising methods for improving the quality of VoIP connections is associated
with the use of more efficient codecs [
            <xref ref-type="bibr" rid="ref13">13</xref>
            ].
          </p>
          <p>
            The use of the more modern Speex codec [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ] allows for a significantly weakening of the
requirements, i.e., the threshold values for the primary data transmission quality indicators in terms of
what is needed in order to provide acceptable voice quality. Specifically, when using this codec, a
satisfactory quality of voice transmission is ensured even with delays of up to 150 ms, a jitter value of
up to 15 ms, and losses of up to 10% of packets. Thus, in comparison with the recommendations of
ITU-T G.712 [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ], the threshold value for the permissible variation in delays is increased by a factor
of 1.5, and the percentage of admissible data loss is increased tenfold. [
            <xref ref-type="bibr" rid="ref2 ref9">2, 9</xref>
            ].
          </p>
          <p>
            Note, however, that the Speex codec is not the best currently available. The electronic resource
devoted to this codec [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ] provides the information that the Speex codec is surpassed, according to all
indicators, by the new freely distributed Opus codec [
            <xref ref-type="bibr" rid="ref16">16</xref>
            ]. Note also that the Opus codec, developed in
2011 (the latest version of this codec was released in July 2016), has already been standardized by the
IETF (Internet Engineering Task Force) as standard RFC 6716 [
            <xref ref-type="bibr" rid="ref17">17</xref>
            ]. This standard combines the
technologies of such well-known codecs as Skype SILK [
            <xref ref-type="bibr" rid="ref18">18</xref>
            ] and Xipn.Org CELT [
            <xref ref-type="bibr" rid="ref19">19</xref>
            ]. Because of
the noted advantages of the codec, Opus decided to implement it in the VoIP-PBX IP4Tel system,
which is the basis of the system, proposed here, for monitoring the quality of voice transmission across
corporate VoIP-telephony systems.
          </p>
        </sec>
        <sec id="sec-2-5-2">
          <title>5. Conclusions and future work</title>
          <p>For this study, a comparative analysis and a development of the methods for assessing the quality of
voice transmission across VoIP telephony networks was carried out. Also, a comparative analysis of
the following VoIP codecs was performed: Speex, Opus, Skype SILK and Xipn.Org CELT. This was
a comparison by the following criteria: allowable delay, allowable losses and allowable jitter (delay
variation). This comparison showed that the Opus codec is the best choice at present. Thus, the Opus
codec was chosen for the further development of the VoIP-telephony system of the Southern Federal
University.</p>
          <p>We believe that once our program of implementation of the methods proposed in this work has
been completed, the results will be of considerable interest to many organizations using corporate
VoIP telephony systems that allow remote access of subscribers through networks of third-party
telecommunications operators.</p>
        </sec>
        <sec id="sec-2-5-3">
          <title>Acknowledgements</title>
          <p>This work falls within the public tasks allotted to the Ministry of Education and Science of the Russian
Federation (2.974.2017/4.6) and was carried out with the support of grant RFBR 16-07-00218a.</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Sukhov</surname>
            <given-names>A M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sultanov</surname>
            <given-names>T G</given-names>
          </string-name>
          and
          <string-name>
            <surname>Polukarov D Y 2012</surname>
          </string-name>
          <article-title>The method of available bandwidth estimation of IP-connection based on variable packet size</article-title>
          <source>Elektrosvyaz</source>
          <volume>11</volume>
          <fpage>39</fpage>
          -
          <lpage>42</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Bukatov</surname>
            <given-names>A A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaitsev N D and Berezovsky A N 2017</surname>
          </string-name>
          <article-title>Methods for monitoring and improving the quality of voice transmission for the system of corporate VoIP telephony SFU Modern information technologies: trends and development prospects</article-title>
          .
          <source>Materials of the XXIV scientific conference 47-52</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Calyam</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ekici</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            <given-names>C G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haffner</surname>
            <given-names>M</given-names>
          </string-name>
          and
          <string-name>
            <surname>Howes N 2007</surname>
          </string-name>
          <article-title>A “GAP-Model” based framework for online VVoIP QoE measurement</article-title>
          <source>Journal of Communications and Networks</source>
          <volume>9</volume>
          (
          <issue>4</issue>
          )
          <fpage>446</fpage>
          -
          <lpage>456</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>107: E-model: a computational model for use in transmission planning (Access mode</article-title>
          : http://www.itu.int/rec/T-REC-G.
          <volume>107</volume>
          /) (
          <year>2017</year>
          -21-04)
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>P.</surname>
          </string-name>
          861:
          <article-title>Objective quality measurement of telephone-band (300-3400 Hz) speech codecs (Access mode: https://www</article-title>
          .itu.int/rec/T-REC-P.861/en/) (
          <year>2017</year>
          -21-04)
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>P.</surname>
          </string-name>
          862:
          <article-title>Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs (Access mode</article-title>
          : http://www.itu.int/rec/T-REC-P.862/en/) (
          <year>2017</year>
          -21-04)
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>P.</surname>
          </string-name>
          563:
          <article-title>Single-ended method for objective speech quality assessment in narrow-band telephony applications (Access mode</article-title>
          : http://www.itu.int/rec/T-REC-P.
          <fpage>563</fpage>
          -200405-I/)
          <article-title>(</article-title>
          <year>2017</year>
          -21-04)
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Kurittu</surname>
            <given-names>A</given-names>
          </string-name>
          2006
          <string-name>
            <surname>Validation of ITU-T P</surname>
          </string-name>
          .
          <article-title>563 single-ended objective speech quality measurement</article-title>
          <source>Journal of the Audio Engineering Society</source>
          <volume>54</volume>
          (
          <issue>11</issue>
          )
          <fpage>1092</fpage>
          -
          <lpage>1098</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Lukinskikh K S 2015 Unbiased Assessment</surname>
          </string-name>
          <article-title>Method For Voice Communication in Cloud</article-title>
          <source>VoIPTelephony Journal of Theoretical and Applied Information Technology</source>
          <volume>80</volume>
          (
          <issue>1</issue>
          )
          <fpage>75</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Gugel Yu</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Izhvanov Yu</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kurakin</surname>
            <given-names>D V</given-names>
          </string-name>
          <year>2010</year>
          <article-title>Development of international connectivity of the federal computer network RUNNet Proceedings of the XVII All-Russian Scientific</article-title>
          and
          <string-name>
            <surname>Methodical Conference</surname>
          </string-name>
          "Telematics'
          <year>2010</year>
          "
          <fpage>273</fpage>
          -
          <lpage>275</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Bolot J C 1993</surname>
          </string-name>
          <article-title>Characterizing end-to-end packet delay and loss in the internet</article-title>
          <source>Journal of High Speed Networks</source>
          <volume>2</volume>
          (
          <issue>3</issue>
          )
          <fpage>305</fpage>
          -
          <lpage>323</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Karam M J and Tobagi</surname>
            <given-names>F A</given-names>
          </string-name>
          <year>2001</year>
          <article-title>Analysis of the delay and jitter of voice traffic over the internet Twentieth annual joint conference of the IEEE Computer</article-title>
          and
          <source>Communications Societies</source>
          <volume>2</volume>
          <fpage>824</fpage>
          -
          <lpage>833</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Singh</surname>
            <given-names>H P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            <given-names>S</given-names>
          </string-name>
          and
          <string-name>
            <surname>Singh</surname>
            <given-names>J 2011</given-names>
          </string-name>
          <article-title>Real time analysis of VoIP system under pervasive environment through spectral parameters</article-title>
          <source>International Journal of Computer Applications</source>
          <volume>31</volume>
          (
          <issue>2</issue>
          )
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Speex</surname>
            :
            <given-names>A Free</given-names>
          </string-name>
          <string-name>
            <surname>Codec For Free</surname>
          </string-name>
          <article-title>Speech (Access mode: https://speex</article-title>
          .org/) (
          <year>2017</year>
          -21-04)
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>712: Transmission performance characteristics of pulse code modulation channels (Access mode: https://www</article-title>
          .itu.int/rec/T-REC-G.
          <fpage>712</fpage>
          -199209-S/en/) (
          <year>2017</year>
          -21-04)
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Opus</given-names>
            <surname>Codec</surname>
          </string-name>
          (Access mode: http://opus-codec.
          <source>org/)</source>
          (
          <year>2017</year>
          -21-04)
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <article-title>RFC 6716 - Definition of the Opus Audio Codec (Access mode: https://tools</article-title>
          . ietf.org/html/rfc6716/)
          <article-title>(</article-title>
          <year>2017</year>
          -21-04)
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>SILK - Wikipedia</surname>
          </string-name>
          (Access mode: https://en.wikipedia.org/wiki/SILK/)
          <article-title>(</article-title>
          <year>2017</year>
          -21-04)
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <article-title>CELT: new audio codec from xipn.org (Access mode: https://www</article-title>
          .linux.org.ru/ news/ opensource/2601080/) (
          <year>2017</year>
          -21-04)
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>