=Paper=
{{Paper
|id=Vol-1623/paperapp9
|storemode=property
|title=Methods Assessment the Probability Density of Discrete Signals in Telecommunications
|pdfUrl=https://ceur-ws.org/Vol-1623/paperapp9.pdf
|volume=Vol-1623
|authors=Yuriy Kropotov,Aleksey Belov
|dblpUrl=https://dblp.org/rec/conf/door/KropotovB16
}}
==Methods Assessment the Probability Density of Discrete Signals in Telecommunications==
<pdf width="1500px">https://ceur-ws.org/Vol-1623/paperapp9.pdf</pdf>
<pre>
 Methods Assessment the Probability Density of Discrete
            Signals in Telecommunications

                                 Yuriy Kropotov, Aleksey Belov

    Murom Institute (branch) "Vladimir State University named after Alexander and Nicholay
                                  Stoletovs", Murom, Russia

                                     kaf-eivt@yandex.ru


         Abstract. This paper is devoted to investigation of problems and methods of
         acoustic signals modeling in the information and control systems for audio ex-
         change communications. The problems of estimation and approximation of
         probable density functions, which may assist in distinction of acoustic speech
         signals and external acoustic noise. We consider the direct and indirect meth-
         ods, techniques histogram evaluation, ways to overcome incorrect problems.

         Keywords: Probability density, discrete signals, telecommunication systems,
         distribution function.


1        Introduction

Evaluation distributions speech signals and noise, as well as any nature of data, based
on empirical derived from experimental results of measurements [1]. There are many
methods of preparing such estimates, divided into many parametric and nonparamet-
ric, direct and indirect methods.
   Under the parametric or understood by classical methods and the methods in which
the probability density is known to an accuracy of parameters, it has the form
f ( x,  )  f ( x) , where x  R n and   R m are respectively the vectors of random
variables and unknown parameters [2].
    The job distribution is characterized, for example, for the detection of problems
and evaluation of signals.
    The detection tasks assumed that the observed data belongs to one of two or more
classes, each of which is characterized by its known a priori probability density
 f k ( x) , or in particular, its own set of parameters  k . The density
f k ( x)  f ( x, k ) and the problem is correlating the observed data to one of the
known distributions.
  On the contrary, problems of estimation parameter vector  is considered to be
unknown, though the function f ( x,  ) itself may be a known probability density.


    Copyright © by the paper's authors. Copying permitted for private and academic purposes.
    In: A. Kononov et al. (eds.): DOOR 2016, Vladivostok, Russia, published at http://ceur-ws.org
746        Yu. Kropotov, A. Belov


   If the function f ( x,  ) is not a probability density, the parameter vector estima-
tion methods  are considered to be non-parametric. In this case - it is a task of ap-
proximation or approximation of the observed data. The resulting approximation
function f ( x,  ) must satisfy the constraints [1, 3]

                                                 
                            f ( x,  )  0 and  f ( x,  )dx  1 .                       (1)
                                                 

  A clear distinction between parametric and non-parametric methods is not always
possible. Thus, the problem of data closer mixture of known distributions represented
density functions  k ( x, k ) , f ( x,  )      ak k ( x, k ) ,  ak  1 ,, more appro-
                                                     k                 k
priately be classified as non-parametric tasks. However if the coefficients ak  0 are
known, the task can be seen as a parametric. For nonparametric problems are the
problems of least squares or linear and nonlinear regression. Methods for solving such
problems is also called projection methods. It should be noted that the definition of
non-parametric methods above only used in mathematical statistics. In the field of
systems theory, optimization, approximation and approach them, on the contrary, it is
called parametric [4, 7], based on the meaning of the tasks is to find a finite number of
unknown parameters.


2        Direct and indirect methods of estimating the probability
         density

A number of studies estimating the probability density methods are divided into direct
and indirect methods. This hallmark of the direct methods is to use a direct link with
the required density of empirical data. For example, to direct methods include meth-
ods based on the solution of the integral equation relating the probability density of
the empirical distribution function
                                

                                 I ( x  v) f (v)dv  Fn ( x) ,
                                                           
                                                                                          (2)
                                
              
   where Fn ( x) is the empirical distribution function of the stepped type. The solu-
tion of equation (2) gives the desired estimate of the probability density.
The empirical distribution function is given by
                                           1 N
                             Fn ( x)        I(, x] ( xl )
                                           N l 1
                                                                                          (3)

    where I ( , x ] ( xl )  the indicator of the set (, x] ,
                           1, xl  (, x]
      I ( , x ] ( xl )                  and N  the sample size.
                           0, xl  (, x]
Methods Assessment the Probability Density of Discrete Signals in Telecommunications   747


   Problem solving equation (2) with the function (3), as already indicated, it relates
to a class of incorrect and requires the use of special techniques. Especially the incor-
rectness is shown with a small sample size [5]. Thus the need for recovery of the
probability density limited amount of data arises frequently, for example, in connec-
tion with the analysis and segmentation unsteady, particularly speech signals, the
statistical characteristics can only be considered as constant intervals of similar
sounds.
   Unlike the direct, indirect methods are based on the average risk minimization
functional described by expressions of the form
                                                             1 n
                   R     Q( x,  )dF ( x)  Rn         Q( xl ,  ) .
                                                             N l 1
    or their corresponding empirical functionals
                                             1 n
                                 Rn         Q( xl ,  ) .
                                             N l 1
   According to this criterion to indirect methods include, such as the maximum like-
lihood method [6].
   Direct, in principle, other methods, such as histogram techniques and methods
based on approximation   functions of a regular feature in the in the expression
                                        
                                f ( x)    ( x  v) f (v)dv .                        (4)
                                        

   However, a clear distinction between direct and indirect methods, in general, is not
always possible. And due to the fact that in both cases, the problem of finding the
density estimates may result in one way or another, to the problem of minimizing a
functional of the empirical data, in particular, from the empirical distribution function.


3       On nuclear and projection estimates the probability density

The nuclear method for obtaining estimates of the density based on the approximated
  function under the integral sign in (4) is a function K ( x) defined on some inter-
val of the argument. This function must satisfy the condition
                                       1  x
                                  lim K     ( x) .
                                  h 0 h h
    As a function K ( x) frequently used expressions

         1 2, x  1                                  1  x2         1 sin x
                                            , K  x    e , K  x 
                                      1
K ( x)              , K  x                                               .
           0, x  1             ( x  1)
                                      2
                                                                      x
748        Yu. Kropotov, A. Belov


     The right-hand side of equation (4) after such a substitution is a function of expec-
         1  x
tation    K   , which can be replaced by the empirical mean value
         h h

If we consider that option is chosen on the basis of the sample size, the probability
density estimate in accordance with equation (4) can be written as
                                                 1 n  x  xl 
                                   fˆ ( x)          K          .
                                               nh(n) l 1  h(n) 
                                                                                                        (5)

   The convergence of this expression to the desired density estimation provided by
the conditions: 1) h(n)  0 if n   and 2) a  0 for any number of inequality
 
 e nh(n)   .
l 1
   The definition of function K ( x) can be seen that with the decrease of the parame-
ter h is an increase in the accuracy of the approximation functions  , but at the same
time, increasing the chances of erroneous classification evaluation to class multimodal
densities. Conversely, increasing this setting may lead to an erroneous assessment of
the assignment to the unimodal density. The problem of choosing a parameter h that
arises in this regard stems from the incorrect density estimation problem and for this
reason has no unique solution. We can only assert that in assessing unimodal distribu-
tions require higher values h than in the case of multimodal.
Equation (4), and you can use when assessing the probability density projection
method. In this method, an unknown probability density is represented by a polyno-
mial system of normalized orthogonal functions k ( x) 1 , while assessment [1]
                                                                                 m


                                                             m
                                               fˆ ( x)   akk ( x) .                                  (6)
                                                             k 1
  Substitution of this polynomial in (4) gives the equation
                           m                             d
                                                                                     1 n
                 f ( x)   akk ( x) . and ak   k ( x) f ( x)dx                   k ( xl )
                          k 1                           c
                                                                                     n l 1
  Substituting           this    expression         in              (6)     leads      to     the   formu-
                 n   m
               1
la fˆ ( x)      k ( x)k ( xl ).
               n l 1 k 1
                                                                          m
Finally, if you enter the kernel function K ( x, xl )                    k ( x)k ( xl ) the estimate of
                                                                          k 1
the density takes the form similar to (5)
                                                         n
                                                1
                                       fˆ ( x)   K  x, xl  .
                                                n l 1
   The use of projection methods in which the score is represented by formula (6), is
not limited to the case considered. There are tasks that are equally based on a projec-
Methods Assessment the Probability Density of Discrete Signals in Telecommunications     749


tion methods, and the integral equation (2). In one of the approaches are evaluated on
the smoothed data that is provided by a non-degenerate linear operator of the form
                                                       d
                                     B g ( x)   K ( x, v) g (v)dv .\
                                                       c
  The action of the operator on (2) leads to the equation
                                                 G f ( x)  Qn ( x)                     (7)
  where
                 d               d
   G f ( x)   K ( x, z )  I ( z  v) f (v)dvdz ,
                 c               c
                                       d
                                 1 n
   Qn ( x)  B Fn ( x)           K ( x, v)dv ,
                                 n l 1 xl

  and (6) is an expansion with respect to functions k ( x) 1 of operator G G H .
                                                                                   m


  The solution of equation (7) because of its incorrectness reduced to the problem of
minimizing the functional

                                                             
                             d                                    2       d
                    J ( fˆ )   G fˆ ( x)  Qn ( x) dx   j  fˆ 2 ( x)dx .           (8)
                             c                                            c
   It is shown that this functional reaches a minimum at values of the coefficients of
the polynomial-patients (6)
                                                              k bk
                                                   ak                ,
                                                            k2   j
                d
  where bk   Qn ( x) k ( x)dx and  k ( x) , k  its own functions and values of
                c

the operator G H G .
   In the particular case when the core K ( x, v)  K ( x  v) of the operator B the op-
                                             d
erator to convolution B          f ( x)   K ( x  v) f (v)dv .
                                             c
   This allows for the minimization of the functional (8) to take advantage of the Fou-
rier transform [1, 6, 7]. Using this, evaluation of density

                                                       g  ( x  xl )             .
                                                        n
                                                  1
                                 fˆ ( x)                                     j
                                             n  j l 1
                                             
                                       1
                                           g ( )e d . It is the inverse Fourier transform
                                                   ju
  Here the function g (u ) 
                                      2 
                                                
               K ( ) K ( )
g ( )                            and K ( )    K (u )e ju du .
           K ( ) K ( )   j 2
                                                
750       Yu. Kropotov, A. Belov


4       The histogram assessment of the probability density

The histogram is called a bar chart of the distribution of the random variable. The
height of each column represents the number of values of the random value falling
within the appropriate interval, generally different widths (see Fig. 1.).
The ratio of the random variable values nl from the interval ( xl 1 , xl ] to the total
number of values N is the empirical probability of the event x  ( xl 1 , xl ] .
                      3500

                      3000

                      2500

                      2000

                      1500

                      1000

                       500

                        0
                         -2       -1           0        1         2    3


                   Fig. 1. Histogram of a mixture of two normal distributions

The theoretical value of this probability is written at the same time through a proba-
                                        xl
bility density P  x  ( xl 1 , xl ]   f ( x)dx .
                                       xl 1

   If we equate the theoretical and empirical density and assume that within each in-
terval change in the probability density can be neglected, the density estimation can
be written as

                                          nl
                                  fl          , l  1,     ,q,                      (9)
                                         xl N

where      xl  xl  xl 1      the    lenght     of    the     l      interval.
When splitting field (c, d ] of the random variable values q at equal intervals of
length xl  (d  c) q and formula (9) can be written as

                                        nl q
                              fl              , l  1,     ,q,                     (10)
                                     (d  c) N
Methods Assessment the Probability Density of Discrete Signals in Telecommunications                  751


   Count value obtained by the formula (9) or (10), etc. may be used for approxima-
tion of the probability density. Units corresponding to estimates, in the first approxi-
mation can be found from the expressions
                                        1
                                 xl             xi , l  1,
                                        nl xi ( xl 1 , xl ]
                                                                                 ,q.

   The graph obtained by approximating the probability density based on
points ( xl , fl ) , l  1, , q , in the coordinates x , f .. Thus, depending on the
amount of data it can be used as interpolation numerical techniques and approxima-
tions of functions. In both cases the problem is to construct the system polynomial
                                                                                           m
P( x, a)   by       the   functions i ( x) ,            i  1,   , m P( x, a)   aii ( x)  aT  ( x) .
                                                                                           i 1

                             and  ( x)   ( x),                             .
                             T                                                    T
where a  a1 ,        , am                               1        ,  m ( x)
  In the case of vector interpolation a is the system of equations
                                   aT  ( xl )  fl , l  1,            , q  m.
  In matrix form, this system takes the form

                                               T a  f ,                                            (11)


                                 and    ( x ),  ( x ),                    ,  ( xm )  .
                                  T
  where f  f 1 ,         , fm                               1      2

   In evaluating you can also take advantage of the generalized method of local inter-
polation. In this method, a sequence of the form of formula (11) as defined for the
corresponding sequence of interpolation intervals. At the same time these formulas
are supplemented by restrictions, providing the necessary conditions of conjugation of
local solutions, and the order of the polynomial is not required to match the number of
points ( xl , fl ) , that is q  m .
   Approximation of probability density smoothing means is the task of the least
squares. The challenge here is to minimize the residual sum of squares polynomial
smoothing and density f l ratings. Functional to be minimized is recorded at the same
time as

                                                                        
                                               n
                                      J (a)   aT  ( xl )  fl
                                                                             2
                                                                                                     (12)
                                              l 1
   In order to smooth the data, as in the interpolation, you can use the methods of the
local approximation, generalizing them in relation to the desired, in particular, a
smooth interface polynomials defined on a sequence of intervals and delivering the
minimum values of functionals of the form (12) under the constraints set by the terms
of pairing.
   Histogram methods [1, 6] of estimation of the probability density,
especially by interpolation, the problem inherent in the partition of the set of values of
the random variable into intervals for small sample sizes. Fig. 2. a, b shows two histo-
752       Yu. Kropotov, A. Belov


grams mixture of normal distributions, the same as in Fig. 1. for a sample of 100 sam-
ples.


Fig. 2. Histograms mixture of normal distributions by partitioning the range of values in (a) and
(b) intervals

This figure shows that the partition of the set of values of the random variable by 20
intervals (Fig. 2 a) interpolation approach does not restore the true form of distribu-
tion and draw the right conclusions. The situation is improved by splitting the plurali-
ty of slots 10 (Fig. 2 b). In this case, a graph similar in shape to the true probability
density bimodal. The solution to this problem, in principle, feasible in the framework
of the adaptive partition of the set of values of the random variable in the interval, not
necessarily of the same length. Optimal partition is in this case, by varying the lengths
and intervals of the centers and of the results of comparison, possibly followed by
averaging them.
   At the local, including generalized local approximation, partition problem is less
acute, and is connected, on the contrary, ensuring sufficient to smooth the number of
intervals. However, there is a new question - the question selection algorithm that
ensures optimal degree of smoothing empirical estimates (9). Resolution of this issue
in principle, feasible methods based on the variation of the free parameters of the
algorithm and then selecting the best according to some evaluation criteria.
   Another problem for interpolation and approximation of methods for smoothing is
a problem of assessment fˆ ( x) belonging to the class of probability density functions.
These conditions within the local approximation can be taken into account by intro-
ducing into the problem of minimizing the functional (12), corresponding limitations
and within the interpolation approach - by varying the lengths and intervals of the
partition centers.
   Finding the coefficients of the polynomial (6) optimization methods is the task of
the linear regression. In practice, however, these polynomials are often built on the
systems standard probability densities nonlinearly depend on a certain set of parame-
                                                                            , it is possible to
                                                                            T
ters. In this case, if the input vector of parameters    1 ,       ,r
Methods Assessment the Probability Density of Discrete Signals in Telecommunications                       753

                                                                     m
determine         the        polynomial P( x, a,  )                 akk ( x,  )  aT  ( x,  ) ,   where
                                                                     k 1

 ( x,  )  1 ( x,  ),                 
                                             T
                               ,  m ( x,  ) .
    Accordingly, the estimate of density can be written as
                                               m
                                   fˆ ( x)   aˆkk ( x, ˆ)  aˆ T  ( x, ˆ)
                                            k 1
    where the evaluation parameters are the solution to the minimization problem

                                                                                    .
                                                           n
                                aˆ , ˆ  arg min  aT  ( xl ,  )  fl
                                                                                       2

                                                   a, 
                                                          l 1
   Finding the vector of parameters a and  and in this case refers to a class of non-
linear problems, which are usually solved by constrained optimization methods.


5       Conclusion

This paper is a study of direct and indirect estimating the density methods of acoustic
signals and the probability of interference occurring in the information and control
telecommunications systems. Investigated models of nuclear projection probability
density estimate that are based on probability density signals approximation in the
case of unimodal and multimodal distributions. Applying method of histogram mix-
ture normal distributions estimation shows that the true form of distributions in the
partition of values set of a random variable on a different number of slots is not al-
ways possible to restore. This solution is provided by an adaptive optimal partition by
varying the lengths and intervals of partition centers.


References
 1. Kropotov Y.A., Paramonov A.A. Methods of designing information processing telecom-
    munications systems sharing audio algorithms: monograph.-Moscow-Berlin: Direct Media,
    2015. 226 p (in Russian).
 2. Kropotov Y.A. The time interval determine the probability distribution of the amplitude of
    the speech signal law Radiotekhnika, 2006. № 6. pp. 97-98 (in Russian).
 3. Ermolaev V.A., Kropotov Y.A. About correlation estimating model parameters of acoustic
    echo. Questions electronics, Vol. 1. №1. pp. 46-50 (in Russian).
 4. Kropotov Y. A., Bykov A.A. Algorithm acoustic noise suppression and interference with
    concentrated formant distribution rejection bands. Questions electronics. 2010. Vol. 1.
    № 1. pp. 60-65 (in Russian).
 5. Kropotov Y.A., Bykov A.A. Approximation of law probability distribution of acoustic
    noise signal samples. Radio engineering and telecommunication systems. 2011. № 2.
    pp. 61-67 (in Russian).
 6. Ermolaev V.A., Eremenko V.T., Karasev O.E., Kropotov Y.A. Identification of model of
    discrete linear systems with variable, slowly varying parameters. Radio Engineering and
    Electronics, 2010. Vol. 55. №1. pp. 57-62 (in Russian).
754      Yu. Kropotov, A. Belov


 7. Ermolaev V. A., Karasev O.E., Kropotov Y.A. Interpolation method filtration in problems
    of speech signal processing in the time domain// Journal of Computer and Information
    Technology, 2008.- №7.- pp. 12-17 (in Russian).

</pre>