=Paper=
{{Paper
|id=Vol-1427/paper6
|storemode=property
|title=A Narrowband Sound Signal Frequency Estimation with Impulsive Noise Filtering
|pdfUrl=https://ceur-ws.org/Vol-1427/paper6.pdf
|volume=Vol-1427
}}
==A Narrowband Sound Signal Frequency Estimation with Impulsive Noise Filtering==
<pdf width="1500px">https://ceur-ws.org/Vol-1427/paper6.pdf</pdf>
<pre>
                     A Narrowband Sound Signal Frequency Estimation
                             with Impulsive Noise Filtering

                                            Iurii Chyrka
   Mathematical Methods for Sensor Information Processing Department of Institute of Information and
                  Communication Technologies of Bulgarian Academy of Sciences
                          25 A, Acad. G. Bonchev str., 1113 Sofia, Bulgaria
                                       Yurasyk88@gmail.com

                          Abstract                                    estimation in such situation. Hence there is need to
                                                                      recover an original data from the degraded observations
      The new stochastic approach of impulsive                        before main processing stage.
      noise filtering, based on the input sample                         Filtering of an impulsive noise generally gives
      separation by modified clusterization criterion,                positive effect not only on frequency estimation, but
      is proposed. It is shown that preliminary                       also for NSI algorithms. The acoustical array is a
      filtration by the proposed procedure provides                   multichannel system and almost simultaneous
      robust narrowband sound frequency estimation                    occurrence of impulses in many different channels can
      and eliminates failures of the estimation                       completely corrupt calculation results.
      algorithm caused by the impulsive noise.
                                                                      2. Problem Statement
 1. Introduction
                                                                      In view of NSI application, attention in the paper is
    Digital audio systems are widely applied today in                 paid to a class of narrowband signals which parameters
 many areas of human activity. One of the biggest class               are changed slowly in time. The problem of the
 of them are acoustic arrays that perform measurement                 instantaneous frequency estimation can be interpreted
 and processing of sound field. One of the key branch in              as estimation on the limited observation interval
 digital sound processing is Noise Source Identification              (usually less than two periods of signal) during which
 (NSI) techniques. They play an important role in the                 the parameters are changed slowly and a narrowband
 acoustic camera, which is aimed to locate and                        signal is considered as a harmonic one.
 characterize sound sources. It fuses images, received by             For description of a mixture of a digital narrowband
 the sensors in a complex image. This image consists of               signal si with a white Gaussian noise ηi of power σ2 and
 a camera image as a background and contour lines                     impulsive noise ζ the following typical additive model
 describing sound field as a foreground [Bil76].                      of a data sample is used:
    Most of NSI techniques have a “narrowband” nature.                   xi = si + ηi + υ i ζ = ρ i sin(ωi τ(i − 1) + ϕ 0 ) + ηi + υi ζ ,
 This means that the sound map must be recalculated for
 each frequency of interest. Usually calculations are                     i = 1, N ,                                        (1)
 performed for the predefined set of bounds and                       where υi is a sign function υi =sgn(pi), that can get
 corresponding center frequencies. The difference                     values –1, 1, 0 with corresponding probabilities pζ/2,
 between actual frequency and defined one leads to                    pζ/2, 1–pζ/2; pζ is an impulsive noise appearance
 inaccurate results. Therefore, knowledge of the original             probability; ζ is an impulsive noise amplitude
 signal frequency is important for precise estimation of              (considered as constant for all impulses); ρi is a signal
 sound field parameters.                                              instantaneous amplitude; ϕ0 is an initial phase; ωi is an
    The estimate can be easily obtained in the steady                 instantaneous angular frequency; τ is a sampling
 state from long enough Fast Fourier Transform, but in                interval, N is a sample size. Further, the normalized
 non-stationary case it must be estimated instantly in                frequency γ=ωτ is used to omit τ. Such the mixture
 real time. For fast, effective and precise estimation of             model is close to a Bernoulli–Gaussian model of an
 any instantaneous parameter, the data sample must be                 impulsive noise process [Vas08].
 as short as possible; therefore, appearance of impulsive             The mixture sample (1) can be represented by the next
 noise has the most substantial influence on the                      probability density function
                                                                          f Ξ (x n | γ , ρ, ϕ 0 , σ, p ζ , ζ , υ n ) = (1 − p ζ ) ⋅ f η (x n | γ , ρ, ϕ 0 , σ )
Copyright © 2015 for the individual papers by the papers' authors.                                                                                                (2)
Copying permitted only for private and academic purposes.
                                                                          + p ζ ⋅ f ηζ (x n | γ , ρ, ϕ 0 , σ, ζ , υ n ).
In: A. Bădică, M. Colhon (eds.): Proceedings of the 2015 Balkan       It corresponds to a Tukey-Huber model, which is the
Conference on Informatics: Advances in ICT                            mostly used one for investigation of robust methods. It

                                                                     40
supposes that majority of sample counts have an                  3.2 Impulse noise filtering techniques
expected distribution fη and some of counts belong to
fηζ. The latter ones are outliers that produce “tails” of            Conventional global filtering approach like a low-
the distribution. The aim of the paper is describing of          pass filtering assumes that both corrupted and
the robust instantaneous frequency estimation                    uncorrupted samples must be processed. Median filters
procedure with removing of distorting outliers in (2).           and other order statistics filters, that process a localized
                                                                 area, also typically modify uncorrupted samples as the
                                                                 transversal filtering is applied uniformly over the whole
3. Methodological Basis                                          signal [Vas08]. In addition, median filter eliminates
                                                                 changes in the input signal with a duration less than a
3.1 Brief description of NSI techniques                          half size of the filter window and does not properly
                                                                 filter a set of consequent impulses longer than a half
   The NSI techniques can be divided onto two
                                                                 size. Some modifications of the classic median filter,
categories: near-field acoustic holography (NAH) and
                                                                 that eliminate some its disadvantages, were developed
beamforming [Bai13]. NAH is aimed to reconstruction
                                                                 [Geo11].
of a sound field in the 3D space. Beamforming gives a
                                                                     Some detection methods perform processing in time
map of a sound intensity by measurement of the signal
                                                                 and frequency domains simultaneously, for example
response from a variety of directions. The main concept
                                                                 wavelet-based method in [Non08]. Yet another filtering
of the both NSI techniques is the next: the sound
                                                                 approach uses fuzzy impulse detection, but mostly for
pressures measured by the microphones (more rarely –
                                                                 images [Sch06].
particle velocities, captured by probes) are processed
                                                                     Impulsive noise usually distorts a relatively small
by an imaging algorithm of either type to calculate an
                                                                 amount of total counts in the sample. Since usually a
acoustic map of the sound pressure or sound intensity
                                                                 relatively large part of the signal counts remain
with a snap to physical coordinates.
                                                                 unaffected by the impulsive noise. Hence, it is
   When NAH performs near-field imaging of noise
                                                                 advantageous to replace only the noisy ones, leaving
sources, beamforming generally works in far field.
                                                                 the uncorrupted counts unchanged. This ideology is
Unlike beamforming, that carries out spatial filtering
                                                                 implemented by the another approach that performs
and maximizes the signal power from certain direction,
                                                                 model-based two-stage filtering by using a linear
NAH provides, based on measurements over a two-
                                                                 prediction system [Esq02]. For audio signals, the most
dimensional aperture, a reconstruction of the three-
                                                                 often used models are autoregressive (AR) or
dimensional sound field from the source’s boundary out
                                                                 autoregressive moving average (ARMA) [Oud14]. In
to the far field. Precision of reconstruction depends
                                                                 this case, the system consists of two main parts:
mostly on microphone spacing distance, sound
                                                                 detector and interpolator that perform individual
frequency and distance between source surface and
                                                                 processing of the each element of the data set.
measurement plane. NAH operates in a low frequency
                                                                     Another independent class of methods is stochastic
range, upper boundary of which is limited by a distance
                                                                 ones [Bas88]. They are close to model-based ones, but
between microphones. On the other hand, beamforming
                                                                 the statistical properties of data samples are used. A
works in the full frequency range but its use is
                                                                 similar approach is proposed in the paper for pre-
reasonable only at high frequencies when it gives better
                                                                 filtering before frequency estimation.
resolution than holography.
   Beamforming provide the best performance on the
irregular arrays, which geometry (positions of                   3.3 Frequency Estimation
microphones) is optimized in order to get the lowest                In this paper frequency estimation is carried out with
possible level of false responses. As opposite to                using of the algorithm mentioned in the previous work
beamforming, NAH is usually performed on a regular               [Pro12]. It has been synthesized with using an AR
array grid that is also the requirement for the classic          model for a single-tone harmonic signal mixed with a
Fourier NAH. Many modern NAH approaches perform                  white Gaussian noise:
calculation in the time domain and usually do not                            si = αsi −1 − si− 2 = 2 cos( γ ) si −1 − si− 2 , (3)
require location of microphones on some equidistant
positions. Hence, it allows reconstructing of sound              where α=2cos(γ) is a parameter of auto-regression.
fields even with irregular array geometries.                        The algorithm is based on the solution of the
                                                                 quadratic equation


                                                            41
                               α 2 − 2Bα − 2 = 0 ,                       (4)        minimal sum of cluster variances, which for a two-
with coefficient B calculated on the basis of the input                             component sample can be written as:
signal sample (x ) as:                                                                                           { (          )      (       )}
                                                                                            y ( opt ) = arg min σ l2 y ( v −1), N + σ r2 y ( v ), N .
                                                                                                            v
                                                                                                                                                      (6)
                   (xi+1 + xi −1 )2 − 2xi2 / 2 (xi+1 xi + xi xi−1 ) .(5)
              N −1                              N −1
 B(x ) =  ∑[  i =2
                                       ] ∑
                                               i =2                  
                                                                                       In order to increase sensitivity to presence of the
                                                                                    second “impulsive” subsample the modified criterion is
   The              equation          (4)       has      two          roots         considered:
α ∗ ( + , − ) = B ± B 2 + 2 . Finally, frequency is estimated
as γ * = arccos (α ∗( + ) / 2) .
                                                                                                                 { (          )      (       )}
                                                                                             y ( opt ) = arg min σ l4 y ( v −1), N + σ r4 y ( v ), N .
                                                                                                            v
                                                                                                                                                       (7)

   Generally, this algorithm is robust in many cases                                   It allows correct treatment of mixtures that contain
with only the impulsive noise and without the Gaussian                              small amount of impulses.
one [Pro09], but it becomes highly sensitive when                                      Finally, the proposed impulse filtering procedure can
Gaussian and impulsive noises occur simultaneously                                  be represented in the form of four consequent steps of
and are multiplied during calculations. Therefore,                                  calculations. On the first step of processing in order to
removing of impulses is a necessary condition before                                facilitate detection of the sample with bipolar pulse it is
estimation procedure starts.                                                        necessary to take the absolute value z n = xn ,
                                                                                     n = 0, N − 1 . Taking into account that time order of
4. Impulsive noise detection                                                        discrete values (1) is insignificant, it can be
                                                                                                                ()
                                                                                    transformed via y = ℜ z into an ordered statistics
   The NSI Robust parameters estimation usually                                                        y = ( y(1), N , y ( 2 ), N ,... y( N ), N ) , (8)
consists of the next processing stages [Hub09]:
   1) Data “errors” or corrupted counts detection;                                  which must be separated onto two parts with
   2) Processing of detected counts by removing them                                corresponding standard deviation values: σl, σr.
from the sample or their restoration with the help of                                  The next step includes the calculation of values of
neighbor ones;                                                                      the above mentioned criterion for each discrete rank
   3) Pure robust estimation using the restored sample.                             (v),N. (the function (7) is used here) The separation
   Here steps 1) – 2) belong to impulse noise filtering.                            threshold is simply found as argmin among all these
   The new stochastic approach based on the cluster                                 values. When the threshold is obtained, all counts that
analysis theory [Eve11] is proposed for performing                                  exceed its value are marked as defective or i. e.
impulses detection. The key idea is to apply                                        containing a noise impulse.
clusterization methods for analysis of the input sample                                On the last stage of processing the detected impulses
probability distribution and separate all counts in the                             must be replaced by interpolation or extrapolation
sample onto (two) independent clusters: the subsample                               using “good” neighbor counts. In view of assumption
with normal counts and relatively small subsample with                              on small duration of the impulses, only the simplest
impulsive noise counts. The important assumption here                               linear interpolation is used in this paper.
is that the signal amplitude is relatively small in
comparison to impulse amplitude, that allows to                                     5. Simulation results
distinguish these clusters. The similar approach was
                                                                                       The effectiveness of impulsive noise filtering was
proposed earlier by the author in application to
                                                                                    analyzed in connection to its influence on the frequency
electroencephalogram signals analysis [Pro13].
                                                                                    estimation process by the aforementioned algorithm.
   Generally, clusterization is carried out on the basis
                                                                                    The flowchart of the frequency estimation process
of some optimization criterion or an objective function.
                                                                                    including the impulse filtering one can see in Fig. 1.
Usually it is an extent of objects density inside the
                                                                                       Statistical simulations by the Monte-Carlo approach
cluster or an extent of distance between different
                                                                                    were done under the next conditions: a signal sample
clusters. Actually, most of cluster separation
                                                                                    size N=50, the sample contains one period of the signal,
procedures can be considered as exact or approximate
                                                                                    hence the normalized frequency γ = 2π / 50 ≈ 0.126 ,
algorithm of some objective function optimization and
                                                                                    SNR=20 dB, signal to impulsive noise ratio is –20 dB,
finding a threshold.
                                                                                    number of numerical simulations for each plot is
   The most widespread methods for optimal threshold
                                                                                    10000.
v calculation [Kor89] are based on the criterion of the


                                                                               42
                                                                           Figure 2: Comparison of algorithm failure
                                                                         probabilities for different filtration methods


           Figure 1: Flowchart of the frequency
                  estimation process

    Fig. 2 shows plots of dependencies of estimation
algorithm failure probability on impulsive noise
appearance probability. Three cases are regarded:
without filtering, with filtration by a classic median
filter and by using the proposed separation. From the
presented figure one can see that the appearance of the
impulsive noise with big power significantly worsen
performance of the frequency estimation algorithm.
Small decreasing of failure rate at higher noise
probability can be explained by the fact that some
impulses start to appear one after another and such a
sequence has less influence on the algorithm.
    The median filter of size 5 makes a situation better
and substantially decreases probability of failure, at low
appearance probability in particular, but its rate is still
                                                                        Figure 3: Comparison of precision indicators of
high enough. Longer filter window allows getting
additional suppression of many impulses but there must              frequency estimations for different filtration methods
be tradeoff between noise filtering and the signal
                                                                       Similarly to situation with failures, the estimation
deformation due influence of the filter. The proposed
                                                                   algorithm does not give reliable estimates, when the
separation procedure for impulse noise filtration makes
                                                                   probability of noise appearance is high. The median
failures caused by impulses action almost impossible.
                                                                   filter reduces the deviation of a mean in about two
    In addition, the comparison of the precision of
                                                                   times. The separation procedure provides a mean and a
frequency estimation was carried out in three
                                                                   standard deviation close to constant values.
aforementioned cases without any filtering, with
                                                                       The carried out statistical studies have shown that
filtration by the classic median filter and by using the
                                                                   the proposed method works well with relatively big
separation. The plots of mean and standard deviation
                                                                   impulses, when signal to impulsive noise ratio is –10
are shown in Fig. 3.


                                                              43
dB or lower. When the amplitude of impulses is low,                     USA, March 31 - April 4, 2008). ICASSP '08.
two clusters of good and corrupted counts are located                   IEEE, Las Vegas, NV, 1593–1596, 2008.
very close to each other and the polymodality of
                                                                [Sch06] S. Schulte, M. Nachtegael, V. De Witte, D.
impulse distribution vanishes. It is hard to find the
threshold in this case.                                                 Van der Weken, E. Kerre. A Fuzzy Impulse
                                                                        Noise Detection and Reduction Method. IEEE
                                                                        Transactions on Image Processing. 15(5):
6. Conclusions                                                          1153–1162, May 2006.
   The proposed impulse noise filtering procedure               [Esq02] P. Esquef, M. Karjalainen and V. Valimaki.
provides better frequency estimation quality in                         Detection of clicks in audio signals using
comparison to conventional median filter when the                       warped linear prediction. In Proceedings of
impulse to signal amplitude ratio is 10 dB or more.
                                                                        the 14th International Conference on Digital
This is caused by better sensitivity of the procedure to
appearance of big impulses in the sample. At the same                   Signal Processing (New York, USA, March
time, it provides constant estimation precision, when                   31 - April 4, 2008). ICDSP '02. IEEE, New
impulse appearance probability is not bigger than 0.15.                 York, NY, 2: 1085–1088, 2002.
This is a consequence of bigger number of points that           [Oud14] L. Oudre. Automatic detection and removal of
have to be interpolated. Therefore, linear interpolation                impulsive noise in audio signals. Image
distorts the signal structure and worsen frequency                      Processing On Line. Preprint. February 2014.
estimation at bigger appearance probabilities.
                                                                [Bas88] M. Basseville. Detecting Changes in Signals
                                                                        and Systems-A Survey. Automatica. 24(3):
Acknowledgments                                                         309–326, May 1988.
The research work reported in the paper was partly              [Pro12] I. G. Prokopenko, I. P. Omelchuk, and Y. D.
supported by the Project AComIn "Advanced                                Chyrka. Radar signal parameters estimation in
Computing for Innovation"; grant 316087, funded by                       the MTD tasks. International Journal of
the FP7 Capacity Programme.                                              Electronics and Telecommunications (JET).
                                                                         58(2): 159–164, June 2012.
References                                                      [Pro09] I. G. Prokopenko, I. P. Omelchuk and
                                                                         G. Y. Sokolov. Robustness of quasioptimal
[Bil76] J. Billingsley, R. Kinns. 1976. The acoustic
                                                                         frequency estimator to impulsive noise.
         telescope, Journal of Sound and Vibration, 48,
                                                                         Electronics and Control Systems. 20(2): 69–
         485–510, 1976.
                                                                         74, 2009. (In Ukrainian).
[Vas08] Saeed V. Vaseghi. Advanced Digital Signal
                                                                [Hub09] P. Huber, E. Ronchett. Robust Statistics – 2nd
        Processing and Noise Reduction – 4th ed. A
                                                                        ed. A John Wiley and Sons Ltd, 18–20, 2009.
        John Wiley and Sons Ltd., Singapore, 2008.
                                                                [Eve11] B. S. Everitt, S. Landau, M. Leese, D. Stahl.
[Bai13] R. M. Bai, J.G. Ih, J. Benesty. Acoustic Array
                                                                        Cluster Analysis – 5th ed. A John Wiley and
        Systems. John Wiley & Sons Singapore Pte.
                                                                        Sons Ltd., London. 2011.
        Ltd., Singapore, 2013.
                                                                [Pro13] I. G. Prokopenko, I. P. Omelchuk, and Y. D.
[Geo11] Geoffrine Judith M. C., N. Kumarasabapathy.
                                                                         Chyrka. Pat. № 84833, UA, IPC G06F7/06.
        Study and analysis of impulse noise reduction
                                                                         Method for electroencephalogram spike
        filters. Signal & Image Processing: An
                                                                         detection. Publ. date: 11.11.2013 (In
        International Journal (SIPIJ). 2(1): 82–92,
                                                                         Ukrainian).
        March 2011.
                                                                [Kor89] E. A. Korniliev, I. G. Prokopenko, V. M.
[Non08] R. C. Nongpiur. Impulse noise removal in
                                                                        Chuprin. Robust algorithms in automated
        speech using wavelets. In Proceedings of the
                                                                        systems of information processing. Technics,
        IEEE International Conference on Acoustics,
                                                                        Kyiv, 1989 (In Russian).
        Speech and Signal Processing (Las Vegas,


                                                           44

</pre>