<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Robust object detection in images corrupted by impulse noise</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Polotsk State University</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Blokhina st.</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Novopolotsk</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Republic of Belarus</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>r.bogush</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>adamovskiy.y }@pdu.by</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Belarusian State University</institution>
          ,
          <addr-line>Nezavisimosti avenue 4, Minsk</addr-line>
          ,
          <country>Republic of Belarus</country>
        </aff>
      </contrib-group>
      <fpage>0000</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>This paper proposes two effective normalized similarity functions for robust object detection in very high density impulse noisy images. These functions form an integral similarity estimate based on relations of minimum by maximum values for all pairs of analyzed image features. To provide invariance under the constant brightness changes, zero-mean additive modification is used. We explore properties of our functions and compare them with other commonly used for object detection in images corrupted by impulse noise. The efficiency of our approach is illustrated and confirmed by experimental results.</p>
      </abstract>
      <kwd-group>
        <kwd>similarity functions</kwd>
        <kwd>object detection</kwd>
        <kwd>impulse noise</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Impulse noise is frequently encountered in digital image and video due to
transmission errors, defective pixels in camera sensors, faulty memory locations, and timing
errors during the conversion [1, 2]. An important property for this noise type is that
the corrupted pixels can take only the maximum and minimum values from dynamic
range and that only part of the pixels is corrupted, and the rest are noise-free. Noise
reduction increases the computational cost and it does not allow to get clear image for
high noise level. Therefore, it is important to use image processing noise reducing
methods for solving the mentioned tasks.</p>
      <p>Object detection and precise definition of object location in images and videos are
widely used for many applied tasks: industrial inspection and medical diagnostics,
stereo vision, reference marks detection and localization in space images of the earth's
surface, tracking targets of airborne radar stations, etc [3, 4]. Absolute and relative
object position in image or video can be determined after their localization. Camera
calibration, image georeferencing, formation of stereo images by computer methods
and other applications contain similar tasks. Therefore, many scientific works are
devoted to the development of methods for detecting and localizing objects in images
and video. Generally, many approaches calculate the similarity of reference object
and an input image and compared with a threshold value.</p>
      <p>Unluckily, there is no a single similarity function that works very well for various
images and for all tasks. Different tasks require different measure properties.
Therefore, traditional similarity metrics are improved [5, 6] or new functions are provided
[7, 8, 9] considering expansion of applications of image processing, and different
similarity functions are selected for various applications [10, 11]. For example, image
detection in large databases requires high speed and low resistance to deformation of
the image, but people tracking in video desires robustness to various types of noise
and high accuracy of comparing selected features.</p>
      <p>Even in solving only one task uncertainty may also occur. For example, to
correctly identity tracked people in a current frame, a maximum accuracy of the
estimation for people feature similarity with previous frames is required [12]. However, high
accuracy for high density noise or overlapping parts of people by objects will lead to
loss their index. It means that a person from previous frames will be identified as
another person who has entered into the surveillance area and displayed on the current
frame. This problem is also related to the fact that the number of possible grayscale
and color images of the same size is huge, their correlation characteristics in practice
are not perfect. This leads to false identification or inaccurate object localization when
detection is performed in presence of various types of noise components in video or
image.</p>
      <p>In this paper, we present a new effective function similarity for image and video
corrupted by impulsive noise. These functions form an integral normalized similarity
estimate based on sequential division of minimum by maximum values for all pairs of
features. In Section 2, comprehensive analysis of main similarity functions used in
image processing is presented. Section 3 presents our normalized similarity functions
and computational costs for them. Section 4 presents experimental results. Finally, the
conclusion and feature work details are provided in Section 5.
2</p>
      <p>Main similarity functions for image processing
If two images O and B are identical, then value of the normalized metric
M (O, B)  0 , and the value of the normalized similarity function S (O, B)  1 . For
another extreme case, when the differences between the images are maximized, the
normalized metric M (O, B)  1 , and the similarity function usually S (O, B)  0 or
S (O, B)  1 , if the mean value of the brightness levels is not taken into account.</p>
      <p>To compare two images O  oij  and B  bij , N  N size, several similarity
functions are often used [13]. It is known that linear relation between brightness of
images allows to efficiently apply the correlation coefficient for image corrupted by
additive noise [14]. The value of the normalized cross-correlation function (NCC)
varies from 0 to 1 and is calculated as:</p>
      <p>S NCC </p>
      <p>N N
  oij bij
i1 j1
N N N N
  oij 2   bij 2
i1 j1 i1 j1</p>
      <p>Similarity function based on the Euclidean distance (ED) is characterized by a
lower computational cost and is defined as:</p>
      <p>The normalization embodied into the zero-mean normalized cross-correlation
function (ZNCC) allows one to tolerate linear brightness variations. Also, due to the
subtraction of the local mean, the ZNCC provides better robustness than the NCC
since it tolerates uniform brightness variations as well. In this case, ZNCC takes a
value of (-1) to (+1) and is calculated by formula:</p>
      <p>S ZNCC </p>
      <p>N N
  oij  o bij  b 
i1 j1
N N
  oij  o 2
i1 j1</p>
      <p>
        N N
  bij  b 
i1 j1
2
where o and b – mean values for images O and B (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ):
o 
1
N 2
      </p>
      <p>N 1 N 1
  oij , b 
i0 j0
1
N 2</p>
      <p>
        N 1 N 1
  bij
i0 j0
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
(
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
(
        <xref ref-type="bibr" rid="ref5">5</xref>
        )
S ED  1 
      </p>
      <p>N 1 N 1
  oij  bij 2
i0 j0
S SSD  1 
1 N N</p>
      <p>  oij  bij 2</p>
      <p>LN 2 i1 j1</p>
      <p>Sum of squared differences (SSD) function is robust to Gaussian noise [15] and is
calculated as:
where L – of valid brightness values range.</p>
      <p>The function based on the weighted sum of squared differences (SSDW) is more
stable to linear distortion of levels of analyzed characteristics compared to previous
functions:
i1 j1</p>
      <p>N N
  oij  bij 2
i1 j1
N N N N
  oij 2   bij 2
i1 j1</p>
      <p>The Hausdorff metric-based function (Hd) can only be used for low noise, as
increasing noise level quickly reduces the estimation accuracy. This function is
determined by the formula:</p>
      <p>S Hd  1  1</p>
      <p>L</p>
      <p>maxij oij  bij</p>
      <p>In general case, to detect an object in an image, the similarity value is calculated
for features of all subimages and object. The decision on object presence is made
based on the comparison of obtained values with a threshold. If the condition is
satisfied, decision about correspondence on subimage and object is made. For accurate
localization similarity function value must exceed threshold only if the object is
correctly positioned. However, ensuring the minimum probability of missing object
(false-negative error) requires a lower threshold level, but this leads to an increase in
the number of subimages, including near the correct location of the object in the
image that does not correspond to it, known as false-positive error.
3</p>
      <p>New normalized similarity functions
The proposed two similarity functions use relationship calculation between the
minimum and maximum values for all pairs of analyzed features. Summation of the
calculated values is used to obtain an integral normalized value that characterizes similarity
of two images. Relationship calculation between descriptors will better emphasize
local differences compared to subtraction. High resistance to noise is achieved
through the use of summation when obtaining a complex normalized value. Minimum
or maximum attribute is necessary to determine when searching a relationship
between them, therefore, the proposed functions are called normalized minimax
similarity functions.</p>
      <p>To estimate image similarity of object O  oij , N  N size, and image B  bij ,</p>
    </sec>
    <sec id="sec-2">
      <title>N  N size, functions are described as:</title>
      <p>• normalized minimax additive similarity p-function (MMADDP):</p>
      <p>S MMADDP 
1 N N minoipj , bipj , p  Z , p  1</p>
      <p>
        NN i1 j1 maxoipj , bipj 
• normalized zero-mean minimax additive similarity p-function (ZMMADDP) :
(
        <xref ref-type="bibr" rid="ref6">6</xref>
        )
(
        <xref ref-type="bibr" rid="ref7">7</xref>
        )
(
        <xref ref-type="bibr" rid="ref8">8</xref>
        )
 2
 N1N iN1 jN1  obiijj  bo  , if oij  o  bij  b

S ZMMADDP  
S (O, B)  1,  O  B;
S (O, B)  S (B,O);
S (O, B)  0,  oij  0,bij  L
(
        <xref ref-type="bibr" rid="ref9">9</xref>
        )
(
        <xref ref-type="bibr" rid="ref10">10</xref>
        )
For both proposed minimax functions, the following basic properties are obvious:
Presented functions are universal, as MMADDp returns a normalized value from
(0) to (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) and ZMMADDp returns a value from (-1) to (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) for randomly selected
features of a pair of images. Table 1 shows a comparison of computational costs for
similarity functions for images with size N  N pixels.
      </p>
      <p>Thus, in comparison with function of normalized correlation, the offered minimax
function provides reduction of calculation complexity, as minimum twice. As result,
minimax similarity functions can be used for search of objects in a static image and
for detecting moving objects in video sequences.</p>
      <p>Experimental results</p>
      <p>Parameter determination</p>
      <p>It is proposed to use the following parameters for the similarity functions
analytical assessment when detecting objects in image:
─ function value calculated for object and subimage ( A – main peak). Value should
aim to 1;
─ main peak variance ( DA ). The value should aim to zero, meaning smaller value
deviations A from expected value;
─ function maximum value from all side peak ( SL ) can be used to determine
threshold T , which should be less than T .
─ maximum side peak variance ( DSL ) should also be as less as possible;
─ side peak number at levels higher than 0.95 ( N SL ). The parameter can be used to
evaluate the possible false-positive detection results number if the threshold value
is incorrectly selected.</p>
      <p>Research software was developed using the MatLab package for experimental
determination of normalized minimax similarity functions qualitative characteristics.
The software is based on an object detection algorithm in a grayscale image using
pixel brightness as features. It includes the following steps:
1. Selecting a subimage Bkl (k  0..M  m,l  0..N  n) , m  n size, from the
upper left of the bitmap image;
2. Similarity function calculation for a reference object O of size m  n and
the selected subimage Bkl :
where F is the mathematical transformation operator;
3. Deciding on object availability by the rule:</p>
      <p>S  F (O, Bkl )
if

else</p>
      <p>S  T then</p>
      <p>Bkl  O</p>
      <p>
        Bkl  O
(
        <xref ref-type="bibr" rid="ref11">11</xref>
        )
(
        <xref ref-type="bibr" rid="ref12">12</xref>
        )
where T is the threshold;
4. If the shift number is less than (M  m)  (N  n) , move right or down one
pixel and go to step 1, otherwise the search is finished. The analyzed total
number of fragments is defined as (M  m  1)  (N  n  1) .
      </p>
      <p>All parameters were calculated for 20 different images without noise and
distortion of size 150 × 150. For each of them 20 reference objects of 15×15 size were used
with obtained values averaging. Values A  1 and DA  1 are obtained for all
similarity functions.</p>
      <p>The resulting values of the remaining parameters are given in Table 2. Analysis of
the Table 2 shows that the proposed ZMMADDp function has the best characteristics
for accurately determining object coordinates in images without noise.</p>
    </sec>
    <sec id="sec-3">
      <title>SSDV Hd</title>
      <p>
        MMADDp (p=2)
ZMMADDp (p=1)
Impulse noise is one of the interference types and it appears as random white or black
dots in an image, i.e. corrupted pixels take a maximum or minimum valid value, for
example, 255 and 0 for 8-bit images. Interference with an amplitude value greater
than the useful signal dynamic range occurs, for example, during a fast transient, and
is the cause of the appearance of impulse noise in the image. The impulse noise
probability density function of a random variable z is given by:
Pa

p(z)  Pc
for z  a
for z  c
else 0
(
        <xref ref-type="bibr" rid="ref13">13</xref>
        )
      </p>
      <p>A pixel with a brightness value of c  a looks like a white point when c (?), and
a pixel with a brightness value of c looks like a black point in image. Почитай.
Здесь совсем непонятно.</p>
      <p>Impulse noise is a combination of white and black points (“salt and pepper”).
Impulse noise can be characterized by the noise density  , which determines the
percentage of the corrupted pixels n to a total number, i.e. the probability of distortion
for each pixel. For impulse noise,  values [0.1, 0.3, 0.5, 0.8, 0.95] was chosen.
Objects for detection in Fig. 1. are presented.</p>
      <p>The difference values between similarity function main and maximum side peaks
show an assessment completeness and comparison of similarity functions in terms of
resistance to various noise types (Table 3-4).</p>
      <p>The ZMMADDp (  =1) provides invariance under the constant change in
brightness. Object detection is possible for this function at  = 0.3 when the image is
distorted by single-level impulse noise and  = 0.5 for salt and pepper noise.
5</p>
      <p>Conclusion</p>
      <p>
        Normalized similarity functions for image evaluation are proposed. They are
characterized by satisfactory computational costs and can improve the detection and
object localization in image and video corrupted by impulse noise. Proposed functions
are universal, they return a normalized value from (0) to (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) for randomly selected
features. Similarity function resistant to a linear change of analyzed feature values of
the compared images is proposed that returns a normalized value from (-1) to (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ).
Experiments were performed to compare these functions with the known ones for
images corrupted by single-level impulse and salt and pepper noise. Pixel brightness
values are used as image features for experiments. The obtained results have
confirmed efficiency of presented similarity functions.
6
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Gonzalez</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Woods</surname>
            <given-names>R.</given-names>
          </string-name>
          :
          <source>Digital Image Processing, 3rd Edition</source>
          . Prentice Hall (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Pitas</surname>
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Venetsanopoulos</surname>
            <given-names>A.</given-names>
          </string-name>
          :
          <source>Order statistics in digital image processing Proceedings of the IEEE</source>
          <volume>80</volume>
          (
          <issue>12</issue>
          ) pp
          <fpage>1893</fpage>
          -
          <lpage>1921</lpage>
          (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Rezaei</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fränti</surname>
            <given-names>P.</given-names>
          </string-name>
          :
          <source>Set Matching Measures for External Cluster Validity Transactions on Knowledge and Data Engineering</source>
          <volume>28</volume>
          (
          <issue>8</issue>
          ) pp
          <fpage>2173</fpage>
          -
          <lpage>86</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Ges</surname>
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Starovoitov</surname>
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Distance-based functions for image comparison Pattern Recogn</article-title>
          .
          <source>Lett</source>
          .
          <volume>20</volume>
          <fpage>207</fpage>
          -
          <lpage>14</lpage>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Wang</surname>
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            <given-names>Y.</given-names>
          </string-name>
          ,
          <source>Feng J. On the Euclidean Distance of Images IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          <volume>27</volume>
          (
          <issue>8</issue>
          ) pp
          <fpage>1334</fpage>
          -
          <lpage>39</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Nakhmani</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tannenbaum</surname>
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>New Distance Measure Based on Generalized Image Normalized Cross-Correlation for Robust Video Tracking and Image Recognition Pattern Recogn</article-title>
          .
          <source>Lett</source>
          .
          <volume>34</volume>
          (
          <issue>3</issue>
          ) pp
          <fpage>315</fpage>
          -
          <lpage>21</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Zagoruiko</surname>
            <given-names>N. G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borisova</surname>
            <given-names>I. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dyubanov</surname>
            <given-names>V. V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kutnenko</surname>
            <given-names>O. A.</given-names>
          </string-name>
          :
          <article-title>A quantitative measure of compactness and similarity in a competitive space J</article-title>
          . Appl. Industr. Math.
          <volume>5</volume>
          (
          <issue>1</issue>
          ) pp
          <fpage>144</fpage>
          -
          <lpage>54</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Lv</surname>
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>A novel similarity measure for matching local image descriptors IEEE Access 6</article-title>
          pp
          <fpage>55315</fpage>
          -
          <lpage>25</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kim</surname>
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Hyun</given-names>
            <surname>Ch</surname>
          </string-name>
          .,
          <string-name>
            <surname>Han</surname>
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>Evaluation of Matching Costs for High-Quality SeaIce Surface Reconstruction from</article-title>
          Aerial
          <source>Images Remote Sensing</source>
          <volume>11</volume>
          (
          <issue>9</issue>
          ) pp
          <fpage>1055</fpage>
          -
          <lpage>72</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Zitova</surname>
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Flusser</surname>
            <given-names>J</given-names>
          </string-name>
          .:
          <article-title>Image registration methods: a survey Image and</article-title>
          vision computing
          <volume>21</volume>
          (
          <issue>11</issue>
          ) pp
          <fpage>977</fpage>
          -
          <lpage>1000</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Cha</surname>
            <given-names>S. H.</given-names>
          </string-name>
          :
          <article-title>Comprehensive survey on distance/similarity measures between probability density functions Int</article-title>
          .
          <source>J. of Mathematical Models and Methods in Applied Sciences</source>
          <volume>1</volume>
          (
          <issue>4</issue>
          ) p
          <fpage>300</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Bohush</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <source>Zakharava I.: Robust Person Tracking Algorithm Based on Convolutional Neural Network for Indoor Video Surveillance Communications in Computer and Information Science</source>
          <volume>1055</volume>
          pp
          <fpage>289</fpage>
          -
          <lpage>300</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Goshtasby</surname>
            <given-names>A. A.</given-names>
          </string-name>
          :
          <article-title>Similarity and dissimilarity measures in Image Registration Principles, Tools and Methods. Advances in Computer Vision</article-title>
          and Pattern
          <string-name>
            <surname>Recognition</surname>
          </string-name>
          (London) pp
          <fpage>7</fpage>
          -
          <lpage>66</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Voronov</surname>
            <given-names>S. V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tashlinskii</surname>
            <given-names>A. G.</given-names>
          </string-name>
          :
          <article-title>Efficiency analysis of information theoretic measures in image registration Pattern Recognit</article-title>
          .
          <source>Image Anal</source>
          .
          <volume>26</volume>
          <fpage>502</fpage>
          -
          <lpage>5</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Voronov</surname>
            <given-names>S. V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tashlinskii</surname>
            <given-names>A. G.</given-names>
          </string-name>
          :
          <article-title>Analysis of objective functions in the problem of estimating mutual geometric deformations of images Pattern Recognit</article-title>
          .
          <source>Image Anal</source>
          .
          <volume>24</volume>
          (
          <issue>4</issue>
          )
          <fpage>575</fpage>
          -
          <lpage>582</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>