<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A technique for detecting concealed objects in terahertz images based on information measure</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>D M Murashov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A A Morozov</string-name>
          <email>morozov@cplire.ru</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>F D Murashov</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Federal Research Center “Computer Science and Control” of RAS</institution>
          ,
          <addr-line>Vavilov str., 44-2, Moscow, Russia, 119333</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Kotel'nikov Institute of Radio Engineering and Electronics of RAS</institution>
          ,
          <addr-line>Mokhovaya, 11-7, Moscow, Russia, 125009</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Moscow Aviation Institute (National Research University)</institution>
          ,
          <addr-line>Volokolamskoe Shosse, 4, Moscow, Russia, 125993</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <fpage>269</fpage>
      <lpage>274</lpage>
      <abstract>
        <p>In this paper, a new technique for detecting concealed objects in the images acquired by a passive THz imaging system is proposed. The technique is based on a method for mutual information maximization successfully used for image matching. For reducing computational expenses, we propose to analyze the mutual information at local maxima of the crosscorrelation function computed in the Fourier domain. The proposed technique does not require parameter tuning. A computing experiment approved the efficiency of the proposed technique and the possibility of its implementation in security systems.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>One of the areas of public safety protection is the development of concealed object detection systems
in images obtained in the terahertz band (300 GHz - 3 THz). As a rule, these images are characterized
by a low signal-to-noise ratio, low resolution, low contrast, and fuzziness of objects (see Figure 1 (a)).</p>
      <p>
        One of the approaches to solving the problem for detecting concealed objects in the terahertz
images consists of image segmentation and recognition of selected objects. Many of the proposed
segmentation methods are based on the assumption that there are three areas in the terahertz image
(background, a human body, and hidden objects) characterized by their radiometric temperature range
corresponding to the ranges of gray tone levels. Gaussian mixture models are usually used to represent
the images. The authors of works [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1-3</xref>
        ] used these models as the basis for developing multi-level
segmentation algorithms. Noise suppression algorithms are applied at the preprocessing stage to
improve the segmentation quality. For example, the anisotropic diffusion algorithm and the nonlocal
means (NL-means) algorithm are used in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. In paper [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], the authors developed a multi-level EM
algorithm to localize objects. At the first level, the algorithm segments two objects: the background
and the human body. At the second level, the EM algorithm highlights the hidden object over the
body. In [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], the author proposed an algorithm based on the maximum likelihood method for
recognizing hidden objects. In order to reduce the effects of noise and low contrast, it may be useful to
combine images taken in different spectral ranges. To increase the performance of the concealed
objects detection, the authors of work [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] combined images obtained in the terahertz and visible
spectral bands. In [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], the objects hidden under clothes are localized using the images obtained in four
channels of a system operating in 3- and 8-millimeter wavelengths with vertical and horizontal
polarization. A multilevel segmentation algorithm combines the k-means algorithm, the EM algorithm
for estimating the parameters of a Gaussian mixture, and the Bayesian classifier. An alternative
approach to object detecting in terahertz images is to use matching the template image of an object and
the input image. In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], a method based on the localization of the maximum of the correlation
coefficient was applied to detect objects. In this paper, for detecting objects in terahertz images, we
propose to apply an image matching method based on the mutual information maximization [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Problem statement</title>
      <p>Suppose there is a set of images of objects that should be detected under the clothes of people, using a
passive terahertz camera. Objects that are visible in the image should be compared with the templates,
and it is necessary to make a decision about their identity or difference. Suppose there are N template
images of objects U n , n  1, 2,..., N . Let the image V be the input image. It is necessary to identify the
object U n located in the image V .</p>
      <p>To solve such a problem, the segmentation of the input image is usually preliminarily performed,
feature descriptions of the images of objects U n are formed, and then in the feature space, the
separating surfaces between the classes of the objects under consideration should be found.</p>
      <p>
        The properties of terahertz images listed above complicate the segmentation of objects. Therefore,
it is preferable to use methods that do not require segmentation. In this paper, we propose to solve the
object detection problem as an image matching problem [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Let the similarity measure of the template
and the objects in the image V be the criterion J (Un ,V ) . Then the problem can be formulated as
follows. It is necessary to find a template image V
similarity measure J (Un ,V ) :
which provides the maximum value of the
Um  arg max{ J (Un ,V )} , n  1, 2,..., N . (1)
      </p>
      <p>Un</p>
      <p>
        Several methods that do not require segmentation of images are known, in particular, methods
based on the maximization of the cross-correlation function values, maximization of the correlation
coefficient [
        <xref ref-type="bibr" rid="ref7 ref9">7,9</xref>
        ], and maximization of the mutual information [
        <xref ref-type="bibr" rid="ref10 ref8">8,10</xref>
        ]. These methods do not require a
priori knowledge about the relationship between the input image and the template as well as
information on the statistical characteristics of the images. However, the correlation matching of
terahertz images fails in some cases (see Figure 1). The mutual information is calculated directly from
the joint distribution of the gray levels of these images and involves the data contained in the
compared images more completely than the correlation parameters. The mutual information is a
quantitative characteristic of the statistical dependence of images. The greater similarity of the
template and the input image produces a greater value of the mutual information. In this work, the
mutual information is used as a similarity measure to solve the problem (1).
      </p>
      <p>
        To apply the information-theoretic approach, a stochastic model of the dependency between the
template image and the input image is necessary. Let the grayscale values in the compared images at
the point x with coordinates (x, y) be described as discrete random variables U (x) and V (x) with
values u and v quantized into a finite number of levels K and L respectively. If images U (x) and
V (x) represent the same scene, there is a relationship between the variables U (x) and V (x) . We
will use a model similar to that proposed in [
        <xref ref-type="bibr" rid="ref11 ref8">8,11</xref>
        ]:
      </p>
      <p>V (Tr(x))  F(U (x))  (x), (2)
where Tr is a coordinate transformation (for registered images V (Tr(x))  U (x) ); F is a grayscale
value conversion function that simulates the relationship between two images of an object;  (x) is a
random variable that simulates noise. Model (2) can be considered as a model of a discrete stochastic
information system with the input U and the output V .</p>
      <p>a) b) c)
Figure 1. The result of the correlation image matching: (a) an image of a human with a hidden object
obtained from the THERZ-7A complex; (b) the image of the template object; (c) the result of an
erroneous correlation matching. The contour of the template is shown in red.</p>
      <p>The mutual information between the system input and output is defined by the expression:
I (U ;V )  K L p(uk , vl ) log  p(uk , vl )  ,</p>
      <p>k1 l1  p(uk ) p(vl ) 
where the functions p(u) , p(v) , and p(u, v) define one-dimensional and two-dimensional discrete
probability distributions of the gray tone levels of the images U and V . If mutual information (3) is
used as a similarity measure, the problem (1) can be formulated as follows. It is necessary to find an
image of the template U n maximizing the criterion (3):</p>
      <p>Um  arg max{I (Un;V )} .</p>
      <p>Un
In the next section, an algorithm for solving the problem is proposed.
(3)
(4)</p>
    </sec>
    <sec id="sec-3">
      <title>3. Comparison of the template images with the images obtained in the terahertz spectral band</title>
      <p>
        The direct computing of the mutual information values on the set of pixels corresponding to the
intersection of the images U n and V at different positions of Un relative to V requires large
computational expenses. Therefore, in practice, methods for accelerated computing of the mutual
information magnitude are necessary. For example, in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], a differential equation was obtained for
searching for the extremum of function (3) in the image matching problem.
      </p>
      <p>
        In this paper, we propose to compute the mutual information values I (Un;V ) at the points of the
input image V corresponding to the local maxima of the cross-correlation function [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], where the
content of the input image V matches the template image Un . The number of the local maxima is
significantly less than the number of pixels in the input image V and the cross-correlation function of
the images can be rapidly computed in the frequency domain. These circumstances make it possible to
significantly speed up the computing of mutual information and, accordingly, solving the problem (4).
Since the cross-correlation function is sensitive to changes in the amplitudes of the grayscale levels in
the compared images, we will use contour preparations of the U n and V images for its computing
because the boundaries of the objects have a fairly stable high grayscale value in the contour
preparations. The cross-correlation function is determined as follows:
cUn,V (x, y)  CUn (s,t)CV (x  s, y  t) , (5)
      </p>
      <p>s t
where CUn and CV are the contour preparations of the images Un and V , x and y are the spatial
coordinates of the points of the image C , s and t are the coordinates of the points in the</p>
      <p>V
intersection area of the images CUn and C . Given the high noisiness of terahertz images, masks of</p>
      <p>
        V
stochastic gradients [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] for signal-to-noise ratio SNR = 1 will be used to obtain contour preparations.
The computing of the function (5) will be carried out in the frequency domain.
      </p>
      <p>
        Thus, the search algorithm on the terahertz images of an object corresponding to condition (4)
includes the following operations: (a) smoothing the image V and obtaining contour preparations
CU1 and CV of the images Un and V respectively; (b) calculating the cross-correlation function of
the contour preparations CU1 and CV by the formula (7); (c) finding the points of local maxima of
the cU1,V (x, y) function; (d) calculating the mutual information I (U1;V ) at the cU1,V (x, y) local
maxima points; (e) finding the maximum of mutual information Imax (U1;V ) ; (f) repeating steps (a) –
(e) for the images U1 , ..., U N ; finding the maximum of the mutual information Imax (Um;V ) among the
values of Imax (U1;V ) , Imax (U2;V ) , ..., Imax (UN ;V ) and the corresponding template image Um which
will be the solution of the problem (4). It should be noted that an operation of impulse noise filtering
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] can be applied at step (a) for input image enhancement.
      </p>
      <p>The proposed algorithm has no parameters and does not require tuning. Figure 2 illustrates the
main stages of localizing the objects. The input image captured by the THERZ-7A complex is shown
in Figure 2 (a), and the image of the template object is given in Figure 2 (b). Contour preparations of
these images are presented in Figures 2 (c) and 2 (d). Figure 2 (e) visualizes the cross-correlation
function of the contour preparations. The figure shows the local maxima of the surface formed by the
function cUn,V (x, y) . The result of localizing the object in the input image is shown in Figure 2 (f).</p>
      <p>a) b) c) d) e) f)
Figure 2. The main stages of localizing the objects: (a) image captured by the THERZ-7A complex;
(b) the image of the template object; (c) the contour preparation of the input image; (d) the contour
preparation of the template image; (e) visualized cross-correlation function of contour preparations
cUn,V (x, y) ; (f) the result of localizing the object in the input image, corresponding to the maximum
of the mutual information I (Un;V ) .</p>
    </sec>
    <sec id="sec-4">
      <title>4. Experiment</title>
      <p>To estimate the applicability of the above approach to solve the problem of localizing and identifying
concealed objects, a computational experiment was carried out. The experiment was organized as
follows. We obtained template images of objects, which should be detected in images captured at the
output of the terahertz imaging system. Template images are shown in Figure 3.</p>
      <p>We consistently placed objects under the clothes of a person and recorded image sequences using
the THERZ-7A system. Using the proposed method, we compared the template images shown in
Figure 3 (a) with frames of the obtained video sequences and fixed the values of the similarity
measure (3). The shooting range was about 1.5 meters and varied slightly, so the template images were
used with the constant value of the scale factor. The cross-correlation function and the mutual
information are not invariant with respect to rotation. Therefore, it is also necessary to analyze the
rotated and flipped variants of the template images. The results of the object localizing are shown in
Figures 4 (a-e), and the computed values of mutual information are presented in Table 1. It follows
from Figure 4 and Table 1 that concealed objects are successfully localized by the proposed method
based on a combination of correlation matching and the mutual information maximum matching.</p>
      <p>From the data in Table 1, we can draw the following conclusion. When comparing images with
concealed objects obtained at the output of the terahertz imaging system and the images of template
objects, the maximum of the mutual information is achieved if the template corresponds to the
concealed object. Consider an example. Suppose a terahertz image is obtained in which a hidden
object is visible (for example, a Walther handgun). This image is matched with the AK submachine
gun, TT handgun, and Walther handgun templates. The computed values of the mutual information
between the matched images are given in the third row of Table 1. The maximum value of the mutual
information is obtained when comparing the input image and the Walther template. The system then
makes a decision that the concealed object is a Walther handgun. From this, it follows that the
proposed method can be used to detect and recognize prohibited concealed objects in security systems.
To eliminate false alarms, the calculated maximum value of the mutual information is compared with
a predetermined threshold value. If the found maximum of the criterion (3) is less than the threshold,
then the decision is made that the prohibited object is not found. The threshold value is determined
empirically.</p>
      <p>a) b) c) d) e)
Figure 4. The result of the matching images of template objects and the images of hidden objects: (a)
TT handgun hidden under the clothes and its template; (b) hidden Walther handgun and its template;
(c) hidden AK submachine gun and TT template; (d) hidden Walther handgun and TT template;
(e) hidden AK submachine gun and its template.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>The paper proposes a new algorithm for localization and identification of objects in images taken
using the passive terahertz imaging system. The proposed algorithm combines matching methods
based on maximizing the value of the cross-correlation function and the mutual information
magnitude. To reduce computational expenses, the criterion of the mutual information maximum is
analyzed at points of local maxima of the cross-correlation function computed in the frequency
domain. The proposed algorithm does not require parameter tuning. Future research will be aimed at
improving the quality of detection and recognition of concealed objects using preprocessing of
terahertz images and applying detection methods that are invariant to geometric transformations.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>Authors are grateful to the Astrohn Technology Ltd and OOO ASoft who provided us with the
THERZ-7A terahertz scanning device. This research was supported in part by the Russian Foundation
for Basic Research (grants No 18-07-01295 and No 16-29-09626).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Shen</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dietlein</surname>
            <given-names>C R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grossman</surname>
            <given-names>E</given-names>
          </string-name>
          , Popovic Z and Meyer F G 2008
          <source>IEEE transactions on Image Processing</source>
          <volume>17</volume>
          <fpage>2465</fpage>
          -
          <lpage>2475</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Lee</surname>
            <given-names>D S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yeom</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Son J Y and Kim S H 2010 Optics Express</surname>
          </string-name>
          18
          <fpage>10659</fpage>
          -
          <lpage>10667</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Martinez</surname>
            <given-names>O</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferraz</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Binefa</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gomez</surname>
            <given-names>I</given-names>
          </string-name>
          and
          <string-name>
            <surname>Dorronsoro C 2010</surname>
          </string-name>
          <article-title>Concealed object detection and segmentation over millimetric waves images</article-title>
          <source>IEEE Computer Society Conference on Computer Vision and Pattern Recognition-Workshops 31-37</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Antsiperov</surname>
            <given-names>V E</given-names>
          </string-name>
          <year>2016</year>
          <article-title>Automatic target recognition algorithm for low-count terahertz images</article-title>
          <source>Computer Optics</source>
          <volume>40</volume>
          (
          <issue>5</issue>
          )
          <fpage>746</fpage>
          -
          <lpage>751</lpage>
          DOI: 10.18287/
          <fpage>2412</fpage>
          -6179-2016-40-5-
          <fpage>746</fpage>
          -751
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Kowalski</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palka</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piszczek</surname>
            <given-names>M</given-names>
          </string-name>
          and
          <string-name>
            <surname>Szustakowski M 2013 Acta Physica Polonica</surname>
            <given-names>A</given-names>
          </string-name>
          124
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Yeom</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Son</surname>
            <given-names>J</given-names>
          </string-name>
          and
          <string-name>
            <surname>Gushin</surname>
            <given-names>V P</given-names>
          </string-name>
          <year>2013</year>
          Progress in Electromagnetics Research 39
          <fpage>169</fpage>
          -
          <lpage>180</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Trofimov</surname>
            <given-names>V A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trofimov</surname>
            <given-names>V V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shestakov</surname>
            <given-names>I L</given-names>
          </string-name>
          and
          <string-name>
            <surname>Blednov R G 2016</surname>
          </string-name>
          <article-title>Concealed object detection using the passive thz image without its viewing Passive and Active Millimeter-Wave Imaging XIX (International Society for Optics</article-title>
          and Photonics)
          <volume>9830</volume>
          98300E
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Viola</surname>
            <given-names>P</given-names>
          </string-name>
          and
          <string-name>
            <surname>Wells III W M 1997</surname>
          </string-name>
          <article-title>International journal</article-title>
          of computer vision
          <volume>24</volume>
          <fpage>137</fpage>
          -
          <lpage>154</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Gonzalez R C and Woods R E 2008 Digital Image</surname>
          </string-name>
          <article-title>Processing (New Jersey: Pearson Education</article-title>
          , Inc)
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Maes</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Collignon</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vandermeulen</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marchal</surname>
            <given-names>G</given-names>
          </string-name>
          and
          <string-name>
            <surname>Suetens</surname>
            <given-names>P 1997</given-names>
          </string-name>
          <source>IEEE transactions on Medical Imaging</source>
          <volume>16</volume>
          <fpage>187</fpage>
          -
          <lpage>198</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Escolano</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suau</surname>
            <given-names>P</given-names>
          </string-name>
          and
          <string-name>
            <surname>Bonev</surname>
            <given-names>B 2009</given-names>
          </string-name>
          <article-title>Information theory in computer vision and pattern recognition (Springer Science</article-title>
          &amp; Business Media)
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Jain</surname>
            <given-names>A K</given-names>
          </string-name>
          <year>1989</year>
          <article-title>Fundamentals of digital image processing (Englewood Cliffs</article-title>
          , NJ: Prentice Hall)
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Chervyakov</surname>
            <given-names>N I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lyakhov</surname>
            <given-names>P A</given-names>
          </string-name>
          and
          <string-name>
            <surname>Orazaev A R 2018</surname>
          </string-name>
          <article-title>Two methods of adaptive median filtering of impulse noise in images</article-title>
          <source>Computer Optics</source>
          <volume>42</volume>
          (
          <issue>4</issue>
          )
          <fpage>667</fpage>
          -
          <lpage>678</lpage>
          DOI: 10.18287/
          <fpage>2412</fpage>
          -6179- 2018-42-4-
          <fpage>667</fpage>
          -678
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>