<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Multichannel Image Segmentation Algorithm for Agricultural Crop Detection</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Anna Denisova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrey Kuznetzov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nikolay Glumov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Samara National Research University 34</institution>
          ,
          <addr-line>Moskovskoe shosse, Samara, 443086, Russian Federation</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this article we propose an algorithm for multichannel image segmentation within the prede ned region of interest. The algorithm allows to determine whether the region of interest is homogeneous or not and it also produces a partition of the region of interest into homogeneous subsets and re nes objects borders. The proposed method has been applied to remote sensing images of agricultural elds as a preliminary step of crop detection mechanism. Segmentation quality has been estimated by means of a speci c measure proposed in this article.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        There are a lot of applications that require automatic segmentation to be
performed as initial step [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. For example, in the case of automatic crop detection on
remote sensing images the initial agricultural eld borders usually correspond to
particular farmers eld with several planted crops. To de ne areas occupied by
each crop an automatic segmentation should be performed. Then the recognition
process can be carried out as well as vegetation stages and planting conditions are
determined. Another potential use of automatic segmentation in agriculture is
acreage control. Often farmers do not trace all the processing steps for each eld
and a particular worker may slightly digress borders while sowing the plants.
So that automatic segmentation technique might be able to assist farmers to
control acreage independently using remote sensing data.
      </p>
      <p>
        Existing image segmentation methods can be regarded in the scope of
further classi cation of popular segmentation techniques given in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Edge-based
segmentation applies contour detection [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. For remote sensing images, it is
supplementary mechanism because the most important information lies in color
and texture properties of the objects. It is rather di cult to apply edge-based
segmentation to any natural object because of the complex border form.
Regionbased segmentation [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] includes di erent clustering techniques on the rst step
and then produce segmentation by means of homogeneity functional
optimization. These algorithms usually exploit only texture features and they are
essentially pixel based methods. Due to the per pixel character of segmentation, it is
more probable to achieve tiny over segmented contours on the nal stage. More
reliable segmentation methods correspond to per- eld and multi-resolution
segmentation [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. For the rst of them, it is assumed that initial object borders
are known and they can assist the segmentation. The latter analyzes image in
di erent scales simultaneously. The key idea behind the multi-scale approach
is to take into account properties of data with di erent resolution. There are
some segmentation algorithms which use neural networks [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ],[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The common
disadvantage of neural network algorithms is huge training data set that must
represent all possible variety of data. It is a big challenge to create such data
set for any particular case. Therefore, the methods based on neural networks are
not exible in terms of novel types of data and features and could not be applied
for internal borders re nement of the particular object.
      </p>
      <p>In our article we assume that the database of objects' contours is available.
This database may be presented in the form of vector map from
geoinformation system (GIS). We explore the case of automatic analysis of internal map
object structure and its border re nement, thus we consider only the explored
object and its surroundings as input data. The aim of the segmentation in our
case is to de ne signi cant object's parts that di er from each other by their
spectral-spatial characteristics, i.e. to clarify an internal object structure.
Resulting contours should separate an initial object into meaningful parts. The outside
object border is supposed to be known from the vector map and is referred as
the region of interest (ROI). It is assumed that multiple features are available.</p>
      <p>
        As far as we suppose that it is more important to identify borders inside the
ROI rather than to re ne its contour, we have based our algorithm on a merge
approach [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. This approach includes two stages i.e. splitting the image into
numerous tiny locally homogeneous areas and their merging into more complicated
ones. As a result, the outside ROI contour will be initially presented with the
precision of tiny homogeneous regions and the inside borders will be de ned with
the precision of the merging rule.
      </p>
      <p>Managing the splitting and merging process allows our algorithm to work
adaptively with each particular contrast and texture, which is highly determined
by image resolution. Such opportunity is very important for the agricultural
elds, because usually images are very di erent not only in sense of planted crops
but also as a result of relief and image acquisition properties. Relief changes lead
to texture variations within the elds with the same crop class. And acquisition
parameters signi cantly determine particular image contrast properties.</p>
      <p>The proposed algorithm has been revised by means of segmentation quality
measure on the set of remote sensing images. Experiments have shown successful
results in most of the cases and the best pair of the algorithm parameters has
been recommended.</p>
      <p>The article includes three main parts: problem statement, algorithm
description and experimental evaluation of the method on remote sensing images. The
article ends with conclusions and acknowledgements sections.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Problem statement</title>
      <p>In our notations initial image of L features with size N1
symbol Xl (n1; n2) ; 0 n1 N1 1; 0 n2 N2 1; 0
and n2 are the coordinates of image pixel. The general segmentation problem is
formulated as optimization problem:</p>
      <p>S = fSkg ; 0
k</p>
      <p>KS
1;
= f ig ; 0
i</p>
      <p>K
1; [ Sk ;</p>
      <p>k
Q (S; ) 7! max:
(1)
(2)
where Q (S) is the segmentation quality measure, S = fSkg ; 0 k KS 1 is
a set of pixel subsets de ning the results of segmentation, KS is the number of
subsets, = f ig ; 0 i K 1 is an ideal segmentation and K is ideal
number of subsets.</p>
      <p>
        We o er the following measure segmentation quality evaluation:
Q (S; ) =
where j:::j is the number of pixels per subset. The segmentation quality measure
is based on the quantities introduced in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. It takes values from 0.5 to 1. The
lower bound is achieved when one of the values KS or K is equal to one and
the other is total number of image pixels. The upper bound of Q corresponds to
the precise segmentation.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Algorithm description</title>
      <p>Let us denote the source remote sensing image as Fm (n1; n2) ; 0 m M
1; 0 n1 N1 1; 0 n2 N2 1. Here by we describe all of the processing
steps of our algorithm including preprocessing and feature calculation.</p>
      <p>Step 1. Input data preparation.</p>
      <p>We clip initial remote sensing image within the bounding box of vector map
object to be analyzed using standard GIS instruments. The resulting piece of
the image represents the region of interest and contains all pixels corresponding
to analyzed object and its surroundings.</p>
      <p>Step 2. Feature extraction.</p>
      <p>This stage includes calculation of initial feature image Xl (n1; n2) ; 0 n1
N1 1; 0 n2 N2 1; 0 l L 1 using local averages and variations for
each pixel block with W W size in the source image. The number of features
L = 2M is two times higher than the original image channel count, because
these features are computed separately per each image component.</p>
      <p>Step 3. Feature variance normalization.</p>
      <p>To exclude the impact of di erent dynamic ranges for the computed features
we normalize each component using its global standard deviation l:
xl (n1; n2) =</p>
      <p>Xl (n1; n2) :
l
(4)
Step 4. Splitting image into tiny regions.</p>
      <p>4.1. Before splitting initial image with segment indexes I (n1; n2) ; 0 n1
N1; 0 n2 N2 is initialized by zero: I (n1; n2). Current number of segments
is set to zero S = 0 too. Further analysis is produced for the sequence of grids
de ned by shifting parameter a = 1;
4.2. For each feature vector in the grid x (n1; n2) ; (n1; n2) 2 T with step aW :
T = f(n1; n2) : n1 mod aW = 0; n2 mod aW = 0g ;
(5)
where mod is the remainder after division, the neighborhood is determined. This
neighborhood contains pixels located at distance aW per each spatial coordinate
from the current pixel. There are three possible cases. First case is when the
neighborhood pixels have di erent indexes of segments. Second case is when
only one pixel has non zero segment index s for the current neighborhood. And
the last case is when aW = 1 and all neighborhood pixels have the same index s.
If the second and third situations take place, then the condition of the similarity
between feature vectors is checked:
(x (n1; n2) ; x (n1
aW; n2
aW )) &lt; "; (n1; n2) 2 T
(6)
where is some distance metric i.e. Euclidean distance. If the condition (6) is
ful lled, the segment index I (n1; n2) is set to s. Otherwise, it is assigned to
smax + 1, where smax is the biggest segment index at the current time.</p>
      <p>In the rst case, when there are several di erent values of segment index in
the neighbourhood, current feature vector have to be compared with an average
feature vectors for each of the presented segments:
(7)
(8)
(9)
(x (n1; n2) ; E (s)) &lt; "; (n1; n2) 2 T;
s = argmins2fI(n1 aW;n2 aW )g (x (n1; n2) ; E (s))
where E (s) is an average feature vector for the segment with index s. If this
condition is ful lled for some index s, it is stored in I (n1; n2) as a current
estimate of the segment number.</p>
      <p>4.3 The grid step is decreased in twice a = a2 and the steps from 4.1 to 4.3
are repeated while the condition aW 1 is valid.</p>
      <p>Step 5. Merging.</p>
      <p>For each pair of the adjacent segments two segments are merged if their
average feature vectors are closer than the threshold value:</p>
      <p>(E (s1) ; E (s2)) &lt; "
Step 6. ROI border re nement.</p>
      <p>This step is intended to the analysis of non zero points on the segment border.
This analysis includes three steps listed below:</p>
      <p>6.1 Distance map calculation for each feature vector. This map contains the
distances between current feature vector and average feature vectors of the
segments corresponding to pixels within the local window W W .</p>
      <p>6.2 Parabolic ltering of the distance map. Each value of the distance
within the window is multiplied by the coe cients of parabolic lter with same
size W W and then the minimum is used as a result of ltration for current
window position. The coe cients of parabolic lter are de ned by following
equation:</p>
      <p>P (w1; w2) = w1 w1 + w2 w2;
(10)
where w1 and w2 take values from the range W2 ; W2 .</p>
      <p>6.3 Final index of the segment is de ned as the index of segment with the
minimum ltered distance which is also less than ".</p>
      <p>The algorithm produces an image with the segments' indexes. Step 2 can be
modi ed if it is necessary to apply any other features.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Experimental evaluation</title>
      <p>The proposed algorithm has been used for agricultural elds analysis. Input
images were acquired by sensor UK DMC for the Samara region. Image resolution
was 22 meters per pixel. Test images were obtained during the period from the
1st of April 2012 till the 15th of June.</p>
      <p>The aim of our experimental research was to de ne which parameter values
deliver the best method performance. The following groups of factors may
signi cantly a ect segmentation quality. The rst group accumulates images which
are partially clouded and with shadows from the clouds. For these images clouds
and shadows always are distinguished as di erent segments and leads to over
segmentation. This is irremovable error, therefore only cloudless images should
be used. The second group accumulated images of elds with the rough relief
structure. If there are some ditches, channels, ravines or the slope and
curvature is high, it would be di cult to make segmentation properly. The last group
includes rough-textured elds. Usually, non hybrid eld has the rough texture
of its surface because of the agricultural management events. In this case there
can be regular lines or spots on the images inside the borders of the eld. If the
texture elements are enough big, they may lead to over segmentation too. Thus,
the results of the algorithm are highly dependent on input data and can not be
applied without calibration.</p>
      <p>For the agricultural eld border monitoring we have used infrared, red and
green spectral channels. The features were local averages and variances for each
spectral channel.</p>
      <p>We have used a sample set of 120 images for the typical problem cases listed
above (30 images per category) and 30 images with simple non hybrid structure
(as an example of the case of good conditions for the algorithm). The ideal
segmentation has been done manually. The examples of test images are presented
in g. 1.</p>
      <p>The quality of segmentation for each image has been estimated by means
of Q criterion described above. We have tested algorithm parameters for the
a)
b)
c)
d)</p>
      <p>To inspect the relationship between each category and quality of
segmentation we have assigned the index g to each of the four image categories: g = 1 for
hybrid eld, g = 2 for non hybrid eld with complex relief, g = 3 for simple non
hybrid eld, g = 4 for textured hybrid eld. Image index in the particular group
is denoted as j and there are J = 30 images per each category.</p>
      <p>For all pairs of parameters W and " an averages and variances of Q per group
and for the whole image set have been calculated using formulas:
mQg (W; ") =</p>
      <p>Qgj (W; ") ;</p>
      <p>J
1 X
J</p>
      <p>j=1
v</p>
      <p>J
Qg (W; ") = tuu J1 X (Qgj (W; ")
j=1</p>
      <p>mQg (W; "))2;
mQ (W; ") =</p>
      <p>For each category we have selected an optimal pair of parameters Poptg as a
pair that ful ll the following condition:</p>
      <p>Poptg = f(W; ") : mQg (W; ")
mg
gg ;
where
mg = max(W;") [mQg (W; ")],
g = max(W;")2Pg [ Qg (W; ")],
(11)
(12)
(13)
(14)
(15)
Pg = f(W; ") : mQg (W; ") = mgg.</p>
      <p>The table of mg and g values for each category is shown below.
Category g
hybrid eld 1
non hybrid eld with complex relief 2
simple non hybrid eld 3
textured hybrid eld 4</p>
      <p>The optimal pairs of the parameters for the whole image set have been found
as intersection of optimal parameters sets for each category Poptg .</p>
      <p>The table 2 contains optimal for the whole sample set parameters and
average segmentation quality measure mQ (W; ") for them. mQ1 (W; ") is an average
value of the segmentation quality measure only for the hybrid eld samples.
Fig. 2. Segmentation results with the best pair of parameters W = 5 and " = 0:6 for
the images from the g. 1</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In this article, we have proposed an algorithm that helps to make a segmentation
of particular objects. It is based on two steps: excess segmentation and
merging. The quality of segmentation depends on the similarity threshold and initial
degree of over segmentation that is controlled by the window size. Algorithm
has been tested on agricultural eld images and the best pair of parameters
have been selected. The control parameters of the algorithm allow to use it for
another kind of data with di erent resolution and contrast properties, but the
calibration must be carried out in advance.</p>
      <p>Acknowledgements This study was nancially supported by RFBR projects
16-37-00043 mola, 16-29-09494 o m.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Narkhede</surname>
            ,
            <given-names>H.P.</given-names>
          </string-name>
          :
          <article-title>Review of image segmentation techniques</article-title>
          .
          <source>International Journal of Science and Modern Engineering</source>
          .
          <volume>1</volume>
          (
          <issue>8</issue>
          ),
          <volume>54</volume>
          {
          <fpage>61</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Blaschke</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , et al.:
          <article-title>Object-oriented image processing in an integrated GIS remote sensing environment and perspectives for environmental applications</article-title>
          .
          <article-title>Environmental information for planning, politics and the public</article-title>
          .
          <volume>2</volume>
          ,
          <issue>555</issue>
          {
          <fpage>570</fpage>
          . (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Rydberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borgefors</surname>
          </string-name>
          , G.:
          <article-title>Integrated method for boundary delineation of agricultural elds in multispectral satellite images</article-title>
          .
          <source>IEEE Transactions on Geoscience and Remote Sensing</source>
          .
          <volume>39</volume>
          (
          <issue>11</issue>
          ),
          <volume>2514</volume>
          {
          <fpage>2520</fpage>
          . (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Mueller</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Segl</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaufmann</surname>
          </string-name>
          , H.:
          <article-title>Edge-and region-based segmentation technique for the extraction of large, man-made objects in high-resolution satellite imagery</article-title>
          .
          <source>Pattern recognition</source>
          .
          <volume>37</volume>
          (
          <issue>8</issue>
          ),
          <volume>1619</volume>
          {
          <fpage>1628</fpage>
          . (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Blaschke</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burnett</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pekkarinen</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Image segmentation methods for objectbased analysis and classi cation. Remote sensing image analysis: Including the spatial domain</article-title>
          .
          <volume>211</volume>
          {
          <fpage>236</fpage>
          . (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Basu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ganguly</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mukhopadhyay</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>DeepSat A Learning framework for Satellite Imagery</article-title>
          .
          <source>Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems</source>
          . (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Chen</surname>
          </string-name>
          , L.-C.,
          <string-name>
            <surname>Papandreou</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kokkinos</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murphy</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yuille</surname>
            ,
            <given-names>A.L.</given-names>
          </string-name>
          :
          <article-title>DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs</article-title>
          .
          <source>Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems</source>
          . (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Sonka</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hlavac</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boyle</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Image processing, analysis, and machine vision</article-title>
          .
          <source>IEEE Transactions on Geoscience and Remote Sensing. Cengage Learning</source>
          . (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Weidner</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Contribution to the assessment of segmentation quality for remote sensing applications</article-title>
          .
          <source>International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences</source>
          .
          <volume>37</volume>
          (
          <issue>B7</issue>
          ),
          <volume>479</volume>
          {
          <fpage>484</fpage>
          . (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>