<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>CEUR Workshop Proceedings</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.18287/1613-0073-2016-1638-419-427</article-id>
      <title-group>
        <article-title>CROPS IDENTIFICATION BY USING SATELLITE IMAGES AND ALGORITHM FOR CALCULATING ESTIMATES</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>N.S. Vorobiova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Samara National Research University, Samara, Russia Image Processing Systems Institute - Branch of the Federal Scientific Research Centre “Crystallography and Photonics” of Russian Academy of Sciences</institution>
          ,
          <addr-line>Samara</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <volume>1638</volume>
      <fpage>419</fpage>
      <lpage>427</lpage>
      <abstract>
        <p>The paper proposes a crop identification method based on the algorithm for calculating estimates using satellites images. The classification features are the set of time series values, built by the sequence of satellite images, and geographic coordinate of field - latitude. Method was tested by using Terra/MODIS images and ground-based information. The comparison of the classification quality of proposed method with classification quality of the classifier based on Mahalanobis Distance is given.</p>
      </abstract>
      <kwd-group>
        <kwd>time series</kwd>
        <kwd>vegetation index</kwd>
        <kwd>NDVI</kwd>
        <kwd>satellite images</kwd>
        <kwd>crops identification</kwd>
        <kwd>crops recognition</kwd>
        <kwd>algorithm for calculating estimates</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Currently, management in agricultural sector is an area of active development and
implementation of methods that apply remotely sensed data (hereinafter - RSD) to
solve production tasks [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. One of such tasks is the crop identification by using
satellite images. The solution of this task is very important for such events:
 real-time monitoring;
 control of crop lands usage,
 verification of information provided by farmers about crops seeded on fields;
 mapping land usage in areas with no information from the farmers about crops
seeded on fields.
      </p>
      <p>
        The literature suggests a large number of methods to solve crop recognition task by
using satellite images [
        <xref ref-type="bibr" rid="ref2 ref3 ref4 ref5 ref6">2-6</xref>
        ]. The most important methods are those methods which are
suitable for usage over large areas (regional scale and more). Territory of large area,
in contrast to the farm or group of farms, is difficult to explore by land, and remotely
sensed data provide this opportunity [
        <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
        ].
      </p>
      <p>In this paper crop recognition method based on algorithm for calculating estimates is
proposed. This method is suitable for usage in areas of regional scale. The method
allows to take into account the geographical position of the field and to use the time
series with gaps in observation days. Gaps occur due to the presence of clouds on
satellite images at a certain day of observation. The ability to use data with gaps does
not require an additional procedure of time series interpolation. The proposed
detection method is based on an evaluation of the proximity between classified and
learning objects. The quality of the proposed method is compared with the quality of the
classifier based on the Mahalanobis distance.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Input data and preprocessing</title>
      <p>All data for research were obtained for the Samara Region. The paper uses satellite
images and ground data about fields for the year 2014. The description of input data
and preprocessing methods is given below.
2.1</p>
      <sec id="sec-2-1">
        <title>Satellite images</title>
        <p>
          Data from satellite Terra/MODIS are used to construct time series. The raw data
recorded on the satellite are passed through ground processing and are saved in the form
of so-called products [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. The product is the result of some processing run on the raw
data or on data already processed to a certain level. In this paper red and near-infrared
channels of daily MOD09GQ product are used to construct time series. The product
MOD09GQ has passed through radiometric and atmospheric correction, is
georeferenced, and is a minimum level of combination of daily 250 m data.
Preprocessing of MOD09GQ products (for each day) includes the following steps:
1. Mosaic composing from two tiles (h20v03, h21v03) which cover the territory of
the Samara region. Tile is the cell of MODIS sinusoidal cell.
2. Pixel-by-pixel synchronization with the images for previous days: setting of
uniform map projection and resolution; setting of uniform horizontal and vertical size;
setting a uniform binding of the corner points.
3. Formation of cloud mask.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Ground data</title>
        <p>
          Ground data about fields is provided by farmers and are necessary for training the
classifier and evaluation of its quality. The following ground data is provided for each
field:
 the boundary and the square of the field;
 crops seeded on the field or type of field usage such as fallows or unused lands.
The boundaries of the fields were superimposed on a cover composed by satellite
images of medium resolution (20-30 m) in order to verify the homogeneity of fields.
The heterogeneity is caused by the following factors: the presence of several crops or
types of usage on the field, the use of only part of the field for sowing, sprouts
heterogeneity. All identified inhomogeneous fields were divided into the corresponding
number of homogeneous regions. The study of homogeneity was conducted using a
segmentation method [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. As a result, new boundaries of plots were obtained. New
plots boundaries are used for the time series calculation and crops identification.
In this study, all crops and types of field usage have been divided into the following
groups: perennial grasses, unused land, winter crops, fallows, early spring crops, late
spring crops. Such division into classes will be used further in the classification
algorithm based on calculating estimates.
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Time series</title>
        <p>Normalized difference vegetation index NDVI was selected for the construction of
time series. Value of index is calculated according to the formula:
NDVI  NIR  RED ,
NIR  RED
(1)
channels  NIR ,  RED ;
ing to the formula (1).
where  NIR ,  RED – values of reflected radiation in the near infrared and the red
regions of the spectrum, respectively.</p>
        <p>Terra/MODIS images received during the period from March 1 to September 30 for
year 2014 were used to construct the NDVI time series. An index value for each plot
is calculated as follows:
 calculation of the values averaged by all pixels of the plot for red and near-infrared
 calculation of the NDVI value on the plot by averaged values NIR ,RED
accordNDVI values are sorted by the date when the image was taken. Sorted NDVI values
form NDVI time series of the plot.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Crop identification method</title>
      <p>
        To detect crops it is offered to use the classification method based on the algorithm
for calculating estimates (hereinafter – ACE). Class of recognition algorithms based
on the calculating estimates was proposed by Zhuravlev YI [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. ACE is not a
predetermined algorithm but a model of recognition algorithm. And in each case the model
of recognition algorithm must be specified. A description of the proposed method for
crop identification, which is a specification of ACE model, is given below.
      </p>
      <sec id="sec-3-1">
        <title>Model of ACE for crop identification</title>
        <p>The input data for the algorithm is a set of reference objects and recognizable objects.
All objects are characterized by the following features: time series values and
geographical coordinate – latitude. The task is to classify a set of recognizable objects in
the predefined classes. A priori information is given in a table 1.
– the value of the time series of the object  p in a day n ; N – the total number of
observation days. Both reference objects and recognizable objects can have gaps in
the values of the time series. Specifying ACE model assumes setting the following
sub-paragraphs.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>System of support feature sets</title>
        <p>In this method system of support feature sets consists of a single set comprising all the
features.
3.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Proximity function</title>
        <p>ed as follows:
  ek11k 2  ,
The proximity  of the recognized object a and the reference object  p is
calculat(2)
where 1 – the value that characterizes proximity of two objects by the time series
values,  2 – the value that characterizes proximity of two objects by latitude, k –
parameter determining the weight 1 and  2 in the final value of proximity. Values
1 and  2 are calculated by the formulas (3) and (4), respectively.</p>
        <p>N 1
1    p,n  an 2</p>
        <p>n0
2   p  a
(3)
(4)
(5)
(6)
The following conventions are used in the formulas (3) and (4): an , n  0, N  1 and
 a – the features of object a : set of time series values and latitude, respectively.
Value 1 is calculated only for the days n on which both the object  p and the
object a have the values of time series  p,n and an , respectively.</p>
        <p>The value of proximity function f p , a between reference object  p and
recognized object a is calculated as follows:
f  p , a  1,   T ,</p>
        <p> 0,   T
where T – proximity threshold.
3.4</p>
      </sec>
      <sec id="sec-3-4">
        <title>Estimate of proximity for a class.</title>
        <p>Estimate of proximity  j  of object a to a class  j is calculated so:
 j  </p>
        <p> f  p , a
 p  j
3.5</p>
      </sec>
      <sec id="sec-3-5">
        <title>Decision rule</title>
        <p>1) c  arg max</p>
        <p>m0, M 1
ments in a class  m .</p>
        <p>Classification of recognized object a will be done in class c according to the
decision rule. We define two variants of decision rules:</p>
        <p>m  ; 2) c  arg mm0a,Mx1 qmm  , where qm – the number of
ele</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experiments</title>
      <sec id="sec-4-1">
        <title>Set of objects for testing</title>
        <p>Testing sample of 6424 plots has been formed to assess the quality of the proposed
classifier. Quality evaluation has been conducted using cross-validation. Testing
sample was divided five times into training and control sample in the ratio of 2: 1.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Classification by using algorithm for calculating estimates</title>
        <p>
          Procedure of searching the parameters k and T has been executed for each option of
the decision rule. Searching procedure allows finding such parameters k and T on
which the best classification results are achieved. Search has been made by a brute
force method with a step of 0.01 for parameter k and 0.001 for parameter T . The
best results of classification for the decision rule 1 are achieved when k  0.98 and
T  0.9945 . For decision rule 2 best results are achieved when k  0.97 and
T  0.9945 . The results of crop identification using the ACE are shown in Tables 2
and 3.
To assess the quality of the proposed classifier based on ACE the classification of the
test sample according to Mahalanobis distance [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] has been made using
crossvalidation scheme mentioned above. The results of classification from class to class
are given in Table 4. The total probability of correct classification is 0.64.
Decision rule 2, k  0.98 , T  0.9945 , Q = 0.72
perennial unused winter fallows early
grasses lands crops spring
crops
0.40 0.42 0.08 0.03 0.01
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>The paper proposes a method of crop identification based on the algorithm for
calculating estimates. The advantages of the proposed algorithm: the usage of time series
with gaps, accounting of the geographical position of the field. As seen from the
results of classification, ACE method gives the total probability of correct classification
(Q = 0.72) higher than the classifier by Mahalanobis Distance (0.64). However, the
value 0.72 is not satisfactory, and improvement of the quality of the classification is
required. A detailed study of the testing sample showed that some of the data are
unreliable: incorrect crops for some fields are given as well as the division of crops into
classes was carried out nonoptimally. Therefore, the direction of future research is to
study the issue of division crops into groups, so that the best classification quality is
achieved at the highest possible splitting crops into classes.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>This work was financially supported by RFBR, project № 16-37-00043_mol_a
«Development of methods of using data from geoinformation systems in remote sensing
data processing» and project № 16-29-09494_ofi_m «Methods of computer
processing of multispectral remote sensing data for vegetation areas detection in special
forensics».</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Agriculture</surname>
          </string-name>
          . URL: http://sovzond.ru/industry-solutions/agriculture/ [in Russian]
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Wardlow</surname>
            <given-names>BD</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Egbert</surname>
            <given-names>SL</given-names>
          </string-name>
          .
          <article-title>Large-area crop mapping using time-series MODIS 250 m NDVI data: An assessment for the U.S. Central Great Plains</article-title>
          .
          <source>Remote sensing of environment</source>
          ,
          <year>2008</year>
          ;
          <volume>112</volume>
          (
          <issue>3</issue>
          ):
          <fpage>1096</fpage>
          -
          <lpage>1116</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Wardlow</given-names>
            <surname>BD</surname>
          </string-name>
          .
          <article-title>An Evaluation of Time-Series MODIS 250-meter Vegetation Index Data for Crop Mapping in the U.S. Central Great Plains</article-title>
          . Michigan: ProQuest Information and Learning Company,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Brown</surname>
            <given-names>JC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kastens</surname>
            <given-names>JH</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Coutinho</surname>
            <given-names>AC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Victoria</surname>
            <given-names>DC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bishop</surname>
            <given-names>CR</given-names>
          </string-name>
          .
          <article-title>Classifying multiyear agricultural land use data from Mato Grosso using time-series MODIS vegetation index data</article-title>
          .
          <source>Remote Sensing of Environment</source>
          ,
          <year>2013</year>
          ;
          <volume>130</volume>
          :
          <fpage>39</fpage>
          -
          <lpage>50</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Plotnikov</surname>
            <given-names>DE</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bartalev</surname>
            <given-names>SA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zharko</surname>
            <given-names>VO</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mihailov</surname>
            <given-names>VV</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prosyannikova</surname>
            <given-names>OI</given-names>
          </string-name>
          .
          <article-title>An experimental assessment of crop types recognisability using time-series of intra-seasonal spectral reflectance measurements by satellite sensor. Current problems in remote sensing of the Earth from space</article-title>
          ,
          <year>2011</year>
          ;
          <volume>8</volume>
          (
          <issue>1</issue>
          ):
          <fpage>199</fpage>
          -
          <lpage>208</lpage>
          . [in Russian]
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Plotnikov</surname>
            <given-names>DE</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bartalev</surname>
            <given-names>SA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Loupian</surname>
            <given-names>EA</given-names>
          </string-name>
          .
          <article-title>The recognition features to map arable lands based on multi-annual MODIS Earth observation data. Current problems in remote sensing of the Earth from space</article-title>
          ,
          <year>2010</year>
          ;
          <volume>7</volume>
          (
          <issue>1</issue>
          ):
          <fpage>330</fpage>
          -
          <lpage>341</lpage>
          . [in Russian]
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>MODIS</given-names>
            <surname>Website</surname>
          </string-name>
          . URL: http://modis.gsfc.nasa.gov/
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Zhuravlev</surname>
            <given-names>JI</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kamilov</surname>
            <given-names>MM</given-names>
          </string-name>
          <article-title>Tulyaganov ShE</article-title>
          .
          <article-title>Algorithms for calculating estimates</article-title>
          .
          <source>Tashkent: “FAN” Publisher</source>
          ,
          <year>1974</year>
          ; 119 p. [in Russian]
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Vorobiova</surname>
            <given-names>NS</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Denisova</surname>
            <given-names>AYu</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuznetsov</surname>
            <given-names>AV</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Belov</surname>
            <given-names>AM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chernov</surname>
            <given-names>AV</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Myasnikov</surname>
            <given-names>VV</given-names>
          </string-name>
          .
          <article-title>How to Use Geoinformation Technologies and Space Monitoring for Controlling the Agricultural Sector in Samara Region. Pattern Recognition and Image Analysis</article-title>
          .
          <source>Advances in Mathematical Theory and Applications</source>
          ,
          <year>2015</year>
          ;
          <volume>25</volume>
          (
          <issue>2</issue>
          ):
          <fpage>347</fpage>
          -
          <lpage>353</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Vorobiova</surname>
            <given-names>NS</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Timbay</surname>
            <given-names>EI</given-names>
          </string-name>
          .
          <article-title>Geoinformation system of agricultural lands inventory and control development</article-title>
          .
          <source>Computer Optics</source>
          ,
          <year>2009</year>
          ;
          <volume>33</volume>
          (
          <issue>3</issue>
          ):
          <fpage>340</fpage>
          -
          <lpage>344</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>