<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Data assimilation of LAI improved crop growth modeling: Comparison between in-situ measurements and satellite estimations⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marija Kopanja</string-name>
          <email>marija.kopanja17@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gordan Mimić</string-name>
          <email>gordan.mimic@biosense.rs</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hilde Vaessen</string-name>
          <email>hilde.vaessen@wur.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nevena Stevanović</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nikola Stanković</string-name>
          <email>nikola.stankovic@biosense.rs</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maša Buđen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dragana Blagojević</string-name>
          <email>dragana.blagojevic@biosense.rs</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bernardo Maestrini</string-name>
          <email>bernardo.maestrini@wur.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nataša Ljubičić</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sanja Brdar</string-name>
          <email>sanja.brdar@biosense.rs</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Frits van Evert</string-name>
          <email>frits.vanevert@wur.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>BioSense Institute</institution>
          ,
          <addr-line>Zorana Đinđića 1, Novi Sad, 21000</addr-line>
          ,
          <country country="RS">Serbia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Wageningen University &amp; Research</institution>
          ,
          <addr-line>Droevendaalsesteeg 4, Wageningen, 6708 PB</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <fpage>84</fpage>
      <lpage>89</lpage>
      <abstract>
        <p>A digital twin of crops was created by integrating real-time measurements into existing crop growth models. The study compared the performance of the WOFOST model in predicting yield for maize and soybean, with and without data assimilation of leaf area index (LAI) measurements. Data assimilation improved the model's performance, especially when using in-situ measured LAI, leading to more accurate yield predictions.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Digital twin</kwd>
        <kwd>Data assimilation</kwd>
        <kwd>WOFOST</kwd>
        <kwd>Ensemble Kalman Filter</kwd>
        <kwd>LAI 1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        A concept of digital twin originates from engineering, and it has been developed to describe
physical and chemical processes in the machines. Recently, it is finding the application in the green
life sciences, and particularly it can be used in digital farming [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Gathering data on weather, soil
conditions, and key plant growth parameters throughout the season enables running models for
predicting crop growth and yield under specific abiotic conditions. A digital twin of a crop can be
created by the integration of near real-time measurements from various sensors into existing crop
growth models. The purpose of the digital twin in digital farming is to simulate crop growth in
nearreal time and adjust the state variables to reflect the situation in the field.
      </p>
      <p>
        A systematic review about the current status and future perspectives of assimilation of remote
sensing data into crop growth models gave insight into a number of challenges ahead, some of which
are coming from the observation [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The first challenge is to combine observations from different
sensors, with different spatial and temporal resolution, in a harmonized way, and the second one is
to provide accurate estimates of the uncertainty of the retrieved variables. However, it is shown that
data assimilation (DA) is a valuable technique for estimating variables related to crop growth, such
as: soil moisture, biomass and leaf area index (LAI), by using satellite remote sensing data [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ].
Some authors developed and successfully validated a process-based soybean model for a prediction
of within-field yield variability by coupling LAI retrieval from Sentinel-2 into the crop model, and
with that approach they managed to optimize some soil-related parameters [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In [6], the authors
assimilated sensor data in a crop growth model of potato, using Planet topsoil moisture C-band data
and canopy reflectance from Sentinel-2 to calculate LAI, and as a result the final yield estimate is
being corrected to a value closer to what was measured in the field at the end of the season.
      </p>
      <p>In this study, the ensemble Kalman filter was exploited. We used data collected at field level
during the 2023 growing season and the WOFOST crop simulation model for a prediction of maize
and soybean yield. We investigated the performance of a model after DA of LAI, by comparing
insitu measurements and satellite estimations.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Data acquisition</title>
      <p>The experiment was conducted in Čenej, Vojvodina (Serbia), covering an area of 56 hectares,
where soybeans and maize were cultivated, each occupying 28 hectares. The field
(latitude=45.410290, longitude=19.804531) was divided by partnering agricultural company into 20
different management zones based on multi-year average values of the NDVI in growing season and
the zones were used as stratification for the soil texture sampling. In this study, six zones were
designated for each crop. The field topography is fairly plain and the existing soil types are as follows:
loam (medium), clay loam (medium fine) and clay (fine). Before planting, soil samples were taken
from each zone to conduct a basic soil chemical analysis. The planting of both crops was carried out
on April 27, 2023. Automatic weather station (PrecMet, Kite ZRT, Hungary) was positioned at the
edge of the field to gather daily data on air temperature, precipitation, air humidity, wind speed and
direction. Surface solar radiation was downloaded from ERA-5 reanalysis dataset available at Climate
Data Store [7].</p>
      <p>To track the progress of plant development, the LAI of soybean and maize was consistently
monitored throughout the growing season. For soybean, measurements were taken from the V4 stage
until harvest, while for maize, measurements were conducted from the V6-V7 stages until harvest,
with readings taken biweekly, resulting in four observed values. In each zone, plant samples were
collected from a 1 m² area and the whole sample was passed through an area meter (LI - 3100C) to
measure LAI. At the corresponding time points LAI was also estimated using cloud-free Sentinel-2
images processed through Sentinel Application Platform (SNAP), resulting in 10 values during the
growing season. Harvesting of crops was performed with the John Deere S770 combine equipped
with the John Deere Harvest Monitor™ system for yield monitoring during harvest. This system is
integrated with GPS, enabling us to track reliable yield data at appropriate locations in the field and
thoroughly analyze our management zones (Figure 1). For each zone, the average yield was
calculated and further compared with the results of the crop growth model.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <sec id="sec-3-1">
        <title>3.1. WOFOST</title>
        <p>The WOFOST is a dynamic, explanatory model that simulates crop growth with a temporal
resolution of one day. The crop growth is simulated based on several eco-physiological processes,
including phenological development, CO2-assimilation, transpiration etc. In general, three different
hierarchical levels of crop growth (potential, limited, reduced) can be distinguished that correspond
to three levels of crop production [8]. In WOFOST implementation there are three levels of crop
production: potential, water-limited, and nutrient-limited. The focus of our work was on the
waterlimited crop growth model due to environmental constraints regarding the availability of water. For
that, the implementation of WOFOST version 7.2 is used in the Python Crop Simulation Environment
(PCSE) 5.4.0. Accordingly, a yield estimation is retrieved from the variable Total Weight Storage
Organs (TWSO) that represent the harvestable product of the crop as dry weight in kg/ha.</p>
        <p>In our analysis, the WOFOST water-limited crop growth model is used for two crops: maize, and
soybean. The modeling concept is the same for both crops, while the differences are expressed by
differences in the value of model parameters. Three different types of model parameters are required
for running the WOFOST. First group of parameters consist of a set of crop parameters, a set of soil
parameters, and a set of site parameters. Second group is related to weather data. Lastly, parameters
related to agro management actions are required to specify the farm activities that will take place on
the field.</p>
        <p>The crop parameter files Grain_maize_203 and Soybean_902 are chosen for maize and soybean
from a given list of pre-defined crop files, appropriate to the varieties used in the field experiment,
based on the growing season duration. Variables related to soil moisture required by WOFOST enable
a connection between the WOFOST crop simulation model and the underlying soil model. The
knowledge of the agronomists is used to determine the most appropriate soil file from the list of
available soil types. In the data acquisition stage it is ensured that soil types are known for each zone.
Soil data files are named EC-1 (coarse), EC-2 (medium), EC-3 (medium fine), EC-4 (fine), and EC-5
(very fine). For our experimental analysis it is determined that EC-2, EC-3 and EC-4 correspond to
the soil types in the field. More precisely, for six zones of maize are used EC-2 and EC-3, while for
six zones of soybean are used EC-2, EC-3 and EC-4.</p>
        <p>Additionally, two parameters that needed to be defined before running WOFOST are maximum
soil-limited rooting depth (RDMSOL) and initial amount of available water in total rootable zone
(WAV). A RDMSOL was set to be 150 cm based on knowledge of local pedology from the field
agronomist. The sensitivity analysis is used to determine WAV and a value of 40 is obtained for both
maize and soybean.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Ensemble Kalman filter and data assimilation</title>
        <p>Ensemble Kalman filter is a sequential DA method, meaning the update of the state variables of
the crop model is done for each date until an observation is available. The study focused on the LAI,
and investigated the performance of the crop growth model after DA of LAI, obtained in two different
ways: in-situ measured and estimated from satellite images. Adjustment of each model state was
based on standard deviation of the observed and the simulated LAI, as a measure of the uncertainty
of the observation and model state.</p>
        <p>DA was performed using the ensemble Kalman filter, with 50 ensemble members, varying the
total initial dry weight (TDWI) as initial condition, and life span of leaves (SPAN) as a parameter,
both related to the leaves. The higher value of SPAN has a positive effect on LAI during the final
growth period (crop maturity) while on the other hand, during the initial phase of crop
establishment, LAI at emergence is calculated from TDWI [8]. The TDWI and SPAN are treated as
Gaussian random variables with a mean value and a standard deviation which are determined from
several runs of the model for each crop individually. For maize, the mean value and standard
deviation of the TDWI distribution are 50 and 10, respectively, while for soybean corresponding
values are 90 and 10. Regarding the distribution of SPAN, for maize the corresponding values of
mean and standard deviation are 40 and 3, whereas for soybean the mean and standard deviation are
50 and 3.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results and discussion</title>
      <p>The WOFOST model was run for each zone and the results for yield prediction were compared in
three different cases: simulation without DA, DA of in-situ measured LAI (M), and DA of LAI
estimated from satellite images (E). Examples for the ensemble Kalman filter DA of in-situ measured
LAI and satellite LAI are given in Figure 3 for maize and soybean.</p>
      <p>Performance of the WOFOST model was evaluated for the final yield, comparing it to the observed
yield, by using standard error metrics such as mean absolute error (MAE) and root mean squared
error (RMSE). The average yield for maize is 10 670 kg/ha, while the average yield for soybean is 3
800 kg/ha. In Table 1 are reported averaged results over all subzones for a given crop. Notably, for
maize the results indicate that model with DA of LAI estimated from satellite images overperformed
two other cases, while for soybean the model with in-situ measured LAI performed better compared
to the model without DA and model with DA of satellite LAI.</p>
      <p>Visual representation of the yield estimation using WOFOST simulation with/without DA versus
field measured yield for each subzone (annotated next to corresponding dot) and both crops, maize,
and soybean, is given in Figure 2.</p>
      <p>It is shown in Table 1 that in both cases DA improved the performance of the WOFOST model.
DA adjusted the state variable such as LAI to reflect the situation in the field. This can be seen in
Figure 3 where in both cases the first assimilated measurement of LAI decreased unrealistically high
values in the early stages of the growing season. Thus, in this case the adjustment of LAI partially
corrected the wrong emergence date estimation from the model. Nonetheless the impact of a wrong
emergence date extends beyond the LAI curve, and it involves calculation of thermal time. Thus,
having a correct estimate of emergence day remains of major importance when modeling crop
growth.</p>
      <p>
        Our study demonstrates one successful application and validation of the WOFOST crop growth
model with DA for two important crops in Serbia: maize and soybean. Experimental work for the
87
soybean is generally important as there is limited testing and validation of the WOFOST model for
this crop [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>This work was supported by the ANTARES project funded from the European Union’s Horizon
2020 research and innovation programme under grant agreement No. 739570. Authors express
sincere gratitude to KITE d.o.o. for the company's collaboration and generous sharing of their field
notes to ensure relevant data in our research.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>The author(s) have not employed any Generative AI tools.
[6] Maestrini, B., Vaessen, H. M., van Hoeven, A. J. A., Sijbrandij, F. D., van Oort, P. A. J., Kempenaar,
C., &amp; van Evert, F. K. (2023) Using assimilation of sensor data in crop growth models.
https://edepot.wur.nl/638921.
[7] Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J.,
Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., Thépaut, J-N. (2023)
ERA5 hourly data on single levels from 1940 to present. Copernicus Climate Change Service
(C3S) Climate Data Store (CDS), DOI: 10.24381/cds.adbb2d47 (Accessed on 29-4-2024).
[8] de Wit, A., Boogaard, H. 2021. A gentle introduction to WOFOST. Wageningen Environmental
Research.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Knibbe</surname>
            ,
            <given-names>W.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Afman</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boersma</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bogaardt</surname>
          </string-name>
          , M.-J.,
          <string-name>
            <surname>Evers</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Evert</surname>
          </string-name>
          , F.,
          <string-name>
            <surname>van der Heide</surname>
          </string-name>
          , J. et al. (
          <year>2022</year>
          )
          <article-title>Digital Twins in the Green Life Sciences</article-title>
          .
          <source>NJAS: Impact in Agricultural and Life Sciences</source>
          <volume>94</volume>
          , no.
          <issue>1</issue>
          ,
          <fpage>249</fpage>
          -
          <lpage>79</lpage>
          . doi:
          <volume>10</volume>
          .1080/27685241.
          <year>2022</year>
          .
          <volume>2150571</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gómez-Dans</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Ma, H.,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lewis</surname>
            ,
            <given-names>P.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xue</surname>
            ,
            <given-names>J.- H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xie</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          (
          <year>2019</year>
          )
          <article-title>Assimilation of remote sensing into crop growth models: Current status and perspectives</article-title>
          ,
          <source>Agricultural and Forest Meteorology</source>
          ,
          <fpage>276</fpage>
          -
          <lpage>277</lpage>
          , 107609, https://doi.org/10.1016/j.agrformet.
          <year>2019</year>
          .
          <volume>06</volume>
          .008.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Jindo</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kozan</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de Wit</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2023</year>
          )
          <article-title>Data Assimilation of Remote Sensing Data into a Crop Growth Model</article-title>
          . In: Cammarano, D., van
          <string-name>
            <surname>Evert</surname>
            ,
            <given-names>F.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kempenaar</surname>
          </string-name>
          , C. (eds) Precision Agriculture: Modelling. Progress in Precision Agriculture. Springer, Cham. https://doi.org/10.1007/978-3-
          <fpage>031</fpage>
          -15258-
          <issue>0</issue>
          _
          <fpage>8</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Gaso</surname>
            ,
            <given-names>D.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paudel</surname>
          </string-name>
          , D.,
          <string-name>
            <surname>de Wit</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Puntel</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mullissa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kooistra</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2024</year>
          )
          <article-title>Beyond assimilation of leaf area index: Leveraging additional spectral information using machine learning for site-specific soybean yield prediction</article-title>
          ,
          <source>Agricultural and Forest Meteorology</source>
          ,
          <volume>351</volume>
          , 110022, https://doi.org/10.1016/j.agrformet.
          <year>2024</year>
          .
          <volume>110022</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Gaso</surname>
            ,
            <given-names>D.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de Wit</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Berger</surname>
            ,
            <given-names>A.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kooistra</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2021</year>
          )
          <article-title>Predicting within-field soybean yield variability by coupling Sentinel-2 leaf area index with a crop growth model</article-title>
          ,
          <source>Agricultural and Forest Meteorology</source>
          ,
          <fpage>308</fpage>
          -
          <lpage>309</lpage>
          , 108553, https://doi.org/10.1016/j.agrformet.
          <year>2021</year>
          .
          <volume>108553</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>