<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Deep Learning Approaches for Forecasting Strawberry Yields and Prices Using Satellite Images and Station-Based Soil Parameters</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mohita Chaudhary</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohamed Sadok Gastli</string-name>
          <email>ms2gastli@uwaterloo.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lobna Nassar</string-name>
          <email>lnassar@uwaterloo.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fakhri Karray</string-name>
          <email>karray@uwaterloo.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Electrical and Computer Engineering, University of Waterloo</institution>
          ,
          <addr-line>Ontario</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Computational tools for forecasting yields and prices for fresh produce have been based on traditional machine learning approaches or time series modeling. We propose here an alternate approach based on deep learning algorithms for forecasting strawberry yields and prices in Santa Barbara county, California. Building the proposed forecasting model comprises three stages: first, the station-based ensemble model (ATT-CNN-LSTM-SeriesNet_Ens) with its compound deep learning components, SeriesNet with Gated Recurrent Unit (GRU) and Convolutional Neural Network LSTM with Attention layer (AttCNN-LSTM), are trained and tested using the station-based soil temperature and moisture data of Santa Barbara as input and the corresponding strawberry yields or prices as output. Secondly, the remote sensing ensemble model (SIM_CNN-LSTM_Ens), which is an ensemble model of Convolutional Neural Network LSTM (CNN-LSTM) models, is trained and tested using satellite images of the same county as input mapped to the same yields and prices as output. These two ensembles forecast strawberry yields and prices with minimal forecasting errors and highest model correlation for five weeks ahead forecasts. Finally, the forecasts of these two models are ensembled to have a final forecasted value for yields and prices by introducing a voting ensemble. Based on an aggregated performance measure (AGM), it is found that this voting ensemble not only enhances the forecasting performance by 5% compared to its best performing component model but also outperforms the Deep Learning (DL) ensemble model found in literature by 33% for forecasting yields and 21% for forecasting prices.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Deep Learning</kwd>
        <kwd>Satellite Images</kwd>
        <kwd>Price</kwd>
        <kwd>Yield</kwd>
        <kwd>Forecasting</kwd>
        <kwd>Fresh Produce</kwd>
        <kwd>Attention</kwd>
        <kwd>Series-Net</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>In Fresh produce Supply Chain Management (FSCM), a crucial part of the procurement
process is to find a model which helps in precisely predicting the farmers’ prices. These prices
are highly afected by the yields hence the availability of accurate yields values to train the
forecasting model is crucial [1].
This study focuses on strawberries as fresh produce, whose yield depends on various
parameters related to weather, soil, synthetic factors, irrigation, and others. These factors are quite
uncertain therefore the forecasting of strawberry yields and prices is a challenging task.
Moreover, from a humanitarian point of view, the United Nations World Food Programme has
reported that around 821 million people around the world sufer from hunger [2], and that
number has been growing drastically since the start of the COVID-19 pandemic. This is part of
the reason why the United Nations have included ending hunger and the betterment of food
security as part of their main goals in their 2030 Agenda for Sustainable Development [3]. A
key aspect to overcome these issues and a significant challenge facing food security is the
ability to reliably estimate crop yields using forecasting models; the main objective of this work.
It should be noted that such forecasting models for strawberries can be applied on numerous
similar fresh produces for better yields estimates to sustain food security.</p>
      <p>The acquisition of data can be very expensive with limited availability. California is chosen
since the data for its strawberry yields, prices, and soil parameters can be acquired from various
publicly available online sources. Due to the frequent absence of localized data, using remote
sensing data such as satellite images is important since they can cover larger geographic areas.
Santa Maria, which lies within Santa Barbara county, is primarily considered in this work
because it is one of the largest stations for strawberry produce in California; it is considered the
leading state producer for strawberries [4]. The yields and prices are predicted by manipulating
historical input data to extract features that can capture as many yields and prices trends as
possible. Currently used forecasting tools for fresh produce have limited performance since
they do not consider an extensive set of influential factors afecting yields and prices. They
are also incapable of capturing the complex patterns in big data sources of prices transactions
which provide valuable information on the underlying processes afecting fresh produce prices
and quantities; this became feasible by the advent of the state-of-the-art machine learning and
DL techniques.</p>
      <p>The scope of this work is forecasting both strawberry yields and prices using input features
related to soil parameters. The yields and prices values are forecasted 5 weeks ahead. It is
found that the past 20 weeks values of the soil parameters afect the yields and hence this is
the lag considered to forecast the yields and prices. An aggregated measure is used to gauge
the performance of the forecasting models, which are compared to a simple LSTM model and a
compound DL ensemble model proposed in [5]. The voting regressor ensemble of
ATT-CNNLSTM-SeriesNet_Ens and SIM_CNN-LSTM_Ens models outperforms its compound component
models as well as simple DL models such as LSTM. Moreover, it enhances the performance of
the compound DL ensemble model described in [5] by up to 33%.</p>
      <p>Section 2 highlights literature review and previous work done in this area, it also summarizes
major findings and limitations. The details of the assembled forecasting compound DL models
are provided in Section 3. The datasets, data preprocessing, and results of the conducted
experiments are presented and analyzed in Section 4. Finally, the drawn conclusion along with
future work are found in Section 5.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Literature Review</title>
      <p>Various methods are used for yields prediction like Artificial Neural Networks [6, 7], K-Nearest
Neighbors [8, 4] and Simple Long Short Term Memory Networks (LSTM) [9]. The weather
parameters are used as input parameters to predict the yields in [10]; while in [11, 5], the
corresponding prices to strawberry yields are predicted using various DL compound models
like ConvLSTM, CNN-LSTM, CNN-LSTM-GRU with attention along with DL ensemble models.
Adding a self-attention layer improves the yield prediction results to quite an extent. Moreover,
the DL models are recommended over nondeep ML and non-ML models for forecasting. In
[12], various machine learning and DL imputation techniques for missing values in datasets
are discussed. In [9], the weather parameters, soil moisture dynamic data and soil quality static
data across various counties in Iowa, USA are used as input to forecast the yearly corn yields
using a basic LSTM network. The static parameters remain constant over years for a specific
region and they are useful only when diferent regions are considered. The work presented
here considers the dynamic parameters solely as input and ignores the static ones since only the
Santa Maria county is considered for forecasting yields and prices. In [13], counter-propagation
artificial neural networks (CP-ANNs) and Supervised Kohonen Networks (SKNs) are used to
predict the wheat yields using a found set of influential soil parameters.</p>
      <p>As for the use of remote sensing in crop yields forecasting, existing approaches are
investigated. Three recent methods and applications are presented in [14, 15, 16]. Their general
framework involves using a preprocessed collection of satellite-based data to train neural
networks for yields prediction. The first approach is conducted in [14] by J. You et al., where they
use satellite images that are publicly available to predict annual soybean yields for specific
counties in the USA. The images used comprise 7 bands of surface reflectance and 2 bands of
temperature. The authors preprocessed these images into histograms due to their large size,
then fed them into several prediction models for comparison. The models used are
Convolutional Neural Network (CNN) and LSTM. Their results indicate that the CNN models are better
than LSTM in predicting yields. However, it should be noted that this approach investigates
yearly yields and not daily forecasts. This is further investigated by J. Phongpreecha in [15],
who considers moisture in addition to surface reflectance and temperature satellite images to
predict annual corn yields. He also implements dimensionality reduction to histograms as
described by [14]. The investigated models are a Custom CNN-LSTM, a Separable CNN-LSTM,
a CNN-LSTM, a 3D-CNN, and a CNN-LSTM-3D CNN network. The ConvLSTM is found to be
the best performing model in forecasting annual corn yields. A similar approach is conducted
in [16] where the authors investigate such application in southern Brazil on annual soybean
yields. Their approach considers satellite images in addition to precipitation data obtained
from weather stations. Three models are tested, namely multivariate OLS linear regression,
random forest, and LSTM where, LSTM outperforms the other two models in forecasting.</p>
      <p>From the literature review, it is evident that neural networks, specifically CNN, LSTM, and
combinations of both, are best suited for the forecasting application. The limitations of the
presented approaches are either considering data collected from localized stations for
forecasting, such data is not readily available for all croplands on earth, or giving annual predictions of
yields using weekly satellite images. Thus, daily predictions are essential with more complex
deep leaning models for higher performance.
Furthermore, the satellite images should be used alongside the stationary data to overcome the
frequent problem of data scarcity.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed Models and Methodology</title>
      <p>Two models are used for forecasting which are then fed into an ensemble to combine their
outputs into a final forecasted value. The first model uses station-based soil data as input
while the other relies on soil data captured by satellite images instead. Compound DL models
are proved to perform better than simple DL and nondeep ML models as described in [5]. The
simple DL model, LSTM, and the ensemble model described in [5] are compared to the proposed
model.</p>
      <sec id="sec-3-1">
        <title>3.1. The Station-based Model (ATT-CNN-LSTM-SeriesNet_Ens)</title>
        <p>This model uses the station-based soil data to forecast the strawberry yields and prices. After
preprocessing, the data is fed into two compound DL models namely Att-CNN-LSTM and
SeriesNet with GRU. The forecasted output values of each of these two models are then fed
as input into a voting ensemble which outputs the final forecasted yields or prices.</p>
        <sec id="sec-3-1-1">
          <title>3.1.1. Att-CNN-LSTM</title>
          <p>The self-attention layer helps in focusing on the essential details in the input data. In the
proposed model, the attention layer uses additive attention and the sigmoid activation function.
  = ∑  , ′   ′
(1)
(2)
(3)
(4)
where  is the element-wise sigmoid function and   and  
are the weight matrices
correof   and  
tion value  
sponding to   and   .</p>
          <p>, while
 
′   is the weight matrix corresponding to the non-linear combination</p>
          <p>and   are the bias vectors [17]. Equation (4) shows how the
attencalculated first to find the value of attention. The compatibility score is calculated using the
is computed. The probability distribution  
and compatibility score  , ′ must be
hidden representation ℎ, ′ of   and   ′ . The use of Attention layer contributes significantly to

performance by reducing the forecasting error of the DL models. The improvement in
performance is evident in the results reported by [17, 18, 19, 20] in domains such as natural language
processing, fresh produce related predictions and healthcare questionnaires.</p>
          <p>Stacking CNNs and LSTMs together helps in utilizing the strength of each of these models
[21, 18]. The CNNs help in extracting the spatial features whereas the LSTMs help in
extracting the temporal features and using their combination highly improves the forecasting
performance. The attention layer is added to the CNN-LSTM compound network to improve
its performance.</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>3.1.2. SeriesNet with GRU</title>
          <p>In this model, the GRU is used alongside the dilated causal convolution. The GRU network is a
two-layer network with attention module in between. The output at the end is flattened and fed
into a single neuron dense layer. The number of layers and neurons alter as per the application.
The architecture of the model is illustrated in Figure 2. Gated Recurrent Unit (GRU) [22] is a
newer version of RNNs and is quite similar to LSTMs [23]. GRUs use hidden states rather than
using the cell state or memory to transfer the information.
They have two gates, a reset gate, and an update gate. The update gate acts similarly to the
forget gate and input gates of LSTM. The reset gate, on the contrary, is used to decide how much
past information has to be forgotten. GRUs are faster than LSTMs since they have fewer tensor
operations. Both LSTMs and GRUs are designed to overcome the short-term memory issues
faced by RNNs. Since RNNs face vanishing or exploding gradient issues, GRUs are introduced
to mitigate those issues. The structure is similar to those of LSTMs and consists of gates that
ensure that the issue of gradient is not encountered.</p>
          <p>The traditional time series forecasting models are unable to efectively extract essential data
features; thus the authors in [24] came up with a novel forecasting architecture called
SeriesNet. The SeriesNet consists of two networks, an LSTM network and a dilated causal
convolution network. The dilated convolution handles the loss of resolution or coverage due to the
down-sampling operation in image semantic segmentation [25]. It uses dilated convolutions
to systematically aggregate multi-scale contextual information and improve the accuracy of
image recognition. The causal convolution ensures that the convolution kernel of CNN can
perform convolution operations exactly in time sequence [26], and that the convolution kernel
only reads the current and historical information. The LSTM network aims to learn holistic
features and to reduce dimensionality of multi-conditional data. The combined results of the
networks help the models to learn multi-range and multilevel features from time series data,
hence it has higher predictive accuracy compared to other models. The SeriesNet with GRU
model uses residual learning as well as batch normalization to improve generalization.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. The Remote Sensing Ensemble Model (SIM_CNN-LSTM_Ens)</title>
        <p>This model is an ensemble of CNN-LSTM. It incorporates the capabilities of both CNN and
LSTM by treating them as layers. The architecture for the CNN-LSTM model used for yields
forecasting is presented in Figure 3, which difers from the prices forecasting model. It should
also be noted that the models are fine tuned through trial and error of multiple configurations.
As for the CNN-LSTM prices forecasting model, its architecture is presented in Figure 4. The
preprocessing is implemented using the Geospatial Data Abstraction Library (GDAL) [27] in
Python to import the images into a processable format. After preprocessing, the features are
fed into the designed models. These models are implemented uisng Python’s Tensoflow Keras
library [28] due to its user-friendliness.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Ensemble of ATT-CNN-LSTM-SeriesNet_Ens and SIM_CNN-LSTM_Ens</title>
        <p>The previous approaches extract diferent features from their data, therefore averaging their
forecasted outputs could potentially lead to a better overall forecast. Hence, a voting
ensemble is required to achieve this averaging. Figure 5 illustrates the overall architecture of the
ifnal proposed model. The outputs of both compound models are combined by the averaging
ensemble to obtain the final forecast.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Evaluation Metrics</title>
        <p>The metrics used for analysis are the Mean Absolute Error (MAE), the Root Mean-Squared Error
(RMSE), R-Squared coeficient (  2), and the Aggregated Measure (AGM). The unit in which
MAE is measured is pounds/acre for yield and US dollars for price. The AGM is a measure
composed of all the previously mentioned metrics, whose purpose is to attempt to incorporate
the information captured by all three metrics into one metric to simplify the process of deciding
the best performing model [11, 29]. The measure is negatively-oriented, meaning lower scores
indicate better performance. It is mathematically defined in (5).</p>
        <p>=
 
+  
2
× (1 −  2)
(5)</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Datasets, Experiments and Analysis</title>
      <sec id="sec-4-1">
        <title>4.1. Datasets and Preprocessing</title>
        <sec id="sec-4-1-1">
          <title>4.1.1. Station-based Data</title>
          <p>There are various parameters related to the soil afecting the yields and prices of fresh produce
namely: soil moisture, soil temperature, solar radiation, surface temperature, PDSI (Palmer
Drought Severity Index) [30, 31],... etc. Choosing the most influential parameters is a major
challenge due to the high correlation amongst those parameters. Hence, the Random Forest
feature selection method in Python with scikit-learn is used and the soil moisture and
temperature are selected accordingly as the most important parameters. Moreover, soil moisture
and temperature are the two soil parameters that can be obtained from satellite images as well.
The station-based soil data is downloaded from the National Oceanic and Atmospheric
Administration website [32]. The data for strawberry yields and prices are downloaded from the
California Strawberry Commission website [33]. A lag of the past 20 weeks, i.e. 140 days, of
soil parameters values is found to afect the yields forecasting and prices values 5 weeks ahead.
Two daily parameters are considered, soil temperature and moisture, hence the total number
of input parameters adds up to 280 (2 parameters x 140 days). After normalizing the data, the
Principal Component Analysis (PCA) [34] is applied on the 280 parameters and it is found that
the first 36 parameters gave the maximum proportion of variance therefore are chosen to train
and test the forecasting model along with the corresponding yields or prices output. The total
number of available samples is 2812 (from year 2011 to 2019) out of which 80% are used for
training and 20% for testing.</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>4.1.2. Remote Sensing Data</title>
          <p>The remote sensing data contains two sets of satellite images; one for surface temperature
data and the other for moisture data. The images in the temperature dataset, obtained from
[35], are taken daily, whereas those in the moisture dataset, obtained from [36], are taken
every 3 days. Therefore, each moisture image is duplicated twice, once for the day before the
original image and once for the day after, in order to have daily moisture values along with the
daily temperature. This approach is an approximation of the missing days based on the closest
known data to those days. Samples of the original images obtained from the datasets for Santa
Barbara county are presented in Figure 6. The moisture bands are for the mainland moisture
levels, which is why the data is not available for the islands and coastal areas. In addition, a
land cover mask is applied to all images, obtained from the MODIS database [35]. The mask
maintains the pixel values of the image parts of interest, while setting pixels not within the
mask to a zero value, to ensure that the model only trains on pixel values that correspond to
crop lands hence minimizing the number of pixels processed. The total number of samples is
2977 (from year 2011 to 2019) out of which 80% are used for training and 20% for testing.</p>
          <p>As shown in [14], the images are too large to be fed directly into the model. Thus,
dimensionality reduction is applied by converting them into histograms of pixel frequency counts.
This reduction is based on the assumption of permutation invariance, meaning that shufling
the pixels has no efect on the information retained in the image. The histograms are then
normalized based on their pixel values. Surface reflectance bands used in [14] and [15] are
omitted due to their negligible improvement to the prediction performance for the considered
county and fresh produce. The number of bins,  , is set to be 32; as it is a reasonable spread
of pixel counts as found in [14]. Moreover, the range of pixel values for each band is chosen
to maximize the spread of the pixels count distribution. The structure is visualized in Figure 7;
where the three axes represent the three dimensions of the histogram. Before being fed to the
models, the bins and bands dimensions are flattened into a single dimension to fit the models.</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Experiments and Analysis</title>
        <p>The block diagram in Figure 5 depicts the proposed final model, which has a voting ensemble
which averages the forecasted yields or prices resulting from the two proposed DL compound
ensembles described in Section 3.1 and Section 3.2. The main idea behind this averaging is that
the final forecast should pick the trends forecasted by both of the component models.</p>
        <p>For yield forecasting, Figure 8 shows the true yields versus the yields forecasted by
ATTCNN-LSTM-SeriesNet_Ens, SIM_CNN-LSTM_Ens, and their averaging ensemble. The figure
shows that the voting ensemble successfully follows the general trend of the true values despite
its inability to describe the sharp fluctuations. Table 1 presents diferent metrics scores of the
forecasted values for the diferent models. Although SIM_CNN-LSTM_Ens outperforms
ATTCNN-LSTM-SeriesNet_Ens in AGM, the averaging ensemble outperforms both of its weaker
component models with its lowest AGM. This indicates that there is some information obtained
by the latter that enhances the performance compared to the individual models. Based on AGM,
the voting ensemble enhances the forecasting performance over that of the DL ensemble model
in [5] by 33% and that of the simple LSTM DL by 56%.</p>
        <p>For prices forecasting, Figure 9 depicts the true prices versus the prices forecasted by
ATTCNN-LSTM-SeriesNet_Ens, SIM_CNN-LSTM_Ens, and the averaging ensemble of both. Table
2 presents diferent performance metrics scores for the tested models. Although
ATT-CNNLSTM-SeriesNet_Ens outperforms SIM_CNN-LSTM_Ens with a noticeable AGM diference,
the voting ensemble outperforms both of these compound components with its lowest AGM.
The voting ensemble persists to be the highest performing compared to the LSTM model, 53%
improvement, and to the ensemble model in [5], 21% improvement in AGM.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion and Future Work</title>
      <p>This paper explores the application of multiple DL models in forecasting strawberry yields and
prices. Station-based data is used to train the ATT-CNN-LSTM-SeriesNet_Ens, while remote
sensing-based data is used to train the SIM_CNN-LSTM_Ens for forecasting. It is found that
the SIM_CNN-LSTM_Ens model does better at yields forecasting while the
ATT-CNN-LSTMSeriesNet_Ens model is better at prices forecasting. The voting ensemble of these models
outperforms its individual components since each component provides a diferent forecasting
behavior. Moreover, it further proves to perform better by up to 33% compared to the most recent
DL ensemble forecasting model. An acknowledged limitation is the restricted ability of the
deployed models to capture steep fluctuations in yields and prices, making them not predictable
with the available tools. Moreover, having an ensemble of various compound DL models is
computationally expensive. Potential future work is needed to further improve the ability of
the models to capture steep fluctuations in both yields and prices. Generalization of the models
application to other fresh produces is also required; this is achieved through transfer learning
to forecast yields and prices of FPs similar to strawberries eficiently with minimal retraining.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments References</title>
      <p>The authors would like to acknowledge the financial support provided by Loblaws corporation,
NSERC CRD program and Mitacs.</p>
      <p>[1] R. Waldick, L. Bizikova, D. White, K. Lindsay, An integrated decision-support process for
adaptation planning: climate change as impetus for scenario planning in an agricultural
region of canada, Regional Environmental Change 17 (2017) 187–200.
[2] UN World Food Programme, 2019 - hunger map, 2019. URL: https://www.wfp.org/
publications/2019-hunger-map, accessed 27.11.2020.
[3] United Nations General Assembly, Transforming our World: The 2030 Agenda for
Sustainable Development, 2015. URL: https://sustainabledevelopment.un.org/post2015/
transformingourworld/publication, accessed 27.11.2020.
[4] T. B. Pathak, S. K. Dara, A. Biscaro, Evaluating correlations and development of
meteorology based yield forecasting model for strawberry, Advances in Meteorology (2016).
[5] I. Okwuchi, Machine Learning based Models for Fresh Produce Yield and Price Forecasting
for Strawberry Fruit, Master’s thesis, University of Waterloo, 2020.
[6] F. O. Karray, C. W. De Silva, Soft computing and intelligent systems design: theory, tools,
and applications, Pearson Education, 2004.
[7] P. Doganis, A. Alexandridis, P. Patrinos, H. Sarimveis, Time series sales forecasting for
short shelf-life food products based on artificial neural networks and evolutionary
computing, Journal of Food Engineering 75 (2006) 196–204.
[8] M. L. Maskey, T. B. Pathak, S. K. Dara, Weather based strawberry yield forecasts at field
scale using statistical and machine learning models, Atmosphere 10 (2019) 378.
[9] Z. Jiang, C. Liu, N. P. Hendricks, B. Ganapathysubramanian, D. J. Hayes, S. Sarkar,
Predicting county level corn yields using deep long short term memory models, arXiv preprint
arXiv:1805.12044 (2018).
[10] M. Kaul, R. L. Hill, C. Walthall, Artificial neural networks for corn and soybean yield
prediction, Agricultural Systems 85 (2005) 1–18.
[11] L. Nassar, I. E. Okwuchi, M. Saad, F. Karray, K. Ponnambalam, P. Agrawal, Prediction
of strawberry yield and farm price utilizing deep learning, in: 2020 International Joint
Conference on Neural Networks (IJCNN), IEEE, 2020, pp. 1–7.
[12] M. Saad, M. Chaudhary, F. Karray, V. Gaudet, Machine learning based approaches for
imputation in time series data and their impact on forecasting, in: 2020 IEEE International
Conference on Systems, Man, and Cybernetics (SMC), IEEE, 2020, pp. 2621–2627.
[13] X. E. Pantazi, D. Moshou, T. Alexandridis, R. L. Whetton, A. M. Mouazen, Wheat yield
prediction using machine learning and advanced sensing techniques, Computers and
Electronics in Agriculture 121 (2016) 57–65.
[14] J. You, X. Li, M. Low, D. Lobell, S. Ermon, Deep gaussian process for crop yield prediction
based on remote sensing data, in: Thirty-First AAAI Conference on Artificial Intelligence,
2017, p. 4559–4565.
[15] J. Phongpreecha, Early corn yields prediction using
satellite images, 2018. URL: https://towardsdatascience.com/
early-corn-yields-prediction-using-satellite-images-dcf49b24efab, accessed 27.11.2020.
[16] R. A. Schwalbert, T. Amado, G. Corassa, L. P. Pott, P. Prasad, I. A. Ciampitti, Satellite-based
soybean yield forecast: Integrating machine learning and weather data for improving crop
yield prediction in southern Brazil, Agricultural &amp; Forest Meteorology 284 (2020) 107886.
[17] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, I.
Polosukhin, Attention is all you need, in: Advances in neural information processing systems,
2017, pp. 5998–6008.
[18] Y. Zhang, J. Zheng, Y. Jiang, G. Huang, R. Chen, A text sentiment classification modeling
method based on coordinated cnn-lstm-attention model, Chinese Journal of Electronics
28 (2019) 120–126.
[19] S. De Alwis, Y. Zhang, M. Na, G. Li, Duo attention with deep learning on tomato yield
prediction and factor interpretation, in: Pacific Rim International Conference on Artificial
Intelligence, Springer, 2019, pp. 704–715.
[20] R. Cai, B. Zhu, L. Ji, T. Hao, J. Yan, W. Liu, An cnn-lstm attention approach to
understanding user query intent from online health communities, in: 2017 IEEE International
Conference on Data Mining Workshops (ICDMW), IEEE, 2017, pp. 430–437.
[21] L. Zheng, W. Xue, F. Chen, P. Guo, J. Chen, B. Chen, H. Gao, A fault prediction of
equipment based on cnn-lstm network, in: 2019 IEEE International Conference on Energy
Internet (ICEI), IEEE, 2019, pp. 537–541.
[22] R. Dey, F. M. Salemt, Gate-variants of gated recurrent unit (gru) neural networks, in: 2017
IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS), IEEE,
2017, pp. 1597–1600.
[23] R. Fu, Z. Zhang, L. Li, Using lstm and gru neural network methods for trafic flow
prediction, in: 2016 31st Youth Academic Annual Conference of Chinese Association of
Automation (YAC), IEEE, 2016, pp. 324–328.
[24] Z. Shen, Y. Zhang, J. Lu, J. Xu, G. Xiao, SeriesNet: A Generative Time Series Forecasting</p>
      <p>Model, in: International Joint Conference on Neural Networks (IJCNN), IEEE, 2018.
[25] F. Yu, V. Koltun, Multi-scale context aggregation by dilated convolutions, arXiv preprint
arXiv:1511.07122 (2015).
[26] A. v. d. Oord, S. Dieleman, H. Zen, K. Simonyan, O. Vinyals, A. Graves, N. Kalchbrenner,
A. Senior, K. Kavukcuoglu, Wavenet: A generative model for raw audio, arXiv preprint
arXiv:1609.03499 (2016).
[27] F. Warmerdam, et al., GDAL, n.d. URL: https://gdal.org/, accessed 27.01.2021.
[28] F. Chollet, et al., Keras, https://keras.io, 2015.
[29] L. Nassar, M. Saad, I. E. Okwuchi, M. Chaudhary, F. Karray, K. Ponnambalam, Imputation
impact on strawberry yield and farm price prediction using deep learning, in: IEEE
International Conference on Systems, Man, and Cybernetics (SMC), IEEE, 2020, pp. 3599–3605.
[30] Details on the Soil Parameters, n.d. URL: https://www1.ncdc.noaa.gov/pub/data/uscrn/
products/hourly02/README.txt, accessed 27.11.2020.
[31] Palmer Drought Severity Index and Palmer Z-Index, n.d. URL: http://www.</p>
      <p>worldwindsinc.com/palmer.htm, accessed 27.11.2020.
[32] National Oceanic and Atmospheric Administrations, n.d. URL: https://www.noaa.gov/,
accessed 27.11.2020.
[33] The California Strawberry Commission website, n.d. URL: https://www.calstrawberry.</p>
      <p>com/en-us/, accessed 27.11.2020.
[34] H. Abdi, L. J. Williams, Principal component analysis, Wiley interdisciplinary reviews:
computational statistics 2 (2010) 433–459.
[35] LP DAAC MODIS, n.d. URL: http://lpdaac.usgs.gov/, accessed 27.11.2020.
[36] NASA-USDA global soil moisture data, n.d. URL: https://earth.gsfc.nasa.gov/hydro/data/
nasa-usda-global-soil-moisture-data, accessed 27.11.2020.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>