<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Figure</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Prediction of Yield Fluctuations Caused by Extreme Weather in Greenhouse Tomato Cultivation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Reo Kamiyama</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ken-ichi Minamino</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Graduate School of Software and Information Science, Iwate Prefectural University</institution>
          ,
          <addr-line>152-52 Sugo, Takizawa-shi, Iwate, 0200693</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <volume>6</volume>
      <issue>2024</issue>
      <fpage>20</fpage>
      <lpage>28</lpage>
      <abstract>
        <p>In recent years, the efects of climate change have become more pronounced, and extreme weather events have significantly afected the agricultural sector. In this study, we aimed to analyze the impact of record-high temperatures between June to August of 2023 and 2024 on the yield of greenhouse tomato cultivation in Iwate Prefecture, Japan. Random Forest regression model was used to analyze the yield prior to the occurrence of exceptional fluctuations and to forecast such fluctuations based on the deviations from actual yields. The results revealed that the method was efective in forecasting the signs of yield fluctuation. In the future, this method can be applied to risk prediction and pest control measures in greenhouse environments.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Smart Agriculture</kwd>
        <kwd>Tomato Cultivation</kwd>
        <kwd>Machine Learning</kwd>
        <kwd>Yield Fluctuations</kwd>
        <kwd>Extreme Weather</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Between 2023 and 2024, both the frequency and severity of extreme weather events increased,
driven by the ongoing progression of global warming. In agriculture, production management
based on empirical rules (such as reliance on conventional weather patterns) is no longer
suficient to cope with this situation, making it increasingly dificult to ensure stable yields
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Even under seemingly controlled environments, such as greenhouses, changes in external
temperature and solar radiation afect the internal climate, which result in yield fluctuations [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        In Japan, crop growth is increasingly afected by several factors, such as heat stress during
summer, insuficient sunlight caused by prolonged rainfall during the rainy season, and the
emergence of new pests and diseases. Exceptional fluctuations in yield and quality have been
reported, even in greenhouse tomato cultivation, where environmental conditions are typically
controlled [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ]. In recent years, yield prediction models that are developed using environmental
and yield data collected before the frequent occurrence of extreme weather events have become
less accurate. During the summer of 2024, prolonged heatwaves made it dificult to control the
internal temperature of greenhouses, leading to serious quality issues, such as poor fruit set and
decreased sugar content. Furthermore, in recent years, complex and unprecedented factors have
caused abnormal plant development and unanticipated reductions in yield; these factors include
the simultaneous outbreak of new pests (e.g., tobacco whitefly) and diseases (e.g., powdery
mildew and late blight) that have adapted to warming climates [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ]. These fluctuations not
only destabilize the income of producers but also pose significant challenges in securing a
stable food supply at the regional and national levels. Existing studies have analyzed yield
lfuctuations using statistical analyses such as multiple regression and machine learning models
such as regression [
        <xref ref-type="bibr" rid="ref7 ref8 ref9">7, 8, 9</xref>
        ]. However, few studies have been conducted to predict abnormal
yield fluctuations caused by global warming.
      </p>
      <p>
        In this study, we aimed to investigate the nature of exceptional yield fluctuations and develop
a prediction method. Using environmental and yield data collected from IoT sensors at the
Iwate Wakae Farm, Inc [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] (Morioka City, Iwate Prefecture, Japan), we aimed to develop a
yield prediction model based on conditions observed before the frequent occurrence of extreme
weather events. We then quantitatively aimed to evaluate the discrepancies between the
predicted and actual yields and identify early signs and timing of anomalies, with the goal
of developing a method to detect yield fluctuations in advance. Iwate Wakae Farm, Inc is a
large-scale facility equipped with a Venlo-type greenhouse that employs a long-term multiphase
cultivation system. The greenhouse covered an area of 2,160 m², and a hydroponic system using
rockwool as the growing medium has been employed in the greenhouse. The facility is equipped
with several environmental sensors, including solar radiation, temperature, humidity, and CO2,
allowing for high-precision monitoring of the cultivation environment. The comprehensive data
collection could significantly enhance the reliability of the analyses conducted in this study.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Yield prediction model before the frequent occurrence of extreme weather events (Normal model)</title>
      <sec id="sec-2-1">
        <title>2.1. Explanatory and Objective Variables</title>
        <p>
          The primary explanatory variables used in this study were solar radiation and average daily
temperature. Over the past four weeks, solar radiation had accumulated, which is closely associated
with photosynthetic activity and fruit enlargement. Previous studies have demonstrated that this
time span is highly correlated with the progression of growth stages from fruit set to maturation
(Figure 2) [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. We aimed to capture the distribution of the daily average temperatures rather
than relying on simple mean values. Therefore, we introduced a histogram-based representation
(hereafter referred to as the H function), which divided the daily average temperatures over
the past eight weeks into 1 °C bins and used the relative frequency of each bin as a feature
variable (Figure 3) [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. Weekly cumulative yield for the following week was set as the objective
variable (Figure 4). Daily yields were estimated using a three-week moving average to account
for practical conditions, such as scheduled harvest breaks (e.g., once a week), which lead to
periodic gaps in yield records. This smoothing approach facilitated the extraction of underlying
yield trends. Previous studies have confirmed that the influence of harvest breaks is typically
confined to period spanning one week before and one week after the break [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Model Building and Evaluation</title>
        <p>
          The yield prediction model was constructed using a random forest regression algorithm, which
is well-suited for capturing nonlinear relationships and ofers strong robustness against external
disturbances [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. A normal model (model18–22) was trained on data obtained between
December 2018 and August 2022, the period before the frequent occurrence of extreme weather events.
For comparison, a model with extended period (model18–23) was also developed using data up
to August 2023, including the potentially abnormal year 2023. The test datasets consisted of data
from the year following each training period: December 2022 to August 2023 for model18–22,
and December 2023 to August 2024 for model18–23. Model performance was evaluated using
the coeficient of determination (R²) and root mean square error (RMSE).
        </p>
        <p>
          2 = 1 −
∑︀=1( − ˆ )2
∑︀=1(−)¯ 2
  = √︂ 1 ∑︁=1( − ˆ )2 (2)
Where n is the number of observations,  is the i-th observed value, ˆ is the corresponding
predicted value, and¯ is the mean of the observed values. Table 1 presents the training results of
the developed prediction models. Significant diferences in the coeficient of determination and
RMSE were observed between model18-22 and model18-23. This indicates that in recent years,
when weather patterns have become increasingly non-stationary due to extreme events, accurate
yield prediction models must be constructed by selecting long-term training datasets that
comprehensively capture exceptional weather conditions and the resulting growth abnormalities
[
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. However, the accumulation of extensive datasets during transitional periods remains
challenging. Therefore, this study proposed a method for continuously monitoring deviations
from the predictions generated by a normal model. This approach could enable the early
detection of anomaly signs and associated risks [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
(1)
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Analysis of exceptional yield fluctuations</title>
      <p>
        Outbreaks of pest infestations have traditionally been limited to greenhouse whiteflies [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
This pest is relatively cold-tolerant, easy to manage, and generally does not cause severe damage.
However, in recent years, a new pest species, the tobacco whitefly [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] has emerged; this species
is highly tolerant to high temperatures and the damage it causes has been rapidly increasing.
Tobacco whiteflies are dificult to control using conventional pest management strategies and
become particularly active in warmer environments, leading to significant efects on yield
during hot periods. Additionally, disease damage caused by pathogens, such as sooty mold
and powdery mildew [
        <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
        ], has also been reported. The simultaneous occurrence of pests
and diseases imposes substantial stress on plant growth, leading to significant yield reduction.
Whitefly infestations in the facility reached their first peaks on May 11, 2023, and April 13, 2024
for 2022–2023 and 2023–2024 periods, respectively (Figure 8). Using a multiple regression model
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] combined with the efective accumulated temperature [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], we developed a prediction
model and determined that the period from egg to larval stage occurred around May 1, 2023,
for the former peak and around April 1, 2024, for the latter. They were used as the explanatory
variables: temperature, humidity, the number of pests in the previous week, Leaf Area Index,
and yield. The infestation in 2023 is primarily caused by the greenhouse whitefly, which is
efectively controlled by conventional pest management methods. In contrast, the 2024 outbreak
was mainly due to the tobacco whitefly, which resisted conventional control measures, leading
to continuous infestation spanning from the first peak to the second peak. Figure 9 shows a
graph of the diferences between the predicted value obtained using the normal model and actual
yield in 2022-2023. The blue lines in Figure 9 represent the range of diferences for the past five
years from 2018 to 2022 (maximum: +151.4 kg, minimum: -76.5 kg). In other words, results
that exceed or fall below these blue lines indicate irregular yield fluctuations compared with
typical years. Because the prediction model forecasts one week in advance, it enables proactive
responses. Moreover, combining this early warning with predictions of the underlying causes
of the exceptional yield fluctuations described earlier, rapid and efective countermeasures have
become possible.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Summary</title>
      <p>In this study, we developed a yield prediction model for facility-grown tomatoes and analyzed
yield fluctuations by examining the diferences between the predicted and actual yields. We
also explored the potential to predict the early signs of such variability. The analysis revealed
that irregular yield fluctuations were mainly caused by insuficient solar radiation, pests, and
diseases. Future work will focus on developing a model that can forecast the likelihood of
exceptional yield fluctuations and estimate the extent of yield reduction in advance, thereby
enabling efective countermeasures at the production site.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>We would like to express our deepest gratitude to Iwate Wakae Farm, Inc. for their generous
cooperation in this research. They willingly provided us with valuable in-greenhouse
environmental and yield data and kindly accommodated our numerous requests for meetings and data
collection. Their support was essential for the successful completion of this study.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>The authors have not employed any Generative AI tools.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>[1] Ministry of Agriculture, Forestry and Fisheries (MAFF), Eforts to address global warming in the agriculture, forestry and fisheries sector</article-title>
          , https://www.maf.go.jp/j/kanbo/kankyo/s eisaku/attach/pdf/index-87.pdf,
          <year>2024</year>
          . Accessed:
          <fpage>2025</fpage>
          -08-25.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Japan</given-names>
            <surname>Meteorological</surname>
          </string-name>
          <article-title>Agency (JMA), Weather summary for summer 2024 (june-august)</article-title>
          , https://www.data.jma.go.jp/cpd/longfcst/seasonal/202408/202408s.html,
          <year>2024</year>
          . Accessed:
          <fpage>2025</fpage>
          -08-25.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>K.</given-names>
            <surname>Nishimura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Okamoto</surname>
          </string-name>
          ,
          <article-title>Current status and challenges of climate change adaptation information platform</article-title>
          ,
          <source>Journal of Agricultural Meteorology</source>
          <volume>75</volume>
          (
          <year>2019</year>
          )
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Takahashi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Sakata</surname>
          </string-name>
          ,
          <article-title>Analysis of the relationship between meteorological factors and growth abnormalities in tomato cultivation</article-title>
          ,
          <source>Environment Control in Biology</source>
          <volume>29</volume>
          (
          <year>2020</year>
          )
          <fpage>105</fpage>
          -
          <lpage>112</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>K.</given-names>
            <surname>Nakajima</surname>
          </string-name>
          ,
          <article-title>Occurrence and impact of the tobacco whitefly in protected horticulture</article-title>
          ,
          <source>Journal of the Japanese Society for Horticultural Science</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Hashimoto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Honda</surname>
          </string-name>
          ,
          <article-title>Pest and disease management and its efectiveness in tomato cultivation during high-temperature periods</article-title>
          ,
          <source>Agriculture and Environment</source>
          <volume>45</volume>
          (
          <year>2021</year>
          )
          <fpage>33</fpage>
          -
          <lpage>40</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>K.</given-names>
            <surname>Oibayashi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Minamino</surname>
          </string-name>
          ,
          <article-title>Improving agricultural management using a yield prediction model for greenhouse tomato cultivation</article-title>
          ,
          <source>in: Proceedings of the 85th National Convention of the Information Processing Society of Japan</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>523</fpage>
          -
          <lpage>524</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>T.</given-names>
            <surname>Endo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Warisawa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Yamamoto</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Yamada,</surname>
          </string-name>
          <article-title>Study on prediction method (calculation model) of harvest date and yield in low-truss dense planting of tomato</article-title>
          ,
          <source>in: Proceedings of the 2017 Annual Meeting of the Japanese Society of Agricultural, Biological, and Environmental Engineering</source>
          , volume
          <volume>A03</volume>
          ,
          <year>2017</year>
          , pp.
          <fpage>14</fpage>
          -
          <lpage>15</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>K.</given-names>
            <surname>Hisekida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Nishina</surname>
          </string-name>
          ,
          <article-title>Study on improving productivity in large-scale tomato greenhouses: Harvest prediction of tomatoes based on cumulative solar radiation</article-title>
          ,
          <source>Japanese Society of Agricultural, Biological and Environmental Engineers and Scientists</source>
          <volume>19</volume>
          (
          <year>2007</year>
          )
          <fpage>11</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <article-title>Iwate wakae farm, inc</article-title>
          , https://www.facebook.com/wakaefarm/?locale=ja_JP,
          <year>2025</year>
          . Accessed:
          <fpage>2025</fpage>
          -08-25.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>L.</given-names>
            <surname>Breiman</surname>
          </string-name>
          , Random forests,
          <source>Machine learning 45</source>
          (
          <year>2001</year>
          )
          <fpage>5</fpage>
          -
          <lpage>32</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kamiyama</surname>
          </string-name>
          , K.-i. Minamino,
          <article-title>Analysis of exceptional yield variations caused by extreme weather in greenhouse tomato cultivation</article-title>
          ,
          <source>2025. Graduate School of Software and information Science</source>
          , Iwate Prefectural University.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <source>[13] National Research and Development Agency</source>
          ,
          <article-title>National Institute for Environmental Studies, Invasive species of japan</article-title>
          , https://www.nies.go.jp/biodiversity/invasive/DB/detail/60330e. html,
          <year>2025</year>
          . Accessed:
          <fpage>2025</fpage>
          -08-25.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>[14] National Research and Development Agency</source>
          ,
          <article-title>National Institute for Environmental Studies, Invasive species of japan</article-title>
          , https://www.nies.go.jp/biodiversity/invasive/DB/detail/60320e. html,
          <year>2025</year>
          . Accessed:
          <fpage>2025</fpage>
          -08-25.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <article-title>Institute of Vegetable and Floriculture Science, National Agriculture</article-title>
          and Food Research organization, Sooty mold information, https://www.naro.go.jp/laboratory/nivfs/kakibyo/ plant_search/ha/botan/post_567.html,
          <year>2025</year>
          . Accessed:
          <fpage>2025</fpage>
          -08-25.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <article-title>Institute of Vegetable and Floriculture Science, National Agriculture</article-title>
          and Food Research organization, Powdery mildew information, https://www.naro.go.jp/laboratory/nivfs/kaki byo/plant_search/ha/hanamizuki/post_132.html,
          <year>2025</year>
          . Accessed:
          <fpage>2025</fpage>
          -08-25.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>C. R.</given-names>
            <surname>Rao</surname>
          </string-name>
          ,
          <article-title>Linear statistical inference and its applications</article-title>
          , 2nd ed., John Wiley &amp; Sons,
          <year>1973</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hirano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kamiyama</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Minamino</surname>
          </string-name>
          ,
          <article-title>Development of a whitefly occurrence prediction model using machine learning in greenhouse tomato cultivation</article-title>
          ,
          <source>in: Proceedings of the 87th The National Convention of the Information Processing Society of Japan, 6ZL-03</source>
          ,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>