<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>ProfIT AI</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Semi-Supervised European Forest Types Mapping using High-Fidelity Satellite Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Bohdan Yailymov</string-name>
          <email>yailymov@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hanna Yailymova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nataliia Kussul</string-name>
          <email>nataliia.kussul@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrii Shelestov</string-name>
          <email>andrii.shelestov@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"</institution>
          ,
          <addr-line>Beresteiskyi ave 37, 03056, Kyiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Space Research Institute NAS of Ukraine and SSA of Ukraine</institution>
          ,
          <addr-line>Glushkov ave 40, 4/1, 03187, Kyiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Maryland</institution>
          ,
          <addr-line>College Park, MD 20742</addr-line>
          ,
          <country country="US">US</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <volume>4</volume>
      <fpage>25</fpage>
      <lpage>27</lpage>
      <abstract>
        <p>Accurate and up-to-date forest type maps are crucial for effective monitoring and management of forest ecosystems across Europe. However, the availability of up to date high-resolution forest type maps has been limited. This study introduces an innovative semi-supervised approach for mapping European forest types by harnessing the power of high-resolution Sentinel-1 and Sentinel-2 satellite data from the Copernicus program. The novelty of the approach lies in the integration of various data sources for training dataset creation and the utilization of the Random Forest classifier on the Google Earth Engine cloud computing platform. This innovative combination enables efficient processing and classification of vast amounts of satellite imagery for large-scale forest type mapping. In particular, the LUCAS Copernicus 2018 and 2022 datasets were employed for training and validation, ensuring the robustness of the classification model. The resulting forest type map for 2022 has a fine spatial resolution of 10 meters and distinguishes between three key classes: broadleaved, coniferous, and mixed forests. Accuracy assessment using independent validation data demonstrated the reliability of the proposed approach, yielding an impressive overall accuracy of 93%. Comparative analysis with existing forest products revealed both consistencies and differences, underscoring the dynamic nature of forest ecosystems. The generated map fills a gap in up to date geospatial information on European forest types, empowering informed decision-making in forest management, conservation efforts, and environmental impact assessment. This study demonstrates the potential of synergizing cutting-edge remote sensing, cloud computing, and machine learning technologies to tackle complex environmental challenges at a continental scale, paving the way for future advancements in forest monitoring and management.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Forest type classification</kwd>
        <kwd>Sentinel-1</kwd>
        <kwd>Sentinel-2</kwd>
        <kwd>Random Forest</kwd>
        <kwd>Google Earth Engine</kwd>
        <kwd>Europe1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        This study was carried out within the scope of the Horizon Europe project Satellites for Wilderness
Inspection and Forest Threat Tracking (SWIFTT - https://swiftt.eu/), the main tasks of which are
developing and improving models for forest health monitoring and damage detection (windthrow
damage, tree health, forest volumes and data for fire risk mapping and early fire detection). Having
an up-to-date geospatial map of forest types is critical to effectively monitor forest damage such as
fires [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], logging, windstorms, disease [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and other natural and anthropogenic forest events [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
Geospatial information makes it possible to accurately determine the location of forest ecosystems
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and determine their types. This provides the possibility of prompt response to potential threats
to forests, which allows preserving biodiversity and ecological balance [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Geospatial mapping also
aids in resource management and conservation planning by providing the ability to track changes in
forest cover over time [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The use of modern technologies of geospatial analysis allows collecting,
processing and analyzing data in real time, which is key to timely response to events that may affect
the health and sustainability of forests. Therefore, within the SWIFTT project there is a pressing
need for the timely and cost-effective creation of contemporary forest maps, particularly those that
classify European forests by type.
      </p>
      <p>In the next section, existing products containing forest maps or forest type maps for European
countries were analyzed. Their main gap is that there is no single product for 2022 that would include
different types of forests (coniferous, deciduous, mixed) and would have a high spatial resolution (10
meters). The task of the SWIFTT project was to obtain a modern map of forest types for the entire
territory of Europe using time series of open Sentinel-1,2 satellite data with a maximum spatial
resolution of 10 meters.
1.1. Overview of Available Forest Cover Products</p>
      <p>
        At present, there exists a range of geospatial solutions featuring forest mapping layers. The
Copernicus program marks a significant stride forward by integrating satellite imagery and machine
learning algorithms to generate highly detailed and accurate forest classification maps. A noteworthy
example is the ESA WorldCover initiative, which has released comprehensive global land cover
products for 2020 [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and 2021 [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], boasting a spatial resolution of 10 meters. Leveraging Sentinel-1
and Sentinel-2 data, these products are crafted and verified in near-real time. The classification
scheme encompasses 11 distinct classes, aligning with the Land Cover Classification System (LCCS)
devised by the Food and Agriculture Organization (FAO) of the United Nations (UN). Independent
validation conducted by Wageningen University demonstrates an overall accuracy of 74.4% for the
WorldCover product [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] across the globe, with continent-specific accuracy rates ranging from 68%
to 81%. While this map includes a forest class, it lacks a division into specific forest types, which is
important information for monitoring the dynamics of the state of forests and their damage. For our
task within the project, the division of the forest into different types of forests is important, because
there are different pests for coniferous and deciduous forests and these forests behave differently
throughout the year.
      </p>
      <p>
        The University of Maryland's Global Land Analysis and Discovery (GLAD) laboratory, in
collaboration with Global Forest Watch (GFW), offers annually updated global forest loss data,
utilizing Landsat time-series imagery with a resolution of 30 meters [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. This dataset spans from
2000 to 2022 and is segmented into 10×10-degree tiles, each containing seven files. The data,
represented in unsigned 8-bit values, has a spatial resolution of 1 arc-second per pixel, or roughly 30
meters per pixel at the equator. It includes a tree cover map expressed as a percentage per grid cell
(ranging from 0 to 100), a forest gain map for 2000-2012, and a forest loss map for 2000-2022. The
disadvantage lies in its 30-meter spatial resolution, inability to differentiate between coniferous and
deciduous forests, and variations in forest mask determination across countries, leading to some
inaccuracies compared to other products.
      </p>
      <p>
        Additionally, Landsat satellite data has been employed to derive annual forest disturbances
among 35 European countries, covering the period 1986-2020. This includes maps of disturbance
severity (up to 2016) and a forest mask [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], utilizing the LandTrend time-series segmentation
approach in the Google Earth Engine environment [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        The CGLS-LC100 product, part of the Copernicus Global Land Service, offers a global land cover
map based on PROBA-V 100 m satellite data for 2015-2019, with various forest types represented
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. However, its 100-meter spatial resolution remains a limitation, though the product maintains
an accuracy of 80%.
      </p>
      <p>
        Copernicus Land Monitoring Service provides the Forest Type (2018) product [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] for the EEA39
countries, offering 10-meter rasters with a forest classification into three thematic classes
(nonforest, broadleaved forest, coniferous forest) with a minimum 90% accuracy for both forest classes,
albeit only for 2018. In [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] have created a global forest cover product for 2017 utilizing Sentinel-1
satellite data, including distribution between broadleaved and coniferous forests.
      </p>
      <p>
        There's been a notable increase in geospatial products formed by integrating existing datasets.
For instance, two earth-observation products [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] have been combined with statistical data to
produce a new pan-European forest map at a 1 km spatial resolution, aligning with official forest
inventory statistics at national and/or regional levels.
      </p>
      <p>Given the SWIFTT project aimed at creating a classification map based on Sentinel satellite data
with a 10-meter spatial resolution, products like WorldCover (which has the appropriate spatial
resolution and updated to the nearest 2022) and Forest Type 2018 (which has the appropriate spatial
resolution and has a distribution between coniferous and deciduous forests) serve as pertinent
references for both training dataset creation and validation of the resulting products, considering
their suitable spatial resolution and updated information up to 2022.</p>
      <p>Within the scope of the project, monitoring of all forests in Europe is planned. Therefore, this
paper analyzes various algorithms and cloud platforms for building a forest classification map.
Taking into account the amount of satellite data and the speed of the algorithm for large-scale
construction of the forest map, the Random Forest algorithm was chosen in free Google Earth Engine
cloud platform, since all satellite data are already ready for use in it. The obtained forest maps and
area was compared with existing products.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Study Area</title>
      <p>
        The forest type classification maps were created for European countries. The information about
countries boundary were the Geographic Information System of the Commission [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. A critical
aspect of this study area lies in its varied topography, including mountainous terrains that introduce
shadows and elevation variations. These geographical features can complicate the interpretation of
satellite imagery, influencing the accuracy of forest classification. Additionally, the presence of snow
cover during certain seasons further adds complexity to the satellite data analysis, impacting the
differentiation between land cover types. Also, taking into account the area of all countries included
in the study area - 6.5 MLN km2, the question of solving the problem of big data arises. To optimize
the work, the countries were divided into groups based on their geo-location, size of the countries,
and availability of the training data. The distribution of countries by groups is shown in the Table 1
and in the Figure 1.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Satellite Data Used</title>
      <p>
        For each group from the Table 1, 12-day mean composites of SAR Sentinel-1 satellite data with VV,
VH bands with 10-meters spatial resolution were created [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. The SAR Sentinel-1 preprocessing
steps were: apply orbit, border noise removal, thermal noise removal, radiometric calibration,
orthorectification, filter box 3x3 were performed as preliminary processing. As a result, the 12-day
time series of Sentinel-1 was formed for every group of countries.
      </p>
      <p>Moreover, in the classification process, Sentinel-2 data with preprocessing Level-2A and a spatial
resolution of 10 meters are employed, utilizing four bands: Red (B4), Green (B3), Blue (B2), and
Infrared (B8). The revisit time of Sentinel-2 is every 5 days; however, due to significant cloud cover,
three composites are generated for each group of countries. These composites represent the median
value of all available data within every 5-day period for the respective bands. To mitigate the impact
of clouds on optical data, a Scene Classification Map (SCL) band with a spatial resolution of 20 meters
is utilized for cloud masking. The specific dates utilized for both Sentinel-1 and Sentinel-2 are detailed
in Table 2.</p>
      <p>Consequently, our input consists of a stack of raster images, comprising a time series of radar
and optical satellite data, encompassing a total of 76 spectral bands. This includes 32 VV bands, 32
VH bands from Sentinel-1, and three composites featuring four bands each (red, green, blue, and
infrared) from Sentinel-2.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Train and Validation Data</title>
      <p>
        For training and testing the creation of forest type maps for Europe, the LUCAS Copernicus 2018
open dataset [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] serves as the primary resource. Despite being based on 2018 data, this dataset
remains suitable for forest type classification due to the relatively slow change in forest types over
time. A five-year span is not considered extensive for a land cover type like forests. To update this
dataset for 2022, the global land cover data from WorldCover 2021 [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] was utilized. Samples that
exhibited a change in class between 2018 and 2021 were subsequently excluded from consideration.
The great advantage of LUCAS Copernicus 2018 data set is that for each sample there were 5 photos
that confirm the correctness of the class that is entered for this sample.
      </p>
      <p>As we can see from Figure 2, some countries are not covered by the given data set (Norway,
Switzerland, Serbia, Montenegro, Albania, Bosnia and Herzegovina, North Macedonia, Turquie,
Iceland, and Ukraine). Therefore, additionally, based on the image interpretation of the Sentinel-2,
forest and another land cover samples were added for those areas where there was a lower coverage
with the initial Lucas Copernicus data set. Also, two classes — rocks and snow — were added to the
data set, because due to the mountainous terrain, there was a confusion of forest classes due to
shadows and slopes.</p>
      <p>The Figure 3 shows an example of the difference between broadleaved and coniferous forest in
summer and winter season, as well as an example of a classification map that separates them. This
confirms the need to use a time series of satellite data and necessarily composites of optical data.</p>
      <p>
        For Ukraine, we used a data set that we have been collecting along the roads to create a land cover
map [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. In 2022, while collecting data along the roads, we also separated different types of forests
(coniferous, broadleaved and mixed), which gave us the opportunity to build a map of forest types
for the territory of Ukraine. Note that no other data, such as LUCAS 2018, forest type map 2018,
which divide forests into different types and have a high spatial resolution, were not available for
Ukraine. Besides that, this dataset was extended with new samples using satellite image
interpretation approach. The total number of samples in the dataset for Ukraine is 934 forests and
7862 other land and is shown in the Table 3. The generated data set was divided into training and
test dataset in a ratio of 80:20 for each oblast of Ukraine.
      </p>
      <p>a) 10.06.23
b) 08.02.23
c)</p>
      <p>The Table 4 displays the distribution of data across European countries by class, while Figure 4
illustrates the geospatial distribution of the resulting dataset for three types of forests (broadleaved,
coniferous, and mixed) within selected groups, as Table 1. To train the model and validate the
resulting product, the dataset was divided into an 80:20 ratio within each distinct group.</p>
      <p>
        Around the beginning of October 2023, after the SWIFTT project was started and forest type
classification map were created, the LUCAS preliminary micro data (2022) [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] was published as
open data source. This dataset is similar to LUCAS Copernicus 2018 but contains more samples. In
this regard, we have used this dataset for validation, and for the correctness of the experiment, we
removed from this data set those elements that were used for training and validation of the obtained
forest map and used the obtained data set as a completely independent data set for testing the
obtained product. The last column of Table 4 shows the distribution of elements by land cover classes
and forest types of LUCAS 2022.
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Methodology</title>
      <p>Within the project, we have developed an automatic technology for forest type classification map
for Europe in the Google Earth Engine cloud platform, which is repeated for each individual group
of countries listed in the Table 1.</p>
      <p>
        The prepared stack of satellite Sentinel-1 and Sentinel-2 composites (section 3) together with
prepared pre-filtered LUCAS Copernicus 2018 train data (section 4) were used as input data for forest
type classification [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. All data (satellite and train data set) is contained in cloud and we don’t
need extra resource to train the classifier. For each separate group of countries, we trained different
Random Forest models due to GEE capacity with 100 number of trees.
      </p>
      <p>The model generates a raster georeferenced image as output, with each pixel representing the
respective land cover or forest type class. Notably, this approach offers an advantage as it enables
the creation of maps even in regions with limited training data, such as Turkey and Iceland. This is
achieved through the utilization of a common Random Forest model for each group of countries.
Leveraging the cloud platform Google Earth Engine facilitates seamless scalability and utilization of
the model across extensive areas, particularly throughout Europe, as demonstrated by previous
studies [23], [24].</p>
      <p>The next steps no require large computing resources and can be done on a personal computer
(PC).</p>
      <p>
        The list of classes on the received map corresponds to the list of classes in the educational data
(Table 4). Aggregation of the received forest types into three generalizing classes (Broadleaved,
Coniferous, and Mixed) took place according the [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] (Table 5). All other classes not related to forest
cover were set to 0.
      </p>
      <p>A portion of the LUCAS 2018 validation dataset (outlined in section 4), along with the LUCAS
2022 data, served as independent data for validating the aggregated forest type map. To evaluate the
accuracy of the land cover classification maps, a confusion matrix [25] derived from an independent
test sample was utilized. This matrix is presented as a rectangular table, where each cell represents
the number of pixels nij belonging to the actual class "i" but classified as class "j" on the classification
map. Additionally, the assessment included metrics such as Overall Accuracy (OA), Producer
Accuracy (PA), and User Accuracy (UA). UA and PA values are ways of representing the accuracies
of the individual classes. The UA value is the probability that the pixel class on the classification map
corresponds to the sample class in the test data, whereas PA indicates the probability that a pixel
from the test data is recognized correctly on the map. By analyzing the correlation between these
quantities, the map user can obtain information on reliability of class recognition on the map, as well
as assess the quality of the geospatial product itself.</p>
      <p>Overall Accuracy (OA) is an indicator of the overall quality of land cover map. In fact, this is the
ratio of the sum of the elements of the main diagonal (i.e., correctly classified pixels) and the sum of
all elements in the error matrix.</p>
      <p>The User, Producer, and Overall accuracies are calculated according to the formulas 1-3:
!!
where q is the number of classes on the land cover map and test data.</p>
      <p>One more metric for the assessment of the classifier quality is F1-score. It reduces the two other
metrics, UA and PA, down to one number, and it is defined as an average-weighted harmonic mean
value between them (formula 4).</p>
      <p>=
! = # ;</p>
      <p>∑"$% "!
" = #"" ;
∑!$% "!</p>
      <p>#
∑"$% ""
# #
∑"$% ∑!$% "!</p>
      <p>,
% = 2</p>
      <p>PA ∗ UA
PA + UA
(1)
(2)
(3)
(4)</p>
      <p>The above-described methodology for obtaining a classification map of forest types for Europe
is schematically presented in the Figure 5.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Machine Learning Model Selection</title>
      <p>
        The Random Forest classifier was chosen as a basis in this work, because it showed relatively good
results compared to deep neural networks, it is built into the GEE cloud platform and can be used in
contrast to deep networks, and it is also significantly superior in algorithm execution time. In the
study [26], researchers introduced and investigated two modifications of Random Forest (RF) models
and two modifications of Convolutional Neural Network (CNN) U-Net for forest type classification.
The raster Forest Type (2018) map [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] served as training labels. During the development of these
models, it was observed that RF might not be the optimal tool for addressing this problem, primarily
due to its architectural constraints, which hinder training the model on large datasets. However, for
point-based datasets, this method remains reasonable due to its relatively low learning and execution
time. The Random Forest model exhibited an overall accuracy ranging from 86% to 86.5% for the pilot
territories in Germany. Conversely, within this study, the U-Net model demonstrated significantly
improved overall accuracy, ranging from 91% to 91.7%, along with higher prediction speed.
Nonetheless, as with deep learning methods, the learning process of the U-Net model extends to tens
of hours, unlike the minutes required for RF. In this work, all experiments were conducted on the
CREODIAS cloud platform (within ORCE project [27]), which contains ready-to-use satellite data,
but still requires time and resources for their partial pre-processing (cloud extraction, filtering, etc.),
unlike the GEE cloud platform.
      </p>
      <p>In the work [28] authors compare the productivity of cloud platforms AWS and CREODIAS for
the land cover classification for Ukraine using Multilayer perceptron. On average, it takes 400
minutes and 50-80 euros to build a classification map based on a time series of satellite data for 1
year, covering an area of about 200,000 km2. Taking into account the territory area of 6.5 MLN km2,
for which we need to obtain a forest type classification map, the necessary time to obtain the map
will be about 9 days of continuous work of the instance and 1700 - 2600 euros for one iteration of
obtaining the classification map. Given the complexity of the mountainous terrain in some countries,
sometimes there is a need to adjust the sampling in difficult areas and rerun the training model
several times.</p>
      <p>Therefore, taking into account the area for which we need to obtain a classification map, in this
work the authors decided to use the Random Forest algorithm in the GEE cloud platform.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Results</title>
      <p>The main outcome of this study is the forest type classification map for the year 2022 covering the
European territory, featuring a spatial resolution of 10 meters. This map was generated utilizing a
time series of Synthetic Aperture Radar (SAR) Sentinel-1 and optical Sentinel-2 satellite data. The
resulting map includes 3 forest type classes (broadleaved, coniferous and mixed) and presented in
the Figure 6. The Table 6 shows the accuracy assessment of the obtained map on the test data, which
was formed on the basis of the Lucas 2018 data set.</p>
      <p>The class of mixed forests turned out to be the most problematic, since it contains various types
of trees that can be contained in other classes. Accordingly, the confusion for this class is the highest,
and the recognition accuracy is the lowest. Accordingly, since it may contain trees of other classes,
it also leads to an underestimation of the accuracy of other classes of forest types.</p>
      <p>In contrast to the Lucas 2018 dataset, the Lucas 2022 dataset contains sampling elements on a
grid. Very often there are cases when the point, which is responsible for the forest class, falls into a
forest strip or in a city in a cluster of a few of trees. Or a point from the Lucas 2022 data set is very
close to the cluster of trees, but does not physically fall into the forest mask, as well as the sample
could be incorrect. In such cases, given the spatial resolution of the satellite data we work with, the
created classification map may not correspond to the class given in the Lucas 2022 set. These
examples certainly affect the classification accuracy estimation, but in this case we have an accuracy
estimate from the bottom of our product, which is 88.5% on independent data, which is a good
indicator at the level of Europe as a whole.</p>
      <p>Also, we compared the resulting areas of forest types and total forest in 2022 with the
corresponding existing global products. With Forest type 2018, the areas of broadleaved and
coniferous forest are compared (Figure 7). There is a decrease in the area of coniferous forests
compared to 2018. This may be due to the fact that coniferous forests suffer more from bark beetles
and various diseases. Trees dry out, and as a result, they are cut down.</p>
      <p>If we compare the total area of the forest (Figure 8, Figure 9), there is a difference between the
Hansen 2022 data set (Hansen underestimates the forest compared to our product). For Hansen's data
set, a threshold of 30% of pixels belonging to the forest was chosen, below which we consider it
inappropriate to take. The reason for the lower forest area in Hansen's data set is that every year
deforestation is subtracted from the forest area, and the last layer of forest growth (gain layer) was
only in 2012.</p>
    </sec>
    <sec id="sec-8">
      <title>8. Product Availability</title>
      <p>The source codes are available for downloading at the link:
https://github.com/IPTMMDA/Forest_type_classification. The code is available in Google Earth Engine cloud platform: for
forest type classification - https://code.earthengine.google.com/0eb34e8a84988ad3dcea334d4ef59805,
for resulting forest type map using
https://code.earthengine.google.com/baa95cfdd09e0630e21f6aa4ffe87eaa, forest type visualization
within Google App: https://ee-swiftt.projects.earthengine.app/view/foresttype.</p>
    </sec>
    <sec id="sec-9">
      <title>9. Discussion</title>
      <p>The results obtained in this study demonstrate the effectiveness of employing satellite data and
machine learning techniques for large-scale forest type mapping across Europe. The application of
the Random Forest classifier in the Google Earth Engine cloud platform has proven to be a reliable
and efficient approach, addressing the challenges associated with processing and analyzing of big
satellite data.</p>
      <p>One of the key strengths of the proposed methodology lies in its ability to overcome the
limitations posed by the lack of ground truth data in certain regions. By leveraging the LUCAS
Copernicus dataset and satellite image interpretation, we were able to create a comprehensive
training dataset, ensuring adequate representation of different forest types and land cover classes
across the entire study area.</p>
      <p>The resulting forest type map exhibits a high overall accuracy of 93% when evaluated against an
independent test dataset. However, it is essential to acknowledge the challenges encountered in
accurately distinguishing mixed forests from other classes. The inherent complexity of mixed forests,
comprising various tree species, likely contributed to the lower accuracy observed for this particular
class.</p>
      <p>It is noteworthy that the accuracy assessment based on the LUCAS 2022 dataset revealed certain
inconsistencies, potentially attributable to the grid-based sampling approach employed in this
dataset. The mismatch between the spatial resolution of the satellite data and the sampling points
may have resulted in instances where the assigned class did not align with the actual land cover
observed in the satellite imagery. Nevertheless, the overall accuracy of 88.5% obtained using the
LUCAS 2022 dataset provides a conservative estimate of the product's performance, further
validating its reliability.</p>
      <p>The comparison of the obtained forest areas with existing global products, such as the Forest
Type 2018 and WorldCover 2021, highlights the dynamic nature of forest ecosystems and the
importance of regularly updating forest maps. The observed decrease in coniferous forest areas
compared to 2018 may be attributed to factors such as pest infestations, diseases, and subsequent
deforestation efforts. Conversely, the differences in total forest area compared to the Hansen dataset
can be explained by the consideration of deforestation and the limited availability of forest gain data
in the Hansen product after 2012.
10. Conclusions</p>
      <p>This study successfully developed and implemented a semi-supervised approach for mapping
European forest types using high-resolution satellite data from the Copernicus program. The
integration of Sentinel-1 and Sentinel-2 data, combined with the Random Forest classifier deployed
on the Google Earth Engine cloud platform, enabled efficient and accurate classification of forest
types across the European continent.</p>
      <p>The resulting forest type map, with a spatial resolution of 10 meters, provides a comprehensive
and up-to-date representation of the distribution of broadleaved, coniferous, and mixed forests for
the year 2022. This product addresses a critical need for contemporary geospatial information,
facilitating informed decision-making in various domains, including forest management,
conservation efforts, and the assessment of environmental impacts.</p>
      <p>The accuracy assessment, conducted using independent validation data from the LUCAS
Copernicus dataset, demonstrated an overall accuracy of 93%, highlighting the reliability of the
proposed methodology. While the mixed forest class exhibited lower accuracy due to its inherent
complexity, the overall performance of the classification model was satisfactory.</p>
      <p>By leveraging cloud computing resources and machine learning techniques, this study overcame
the challenges associated with processing and analyzing large volumes of satellite data, enabling the
efficient generation of a high-resolution forest type map for the entire European region.</p>
      <p>Overall, this study demonstrates the potential of integrating cutting-edge technologies and data
sources for addressing complex environmental challenges at a continental scale. The developed
methodology can be further refined and applied to other regions, contributing to a better
understanding of global forest dynamics and supporting sustainable forest management practices.</p>
      <p>Future research could explore the integration of additional data sources, such as LiDAR or
hyperspectral imagery, to enhance the accuracy of forest type classification, particularly for mixed
forests. Additionally, the development of automated monitoring systems, leveraging the capabilities
of cloud computing and machine learning, could facilitate the timely detection of forest disturbances
and support proactive conservation efforts.</p>
    </sec>
    <sec id="sec-10">
      <title>Acknowledgements</title>
      <p>The study was supported by the HORIZON Europe project SWIFTT No. 101082732, “Satellites for
Wilderness Inspection and Forest Threat Tracking.”
Telecommunications, 2021, pp. 101-118. Cham: Springer International Publishing. DOI:
10.1007/978-3-031-16368-5_5.
[23] N. Kussul, et al, Land cover changes analysis based on deep machine learning technique. Journal
of Automation and Information Sciences, 2016, 48(5).
[24] A. Shelestov, et al, Large scale crop classification using Google earth engine platform. In 2017
IEEE international geoscience and remote sensing symposium (IGARSS), 2017, pp. 3696-3699.</p>
      <p>DOI: 10.1109/IGARSS.2017.8127801.
[25] R. Congalton, A review of assessing the accuracy of classifications of remotely sensed data.</p>
      <p>Remote sensing of environment, 1991, 37(1), 35-46.
[26] Y. Salii, et al, Machine learning models and technology for classification of forest on satellite
data. In IEEE EUROCON 2023-20th International Conference on Smart Technologies, 2023, pp.
93-98.
[27] A. Shelestov, E. Siemens, OCRE project: consortium, main goals, approaches and opportunities.</p>
      <p>In 2022 12th International Conference on Dependable Systems, Services and Technologies
(DESSERT), 2022.
[28] N. Kussul, et al, Cloud Platforms and Technologies for Big Satellite Data Processing, in
International scientific and practical conference Information and communication technologies
and sustainable development, 2023, pp. 303-321. Cham: Springer Nature Switzerland.
[29] R. Andrimont, et al, LUCAS Copernicus 2018. Earth Observation relevant in-situ data on land
cover throughout the European Union. Earth System Science Data Discussions, 2020, 1-19,
https://doi.org/10.5194/essd-13-1119-2021.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B.</given-names>
            <surname>Yailymov</surname>
          </string-name>
          , et al.,
          <article-title>Google Earth Engine framework for satellite data-driven wildfire monitoring in Ukraine</article-title>
          . Fire,
          <year>2023</year>
          ,
          <volume>6</volume>
          (
          <issue>11</issue>
          ),
          <fpage>411</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Meentemeyer</surname>
          </string-name>
          ,
          <article-title>Remote sensing of forest damage by diseases and insects</article-title>
          .
          <source>Remote sensing for sustainability</source>
          ,
          <year>2016</year>
          ,
          <fpage>145</fpage>
          -
          <lpage>162</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Matsala</surname>
          </string-name>
          , et al,
          <article-title>War drives forest fire risks and highlights the need for more ecologicallysound forest management in post-war Ukraine</article-title>
          .
          <source>Scientific reports</source>
          ,
          <year>2024</year>
          ,
          <volume>14</volume>
          (
          <issue>1</issue>
          ),
          <fpage>4131</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>P.</given-names>
            <surname>Verkerk</surname>
          </string-name>
          , et al,
          <article-title>Spatial distribution of the potential forest biomass availability in Europe</article-title>
          .
          <source>Forest Ecosystems</source>
          ,
          <year>2019</year>
          ,
          <volume>6</volume>
          (
          <issue>1</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>C.</given-names>
            <surname>Senf</surname>
          </string-name>
          ,
          <string-name>
            <surname>R. Seidl R.</surname>
          </string-name>
          ,
          <source>Mapping the forest disturbance regimes of Europe. Nature Sustainability</source>
          ,
          <year>2021</year>
          ,
          <volume>4</volume>
          (
          <issue>1</issue>
          ),
          <fpage>63</fpage>
          -
          <lpage>70</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Palmero-Iniesta</surname>
          </string-name>
          , et al,
          <article-title>Changes in forest landscape patterns resulting from recent afforestation in Europe (1990-2012): Defragmentation of pre-existing forest versus new patch proliferation</article-title>
          .
          <source>Annals of Forest Science</source>
          ,
          <year>2020</year>
          ,
          <volume>77</volume>
          (
          <issue>2</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zanaga</surname>
          </string-name>
          ,et al,
          <source>ESA WorldCover 10</source>
          m
          <year>2020</year>
          v100. URL: https://doi.org/10.5281/zenodo.5571936.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zanaga</surname>
          </string-name>
          , et al,
          <source>ESA WorldCover 10</source>
          m
          <year>2021</year>
          v200. URL: https://doi.org/10.5281/zenodo.7254221.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kerchove</surname>
          </string-name>
          ,
          <article-title>Release of the 10 m worldcover map</article-title>
          . URL: https://blog.vito.be/remotesensing/release-of-the-10
          <string-name>
            <surname>-</surname>
          </string-name>
          m
          <article-title>-worldcover-map.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hansen</surname>
          </string-name>
          , et al,
          <source>High-Resolution Global Maps of 21st-Century Forest Cover Change. Science 342</source>
          ,
          <year>2013</year>
          ,
          <fpage>850</fpage>
          -
          <lpage>53</lpage>
          . URL: https://glad.earthengine.app/view/global-forest-change.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>C.</given-names>
            <surname>Senf</surname>
          </string-name>
          ,
          <source>European forest disturbance maps (1.0.0)</source>
          ,
          <year>2020</year>
          . [Data set].
          <source>Zenodo</source>
          . https://doi.org/10.5281/zenodo.3924381.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Buchhorn</surname>
          </string-name>
          , et al.,
          <source>Copernicus Global Land Cover Layers-Collection 2. Remote Sensing</source>
          ,
          <year>2020</year>
          ,
          <volume>12</volume>
          ,
          <issue>108</issue>
          , 1044. doi:
          <volume>10</volume>
          .3390/rs12061044.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Forest</surname>
            <given-names>Type 2018</given-names>
          </string-name>
          <source>(raster 10 m)</source>
          ,
          <year>2020</year>
          . URL: https://land.copernicus.eu/pan-european/highresolution-layers/forests/forest-type-1/status-maps/forest-type-
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dostálová</surname>
          </string-name>
          , et al,
          <article-title>European wide forest classification based on sentinel-1 data</article-title>
          , Remote Sensing,
          <year>2021</year>
          ,
          <volume>13</volume>
          (
          <issue>3</issue>
          ),
          <fpage>337</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>P.</given-names>
            <surname>Kempeneers</surname>
          </string-name>
          , et al,
          <article-title>Data fusion of different spatial resolution remote sensing images applied to forest-type mapping</article-title>
          .
          <source>IEEE Transactions on Geoscience and Remote Sensing</source>
          ,
          <year>2011</year>
          ,
          <volume>49</volume>
          (
          <issue>12</issue>
          ),
          <fpage>4977</fpage>
          -
          <lpage>4986</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>R.</given-names>
            <surname>Päivinen</surname>
          </string-name>
          , et al,
          <article-title>Combining earth observation data and forest statistics</article-title>
          ,
          <source>Joensuu: European Forest Institute</source>
          .
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Countries</surname>
          </string-name>
          ,
          <year>2020</year>
          . URL: https://ec.europa.eu/eurostat/web/gisco/geodata/referencedata/administrative-units
          <article-title>-statistical-units/countries.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>N.</given-names>
            <surname>Kussul</surname>
          </string-name>
          , et al,
          <article-title>Data assimilation technique for flood monitoring and prediction</article-title>
          .
          <source>International Journal Information Theories &amp; Applications</source>
          .
          <year>2008</year>
          ,
          <volume>15</volume>
          ,
          <fpage>76</fpage>
          -
          <lpage>83</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>F.</given-names>
            <surname>Waldner</surname>
          </string-name>
          , et al.
          <article-title>Roadside collection of training data for cropland mapping is viable when environmental and management gradients are surveyed</article-title>
          .
          <source>International Journal of Applied Earth Observation and Geoinformation</source>
          <volume>80</volume>
          (
          <year>2019</year>
          ):
          <fpage>82</fpage>
          -
          <lpage>93</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <article-title>LUCAS preliminary micro data 2022</article-title>
          . URL: https://ec.europa.eu/eurostat/web/lucas/database/
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>A.</given-names>
            <surname>Shelestov</surname>
          </string-name>
          , et al,
          <article-title>Cloud approach to automated crop classification using Sentinel-1 imagery</article-title>
          .
          <source>IEEE Transactions on Big Data</source>
          ,
          <year>2019</year>
          ,
          <volume>6</volume>
          (
          <issue>3</issue>
          ),
          <fpage>572</fpage>
          -
          <lpage>582</lpage>
          . DOI:
          <volume>10</volume>
          .1109/TBDATA.
          <year>2019</year>
          .
          <volume>2940237</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>A.</given-names>
            <surname>Shelestov</surname>
          </string-name>
          , et al,
          <article-title>Cloud-Based Technologies for Data Processing in Ukraine: International Context</article-title>
          . In International Scientific and Technical Conference-Modern Challenges in
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>