<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Digital technologies for forest monitoring in the Baikal natural territory</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Igor</forename><forename type="middle">V</forename><surname>Bychkov</surname></persName>
							<email>bychkov@icc.ru</email>
							<affiliation key="aff0">
								<orgName type="department">Matrosov Institute for System Dynamics and Control Theory</orgName>
								<orgName type="institution">Siberian Branch of Russian Academy of Sciences</orgName>
								<address>
									<addrLine>Lermontov st. 134</addrLine>
									<settlement>Irkutsk</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Gennady</forename><forename type="middle">M</forename><surname>Ruzhnikov</surname></persName>
							<email>rugnikov@icc.ru</email>
							<affiliation key="aff0">
								<orgName type="department">Matrosov Institute for System Dynamics and Control Theory</orgName>
								<orgName type="institution">Siberian Branch of Russian Academy of Sciences</orgName>
								<address>
									<addrLine>Lermontov st. 134</addrLine>
									<settlement>Irkutsk</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Roman</forename><forename type="middle">K</forename><surname>Fedorov</surname></persName>
							<email>fedorov@icc.ru</email>
							<affiliation key="aff0">
								<orgName type="department">Matrosov Institute for System Dynamics and Control Theory</orgName>
								<orgName type="institution">Siberian Branch of Russian Academy of Sciences</orgName>
								<address>
									<addrLine>Lermontov st. 134</addrLine>
									<settlement>Irkutsk</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Anastasia</forename><forename type="middle">K</forename><surname>Popova</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Matrosov Institute for System Dynamics and Control Theory</orgName>
								<orgName type="institution">Siberian Branch of Russian Academy of Sciences</orgName>
								<address>
									<addrLine>Lermontov st. 134</addrLine>
									<settlement>Irkutsk</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Digital technologies for forest monitoring in the Baikal natural territory</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">D5AE2F805104BEB5660BE0764522CF06</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T02:14+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Machine learning</term>
					<term>remote sensing</term>
					<term>forest monitoring</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The paper considers the problem of forest resources monitoring over large areas on the example of the Baikal natural territory. As the main data source, we use Sentinel-2 remote sensing data due to their regularity, broad coverage, multispectral parameters of the resulting image. The Random forest and Support Vector Machines (SVM) machine learning algorithms were used to classify land cover from the Sentinel-2 products. Both methods have shown good results with a fairly high accuracy. The training was carried out with data labeled manually into 12 classes.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Forest monitoring is an assessment and forecasting system for the forest fund state in space and time for the rational use, protection and reproduction of forests, increasing their ecological functions. Monitoring supports tracking the forest resources dynamics caused by forest management, natural and anthropogenic impacts, compiling predictive and analytical models for their protection and use, sustainable development of forest economics. The effectiveness of forest monitoring directly depends on the completeness and accuracy of observations data of various environment elements.</p><p>The Baikal Natural Territory (BNT) covers Lake Baikal, the water protection zone around it, specially protected natural areas and adjacent areas 200 km to the west and northwest from the lake. The area of the BNT is 386 thousand km², there are 31 specially protected natural areas, including 3 reserves, 2 national parks, 6 recreational areas and more than 128 natural monuments.</p><p>Among the most important natural resources of the BNT are forest resources, which ensure the sustainability of environment, performing water and soil protection, water regulation functions. The area of BNT lands covered with forest vegetation is about 8350.73 thousand hectares and 92% of these lands are covered by forests, represented by two groups of forest-forming species: coniferous and deciduous trees. BNT forests are negatively affected by fires, forest diseases, insect pests, unfavorable weather conditions, which can lead to the loss of forest biological stability.</p><p>Forest monitoring of the BNT has poor efficiency and limited access to in-situ data, which complicates the support of decision-making and the conduct of interdisciplinary research. Official forest inventory information is not always up-to-date, there is no unified system for storing and processing forest monitoring data at the regional level <ref type="bibr" target="#b0">[1]</ref>.</p><p>This determines the relevance of forest monitoring in a digital format, which is essential for sustainable forest management in the BNT and compliance with the requirements of continuous, rational use of forests, their reproduction, and conservation of resource, recreational, ecological potential and biological diversity.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Organization of digital monitoring</head><p>The advantages of the digital forest monitoring system of the BNT are a large number of participants and their information resources; the participants' emphasis on their strengths and the transfer of non-core activities and services to outsourcing to others; efficiency of updating open digital information resources, attraction of scientific knowledge. Planning and implementation of several services provided by different participants, including third-party ones, reducing the cost of obtaining services helps to increase the complexity and validity of management decisions.</p><p>Digital forest monitoring of the BNT is based on an information and analytical environment that provides collection, transmission, search, storage of spatio-temporal data from forest monitoring, the ability to assess, model and forecast the state of forest resources of the BNT <ref type="bibr" target="#b1">[2]</ref><ref type="bibr" target="#b2">[3]</ref>. Such an environment should contain spatial and thematic data of forest monitoring, including remote sensing data, unified reference books and classifiers. A catalog of services is intended for processing monitoring data: providing data, assessing forest dynamics, machine learning, publishing results in the form of maps and diagrams. The scheme of such digital forest monitoring system is shown in the figure <ref type="figure">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 1: Scheme of a digital forest monitoring system</head><p>Remote sensing is an important source of data for digital forest monitoring <ref type="bibr" target="#b3">[4]</ref><ref type="bibr" target="#b4">[5]</ref>. Traditional forest in-situ data are not always able to provide up-to-date data for a large area, such as BNT, they are often based on a sample of small areas, or contain aggregated information without an accurate spatial description. Remote sensing data creates opportunities to obtain forest data in a more efficient way, provides information about their spatial species distribution with wide temporal coverage and higher refresh rates.</p><p>Medium resolution satellite imagery such as Landsat and Sentinel-2 allow map large areas in economical manner. The resolution of 10 m in the main Sentinel-2 bands allows to detect a number of forest parameters quite accurately, making them more preferable than Landsat images with a 30 m resolution. Sentinel-2 satellites are equipped with MultiSpectral Instrument with 13 spectral bands, covering channels from blue to short wave infrared (SWIR) with a resolution of 10 to 60 m. Provides global coverage on average every 5 days.</p><p>Intelligent analysis of remote sensing data provides an opportunity to identify changes in the forest fund as a result of anthropogenic impacts and environmental disturbances, fires, damage to forests by pests, diseases, windblows, etc. At the initial stage, it is necessary to classify satellite images for the study area by compiling land cover maps. In this study, we use machine learning methods for automated land cover classification.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results and discussion</head><p>Machine learning algorithms are used to process large datasets consisting of multi-temporal images with spectral metrics <ref type="bibr" target="#b5">[6]</ref><ref type="bibr" target="#b6">[7]</ref>. These methods are effective for classifying complex multidimensional data, providing nonlinear and nonparametric classifications. The most popular machine learning algorithms used in remote sensing research are Random Forest (RF) and Support Vector Machines (SVM).</p><p>RF is a nonparametric ensemble machine learning algorithm based on decision trees. It can process various data such as satellite images and numerical data. Each decision tree produces a classification result for samples not selected as training samples. The decision tree chooses some class, and the final class is determined by the highest number of votes.</p><p>SVM is a machine learning method developed based on the theory of statistical learning and the principle of minimizing structural risks. Compared with traditional teaching methods, it has high accuracy, fast computation speed, and strong generalizability, which is widely used in image mapping and land classification.</p><p>Study area in this research covers south of the lake Baikal. Little cloudy Sentinel-2A MSI granules used in the study were freely acquired on 25 June 2017 and downloaded from the Copernicus Scientific Data Hub as a Level-1C product. Figure <ref type="figure">2</ref> present Sentinel-2 image RGB composite for the study area.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 2: Sentinel-2 image for study area</head><p>To monitor the forest resources of the BNT, a part of the study area was previously labeled manually with 12 classes: felling, shrubs, coniferous forest, woodland, deciduous forest, mixed forest, rocks, pastures, arable land, residential area, clouds, water. For this step, polygon-shaped samples were generated based on visual interpretation of the high resolution satellite images and expert knowledge.</p><p>All 13 bands of the Sentinel-2 image were used for training. The labeled sample was randomly divided into training and test parts in a proportion 70/30. We used implementations of machine learning algorithms from the "scikit-learn" Python library. The Random forest method uses 200 decision trees, SVM parameters are: kernel = "linear", C = 1.0. The estimation of the accuracy of the algorithms is given in the table 1. We used macro average values for precision, recall, and f1-score parameters. The Random forest made the most mistakes in the Lightwood and Deciduous Forest classes. SVM misclassifies Pasture, Woodland, and Deciduous Forest classes. Errors occurred due to the similarity of spectral characteristics in the classes associated with vegetation. In the future, it is necessary to expand the training dataset, filling it with a large number of samples of different forest species.</p><p>The classification results are presented in Figures <ref type="figure">3 and 4</ref>. It is visible that the Random forest method misclassifies "Living area" class (highlighted in gray). The SVM method also misclassifies the "Logging" class (lower left corner of the map, highlighted in red). At the same time, the value of the accuracy of both algorithms in these classes is quite high (96-97%), which can be explained by the insufficient size of the test sample, on which the accuracy was assessed. In general, Random forest showed the best result in the classification of remote sensing data for BNT. To improve the calculations of the SVM algorithm, more complex non-linear kernel in the method parameters can be used.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusion</head><p>Effective management of forest resources is impossible without full and timely information about their condition. Remote sensing images are well suited for regular monitoring of large areas due to their high repeatability, wide coverage, and easy accessibility. 13 spectral bands of Sentinel-2 satellites images are quite enough to distinguish various tree species.</p><p>The work compares the results of two machine learning algorithms -Random forest and SVM -to classify the land cover. The test array with 12 classes on the BNT area training samples were generated and labeled manually. The learning showed high results: 98.92% OAA for Random forest and 93.79% for SVM. The main calculation errors are associated with an insufficient number of test samples, which does not allow the methods to separate accurately from each other classes with similar spectral characteristics. We plan to expand sample dataset to improve the classification results.</p><p>The resulting classification of land cover can be used for BNT forest monitoring. Fast tracking of logging, burnt-out areas, and reforestation will allow to assess the forest resources dynamics and to make management decisions.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 3 : 4 :</head><label>34</label><figDesc>Figure 3: Result of the RF Figure 4: Result of the SVM</figDesc><graphic coords="4,69.40,325.75,222.25,223.45" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="2,142.25,292.82,309.95,217.85" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="3,150.38,330.77,294.24,294.60" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Accuracy assessment for the classification algorithms</figDesc><table><row><cell>Algorithm</cell><cell>Precision</cell><cell>Recall</cell><cell>F1-score</cell></row><row><cell>RF</cell><cell>98.92%</cell><cell>0.95</cell><cell>0.95</cell></row><row><cell>SVM</cell><cell>93.79%</cell><cell>0.89</cell><cell>0.87</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Acknowledgements</head><p>The results were obtained within the framework of the State Assignment of the Ministry of Education and Science of the Russian Federation for the project "Methods and technologies of cloudbased service-oriented platform for collecting, storing and processing large volumes of multi-format interdisciplinary data and knowledge based upon the use of artificial intelligence, model-guided approach and machine learning" (state registration number 121030500071-2). Results are achieved using the Centre of collective usage «Integrated information network of Irkutsk scientific educational complex».</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Forest Resources of the Baikal Region: Vegetation Dynamics Under Anthropogenic Use, Information Technologies in the Research of Biodiversity</title>
		<author>
			<persName><forename type="first">Anastasia</forename><forename type="middle">K</forename><surname>Popova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Evgeny</forename><forename type="middle">A</forename><surname>Cherkasin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Igor</forename><forename type="middle">N</forename><surname>Vladimirov</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-030-11720-7_14</idno>
	</analytic>
	<monogr>
		<title level="m">Springer Proceedings in Earth and Environmental Sciences</title>
				<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="96" to="106" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Digital platform for forest resources monitoring in the BAIKAL natural territory</title>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">V</forename><surname>Bychkov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">M</forename><surname>Ruzhnikov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">K</forename><surname>Fedorov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">K</forename><surname>Popova</surname></persName>
		</author>
		<idno type="DOI">10.1088/1742-6596/1864/1/012111</idno>
	</analytic>
	<monogr>
		<title level="m">13th Multiconference on Control Problems (MCCP 2020) 6-8</title>
				<meeting><address><addrLine>Saint Petersburg, Russia</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020-10">October 2020. 2020</date>
			<biblScope unit="volume">1864</biblScope>
			<biblScope unit="page">12111</biblScope>
		</imprint>
	</monogr>
	<note>J. Phys.: Conf. Ser.</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Organization of digital monitoring of the Baikal natural territory</title>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">V</forename><surname>Bychkov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">M</forename><surname>Ruzhnikov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">K</forename><surname>Fedorov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">E</forename><surname>Khmelnov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">K</forename><surname>Popova</surname></persName>
		</author>
		<idno type="DOI">10.1088/1755-1315/629/1/012067</idno>
	</analytic>
	<monogr>
		<title level="m">Environmental transformation and sustainable development in Asian region 8-10</title>
				<meeting><address><addrLine>Irkutsk,</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020-09">September 2020. 2021</date>
			<biblScope unit="volume">629</biblScope>
			<biblScope unit="page">12067</biblScope>
		</imprint>
	</monogr>
	<note>IOP Conf. Ser.: Earth Environ. Sci.</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Sentinel-2 data in an evaluation of the impact of the disturbances on forest vegetation</title>
		<author>
			<persName><forename type="first">J</forename><surname>Lastovicka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Svec</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Paluba</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Kobliuk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Svoboda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Hladky</surname></persName>
		</author>
		<idno type="DOI">10.3390/rs12121914</idno>
	</analytic>
	<monogr>
		<title level="j">Remote Sens</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="page">1914</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Evaluating the potential of Sentinel-2, Landsat-8, and IRS satellite images in tree species classification of hyrcanian forest of Iran using Random forest</title>
		<author>
			<persName><forename type="first">L</forename><surname>Soleimannejad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ullah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Abedi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dees</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Koch</surname></persName>
		</author>
		<idno type="DOI">10.1080/10549811.2019.1598443</idno>
	</analytic>
	<monogr>
		<title level="j">J Sustain Forest</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="page" from="615" to="628" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Evaluation of machine learning algorithms for forest stand species mapping using Sentinel-2 imagery and environmental data in the Polish Carpathians</title>
		<author>
			<persName><forename type="first">E</forename><surname>Grabska</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Frantz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Ostapowicz</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.rse.2020.112103</idno>
	</analytic>
	<monogr>
		<title level="j">Remote Sens Environ</title>
		<imprint>
			<biblScope unit="volume">251</biblScope>
			<biblScope unit="page">112103</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">System dynamic modelling and simulation for cultivation of forest land: Case study Perum Perhutani, Central Java, Indonesia</title>
		<author>
			<persName><forename type="first">C</forename><surname>Musi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Anggoro</surname></persName>
		</author>
		<author>
			<persName><surname>Sunarsih</surname></persName>
		</author>
		<idno type="DOI">10.12911/22998993/74307</idno>
	</analytic>
	<monogr>
		<title level="j">J Ecol Eng</title>
		<imprint>
			<biblScope unit="volume">18</biblScope>
			<biblScope unit="page" from="25" to="34" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
