=Paper= {{Paper |id=Vol-3006/65_short_paper |storemode=property |title=Using remote sensing data in population density estimation |pdfUrl=https://ceur-ws.org/Vol-3006/65_short_paper.pdf |volume=Vol-3006 |authors=Evgeniy G. Shvetsov,Nadezhda M. Tchebakova,Elena I. Parfenova }} ==Using remote sensing data in population density estimation== https://ceur-ws.org/Vol-3006/65_short_paper.pdf
Using remote sensing data in population density
estimation
Evgeniy G. Shvetsov1,2 , Nadezhda M. Tchebakova3 and Elena I. Parfenova4
1
    V.N. Sukachev Institute of Forest, Federal Research Center “Krasnoyarsk Science Center SB RAS”, Krasnoyarsk, Russia
2
    Siberian Federal University, Krasnoyarsk, Russia


                                         Abstract
                                         In recent decades, remote sensing methods have often been used to estimate population density, especially
                                         using data on nighttime illumination. Information about the spatial distribution of the population is
                                         important for understanding the dynamics of cities and analyzing various socio-economic, environmental
                                         and political factors. In this work, we have formed layers of the nighttime light index, surface temperature
                                         and vegetation index according to the SNPP/VIIRS satellite system for the territory of the central and
                                         southern regions of the Krasnoyarsk krai. Using these data, we have calculated VTLPI (vegetation
                                         temperature light population index) for the year 2013. The obtained values of the VTLPI calculated for a
                                         number of settlements of the Krasnoyarsk krai were compared with the results of the population census
                                         conducted in 2010. In total, we used census data for 40 settlements. Analysis of the data showed that the
                                         relationship between the value of the VTLPI index and the population density in the Krasnoyarsk krai
                                         can be adequately fitted (𝑅2 = 0.65) using a linear function. In this case, the value of the root-mean-
                                         square error was 345, and the relative error was 0.09. Using the obtained model equation and the spatial
                                         distribution of the VTLPI index using GIS tools, the distribution of the population over the study area
                                         was estimated with a spatial resolution of 500 meters. According to the obtained model and the VTLPI
                                         index, the average urban population density in the study area exceeded 500 people/km2 . Comparison of
                                         the obtained data on the total population in the study area showed that the estimate based on the VTLPI
                                         index is about 21% higher than the actual census data.

                                         Keywords
                                         Population density, VIIRS, nighttime lights, VTLPI, Krasnoyarsk krai.




1. Introduction
Detailed information on the spatial distribution of the population is important for understanding
the dynamics of cities and analyzing various socio-economic, environmental and political
factors [1, 2]. The most readily available source of population data is usually census data. This
method, however, is expensive and time consuming, and therefore, population censuses are
rarely conducted [3].
   In recent decades, remote sensing methods have often been used to estimate population
density, especially using nighttime light (NTL) data. In addition, in recent years, to improve the
accuracy of the population density estimates, nighttime lights data have been used in conjunction
with other data sources (vegetation indices, data on types of vegetation cover) [4]. We have
used several sets of remote sensing data to assess the spatial distribution of the population

SDM-2021: All-Russian conference, August 24–27, 2021, Novosibirsk, Russia
" eugeneshvetsov11@yandex.ru (E. G. Shvetsov)
                                       © 2021 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073       CEUR Workshop Proceedings (CEUR-WS.org)



                                                                                                         550
Evgeniy G. Shvetsov et al. CEUR Workshop Proceedings                                    550–556


density in the southern and central regions of the Krasnoyarsk krai. The day night band (DNB)
of the VIIRS radiometer located on the Suomi NPP satellite was used as a source of nighttime
lights data. In addition, data on the Earth’s surface temperature and the NDVI vegetation index
were used to improve the accuracy of the population density estimates. These datasets were
also obtained using measurements made by the SNPP/VIIRS radiometer. The VTLPI (vegetation
temperature light population index) index [5] was used as a composite index to estimate the
population density.
   The main purpose of this work was the estimation of the population density in the central
and southern regions of the Krasnoyarsk krai using remote sensing data. This included the
following objectives: (1) creation of the composite index, including data on nighttime lights in
the study area, surface temperature and vegetation index; (2) estimation of population density
using a model derived from the 2010 census.


2. Calculation of the Vegetation Temperature Light Population
   Index from satellite data
We have generated the GIS layers containing nighttime lights; surface temperature and vegeta-
tion index using data from the SNPP/VIIRS satellite system.
   The study area included the central and southern regions of the Krasnoyarsk krai, covering
an area from 52∘ N to 58∘ N and from 88∘ E to 99∘ E. (Figure 1). Its total area was about
250 thousand km2 , including three forest zones: taiga, forest-steppe, and the South Siberian
mountain zone [6]. According to the vegetation map [7], the area of forested lands is about 50%.
   We have used satellite data obtained using the VIIRS radiometer of the SNPP satellite. As a
source of nighttime lights data, we have used a product generated by the National Geophysical
Data Center (NOAA, USA) and available at (https://www.ngdc.noaa.gov/eog/viirs/download_
dnb_composites.html). This product contains information on the light intensity from areas of
anthropogenic activity, as well as certain other objects or phenomena with a spatial resolution
of 15 arc seconds. We used data on nighttime lights for the territory of the Krasnoyarsk krai for
2013, which were averaged over 12 months, so each image pixel represented the mean annual
illumination value. After the averaging procedure, the final layer was re-projected into the
Albers Equal-Area Conical Projection with a spatial resolution of 500 meters.
   To calculate the vegetation indices, we used the VNP13A1 product, containing vegetation
indices, such as NDVI, EVI, as well as reflectance values in the visible and near infrared wave
ranges. During data preprocessing, we also have excluded areas containing clouds or cloud
shadows. Further, a composite image was created using the maximum NDVI values obtained
between the beginning of June and the end of August of the year 2013.
   Additionally, to calculate the composite VTLPI index, we also used data on the earth’s
surface temperature. The VNP21A1 product created using the VIIRS radiometer data was
used as a source of surface temperature data. Using product quality flags we have excluded
pixels containing clouds, cloud shadows, or low quality data. The final temperature layer was
calculated as the mean annual temperature. Further, the GIS layers containing the NDVI values
and surface temperature were also re-projected into the Albers equal-area conical projection
with a spatial resolution of 500 meters using the nearest neighbor method.



                                               551
Evgeniy G. Shvetsov et al. CEUR Workshop Proceedings                                          550–556




Figure 1: GIS layer containing VTLPI spatial distribution for the territory of the central and southern
regions of the Krasnoyarsk krai, obtained using VIIRS radiometer data.


  To exclude areas occupied by water bodies, a map of land surface types was used [7]. To
assess the population density, we used the VTLPI index [5], calculated as follows:
                                         √
                                           DNBnorm
                               VTLPI =              · LSTnorm ,
                                          NDVImax
where DNBnorm is the normalized DNB value, NDVImax is the maximum NDVI value, and
LSTnorm is the normalized surface temperature value. The normalized DNBnorm and LSTnorm
values were calculated as follows:
                                                LST − LSTmin
                                  LSTnorm =                    .
                                               LSTmax − LSTmin
  Here LSTnorm is the normalized surface temperature, ranging from 0 to 1. LSTmax and LSTmin
are the maximum and minimum temperature values within the study area, respectively, and
LST is the temperature value of the analyzed pixel.
                                           √         √
                                             DNB − DNBmin
                             DNBnorm = √
                          √︀
                                                       √         .
                                            DNBmax − DNBmin




                                                 552
Evgeniy G. Shvetsov et al. CEUR Workshop Proceedings                                    550–556


   Here DNBnorm is the normalized value of the nighttime light for the area, ranging from 0 to 1.
DNBmax and DNBmin are the maximum and minimum nighttime light values within the study
area, respectively, and DNB is the value of the analyzed pixel.
   Unlike [5], we did not correct the VTLPI value for the height of the terrain above sea level,
since there were no mountain regions within our study area.

2.1. Modeling population density using remote sensing data
To use the VTLPI to assess the population density of the Krasnoyarsk krai, a model relationship
relating these two data sets was obtained. The materials of the year 2010 census were used as a
source of data on the actual population density. Census materials are publicly available on the
Rosstat website (https://rosstat.gov.ru/free_doc/new_site/perepis2010/croc/perepis_itogi1612.
htm).
   We compared the averaged VTLPI values calculated for a number of settlements within
the Krasnoyarsk krai with the population values obtained from the census data. To calculate
the mean VTLPI value, we used geoinformation layer of the settlements boundaries. Using
these boundaries, we have extracted fragments from the VTLPI raster layer corresponding to
settlements for which several statistical parameters were calculated. In total, we used data for
40 settlements in the central and southern regions of the Krasnoyarsk krai.
   According to the census data, the number of people in the analyzed settlements ranged
from 756 to 973826. At the same time, the mean VTLPI value for the considered settlements
ranged from 0.7 to 17. Figure 2 shows a diagram of the VTLPI values and population density
in the analyzed settlements according to the census data. Analysis of the data showed that
the relationship between these data series can be fitted using a linear function, such as 𝑃 =
136.5 * VTLPI − 105, which reasonable well describes the dependence between the population
density and the value of the composite VTLPI index (𝑅2 = 0.65). Here 𝑃 is the population
density (people/km2 ). In this case, the value of the root-mean-square error was 345, and the
median relative error was 0.09. The relative error was calculated as the difference between the
census data and the model data referred to the census data.




Figure 2: The relationship between the VTLPI index and population density according to the 2010
census. Each point corresponds to one settlement. The solid line represents the linear fit.




                                               553
Evgeniy G. Shvetsov et al. CEUR Workshop Proceedings                                     550–556




Figure 3: Spatial distribution of population density in the study area.


   Using the obtained model equation and the spatial distribution of the VTLPI index, we have
estimated the distribution of the population over the territory of the study area with a spatial
resolution of 500 m (Figure 3). According to the resulting model and the VTLPI index, the highest
population density was observed in cities, which corresponds to the actual census data. The
average density of the urban population in the considered territory exceeded 500 people/km2 .
   Comparing our results with the census data for the study area we found that the VTLPI-based
estimate is about 21% higher than the actual census data. Thus, according to the model estimate,
the total population was 2.9 million, while according to the census data, this was approximately
2.4 million.
   There are several factors that can influence the accuracy of estimates. First, the VTLPI index
does not allow excluding the influence of industrial objects, which are characterized by rather
high values of the nighttime light measurements, temperature, as well as low values of the NDVI
index. For example, on the territory of the Achinsk refinery, the population density, according
to the model estimate, reached 4–5 thousand people/km2 , which is obviously associated with
significant emissions of thermal and light radiation. Also, errors in determining the boundaries
of settlements used to create a sample in the process of the model parameterizing can have an
impact on the accuracy of estimates. In addition, the time difference between census data (2010)
and the time of satellite data acquisition (2013) is likely to have influenced the results.




                                                  554
Evgeniy G. Shvetsov et al. CEUR Workshop Proceedings                                     550–556


3. Conclusions
Using the archive of thematic products, generated using the data of the VIIRS radiometer,
we have created the GIS layers of the nighttime lights, the vegetation index NDVI and the
land surface temperature for the central and southern regions of the Krasnoyarsk krai for the
year 2013. Using these layers, we have calculated the VTLPI index related to population density.
   According to the year 2010 census data obtained for 40 settlements of the Krasnoyarsk krai, a
training sample was created, which was used to parameterize a linear model relating the VTLPI
index and the actual population density.
   Using the obtained model equation and the spatial distribution of the VTLPI index, the
population density for the study area was estimated with a spatial resolution of 500 m. At the
same time, comparison with the census data showed that the estimate based on the VTLPI index
is about 21% higher than the actual data.


Acknowledgments
This work was financially supported by the Russian Foundation for Basic Research, the Krasno-
yarsk Regional Fund for the Support of Scientific and Scientific and Technical Activities, project
No. 19-45-240004-r_a “Predictions of the ecological-economic potential for possible “climatic”
migrations in the Angara-Yenisei macroregion in a changing climate of the 21st century”.


References
 [1] Ahola T., Virrantaus K., Krisp J.M., Hunter G.J. A spatio-temporal population model to
     support risk assessment and damage analysis for decision-making // International Journal
     of Geographical Information Science. 2007. Vol. 21. P. 935–953.
 [2] Aubrecht C., Özceylan D., Steinnocher K., Freire S. Multi-level geospatial modeling of
     human exposure patterns and vulnerability indicators // Natural Hazards. 2013. Vol. 68.
     P. 147–163.
 [3] Yang X., Huang Y., Dong P., Jiang D., Liu H. An updating system for the gridded population
     database of China based on remote sensing, GIS and spatial database technologies //
     Sensors. 2009. Vol. 9. P. 1128–1140.
 [4] Roy Chowdhury P.K., Maithani S., Dadhwal V.K. Estimation of urban population in Indo-
     Gangetic Plains using night-time OLS data // International Journal of Remote Sensing.
     2012. Vol. 33. Is. 8. P. 2498–2515.
 [5] Luo P., Zhang X., Cheng J., Sun Q. Modeling population density using a new index derived
     from multi-sensor image data // Remote sensing. 2019. Vol. 11. P. 2620.
 [6] Decree of the Ministry of Natural Resources of Russia “On approval of the list of forest
     zones of the Russian Federation and the list of forest regions of the Russian Federation”
     No. 367 dated 18.08.2014 (as amended on 19.02.2019). Moscow, 2014. (In Russ.)
 [7] Bartalev S.A., Belward A., Erchov D.V., Isaev A.S. A new SPOT4-VEGETATION derived
     land cover map of Northern Eurasia // International Journal of Remote Sensing. 2003.
     Vol. 24. Is. 9. P. 1977–1982.



                                               555