<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Using remote sensing data in population density estimation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Evgeniy G. Shvetsov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nadezhda M. Tchebakova</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elena I. Parfenova</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Siberian Federal University</institution>
          ,
          <addr-line>Krasnoyarsk</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>V.N. Sukachev Institute of Forest, Federal Research Center “Krasnoyarsk Science Center SB RAS”</institution>
          ,
          <addr-line>Krasnoyarsk</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <fpage>550</fpage>
      <lpage>556</lpage>
      <abstract>
        <p>In recent decades, remote sensing methods have often been used to estimate population density, especially using data on nighttime illumination. Information about the spatial distribution of the population is important for understanding the dynamics of cities and analyzing various socio-economic, environmental and political factors. In this work, we have formed layers of the nighttime light index, surface temperature and vegetation index according to the SNPP/VIIRS satellite system for the territory of the central and southern regions of the Krasnoyarsk krai. Using these data, we have calculated VTLPI (vegetation temperature light population index) for the year 2013. The obtained values of the VTLPI calculated for a number of settlements of the Krasnoyarsk krai were compared with the results of the population census conducted in 2010. In total, we used census data for 40 settlements. Analysis of the data showed that the relationship between the value of the VTLPI index and the population density in the Krasnoyarsk krai can be adequately fitted ( 2 = 0.65) using a linear function. In this case, the value of the root-meansquare error was 345, and the relative error was 0.09. Using the obtained model equation and the spatial distribution of the VTLPI index using GIS tools, the distribution of the population over the study area was estimated with a spatial resolution of 500 meters. According to the obtained model and the VTLPI index, the average urban population density in the study area exceeded 500 people/km2. Comparison of the obtained data on the total population in the study area showed that the estimate based on the VTLPI index is about 21% higher than the actual census data.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Population density</kwd>
        <kwd>VIIRS</kwd>
        <kwd>nighttime lights</kwd>
        <kwd>VTLPI</kwd>
        <kwd>Krasnoyarsk krai</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Detailed information on the spatial distribution of the population is important for understanding
the dynamics of cities and analyzing various socio-economic, environmental and political
factors [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. The most readily available source of population data is usually census data. This
method, however, is expensive and time consuming, and therefore, population censuses are
rarely conducted [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        In recent decades, remote sensing methods have often been used to estimate population
density, especially using nighttime light (NTL) data. In addition, in recent years, to improve the
accuracy of the population density estimates, nighttime lights data have been used in conjunction
with other data sources (vegetation indices, data on types of vegetation cover) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. We have
used several sets of remote sensing data to assess the spatial distribution of the population
density in the southern and central regions of the Krasnoyarsk krai. The day night band (DNB)
of the VIIRS radiometer located on the Suomi NPP satellite was used as a source of nighttime
lights data. In addition, data on the Earth’s surface temperature and the NDVI vegetation index
were used to improve the accuracy of the population density estimates. These datasets were
also obtained using measurements made by the SNPP/VIIRS radiometer. The VTLPI (vegetation
temperature light population index) index [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] was used as a composite index to estimate the
population density.
      </p>
      <p>The main purpose of this work was the estimation of the population density in the central
and southern regions of the Krasnoyarsk krai using remote sensing data. This included the
following objectives: (1) creation of the composite index, including data on nighttime lights in
the study area, surface temperature and vegetation index; (2) estimation of population density
using a model derived from the 2010 census.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Calculation of the Vegetation Temperature Light Population</title>
    </sec>
    <sec id="sec-3">
      <title>Index from satellite data</title>
      <p>We have generated the GIS layers containing nighttime lights; surface temperature and
vegetation index using data from the SNPP/VIIRS satellite system.</p>
      <p>
        The study area included the central and southern regions of the Krasnoyarsk krai, covering
an area from 52∘ N to 58∘ N and from 88∘ E to 99∘ E. (Figure 1). Its total area was about
250 thousand km2, including three forest zones: taiga, forest-steppe, and the South Siberian
mountain zone [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. According to the vegetation map [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], the area of forested lands is about 50%.
      </p>
      <p>We have used satellite data obtained using the VIIRS radiometer of the SNPP satellite. As a
source of nighttime lights data, we have used a product generated by the National Geophysical
Data Center (NOAA, USA) and available at (https://www.ngdc.noaa.gov/eog/viirs/download_
dnb_composites.html). This product contains information on the light intensity from areas of
anthropogenic activity, as well as certain other objects or phenomena with a spatial resolution
of 15 arc seconds. We used data on nighttime lights for the territory of the Krasnoyarsk krai for
2013, which were averaged over 12 months, so each image pixel represented the mean annual
illumination value. After the averaging procedure, the final layer was re-projected into the
Albers Equal-Area Conical Projection with a spatial resolution of 500 meters.</p>
      <p>To calculate the vegetation indices, we used the VNP13A1 product, containing vegetation
indices, such as NDVI, EVI, as well as reflectance values in the visible and near infrared wave
ranges. During data preprocessing, we also have excluded areas containing clouds or cloud
shadows. Further, a composite image was created using the maximum NDVI values obtained
between the beginning of June and the end of August of the year 2013.</p>
      <p>Additionally, to calculate the composite VTLPI index, we also used data on the earth’s
surface temperature. The VNP21A1 product created using the VIIRS radiometer data was
used as a source of surface temperature data. Using product quality flags we have excluded
pixels containing clouds, cloud shadows, or low quality data. The final temperature layer was
calculated as the mean annual temperature. Further, the GIS layers containing the NDVI values
and surface temperature were also re-projected into the Albers equal-area conical projection
with a spatial resolution of 500 meters using the nearest neighbor method.</p>
      <p>
        To exclude areas occupied by water bodies, a map of land surface types was used [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. To
assess the population density, we used the VTLPI index [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], calculated as follows:
VTLPI =
√DNBnorm
NDVImax
      </p>
      <p>· LSTnorm,
LSTnorm =</p>
      <p>LST − LSTmin .</p>
      <p>LSTmax − LSTmin
where DNBnorm is the normalized DNB value, NDVImax is the maximum NDVI value, and
LSTnorm is the normalized surface temperature value. The normalized DNBnorm and LSTnorm
values were calculated as follows:</p>
      <p>Here LSTnorm is the normalized surface temperature, ranging from 0 to 1. LSTmax and LSTmin
are the maximum and minimum temperature values within the study area, respectively, and
LST is the temperature value of the analyzed pixel.</p>
      <p>√DNB −
√︀DNBnorm = √DNBmax −
√DNBmin .</p>
      <p>√DNBmin</p>
      <p>Here DNBnorm is the normalized value of the nighttime light for the area, ranging from 0 to 1.
DNBmax and DNBmin are the maximum and minimum nighttime light values within the study
area, respectively, and DNB is the value of the analyzed pixel.</p>
      <p>
        Unlike [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], we did not correct the VTLPI value for the height of the terrain above sea level,
since there were no mountain regions within our study area.
2.1. Modeling population density using remote sensing data
To use the VTLPI to assess the population density of the Krasnoyarsk krai, a model relationship
relating these two data sets was obtained. The materials of the year 2010 census were used as a
source of data on the actual population density. Census materials are publicly available on the
Rosstat website (https://rosstat.gov.ru/free_doc/new_site/perepis2010/croc/perepis_itogi1612.
htm).
      </p>
      <p>We compared the averaged VTLPI values calculated for a number of settlements within
the Krasnoyarsk krai with the population values obtained from the census data. To calculate
the mean VTLPI value, we used geoinformation layer of the settlements boundaries. Using
these boundaries, we have extracted fragments from the VTLPI raster layer corresponding to
settlements for which several statistical parameters were calculated. In total, we used data for
40 settlements in the central and southern regions of the Krasnoyarsk krai.</p>
      <p>According to the census data, the number of people in the analyzed settlements ranged
from 756 to 973826. At the same time, the mean VTLPI value for the considered settlements
ranged from 0.7 to 17. Figure 2 shows a diagram of the VTLPI values and population density
in the analyzed settlements according to the census data. Analysis of the data showed that
the relationship between these data series can be fitted using a linear function, such as  =
136.5 * VTLPI − 105, which reasonable well describes the dependence between the population
density and the value of the composite VTLPI index (2 = 0.65). Here  is the population
density (people/km2). In this case, the value of the root-mean-square error was 345, and the
median relative error was 0.09. The relative error was calculated as the diference between the
census data and the model data referred to the census data.</p>
      <p>Using the obtained model equation and the spatial distribution of the VTLPI index, we have
estimated the distribution of the population over the territory of the study area with a spatial
resolution of 500 m (Figure 3). According to the resulting model and the VTLPI index, the highest
population density was observed in cities, which corresponds to the actual census data. The
average density of the urban population in the considered territory exceeded 500 people/km2.</p>
      <p>Comparing our results with the census data for the study area we found that the VTLPI-based
estimate is about 21% higher than the actual census data. Thus, according to the model estimate,
the total population was 2.9 million, while according to the census data, this was approximately
2.4 million.</p>
      <p>There are several factors that can influence the accuracy of estimates. First, the VTLPI index
does not allow excluding the influence of industrial objects, which are characterized by rather
high values of the nighttime light measurements, temperature, as well as low values of the NDVI
index. For example, on the territory of the Achinsk refinery, the population density, according
to the model estimate, reached 4–5 thousand people/km2, which is obviously associated with
significant emissions of thermal and light radiation. Also, errors in determining the boundaries
of settlements used to create a sample in the process of the model parameterizing can have an
impact on the accuracy of estimates. In addition, the time diference between census data (2010)
and the time of satellite data acquisition (2013) is likely to have influenced the results.</p>
    </sec>
    <sec id="sec-4">
      <title>3. Conclusions</title>
      <p>Using the archive of thematic products, generated using the data of the VIIRS radiometer,
we have created the GIS layers of the nighttime lights, the vegetation index NDVI and the
land surface temperature for the central and southern regions of the Krasnoyarsk krai for the
year 2013. Using these layers, we have calculated the VTLPI index related to population density.</p>
      <p>According to the year 2010 census data obtained for 40 settlements of the Krasnoyarsk krai, a
training sample was created, which was used to parameterize a linear model relating the VTLPI
index and the actual population density.</p>
      <p>Using the obtained model equation and the spatial distribution of the VTLPI index, the
population density for the study area was estimated with a spatial resolution of 500 m. At the
same time, comparison with the census data showed that the estimate based on the VTLPI index
is about 21% higher than the actual data.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work was financially supported by the Russian Foundation for Basic Research, the
Krasnoyarsk Regional Fund for the Support of Scientific and Scientific and Technical Activities, project
No. 19-45-240004-r_a “Predictions of the ecological-economic potential for possible “climatic”
migrations in the Angara-Yenisei macroregion in a changing climate of the 21st century”.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Ahola</surname>
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Virrantaus</surname>
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krisp</surname>
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hunter</surname>
            <given-names>G.J.</given-names>
          </string-name>
          <article-title>A spatio-temporal population model to support risk assessment and damage analysis for decision-making //</article-title>
          <source>International Journal of Geographical Information Science</source>
          .
          <year>2007</year>
          . Vol.
          <volume>21</volume>
          . P.
          <volume>935</volume>
          -
          <fpage>953</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Aubrecht</surname>
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Özceylan</surname>
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steinnocher</surname>
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Freire</surname>
            <given-names>S.</given-names>
          </string-name>
          <article-title>Multi-level geospatial modeling of human exposure patterns</article-title>
          and vulnerability indicators // Natural Hazards.
          <year>2013</year>
          . Vol.
          <volume>68</volume>
          . P.
          <volume>147</volume>
          -
          <fpage>163</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Yang</surname>
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dong</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jiang</surname>
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            <given-names>H.</given-names>
          </string-name>
          <article-title>An updating system for the gridded population database of China based on remote sensing, GIS and spatial</article-title>
          database technologies // Sensors.
          <year>2009</year>
          . Vol.
          <volume>9</volume>
          . P.
          <volume>1128</volume>
          -
          <fpage>1140</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Roy</given-names>
            <surname>Chowdhury P.K</surname>
          </string-name>
          .,
          <string-name>
            <surname>Maithani</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dadhwal</surname>
            <given-names>V.K.</given-names>
          </string-name>
          <article-title>Estimation of urban population in IndoGangetic Plains using night-time OLS data //</article-title>
          <source>International Journal of Remote Sensing</source>
          .
          <year>2012</year>
          . Vol.
          <volume>33</volume>
          . Is. 8. P.
          <volume>2498</volume>
          -
          <fpage>2515</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Luo</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            <given-names>X</given-names>
          </string-name>
          ., Cheng J.,
          <string-name>
            <surname>Sun</surname>
            <given-names>Q</given-names>
          </string-name>
          .
          <article-title>Modeling population density using a new index derived from multi-sensor image data // Remote sensing</article-title>
          .
          <source>2019</source>
          . Vol.
          <volume>11</volume>
          . P.
          <volume>2620</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <article-title>[6] Decree of the Ministry of Natural Resources of Russia “On approval of the list of forest zones of the Russian Federation and the list of forest regions of the Russian Federation”</article-title>
          <source>No. 367 dated 18.08.2014 (as amended on 19.02</source>
          .
          <year>2019</year>
          ). Moscow,
          <year>2014</year>
          . (In Russ.)
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Bartalev</surname>
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Belward</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Erchov</surname>
            <given-names>D.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Isaev</surname>
            <given-names>A.S.</given-names>
          </string-name>
          <article-title>A new SPOT4-VEGETATION derived land cover map of Northern Eurasia //</article-title>
          <source>International Journal of Remote Sensing</source>
          .
          <year>2003</year>
          . Vol.
          <volume>24</volume>
          . Is. 9.
          <string-name>
            <surname>P.</surname>
          </string-name>
          <year>1977</year>
          -
          <fpage>1982</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>