<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Segmentation of Illuminated Areas of Light Using CNN and Large-Scale RGB+D Dataset for Augmented and Mixed Reality Systems*</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>M. Sorokin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>D. Zhdanov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A. Zhdanov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ITMO University</institution>
          ,
          <addr-line>St. Petersburg</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>1982</year>
      </pub-date>
      <fpage>0000</fpage>
      <lpage>0001</lpage>
      <abstract>
        <p>This work is devoted to the problem of restoring realistic rendering for augmented and mixed reality systems. Finding the light sources and restoring the correct distribution of scene brightness is one of the key parameters that allows to solve the problem of correct interaction between the virtual and real worlds. With the advent of such datasets as, "LARGE-SCALE RGB + D," it became possible to train neural networks to recognize the depth map of images, which is a key requirement for working with the environment in real time. Additionally, in this work, convolutional neural networks were trained on the synthesized dataset with realistic lighting. The results of the proposed methods are presented, the accuracy of restoring the position of the light sources is estimated, and the visual difference between the image of the scene with the original light sources and the same scene. The speed allows it to be used in real-time AR/VR systems.</p>
      </abstract>
      <kwd-group>
        <kwd>Augmented Reality</kwd>
        <kwd>Mixed Reality</kwd>
        <kwd>Fully Convolutional Network</kwd>
        <kwd>Segmentation</kwd>
        <kwd>Deep Learning</kwd>
        <kwd>CNN</kwd>
        <kwd>Computer Vision Algorithms</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Augmented and mixed reality systems are being used in many tasks, however the
incorrect illumination of the virtual world objects may cause discomfort in the perception
of the reality, in which objects of the real and virtual worlds are mixed and as a result
this limits the time that a person can be in the mixed reality, and further it restricts the
practical use of the mixed reality systems in various areas, for example - in education
or training.</p>
      <p>This article is devoted to convolutional neural network methods (CNN) for solving
the global scientific problem in the field of the physically correct and effective
restoration of illumination conditions and optical properties of real-world objects during the
synthesis of images of the virtual world.</p>
      <p>First, this work is focused on determining the real power of illumination of the light
flux and its position in an environment. For this a manually synthesized sample of
images with realistic optical parameters of the medium were used. Although the sample
consists of only 260 images (221 were used for training, and39 for the test), the neural
network at the output classifies with good accuracy the real optical parameters of the
illumination of the medium. These are usually divided by the strength of illumination
into 5 classes, where the first is 0 lumens, which means it is not lit at all, while grade 5
is the source of illumination of an ordinary room lamp.</p>
      <p>Moreover, for the reconstruction of the depth of an environment the "Large-Scale
RGB + D Dataset" was used, which was obtained using Kinect v2 and Zed stereo
camera and their disparity maps.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related works</title>
      <p>
        The idea of environment segmentation is already trivial, but every year new approaches,
neural network architectures and datasets appear that improve the previous results. An
analysis of outdoor lighting using a fully convolution network is presented in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ],
where analyzes panoramic images of the environment in an open air as input and
embeds the image under these environmental conditions. The work [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] also analyzes the
environment and builds the shadows of objects as they should be. The convolution
network is also used in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], but to determine where the object is located: outdoors or
indoors. The following article [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] presents its own architecture and solves three different
problems: predicting depth, evaluating surface normals, and semantic
marking. Many works [
        <xref ref-type="bibr" rid="ref6 ref7 ref8 ref9">6, 7, 8, 9</xref>
        ] were aimed at detecting objects using convolutional
neural networks.
      </p>
      <p>
        Works [
        <xref ref-type="bibr" rid="ref11 ref12 ref13 ref14">11,12,13,14</xref>
        ] as well as this work are aimed at restoring lighting for
augmented reality systems, but for different purposes and tasks. Among these articles,
methods for analyzing direct lighting are considered, without taking into account the
secondary lighting for objects of virtual reality. Also, these works are aimed at finding
the direct observer in the augmented reality and lighting system relative to the user. The
difference between the neural network described in this paper is that the data set is
generated using a powerful renderer - “Lumicept” [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], which were used to train the
neural network, restoring segmented sections of light similar to reference images with
ground truth using the categorical cross entropy object function. The main task of the
current work is to determine and classify the real illumination power of a real room and
the light source position.
      </p>
      <p>
        The “LARGE-SCALE RGB+D DATABASE” dataset contains synchronized RGBD
frames from both Kinect v2 and Zed stereo camera. For the outdoor scene, they first
generated disparity maps using an accurate stereo matching method and converted them
using calibration parameters. A per-pixel confidence map of disparity is also provided.
The scenes are captured at various places, e.g., offices, rooms, dormitory, exhibition
center, street, road etc., from Yonsei University and Ewha University. This dataset has
been used to train convolutional neural networks in projects [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and in papers[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ],
      </p>
      <p>
        Segmentation of Illuminated Areas of Light Using CNN and Large-Scale RGB+D Dataset 3
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] "High quality 2D-to-multiview contents generation from large-scale
RGB-D database".
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Implementation</title>
      <p>To train a neural network for classification depth maps a sorted dataset "Large-Scale
RGB + D Dataset" of indoor images was used, which consists of 1609 images for
training and 503 images for testing. These images were obtained using Kinect v2 and Zed
stereo cameras, with the calculation of their disparity maps and restoration of the pixel
confidence. Sorted "Large-Scale RGB + D Dataset" comes with high-resolution and
low-resolution quality, in current work was used the low-resolution pack with
squeezing all images to 224*224 pixels, so that it would be possible to train them on the
architecture of VGG 16-Net.</p>
      <p>The working distance of the Kinect is up to 6 meters. The level of gradation between
the distances of measurement levels is about ~30 cm. In total, in this approach with the
neural network of the VGG 16-Net architecture 20 output neurons were used, to
produce distance measurement with a 20-level heat map. As a convolution network
architecture, it was decided to use the VGG16 Net architecture because it was successfully
used in many classification tasks, it consists of 5 blocks with convolution, pooling, and
“ReLU” activation function between layers and the optimization method is “Nesterov”.</p>
      <p>For lighting classification method was used manually synthesized dataset of lighting
of different rooms. The task of a fully convolutional network is to classify each pixel
of an image into one class. That is, passing through all convolutional layers, the network
characterizes a certain area of the image to one class in accordance with the power of
illumination.</p>
      <p>The architecture is presented on figure 1.
The neural network training for the classification of light took 50 epochs, for depth map
- 200 epochs. The training took place on a GeForse GTX 1080Ti video card. The
original image was fed with 5 light area masks and 20 depth data masks. Dataset images
are shown in figures 2 and 3.</p>
      <p>Fig. 3. The training dataset images for depth classification.</p>
      <p>Segmentation of Illuminated Areas of Light Using CNN and Large-Scale RGB+D Dataset 5
The history of training and results of work are presented in figures 4 and 5.
In this work, neural networks were trained to restore light sources and depth maps of
the indoor scene. The architecture of this network is good for classifying data with
many classes. The speed allows it to be used in real-time systems. The recognition
accuracy of light sources in some scenes turned out to be quite good; in the future, it is
planned to improve the coordinate recovery algorithm for greater accuracy, which can
easily restore the distance to any point in the image, but working with images
significantly faster than with 3D models.
5</p>
    </sec>
    <sec id="sec-4">
      <title>Further work</title>
      <p>In further work it is planned to improve the results by using other neural network
architectures and image processing using computer vision algorithms to strengthen the
output results.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Hold-Geoffroy</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sunkavalli</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hadap</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gambaretto</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lalonde</surname>
            ,
            <given-names>J.-F.</given-names>
          </string-name>
          ,
          <article-title>"Deep outdoor illumination estimation,"</article-title>
          <source>In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR)</source>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Lalonde</surname>
            ,
            <given-names>J.-F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Efros</surname>
            ,
            <given-names>A. A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Narasimhan</surname>
            ,
            <given-names>S. G.</given-names>
          </string-name>
          ,
          <article-title>"Estimating the natural illumination conditions from a single outdoor image,"</article-title>
          <source>International Journal of Computer Vision</source>
          ,
          <volume>98</volume>
          (
          <issue>2</issue>
          ),
          <fpage>123</fpage>
          -
          <lpage>145</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Gardner</surname>
            , M.-
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sunkavalli</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yumer</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gambaretto</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gagne</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lalonde</surname>
            ,
            <given-names>J.-F.</given-names>
          </string-name>
          ,
          <article-title>"Learning to predict indoor illumination from a single image,"</article-title>
          <source>ACM Transactions on Graphics (Proceedings of SIGGRAPH Asia)</source>
          ,
          <source>preprints (</source>
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Lombardi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Nishino</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <article-title>"Reflectance and Illumination Recovery in the Wild,"</article-title>
          <source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          <volume>38</volume>
          ,
          <fpage>129</fpage>
          -
          <lpage>141</lpage>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Eigen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Fergus</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <article-title>"Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture, "</article-title>
          <source>International Conference on Computer Vision</source>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Girshick</surname>
            ,
            <given-names>R. B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Donahue</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darrell</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Malik</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <article-title>"Rich feature hierarchies for accurate object detection and semantic segmentation," CVPR (</article-title>
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Sermanet</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eigen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mathieu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fergus</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          and LeCun, Y.,
          <article-title>"Overfeat: Integrated recognition, localization and detection using convolutional networks," ICLR (</article-title>
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Simonyan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Zisserman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <article-title>"Very deep convolutional networks for large-scale image recognition," CoRR</article-title>
          , abs/1409.1556 (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Szegedy</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jia</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sermanet</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reed</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Anguelov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Erhan</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vanhoucke</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Rabinovich</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <article-title>"Going deeper with convolutions," CoRR</article-title>
          , abs/1409.4842 (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Heymann</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smolic</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muller</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Froehlich</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <article-title>"Illumination reconstruction from realtime video for interactive augmented reality,"</article-title>
          <source>International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS)</source>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. Bruno Augusto Dorta,
          <string-name>
            <surname>M.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Rafael</given-names>
            <surname>Rego</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Cristina</surname>
          </string-name>
          <string-name>
            <given-names>Nader</given-names>
            , Vasconcelos.,
            <surname>Esteban</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          ,
          <article-title>"Deep light source estimation for mixed reality,"</article-title>
          <source>VISIGRAPP 2018 - Proceedings of the 13th International Joint Conference on Computer Vision</source>
          , Imaging and
          <source>Computer Graphics Theory and Applications</source>
          , (
          <volume>303</volume>
          -
          <fpage>311</fpage>
          ) (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Salma</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Philippe</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eric</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <article-title>"Illumination Estimation Using Cast Shadows for Realistic Augmented Reality Applications,"</article-title>
          <source>Adjunct Proceedings of the 2017 IEEE International Symposium on Mixed and Augmented Reality</source>
          ,
          <string-name>
            <surname>ISMAR-Adjunct</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Frahm</surname>
            ,
            <given-names>Jan-Michael.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koeser</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grest</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koch</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <article-title>"Markerless Augmented Reality with Light Source Estimation for Direct Illumination,"</article-title>
          <source>European Conference on Visual Media Production</source>
          , (
          <volume>211</volume>
          -
          <fpage>220</fpage>
          ) (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <article-title>"Lumicept | Integra Inc.," Integra Inc</article-title>
          .,
          <year>2019</year>
          , &lt;https://integra.jp/en/products/lumicept&gt; (
          <source>Last accessed 19 Oct</source>
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <article-title>Digital Image Media Laboratory (DIML), “DIML/CVL RGB-D technical_report”, This material is presented to provide a detailed description about the DIML/CVl RGB-D dataset</article-title>
          . https://dimlrgbd.github.
          <source>io/downloads/technical_report.pdf (Last accessed 27 July</source>
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ham</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oh</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sohn</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <article-title>"Structure selective depth super-resolution for RGB-D cameras,"</article-title>
          <source>IEEE Trnas. on Image Processing</source>
          , vol.
          <volume>25</volume>
          , no.
          <issue>11</issue>
          , pp.
          <fpage>5527</fpage>
          -
          <lpage>38</lpage>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Min</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ham</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sohn</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <article-title>"Deep Stereo Confidence Prediction for Depth Estimation,"</article-title>
          <source>IEEE International Conference on Image Processing</source>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jung</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Min</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sohn</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <article-title>"Deep Monocular Depth Estimation via Integration of Global and Local Predictions,"</article-title>
          <source>IEEE Trnas. on Image Processing</source>
          , vol.
          <volume>27</volume>
          , no.
          <issue>8</issue>
          , pp.
          <fpage>4131</fpage>
          -
          <lpage>43</lpage>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Cho</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Min</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sohn</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <article-title>"A Large RGB-D Dataset for Semi-supervised Monocular Depth Estimation", Submitted to the IEEE Transactions on Image Processing (</article-title>
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>