<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Spatial Knowledge and Information Canada</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Landcover classification using texture- encoded convolutional neural networks: peeking inside the black box</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>MALIK KARIM</string-name>
          <email>malikkarim360@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>COLIN ROBERTSON</string-name>
          <email>crobertson@wlu.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Geography and Environmental, Studies, Wilfrid Laurier University</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <volume>7</volume>
      <issue>6</issue>
      <abstract>
        <p />
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Pattern recognition, object detection, and
image classification are typical areas in
which contemporary computer vision
algorithms are being deployed. In
highresolution remotely sensed image
classification problems, texture features can
be crucial to the recognition of different
landcover classes. Using a classical
convolutional neural network (CNN) and
texture-encoded CNN variant, early
concatenation CNN (EC-CNN), we explore
the relevance of texture-based features in
landcover classification. In this paper, we
demonstrate the utility of using shallow
layers of a CNN for learning discriminative
local texture features in very high-resolution
images. We apply these models to a case
study problem involving ground lichen
classification in a tundra ecosystem and
found that the texture EC-CNN
outperformed the non-texture based classical
CNN. Given that deep learning models are
often perceived as “black-boxes”, in order to
illustrate how effective texture models
represent landcover features, we extract
feature maps from each model to provide a
visual interpretation of the texture patterns
learned by the various models. The CNN
model saliency maps contain more localized
patterns which are not easily interpretable,
visually. The EC-CNN model on the other
hand contain patterns that are more
intuitive and representative of fine-grained
textures. Furthermore, almost all filters in
the model detected locally significant
patterns in the landscape. This finding
suggests the potential generalizability of
texture-based CNNs and that classification
errors associated with such models might be
lower than that of traditional CNNs.</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>
        Landcover classification is becoming
increasingly vital and more sophisticated as
remotely sensed data are now available at
high temporal and spatial resolutions.
Landcover information is crucial for
monitoring, and reporting on transitions in
vegetation types across time and space
        <xref ref-type="bibr" rid="ref1">(Albert and Gonz, 2017)</xref>
        . Remotely sensed
image classification represents a domain in
which computer vision methods are now
widely applied. Landcover type
discrimination is a typical task in which
machine learning algorithms are frequently
deployed. The recent successes of artificial
neural networks in object detection, pattern
recognition and scene classification tasks
have resulted in growing interest in
exploring the capabilities of these models
        <xref ref-type="bibr" rid="ref8">(Hinton et al., 2012)</xref>
        . While traditional
landcover mapping methods focus on
spectral homogeneity of classes as the basis
for discrimination, there are several cases
where the spatial arrangement of features
on the landscape is a key discriminant
feature. Forest fragmentation, thermokarst,
and vegetation patterns in arid ecosystems
are all typically recognized by their texture
(i.e., spacing and arrangement) rather than
their spectral characteristics. Most
traditional classification techniques and
classifiers are unable to achieve high
accuracy in these settings due to high data
dimensionality and scale dependencies
        <xref ref-type="bibr" rid="ref4">(Basu et al., 2018)</xref>
        . On recognizing this
limitation, there has been increasing
interest in approaching texture mapping
using ANN techniques
        <xref ref-type="bibr" rid="ref10">(Lloyd et al., 2004)</xref>
        .
        <xref ref-type="bibr" rid="ref2">Andrearczyk and Whelan (2016)</xref>
        found that
a texture explicit CNN model out-performed
the state-of-the art models in texture
datasets. In a related study,
        <xref ref-type="bibr" rid="ref5">Cimpoi and
Vedaldi (2015)</xref>
        demonstrated CNN models
capability to detect textural classes in scenes
characterized by diverse texture categories .
Research in landcover classification and
landscape comparison has demonstrated
the potential of CNNs, especially in
determining similarities in urban land use
patterns
        <xref ref-type="bibr" rid="ref1">(Albert and Gonz, 2017)</xref>
        . Despite
the level of improvements achieved using
CNNs, the fact that higher hierarchical
features are often used to perform
classifications in classical CNNs raises
questions pertaining to their
generalizability, transferability and error
rate across domains, locations, and datasets.
Higher-level CNN features or activations
lack geometric invariance, thus diminishing
their robustness and generalization power
for classification and mapping across
variable scene configurations
        <xref ref-type="bibr" rid="ref7">(Gong et al.,
2014)</xref>
        . Lower-layer feature maps however
contain significant local information that
capture texture and hence extracting and
concatenating such features could build
texture explicit models which are robust and
less prone to classification errors. Research
on the use of dense lower CNN features
information in classification task is limited.
This necessitates the specification of explicit
texture models capable of learning and
accurately representing textured landcovers.
We specify a texture-encoded CNN for
feature extraction and classification of
lichen for a landscape in a low-tundra
ecosystem in Northwest Territories, Canada.
We further compare a classical CNN with
that of our texture-based model. Given that
CNNs are to some degree, considered “black
boxes”, we adapt existing approaches to
compare activation feature maps from the
CNN models
        <xref ref-type="bibr" rid="ref13 ref9">(Jacobs and Goldman, 2010;
Selvaraju et al., 2017)</xref>
        . The contributions of
this paper are therefore two-fold: (a) design
of a simple texture-encoded CNN for
landcover classification, and (b) use of
computer vision techniques to visualize and
gain insight into how well the models learn
texture patterns.
      </p>
    </sec>
    <sec id="sec-3">
      <title>2. Experimental methods</title>
      <p>
        2.1 Texture-encoded CNN model design
CNNs consist of filter banks capable of
extracting hierarchical spatial features using
a weight-sharing framework
        <xref ref-type="bibr" rid="ref5">(Cimpoi et al.,
2015)</xref>
        . Texture analysis and synthesis has
been implemented in CNNs and proven to
represent discriminative local features
        <xref ref-type="bibr" rid="ref14 ref6">(Gatys et al., 2015; Ustyuzhaninov et al.,
2016)</xref>
        . Our design of a texture-based CNN is
motivated by previous findings that
lowerlayers pool dense orderless features and
capture relevant local patterns as compared
to higher-layers which contain global shape
information
        <xref ref-type="bibr" rid="ref5">(Cimpoi et al., 2015)</xref>
        . Our
approach is also inspired by
        <xref ref-type="bibr" rid="ref2">Andrearczyk
and Whelan (2016)</xref>
        technique that derives
an energy feature vector from the
penultimate pooling layer and concatenates
them with the first fully connected layer.
In the context of landscape or landcover
classification and similarity search, global
shape information present in fully
connected layers is of little relevance as
spatial patterns often bear no uniquely
defined geometry across space and time.
Our method encompasses the concatenation
of multi-layer features and learning a
representation of the data generating
process concurrently. In the feature fusion
framework, feature maps from three
hierarchical layers are concatenated; these
are subsequently flattened to feature vectors
to construct the first fully connected (FC)
layer. Here, we attempt to derive a
nonexisting fully connected layer FC1 via
merging features from all preceding layers
(i.e., pool1, pool2, and pool3). The classical
CNN is also designed in parallel to enable
performance comparison with the texture
CNN. Figure 1 depicts the architecture of a
classical CNN and a texture-encoded CNN.
The architecture of both CNNs (i.e., the size
of hidden layers) is adapted from the VGG
model
        <xref ref-type="bibr" rid="ref12">(Simonyan and Zisserman, 2014)</xref>
        ,
with slight modifications in the first and
second hidden layers to learn 96 features.
Table 1.0 provides further information
regarding the model architecture.
2.2 Sample preparation and model
training
Imagery from an unmanned aerial vehicle
(UAV) as well as georeferenced ground
photographs from a study site at Daring
Lake, Northwestern Territories, Canada are
used in this study. Spatial resolutions of the
sample imagery are 0.05 cm and 1 cm for
the ground photos and UAV, respectively.
To generate the training samples, high
resolution ground-truth photos from the
study site were tiled into 225 × 225 patches
of homogenous landcover categories. We
also generated 225 × 225 non-homogeneous
tiles comprising all the three vegetation
categories. These are reserved for probing
into CNN layers to unravel what the models
have learned. The tile dimensions used are
significant enough to contain relevant
discriminative features in the 5 × 5 receptive
field of the CNN filters
        <xref ref-type="bibr" rid="ref3">(Basu et al., 2015)</xref>
        .
The lichen class contains 1300 samples
while the green and colored vegetation
categories comprise 1000 images in each
class, resulting in 3300 patches. The green
vegetation class mainly comprise sedges,
birch and alder shrubs and grasses, while
the colored vegetation category represents
primarily dwarf shrubs in the genus
Arctostaphylos (i.e., bearberry). Figure 2.0
depicts samples of the landcover classes.
The ground photos are used exclusively for
model training, and classification is
implemented on 450 × 450 tiles of UAV
imagery. The rectified linear unit (ReLU)
method is employed in the convolutional
layers and fully connected layers to effect
non-linear input transformation. All images
are standardized to zero mean and unit
variance. Image standardization is essential
to mitigating the ReLU signal saturation
and ensuring convergence of the gradient
descent algorithm
        <xref ref-type="bibr" rid="ref11">(Nair and Hinton, 2010)</xref>
        .
In training, the stochastic gradient
algorithm with cross-entropy cost function
is used. The learning rate, momentum and
number of iterations are set to 0.01, 0.9 and
50, respectively. In order reduce overfitting,
a 50% drop-out is applied to both
convolutional and fully connected layers.
Shutting 50% hidden units (drop-out) is an
effective regularization technique for
minimizing potential overfitting. The
learning rate, and momentum were held
unchanged. An epoch of 50 was selected
after training the models for incremental
iterations of 20, 30, 40, and 50. The models
performance did not improve appreciably
after 40 epochs, thus the selection of 50
epochs for training both models. The
training, validation and test data comprise
60%, 20% and 20%, respectively. The
experiment is conducted with CPU
implementation using Keras + Tensorflow
in Python. Training duration is 9 hours on
i7-3930K CPU @ 3.20GHz, and 52.0GB
windows computer.
      </p>
    </sec>
    <sec id="sec-4">
      <title>3. Results and discussion</title>
      <p>The model accuracy reports on the test
sample are 97.50% and 96.52%,
respectively, for EC-CNN and CNN models;
this suggests a marginal improvement in
performance of the texture-based model
compared to the classical CNN, and
probably accounts for the close similarity in
the classification results outputted by the
models. A plausible cause of this slight
difference in test accuracy can be attributed
to the relatively small training sample size
used. With sample size of 2640 (60% of
3300) used in training and validation (20%
of 3300), the traditional CNN is likely to
demonstrate competitive performance.
Figure 3.0 shows training/validation losses
and training/ validation accuracies for both
models. It can be observed that the EC-CNN
losses and accuracies peak and stabilize
after 25 epochs. The CNN on other hand has
its losses and accuracies peaking after 40
epochs but do not exhibit clear stability. It is
possible that the CNN requires additional
training time to learn. However, it should be
noted that CNNs are susceptible to over
training, resulting in poor performance on
unseen data.</p>
      <p>a)
b)
generalizability of the models on an unseen
dataset.</p>
      <p>Our approach to probing the performance of
the models in the classification of the
various landcover categories surprisingly
reveals that the texture-based model
prediction of the landcover types is visually
intuitive and interpretive; in fact, visual
inspection shows that the model has learned
relevant feature maps about lichen
composition and configuration, and that it is
easier for users to link, for example, what
the model thinks is lichen in the original
map to the class activation maps extracted
from layer three (cov3) of EC-CNN model.
This implies the model predictions may be
more robust and accurate despite the
competing performance seen in the CNN. In
Figure 5.0, class activation maps (CAM)
extracted from lichen filters found in the
models (cov3 layers) shed light on the
patterns learned by each model to
discriminate lichen. It can be observed that
the EC-CNN features are denser than the
CNN features which exhibit sparsity and
smoothness.
a)</p>
      <p>Original Maps</p>
      <p>EC-CNN
c)</p>
      <p>CNN
b)
d)</p>
      <p>Prediction agreement
Green Vegetagtion
Color Vegetation
Lichen</p>
    </sec>
    <sec id="sec-5">
      <title>4. Conclusion</title>
      <p>
        In this paper, we demonstrate the
robustness of texture-based CNN in a lichen
dominated tundra ecosystem classification
task. Using gradient-based class activation
maps as in
        <xref ref-type="bibr" rid="ref13">(Selvaraju et al., 2017)</xref>
        , we show
that our texture-encoded CNN can be more
effective and less error prone in
classification problems as its CAMs are
more intuitive and visually depict a
particular landcover class the model
predicts. Further research is essential to
testing the accuracy of the texture model on
benchmark texture datasets. It is also worth
exploring the performance of different
architectures of the texture model. The
effectiveness of texture-based features in
change detection and landscape similarity
search is another research area worth
investigating.
      </p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>The authors acknowledge
supporting this research.
NSERC for</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Albert</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Gonz</surname>
            ,
            <given-names>M. C.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Using Convolutional Networks and Satellite Imagery to Identify Pa erns in Urban Environments at</article-title>
          a Large Scale,
          <fpage>1357</fpage>
          -
          <lpage>1366</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Andrearczyk</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Whelan</surname>
            ,
            <given-names>P. F.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Using filter banks in Convolutional Neural Networks for texture classification R</article-title>
          .
          <source>Pattern Recognition Letters</source>
          ,
          <volume>84</volume>
          ,
          <fpage>63</fpage>
          -
          <lpage>69</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Basu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ganguly</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mukhopadhyay</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>DiBiano</given-names>
            , R.,
            <surname>Karki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            , &amp;
            <surname>Nemani</surname>
          </string-name>
          ,
          <string-name>
            <surname>R.</surname>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>DeepSat-A Learning framework for Satellite Imagery</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Basu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mukhopadhyay</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>DiBiano</given-names>
            , R.,
            <surname>Ganguly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Nemani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            , &amp;
            <surname>Gayaka</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Deep neural networks for texture classification-A theoretical analysis</article-title>
          .
          <source>Neural Networks</source>
          ,
          <volume>97</volume>
          ,
          <fpage>173</fpage>
          -
          <lpage>182</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Cimpoi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maji</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Vedaldi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>Deep filter banks for texture recognition and segmentation</article-title>
          .
          <source>Computer Vision and Pattern Recognition (CVPR)</source>
          ,
          <source>2015 IEEE Conference On</source>
          ,
          <fpage>3828</fpage>
          -
          <lpage>3836</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Gatys</surname>
            ,
            <given-names>L. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ecker</surname>
            ,
            <given-names>A. S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Bethge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <source>Texture Synthesis Using Convolutional Neural Networks</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Gong</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guo</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Lazebnik</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Multi-Scale Orderless Pooling of Deep Convolutional Activation Features Yunchao</article-title>
          .
          <source>In European Conference on Computer Vision</source>
          ,
          <fpage>392</fpage>
          -
          <lpage>407</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Hinton</surname>
            ,
            <given-names>G. E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Srivastava</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krizhevsky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salakhutdinov</surname>
            ,
            <given-names>R. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yosinski</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lipson</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Improving neural networks by preventing co-adaptation of feature detectors</article-title>
          ,
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Jacobs</surname>
            ,
            <given-names>D. E.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Goldman</surname>
            ,
            <given-names>D. B.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Cosaliency : Where People Look When Comparing Images</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Lloyd</surname>
            ,
            <given-names>C. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berberoglu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Curran</surname>
            ,
            <given-names>P. J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Atkinson</surname>
            ,
            <given-names>P. M.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>A comparison of texture measures for the per-field classification of Mediterranean land cover</article-title>
          .
          <source>International Journal of Remote Sensing</source>
          ,
          <volume>25</volume>
          (
          <issue>19</issue>
          ),
          <fpage>3943</fpage>
          -
          <lpage>3965</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Nair</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Hinton</surname>
            ,
            <given-names>G. E.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Rectified Linear Units Improve Restricted Boltzmann Machines</article-title>
          .
          <source>Proceedings of the 27th International Conference on Machine Learning, (3)</source>
          ,
          <fpage>807</fpage>
          -
          <lpage>814</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Simonyan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Zisserman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Very deep convolutional networks for largescale image recognition</article-title>
          .
          <source>arXiv preprint arXiv:1409</source>
          .
          <fpage>1556</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Selvaraju</surname>
            ,
            <given-names>R. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cogswell</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Das</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vedantam</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parikh</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Batra</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization</article-title>
          .
          <source>Proceedings of the IEEE International Conference on Computer Vision</source>
          , 2017- Octob,
          <fpage>618</fpage>
          -
          <lpage>626</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Ustyuzhaninov</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brendel</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gatys</surname>
            ,
            <given-names>L. A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Bethge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Texture Synthesis Using Shallow Convolutional Networks with Random Filters, 1-9</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>