<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Heuristic Architecture Search Using Network Morphism for Chest X-Ray Classification</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Khmelnytskyi National University</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Instytuts'ka str.</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Khmelnytskyi</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ukraine</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Karabük University, Kılavuzlar/Karabük Merkez/Karabük</institution>
          ,
          <addr-line>78050</addr-line>
          ,
          <country country="TR">Turkey</country>
        </aff>
      </contrib-group>
      <fpage>0000</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>Nowadays, the demand for medical image computing is exceptionally high. This growth was mostly driven by the manual development of machine learning models, in particular neural networks. However, due to the constant evolution of domain requirements, manual model development has become insufficient. The present study proposes a heuristic architecture search that can be in an excellent service for the task of medical image classification. We implemented a novel approach called network morphism to the search algorithm. The proposed search method utilizes the enforced hill-climbing algorithm and functional-saving modifications. As a result of computational experiments, the search method found the optimal architecture in 28 GPU hours. The model formed by the found architecture achieved performance of 73.2% in validation accuracy and 84.5% in AUC on the validation dataset that is competitive to the state-of-the-art hand-crafted networks. Moreover, the proposed search method managed to find the architecture that contains four times fewer parameters. Besides, the model requires almost ten times less physical memory, which may indicate the practical usefulness of our method in medical image analysis.</p>
      </abstract>
      <kwd-group>
        <kwd>heuristic search</kwd>
        <kwd>neural architecture search</kwd>
        <kwd>network morphism</kwd>
        <kwd>medical image classification</kwd>
        <kwd>Chest X-Ray</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Over the past decade, there has been a dramatic increase in the manual development
of deep convolutional neural network (DCNN) architectures that show significant
results in image processing. The most recognized hand-crafted DCCNs are VGG [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ],
ResNet [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and DenseNet [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. These networks were customized to solve
multiclassification tasks, e.g., the Imagenet classification benchmark [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] with 1000 classes.
      </p>
      <p>
        In contrast, practical tasks usually require models that can perform well on various
data, either small or large images, with two or many classes, based on numerous or
few training data. For instance, state-of-the-art architectures [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1–3</xref>
        ] show excellent
performance in image classification tasks with many classes. However, these
networks contain a significant number of parameters that make them heavy in terms of
memory and training time and, therefore, redundant for many practical tasks.
Moreover, different cases may involve various features in the implemented models. Medical
image processing is a representative example of a real challenge that requires both
flexibility and reliability in classification tasks.
      </p>
      <p>
        Automated machine learning (AutoML) and its branch neural architecture search
(NAS) might be an excellent solution to the mentioned-above issues. To date,
AutoML and NAS methods have been actively developed and employed in various
fields. For instance, NAS allows deploying neural networks into classification and
segmentation tasks with little domain expertise and small datasets. The most
recognized NAS approaches employ search strategies based on genetic algorithms [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ],
reinforcement learning techniques [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], Bayesian optimization [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], gradient-based
methods [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. According to recent reviews [
        <xref ref-type="bibr" rid="ref10 ref9">9,10</xref>
        ], the most promising in terms of
efficiency are architecture search methods based on genetic algorithms.
      </p>
      <p>Despite significant advances in NAS, its impact on the practical tasks in healthcare
is yet not clear. There is still much uncertainty about the efficiency of NAS
approaches in medical screening. To address this issue, the present study examines the
application of automated search methods to find optimal neural architecture medical image
classification.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Works</title>
      <p>
        Genetic or so-called evolutionary algorithms have significantly advanced and
expanded the use of AutoML. In the field of NAS, genetic algorithms are designed to
construct new sets of neural architectures, called population [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The algorithm starts
with establishing underlying networks, either random or predefined. Each structure is
then trained and evaluated based on the requirements of a particular task, such as
image classification or segmentation. After that, the most suitable network may serve
as a parent for further search. At the following stage, the algorithm creates offspring
by implementing modifications (mutations) in the structure of the parent network. The
algorithm ends when new adjustments cannot be applied to an offspring as it reaches
the best performance in the task.
      </p>
      <p>
        Evolutionary algorithms have become particularly popular due to the use of a vast
search space, which has considerably improved NAS techniques. Even though genetic
algorithms allow achieving decent results in NAS [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], neural models based on
evolutionary search require to train an enormous number of architectures [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] that lead to
significant time expenses [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Also, many real-world applications cannot afford high
computational cost due to technical limitations. Thus, the development of methods
that would be effective at computational and time constraints is a relevant task at
present.
      </p>
      <p>
        Over the past years, researchers have proposed various methods to solve primary
drawbacks of any NAS methods: significant time costs and the considerable weight of
an output network. For example, Veniat et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] applied the gradient descent
method to the budgeted learning function, which includes the maximum allowable cost.
They could train a neural network capable of predicting well in less than 100
milliseconds on both CIFAR10 and CIFAR100 datasets. In [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], Li et al. presented a
pruning method to cut search space by including profile information about the output
speed on the target dataset. This approach was able to provide an automated
architecture search with an excellent compromise in speed and accuracy on the ImageNet
dataset. In [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], Tan et al. suggested a scaling technique that uniformly expanded all
dimensions using a compound coefficient. Also, the authors rescaled MobileNets and
ResNet up to obtain a family of efficient models. This approach showed decent results
in the relevant transfer training stands.
      </p>
      <p>
        Other researchers moved in a slightly different direction. In [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], Wei et al.
systemized preserved functions and introduced a set of parametric operations that could
enhance the morphing of any continuous nonlinear activation neurons. Their approach
showed decent results on the CIFAR10 and CIFAR100 datasets. In [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], Elsken et al.
evolved network morphism by combining it with a simple hill-climbing search
algorithm. Furthermore, the authors conducted optimization runs by cosine annealing after
each operation. This method achieved a stunning 94% validation accuracy in only 12
hours on a single GPU on the CIFAR10 dataset. Gordon et al. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] presented a novel
framework called MorphNet that could iteratively shrink and expand a neural
network. Their method was adaptable to specific resource constraints and could improve
network performance on different data sets.
      </p>
      <p>
        Several studies have addressed the efficiency of NAS on different medical datasets.
Gessert et al. [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] proposed an efficient NAS by subsequently transferring
lowdimensional data to high-dimensional one for OCT image segmentation. The authors
achieved an 87.5% reduction in search time on one-dimensional data, compared to
two-dimensional data. In [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], the authors proposed a NAS framework based on
particle swarm optimization technique that could temporally evolve and finally
converged to a feasible optimal architecture. The framework ensured robust architecture
design having been trained on a single GPU card and tested on the volumetric fMRI
data. Kwasigroch et al. [22] employed network morphism operations to the evolution
strategy in order to diversify the exploring network without reducing validation
accuracy. Such an approach provided a significant reduction in computational cost on the
skin lesion dataset.
      </p>
      <p>From the analysis of the literature, we assumed that network morphism could be an
excellent solution to reduce the running time of the search method and facilitate the
output model. Therefore, in this work, we apply network morphism to a genetic
search for Chest X-ray classification.
3</p>
    </sec>
    <sec id="sec-3">
      <title>The Problem Statement</title>
      <p>The main goal of the present research is to investigate the efficiency of the network
morphism approach in the medical classification task. To achieve the goal, the
following tasks are due to be resolved:
1. To consider network morphism and adjust it to a genetic algorithm.
2. To select an appropriate dataset of medical images for the multiclassification task.
3. To construct a CNN as a baseline architecture and investigate its impact on the
output architecture.
4. To implement a heuristic algorithm based on network morphism to search for an
optimal neural architecture.
5. To evaluate an optimal architecture found automatically with objective metrics and
compare it with state-of-the-art manual networks on the medical image dataset.
In this paper, we address the issue of searching for the optimal neural network
architecture for the target data set D = {Dtran , Dtest , Dval } , where Dtran is a training dataset,
Dtest stands for a test dataset, and Dval is a validation dataset. Let us consider the
target problem as a two-level multipurpose optimization problem. The function of
multiobjective bilevel optimization can be presented in a general form as follows:
subject to
aopt = min {Lval ( A, w* ) , C ( A)}</p>
      <p>A∈A
w* ∈ arg min {Ltran ( w, a)} ,
w∈W
(1)
where aopt represents the final architecture among optimized ones A from the search
space of all possible architectures of A , C ( A) is the function of the complexity of
the optimized architecture and other related values, w stands for the weights of the
neural network from the weight space of W , Ltran and Lval are loss functions on</p>
      <sec id="sec-3-1">
        <title>Dtran and Dval , respectively.</title>
        <p>Below, we describe the proposed NAS method that was designed and assessed step
by step by the conceptual framework from [23].
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Heuristic Architecture Search Algorithm</title>
      <p>
        In this study, we investigate a heuristic architecture search inspired by network
morphism operators proposed in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. The morphism allows modifying a neural
network without losing the obtained information about network structure and its
hyperparameters. Created by the use of morphism, neural architectures can achieve
performance measures as their predecessors but concurrently show encouraging
computational capabilities. Fig. 1 illustrates the idea behind network morphism.
Fig. 1. The procedure of network morphism [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. According to the algorithm, the child network
will inherit all knowledge (A–F) from the parent network while maintaining the network
function. A combination of morphing modifications can provide diverse network morphism.
In contrast, our paper applied a morphism algorithm to the classification task with
large medical images. In this regard, we implemented various functional-saving
operations into the algorithm to achieve the best training performance.
4.1
      </p>
      <sec id="sec-4-1">
        <title>Functional-Saving Modifications</title>
        <p>
          Network morphism allows avoiding training of each new network from scratch.
Thereby, researchers can test many models in a short time. In this study, we applied
several modifications to ensure model efficiency. Concatenation operations and add
nodes serve as in [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. Convolutional layers were expanded by the morphism
functions, according to [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. However, in contrast with previous studies, we implemented
a combination of an additional operational layer with an extra node to provide skip
connection in training. Also, we applied a new modification by inputting noise into
the weights of the newly generated network after all previously mentioned
modifications. Besides the introduced operations, symmetry disruption in the neural network
parameter sequence can also lead to significant performance improvements in the
NAS algorithm.
4.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>The Enforced Hill-Climbing Search</title>
        <p>In this section, we briefly describe a heuristic search that executes the task (1). As a
search strategy, we utilized a genetic algorithm called the enforced hill-climbing
search [23]. Fig. 2 depicts the scheme of the search.</p>
        <p>
          The algorithm begins with a data D input. Then, a pre-trained original neural network
serves as a baseline architecture for climbing a so-called hill. In response to the
application of functional-saving modifications, the original network produces a certain
number of descendants. These modifications ensure that each descendant performs the
same as its parent network. Moreover, due to the genetic algorithm, the descendants
suit better for training than their parents. Each descendant is trained on a small
number of epochs and then evaluated by a precision measure – the best descendent moves
to the next step, where they form new descendants. The search procedure ends after a
certain number of epochs. The algorithm presents the so-called best architecture aopt ,
which then serves for final long-term training. Fig. 3 demonstrates a heuristic
architecture search based on enforced hill-climbing algorithm inspired by [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]:
function HAS( model0 , nsteps , nneigh , nmorph , epochneigh , epochfinal , lrate )
# model0 – initial model, nsteps – number of hill-climbing steps;
# nneigh – number of neighbors, nmorph – number of func.-saving
# modifications, epochneigh – number of epochs to train each neighbor,
# epochfinal – number of epochs for final training,
# lrate – value of learning rate during model optimization.
modelbest := model0
# start enforced hill climbing
for i:= 1,nsteps do
# modifications to modelbest
for j:= 1,nneigh - 1 do
# get nneigh neighbors of model0 by applying nmorph func.-saving
modelj := ApplyFuncSav(modelbest , nmorph )
# train the model for several epochs on the training dataset
modelj := Train(modelj , epochneigh , lrate )
end for
# in case the last neighbor is the best
modelnneigh := Train(modelbest , epochneigh , lrate )
# receive the best model on the validation dataset
modelbest := argmax{peformanceval (modelj )}
        </p>
        <p>j=1,…,nneigh
end for
# train the final model both on training and validation datasets
modelbest := Train(modelbest , epochneigh , lrate )
return modelbest
end function
The algorithm does not have to choose a new model at each iteration, and can also
hold the one from the previous step if others do not improve. Consequently, the
current best model at iteration may also be considered a child network. It is assumed that
epochneigh should be small, as the algorithm is forced to train numerous networks.
Thus, to check the possibility of overfitting, we conducted two experiments with
small and large numbers of epochs.</p>
        <p>The algorithm presented above is a straightforward implementation of the
hillclimbing method. It can be interpreted as a simple genetic algorithm with one
organism, population size of nneigh , and without crossover. While climbing, the
functionalsaving modifications of network morphism serve as mutations. Besides, the selection
part considers only members of the population with the best performance as a parent
for the next generation.
5
5.1</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Implementation Details</title>
      <sec id="sec-5-1">
        <title>Benchmark Dataset and Data Augmentation</title>
        <p>To evaluate the proposed heuristic search, we chose the CheXpert benchmark dataset
[25]. The whole dataset comprises 224,316 chest radiographs with a size of 320×320
pixels excluded from 65,240 patients. The images are labeled for 14 diseases as
negative, positive, or uncertain. In the original dataset, positive and negative cases were
marked as ones, dubious images – as zeros. Fig. 4 presents an example of the
CheXpert dataset.</p>
        <p>For the experiment, we formed a subgroup of five diseases, namely atelectasis,
cardiomegaly, consolidation, edema, and pleural effusion. Therefore, the number of classes
was set to five. We split the subset into 70% training, 20% testing, and 10%
validation images. Furthermore, several data augmentation techniques, such as random
flips, translations, and rotations, were applied to the pictures of the training dataset.
Here, we describe the baseline architecture, which serves as an original network for
further optimization. According to the recent comprehensive overviews [26,27],
CNNs are the type of neural architectures that suite the best for the classification of
medical images. Therefore, this work is devoted to applying heuristic architecture
search only to CNNs. Hence, guided by [28,29], a small baseline convolutional
architecture was represented as follows</p>
        <p>Input → {16 ⋅ 2i−1 × ConvLi →</p>
        <p>MPLi }i=1 → 128 × ConvL → SML ,
3
where ConvL stands for convolutional layer, from 16 to 128 filters of size 3×3 and
stride 1, ReLU is an activation function, MPL stands for MaxPool layer of size 2×2
and stride 2, and SML represents SoftMax function for the probability distribution of
the output result.</p>
        <p>The large-scale convolutional architecture was set as
(2)
(3)
Input → {16 ⋅ 2i−1 × ConvLi →</p>
        <sec id="sec-5-1-1">
          <title>ReLUi →</title>
          <p>MPLi }i=1
3
→ 128 × ConvL →</p>
          <p>BN →</p>
          <p>ReLU → SML,
where BN represents batch normalization. All other parts of architecture (2) denote
the corresponding elements in the network (1).
5.3</p>
        </sec>
      </sec>
      <sec id="sec-5-2">
        <title>Parameters Setup</title>
        <p>In this section, we describe the setup of the training parameters. In the hill-climbing
algorithm, we set the number of search epochs of 10, the number of organisms in the
population of 10, the number of epochs for training each element of 5, the number of
mutations of 5. We also limited the possible size of the model. If the sample model
became too large, the agent would reject the investigated architecture and choose
another one.</p>
        <p>For the training procedure, we employed Adam optimization method with the
learning rate of 10−3 , the weight decay of 0.5 ⋅10−3 , the momentum of 0.9, and a
batch size of 256. According to experimental results in [30], this setup of training
parameters can assure excellent model approximation in training. The original
network was pre-trained by ten epochs on one fold of the training dataset, while the final
architecture was optimized by one hundred runs on each fold from scratch. Overall,
the training was performed five times, and the milestone results were averaged.</p>
        <p>All experiments were performed in Python v3.6, using the TensorFlow v.1.13
backend [31]. The hardware setup consists of 8 core Ryzen 2700 and a single
NVIDIA GeForce GTX1080 GPU with 8 GB memory. The working code is
opensourced and available by [32].
5.4</p>
      </sec>
      <sec id="sec-5-3">
        <title>Evaluation Criteria</title>
        <p>In this study, we evaluate the output architecture and compare it with other networks
by several statistical measures, which are recall (REC), precision (PREC), accuracy
(ACC), and area under the curve (AUC). Let us consider the number of real positive
(P) and real negative (N) cases in the data. As it is known from the theory of statistics
[33], the classification results are distributed as true positive (TP), true negative (TN),
false positive (FP) and false negative (FN) cases. Thus, the evaluation metrics used in
this study are as follows</p>
        <p>REC =
PREC =</p>
        <p>TP
TP + FN</p>
        <p>TP
TP + FN
,
,
ACC =</p>
        <p>TP + TN
TP + TN + FP + FN
.</p>
        <p>For binary classification, AUC is set as</p>
        <p>A = 1  FP2
2  FP2 − TN2
−</p>
        <p>FP1  ×  TP2
FP1 − TN1   TP2 − FN2
−</p>
        <p>TP1  .</p>
        <p>TP1 − FN1 
In the case of five classes, AUS is as follows
(4)
(5)
(6)
(7)
5
AUC5 = ∑µ (ci ) p (ci ) ,</p>
        <p>i=1
where µ (ci ) stands for AUC under the ROC curve of class ci , p (ci ) is the prior
probability of class ci .</p>
        <p>Also, the different architectures were evaluated and compared by network size,
number of parameters, number of training epochs, and training time.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Experiments and Results</title>
      <p>In this section, we describe how the initial type of network can influence the results of
the heuristic architecture search. Also, we compare the result of our approach with
two different numbers of epochs to the set of hand-crafted architectures.
6.1</p>
      <sec id="sec-6-1">
        <title>Impact of a Baseline Architecture on the Search Process</title>
        <p>The conducted experiments revealed that selecting a small original network lead to a
lengthy search process. Moreover, small network (2) required numerous mutations,
and thus, the time to evolve into an optimal architecture. In contrast, a large-scale
architecture (3) could limit the search space with large structures, neglecting smaller
ones that might also be suitable for search.</p>
        <p>We conducted two representative experiments to investigate the above-mentioned
hypothesis. In the first run, we checked the small original network (2), in the second –
the large one (3). Fig. 5 shows a comparison of the results of two possible original
architectures.</p>
        <p>
          According to Fig. 5, a large-scale network provides higher model accuracy.
In this section, we examine the final architecture optimized by our NAS algorithm.
We employed three hand-crafted convolutional networks commonly used in medical
image tasks. These networks are VGG19 [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. Inception v4 [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] and DenseNet [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. The
final network generated by the search algorithm can be observed via [32].
        </p>
        <p>We trained the networks on the selected dataset and compared them with the final
architecture of our algorithm. Besides, we conducted two separate experiments with
different numbers of epochs to investigate the probability of overfitting. All networks
were evaluated by metrics (4)–(7). The hyperparameters remained the same. Table 1
presents averaged evaluation results of k-fold validation ( k = 5 ).
According to Table 1, the architecture found by our heuristic search demonstrates
competitive results to state-of-the-art models in the classification tasks. The NAS
architecture comprises various branches such as concatenates, skip nodes, adds. In
contrast, hand-crafted architectures usually lack additional layers as it requires
numerous experiments. While most state-of-the-art models have a regular structure, that
is, they consist of multiple blocks that are repeated in the architecture, the NAS
architecture has a structure without a noticeable repeating pattern. This approach allows for
creating flexible architectures for different datasets. Table 2 reveals more details of
the comparison.
The proposed NAS algorithm allowed finding the optimal architecture in 28 GPU
hours, while the manual search can require weeks of tedious attempts and
experiments. Furthermore, the final model fewer number of parameters with sufficiently
high accuracy compared to the state-of-the-art. Thus, the model requires less physical
memory and could ensure efficient use in practice.
7</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Discussion and Conclusion</title>
      <p>In this study, we proposed a heuristic algorithm of neural architecture search in
medical image analysis. To investigate the issue of medical image classification, we
employed the CheXpert benchmark dataset and considered a multiclassification task. The
core of the search method was the enforced hill-climbing algorithm enhanced with
network morphism.</p>
      <p>Firstly, we composed and examined two hand-crafted CNNs as baseline
architecture. As a result of numerical experiments, the large-scale network turned out to be a
better solution. Secondly, we implemented a heuristic search with network morphism
modifications in order to find optimal neural architecture. To check if the final
network was subjected to overfitting, we conducted two separate experiments with
numbers of epochs 21 and 69. As a result, even taking into account the almost three-fold
difference between the number of epochs, both statistical indicators and
computational costs and weight were approximately equal. This outcome could occur due to either
the use of one organism within the algorithm or lack of crossover. The authors aim to
investigate this knowledge gap in future work.</p>
      <p>We evaluated the final architecture found automatically with an objective statistical
metrics and compared it with recognized hand-crafted networks on CheXpert dataset.
Our search algorithm managed to find the optimal architecture in total in 28 GPU
hours. The optimized architecture in the second experiment achieved the performance
of 73.2% in accuracy and 84.5% in AUC on the validation dataset, yielding the
competitive results to the state-of-the-art hand-crafted networks. Moreover, the optimized
model contained four times fewer parameters and required almost ten times less
physical memory compared to other networks. In summary, these results could indicate the
practical usefulness of the proposed heuristic search method.</p>
      <p>Further work needs to be done to determine the influence of sensitivity and
specificity on the proposed heuristic search. A reasonable approach to tackle this issue
could be the inclusion of diverse medical datasets of both X-Ray images and
computer tomography scans. Further research should also focus on the optimization of
hyperparameters within the heuristic architecture search.
22. Kwasigroch, A., Grochowski, M., Mikołajczyk, A.: Neural architecture search for skin
lesion classification. IEEE Access. 8, 9061–9071 (2020).</p>
      <p>doi:10.1109/ACCESS.2020.2964424
23. Radiuk, P.M., Hrypynska, N.V.: A framework for exploring and modeling neural
architecture search methods. Paper presented at the 4th International Conference on Computational
Linguistics and Intelligent Systems (COLINS-2020), Lviv, Ukraine, 23–24 April 2020.</p>
      <p>CEUR-Workshop Proceedings, vol. 2604, pp. 1060–1074. CEUR-WS.org (2020)
24. Edelkamp, S., Schrödl, S.: Memory-restricted search. In: Edelkamp, S. and Schrödl,
S.B.T.-H.S. (eds.) Heuristic Search: Theory and Applications. pp. 227–281. Morgan
Kaufmann, San Francisco (2012). doi:10.1016/B978-0-12-372512-7.00006-7
25. Irvin, J., et al.: CheXpert: A large chest radiograph dataset with uncertainty labels and
expert comparison. In: Proceedings of the 33d AAAI Conference on Artificial Intelligence
2019 (AAAI-2019), Honolulu, Hawaii, USA, 27 January – 1 February 2019. vol. 33(1),
pp. 590–597. Association for the Advancement of Artificial Intelligence (2019).
doi.10.1609/aaai.v33i01.3301590
26. Shen, J., Zhang, C.J.P., Jiang, B., Chen, J., Song, J., Liu, Z., He, Z., Wong, S.Y., Fang,
P.H., Ming, W.-K.: Artificial intelligence versus clinicians in disease diagnosis: A
systematic review. JMIR Med. Informatics. 7(3), e10010 (2019). doi:10.2196/10010
27. Kim, M., Yan, C., Yang, D., Wang, Q., Ma, J., Wu, G.: Deep learning in biomedical image
analysis. In: Feng, D.D. (eds.) Biomedical Information Technology. pp. 239–263. San
Diego, Elsevier (2020). doi:10.1016/B978-0-12-816034-3.00008-0
28. Romanuke, V.V.: An attempt of finding an appropriate number of convolutional layers in
CNNs based on benchmarks of heterogeneous datasets. Electr. Control Commun. Eng.
14(1), 51–57 (2018). doi:10.2478/ecce-2018-0006
29. Romanuke, V.V.: Smooth non-increasing square spatial extents of filters in convolutional
layers of CNNs for image classification problems. Appl. Comput. Syst. 23(1), 52–62
(2018). doi:10.2478/acss-2018-0007
30. Radiuk, P.M.: Impact of training set batch size on the performance of convolutional neural
networks for diverse datasets. Inf. Technol. Manag. Sci. 20(1), (2017).</p>
      <p>doi:10.1515/itms-2017-0003
31. Abadi, M., et al.: TensorFlow: A system for large-scale machine learning. In: Keeton, K.
and Roscoe, T. (eds.) Proceedings of 12th USENIX Symposium on Operating Systems
Design and Implementation (OSDI-2016), Savannah, GA, USA, 2–4 November 2016.
pp. 265–283. USENIX Association (2016)
32. Heuristic NAS. GitHub, Inc. https://github.com/soolstafir/Heuristic-NAS (2020). Accessed
27 Apr 2020
33. Fawcett, T.: An introduction to ROC analysis. Pattern Recognit. Lett. 27(8), 861–874
(2006). doi:10.1016/j.patrec.2005.10.010</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Simonyan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zisserman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Very deep convolutional networks for large-scale image recognition</article-title>
          .
          <source>Paper presented at the 3rd International Conference on Learning Representations (ICLR-2015)</source>
          , San Diego, CA, USA,
          <fpage>7</fpage>
          -9 May 2015
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Szegedy</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ioffe</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vanhoucke</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alemi</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          :
          <article-title>Inception-v4, Inception-ResNet and the impact of residual connections on learning</article-title>
          .
          <source>In: Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI-2017)</source>
          , San Francisco, California, USA,
          <fpage>4</fpage>
          -
          <issue>10</issue>
          <year>February 2017</year>
          . pp.
          <fpage>4278</fpage>
          -
          <lpage>4284</lpage>
          . AAAI Press (
          <year>2017</year>
          ). doi:
          <volume>10</volume>
          .5555/3298023.3298188
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Van Der Maaten</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weinberger</surname>
            ,
            <given-names>K.Q.</given-names>
          </string-name>
          :
          <article-title>Densely connected convolutional networks</article-title>
          .
          <source>In: Proceedings of the 30th IEEE Conference on Computer Vision</source>
          and
          <article-title>Pattern Recognition (CVPR-</article-title>
          <year>2017</year>
          ), Honolulu,
          <string-name>
            <surname>HI</surname>
          </string-name>
          , USA,
          <fpage>21</fpage>
          -
          <issue>26</issue>
          <year>July 2017</year>
          . pp.
          <fpage>2261</fpage>
          -
          <lpage>2269</lpage>
          . IEEE Inc.
          <article-title>(</article-title>
          <year>2017</year>
          ). doi:
          <volume>10</volume>
          .1109/CVPR.
          <year>2017</year>
          .243
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Deng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dong</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Socher</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fei-Fei</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>ImageNet: A large-scale hierarchical image database</article-title>
          .
          <source>In: Proceedings of 2009 IEEE Conference on Computer Vision</source>
          and
          <article-title>Pattern Recognition (CVPR-</article-title>
          <year>2009</year>
          ), Miami, FL, USA,
          <fpage>20</fpage>
          -
          <lpage>25</lpage>
          June 2009. pp.
          <fpage>248</fpage>
          -
          <lpage>255</lpage>
          . IEEE Inc.
          <article-title>(</article-title>
          <year>2009</year>
          ). doi:
          <volume>10</volume>
          .1109/CVPR.
          <year>2009</year>
          .5206848
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Angeline</surname>
            ,
            <given-names>P.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saunders</surname>
            ,
            <given-names>G.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pollack</surname>
            ,
            <given-names>J.B.</given-names>
          </string-name>
          :
          <article-title>An evolutionary algorithm that constructs recurrent neural networks</article-title>
          .
          <source>IEEE Trans. Neural Networks</source>
          .
          <volume>5</volume>
          (
          <issue>1</issue>
          ),
          <fpage>54</fpage>
          -
          <lpage>65</lpage>
          (
          <year>1994</year>
          ). doi:
          <volume>10</volume>
          .1109/72.265960
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Bello</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zoph</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vasudevan</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Le</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          <article-title>V: Neural optimizer search with reinforcement learning</article-title>
          . In: Precup,
          <string-name>
            <given-names>D.</given-names>
            and
            <surname>Teh</surname>
          </string-name>
          , Y.W. (eds.)
          <source>Proceedings of the 34th International Conference on Machine Learning (ICML-2017)</source>
          , Sydney, Australia,
          <fpage>6</fpage>
          -
          <issue>11</issue>
          <year>August 2017</year>
          . vol.
          <volume>70</volume>
          , pp.
          <fpage>459</fpage>
          -
          <lpage>468</lpage>
          . JMLR.org (
          <year>2017</year>
          ). doi:
          <volume>10</volume>
          .5555/3305381.3305429
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Kandasamy</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neiswanger</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schneider</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Póczos</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xing</surname>
            ,
            <given-names>E.P.</given-names>
          </string-name>
          :
          <article-title>Neural architecture search with bayesian optimisation and optimal transport</article-title>
          . In: Bengio,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Wallach</surname>
          </string-name>
          , H.M. (eds.)
          <source>Proceedings of the 32nd International Conference on Neural Information Processing Systems (NIPS-2018)</source>
          , Montreal, QC, Canada,
          <fpage>3</fpage>
          -
          <issue>8</issue>
          <year>December 2018</year>
          . pp.
          <fpage>2020</fpage>
          -
          <lpage>2029</lpage>
          . Curran Associates Inc. (
          <year>2018</year>
          ). doi:
          <volume>10</volume>
          .5555/3326943.3327130
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simonyan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>DARTS: Differentiable architecture search</article-title>
          .
          <source>Paper presented at the 7th International Conference on Learning Representations (ICLR-2019)</source>
          , New Orleans, LA, USA,
          <fpage>6</fpage>
          -9 May 2019
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kerschke</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoos</surname>
            ,
            <given-names>H.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neumann</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trautmann</surname>
          </string-name>
          , H.:
          <article-title>Automated algorithm selection: Survey and perspectives</article-title>
          .
          <source>Evol. Comput</source>
          .
          <volume>27</volume>
          (
          <issue>1</issue>
          ),
          <fpage>3</fpage>
          -
          <lpage>45</lpage>
          (
          <year>2019</year>
          ). doi:
          <volume>10</volume>
          .1162/evco_a_
          <fpage>00242</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Elsken</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Metzen</surname>
            ,
            <given-names>J.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hutter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Neural architecture search: A survey</article-title>
          .
          <source>J. Mach. Learn. Res</source>
          .
          <volume>20</volume>
          (
          <issue>55</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>21</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Radiuk</surname>
            ,
            <given-names>P.M.:</given-names>
          </string-name>
          <article-title>Neuroevolution of convolutional neural networks for the classification of lung cancer images</article-title>
          .
          <source>Her. Khmelnytskyi Natl. Univ</source>
          .
          <volume>267</volume>
          (
          <issue>6</issue>
          ),
          <fpage>188</fpage>
          -
          <lpage>192</lpage>
          (
          <year>2018</year>
          ). doi:
          <volume>10</volume>
          .31891/
          <fpage>2307</fpage>
          -5732-2018-267-
          <issue>6</issue>
          (
          <issue>2</issue>
          )-188-192
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Floreano</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dürr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mattiussi</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Neuroevolution: From architectures to learning</article-title>
          .
          <source>Evol. Intell</source>
          .
          <volume>1</volume>
          ,
          <fpage>47</fpage>
          -
          <lpage>62</lpage>
          (
          <year>2008</year>
          ).
          <source>doi:10.1007/s12065-007-0002-4</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Chugh</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sindhya</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hakanen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miettinen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>A survey on handling computationally expensive multiobjective optimization problems with evolutionary algorithms</article-title>
          .
          <source>Soft Comput</source>
          .
          <volume>23</volume>
          (
          <issue>9</issue>
          ),
          <fpage>3137</fpage>
          -
          <lpage>3166</lpage>
          (
          <year>2019</year>
          ).
          <source>doi:10.1007/s00500-017-2965-0</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Veniat</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Denoyer</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Learning time/memory-efficient deep architectures with budgeted super networks</article-title>
          .
          <source>In: Proceedings of 2018 IEEE/CVF Conference on Computer Vision</source>
          and
          <article-title>Pattern Recognition (CVPR-</article-title>
          <year>2018</year>
          ), Salt Lake City,
          <string-name>
            <surname>UT</surname>
          </string-name>
          , USA,
          <fpage>18</fpage>
          -
          <lpage>23</lpage>
          June 2018. pp.
          <fpage>3492</fpage>
          -
          <lpage>3500</lpage>
          . IEEE Inc.
          <article-title>(</article-title>
          <year>2018</year>
          ). doi:
          <volume>10</volume>
          .1109/CVPR.
          <year>2018</year>
          .00368
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feng</surname>
          </string-name>
          , J.:
          <article-title>Partial order pruning: for best speed/accuracy trade-off in neural architecture search</article-title>
          .
          <source>In: Proceedings of 2019 IEEE/CVF Conference on Computer Vision</source>
          and
          <article-title>Pattern Recognition (CVPR-</article-title>
          <year>2019</year>
          ), Long Beach, CA, USA, USA,
          <fpage>15</fpage>
          -
          <lpage>20</lpage>
          June 2019. pp.
          <fpage>9137</fpage>
          -
          <lpage>9145</lpage>
          . IEEE Inc.
          <article-title>(</article-title>
          <year>2019</year>
          ). doi:
          <volume>10</volume>
          .1109/CVPR.
          <year>2019</year>
          .00936
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Tan</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Le</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          :
          <article-title>EfficientNet: Rethinking model scaling for convolutional neural networks</article-title>
          . In: Chaudhuri,
          <string-name>
            <given-names>K.</given-names>
            and
            <surname>Salakhutdinov</surname>
          </string-name>
          , R. (eds.)
          <source>Proceedings of the 36th International Conference on Machine Learning (ICML-2019)</source>
          , Long Beach, California, USA,
          <fpage>10</fpage>
          -
          <lpage>15</lpage>
          Jun
          <year>2019</year>
          . vol.
          <volume>97</volume>
          , pp.
          <fpage>6105</fpage>
          -
          <lpage>6114</lpage>
          . PMLR Inc.
          <article-title>(</article-title>
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Wei</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rui</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>C.W.</given-names>
          </string-name>
          :
          <article-title>Network morphism</article-title>
          . In: Balcan,
          <string-name>
            <given-names>M.F.</given-names>
            , and
            <surname>Weinberger</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.Q</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of the 33rd International Conference on Machine Learning (ICIML-2016)</source>
          , New York City, NY, USA,
          <fpage>19</fpage>
          -
          <lpage>24</lpage>
          Jun
          <year>2016</year>
          . vol.
          <volume>48</volume>
          , pp.
          <fpage>564</fpage>
          -
          <lpage>572</lpage>
          . JMLR.org (
          <year>2016</year>
          ). doi:
          <volume>10</volume>
          .5555/3045390.3045451
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Elsken</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Metzen</surname>
            ,
            <given-names>J.-H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hutter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Simple and efficient architecture search for convolutional neural networks</article-title>
          .
          <source>Paper presented at the 6th International Conference on Learning Representations (ICLR-2018)</source>
          , Vancouver, BC, Canada,
          <volume>30</volume>
          <fpage>April</fpage>
          - 3 May 2018
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Gordon</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eban</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nachum</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Choi</surname>
          </string-name>
          , E.:
          <article-title>MorphNet: Fast &amp; simple resource-constrained structure learning of deep networks</article-title>
          .
          <source>In: Proceedings of 2018 IEEE/CVF Conference on Computer Vision</source>
          and
          <article-title>Pattern Recognition (CVPR-</article-title>
          <year>2018</year>
          ), Salt Lake City,
          <string-name>
            <surname>UT</surname>
          </string-name>
          , USA,
          <fpage>18</fpage>
          -
          <lpage>23</lpage>
          June 2018. pp.
          <fpage>1586</fpage>
          -
          <lpage>1595</lpage>
          . IEEE Inc.
          <article-title>(</article-title>
          <year>2018</year>
          ). doi:
          <volume>10</volume>
          .1109/CVPR.
          <year>2018</year>
          .00171
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Gessert</surname>
            ,
            <given-names>N.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schlaefer</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Efficient neural architecture search on low-dimensional data for OCT image segmentation</article-title>
          .
          <source>In: Proceedings of Conference on Medical Imaging with Deep Learning (MIDL-2019)</source>
          , London, The UK,
          <fpage>8</fpage>
          -
          <issue>10</issue>
          <year>July 2019</year>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          . Medizintechnische Systeme E-1
          <string-name>
            <surname>Institute</surname>
          </string-name>
          (
          <year>2019</year>
          ). doi:
          <volume>10</volume>
          .15480/882.2731
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Qiang</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ge</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dong</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ge</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Neural architecture search for optimizing deep belief network models of fMRI data</article-title>
          . In: Li,
          <string-name>
            <given-names>Q.</given-names>
            ,
            <surname>Leahy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            , and
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <surname>X</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of the 1st International Workshop on Multiscale Multimodal Medical Imaging (MMMI-2019)</source>
          , Shenzhen, China, 13
          <year>October 2019</year>
          . vol.
          <volume>11977</volume>
          , pp.
          <fpage>26</fpage>
          -
          <lpage>34</lpage>
          . Springer International Publishing (
          <year>2020</year>
          ). doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>030</fpage>
          -37969-
          <issue>8</issue>
          _
          <fpage>4</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>