<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Gene Selection for Cancer Diagnosis via Iterative Graph Clustering-based Approach</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mehrdad Rostami</string-name>
          <email>Mehrdad.Rostami@oulu.fi</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mourad Oussalah</string-name>
          <email>Mourad.Oussalah@oulu.fi</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centre of Machine Vision and Signal Processing, Faculty of Information Technology, University of Oulu</institution>
          ,
          <addr-line>Oulu</addr-line>
          ,
          <country country="FI">Finland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Research Unit of Medical Imaging, Physics, and Technology, Faculty of Medicine, University of Oulu</institution>
          ,
          <country country="FI">Finland</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <abstract>
        <p>The development of microarray devices has led to the accumulation of DNA microarray datasets. Through this technological advance, physicians are able to examine various aspects of gene expression for cancer diagnosis. As data accumulation rapidly increases, the task of machine learning faces considerable challenges for high-dimensional DNA microarray data classification. Gene selection is a popular and powerful approach to deal with these high-dimensional cancer data. In this paper, a novel graph clustering-based gene selection approach is developed. The developed approach has two main objectives, consisting of relevance maximization and redundancy minimization of the selected genes. In this method, in each iteration, one subgraph is extracted, and then among the existing genes in this cluster, appropriate genes are selected using filter-based measure. The reported results on five cancer datasets indicate that the developed gene selection approach can improve the accuracy of cancer diagnosis.</p>
      </abstract>
      <kwd-group>
        <kwd>Approach</kwd>
        <kwd>Cancer diagnosis</kwd>
        <kwd>Microarray data classification</kwd>
        <kwd>Gene selection</kwd>
        <kwd>Feature selection</kwd>
        <kwd>Graph clustering</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Worldwide, cancer remains the leading cause of death for both men and women. Cancer
diagnosis is important to increase survival chances since early treatment is available to the
patients, provided successful early diagnosis. As a result, much research has been conducted to
develop better strategies to prevent, diagnose, and treat this disease to decrease the mortality
[
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. Using the emerging DNA microarray data, in an experiment, multiple aspects of gene
expression can be investigated to diagnose or detect diferent kinds of cancer.
Existing machine learning and pattern recognition approaches to handle large volume of DNA
microarray have been challenged by the high-dimensional structure of a such data [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The high
data volume makes many genes irrelevant or redundant to cancer diagnosis or classification
task. Gene selection is a powerful and eficient technique in microarray data to deal with this
challenge [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. By using this strategy, the training process can be simplified, which, in turn,
enhances machine learning performance, and, thereby, the general diagnosis [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
The main goal of our study is to develop an eficient clustering-based approach to choose a
subset of primary genes where one cluster of genes is chosen at each iteration. Then, among
the existing genes in this cluster, appropriate ones are selected using a filter-based measure.
AIxIA 2021 SMARTERCARE Workshop, November 29, 2021, Milan, IT
nEvelop-O
CEUR
This process of finding a cluster and selecting a candidate gene from each cluster is iterated
until all clusters are selected. We expect that our model, in addition to selecting genes with the
least amount of redundancy, will also maximize the relevancy of selected genes.
The remainder of this paper is organized as below: Section 2 reviews some related works. The
developed prediction method is presented in Section 3. The experimental results are reported
in section 4 and finally, section 5 present the conclusion and future works.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Works</title>
      <p>
        A significant challenge in handling microarray data for cancer diagnosis is their high-dimensionality
where the number of genes is much greater than the number of patterns [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5, 6, 7</xref>
        ], which leads
to a well-known problem known as “curse of dimensionality”. Gene selection is one
popular technique to eliminate irrelevant and/or redundant genes [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Previously, gene selection
approaches were classified as filter, wrapper, hybrid, and embedded approaches. In the filter
techniques, relevant genes are evaluated without a learning model. As a result, these techniques
are typically fast [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. There have been many filter-based approaches for gene selection in cancer
diagnosis such as, Filtering and ranking techniques [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], Simplified Swarm Optimization (SSO)
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], Tabu Asexual Genetic Algorithm (TAGA) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], Feature Clustering and Feature
Discretization assisting gene selection (FCFD) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], Least Loss (LL) [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], etc. The wrapper gene selection
strategies employ a learning model to evaluate the eficiency of the chosen gene subset [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
In this category, an iterative search algorithm-based process is applied to seek the optimum
gene subset, and at each step, a subset of original genes are chosen and with a fitness function
determining which genes are the best. Despite the fact that wrapper strategies choose a efective
subset of original genes, they are computationally complex and may present challenges within
analysis of DNA microarray datasets [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Hybrid strategies combine the benefits of both filter
and wrapper strategies [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. In addition, the embedded strategies make use of gene selection in
the learning process.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed Method</title>
      <p>Our novel gene selection algorithm is introduced by combining the notions of Graph Clustering
with Feature Weighting (GCFW). We can group our algorithm under the category of filter
gene selection technique and this algorithm measure relevancy and redundancy notions in
its selection mechanism. GCFW consists of two main phases including (1) Gene similarity
calculation, (2) Iterative subgraph finding and gene selection. In the reminder of this section
the details of these two phases are explained.</p>
      <sec id="sec-3-1">
        <title>3.1. Gene similarity calculation</title>
        <p>
          Initially, microarray datasets are represented as a weighted graph. In this demonstration, each
gene is indicated by a node and the value of each edge shows by gene similarities. In this graph
representation, the Pearson correlation coeficient criterion [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] is used to measure the similarity
values between two genes. This similarity measure maps the gene space of a microarray dataset
into a fully weighted and connected graph. To make the graph sparser, the edges with values
lower than the threshold  are deleted.  is an adjustable parameter and takes its value in the
range [0 1]. In our experiment  is set to 0.6.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Iterative subgraph finding and gene selection</title>
        <p>In order to avoid choosing similar genes, initial genes are divided to several groups to to reduce
the possibility of selecting redundant genes. Moreover to select a subset of relevant genes, a
feature weighting strategy is developed. In the proposed method, in each iteration, a subgraph
is identified and then among these present genes in this subgraph, the suitable ones are chosen
using gene relevance.</p>
        <p>
          The purpose of the subgraph discovery is to divide the genes into clusters so that the genes of
each cluster are as similar as possible. Previous gene clustering algorithms sufer from inherent
shortcomings, like the need to specify the number of clusters, ignoring the distribution of genes
and equal consideration of all genes. To deal with these issues, quite diferent from existing
feature clustering algorithms, a fast algorithm for subgraph discovery [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] is employed for gene
clustering. This algorithm yields grouping faster than previous gene clustering method.
Moreover, in this proposed method, Fisher Score weighting technique is used to rank the
genes of each subgraph and select the relevant genes for representing the genes of that cluster.
Therefore, it can claim that the genes selected satisfy both qualities: maximum relevancy and
minimal redundancy. In other words, the use of subgraph discovery criteria avoids to choose
the redundant genes and utilization of the notion of feature weighting results in the selection of
appropriate genes.Then, from each subgraph, the relevant gene is selected by performing the
feature weighting technique. The purpose of feature weighting is to select a representative gene
that is most relevant to the cancer diagnosis task. Fisher Score (FS) is a supervised univariate
iflter which gives higher values to those genes that have a separation property. The Fisher Score
of gene k is calculated as below:
  (  ) =
∑∈   ( ̄ −  ̄ )2
  (  (  ))2
(1)
where  ̄ is the average value of all the patterns related to the gene   ,  is a set of all classes in
a dataset,   is the size of samples on the class v, and   (  ) and  ̄ indicates the variance and
mean of gene   on class v, respectively.
        </p>
        <p>According to the algorithm design, a subgraph of genes is extracted in each iteration using a
repetitive process, and then the appropriate genes are selected from the existing genes in this
subgraph using a feature-weighting strategy. This process is repeated again for all remaining
genes after deleting the genes present in the extracted subgraph.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Experimental results</title>
      <p>
        To investigate the performance of our cancer diagnosis algorithm, various experiments are
performed.The eficiency of our algorithm is compared with three new methods of filter-based
cancer diagnosis: Tabu Asexual Genetic Algorithm (TAGA) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], Feature Clustering and
Feature Discretization assisting gene selection (FCFD) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], Least Loss (LL) [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Moreover, the
experiments in this study use a variety of datasets with diferent properties to demonstrate the
efectiveness of the developed algorithm. These microarray data include of Colon, Leukemia,
SRBCT, Prostate Tumor, and Lung Cancer [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The primary characteristics of these datasets are
detailed in Table 1. Additionally, to assess the flexibility of the proposed algorithm on diferent
classifiers, we also examined the performance of two frequent used classifiers, including Support
Vector Machine (SVM) and AdaBoost (AB).
      </p>
      <p>
        In our experiments, the eficiency of our algorithm is measured using the mentioned classifiers.
Figure 1 summarizes the average classification accuracy over ten separate and autonomous
runs of the diferent gene selection algorithms. The reported results indicate that in all cancer
datasets, the developed gene selection performs better than those of other alternative approaches.
For example, for the Colon dataset on SVM classifier, the proposed algorithm obtained a 88.91 %
accuracy while for TAGA [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], FCFD [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], LL [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], this value is 81.52%, 87.82% and 87.24%,
correspondingly. Furthermore, the reported results for AB classifier were similar to SVM classifier,
and in all cases the developed algorithm was more precise than the other compared algorithms.
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>In this paper, an eficient dimensionality reduction algorithm in cancer diagnosis has been
developed utilizing the subgraph discovery and feature weighting. The main aim of our algorithm
is to choose a subset of appropriate and non-redundant genes that are most closely associated
to the target class of microarray data classification. The proposed algorithm has been compared
to the recent gene selection algorithms on the cancer microarray datasets. The experimental
results show that our cancer diagnosis algorithm gained the highest performance.
In future work, our goals are (1) integrate our proposed gene selection approach along with deep
learning techniques for accurate cancer diagnosis, and (2) study explainable artificial intelligence
in detail to see how explainable artificial intelligence can improve further interpretability and
transparency of diagnosis.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This project is supported by the Academy of Finland Profi5 DigiHealth project, which is gratefully
acknowledged.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Xiao</surname>
          </string-name>
          , L. Cao,
          <article-title>Recent advances in heat shock proteins in cancer diagnosis, prognosis, metabolism and treatment</article-title>
          ,
          <source>Biomedicine &amp; Pharmacotherapy</source>
          <volume>142</volume>
          (
          <year>2021</year>
          )
          <fpage>112074</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Daneshjou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ouyang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Y.</given-names>
            <surname>Zou</surname>
          </string-name>
          ,
          <article-title>How to evaluate deep learning for cancer diagnostics - factors and recommendations</article-title>
          ,
          <source>Biochimica et Biophysica Acta (BBA) - Reviews on Cancer</source>
          <year>1875</year>
          (
          <year>2021</year>
          )
          <article-title>188515</article-title>
          . doi:https://doi.org/10.1016/j.bbcan.
          <year>2021</year>
          .
          <volume>188515</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Gao</surname>
          </string-name>
          , J. Han,
          <string-name>
            <surname>L</surname>
          </string-name>
          . Shao,
          <article-title>Flexible unsupervised feature extraction for image classification</article-title>
          ,
          <source>Neural Networks</source>
          <volume>115</volume>
          (
          <year>2019</year>
          )
          <fpage>65</fpage>
          -
          <lpage>71</lpage>
          . doi: https://doi.org/10.1016/ j.neunet.
          <year>2019</year>
          .
          <volume>03</volume>
          .008.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Forouzandeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Berahmand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rostami</surname>
          </string-name>
          ,
          <article-title>Presentation of a recommender system with ensemble learning and graph embedding: a case on movielens</article-title>
          ,
          <source>Multimedia Tools and Applications</source>
          <volume>80</volume>
          (
          <year>2021</year>
          )
          <fpage>7805</fpage>
          -
          <lpage>7832</lpage>
          . URL: https://doi.org/10.1007/s11042-020-09949-5. doi:
          <volume>10</volume>
          .1007/s11042- 020- 09949- 5.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rostami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Berahmand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Nasiri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Forouzandeh</surname>
          </string-name>
          ,
          <article-title>Review of swarm intelligencebased feature selection methods</article-title>
          ,
          <source>Engineering Applications of Artificial Intelligence</source>
          <volume>100</volume>
          (
          <year>2021</year>
          )
          <article-title>104210</article-title>
          . URL: https://www.sciencedirect.com/science/article/pii/S0952197621000579. doi:https://doi.org/10.1016/j.engappai.
          <year>2021</year>
          .
          <volume>104210</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rostami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Forouzandeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Berahmand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Soltani</surname>
          </string-name>
          ,
          <article-title>Integration of multi-objective pso based feature selection and node centrality for medical datasets</article-title>
          ,
          <source>Genomics</source>
          <volume>112</volume>
          (
          <year>2020</year>
          )
          <fpage>4370</fpage>
          -
          <lpage>4384</lpage>
          . URL: https://www.sciencedirect.com/science/article/pii/S088875432030224X. doi:https://doi.org/10.1016/j.ygeno.
          <year>2020</year>
          .
          <volume>07</volume>
          .027.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rostami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Berahmand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Forouzandeh</surname>
          </string-name>
          ,
          <article-title>A novel method of constrained feature selection by the measurement of pairwise constraints uncertainty</article-title>
          ,
          <source>Journal of Big Data</source>
          <volume>7</volume>
          (
          <year>2020</year>
          )
          <article-title>83</article-title>
          . URL: https://doi.org/10.1186/s40537-020-00352-3. doi:
          <volume>10</volume>
          .1186/s40537- 020- 00352- 3.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rostami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Berahmand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Forouzandeh</surname>
          </string-name>
          ,
          <article-title>A novel community detection based genetic algorithm for feature selection</article-title>
          ,
          <source>Journal of Big Data</source>
          <volume>8</volume>
          (
          <year>2021</year>
          )
          <article-title>2</article-title>
          . URL: https://doi.org/10. 1186/s40537-020-00398-3. doi:
          <volume>10</volume>
          .1186/s40537-020-00398-3.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Labani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Moradi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ahmadizar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jalili</surname>
          </string-name>
          ,
          <article-title>A novel multivariate filter method for feature selection in text classification problems</article-title>
          ,
          <source>Engineering Applications of Artificial Intelligence</source>
          <volume>70</volume>
          (
          <year>2018</year>
          )
          <fpage>25</fpage>
          -
          <lpage>37</lpage>
          . URL: https://www.sciencedirect.com/science/article/ pii/S0952197617303172. doi:https://doi.org/10.1016/j.engappai.
          <year>2017</year>
          .
          <volume>12</volume>
          .014.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>W. De Smet</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. De Loof</surname>
            ,
            <given-names>P.</given-names>
            De Vos, P.
          </string-name>
          <string-name>
            <surname>Dawyndt</surname>
          </string-name>
          , B. De Baets,
          <article-title>Filtering and ranking techniques for automated selection of high-quality 16s rrna gene sequences</article-title>
          ,
          <source>Systematic and Applied Microbiology</source>
          <volume>36</volume>
          (
          <year>2013</year>
          )
          <fpage>549</fpage>
          -
          <lpage>559</lpage>
          . URL: https://www.sciencedirect.com/science/ article/pii/S0723202013001495. doi:https://doi.org/10.1016/j.syapm.
          <year>2013</year>
          .
          <volume>09</volume>
          .001.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>C.-M. Lai</surname>
            ,
            <given-names>H.-P.</given-names>
          </string-name>
          <string-name>
            <surname>Huang</surname>
          </string-name>
          ,
          <article-title>A gene selection algorithm using simplified swarm optimization with multi-filter ensemble technique</article-title>
          ,
          <source>Applied Soft Computing</source>
          <volume>100</volume>
          (
          <year>2021</year>
          )
          <article-title>106994</article-title>
          . URL: https://www.sciencedirect.com/science/article/pii/S1568494620309339. doi:https://doi. org/10.1016/j.asoc.
          <year>2020</year>
          .
          <volume>106994</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Salesi</surname>
          </string-name>
          , G. Cosma,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mavrovouniotis</surname>
          </string-name>
          , Taga:
          <article-title>Tabu asexual genetic algorithm embedded in a filter/filter feature selection approach for high-dimensional data</article-title>
          ,
          <source>Information Sciences 565</source>
          (
          <year>2021</year>
          )
          <fpage>105</fpage>
          -
          <lpage>127</lpage>
          . URL: https://www.sciencedirect.com/science/article/pii/ S0020025521000475. doi:https://doi.org/10.1016/j.ins.
          <year>2021</year>
          .
          <volume>01</volume>
          .020.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>H.-Y. Lin</surname>
          </string-name>
          ,
          <article-title>Feature clustering and feature discretization assisting gene selection for molecular classification using fuzzy c-means and expectation-maximization algorithm</article-title>
          ,
          <source>The Journal of Supercomputing</source>
          <volume>77</volume>
          (
          <year>2021</year>
          )
          <fpage>5381</fpage>
          -
          <lpage>5397</lpage>
          . URL: https://doi.org/10.1007/s11227-020-03480-y. doi:
          <volume>10</volume>
          .1007/s11227-020-03480-y.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>F.</given-names>
            <surname>Thabtah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Kamalov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hammoud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Shahamiri</surname>
          </string-name>
          ,
          <article-title>Least loss: A simplified filter method for feature selection</article-title>
          ,
          <source>Information Sciences 534</source>
          (
          <year>2020</year>
          )
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
          . URL: https: //www.sciencedirect.com/science/article/pii/S0020025520304242. doi:https://doi.org/ 10.1016/j.ins.
          <year>2020</year>
          .
          <volume>05</volume>
          .017.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Rostami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Moradi</surname>
          </string-name>
          ,
          <article-title>A clustering based genetic algorithm for feature selection</article-title>
          ,
          <source>in: 2014 6th Conference on Information and Knowledge Technology (IKT)</source>
          ,
          <year>2014</year>
          , pp.
          <fpage>112</fpage>
          -
          <lpage>116</lpage>
          . doi:
          <volume>10</volume>
          .1109/IKT.
          <year>2014</year>
          .
          <volume>7030343</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>P.</given-names>
            <surname>García-Díaz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Sánchez-Berriel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Martínez-Rojas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Diez-Pascual</surname>
          </string-name>
          ,
          <article-title>Unsupervised feature selection algorithm for multiclass cancer classification of gene expression rna-seq data</article-title>
          ,
          <source>Genomics</source>
          <volume>112</volume>
          (
          <year>2020</year>
          )
          <fpage>1916</fpage>
          -
          <lpage>1925</lpage>
          . URL: https://www.sciencedirect.com/science/article/ pii/S0888754319304811. doi:https://doi.org/10.1016/j.ygeno.
          <year>2019</year>
          .
          <volume>11</volume>
          .004.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>M. M. Kabir</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Shahjahan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Murase</surname>
          </string-name>
          ,
          <article-title>A new local search based hybrid genetic algorithm for feature selection</article-title>
          ,
          <source>Neurocomputing</source>
          <volume>74</volume>
          (
          <year>2011</year>
          )
          <fpage>2914</fpage>
          -
          <lpage>2928</lpage>
          . URL: https: //www.sciencedirect.com/science/article/pii/S0925231211002748. doi:https://doi.org/ 10.1016/j.neucom.
          <year>2011</year>
          .
          <volume>03</volume>
          .034.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bressan</surname>
          </string-name>
          ,
          <article-title>Faster algorithms for counting subgraphs in sparse graphs</article-title>
          ,
          <source>Algorithmica</source>
          <volume>83</volume>
          (
          <year>2021</year>
          )
          <fpage>2578</fpage>
          -
          <lpage>2605</lpage>
          . URL: https://doi.org/10.1007/s00453-021-00811-0. doi:
          <volume>10</volume>
          .1007/ s00453-021-00811-0.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>