<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Feature Compression for Predicting Effective Drug Combination</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Guocai Chen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xiaoqian Jiang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>W. Jim Zheng</string-name>
          <email>Wenjin.j.zheng@uth.tmc.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Biomedical Informatics, University of Texas Health Science Center</institution>
          ,
          <addr-line>Houston, TX</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Computational approach to predict effective drug combination can significantly improve drug efficacy while reducing drug toxicity. In this work, we employed a deep feature compression approach on gene expression data, pathway information and Ontology Fingerprints to improve the performance of a deep learning framework for effective drug combination. Our method indicates that the deep feature compression approach is an effective way to improve the performance of drug combination prediction.</p>
      </abstract>
      <kwd-group>
        <kwd>Feature compression</kwd>
        <kwd>drug combination</kwd>
        <kwd>Ontology Fingerprints</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Combined use of drugs may have extra synergistic effect that could lead to the reduced
drug dosage and hence the toxicity. Computational methodologies to predict the
effective drug combination make it possible to discover extensive drug combinations
without expansive and time-consuming experiments. However, despite of our previous
work to integrate ontology, literature and experimental data, the lack of experimental
data for drug combination impeded the improvement of the accuracy of the
computational methodologies for drug combination prediction, especially those methods based
on deep learning approaches.</p>
      <p>
        The performance of deep learning methods relies on the use of large amount of high
quality, annotated data generated specifically for a specific task. However, the quantity
of high quality training data for a targeted problem is often limited in most of the
circumstances such as for drug combination prediction. This issue has been explored in
many ways. Transfer learning [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] is one of the approaches to deal with the lack of
training case problem where deep models are trained with relevant training data that has
been well-studied and are largely available and is then amended and re-trained for the
targeted task with small amount of data. The prerequisite of the use of Transfer learning
is the availability of large amount training data in the relevant field. Another method to
enable the effective use of deep learning approach for small training dataset is deep
feature compression [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">2-4</xref>
        ]. By reducing the dimension of the inputs in the feature
domain, feature compression decreases the number of weights that need to be trained for
the deep model and thus the need of large amount of training data.
      </p>
      <p>
        Several machine learning methods have been applied for the detection of effective
drug combination in the AstraZeneca-Sanger Drug Combination Prediction DREAM
Challenge (www.synapse.org/#!Synapse:syn4231880, DREAM2015) [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ], including
regression, decision trees, random forests, Gaussian processes, SVM, neural networks,
text mining, mechanistic network-based and others. Preuer K. et al [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] designed a
feedforward neural network with the integration of the heterogeneous resources as input to
predict the drug synergy. Janizek J. introduced an extreme gradient boosted based
approach, TreeCombo [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] to predict synergy of novel drug combinations.
      </p>
      <p>
        Built upon our previous work employing a Stacked Restricted Boltzmann Machine
to predict effective drug combinations from ontology, literature and experimental data
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], we applied deep feature compression on the input features for this deep belief
network and significantly improved the performance of this model for the drug
combination prediction.
2
2.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Methods and Data materials</title>
      <sec id="sec-2-1">
        <title>Data</title>
        <p>
          The data used in this project include Ontology Fingerprints derived from Gene
Ontology and literature, transcription profiling data and the drug sensitivity data, as
described in our previous work [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. The training datasets from experiments we used in
this project contains 2199 pairs of drugs for 83 cell lines, which is sourced from
DREAM2015 (www.synapse.org/#!Synapse:syn4231880). The features are compiled
with Ontology Fingerprints [
          <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref9">9-13</xref>
          ], KEGG pathways [
          <xref ref-type="bibr" rid="ref14 ref15 ref16 ref17">14-17</xref>
          ] and gene expression data
provided by DREAM2015.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Methods</title>
      </sec>
      <sec id="sec-2-3">
        <title>Feature compilation</title>
        <p>
          As described previously [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], features are compiled specifically for each cell line for
effective genes. Each feature is a combination of the rank of the targeted gene for the
cell line normalized with the minmax algorithm, the Ontology Fingerprint similarity
[
          <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref18">10-13, 18</xref>
          ] and the inverse distance of the targeted genes in the extended KEGG
pathway measured with InfoMap [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
        </p>
        <p>Ontology
Fingerprin&gt;tUID 207</p>
        <p>GO:00075</p>
        <p>48
GO:00484</p>
        <p>77
GO:00072
83
…</p>
        <sec id="sec-2-3-1">
          <title>Deep feature compression</title>
        </sec>
      </sec>
      <sec id="sec-2-4">
        <title>Feature compression and Deep learning with RBM</title>
        <p>
          The vectors for the drug combination in the feature domain are fed into a deep
Autoencoder as shown in Fig 1. The output in the middle layer is then extracted as the
representative feature. These representative features are then used as the inputs to the
Restricted Boltzmann Machine [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] for the drug combinations prediction.
        </p>
      </sec>
      <sec id="sec-2-5">
        <title>Experiment and Evaluation</title>
        <p>The model is trained and evaluated in 3 ways:</p>
        <sec id="sec-2-5-1">
          <title>1. One model for one cell line.</title>
          <p>2. One model for all cell lines with feature compressed independently for each cell line.
3. One model for all cell lines.</p>
          <p>The datasets are evaluated using leave one out validation method.</p>
          <p>A typical combination of hyperparameters we used is:</p>
          <p>For deep Autoencoder, three hidden layers were used. The size of each layer is one
10th of the size of the previous layer. If the size of the layer is less than 10, then double
it. The other parameters are all default values.</p>
          <p>For the Stacked RBM model we used 3 layer neural networks as well – the
dimensions are the input size, 60 and 1 respectively. The other parameters are Weight cost
0.0001, Drop-out rate 0.5, Step ratio 0.01 Batch size 100. The SparseQ and
SparseLambda may be slightly adjusted to balance the precision and recall.
3</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>
        Comparing with the result we obtained previously [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]—precision 71.5%, recall 60.2%,
f score 65.4%, as shown in table 1 the improved results are significantly better after
applying deep feature compression. For method I, the overall precision is 77.1%, recall
is 68.0% and f score is 72.2%. The f scores for 43 out of 83 cell lines are greater than
70% where we only had 32 out of 83 cell lines reported previously [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. For method II,
the overall precision is 73.3%, recall is 55.5% and f score is 63.2%. For method III,
the overall precision is 72.7%, recall is 58.3% and f score is 64.7%. While the methods
II and III show similar results as previously reported, we believe this is due to the
sharing of a single model for all 83 cell lines. The diversity of these cell lines makes our
approaches used in method II and III not very effective in prediction.
We applied deep feature compression for the purpose of predicting effective drug
combination. Our results indicate that reducing the dimension of the feature domain by deep
feature compression can significantly improve the performance of the deep learning
model we previously developed to predict effective drug combination [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Torrey</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Shavlik</surname>
          </string-name>
          ,
          <article-title>Transfer learning, in Handbook of research on machine learning applications and trends: algorithms, methods</article-title>
          , and techniques.
          <year>2010</year>
          ,
          <string-name>
            <given-names>IGI</given-names>
            <surname>Global</surname>
          </string-name>
          . p.
          <fpage>242</fpage>
          -
          <lpage>264</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Watanabe</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <article-title>Feature compression</article-title>
          ,
          <source>in Advances in information systems science. 1970</source>
          , Springer. p.
          <fpage>63</fpage>
          -
          <lpage>111</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al.
          <article-title>Context-aware deep feature compression for high-speed visual tracking</article-title>
          .
          <source>in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</source>
          .
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>H. and I.V.</given-names>
          </string-name>
          <string-name>
            <surname>Bajić</surname>
          </string-name>
          .
          <article-title>Deep feature compression for collaborative object detection</article-title>
          .
          <source>in 2018 25th IEEE International Conference on Image Processing (ICIP)</source>
          .
          <year>2018</year>
          . IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Menden</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          , et al.,
          <article-title>Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen</article-title>
          .
          <source>Nature communications</source>
          ,
          <year>2019</year>
          .
          <volume>10</volume>
          (
          <issue>1</issue>
          ): p.
          <fpage>2674</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , et al.,
          <article-title>Predict effective drug combination by deep belief network and Ontology Fingerprints</article-title>
          .
          <source>Journal of biomedical informatics</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Preuer</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , et al.,
          <article-title>DeepSynergy: Predicting anti-cancer drug synergy with Deep Learning</article-title>
          . Bioinformatics,
          <year>2017</year>
          . 1: p.
          <fpage>9</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Janizek</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.D.</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Celik</surname>
            , and
            <given-names>S.-I. Lee,</given-names>
          </string-name>
          <article-title>Explainable machine learning prediction of synergistic drug combinations for precision cancer medicine</article-title>
          .
          <source>bioRxiv</source>
          ,
          <year>2018</year>
          : p.
          <fpage>331769</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , et al.,
          <article-title>Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature</article-title>
          .
          <source>Database</source>
          ,
          <year>2014</year>
          . bav034: p.
          <source>doi: 10</source>
          .1093/database/bav034.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Qin</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , et al.,
          <article-title>Signaling network prediction by the Ontology Fingerprint enhanced Bayesian network</article-title>
          .
          <source>BMC Systems Biology</source>
          ,
          <year>2012</year>
          .
          <volume>6</volume>
          (
          <issue>Suppl 3</issue>
          ): p.
          <fpage>S3</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Qin</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , et al.,
          <article-title>Finding pathway-modulating genes from a novel Ontology Fingerprint-derived gene network</article-title>
          .
          <source>Nucleic acids research</source>
          ,
          <year>2014</year>
          .
          <volume>42</volume>
          (
          <issue>18</issue>
          ): p.
          <fpage>e138</fpage>
          -
          <lpage>e138</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Tsoi</surname>
            ,
            <given-names>L.C.</given-names>
          </string-name>
          , et al.,
          <article-title>Evaluation of genome-wide association study results through development of ontology fingerprints</article-title>
          .
          <source>Bioinformatics</source>
          ,
          <year>2009</year>
          .
          <volume>25</volume>
          (
          <issue>10</issue>
          ): p.
          <fpage>1314</fpage>
          -
          <lpage>1320</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Tsoi</surname>
            ,
            <given-names>L.C.</given-names>
          </string-name>
          , et al.,
          <article-title>Consistent Differential Expression Pattern (CDEP) on microarray to identify genes related to metastatic behavior</article-title>
          .
          <source>BMC Bioinformatics</source>
          ,
          <year>2011</year>
          .
          <volume>12</volume>
          : p.
          <fpage>438</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Fabris</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>A.A.</given-names>
            <surname>Freitas</surname>
          </string-name>
          ,
          <article-title>New KEGG pathway-based interpretable features for classifying ageing-related mouse proteins</article-title>
          .
          <source>Bioinformatics</source>
          ,
          <year>2016</year>
          .
          <volume>32</volume>
          (
          <issue>19</issue>
          ): p.
          <fpage>2988</fpage>
          -
          <lpage>2995</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Kanehisa</surname>
            , M. and
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Goto</surname>
          </string-name>
          ,
          <article-title>KEGG: kyoto encyclopedia of genes and genomes</article-title>
          .
          <source>Nucleic acids research</source>
          ,
          <year>2000</year>
          .
          <volume>28</volume>
          (
          <issue>1</issue>
          ): p.
          <fpage>27</fpage>
          -
          <lpage>30</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Kanehisa</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al.,
          <article-title>KEGG for representation and analysis of molecular networks involving diseases and drugs</article-title>
          .
          <source>Nucleic acids research</source>
          ,
          <year>2009</year>
          .
          <volume>38</volume>
          (
          <issue>suppl</issue>
          _1): p.
          <fpage>D355</fpage>
          -
          <lpage>D360</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Kanehisa</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al.,
          <article-title>The KEGG resource for deciphering the genome</article-title>
          .
          <source>Nucleic acids research</source>
          ,
          <year>2004</year>
          .
          <volume>32</volume>
          (
          <issue>suppl</issue>
          _1): p.
          <fpage>D277</fpage>
          -
          <lpage>D280</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , et al.,
          <article-title>Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature</article-title>
          .
          <source>Database</source>
          ,
          <year>2015</year>
          .
          <year>2015</year>
          : p.
          <fpage>bav034</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Burk</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Horton</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Amat</surname>
          </string-name>
          ,
          <article-title>Infomap: a complete guide to discovering corporate information resources</article-title>
          . Revista
          <string-name>
            <surname>Española de Documentación Científica</surname>
          </string-name>
          ,
          <year>1994</year>
          .
          <volume>17</volume>
          (
          <issue>2</issue>
          ): p.
          <fpage>235</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Fischer</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . and
          <string-name>
            <given-names>C.</given-names>
            <surname>Igel</surname>
          </string-name>
          .
          <article-title>An introduction to restricted Boltzmann machines</article-title>
          .
          <source>in Iberoamerican Congress on Pattern Recognition</source>
          .
          <year>2012</year>
          . Springer.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>