<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Machine and Deep Learning Innovations for Protein Structure Quality Assessment</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Loubna Terra</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fouzia Benchikha</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohamed Hachem Kermani</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Laboratory LIRE, Abdelhamid Mehri University Constantine 2</institution>
          ,
          <addr-line>Constantine</addr-line>
          ,
          <country country="DZ">Algeria</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Laboratory LIRE, National Polytechnic School</institution>
          ,
          <addr-line>Constantine</addr-line>
          ,
          <country country="DZ">Algeria</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The importance of protein structures in biomedical research, especially in the drug discovery and design process, cannot be overlooked. The accuracy of these structures is crucial to ensure the success of research endeavors. However, experimental determination of protein structures is expensive and timeconsuming, and computational predictions are not flawless. Therefore, assessing the quality of protein models has become a vital step in filtering the most reliable options before further exploration. To meet this need, various structural bioinformatics labs have developed methods for Evaluating Model Quality (EMQ). Applying machine learning (ML) to EMQ has emerged as one of the most efective approaches, as evidenced by the results of the CASP challenge, which is widely recognized within the scientific community. This article ofers a systematic analysis of the leading ML-based EMQ methods developed in recent years. We categorize these methods based on the ML technology used and examine their relevance from a methodological perspective. We also introduce the fundamentals of EMQ. Overall, this article aims to serve as a starting point for exploring current research on protein quality evaluation while discussing future prospects in this rapidly evolving field.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;protein structure prediction</kwd>
        <kwd>model quality assessment</kwd>
        <kwd>machine learning (ML)</kwd>
        <kwd>deep learning (DL)</kwd>
        <kwd>CASP</kwd>
        <kwd>EMQ</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The fascinating process through which amino acids fold into three-dimensional protein
structures is a natural wonder that plays a critical role in the myriad of functions executed by
proteins within living organisms. Delving into the exact structures of proteins is essential
for the advancement of molecular biology, biochemistry, and pharmacology, ofering deep
insights into the molecular mechanisms of life and fostering innovation in drug development,
disease treatment, and the emerging field of synthetic biology. Traditionally, the determination
of protein structures relied heavily on experimental methods such as X-ray crystallography,
nuclear magnetic resonance (NMR) spectroscopy, and cryo-electron microscopy (cryo-EM) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
While these techniques have provided invaluable data, they are often constrained by high costs,
technical challenges, and inherent limitations, such as the dificulty in crystallizing certain
proteins or the extensive time requirements for data collection and analysis.
      </p>
      <p>
        The emergence of computational methods for predicting protein structure (PSP) marks a
paradigm shift, ofering the promise of accelerating the pace of discovery while circumventing
the limitations associated with traditional experimental approaches. Over recent decades, the
ifeld of PSP has witnessed substantial advancements, evolving from basic homology modeling
techniques to sophisticated machine learning algorithms capable of predicting structures from
amino acid sequences with remarkable accuracy. The integration of deep learning technologies,
exemplified by the development of DeepMind’s AlphaFold2 [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], represents a monumental leap
in our ability to predict protein structures with near-experimental accuracy across a wide
range of proteins. This breakthrough has set new benchmarks in the Critical Assessment of
Structure Prediction (CASP) competitions, highlighting a significant stride forward in the realm
of computational biology.
      </p>
      <p>Concurrently, the importance of model quality assessment (QA) has been increasingly
recognized, as it is essential for determining the reliability of predicted protein structures. QA
methods enable the discernment of the most plausible models from a plethora of predictions,
ofering a measure of confidence in the models utilized for further biological interpretation or
drug design. The evolution of QA methodologies has mirrored the advancements in PSP, with a
notable shift towards the application of machine learning and deep learning techniques for a
more nuanced analysis and interpretation of structural data.</p>
      <p>This article endeavors to synthesize and compare various significant works that showcase
the ongoing evolution and current status of PSP and QA methodologies. Each piece of work
discussed represents a distinct contribution to the overarching efort to accurately predict and
evaluate protein structures. By providing a comprehensive summary of these key ideas and
methodologies, the article aims to ofer a panoramic view of the advancements and challenges
within the PSP and QA fields. It highlights the transformative impact of machine learning
and deep learning technologies on our capabilities to predict and evaluate protein structures,
paving the way for groundbreaking discoveries and applications in biology and medicine. As
we continue to refine these computational tools, their integration into the broader ecosystem of
structural biology promises to unlock new horizons in our understanding and utilization of the
proteome.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background</title>
      <sec id="sec-2-1">
        <title>2.1. Machine learning and deep learning</title>
        <p>
          Machine learning is a field of study within artificial intelligence (AI) focused on designing,
analyzing, developing, and implementing methods that allow a machine (broadly defined) to
evolve through a data-driven process rather than traditional deterministic algorithms. Machine
learning approaches can be broadly classified into four types: supervised learning, unsupervised
learning, semi-supervised learning, and reinforcement learning [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. Supervised learning acquires
knowledge from training data with labeled responses. The learning process iteratively and
automatically adjusts the internal parameters of the prediction model, aiming to minimize
prediction errors. Most model quality assessment (MQA) methods are based on supervised
machine learning algorithms.
        </p>
        <p>
          Deep learning is a newer research domain within machine learning, introduced with the goal
of bringing ML closer to its ultimate objective: artificial intelligence. It involves algorithms
inspired by the structure and function of the brain [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Deep learning encompasses a set of
machine learning algorithms attempting to learn multiple levels of representation in order
to model complex relationships between data. It has the capability to extract features from
raw data through multiple layers of processing, consisting of multiple linear and nonlinear
transformations, and to learn about these features gradually across each layer with minimal
human intervention [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>Machine learning/deep learning (ML/DL) algorithms and their use in MQA methods will be
discussed in Section 3.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Protein structure prediction</title>
        <p>
          Protein structure prediction, crucial for understanding biological functions, remains a key
challenge in structural bioinformatics. This task involves inferring the three-dimensional
structure of a protein from its amino acid sequence, with three main approaches distinguished:
homology modeling [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] ,fold recognition[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] and ab initio prediction [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Recent advancements
include the use of residue contact prediction, enriched by co-evolutionary analysis from multiple
sequence alignments (MSA), significantly improving model accuracy [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
        <p>
          The advent of deep learning, especially with algorithms such as DeepMind’s AlphaFold [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ],
MULTICOM and RaptorX Contact, has marked a significant breakthrough, enabling accurate
prediction of complex protein structures [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. These methods leverage deep sequential and
structural features to predict inter-residue distances and spatial configurations in a global
context, setting new accuracy standards in the field.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Critical Assessment of Structure Prediction (CASP)</title>
        <p>
          The Critical Assessment of Protein Structure Prediction (CASP), is a global competition aimed
at evaluating protein structure prediction methods and fostering progress in the field[
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. Since
its inception in 1994, CASP has played a crucial role in evaluating and advancing methods for
predicting protein structures. Beginning with CASP7, the focus has extended to model quality
evaluation (MQA) methods, which assess both global and local quality of protein structures
submitted by prediction servers [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. The integration of deep learning techniques, especially
from CASP13, has significantly improved prediction accuracies, exemplified by methods like
AlphaFold and RaptorX Contact [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. These advancements challenge MQA methods to keep
pace with the continuously improving model quality[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Prominent MQA methods in recent
evaluations include FaeNNz, ModFOLD7, ProQ3D, and several others [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>2.4. Model Quality Assessment Metrics</title>
        <p>MQA methods are crucial for selecting the most accurate protein structure models from
predictions, thus supporting biomedical research, particularly in drug discovery.</p>
        <p>
          The evaluation metrics used to judge the accuracy of protein structure predictions include the
Global Distance Test Total Score (GDT-TS)[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], Template Modeling (TM) score [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], local-Distance
Diference Test (lDDT) score [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]and RMSD (Root Mean Square Deviation)[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. These metrics
assess the similarity between the predicted model and a reference experimental structure, focusing
on the ability to overlay sets of residues and measure the accuracy of inter-residue contacts.also,
allow a comprehensive and nuanced evaluation of predicted model quality, contributing to the
continuous improvement of protein structure prediction methods and the efectiveness of ML
and DL-based MQA methods in CASP challenges.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. EMQ methods based on ML and DL</title>
      <p>This section compares several Model Quality Assessment (MQA) applications selected for their
high popularity, immediate availability, and performance in CASP. Most of these methods are
based on artificial neural networks (CNNs, GNNs).</p>
      <p>Table 1 shows the details of these ML and DL-based MQA methods.</p>
      <sec id="sec-3-1">
        <title>3.1. Method based on 3DCNN</title>
        <p>
          The advanced approach using three-dimensional convolutional neural networks (3DCNN)
represents a significant innovation in assessing protein structure quality, focusing on the
detailed analysis of local quality to predict the overall model quality [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. Using CASP dataset
collections, this method enhances prediction accuracy through careful feature selection and
optimized network topologies. It demonstrates the efectiveness of deep learning in protein
structure evaluation, promising substantial advancements in structural bioinformatics [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
3.2. TopQA
TopQA introduces an innovative method for protein structure quality assessment based on
topology and employing machine learning. By leveraging a unique topological representation
and applying a CNN to predict the GDT-TS score, TopQA surpasses traditional methods in
accuracy, as evidenced by a correlation of 0.41 on CASP12 [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. Developed from data in the
CASP10 and CASP11 competitions, this approach optimizes the use of structural features for
model training. TopQA, accessible via GitHub, signifies a progression in protein model
evaluation by emphasizing their topological structure, opening new research avenues in structural
bioinformatics [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.3. SynthQA</title>
      </sec>
      <sec id="sec-3-3">
        <title>3.4. DeepUMQA</title>
        <p>
          SynthQA represents a breakthrough in protein model quality evaluation, utilizing a hierarchical
architecture based on machine learning to analyze multi-scale features, from energetic scores to
protein topology[
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. This method enhances evaluation accuracy over traditional approaches
by analyzing and generating new features for optimized model training[
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
DeepUMQA is a cutting-edge method for evaluating protein structure quality, using ultra-rapid
shape recognition (USR) and deep learning to efectively combine multi-scale features [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
It stands out by surpassing well-established methods through its ability to detail structural
Name
3DCNN
TopQA
SynthQA
        </p>
        <p>
          Year
and ref
information of residues, proven by superior performance on the CASP13, CASP14, and CAMEO
datasets [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
3.5. EnQA
EnQA utilizes an innovative 3D equivariant graph neural network to assess protein model quality,
leveraging advanced features from AlphaFold2 for accurate and transformation-insensitive
evaluation [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. This method exceeds the performance of traditional approaches and AlphaFold2
in quality assessment, illustrating its potential to transform protein structure evaluation in
structural bioinformatics [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Discussion</title>
      <p>Contemporary protein model quality assessment techniques like TopQA, SynthQA, DeepUMQA,
and EnQA face significant challenges. These methods typically depend heavily on specific
databases, which may not accurately represent the diversity of protein structures, potentially
leading to biased outcomes. Additionally, they require substantial computational resources,
restricting their use in environments with limited capabilities. Although these tools incorporate
advanced deep learning and hierarchical architectures to enhance their evaluations, they often
struggle to apply their findings beyond the initial training datasets. Consequently, models
generally perform well on familiar data but fail to replicate this success on new, unseen datasets.
This lack of robustness underscores the urgent need for innovation in structural bioinformatics
to develop methods that are adaptive, less data-dependent, and more eficient, thus enhancing
their reliability and practical utility across varied scenarios.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>The evaluation of protein structures using ML and DL has demonstrated progress but also
presents challenges, particularly in terms of generalization and data dependency. The limitations
of current methods such as EMQ, observed during CASP competitions, underscore the need to
explore more sophisticated and adaptive deep learning architectures. By adopting convolutional
, residual or graph neural networks, we can expect improvements in prediction accuracy and an
enhanced ability to process complex protein structures. The future of structural bioinformatics
will heavily depend on our ability to integrate these advanced technologies, thereby ensuring
significant advances in understanding biological functions and developing new therapies.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used ChatGPT for certain translation tasks.
After using this tool, the authors reviewed and edited the content as needed, and assume full
responsibility for the published material.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lengyel</surname>
          </string-name>
          , E. Hnath,
          <string-name>
            <given-names>M.</given-names>
            <surname>Storms</surname>
          </string-name>
          , T. Wohlfarth,
          <article-title>Towards an integrative structural biology approach: combining Cryo-TEM, X-ray crystallography, and NMR</article-title>
          ,
          <source>Journal of Structural and Functional Genomics</source>
          <volume>15</volume>
          (
          <year>2014</year>
          )
          <fpage>117</fpage>
          -
          <lpage>124</lpage>
          . URL: http://link.springer.
          <source>com/10.1007/ s10969-014-9179-9</source>
          . doi:
          <volume>10</volume>
          .1007/s10969-014-9179-9.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A. W.</given-names>
            <surname>Senior</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Evans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Jumper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kirkpatrick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Sifre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Green</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Qin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Žídek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. W. R.</given-names>
            <surname>Nelson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bridgland</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Penedones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Petersen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Simonyan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Crossan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Kohli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. T.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Silver</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kavukcuoglu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Hassabis</surname>
          </string-name>
          ,
          <article-title>Improved protein structure prediction using potentials from deep learning</article-title>
          ,
          <source>Nature</source>
          <volume>577</volume>
          (
          <year>2020</year>
          )
          <fpage>706</fpage>
          -
          <lpage>710</lpage>
          . URL: https://www.nature.com/ articles/s41586-019
          <article-title>-1923-7</article-title>
          . doi:
          <volume>10</volume>
          .1038/s41586-019-1923-7.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Sara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Loubna</surname>
          </string-name>
          , T. Zineeddine,
          <article-title>Approche Deep Learning basée sur le réseau de neurones résiduel pour la prédiction des séquences d'ADN virales</article-title>
          . (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.</given-names>
            <surname>Hardin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. V.</given-names>
            <surname>Pogorelov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Luthey-Schulten</surname>
          </string-name>
          ,
          <article-title>Ab initio protein structure prediction (</article-title>
          <year>2002</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <surname>S. W. I. Siu</surname>
          </string-name>
          ,
          <source>Machine Learning Approaches for Quality Assessment of Protein Structures, Biomolecules</source>
          <volume>10</volume>
          (
          <year>2020</year>
          )
          <article-title>626</article-title>
          . URL: https://www.mdpi.com/2218-273X/10/4/626. doi:
          <volume>10</volume>
          .3390/biom10040626.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Moult</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Fidelis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kryshtafovych</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Schwede</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tramontano</surname>
          </string-name>
          ,
          <article-title>Critical assessment of methods of protein structure prediction (CASP) - round x</article-title>
          ,
          <source>Proteins: Structure, Function, and Bioinformatics</source>
          <volume>82</volume>
          (
          <year>2014</year>
          )
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          . URL: https://onlinelibrary.wiley.com/doi/10.1002/prot. 24452. doi:
          <volume>10</volume>
          .1002/prot.24452.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y. Zhang,</surname>
          </string-name>
          <article-title>How significant is a protein structure similarity with TM-score = 0.5?, Bioinformatics</article-title>
          <volume>26</volume>
          (
          <year>2010</year>
          )
          <fpage>889</fpage>
          -
          <lpage>895</lpage>
          . URL: https://academic.oup.com/bioinformatics/article/ 26/7/889/213219. doi:
          <volume>10</volume>
          .1093/bioinformatics/btq066.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>V.</given-names>
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Biasini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Barbato</surname>
          </string-name>
          , T. Schwede,
          <article-title>lDDT: a local superposition-free score for comparing protein structures and models using distance diference tests</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>29</volume>
          (
          <year>2013</year>
          )
          <fpage>2722</fpage>
          -
          <lpage>2728</lpage>
          . URL: https://academic.oup.com/bioinformatics/article/29/21/2722/ 195896. doi:
          <volume>10</volume>
          .1093/bioinformatics/btt473.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ishida</surname>
          </string-name>
          ,
          <article-title>Protein model accuracy estimation based on local structure quality assessment using 3D convolutional neural network</article-title>
          ,
          <source>PLOS ONE 14</source>
          (
          <year>2019</year>
          )
          <article-title>e0221347</article-title>
          . URL: https: //dx.plos.
          <source>org/10</source>
          .1371/journal.pone.0221347. doi:
          <volume>10</volume>
          .1371/journal.pone.
          <volume>0221347</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Conover</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Stephenson</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Eickholt,</surname>
          </string-name>
          <article-title>TopQA: a topological representation for single-model protein quality assessment with machine learning (</article-title>
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Korovnik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hippe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Si</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kishaba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cao</surname>
          </string-name>
          , Synthqa - Hierarchical
          <source>Machine Learning-Based Protein Quality Assessment</source>
          ,
          <year>2021</year>
          . URL: http://biorxiv.org/lookup/doi/10. 1101/
          <year>2021</year>
          .01.28.428710. doi:
          <volume>10</volume>
          .1101/
          <year>2021</year>
          .01.28.428710.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.-S.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.-G.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.-J. Zhang,</surname>
          </string-name>
          <article-title>DeepUMQA: ultrafast shape recognition-based protein model quality assessment using deep learning</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>38</volume>
          (
          <year>2022</year>
          )
          <fpage>1895</fpage>
          -
          <lpage>1903</lpage>
          . URL: https://academic.oup.com/bioinformatics/article/38/7/1895/6520805. doi:
          <volume>10</volume>
          .1093/ bioinformatics/btac056.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Morehead</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wu</surname>
          </string-name>
          , J. Cheng, 3D
          <article-title>-equivariant graph neural networks for protein model quality assessment</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>39</volume>
          (
          <year>2023</year>
          )
          <article-title>btad030</article-title>
          . URL: https: //academic.oup.com/bioinformatics/article/doi/10.1093/bioinformatics/btad030/6986970.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>