<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Biomedical association inference on pandemic knowledge graphs: A comparative study⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mengjia Wu</string-name>
          <email>mengjia.wu@uts.edu.au</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chao Yu</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jian Xu</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ying Ding</string-name>
          <email>ying.ding@ischool.utexas.edu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yi Zhang</string-name>
          <email>yi.zhang@uts.edu.au</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Australian Artificial Intelligence Institute, Faculty of Engineering and Information Technology, University of Technology Sydney</institution>
          ,
          <addr-line>15 Broadway, Ultimo, NSW</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Information Management, Sun Yat-sen University</institution>
          ,
          <addr-line>Guangzhou</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>School of Information, University of Texas</institution>
          ,
          <addr-line>Austin, TX</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Acquiring insights and understanding from historical pandemics is crucial for reducing the likelihood of their recurrence. The utilization of knowledge graphs stands as an essential tool for researchers, with knowledge inference emerging as a prominent task within these graphs to deduce previously unidentified connections between entities. This study endeavors to construct a knowledge graph centered on pandemic research and to evaluate the eficacy of various mainstream methodologies in the context of biomedical association inference. Our findings indicate that techniques for graph representation hold significant promise in executing these tasks and heterogeneous graph representation techniques demonstrate high predicting accuracy. Nonetheless, the advancement in this area of research necessitates more refined experimental designs and the adoption of more adaptive learning strategies.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Biomedical knowledge graph</kwd>
        <kwd>graph representation</kwd>
        <kwd>knowledge inference</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Biomedical entity association inference is a long-term task
for scientific researchers and industry practitioners to
understand the relationships between biomedical entities and
propose first-hand literature-based evidence for further
investigations [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. Severe Acute Respiratory Syndrome (SARS),
Middle East Respiratory Syndrome (MERS) and Coronavirus
Disease 2019 (COVID-19), the three notorious pandemics in
public health history, presented huge threats to human lives
and social stability [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ]. Uncovering knowledge inference
from the pandemic knowledge foundation encompassing
tremendous coronavirus-related research articles published
in human history may bring insights to uncover the
evolutionary mechanisms of coronavirus for reducing public
uncertainties towards and developing precautions for future
infectious disease crises [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ]. However, the complexity,
heterogeneity and intricate associations of biomedical entities
present a challenge in exploring newly emerging
knowledge.
      </p>
      <p>
        Knowledge graphs, which are extensively used to depict
intricate data relationships, serve as the foundation for
analyzing and inferring associations [
        <xref ref-type="bibr" rid="ref2 ref6 ref7 ref8">2, 6, 7, 8</xref>
        ]. These graphs
represent biomedical entities such as genes, diseases,
chemicals, and drugs as nodes, with their relationships illustrated
as either directed or undirected edges, sometimes
accompanied by supplementary descriptive attributes. Leveraging
network analysis techniques, various methods have been
introduced to investigate patterns of association and predict
previously unknown relationships.
      </p>
      <p>In this study, we developed a knowledge graph from
scholarly articles on SARS, MERS, and COVID-19, comprising
9,142 nodes and 81,707 connections. We conducted a
validation test to assess how well various mainstream techniques
could predict relationships within this graph. By masking
10% of the connections of each type, we applied five diferent
methods to the masked graph to identify the hidden
connections from an equal mix of randomly inserted non-existent
connections. The findings revealed the diverse efectiveness
of these methods in identifying the obscured connections,
with HetGNN proven as the most efective. Nonetheless, the
lfexibility and applicability of diferent graph representation
methods across varied contexts need enhancement. This
research illustrates the application of multiple prominent
methods in deducing associations in knowledge graphs and
verifies the precision of these methods.</p>
      <p>The following of this paper is organized as follows: We
introduced the pandemic knowledge graphs and examined
methods in the section Data and Method, followed by
Experimental Settings and Results. We concluded the study
and anticipated some future directions in the section of
Discussion and Conclusions.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Data and Method</title>
      <p>
        The integrative Biomedical Knowledge Hub (iBKH) is a
knowledge graph dataset that curates the associations of
11 categories of biomedical entities from 17 publicly
available data sources [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Using the iBKH as the global dataset,
we searched scholarly articles across PubMed using search
strategies from [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and cross-matched the search results to
iBKH. By extracting the nodes and edges relevant to papers
in the search results, we constructed a pandemic-specific
sub-graph of the iBKH dataset. The overall description of
sub-graphs relevant to each pandemic is given in Table 1.
      </p>
      <p>
        The pandemic graph is denoted as  = (, ), and
 = {, , }
 = {, , , , , }
(1)
(2)
where , , and  respectively represent the node
set of diseases, drugs and genes. (,  ∈ {, , })
denotes the edge set of associations between nodes of types
 and . Entity association inference on this pandemic graph
aims to predict emerging associations between nodes in 
that have not yet appeared in .
• Random Walk with Restart (RWR) is a commonly
used method for inferring relationships within
graphs, particularly in the biomedical field. It
models a random walking process that begins at node 
and calculates the likelihood of reaching node  as
a measure of relevance between nodes  and . To
avoid the walk from becoming trapped in local areas,
it introduces a restart probability , which allows
the walk to restart from node  at each step, thereby
ensuring broader exploration of the graph.
• Resource allocation (RA) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]: RA is a link
prediction algorithm that conceptualizes the graph as a
transportation network, viewing edges as channels
for resource difusion. Under this model, the
likelihood of forming a link between any two nodes is
approximated by the total resources these nodes are
expected to receive through their shared neighbors.
This approach leverages the idea that the more
resources two nodes can exchange via their common
connections, the higher the probability they will
establish a direct link.
• Node2Vec [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]: Node2Vec is a scalable graph
representation technique that utilizes random walks
to learn low-dimensional vector representations of
nodes within a graph. It operates by optimizing an
objective that aims to preserve neighborhood
relationships, ensuring that nodes with similar network
neighborhoods are close to each other in the vector
space.
• Heterogeneous graph neural networks (HetGNN)
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]: HetGNN is a graph representation technique
designed to work with heterogeneous graphs,
characterized by their inclusion of various types of nodes,
each possessing diverse content attributes such as
text and images. It introduces a novel two-step
information aggregation process aimed at efectively
learning from the information presented by
neighboring nodes, both of the same and diferent types.
This process allows HetGNN to capture the complex
structural and content heterogeneity of the graph,
enabling the model to generate more accurate and
meaningful representations of each node.
• Heterogeneous graph neural network with
cocontrastive learning (HeCo) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]: HeCo is a
selfsupervised learning technique designed for
heterogeneous graph representation, which utilizes
contrastive learning to derive node representations.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Experiment settings</title>
      <p>The setup for the experiment is detailed in Figure 1. The
objective was to assess the eficacy of various algorithms
in predicting associations between biomedical entities. To
this end, a validation experiment was structured in the
following manner: From each category of edges, denoted
as  where ,  belong to the set , , 
(representing disease, drug-gene, and gene respectively), 10% of
the edges were randomly selected and removed. The
resulting graph, with these edges removed, was labeled as
 = (, ). The edges that were removed are
represented by  = |,  ∈ , , , and these were
considered the ’true’ associations for the purposes of this
experiment. In addition to this, an equivalent number of
node pairs, which were not connected by edges in the
original graph , were randomly chosen. These pairs are
denoted by  = |,  ∈ , , ,  ∩  = ∅,
and they were defined as the negative sample set for this
study. This methodical approach enabled a balanced
evaluation, comparing the algorithms’ abilities to correctly infer
both existing and non-existing associations, thereby
providing a comprehensive understanding of their performance in
the context of biomedical entity association inference.</p>
      <p>Subsequently, each candidate algorithm was applied to
the modified graph  to ascertain the likelihood of edge
formation between every pair of nodes within both 
and . In the cases of the Random Walk with Restart
(RWR) and resource allocation algorithms, this procedure
involved computing the random walk probability and the
resource allocation score, respectively, for each node pair.
Conversely, for the three graph representation techniques,
the process entailed converting every node in the set  into
embedding vectors. The representation for edges was then
determined through an average pooling strategy, which
involves aggregating the features of node embeddings to
form a single representation for each edge.</p>
      <p>Following the generation of these probabilities or
representations, the combined dataset of  and  was divided,
with 80% allocated for training and the remaining 20% for
testing. This division was employed to train a logistic
regression classifier, the purpose of which was to predict the
likelihood of edge formation between node pairs in the test
set. The predictions made by the logistic regression model
were then used to calculate the Area Under the Curve (AUC)
metric for each method. By focusing exclusively on the test
data, which comprised 20% of the total dataset, a
standardized evaluation criterion was established. This approach
allowed for a fair comparison of the five candidate methods,
with the AUC metric serving as a measure of each method’s
ability to accurately classify node pairs as either connected
or not connected, based on the generated classification
probabilities.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>Table 2 presents the AUC scores for the five candidate
methods. It is noted that HeCo needs a metapath definition to
function, and a gene-based metapath was chosen for this
purpose. Consequently, HeCo’s evaluation was limited to
gene-related associations. It was found that HetGNN
outperformed others in recovering the removed links.Compared
to RWR and RA, the three graph representation methods
demonstrated better accuracy in identifying connections.
Yet, their advantage is not definitive because they utilize a
supervised learning approach, requiring both positive and
negative samples to train a classifier, whereas RWR and
RA can be applied directly to the existing graph structure
without any pre-existing knowledge of it.</p>
      <p>From the perspective of edge types, the analysis of
genedrug and drug-drug connections showed superior outcomes.
Importantly, both RWR and RA displayed similar levels of
efectiveness as graph representation techniques in the task
of deducing disease-disease associations. This suggests that
inferring disease similarities might be distinct from other
tasks, meriting additional investigation.</p>
      <p>Among the graph representation strategies, two methods
tailored for heterogeneous networks achieved superior AUC
scores over Node2Vec. This superiority results from their
training mechanisms being specifically designed for
heterogeneous networks, as seen in this research and commonly
in biomedical entity graphs. These methods incorporate
the significance of node types into the computation,
employing either type-specific or metapath-based aggregation
strategies for information. While this heterogeneity-focused
approach is beneficial, it limits the model’s applicability
and increases the cost of adaptation. Changes in the
heterogeneous graph’s structure necessitate adjustments to
HetGNN’s data inputs and HeCo’s metapaths, along with
significant methodological revisions. Additionally, HeCo’s
performance is influenced by the setting of a positive sample
threshold and the definition of metapaths, which vary per
case and afects the outcome significantly. Node2Vec, in
contrast, ofers a more generalized solution applicable to a
wide range of graph types.</p>
      <p>In conclusion, while heterogeneous graph representation
methods hold promise for deducing relationships within
pandemic knowledge graphs, enhancing their flexibility and
general applicability remains a challenge.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion and Conclusions</title>
      <p>
        This study explores the performance of diferent methods of
association inference and provides insights into the
potential of graph representation methods. Despite some existing
entity-relationship summarization tools like PubTator 3 [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ],
graph representation methods still hold the potential to infer
more accurate biomedical associations but need
improvement on adaptability and generalisability. Future work will
modify the inference framework and perform real-world
association inference on the built pandemic graph.
      </p>
      <p>We anticipated the following future directions
aligning with some limitations of the current study: 1) This
study ofered some preliminary understandings on selected
baselines of graph representation learning in inferring the
pandemic knowledge graph, but further customized
redevelopment based on the unique features of the pandemic
knowledge graph to enhance its performance might be
beneficial. 2) Investigating the scientific community of a
pandemic and its collaborative patterns will bring insights to
analyze the societal context of a pandemic crisis and provide
evidence-based decision support in terms of science policy,
public health, and public administration.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work was supported by the Commonwealth Scientific
and Industrial Research Organization (CSIRO), Australia, in
conjunction with the National Science Foundation (NSF) of
the United States, under CSIRO-NSF #2303037.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Henry</surname>
          </string-name>
          ,
          <string-name>
            <surname>B. T. McInnes</surname>
          </string-name>
          ,
          <article-title>Literature based discovery: Models, methods, and trends</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          <volume>74</volume>
          (
          <year>2017</year>
          )
          <fpage>20</fpage>
          -
          <lpage>32</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , G. Zhang, J. Lu,
          <article-title>Exploring the genetic basis of diseases through a heterogeneous bibliometric network: A methodology and case study</article-title>
          ,
          <source>Technological Forecasting and Social Change</source>
          <volume>164</volume>
          (
          <year>2021</year>
          )
          <fpage>120513</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Haghani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Bliemer</surname>
          </string-name>
          , Covid
          <article-title>-19 pandemic and the unprecedented mobilisation of scholarly eforts prompted by a health crisis: Scientometric comparisons across sars, mers and 2019-ncov literature</article-title>
          ,
          <source>Scientometrics</source>
          <volume>125</volume>
          (
          <year>2020</year>
          )
          <fpage>2695</fpage>
          -
          <lpage>2726</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Cai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. V.</given-names>
            <surname>Fry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Wagner</surname>
          </string-name>
          ,
          <article-title>Topic evolution, disruption and resilience in early covid-19 research</article-title>
          , Scientometrics
          <volume>126</volume>
          (
          <year>2021</year>
          )
          <fpage>4225</fpage>
          -
          <lpage>4253</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A. L.</given-names>
            <surname>Porter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Wu, Tracking and mining the covid-19 research literature</article-title>
          ,
          <source>Frontiers in Research Metrics and Analytics</source>
          <volume>5</volume>
          (
          <year>2020</year>
          )
          <fpage>594060</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Markley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Cassidy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Newman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Porter</surname>
          </string-name>
          , Covid
          <article-title>-19 knowledge deconstruction and retrieval: An intelligent bibliometric solution</article-title>
          ,
          <source>Scientometrics</source>
          (
          <year>2023</year>
          )
          <fpage>1</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Grosser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tipper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Venter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <article-title>Profiling covid-19 genetic research: A data-driven study utilizing intelligent bibliometrics</article-title>
          ,
          <source>Frontiers in Research Metrics and Analytics</source>
          <volume>6</volume>
          (
          <year>2021</year>
          )
          <fpage>683212</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>K.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Soo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Grosser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Venter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , et al.,
          <string-name>
            <surname>Artificial</surname>
          </string-name>
          intelligence
          <article-title>-driven biomedical genomics</article-title>
          ,
          <source>KnowledgeBased Systems</source>
          (
          <year>2023</year>
          )
          <fpage>110937</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rajendran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Maasch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Abedi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Bai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Cuturrufo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Guo</surname>
          </string-name>
          , et al.,
          <article-title>Biomedical discovery through the integrative biomedical knowledge hub (ibkh</article-title>
          ),
          <source>Iscience</source>
          <volume>26</volume>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>T.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lü</surname>
          </string-name>
          , Y.-C. Zhang, Predicting missing links via local information,
          <source>The European Physical Journal B</source>
          <volume>71</volume>
          (
          <year>2009</year>
          )
          <fpage>623</fpage>
          -
          <lpage>630</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Grover</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Leskovec</surname>
          </string-name>
          , node2vec:
          <article-title>Scalable feature learning for networks</article-title>
          ,
          <source>in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>855</fpage>
          -
          <lpage>864</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Song</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Swami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. V.</given-names>
            <surname>Chawla</surname>
          </string-name>
          ,
          <article-title>Heterogeneous graph neural network</article-title>
          ,
          <source>in: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &amp; Data Mining</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>793</fpage>
          -
          <lpage>803</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Liu</surname>
          </string-name>
          , H. Han,
          <string-name>
            <surname>C</surname>
          </string-name>
          . Shi,
          <article-title>Self-supervised heterogeneous graph neural network with co-contrastive learning</article-title>
          ,
          <source>in: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery &amp; Data Mining</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>1726</fpage>
          -
          <lpage>1736</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>C.-H. Wei</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Allot</surname>
            , P.-T. Lai,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Leaman</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Tian</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Luo</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          <string-name>
            <surname>Jin</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Lu</surname>
          </string-name>
          ,
          <article-title>Pubtator 3.0: An aipowered literature resource for unlocking biomedical knowledge</article-title>
          ,
          <source>arXiv preprint arXiv:2401.11048</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>