<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Fast and scalable learning of neuro-symbolic representations of biomedical knowledge</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Asan Agibetov</string-name>
          <email>asan.agibetov@meduniwien.ac.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Matthias Samwald</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Section for Arti cial Intelligence and Decision Support; Center for Medical Statistics</institution>
          ,
          <addr-line>Informatics, and Intelligent Systems;</addr-line>
          <institution>Medical University of Vienna</institution>
          ,
          <country country="AT">Austria</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this work we address the problem of fast and scalable learning of neuro-symbolic representations for general biological knowledge. Based on a recently published comprehensive biological knowledge graph (Alshahrani, 2017) that was used for demonstrating neurosymbolic representation learning, we show how to train fast (under 1 minute) log-linear neural embeddings of the entities. We utilize these representations as inputs for machine learning classi ers to enable important tasks such as biological link prediction. Classi ers are trained by concatenating learned entity embeddings to represent entity relations, and training classi ers on the concatenated embeddings to discern true relations from automatically generated negative examples. Our simple embedding methodology greatly improves on classi cation error compared to previously published state-of-the-art results, yielding a maximum increase of +0:28 F-measure and +0:22 ROC AUC scores for the most di cult biological link prediction problem. Finally, our embedding approach is orders of magnitude faster to train ( 1 minute vs. hours), much more economical in terms of embedding dimensions (d = 50 vs. d = 512), and naturally encodes the directionality of the asymmetric biological relations, that can be controlled by the order with which we concatenate the embeddings.</p>
      </abstract>
      <kwd-group>
        <kwd>knowledge graphs</kwd>
        <kwd>neural embeddings</kwd>
        <kwd>biological link prediction</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Over the last decade there has been a very popular trend of merging neural
and symbolic representations of knowledge for the large, general-purpose
knowledge graphs such as FreeBase [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and WordNet [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The utilized methods can
be roughly divided into two groups: i) multi-relational knowledge graph
embeddings [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ] and ii) graph embeddings [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ]. The former aims at learning
representations of both entities and relations, while the latter focus on the
untyped graphs, where each relation's type can be dropped without introducing
ambiguities. Both approaches aim at solving the problem of link prediction, i.e.,
modeling the probability of an instance of a relation (e.g., (u; v) 2 r) based on
d-dimensional vector representations (e.g., e(u); e(v); e(r) 2 Rd) and binary
operations de ned on them. Thus, in the case of multi-relational knowledge graphs
we seek to embed both entities and relations into d-dimensional vector space,
and we model the probability of a triple (labeled arc of a graph) (u; r; v) as
P ((u; v) 2 r) = he(u) + e(r); e(v)i (Euclidean dot product). In the case of
unlabeled graphs we drop the labels of the arcs (or edges in case the relations can
be treated as symmetric), we therefore do not embed the relations, and model
one single arc (or edge) directly as P (u; v) = he(u); e(v)i. The Euclidean dot
product is only of the many ways to model a probability of having a link (with
a label r in the multi-relational case) between the two entities u; v. In fact, the
underlying geometry may not necessarily be Euclidean, for more in-depth survey
of link prediction methodologies please see [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In the context of Semantic Web
technologies and the Resource Description Framework (RDF) and Web Ontology
(OWL) technology stack specialized knowledge graph embedding methodologies
have also recently been proposed [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ].
      </p>
      <p>
        In the bioinformatics domain Alshahrani et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] recently proposed a novel
methodology for representing nodes and relations from structured biological
knowledge that operates directly on Linked Data resources, leverages
ontologies, and yields neuro-symbolic representations amenable for down-stream use
in machine learning algorithms. The authors base their methodology on the
DeepWalk algorithm, which performs random walks on the unlabeled and
undirected graphs (i.e., with symmetric relations) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and embeds entities through an
approach inspired by the popular Word2Vec algorithm [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. This methodology
is further tuned for multi-relational data by explicitly encoding the sequences
of intermingled entities and relations. Such complex intermingled sequences
alleviate the innate undirected nature of the random walks, at the expense of
increased number of parameters to train. Unfortunately, training such models is
computationally expensive (hours on a modern intel core i7 desktop machine)
and requires relatively large embedding dimensions (d = 512). This manuscript
builds upon this seminal work and proposes a more economical, fast and scalable
way of learning neuro-symbolic representations. The neural embeddings obtained
with our approach outperform published state-of-the-art results, with speci c
assumptions on the structure of the original knowledge graph, and with the smart
encoding of links based on the embeddings of the entities. Among other things
the contributions of this work are based on the following hypotheses:
{ There is no need for a sophisticated labeled DeepWalk [
        <xref ref-type="bibr" rid="ref5 ref9">5, 9</xref>
        ] to account for
all the complexity of the interconnectivity of biological knowledge, since
all (considered) biological relations have clear non-overlapping domain and
range separations,
{ We can train faster and more economical log-linear neural embeddings with
StarSpace [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], whose quality is comparable to the state-of-the-art results
(improves on all but one link prediction task) when considering standard
classi ers based on logistic regression as in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ],
{ Using the concatenation of the neural embeddings naturally encodes the
directionality of the asymmetric biological relations, and fully exploits the
non-linear patterns that can be uncovered by the neural network classi ers.
2
2.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Materials and methods</title>
      <p>
        Dataset and evaluation methodology for link prediction used
In this work we consider the curated biological knowledge graph, presented in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
This knowledge graph is based on the three ontologies: Gene Ontology [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ],
Human Phenotype Ontology [13] and the Disease Ontology [14]. It also incorporates
the knowledge from several biological databases, including human proteins
interactions, human chemical-protein interactions and drug side e ects and drug
indications pairs. We refer the reader to [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] for the detailed description on
provenance of the data, and on data processing pipelines employed to obtain the nal
graph. For the purpose of this work, we summarize the number of biological
relation instances present in this knowledge graph in Table 1.
      </p>
      <p>relation</p>
      <p>number of instances</p>
      <p>
        Our goal is to train fast neural embeddings of the nodes of this knowledge
graph, such that we could use these embeddings to perform link prediction.
That is, we try to estimate the probability that an edge with label l (e.g., l =
has-function) exists between the nodes v1; v2 (e.g., v1 = TRIM28 gene and v2 =
negative regulation of transcription by RNA polymerase II) given their vector
representations (v1); (v2). As in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] we build separate binary prediction models
for each relation in the knowledge graph. Note that, in this work we only focus
on the link prediction problem where the embeddings are trained on the
knowledge graph, in which we remove the 20% of the edges for a given relation (this
corresponds to the rst link prediction problem reported in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]). We then use
these embeddings to train classi ers (logistic regression and multi-layer
perceptron (MLP)) on 80% of the positive true edges (i.e., relation instances) and on
the same amount of generated negative edges. These classi ers are then tested
on the remaining 20% positive and generated negative edges (which have not
been used in the embeddings generation). For a fair comparison with the
stateof-the-art results, we use the same methodology for negative sample generation,
and we use 5-fold cross validation for the training of embeddings and subsequent
link prediction classi ers, precisely the same way as in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. For all of our
experiments we do not use any deductive inference, and compare our obtained results
with the results obtained without inference in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
2.2
      </p>
      <p>
        Assumptions on the structure of the Knowledge Graph
Our methodology exploits the fact that the full biomedical knowledge graph
KG we are using only contains relations that can be inferred from the types
of the entities that are object and subject of the relation. This means that arc
labels can be safely dropped without the loss of semantics and without the
introduction of ambiguous duplicated pairs of nodes (6 9rj :(u; rj ; v) 2 KG; rj 6=
ri and (u; ri; v) 2 KG). Therefore, we can atten our graph without the risk of
having more than one relation connecting the same source and target nodes, i.e.,
we can simply consider our knowledge graph as a set of pairs of nodes (u; v).
As opposed to DeepWalk employed by [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], our methodology does not rely on
random walks on knowledge graphs [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]; instead of producing sequences of labeled
entities (nodes and arc labels mixed together), we directly consider pairs of
connected nodes. Furthermore, we simplify the structure of the knowledge graph by
removing anonymous instances that were introduced by the creator of the
knowledge graph to assert relation instances in the ABox, i.e., we directly connect
OWL classes to de-clutter the graph used to train embeddings. In the original
knowledge graph, Alshahrani et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] commit to strict OWL semantics when
modeling biological relations by asserting anonymous instances, for example a
relation instance of has-function (domain: Gene/Protein, range: Function)
would be encoded as in Listing 1.1, where we present a speci c instance of a
relation that asserts that the TRIM28 gene has the function of negative regulation
of transcription by RNA polymerase II.
gene : &lt;http :// www . ncbi . nlm . nih . gov / gene /&gt;
obo : &lt;http :// purl . obolibrary . org / obo /&gt;
go : &lt;http :// aber - owl . net / go /&gt;
rdf : &lt;http :// www . w3 . org /1999/02/22 - rdf - syntax - ns #&gt;
gene :10155 obo : RO_0000085 go : instance_106358 &gt; .
aber - owl : go / instance_106358 rdf : type obo : GO_0000122 .
      </p>
      <p>Listing 1.1. Biological knowledge representation with OWL semantics commitment
We simplify the knowledge graph by removing all anonymous instances of type
&lt;http://aber-owl.net/go/instance 106358&gt; and connecting entities directly through
object relations, i.e., we rewrite all triples of the form presented above (Listing 1.1)
to the form that only contains object property assertions as demonstrated below
(Listing 1.2).
gene : &lt;http :// www . ncbi . nlm . nih . gov / gene /&gt;
obo : &lt;http :// purl . obolibrary . org / obo /&gt;
gene :10155 obo : RO_0000085 obo : GO_0000122 .</p>
      <p>Listing 1.2. Relaxed biological knowledge representation without OWL semantics
commitment
We admit such a relaxation in the OWL semantics commitment of the
knowledge graph, because we do not leverage any OWL reasoning for our tasks. This
relaxation does not change the statistics of the number of biological relation
instances present in the knowledge graph (Table 1).
2.3</p>
      <p>
        Training fast log-linear embeddings with StarSpace
As opposed to the approach taken by Alshahrani et al [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] we employ another
neural embedding method which requires fewer parameters and is much faster
to train. Speci cally, we exploit the fact that the biological relations have well
de ned non-overlapping domain and ranges, and therefore the whole knowledge
graph can be treated as an untyped directed graph, where there is no ambiguity
in the semantics of any relation. To this end, we employ the neural embedding
model from the StarSpace toolkit [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], which aims at learning entities, each of
which is described by a set of discrete features (bag-of-features) coming from
a xed-length dictionary. The model is trained by assigning a d-dimensional
vector to each of the discrete features in the set that we want to embed directly.
Ultimately, the look-up matrix (the matrix of embeddings - latent vectors) is
learned by minimizing the following loss function
      </p>
      <p>X</p>
      <p>
        Lbatch(sim(a; b); sim(a; b1 ); : : : ; sim(a; bk )):
(a;b)2E+;b 2E
In this loss function, we need to indicate the generator of positive entry pairs
(a; b) 2 E+ { in our setting those are entities (u; v) connected via a relation
r { and the generator of negative entities bi 2 E , similar to the k-negative
sampling strategy proposed by Mikolov et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. In our setting, the negative
pairs (u; v ) are the so-called negative examples, i.e., pairs of entities (u; v )
that do not appear in the knowledge graph. The similarity function sim is
taskdependent and should operate on d-dimensional vector representations of the
entities, in our case we use the standard Euclidean dot product. Please note
that the aforementioned embedding scheme is di erent from a multi-relational
knowledge graph embedding task. The main di erence is that we do not require
the embeddings for the relations.
      </p>
      <p>
        Based on the embeddings of the nodes of the graph, we can come up with
di erent ways of representing a link between a node u and v, as a binary
operation de ned on the nodes of the graph (see [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] for more detail). In particular,
we employ the so-called concatenation of the embeddings u; v to represent each
relation instance as a concatenated vector [u v]T (Figure 1).
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>
        In Table 2 we report the state-of-the-art evaluation scores as provided in
Alshahrani et al [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Throughout the rest of this manuscript we refer to these
Retained graph
      </p>
      <sec id="sec-3-1">
        <title>StarSpace</title>
      </sec>
      <sec id="sec-3-2">
        <title>Classifier</title>
        <p>
          results as SOTA results for convenience. We further use these state-of-the-art
results to contrast our classi cation results in Tables 3 and 4. To simplify the
interpretation of our results, both Tables 3, 4 report only di erences in F-measure
and ROC AUC scores for our approach wrt. the SOTA results. Classi cation
results are divided into two parts, di erentiated by the classi er used: i)
(Table 3) logistic regression (as in [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]), and ii) (Table 4)) MLP. The two classi ers
are trained on concatenated embeddings of entities (nodes), which are obtained
from the attened graphs for each biomedical relations via StarSpace [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], as
described in Section 2. All classi cation results presented here are averaged over
5 folds to be directly and fairly compared with the results in [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
relation
has-disease-annotation
has-disease-phenotype
has-function
has-gene-phenotype
has-indication
has-interaction
has-side-e ect
has-target
        </p>
        <p>
          F-measure ROC AUC
0.89
0.72
0.85
0.84
0.72
0.82
0.86
0.94
0.95
0.78
0.95
0.91
0.79
0.88
0.93
0.97
Overall, we are able to outperform SOTA results on all relations except for
has-target (Table 3). It is important to notice that we improve signi cantly on
has-indication and has-disease-phenotype - the two worst performing
relations in Alshahrani et al [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. We speci cally consider the embeddings of rather
small sizes ([5; 10; 20; 50]) to emphasize the rapidity and scalability of training
embeddings using log-linear neural embedding approaches [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. For all
embedding dimensions we train our embeddings for at most 10 epochs, which keeps
overall training time of embeddings for one speci c biomedical relation under
1 minute on a Core i7 desktop with 32GB of RAM. It is also important to
notice that the SOTA results were obtained via the extended DeepWalk
algorithm [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] with 512 dimensions for the embeddings, which takes several hours to
train on our machine. Moreover, our learned embeddings are more consistent, as
they have a 0.92-0.99 F-measure and ROC AUC range for all relations, whereas
SOTA embeddings range from 0.72 to 0.94.
        </p>
        <p>F-measure
5
10
20
50
5</p>
        <p>ROC AUC
10
20
50
has-disease-annotation -0.027 +0.013 +0.033 +0.071 -0.088 -0.047 -0.028 +0.012
has-disease-phenotype +0.239 +0.260 +0.274 +0.279 +0.180 +0.200 +0.214 +0.219
has-function +0.013 +0.028 +0.067 +0.117 -0.077 -0.066 -0.030 +0.017
has-gene-phenotype +0.148 +0.156 +0.159 +0.159 +0.078 +0.086 +0.089 +0.089
has-indication +0.186 +0.262 +0.270 +0.275 +0.112 +0.192 +0.200 +0.205
has-interaction +0.010 +0.147 +0.179 +0.180 -0.034 +0.088 +0.119 +0.120
has-side-e ect +0.091 +0.105 +0.128 +0.137 +0.021 +0.036 +0.059 +0.067
has-target -0.107 -0.077 -0.047 -0.018 -0.109 -0.083 -0.057 -0.034
Table 3. Di erences in F-measure and ROC AUC scores for our classi cation results
for logistic regression models trained on our neural embeddings wrt. the SOTA results.
In light gray are the minimal embedding dimension with the better scores than the
state of the art (excluding has-target relation). Rows colored with dark gray represent
the worst performing SOTA relations, which we outperform signi cantly.
3.2</p>
        <p>MLP and biomedical link prediction
We hypothesize that our approach of augmented embedding dimension via
concatenation of entity embeddings is more suited for neural network architectures.
Indeed, we are able to obtain very good biological link prediction classi ers by
using concatenated embeddings and multi-layer perceptrons. We experimeted
with di erent shallow and deep architectures (hidden layer sizes ([200], [20, 20,
20], [200, 200, 200]), which yielded almost similar performances. The results of
a shallow neural networks with one hidden layer consisting of 200 neurons are
summarized in Table 4, that empirically show that the concatenation of the
neural embeddings to represent a link between the two entities fully exploits the
non-linearity patterns, which can be uncovered by the neural network classi ers.
As a result, we are able to improve the SOTA results for all the biological link
prediction tasks.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Discussion and conclusion</title>
      <p>
        Recent trends of neuro-symbolic embeddings continue the long-sought quest of
the arti cial intelligence community to unify the two disparate worlds, where
10
20
50
has-disease-annotation +0.095 +0.109 +0.110 +0.110 +0.035 +0.049 +0.050 +0.050
has-disease-phenotype +0.272 +0.279 +0.280 +0.280 +0.212 +0.219 +0.220 +0.220
has-function +0.148 +0.150 +0.149 +0.150 +0.048 +0.050 +0.049 +0.050
has-gene-phenotype +0.160 +0.160 +0.160 +0.160 +0.089 +0.090 +0.090 +0.090
has-indication +0.276 +0.278 +0.279 +0.279 +0.206 +0.208 +0.209 +0.209
has-interaction +0.180 +0.180 +0.180 +0.180 +0.120 +0.120 +0.120 +0.120
has-side-e ect +0.128 +0.137 +0.139 +0.139 +0.058 +0.067 +0.069 +0.069
has-target -0.024 +0.006 +0.023 +0.033 -0.040 -0.016 -0.003 +0.006
Table 4. Di erences in F-measure and ROC AUC scores for our classi cation results
with MLP models with one hidden layer consisting of 200 hidden units, trained on our
neural embeddings wrt. the SOTA results. In light gray are the minimal embedding
dimension with the better scores for all relations than the state of the art. Rows colored
with dark gray represent relations where the previous SOTA approach performs worst
and where our approach outperforms signi cantly.
the reasoning is performed either in a discrete symbolic space or in a
continuous vector space. As a community, we are still somewhere along this road, and
up to date there has still been no evidence of a clear way of combining the
two approaches. The neuro-symbolic representations based on random walks on
RDF data for the general biological knowledge as introduced by [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] are an
important rst development. The methodology allows for leveraging the existing
curated and structured biological knowledge (Linked Data), incorporating OWL
reasoning, and enabling the inference of hidden links that are implicitly encoded
in the biological knowledge graphs. However, as our results demonstrate, it is
possible to obtain improved classi cation results for link prediction if we relax
the constraints of multi-relational biological knowledge structure, and consider
all arcs as part of one semantic relation. Such a relaxation gives rise to faster
and more economical generation of neural embeddings, which can be further
used in scalable downstream machine learning tasks. While our results
demonstrate excellent prediction performance (all F-measure and ROC AUC scores
range in 0.92-0.99), they outline that having very well-structured input data is
a core ingredient. Indeed, the biological knowledge graph curated by Alshahrani
et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] implicitly encodes signi cant biological knowledge available to the
community, and simple log-linear embeddings coupled with shallow neural networks
are enough to obtain very good prediction results for the transductive link
prediction problems. Unfortunately, the quest of merging symbolic and continuous
representations cannot be ful lled to its advertised limits, as was already
mentioned in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], symbolic inference (OWL-EL reasoning) do not yield signi cant
improvements on link prediction tasks. Indeed, we managed to get very good
scores without any deductive completion of the Abox of the knowledge graph.
Another important aspect which we implicitly emphasized in our work is the
evaluation strategy of the neural embeddings. When dealing with big and rich
knowledge graphs one has to meticulously generate train and test splits, which
avoid potential leakage of information between the two sets. Failing to do so
might lead to the models which over t and are unable to truly perform link
predictions. As part of our future work we would like to focus on the creation
of di erent evaluation strategies that test the quality of the neural embeddings,
their explainability, and we would like to consider not only transductive link
prediction problems, but also focus on the more challenging inductive cases.
13. Kohler, S., Doelken, S.C., Mungall, C.J., Bauer, S., Firth, H.V., Bailleul-Forestier,
I., Black, G.C.M., Brown, D.L., Brudno, M., Campbell, J., FitzPatrick, D.R.,
Eppig, J.T., Jackson, A.P., Freson, K., Girdea, M., Helbig, I., Hurst, J.A., Jhn, J.,
Jackson, L.G., Kelly, A.M., Ledbetter, D.H., Mansour, S., Martin, C.L., Moss, C.,
Mumford, A., Ouwehand, W.H., Park, S.M., Riggs, E.R., Scott, R.H., Sisodiya, S.,
Van Vooren, S., Wapner, R.J., Wilkie, A.O.M., Wright, C.F., Vulto-van Silfhout,
A.T., de Leeuw, N., de Vries, B.B.A., Washingthon, N.L., Smith, C.L., Wester eld,
M., Scho eld, P., Ruef, B.J., Gkoutos, G.V., Haendel, M., Smedley, D., Lewis, S.E.,
Robinson, P.N.: The human phenotype ontology project: linking molecular biology
and disease through phenotype data. Nucleic Acids Res 42(Database issue) (jan
2014) D966{74
14. Kibbe, W.A., Arze, C., Felix, V., Mitraka, E., Bolton, E., Fu, G., Mungall, C.J.,
Binder, J.X., Malone, J., Vasant, D., Parkinson, H., Schriml, L.M.: Disease
ontology 2015 update: an expanded and updated database of human diseases for linking
biomedical knowledge through disease data. Nucleic Acids Res 43(Database issue)
(jan 2015) D1071{8
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bollacker</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Evans</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paritosh</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sturge</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , J.:
          <article-title>Freebase: A collaboratively created graph database for structuring human knowledge</article-title>
          .
          <source>In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data - SIGMOD '08</source>
          , New York, New York, USA, ACM Press (jun
          <year>2008</year>
          )
          <volume>1247</volume>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>G.A.</given-names>
          </string-name>
          :
          <article-title>Wordnet: a lexical database for english</article-title>
          .
          <source>Commun ACM</source>
          <volume>38</volume>
          (
          <issue>11</issue>
          ) (nov
          <year>1995</year>
          )
          <volume>39</volume>
          {
          <fpage>41</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bordes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Usunier</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia-Duran</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weston</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yakhnenko</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Translating embeddings for modeling multi-relational data</article-title>
          . (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Nickel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murphy</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tresp</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gabrilovich</surname>
          </string-name>
          , E.:
          <article-title>A review of relational machine learning for knowledge graphs</article-title>
          .
          <source>Proc. IEEE</source>
          <volume>104</volume>
          (
          <issue>1</issue>
          ) (jan
          <year>2016</year>
          )
          <volume>11</volume>
          {
          <fpage>33</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Perozzi</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Al-Rfou</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Skiena</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          : Deepwalk:
          <article-title>Online learning of social representations</article-title>
          .
          <source>In: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '14</source>
          , New York, New York, USA, ACM Press (aug
          <year>2014</year>
          )
          <volume>701</volume>
          {
          <fpage>710</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Grover</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leskovec</surname>
          </string-name>
          , J.: node2vec:
          <article-title>Scalable feature learning for networks</article-title>
          .
          <source>KDD</source>
          <year>2016</year>
          (aug
          <year>2016</year>
          )
          <volume>855</volume>
          {
          <fpage>864</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ristoski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paulheim</surname>
          </string-name>
          , H.:
          <article-title>Rdf2vec: Rdf graph embeddings for data mining</article-title>
          . In Groth, P.,
          <string-name>
            <surname>Simperl</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gray</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sabou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krtzsch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lecue</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Flck</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gil</surname>
          </string-name>
          , Y., eds.:
          <source>The semantic web ISWC 2016. Volume 9981 of Lecture notes in computer science</source>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2016</year>
          )
          <volume>498</volume>
          {
          <fpage>514</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Cochez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ristoski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ponzetto</surname>
            ,
            <given-names>S.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paulheim</surname>
          </string-name>
          , H.:
          <article-title>Global rdf vector space embeddings</article-title>
          . In d'Amato,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Fernandez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Tamma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Lecue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Cudr-Mauroux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Sequeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Lange</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>He</surname>
          </string-name>
          <string-name>
            <surname>in</surname>
          </string-name>
          , J., eds.:
          <source>The semantic web ISWC 2017. Volume 10587 of Lecture notes in computer science</source>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2017</year>
          )
          <volume>190</volume>
          {
          <fpage>207</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Alshahrani</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khan</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maddouri</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kinjo</surname>
            ,
            <given-names>A.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Queralt-Rosinach</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoehndorf</surname>
          </string-name>
          , R.:
          <article-title>Neuro-symbolic representation learning on biological knowledge graphs</article-title>
          .
          <source>Bioinformatics</source>
          <volume>33</volume>
          (
          <issue>17</issue>
          ) (sep
          <year>2017</year>
          )
          <volume>2723</volume>
          {
          <fpage>2730</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>arXiv (</source>
          oct
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fisch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chopra</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adams</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bordes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weston</surname>
          </string-name>
          , J.: Starspace:
          <article-title>Embed all the things! arXiv (sep</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Ashburner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ball</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blake</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Botstein</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Butler</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cherry</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>A.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dolinski</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dwight</surname>
            ,
            <given-names>S.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eppig</surname>
            ,
            <given-names>J.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harris</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hill</surname>
            ,
            <given-names>D.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>IsselTarver</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kasarskis</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lewis</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matese</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Richardson</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ringwald</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rubin</surname>
            ,
            <given-names>G.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sherlock</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Gene ontology: tool for the uni cation of biology. the gene ontology consortium</article-title>
          .
          <source>Nat Genet</source>
          <volume>25</volume>
          (
          <issue>1</issue>
          ) (may
          <year>2000</year>
          )
          <volume>25</volume>
          {
          <fpage>29</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>