<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Improving Transitive Embeddings in Neural Reasoning Tasks via Knowledge-Based Policy Networks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Shervin Mehryar</string-name>
          <email>shervin.mehryar@utoronto.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Remzi Celebi</string-name>
          <email>remzi.celebi@maastrichtuniversity.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Electrical &amp; Computer Engineering, University of Toronto</institution>
          ,
          <country country="CA">Canada</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute of Data Science, Maastricht University</institution>
          ,
          <addr-line>Maastricht</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper proposes an approach to embed ontologies in order to deal with reasoning based on transitive relations, using the datasets provided for the SemRec Challenge at ISWC 2022. Knowledge Graph Embedding (KGE) methods provide a low-dimensional representation of the entities and relationships extracted from the knowledge graph and have been successfully used for a variety of applications such as question answering, reasoning, inference, and link prediction. However, most KGE methods cannot handle the underlying constraints and characteristics of ontologies, preventing them from performing important reasoning tasks such as subsumption and instance checking. We propose to extend translation-based embedding methods to include subsumption and instance checking reasoning by leveraging transitive relations. Experimental results show that our approach can achieve Hits@10 as high as %73 using samples generated by a policy network.</p>
      </abstract>
      <kwd-group>
        <kwd>ontology embedding</kwd>
        <kwd>knowledge graph embedding</kwd>
        <kwd>reasoning</kwd>
        <kwd>policy network</kwd>
        <kwd>neural reasoning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Neural-symbolic computing, an approach to combine symbolic approaches with
neural models, can provide reasoning and explainability capabilities while being
highly scalable in analyzing and inferring new data from a large volume of data.
Deep learning has achieved significant success in many areas in recent years, but
the success of deep learning models is limited by the availability of suficient
quality labeled data for the task at hand. Deep Learning models input-output
relationships by generating neural representation of data, but these neural
representations often fail to consider common sense or acquired domain knowledge.
A promising recent approach to neural-symbolic integration is Knowledge Graph
Embeddings (KGE), which map the symbolic representation of a domain data
into vector spaces. KGE can perform approximate reasoning that takes place in
vector space to reduce the complexity of reasoning time.</p>
      <p>
        There are many existing KGE methods to try to preserve certain properties
of KG [
        <xref ref-type="bibr" rid="ref1 ref11 ref13 ref2">1,13,11,2</xref>
        ]. TransE, being one of the simple and successful knowledge
graph embedding methods, proposes a scoring function to measure plausibility
of a fact [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The TransE scoring function tries to minimize the distance between
the translated head vector embedding (h + r) and the tail vector embedding
(t) where the facts are represented as a triplet &lt; h, r, t &gt; (head, relation, tail).
The authors later proposed rTransE, an extension of TransE, which learns rules
that compose relationships as a sequence of translations in the embedding space.
Particularly, rTransE embeds the composition of two relationships (l1, l2) as the
sum of their embeddings and calculates the distance between the translated head
embedding (h + l1 + l2) and the tail embedding (t) as a scoring function.
      </p>
      <p>
        KGs can take the form of Ontologies, which are in turn meant to represent
more complex relationships (i.e. negation, conjunction, disjunction and
quantiifers). However, most KGE methods cannot handle the underlying constraints
and characteristics of the ontologies. As a result, researchers have recently tried
to address this issue by incorporating the geometric structure of the EL
description logic into the embedding space (EmEl) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] or modifying translation-based
embeddings that can handle complex many-many roles in the EL embeddings
(EmElvar)[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>In this work, we extend TransE and rTransE to make subsumption and
instance checking reasoning possible whereby reasoning can take place in the vector
space by leveraging transitive relations. We further improve the quality of
embeddings using multi-hop samples generated by an agent’s policy network. The
agent is a neural network that takes as input an entity vector embedding (i.e.
state) and outputs a relationship (i.e. action). Through careful design of a
reward function, the agent learns a good policy to choose actions that lead to more
meaningful and longer sequence of translations. We investigate qualitatively and
quantitatively the behaviour and performance of the proposed policy-based
embedding method, which we call aTransE.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], Liu et al. proposed a neural-symbolic reasoning approach PoLo that is
based on reinforcement learning and logical rules. Their method extends
MINERVA framework [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], which trains a reinforcement learning agent to perform
a random walk with the goal of reaching a target entity. They addressed the
issue of noisy reward signals by incorporating logical rules (used as meta-paths)
in their reward function for agent to perform policy-guided random walk on a
background knowledge graph. They evaluated their method on a link prediction
task where hold-out drug-disease links were predicted in the drug repurposing
setting. They reported that their approach outperformed the the state-of-the-art
methods on the Hetionet biomedical dataset.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], the authors combined embedding and rule-based approaches using
ensemble learning for knowledge graph completion tasks. They claim that these
two methods complement each other to improve the performance of knowledge
graph completion methods. These methods are combined with linear blending
function at the relation level. Also the paper introduced RuleN, a rule learning
method, which supports Type Pn (n-length path rules) and Type C rules (rules
with constants in their heads). To show advantages of the combined approaches,
a sort of experiment is conducted. Their ensemble method outperformed
embedding methods, including HolE, RESCAL, and TransE, and rule-based learning
methods, AMIE and RuleN, in the evaluation. Their methodology difers from
ours in that it combines two classes of methods with linear blending function
and a classifier is trained with an equal number of positive triplets and generated
negative triplets. Whereas we combine rules and embeddings into labeling
functions and ensemble of output labels from each labeling function. While we used
labeling functions to generate labels for a triplet from each rule-guided walk and
integrated labels with a probabilistic model, in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] logistic regression was used
to train on the input data of as normalized scores from each individual model
(either rule and embedding based model).
      </p>
      <p>
        Zang et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] proposed a novel framework IterE which aims to predict new
triplets by iteratively combining embedding and rule learning. To address the
quality issue in the embeddings trained on sparse entities, IterE uses ontological
axioms to enrich the knowledge graph. The framework takes a knowledge graph
and relation embedding R as input and produces a set of axioms and confidence
scores. After new axioms (triplets) with high scores are added to the knowledge
graph, an embedding learning step is initiated with the updated KG. This process
continues with k iterations to improve the embeddings.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Method</title>
      <p>We propose an algorithm that given a set of triplets describing facts from a
(partial) KG, learns embeddings for the entities and relations. The embeddings are
subsequently used to perform reasoning tasks, such as link prediction. In order
to account for the case where multiple relation types are present in the data (see
Figure 1 left), we train a policy agent that generates heterogenous paths across
the KGE. The one-to-one relations are embedded into a vector space directly
using the subject, relation, and objects relations in the training data. In order
to further capture and embed properties that increase the expressivity of the
model, we incorporate two additional sampling techniques. We generate
quadruples using the transitive property in which triplets (1 hop) in the training data
are extended to quadruples (2 hops), by following a second predicate sharing a
common subject (i.e. 2-hop reasoning). We believe that this is a critical
consideration to allow the model to infer subsumption and instance assertions. The
details are given in the following.</p>
      <p>
        More formally, we formulate the problem as an optimization problem for
which optimal vector embeddings are learned for entities and relations so that
given a knowledge base graph G represented with a relation r between entities
e1 and e2, the following relation also holds in the corresponding vector space:
e⃗2 = e⃗1 + ⃗r. This is a representation of single triplets in the ontology in the
form (e1, r, e2) ∈ G. We further extend this notation to include K-hop
reasoning, i.e. (e1, {rk}kK=1, eK ) ∈ G. Adopting the notation in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], the energy for this
path is given by d(e1, {rk}kK=1, eK ) = ||⃗sK (e1, {rk}kK=1, eK ) − ⃗eK ||2. Here, ⃗sk is
the vector entity starting from entity e0 and following relations r1, · · · , rk, i.e.
⃗sK (e1, {rk}kK=1, eK ) = ⃗e1 +⃗r1 · · · +⃗rK . In this view, we proceed to minimize the
following loss functions in order to learn good entity and relation embeddings:
      </p>
      <p>Triple Loss: given a set S of triplets representing facts in the KG, the Triple
loss refers to the following criterion:</p>
      <p>Ltri =</p>
      <p>X
(s,r,o)∈S
||⃗s + ⃗r − ⃗o||2,
where as before, ⃗s, ⃗r, and ⃗o are vector representations in Rd corresponding to
source entity, relation, and object entity in the ontology. These representations
are learned using the triplets in the data and embeded as a d-dimensional vector
similar to the process in TransE.</p>
      <p>Quadruple Loss: to improve the quality of representations, we consider a
secondary loss corresponding to paths with length K = 2. Given triplets (s, r1, o)
and (o, r2, t), the 2-hop path loss referes to the following crietrion:
Lquad =</p>
      <p>
        X
(s,r1,r2,t)∈S
||⃗s + r⃗1 + r⃗2 − ⃗t||2,
(1)
(2)
where as before, ⃗s, r⃗1, r⃗2, ⃗t are vector representations in Rd corresponding to
ifrst source entity, first relation, second relation, and second tail entity. This
structure ensure that learned vector representations are near each other for
entities at two links away following the composition rule [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Note that with this
setting, diversity in type of relations to the second order degree is achievable.
However, this introduces a squared polynomial level of complexity in the number
of relations. In other words, given the edge diversity size (i.e. number of edge
types) R and maximum path length support K, the time and space requirements
grow in the order of O(RK ) and O(dRK ), respectively. We next propose a policy
agent in the following in order to circumvent this issue and provide expressivity
in large KG graphs. For KG facts generation in order to compute this loss, we
form quadruples that share a common middle node, i.e. (s, r1, o) and (o, r2, t)
from the the triplet (s, l, t), which we generate from the all the present triplets
and store as quintuple samples in the form (s, r1, o, r2, t).
      </p>
      <p>
        Agent Loss: to accommodate path lengths K &gt; 2, we propose a Policy
Agent similar to [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] that is parameterized by a neural network to provide action
ak at time step k &gt; 2 corresponding to one of the relations. The agent learns
a distribution over all possible actions based on the current entity relation and
the path taken up to then. First, we train the agent on the K=1 and K=2 paths
in a supervised fashion using the representations learned above. We then retrain
the agent using memory replay and the following reward function:
R(k) =
0.0


+0.1 (s, ak, t) ∈/ S,
      </p>
      <p>s = t for (s, ak, t)
+1.0


− 0.1
k &gt; 2 and (s, {a1 · · · ak}, t) ∈ S
otherwise
in which the first condition allows self-loop (but doesn’t encourage or prohibit it),
the second condition encourages unseen but potential new paths, and the third
condition rewards paths of length k &gt; 2 for which a corresponding triplet exists
in the KG (i.e. new paths). In section 5 we provide quantitative examples of these
cases. In general, the agent is a multi-layer neural network with ReLu activity
functions, and a soft-max output, parameterized by θ . It outputs probabilities
according to distribution π (a|s) for each action given the current state. The
state corresponds to the current entity node along the path, which in our case
is represented by an embedding vector of dimension d from TransE. The action
corresponds to the relation type conditioned on the state at path length k. The
given reward function aims to maximize the following expected objective:
Lagent(θ ) = X X π (ak|sk; θ )R(k).</p>
      <p>k a</p>
      <p>In order to learn the parameters of the agent’s policy consistent with the
underlying structure of the KGE, at first the agent is trained in a supervised
manner given available triplets. In this fashion, the agent learns to make myopic
decisions or 1-hop reasoning by design (i.e. k &lt; 2). For the case of k &gt; 2, at
each level the agent makes a choice among R actions according to a soft-maxed
distribution conditioned on the current input state. In contrast to the worst case
scenario where the agent would have been making a choice among RK branches,
the agent instead makes local decisions here and receives a reward at the end
of each episode. We define episodes by the agent taking actions up to a
prespecified maximum length K or when another node is reached as a valid target
following actions a1 to ak. At that point, the agent receives a maximum reward
of 1 in our framework if successful. Since in fact at step k the choice will depend
on the total number of sub-branches at step k + 1 and this is computationally
intractable, we train in parallel a value function that estimates the value of next
state.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>
        We use three datasets, namely OWL2Bench, ORE, and CaLiGraph [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], in which
training, validation, and test data are given in triplet format as part of the
SemREC Challenge at ISWC 2022. The OWL2Bench is further provided in two
separate sub-datasets, namely OWL2Bench1 containing 7989 class assertions in
training data and 157 in test data as well as 105 subclass relations in training
(3)
(4)
data and 64 in test data; and OwL2Bench2 containing 15526 assertions in
training data and 146 in test data as well as 105 class relations in training data and
53 in test data. ORE is further provided in three subsets, namely ORE1 with
8194 training and 9073 test, ORE2 with 8204 training and 9369 test, and ORE3
with 8187 training and 9122 test data points of subclass relations, as well as,
53048 training and 42851 test assertions for ORE1, 53081 training and 42432
test assertions for ORE2, and 53014 training and 43181 test assertions for ORE3.
The last dataset is CaLiGraph which contains more ontology types, again with
subclass and type forming the majority of data, provided in diferent subsets of
size 104 and 105 (as well as the full ontology which we do not consider due to
memory requirements). The distribution of all predicates for both is shown in
Figure 2.
      </p>
      <p>
        During the training phase, each unique entity is represented by a vector
embedding of dimension size D = 30, which we found to be the most efective
across all datasets from the validation data. Initially, vector representations are
generated randomly. During training we further normalize the magnitude of each
vector representation to one [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Similarly, each relation type is represented with
a vector of the same size, i.e. D = 30, in order to simplify vector operations
as described in Section 3. Overall, for ORE there are about 6650-6673 entity
vectors and 2 relation vectors. For OwL2Bench1 there are about 113-115 unique
entity vectors and again 2 relation vectors that we consider. For CaLiGraph10e4
and CaLiGraph10e5 there are 10311 and 75195 with multiple relation types (see
Figure 2). Batch sizes of 25 from triplets and quadruple are selected in order to
minimize the the loss functions in relations 1 and 2, respectively. An stochastic
gradient optimizer such as ADAM with a learning rate of 10− 5 is used to update
the vector representations as model parameters.
      </p>
      <p>One of the important contributions from this work is additional sample
generation for the training process in order to improve the quality of transitive
embeddings. There are three types of samples generated in order to learn good
representations for the structure of the underlying knowledge graph data. The
ifrst set of samples is taken by converting the training data directly into a
subject, predicate, and object triplet in a vectorized form. These samples intuitively
provide local information as pertaining to a single hop reasoner. In order to
boost the reasoning power of the model, a second set of samples is generated
by considering extended link prediction and extracting subject-object relations
from the training data at a 2 hops distance. This second type of sample
generation is utilized to minimize the loss in relation 2. In order to still improve the
predictive power of the model by taking into account longer paths in a third
set, we train a policy network agent with two hidden layers of size 60 and 64,
followed by a ReLU activation function following each layer. The final layer is
a fully connected one with output size corresponding to the number of
predicate/relations for each task. We first train the agent in a supervised manner on
the triple and quadruple samples. Afterward, the agent is trained according to
the reward function described in Section 3 and is capable of generating relation
and entity candidates at a length greater than 2 hops away. Specifically, we limit
this length to a maximum of the path length 5 in each episode.</p>
      <p>During the test phase, the learned embeddings are used to perform the task
of link prediction on the test triplets provided in each dataset as follows. For
each test triplet, the tail embedding is obtained by adding the corresponding
embedding vectors of given head and relation which can be retrieved directly by
the model. The resulting embedding is then compared and ranked using cosine
similarly against all entities present in the dataset. We report success per case
if the true tail entity is in the top K predictions, i.e. Hits@K. In particular,
we report Hits@1 and Hits@10. For each dataset, we further consider subclass,
class assertion, and all relationships together for better analysis of the learned
models. The training and validation code3 are made available publicly.</p>
      <p>Table 1 shows the results reported for each dataset for diferent embedding
learning methods and relationship type. In particular, we report the TransE
performance metrics using triplets as learned from the loss optimization in relation
1. The embeddings learned through optimization of the loss relation in equation
2 using quadruple sampling in addition, corresponding to the recurrent TransE,
or rTransE, are also evaluated and reported. We consider embeddings learned for
class and assertion types separately first (middle columns) as well as the
combination of all relation types (last two columns). In this latter case, the three
loss functions are minimized in combination with each other. The performance
of model in terms of the learned embeddings using agent’s generated samples in
addition to triplets and quadruples, named aTransE, are included for this last
case since improvements otherwise we found to be negligible. It can be observed
that in each case, by including quadruple and agent’s samples, the overall
embedding quality is improved with highest performance achieved at %68.8 Hits@1
and %73 Hits@10 over all. It is concluded then that by including more relation
combinations and simultaneously minimizing the combined losses including the
agent’s generated samples the performance is consistently improved.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Discussion</title>
      <p>
        One of the key contributions of this work in order to improve the quality of
entity and relation embeddings learned by the model is attributed to the
quality of sample generation, which is deemed important for many down-stream
applications such as question answering [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. While in the previous section we
discussed quantitative results obtained across diferent tasks, in Table 2 we also
provide qualitative examples of such samples from the CaLiGraph dataset. The
triplets are unique samples directly taken from the training data. The
quadruple relations are generated by joining diferent triplets sharing a common node.
Samples of length 2 or more generated through the agent’s policy network are
shown in the last row. We observe quantitatively that these samples are
meaningful which justifies the improvements. In one case, the agent follows subclass
relationships from an entity ’Environmental Organization’ down to an instance
’Non-renewable Resource Company Disetablished in 1911’ at length 5 away,
attesting to the quality of policy learned based the subsumption rule. In another
case, the agent’s policy is able to realize heterogeneity in relationships for
example by following subclass relations from ’organization’ down to ’philosophy
organization’, and location/country to instance ’Human Rights Organization
based in China’.
3 https://github.com/MaastrichtU-IDS/SemREC
(a) Average, maximum, and minimum (b) Average, maximum, and minimum
acpath lengths cumulated rewards
(c) Agent’s predictive entropy conditioned on the agent’s state, according to which the
agent is more likely to choose a relation with a higher value
Fig. 3: Path length, reward, and predictive behaviour of the the trained agent
on CaLiGraph datasets. The agent is trained for 50 episodes and results are
averaged over 100 runs, including maximum and minimum values.
      </p>
      <p>Moreover, the quality of the agent’s generated samples can be assessed based
on the length of paths generated corresponding to the number of reasoning hops
as well as the reward achieved during each run. In Figure 3a, the minimum,
average, and maximum path lengths are shown for the agent during training up
to 50 episodes, over 100 diferent runs. It can be seen while on average the agent
tends to produce paths of length two (corresponding to quadruple samples), often
it learns to explore new paths of higher lengths. In Figure 3b, the average reward
over one hundred runs of the training algorithm upto 50 episodes is depicted.
Consistent with the previous figure and the reward function defined in Section
3, it can be observed that the agent is able to reach average reward of 1 when
exploring path lengths greater than 2.</p>
      <p>Lastly, in order to better understand the quality of the learned policy by the
agent, we evaluate the predictive entropy over the domain of relations across the
CaLiGraph datasets which contain many more possibilities with respect to the
choice of the relationship types. This quantity can be interpreted as the likelihood
that the agent will generate a sample by choosing any relation type given a
specific state embedding as input. In other words, the agent essentially learns a
distribution over the action space conditioned on the state its in, which in our
development thus far corresponds to entity and relation pairs. As shown in Figure
3c, given any state then the agent has learned a meaningful distribution over all
relations which consequently allows for diverse sample generation. Interestingly
as compared to Figure 2, the agent’s behaviour is reasonably well matched to
the distribution seen in the original datasets. It’s believed that with a better
policy network (e.g. more layers and reward function) as well as more episode
training this performance can significantly improve.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>Learning good embeddings in ontological data is challenging due to underlying
structure, incompleteness, and heterogeneity. While many attempts have been
made to address these issues, data specific development of good models using
neural reasoners remains an open problem. In this work, we propose a
learning algorithm for three datasets, namely ORE, OWL2Bench, and CaLiGraph,
which focuses on the transitive properties in the underlying knowledge base and
improves the quality of learned embeddings through multi-hop sampling and
combined loss minimization. Through experiments, we show that our approach
can achieve Hits@10 metric as high as %73 using samples generated by a policy
network.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bordes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Usunier</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia-Duran</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weston</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yakhnenko</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Translating embeddings for modeling multi-relational data</article-title>
          . In: Burges,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Bottou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Welling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Ghahramani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            ,
            <surname>Weinberger</surname>
          </string-name>
          ,
          <string-name>
            <surname>K</surname>
          </string-name>
          . (eds.)
          <source>Advances in Neural Information Processing Systems</source>
          . vol.
          <volume>26</volume>
          . Curran Associates, Inc. (
          <year>2013</year>
          ), https://proceedings. neurips.cc/paper/2013/file/1cecc7a77928ca8133fa24680a88d2f9-Paper.pdf
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Celebi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yasar</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uyar</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gumus</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dikenelli</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumontier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Evaluation of knowledge graph embedding approaches for drug-drug interaction prediction using linked open data (</article-title>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Das</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dhuliawala</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaheer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vilnis</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Durugkar</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krishnamurthy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smola</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCallum</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Go for a walk and arrive at the answer: Reasoning over paths in knowledge bases using reinforcement learning</article-title>
          .
          <source>arXiv preprint arXiv:1711.05851</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Garıac</surname>
          </string-name>
          ´ -Duarn
          <string-name>
            <surname>´</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bordes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Usunier</surname>
          </string-name>
          , N.:
          <article-title>Composing relationships with translations</article-title>
          .
          <source>Ph.D. thesis</source>
          , CNRS,
          <string-name>
            <surname>Heudiasyc</surname>
          </string-name>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Heist</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paulheim</surname>
          </string-name>
          , H.:
          <article-title>The caligraph ontology as a challenge for owl reasoners (</article-title>
          <year>2021</year>
          ). https://doi.org/10.48550/ARXIV.2110.05028, https://arxiv.org/ abs/2110.05028
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Kulmanov</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu-Wei</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yan</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoehndorf</surname>
          </string-name>
          , R.:
          <article-title>El embeddings: Geometric construction of models for the description logic el++</article-title>
          .
          <source>In: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI19</source>
          . pp.
          <fpage>6103</fpage>
          -
          <lpage>6109</lpage>
          .
          <source>International Joint Conferences on Artificial Intelligence Organization</source>
          (7
          <year>2019</year>
          ). https://doi.org/10.24963/ijcai.
          <year>2019</year>
          /845, https://doi.org/10. 24963/ijcai.
          <year>2019</year>
          /845
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hildebrandt</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joblin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ringsquandl</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raissouni</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tresp</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Neural multi-hop reasoning with logical rules on biomedical knowledge graphs</article-title>
          . In: Verborgh,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Hose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Paulheim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Champin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.A.</given-names>
            ,
            <surname>Maleshkova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Corcho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            ,
            <surname>Ristoski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Alam</surname>
          </string-name>
          , M. (eds.)
          <article-title>The Semantic Web</article-title>
          . pp.
          <fpage>375</fpage>
          -
          <lpage>391</lpage>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Meilicke</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fink</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rufinelli</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gemulla</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stuckenschmidt</surname>
          </string-name>
          , H.:
          <article-title>Fine-grained evaluation of rule-and embedding-based systems for knowledge graph completion</article-title>
          . In: International semantic web conference. pp.
          <fpage>3</fpage>
          -
          <lpage>20</lpage>
          . Springer (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Mohapatra</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bhatia</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mutharaju</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Srinivasaraghavan</surname>
          </string-name>
          , G.:
          <article-title>Emelvar: A neurosymbolic reasoner for the el++ description logic</article-title>
          .
          <source>In: SemREC@ ISWC</source>
          . pp.
          <fpage>44</fpage>
          -
          <lpage>51</lpage>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Ning</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ammar</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yilmaz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mehryar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Celebi</surname>
          </string-name>
          , R.:
          <article-title>Semantic answer type prediction by using bert classifier and rule-based ranking strategies</article-title>
          .
          <source>In: Proceedings of the SeMantic Answer Type and Relation Prediction Task at ISWC</source>
          <year>2021</year>
          <article-title>Semantic Web Challenge (SMART2021): co-located with the 20th International Semantic Web Conference (ISWC</article-title>
          <year>2021</year>
          ). p.
          <fpage>66</fpage>
          . CEUR Workshop Proceedings (
          <year>2022</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deng</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nie</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tang</surname>
          </string-name>
          , J.: Rotate:
          <article-title>Knowledge graph embedding by relational rotation in complex space</article-title>
          .
          <source>In: 7th International Conference on Learning Representations, ICLR</source>
          <year>2019</year>
          ,
          <article-title>New Orleans</article-title>
          , LA, USA, May 6-
          <issue>9</issue>
          ,
          <year>2019</year>
          . OpenReview.net (
          <year>2019</year>
          ), https://openreview.net/forum?id=HkgEQnRqYQ
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mao</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          :
          <article-title>Modeling inter-claim interactions for verifying multiple claims</article-title>
          .
          <source>In: Proceedings of the 30th ACM International Conference on Information &amp; Knowledge Management</source>
          . pp.
          <fpage>3503</fpage>
          -
          <lpage>3507</lpage>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          , Zhang, J.,
          <string-name>
            <surname>Feng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Knowledge graph embedding by translating on hyperplanes</article-title>
          .
          <source>In: Proceedings of the AAAI conference on artificial intelligence</source>
          . vol.
          <volume>28</volume>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paudel</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>W.</given-names>
            ,
            <surname>Bernstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Chen</surname>
          </string-name>
          , H.:
          <article-title>Iteratively learning embeddings and rules for knowledge graph reasoning</article-title>
          .
          <source>In: The World Wide Web Conference</source>
          . pp.
          <fpage>2366</fpage>
          -
          <lpage>2377</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>