<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An Evaluation of the Impact of Training Data Locality on the E ectiveness of Knowledge Graph Embeddings Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marc Mellotte</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Conor Hayes</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Data Science Institute, National University of Ireland Galway</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>As the volume and variety of data that many modern organisations deal with continue to grow, graphs are becoming increasingly important and relevant as a means of organising this data. This work looks at a possible way to improve the training of some state-of-the-art machine learning models in the area of knowledge graph embeddings. Where the interest of the user is on the ability to predict the existence of a particular link type as opposed to predicting links generally, subsets or sub-graphs could possibly be used to train the model more e ectively than the entire graph. We evaluate the performance of two state-ofthe-art knowledge graph embedding models on the task of predicting a speci c link type. The models are rst trained with all of the available training data and subsequently with subsets or sub-graphs based on the locality of the link type we wish to predict. We nd that there is evidence that using less training data can in some cases actually improve the performance of the model. Finally, we look at some graph features and examine if there is any correlation between these and the accuracy/performance of the machine learning models. While no strong correlation is found, the results point to further work being required to understand this phenomenon.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Modern organisations deal with ever-increasing amounts of data from
multiple sources and in many di erent formats and structures. In a widely quoted
statistic, Grimes reports that 80% of the most business-relevant data comes in
unstructured form, mostly text[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. A key challenge then is the ability to
organise and establish links between diverse data types. One way to achieve this is
through the use of Natural Language Processing (NLP) to extract \facts\ from
unstructured text and and graph technology to link these facts together into an
overall knowledge base or Knowledge Graph.
      </p>
      <p>Once such knowledge graphs are constructed, challenges such as how to keep
them up to date and how to mine them for insights arise. The task of predicting
links between nodes (people, organisations, etc) in the graph can lead to insights
where the relation was not believed to exist previously. However, as opposed to
generally predicting links for a graph overall, we are interested in the ability
to predict speci c link types. For example, in a knowledge graph of movies,
we might be interested in predicting who directed a movie, i.e. the \director"
relation between a person and a movie; instead of generally predicting links
between actors, movies, places, etc. Similarly, a large commercial organisation
might be interested in predicting the existence of links between customers and
new product lines. A pharmaceutical research organisation may want to predict
a potential link between a drug and a speci c protein, which may indicate the
genetic pro le of the population segment that the drug might be e ective for.
In all of these cases, we are interested in the e ectiveness of predicting speci c
link types within the overall graph.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Knowledge Graphs (KGs) are knowledge bases that store factual information in
the form of graph data [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. As such, a knowledge graph is a directed,
multirelational graph. These graphs are typically represented as a set of triples in
the form of (subject, predicate, object) where each triple represents a single fact
and the predicate is the link or relation between the subject and object entities.
Figure 1 shows a very simple sub-graph and Table 1 describes the triples. Such
facts are frequently, but not necessarily, encoded using the Resource Description
Framework (RDF) triples format.
      </p>
      <p>
        KGs continue to grow in popularity. In addition to well-know instances on
the Web such as the Google knowledge graph [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], (which has incorporated an
earlier KG, Freebase [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]), WordNet [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], DBPedia [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], BabelNet [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] and NELL
[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], KGs are nding increasing use within within organisations such as Amazon
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]and were added to the Gartner Hype Cycle for Emerging technologies in 2018
[
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
      </p>
      <p>
        One of the main problems with all knowledge graphs to date is that they
are very far from complete. For example, Freebase has no place of birth for over
70% of the person entities it contains. Even amongst the 100k most frequent
person entities on Freebase, almost a quarter (24%) do not have a profession
recorded [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. This incompleteness has lead to much research e ort in the area
of knowledge graph completion. Knowledge graph completion generally breaks
down into the sub-tasks of entity creation or extraction and link prediction [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
The former involves adding new entities or nodes to the graph while the latter
involves adding new links between existing entities.
      </p>
      <p>
        Statistical Relational Learning: The prediction of missing links or
relations has been the focus of Statistical Relational Learning (SRL), a sub- eld
of machine learning that focuses on graph data. SRL covers a number of
areas including collective classi cation, link prediction, link-based clustering,
social network modelling and object identi cation [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. According to [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], there
are two main classes of SRL techniques. The rst contains those that capture
the correlation between nodes (or links) based on the observable features of
the graph. The second captures the correlation using latent variables. Models
based on observable graph features can be divided into two approaches: a
nearest neighbour approach based on node similarity using measures such as cosine
similarity, mutual information, Dice coe cient, distance-based measurements,
binary classi ers and more [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] or a topological approach where prediction
is based on patterns extracted from local or global topology such as Common
Neighbours, Jaccard Index, Adamic Adar [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], Resource Allocation Index [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ],
Katz Index [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], Hitting Time [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], Rooted PageRank [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and SimRank [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In
models based on latent variables, each node in the knowledge graph is assumed
to have a latent feature vector which must be learned; the relationships between
nodes are explained by these latent features [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. As an example, Nickel et al.
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] propose that Alec Guinness received the Academy Award because he was a
\good" actor. The property \good" here is a latent feature of the Alec Guinness
entity because it is not directly observable in the data. A key task of all latent
feature models is to learn the latent features, or \embeddings", for the nodes in
the graph. The \embeddings" refers to how entities and relations in a knowledge
graph are embedded into continuous vector spaces. This gives advantages such
as simplifying manipulation of the graph, while at the same time preserving its
inherent structure [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ].
      </p>
      <p>
        Knowledge Graph Embeddings: Wang et al [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] summarise the
approaches to KG embeddings into two classes :\Semantic Matching Models" and
\Translational Distance Models". Translational distance models exploit
distancebased scoring functions. The Translating Embeddings or TransE approach[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
models each entity as a vector in space Rd and each relation as a vector
transformation that maps the subject to the object of the relationship. The accuracy
of a relation or link is measured as the distance between the entities after the
transformation has been applied. TransE would allow for Alf redHitchcock +
DirectorOf P sycho and J amesCameron + DirectorOf Avatar. Semantic
matching models di er from translational-distance models in that they use a
similarity scoring approach as opposed to distance-based scoring. The likelihood
of a fact or triple is calculated using the latent semantics of the entities and
relations. RESCAL [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] and its derivatives/extensions such as DistMult [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ],
Holographic Embeddings (HolE) [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] Complex Embeddings (ComplEx) [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] are
examples of this approach.
      </p>
      <p>
        Model Evaluation: A common approach to evaluating the accuracy of
embeddings models is described by Bordes et al [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. For all training and testing
triples (s, p, o), the subject entity, s, is removed and the likelihood of all possible
triples/facts (e, p, o) is calculated where e 2 De and D is the entire dictionary
of entities in the graph. Triples other than the original correct one are referred to
as the corrupted triples. The results are then sorted in decreasing order of their
likelihood according to the model and the rank of the correct triple is recorded.
The process is repeated for the object entity o. Evaluation is then similar to
that used for question answering in information retrieval where the Mean Rank
of the correct triple is measured.
      </p>
      <p>
        Bordes et al [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] report the mean of the predicted rank of the subject and
object entities and the Hits@10, i.e. the proportion of correct triples in those
ranked as the top 10. One potential issue with this metric is that there may
be triples amongst the corrupted triples that are correct. For example, if the
original triple was the one relating to the actor Alex Guinness in Table 1, i.e.
Alec Guinness, characterIn, Star Wars, then we would remove the subject entity
(Alec Guinness ) and predict the accuracy of all other entities in the graph (i.e.
Freebase in this case) against the same predicate (relation) and object entity.
However, there were more characters in the same movie so some of the corrupted
triples will actually be true. In order to avoid this scenario, Bordes et al [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
removed all valid triples that appear in either the training, test or validation data
sets from the corrupted triples. They referred to this as the \Filtered" setting
and report metrics against both the original, \Raw", data and the \Filtered"
version. They then reported both the Mean Rank and Hits@n (e.g. Hits@1,
Hits@3 &amp; Hits@10) for both the original triples (raw ) and those with the valid
triples removed ( ltered ). Nickel et al [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] followed this evaluation approach and
measured the quality of the ranking using the Mean Reciprocal Rank (MRR)
which is less sensitive to outliers than the mean rank. Again, this is reported for
both the raw and ltered settings.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Research Question</title>
      <p>
        This work seeks to address whether more (relevant) training data is always
better (i.e. produces higher accuracy in tests) when training a knowledge graph
embedding model to predict a speci c link/relation type. Our research question
is thus - Does locality (i.e. the local sub-graph around the relation type we are
interested in) of training data have a disproportionate e ect on the accuracy of
the model and, if so, how much local data can e ectively train the model ? Our
null hypothesis is that more data will always lead to better accuracy as this is
commonly the case for machine learning models. From some of the most recent
work in the knowledge graph embedding space, we choose two models for this
study - ComplEx and DistMult. Both models perform very well in benchmark
tests in the literature [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. We select two datasets that are representative of
realworld knowledge graphs upon which to carry out the experiments - Freebase
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] &amp; NELL[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. We adopt the version of Freebase that has been re ned for link
prediction experiments by Totuanova and Chen [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]. This dataset, often referred
to as FB15k-237, contains approximately 15k entities with 237 relations. Table
2 shows a summary of the two datasets used in this work.
      </p>
      <p>Sub-Graph Selection: Figure 2 describes visually the rst level of the
subgraph. At the centre of the diagram, the graph nodes shown in dark colour
with the bold link between them represent an instance of the relation that we
are interested in. Starting with this relation, we gather all of the relations that
connect into or out of either end of this one. We refer to these as \Level 1"
relations. This set consists of the relations and nodes inside the circle with the
darker shading.</p>
      <p>The next sub-graph extracted is called the \Level 2" sub-graph. The process
for extracting it is similar to that described for Level 1. Again we start with the
relation type of interest and select all of the adjacent relations. For Level 2, we
also select all of the relations adjacent to the Level 1 relations. In other words,
we select all of the relations that are 2 hops from the relation type of interest.
This is shown visually by the darker shaded circles in Figure 3. Level 2 contains
all of the relations in Level 1 plus those immediately adjacent to them. Lastly,
the Level 3 sub-graph contains all of the relations in Level 2 as well as the set
of relations immediately adjacent to these, i.e. all relations that are 3 hops from
the relation type of interest. This is shown visually by the darker-shaded circles
in Figure 4.</p>
      <p>If we were interested in predicting the existence of the / lm/ lm/genre
relation type from the Freebase dataset, we would nd that there are 3756 instances
of it in the training data. When the sub-graph levels are extracted we get
subgraphs of the sizes shown in Table 3. As we can see, the Level 3 sub-graph is close
to the the size of the full training graph/dataset (full graph has approximately
272k relations).</p>
      <p>Normally, the evaluation of a model is against the full set of test data. In
this case, we are only interested in the performance of the trained model on the
speci c relation we are interested in. For example, if we are interested in the
relation / lm/ lm/genre, we lter the test dataset to only contain this relation.
We are not interested in the model performance on the prediction of any of the
other relations or the overall graph.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Implementation and Evaluation Details</title>
      <p>
        All of the experiments in this work were carried out on a Linux server with
Intel(R) Core(TM) i70.4790K CPU 4.00GHz processor, 32 GB RAM, and an
nVidia Titan X GPU. The operating system of the server was Ubuntu Server
16.04.5 LTS. The code was written in Python 3.5. The KGE models were
developed on top of the TensorFlow[
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] (GPU) framework. The implementation of
the KGE models has been provided by SK Mohamed et al. [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. This
implementation is not (yet) available as an open source release. It is planned to release
the software code developed for this work as open-source once the underlying
KGE libraries have also been released.
      </p>
      <p>Evaluation Metrics: We evaluate our results using the Mean Reciprocal
Rank (MRR) and the Hits@10 on the Filtered dataset (MRR FIL). The MRR
measures the mean of the reciprocal of the rank of the correct triple as
predicted by the model. Hits@10 records the number of correct triples in the top
10 predicted by the model.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>In the experiments, both models performed poorly when trained with the Level
1 sub-graphs when compared with the same model trained with the Level 0
graph. The models trained with Level 1 sub-graphs resulted in average accuracies
ranging from 50% to over 80% less than the same model trained on the Level 0
graph. This contrasted with signi cantly better relative performance seen when
the models were trained on Level 2 &amp; 3 sub-graphs. We focus on these results
here, starting with the ComplEx model (Table 4).</p>
      <p>The most immediately interesting results are where the ComplEx models
trained on a sub-graph outperformed those trained on the full Level 0 graph. As
Table 4 shows, the ComplEx model trained with either the Level 2 or 3 NELL
sub-graph outperformed the same model trained with the entire Level 0 NELL
sub-graph. This suggests that there are scenarios where less training data can
lead to a more accurate link prediction model for speci c links of interest.</p>
      <p>We do not see the same overall outperforming of the Level 0 model for a
full Level 2 or 3 when ComplEx was trained with Freebase data. We do notice
that the drop in accuracy of Levels 2 &amp; 3 from Level 0 is far less than the
corresponding reduction in the amount of training data used. For example, the
models trained with Level 2 sub-graphs experienced an average drop in accuracy
of 24% versus Level 0 for a reduction in training data of 71%. This suggests that
there isn't a linear relationship between the amount of training data and the
model performance.</p>
      <p>These averages for the full level also mask some additional ndings within
that level. For example, the Freebase Level 2 sub-graphs contained a range of
relations from approximately 2k to 200k relations. When this was broken out
into ranges of relation counts, we saw that ComplEx models trained on Level 2
sub-graphs with a size of up to 30k relations (approximately) outperformed the
model trained on the overall Level 0 graph. Drilling into the Level 3 results in
a similar way shows that the ComplEx models trained on Level 3 sub-graphs of
sizes from 20-50k relations also outperformed the overall average.</p>
      <p>Results of the experiments conducted with the DistMult model are
summarised in Table 5. As was the case with the ComplEx model, we see scenarios
where the model trained with a sub-graph outperforms the model trained with
the full Level 0 graph. In this case, the Freebase Level 2 and NELL Level 3
average accuracy (MRR FIL) outperforms the overall Level 0 average.</p>
      <p>For the other scenarios - DistMult trained with Freebase Level 3 or NELL
Level 2 - we again see that the reduction in accuracy versus Level 0 is signi cantly
less than the reduction in the training data used, which again supports the
suggestion that there is not a linear relationship between the two here. As before,
we can drill into these results to get a detailed breakdown by ranges of relations
within both Level 2 &amp; 3. For the Freebase Level 3 average, the breakdown by
range of relations shows that DistMult models trained with Level 3 sub-graphs
of size up to 150k relations outperform the overall Level 0 average. It is when the
Level 3 sub-graph sizes go beyond 200k relations (up to a max of 300k relations)
that the average accuracy drops well below the overall average. Looking at the
NELL Level 2 averages breakdown, we see that DistMult models trained with
70-80k relations outperformed the overall average. However, most of the NELL
Level 2 sub-graph ranges performed less well than the overall average.
5.1</p>
      <p>Correlating Model Accuracy with Graph Features
The results discussed so far show that there are some scenarios where the models
trained with a subgraph outperform those trained with the overall graph in the
prediction of speci c link types. Why might this be? Is there some feature of these
subgraphs that distinguishes them from others? To examine this, the average
clustering coe cient and the density were calculated and recorded alongside the
other results. The charts in Figure 5 show the variation in both the average
clustering coe cient and the density of the subgraphs against the accuracy of
the trained ComplEx model (using MRR FIL for accuracy).</p>
      <p>Table 6 below summarises the observations from these charts. There is a
slight trend of models showing higher accuracy being those trained with
subgraphs of a lower clustering coe cient. The density of the sub-graphs remains
relatively constant regardless of the resulting accuracy of the trained models.</p>
      <p>It would seem that the density of the sub-graph is independent of the
accuracy of the model trained on that data. The additional relations in the subgraphs
of higher density do not seem to be providing any additional value in training
the model. With clustering coe cient, there is a slight pattern of the subgraphs
used to train models with higher accuracy having a lower average clustering
coefcient. However, the pattern is not strong enough to draw con dent conclusions
from.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>The ndings from the experiments show a pattern where models trained with
sub-graphs of certain sizes either outperformed the model trained with the entire
graph or at least showed a decrease in accuracy that was signi cantly smaller
than the decrease in the amount of training data used. However, there was no
consistent pattern in terms of the optimal sub-graph to use to train the model.
What is it then that causes some \levels" of sub-graph to perform better than
others in training the model? Two speci c graph properties were identi ed as
potential candidates - average clustering coe cient and density; however our results
do not point towards either property being clearly correlated with the accuracy
of the resulting trained model. Further work is required to look at additional
graph features/characteristics that may be correlated with model accuracy, such
as Average Node Degree, Edge Betweenness or Graph Connectivity. Finally,
edge/link direction could be considered in the selection of the sub-graph to train
the model. The experiments in this work used sub-graph levels that consisted of
adjacent links/relations that were extracted independently of the direction of the
relation. Future work will examine if relation directionality should be considered
when constructing sub-graphs.</p>
      <p>Acknowledgements: This publication has emanated from research
supported in part by a research grant from Science Foundation Ireland (SFI) under
Grant Number SFI/12/RC/2289 P2, co-funded by the European Regional
Development Fund.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Adamic</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adar</surname>
          </string-name>
          , E.:
          <article-title>Friends and neighbors on the web</article-title>
          .
          <source>Social Networks</source>
          <volume>25</volume>
          ,
          <issue>211</issue>
          {
          <volume>230</volume>
          (07
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kobilarov</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ives</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>DBpedia: A Nucleus for a Web of Open Data</article-title>
          .
          <source>In: The Semantic Web</source>
          , pp.
          <volume>722</volume>
          {
          <fpage>735</fpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg (11
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bollacker</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Evans</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paritosh</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sturge</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , J.:
          <source>Freebase. In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data - SIGMOD '08</source>
          . p.
          <fpage>1247</fpage>
          . ACM Press (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bordes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Usunier</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia-Duran</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weston</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yakhnenko</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Translating embeddings for modeling multi-relational data</article-title>
          .
          <source>In: Proceedings of the 26th International Conference on Neural Information Processing Systems - Volume 2</source>
          . pp.
          <volume>2787</volume>
          {
          <fpage>2795</fpage>
          . NIPS'
          <volume>13</volume>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Bordes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weston</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Collobert</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bengio</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Learning structured embeddings of knowledge bases</article-title>
          .
          <source>In: Twenty-Fifth AAAI Conference on Arti cial Intelligence</source>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Brightwell</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Winkler</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Maximum hitting time for random walks on graphs</article-title>
          .
          <source>Random Structures &amp; Algorithms</source>
          <volume>1</volume>
          (
          <issue>3</issue>
          ),
          <volume>263</volume>
          {
          <fpage>276</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Carlson</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Betteridge</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kisiel</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Settles</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hruschka</surname>
          </string-name>
          , Jr., E.R., Mitchell, T.M.:
          <article-title>Toward an architecture for never-ending language learning</article-title>
          .
          <source>In: Proceedings of the Twenty-Fourth AAAI Conference on Arti cial Intelligence</source>
          . pp.
          <volume>1306</volume>
          {
          <fpage>1313</fpage>
          . AAAI'
          <volume>10</volume>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Grimes</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Unstructured Data and the 80 Percent Rule (</article-title>
          <year>2008</year>
          ), http://breakthroughanalysis.com/
          <year>2008</year>
          /08/01/ unstructured
          <article-title>-data-and-</article-title>
          <string-name>
            <surname>the-</surname>
          </string-name>
          80
          <string-name>
            <surname>-</surname>
          </string-name>
          percent-rule/
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Jeh</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Widom</surname>
          </string-name>
          , J.:
          <article-title>Simrank: A measure of structural-context similarity</article-title>
          .
          <source>In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          . pp.
          <volume>538</volume>
          {
          <fpage>543</fpage>
          . KDD '
          <volume>02</volume>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>L.:</given-names>
          </string-name>
          <article-title>A new status index derived from sociometric analysis</article-title>
          .
          <source>Psychometrika</source>
          <volume>18</volume>
          (
          <issue>1</issue>
          ),
          <volume>39</volume>
          {43 (Mar
          <year>1953</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Krishnan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Making search on Amazon easier (</article-title>
          <year>8 2018</year>
          ), https://blog. aboutamazon.com/innovation/making-search-easier
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Liben-Nowell</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kleinberg</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>The link prediction problem for social networks</article-title>
          .
          <source>In: Proceedings of the Twelfth International Conference on Information and Knowledge Management</source>
          . pp.
          <volume>556</volume>
          {
          <fpage>559</fpage>
          . CIKM '
          <volume>03</volume>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>An information-theoretic de nition of similarity</article-title>
          .
          <source>In: Proceedings of the Fifteenth International Conference on Machine Learning</source>
          . pp.
          <volume>296</volume>
          {
          <fpage>304</fpage>
          . ICML '
          <volume>98</volume>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>G.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          , G.:
          <article-title>WordNet: a lexical database for English</article-title>
          .
          <source>Communications of the ACM</source>
          <volume>38</volume>
          (
          <issue>11</issue>
          ),
          <volume>39</volume>
          {
          <volume>41</volume>
          (11
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Mohamed</surname>
            ,
            <given-names>S.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Novacek</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Link Prediction Using Multi Part Embeddings</article-title>
          .
          <source>In: The Semantic Web - 16th International Conference</source>
          , fESWCg
          <year>2019</year>
          , Portoroz, Slovenia, June 2-6,
          <year>2019</year>
          , Proceedings. pp.
          <volume>240</volume>
          {
          <fpage>254</fpage>
          .
          <string-name>
            <given-names>NUI</given-names>
            <surname>Galway</surname>
          </string-name>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Navigli</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ponzetto</surname>
            ,
            <given-names>S.P.:</given-names>
          </string-name>
          <article-title>BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network</article-title>
          .
          <source>Arti cial Intelligence</source>
          <volume>193</volume>
          ,
          <fpage>217</fpage>
          {
          <volume>250</volume>
          (12
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Nickel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murphy</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tresp</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gabrilovich</surname>
          </string-name>
          , E.:
          <article-title>A review of relational machine learning for knowledge graphs</article-title>
          .
          <source>Proceedings of the IEEE</source>
          <volume>104</volume>
          (
          <issue>1</issue>
          ),
          <volume>11</volume>
          {
          <fpage>33</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Nickel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosasco</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poggio</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Holographic Embeddings of Knowledge Graphs</article-title>
          .
          <source>In: Proceedings of the Thirtieth AAAI Conference on Arti cial Intelligence</source>
          . pp.
          <year>1955</year>
          {
          <year>1961</year>
          . AAAI'
          <fpage>16</fpage>
          , AAAI Press (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Nickel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tresp</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kriegel</surname>
            ,
            <given-names>H.P.:</given-names>
          </string-name>
          <article-title>A three-way model for collective learning on multi-relational data</article-title>
          .
          <source>In: Proceedings of the 28th International Conference on International Conference on Machine Learning</source>
          . pp.
          <volume>809</volume>
          {
          <fpage>816</fpage>
          . ICML'
          <volume>11</volume>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Panetta</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <article-title>Gartner: 5 Trends Emerge in the Gartner Hype Cycle for Emerging Technologies, 2018 - Smarter With Gartner (</article-title>
          <year>2018</year>
          ), https://www.gartner.com/smarterwithgartner/ 5
          <article-title>-trends-emerge-in-gartner-hype-cycle-for-emerging-technologies-2018/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Richardson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Domingos</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Markov logic networks</article-title>
          .
          <source>Machine Learning 62(1-2 SPEC. ISS.)</source>
          ,
          <volume>107</volume>
          {
          <fpage>136</fpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Rossi</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mcdowell</surname>
            ,
            <given-names>L.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aha</surname>
            ,
            <given-names>D.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neville</surname>
          </string-name>
          , J.:
          <article-title>Transforming Graph Data for Statistical Relational Learning</article-title>
          .
          <source>Journal of Arti cial Intelligence Research</source>
          <volume>45</volume>
          ,
          <volume>363</volume>
          {
          <fpage>441</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Singhal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Introducing the knowledge graph: things, not strings</article-title>
          .
          <source>O cial google blog</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24. Tensor Flow Team,
          <string-name>
            <surname>G.R.</surname>
          </string-name>
          :
          <source>TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems</source>
          (
          <year>2015</year>
          ), www.tensorflow.org.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Toutanova</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Observed versus latent features for knowledge base and text inference</article-title>
          .
          <source>In: 3rd Workshop on Continuous Vector Space Models and Their Compositionality. ACL - Association for Computational Linguistics (July</source>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Trouillon</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Welbl</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riedel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gaussier</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bouchard</surname>
          </string-name>
          , G.:
          <article-title>Complex embeddings for simple link prediction</article-title>
          .
          <source>In: Proceedings of the 33rd International Conference on International Conference on Machine Learning - Volume 48</source>
          . pp.
          <year>2071</year>
          {
          <year>2080</year>
          . ICML'
          <volume>16</volume>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mao</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Knowledge graph embedding: A survey of approaches and applications</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>29</volume>
          (
          <issue>12</issue>
          ),
          <volume>2724</volume>
          {2743 (Dec
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>West</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gabrilovich</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murphy</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gupta</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Knowledge base completion via search-based question answering</article-title>
          .
          <source>In: Proceedings of the 23rd International Conference on World Wide Web</source>
          . pp.
          <volume>515</volume>
          {
          <fpage>526</fpage>
          . WWW '
          <volume>14</volume>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yih</surname>
          </string-name>
          , S.W.t.,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deng</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Embedding entities and relations for learning and inference in knowledge bases</article-title>
          .
          <source>In: Proceedings of the International Conference on Learning Representations (ICLR)</source>
          <year>2015</year>
          (May
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            <given-names></given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y.C.</surname>
          </string-name>
          :
          <article-title>Predicting missing links via local information</article-title>
          .
          <source>The European Physical Journal B</source>
          <volume>71</volume>
          (
          <issue>4</issue>
          ),
          <volume>623</volume>
          {630 (Oct
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>