<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Entity Enrichment by Leveraging Multilingual Descriptions for Link Prediction</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Genet Asefa Gesese</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mehwish Alam</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Harald Sack</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>FIZ Karlsruhe</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Karlsruhe Institute of Technology, Institute AIFB</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Leibniz Institute for Information Infrastructure</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Most Knowledge Graphs (KGs) contain textual descriptions of entities in various natural languages. These descriptions of entities provide valuable information that may not be explicitly represented in the structured part of the KG. Based on this fact, some link prediction methods which make use of the information presented in the textual descriptions of entities have been proposed to learn representations of (monolingual) KGs. However, these methods use entity descriptions in only one language and ignore the fact that descriptions given in di erent languages may provide complementary information and thereby also additional semantics. In this position paper, the problem of e ectively leveraging multilingual entity descriptions for the purpose of link prediction in KGs will be discussed along with potential solutions to the problem.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Knowledge Graphs (KGs) such as Freebase [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], DBpedia [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], and Wikidata [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]
have been created in order to share linked data which describes entities and the
relationships between them. The availability of these various cross-domain KGs
has sparked interest in undertaking research directions such as KG completion
using tasks like link prediction. Hence, di erent Knowledge Graph Embedding
(KGE) approaches, which map KGs to a low dimensional vector space, based
on a link prediction task have been published. Link prediction is widely used
because in the Open World Assumption the knowledge explicitly represented in
a KG is never complete, there are always missing facts which can be predicted
using link prediction.
      </p>
      <p>
        DistMult [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] and ConvE [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] are among those KGE models which are trained
on a link prediction task but without making use of the textual descriptions of
entities. On the other hand, there are some models such as DKRL [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] which
leverage the textual descriptions of entities for the link prediction task on
(monolingual) datasets like FB15K [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and FB15K-237 [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and have demonstrated that
using the textual descriptions enhance the embeddings of entities [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. However,
Copyright © 2020 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0).
despite the fact that the entities in these KGs have multilingual entity
descriptions, all the existing models which use descriptions of entities focus on using
descriptions written in only one natural language.
      </p>
      <p>
        In most of the popular KGs, a single entity can have descriptions in two or
more languages where the contents of the descriptions are di erent. This fact is
demonstrated in Figure 1 using, as an example, a triple from FB15K with some
of the descriptions of its head and tail entities extracted from Freebase. In this
example, it can be seen that, for both the head (`m.02rcdc2') and tail (`m.019f4v')
entities, the description provided in one language contains information that is
not available in the description given in the other language. Hence, a KGE
model which uses entity descriptions only in one language discards the extra
information provided in the descriptions in other languages.
Few attempts have been made to combine the structured part of KGs with entity
descriptions to learn KGE models. Among these models, DKRL [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], MKBE [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ],
and Jointly [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] use neural network encoders (either CNN or LSTM) to represent
entity descriptions. The other models are SSP [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and LiteralE [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] which rely on
document embedding approaches to get representations for entity descriptions.
In DKRL, CNN is used to encode entity descriptions using word embeddings as
an input. MKBE, same as in DKRL, uses CNN to encode textual descriptions of
entities. The descriptions that are used in both DKRL and MKBE are provided
in only one language (English).
      </p>
      <p>
        Jointly [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] is a KGE method which combines structural and textual
encoding as in DKRL but using (attentive) LSTM encoder instead of CNN. In this
approach, the embedding of a word is initialized by taking the average of the
embeddings of the entities whose description include this word. Initialising in this
way does not work well for multilingual descriptions because it is not capable
of capturing words which are from di erent languages but semantically similar
and are not linked to the same set of entities.
      </p>
      <p>
        SSP is another KGE approach which jointly learns from structured
information and entity descriptions. This method adopts the Non-negative Matrix
Factorization (NMF) topic model to generate a representation for an entity based
on its description, i.e., treating each entity description as a document and taking
the topic distribution of the document as the representation of the corresponding
entity. However, the approach would not perform well with multilingual entity
descriptions since the adopted topic model does not deal with multilinguality. In
LiteralE, entity descriptions are represented using a document embedding
technique proposed in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. This document embedding technique works by rst
mapping the whole document (i.e., entity description) and also every word present
in the document into corresponding unique vectors and then taking the average
or concatenation of the paragraph vector and word vectors so as to predict the
next word in a context.
3
      </p>
    </sec>
    <sec id="sec-2">
      <title>Proposed Methodology</title>
      <p>In order to address the issues with the existing KGE models in using
multilingual descriptions, this study provides the following insights into the potential
solutions.
3.1</p>
      <sec id="sec-2-1">
        <title>Applying Language Translators</title>
        <p>The straight forward way to incorporate multilingual entity descriptions in the
existing neural network encoder based KGE models (i.e., DKRL, MKBE, and
Jointly) is rst to convert all the descriptions into one language (English) with
a language translator and then to pass as inputs to the encoder pre-trained
embeddings of the words present in the descriptions. The pre-trained word
embeddings can be obtained from any monolingual word embedding model. The
main challenge with this method is the errors that occur during machine
translation (converting multilingual descriptions into one language) will be propagated
to the encoder. One way to address this issue is to use the multilingual word
embeddings instead of applying machine translation.</p>
      </sec>
      <sec id="sec-2-2">
        <title>Using Multilingual Word Embeddings</title>
        <p>
          KDCoE [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] is a KGE approach which leverages a weakly aligned multilingual KG
for semi-supervised cross-lingual learning using descriptions of entities. With this
approach, the authors have demonstrated that a very good performance can
be achieved for an entity alignment task by using an Attentive Gated
Recurrent Unit encoder (AGRU) to encode multilingual descriptions with multilingual
word embeddings as inputs. The multilingual embedding model used is a
crosslingual Bilbowa word embedding [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] trained on the cross-lingual parallel corpora
Europarl v7 [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] and monolingual corpora of Wikipedia dump. The results from
KDCoE show that multilingual text encoders can bene t from multilingual word
embeddings. It would also be interesting to adopt the same approach, which is
used to encode multilingual entity descriptions for cross-lingual entity alignment
task in KDCoE, for a link prediction task on di erent monolingual datasets such
as FB15K and FB15K-237. This approach allows to capture as much information
as possible from entity descriptions present in multiple languages.
        </p>
        <p>Furthermore, the existing models such as DKRL and Jointly can be
improved by leveraging multilingual entity descriptions by passing as inputs to
the encoders the embeddings of the words in the descriptions obtained by a
multilingual word embedding model like MUSE 3. For instance, Figure 2 shows
how the CNN encoder part of DKRL can be modi ed to take pre-trained word
embeddings from multilingual descriptions as inputs.
4</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>In this position paper, the problem of leveraging multilingual entity descriptions
for link prediction task on KGs is discussed. As mentioned in Section 1 and
Section 2, the available link prediction models on monolingual datasets such as
FB15K-237 use only monolingual entity descriptions and ignore the fact that the
descriptions in other languages may contain additional semantics. Thus, in this
study, some insights into potential solutions to this problem are provided. These
solutions enable the existing link prediction models to leverage multilingual
entity descriptions. The solutions proposed in this study for link prediction task
can also be adopted for other KG completion tasks such as triple classi cation
and entity classi cation. In order to come up with an even better solution to the
problem, conducting detailed analysis on the nature and quality of the
multilingual entity descriptions available in di erent KGs such as DBpedia, Wikidata,
and Freebase would be bene cial.
3 https://github.com/facebookresearch/MUSE</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bollacker</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Evans</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paritosh</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sturge</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , J.:
          <article-title>Freebase: a collaboratively created graph database for structuring human knowledge</article-title>
          .
          <source>In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data</source>
          . pp.
          <volume>1247</volume>
          {
          <issue>1250</issue>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bordes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Usunier</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia-Duran</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weston</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yakhnenko</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Translating Embeddings for Modeling Multi-Relational Data</article-title>
          . In: NIPS (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tian</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>K.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Skiena</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaniolo</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Co-training embeddings of knowledge graphs and entity descriptions for cross-lingual entity alignment</article-title>
          . arXiv preprint arXiv:
          <year>1806</year>
          .
          <volume>06478</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Dettmers</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Minervini</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stenetorp</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riedel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Convolutional 2d knowledge graph embeddings</article-title>
          .
          <source>In: Thirty-Second AAAI Conference on Arti cial Intelligence</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Gesese</surname>
            ,
            <given-names>G.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Biswas</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alam</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sack</surname>
          </string-name>
          , H.:
          <article-title>A survey on knowledge graph embeddings with literals: Which model links better literal-ly? arXiv preprint</article-title>
          arXiv:
          <year>1910</year>
          .
          <volume>12507</volume>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Gouws</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bengio</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
          </string-name>
          , G.:
          <article-title>Bilbowa: Fast bilingual distributed representations without word alignments (</article-title>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Koehn</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Europarl: A parallel corpus for statistical machine translation</article-title>
          .
          <source>In: MT summit</source>
          . vol.
          <volume>5</volume>
          , pp.
          <volume>79</volume>
          {
          <fpage>86</fpage>
          .
          <string-name>
            <surname>Citeseer</surname>
          </string-name>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Kristiadi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khan</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lukovnikov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fischer</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Incorporating literals into knowledge graph embeddings</article-title>
          .
          <source>In: International Semantic Web Conference</source>
          . pp.
          <volume>347</volume>
          {
          <fpage>363</fpage>
          . Springer (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Le</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Distributed representations of sentences and documents</article-title>
          .
          <source>In: International conference on machine learning</source>
          . pp.
          <volume>1188</volume>
          {
          <issue>1196</issue>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Isele</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jakob</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jentzsch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kontokostas</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mendes</surname>
            ,
            <given-names>P.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hellmann</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morsey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Van Kleef</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , et al.:
          <article-title>Dbpedia{a large-scale, multilingual knowledge base extracted from wikipedia</article-title>
          .
          <source>Semantic Web</source>
          <volume>6</volume>
          (
          <issue>2</issue>
          ),
          <volume>167</volume>
          {
          <fpage>195</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Pezeshkpour</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Embedding multimodal relational data for knowledge base completion</article-title>
          .
          <source>In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</source>
          . pp.
          <volume>3208</volume>
          {
          <fpage>3218</fpage>
          .
          <article-title>Association for Computational Linguistics (Oct-Nov</article-title>
          <year>2018</year>
          ), https://www.aclweb.org/anthology/D18-1359
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Toutanova</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Observed versus latent features for knowledge base and text inference</article-title>
          .
          <source>In: Proceedings of the 3rd Workshop on Continuous Vector Space Models and their Compositionality</source>
          . pp.
          <volume>57</volume>
          {
          <fpage>66</fpage>
          . Association for Computational Linguistics (
          <year>Jul 2015</year>
          ), https://www.aclweb.org/anthology/W15-4007
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Vrandecic</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , Krotzsch, M.:
          <article-title>Wikidata: a free collaborative knowledgebase</article-title>
          .
          <source>Communications of the ACM</source>
          <volume>57</volume>
          (
          <issue>10</issue>
          ),
          <volume>78</volume>
          {
          <fpage>85</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Xiao</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meng</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Ssp: semantic space projection for knowledge graph embedding with text descriptions</article-title>
          .
          <source>In: Thirty-First AAAI Conference on Arti cial Intelligence</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Xie</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jia</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luan</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Representation learning of knowledge graphs with entity descriptions</article-title>
          .
          <source>In: Thirtieth AAAI Conference on Arti cial Intelligence</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qiu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Knowledge graph representation with jointly structural and textual encoding</article-title>
          . pp.
          <volume>1318</volume>
          {
          <issue>1324</issue>
          (08
          <year>2017</year>
          ). https://doi.org/10.24963/ijcai.
          <year>2017</year>
          /183
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yih</surname>
          </string-name>
          , W.t.,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deng</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Embedding entities and relations for learning and inference in knowledge bases</article-title>
          .
          <source>arXiv preprint arXiv:1412.6575</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>