<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>RDF Graph Embeddings for Content-based Recommender Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jessica Rosati1;2</string-name>
          <email>jessica.rosati@unicam.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Petar Ristoski</string-name>
          <email>petar.ristoski@informatik.uni-</email>
          <email>petar.ristoski@informatik.unimannheim.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tommaso Di Noia Renato De Leone</string-name>
          <email>renato.deleone@unicam.it</email>
          <email>tommaso.dinoia@poliba.it</email>
          <email>tommaso.dinoia@poliba.it renato.deleone@unicam.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Heiko Paulheim</string-name>
          <email>heiko@informatik.uni-</email>
          <email>heiko@informatik.unimannheim.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>1University of Camerino -</institution>
          ,
          <addr-line>Piazza Cavour 19/f - 62032, Camerino</addr-line>
          ,
          <country country="IT">Italy</country>
          ,
          <institution>2Polytechnic University of Bari</institution>
          ,
          <addr-line>- Via Orabona, 4 - 70125, Bari</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Data and Web Science Group, University of Mannheim</institution>
          ,
          <addr-line>B6, 26, 68159 Mannheim</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Polytechnic University of Bari University of Camerino -, - Via Orabona</institution>
          ,
          <addr-line>4 - 70125 Piazza Cavour 19/f - 62032, Bari, Italy Camerino</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>recommendation approaches is that the information on which Linked Open Data has been recognized as a useful source they rely is generally insu cient to elicit user's interests and of background knowledge for building content-based rec- characterize all the aspects of her interaction with the sysommender systems. Vast amount of RDF data, covering tem. This is the main drawback of the approaches built multiple domains, has been published in freely accessible on textual and keyword-based representations, which candatasets. In this paper, we present an approach that uses not capture complex relations among objects since they lack language modeling approaches for unsupervised feature ex- the semantics associated to their attributes. A process of traction from sequences of words, and adapts them to RDF \knowledge infusion" [40] and semantic analysis has been graphs used for building content-based recommender sys- proposed to face this issue, and numerous approaches that tem. We generate sequences by leveraging local information incorporate ontological knowledge have been proposed, givfrom graph sub-structures and learn latent numerical rep- ing rise to the newly de ned class of semantics-aware contentresentations of entities in RDF graphs. Our evaluation on based recommender systems [6]. More recently the Linked two datasets in the domain of movies and books shows that Open Data (LOD) initiative [3] has opened new interesting feature vector representations of general knowledge graphs possibilities to realize better recommendation approaches. such as DBpedia and Wikidata can be e ectively used in The LOD initiative in fact gave rise to a variety of open content-based recommender systems. knowledge bases freely accessible on the Web and being part of a huge decentralized knowledge base, the LOD cloud, where each piece of little knowledge is enriched by links to reCategories and Subject Descriptors lated data. LOD is an open, interlinked collection of datasets H.3.3 [Information Systems]: Information Search and Re- in machine-interpretable form, built on World Wide Web trieval Consortium (W3C) standards as RDF1, and SPARQL2. Currently the LOD cloud consists of about 1; 000 interlinked</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Permission to make digital or hard copies of all or part of this work for personal or
classroom use is granted without fee provided that copies are not made or distributed
for profit or commercial advantage and that copies bear this notice and the full
citation on the first page. Copyrights for components of this work owned by others than
ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or
republish, to post on servers or to redistribute to lists, requires prior specific permission
and/or a fee. Request permissions from permissions@acm.org.</p>
      <p>CBRecSys 2016, September 16, 2016, Boston, MA, USA.</p>
      <p>Copyright remains with the authors and/or original copyright holders
c 2016 ACM. ISBN 978-1-4503-2138-9.</p>
      <p>DOI: 10.1145/1235
posed approaches in the literature are supervised or
semisupervised, which means cannot work without human
interaction.</p>
      <p>In this work, we adapt language modeling approaches for
latent representation of entities in RDF graphs. To do so, we
rst convert the graph into a set of sequences of entities
using graph walks. In the second step, we use those sequences
to train a neural language model, which estimates the
likelihood of a sequence of entities appearing in the graph. Once
the training is nished, each entity in the graph is
represented with a vector of latent numerical values. Projecting
such latent representation of entities into a lower
dimensional feature space shows that semantically similar entities
appear closer to each other. Such entity vectors can be
directly used in a content-based recommender system.</p>
      <p>
        In this work, we utilize two of the most prominent RDF
knowledge graphs [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], i.e. DBpedia [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] and Wikidata [
        <xref ref-type="bibr" rid="ref42">42</xref>
        ].
DBpedia is a knowledge graph which is extracted from
structured data in Wikipedia. The main source for this extraction
are the key-value pairs in the Wikipedia infoboxes.
Wikidata is a collaboratively edited knowledge graph, operated
by the Wikimedia foundation3 that also hosts various
language editions of Wikipedia.
      </p>
      <p>The rest of this paper is structured as follows. In
Section 2, we give an overview of related work. In Section 3, we
introduce our approach, followed by an evaluation in
Section 4. We conclude with a summary and an outlook on
future work.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>RELATED WORK</title>
      <p>
        It has been shown that LOD can improve recommender
systems towards a better understanding and representation
of user preferences, item features, and contextual signs they
deal with. LOD has been used in content-based,
collaborative, and hybrid techniques, in various recommendation
tasks, i.e., rating prediction, top-N recommendations and
improving of diversity in content-based recommendations.
LOD datasets, e.g. DBpedia, have been used in
contentbased recommender systems in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. The former
performs a semantic expansion of the item content based on
ontological information extracted from DBpedia and
LinkedMDB [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], the rst open semantic web database for movies,
and tries to derive implicit relations between items. The
latter involves DBpedia and LinkedMDB too, but is an
adaptation of the Vector Space Model to Linked Open Data: it
represents the RDF graph as a 3-dimensional tensor where each
slice is an ontological property (e.g. starring, director,...)
and represents its adjacency matrix. It has been proven
that leveraging LOD datasets is also e ective for hybrid
recommender systems [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], that is in those approaches that
boost the collaborative information with additional
knowledge, such as the item content. In [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] the authors propose
SPRank, a hybrid recommendation algorithm that extracts
semantic path-based features from DBpedia and uses them
to compute top-N recommendations in a learning to rank
approach and in multiple domains, movies, books and
musical artists. SPRank is compared with numerous
collaborative approaches based on matrix factorization [
        <xref ref-type="bibr" rid="ref17 ref34">17, 34</xref>
        ]
and with other hybrid RS, such as BPR-SSLIM [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], and
exhibits good performance especially in those contexts
characterized by high sparsity, where the contribution of the
      </p>
      <sec id="sec-2-1">
        <title>3http://wikimediafoundation.org/</title>
        <p>
          content becomes essential. Another hybrid approach is
proposed in [
          <xref ref-type="bibr" rid="ref36">36</xref>
          ], which builds on training individual base
recommenders and using global popularity scores as generic
recommenders. The results of the individual recommenders are
combined using stacking regression and rank aggregation.
Most of these approaches can be referred to as top-down
approaches [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], since they rely on the integration of external
knowledge and cannot work without human intervention.
On the other side, bottom-up approaches ground on the
distributional hypothesis [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] for language modeling, according
to which the meaning of words depends on the context in
which they occur, in some textual content. The resulting
strategy is therefore unsupervised, requiring a corpora of
textual documents for training as large as possible.
Approaches based on the distributional hypothesis, referred to
as discriminative models, behave as word embeddings
techniques where each term (and document) becomes a point
in the vector space. They substitute the term-document
matrix typical of Vector Space Model with a term-context
matrix on which they apply dimensionality reduction
techniques such as Latent Semantic Indexing (LSI) [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] and the
more scalable and incremental Random Indexing (RI) [
          <xref ref-type="bibr" rid="ref38">38</xref>
          ].
The latter has been involved in [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] and [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] to de ne the
so called enhanced Vector Space Model (eVSM) for
contentbased RS, where user's pro le is incrementally built
summing the features vectors representing documents liked by
the user and a negation operator is introduced to take into
account also negative preferences.
        </p>
        <p>
          Word embedding techniques are not limited to LSI and RI.
The word2vec strategy has been recently presented in [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]
and [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ], and to the best of our knowldge, has been applied
to item recommendations in a few works [
          <xref ref-type="bibr" rid="ref21 ref28">21, 28</xref>
          ]. In
particular, [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] is an empirical evaluation of LSI, RI and word2vec
to make content-based movie recommendation exploiting
textual information from Wikipedia, while [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ] deals with
check-in venue (location) recommendations and adds a
nontextual feature, the past check-ins of the user. They both
draw the conclusion that word2vec techniques are promising
for the recommendation task. Finally there is a single
example of product embedding [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], namely prod2vec, which
operates on the arti cial graph of purchases, treating a purchase
sequence as a \sentence" and products within the sequence
as words.
3.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>APPROACH</title>
      <p>
        In our approach, we adapt neural language models for
RDF graph embeddings. Such approaches take advantage
of the word order in text documents, explicitly modeling
the assumption that closer words in the word sequence are
statistically more dependent. In the case of RDF graphs, we
follow the approach sketched in [
        <xref ref-type="bibr" rid="ref37">37</xref>
        ], considering entities and
relations between entities instead of word sequences. Thus,
in order to apply such approaches on RDF graph data, we
have to transform the graph data into sequences of entities,
which can be considered as sentences. After the graph is
converted into a set of sequences of entities, we can train
the same neural language models to represent each entity in
the RDF graph as a vector of numerical values in a latent
feature space. Such entity vectors can be directly used in a
content-based recommender system.
3.1
      </p>
    </sec>
    <sec id="sec-4">
      <title>RDF Graph Sub-Structures Extraction</title>
      <p>We propose random graph walks as an approach for
conwords w1; w2; w3; :::; wT , and a context window c, the
objective of the CBOW model is to maximize the average log
probability:</p>
      <p>T
1 X log p(wtjwt c:::wt+c); (1)</p>
      <p>T t=1
where the probability p(wtjwt c:::wt+c) is calculated using
the softmax function:
exp(vT vw0t )
p(wtjwt c:::wt+c) = PVw=1 exp(vT vw0) ; (2)
where vw0 is the output vector of the word w, V is the
complete vocabulary of words, and v is the averaged input vector
of all the context words:</p>
      <p>v = 21c X vwt+j (3)
3.2.2</p>
      <p>c j c;j6=0</p>
      <p>Skip-Gram Model</p>
      <p>The Skip-Gram model does the inverse of the CBOW
model and tries to predict the context words from the
target words. More formally, given a sequence of training words
w1; w2; w3; :::; wT , and a context window c, the objective of
the skip-gram model is to maximize the following average
log probability:</p>
      <p>T t=1 c j c;j6=0
where the probability p(wt+jjwt) is calculated using the
softmax function:
log p(wt+jjwt);</p>
      <p>T
1 X</p>
      <p>X
(4)
(5)
verting graphs into a set of sequences of entities.</p>
      <p>Definition 1. An RDF graph is a graph G = (V, E),
where V is a set of vertices, and E is a set of directed edges.</p>
      <p>The objective of the conversion functions is for each vertex
v 2 V to generate a set of sequences Sv, where the rst
token of each sequence s 2 Sv is the vertex v followed by a
sequence of tokens, which might be edges, vertices, or any
substructure extracted from the RDF graph, in an order
that re ects the relations between the vertex v and the rest
of the tokens, as well as among those tokens.</p>
      <p>In this approach, for a given graph G = (V; E), for each
vertex v 2 V we generate all graph walks Pv of depth d
rooted in the vertex v. To generate the walks, we use the
breadth- rst algorithm. In the rst iteration, the algorithm
generates paths by exploring the direct outgoing edges of the
root node vr. The paths generated after the rst iteration
will have the following pattern vr -&gt;e1i, where i 2 E(vr).
In the second iteration, for each of the previously explored
edges the algorithm visits the connected vertices. The paths
generated after the second iteration will follow the following
pattern vr -&gt;e1i -&gt;v1i. The algorithm continues until d
iterations are reached. The nal set of sequences for the
given graph G is the union of the sequences of all the vertices
Sv2V Pv.
3.2</p>
    </sec>
    <sec id="sec-5">
      <title>Neural Language Models – word2vec</title>
      <p>Until recently, most of the Natural Language Processing
systems and techniques treated words as atomic units,
representing each word as a feature vector using a one-hot
representation, where a word vector has the same length as the
size of a vocabulary. In such approaches, there is no notion of
semantic similarity between words. While such approaches
are widely used in many tasks due to their simplicity and
robustness, they su er from several drawbacks, e.g., high
dimensionality and severe data sparsity, which limit the
performance of such techniques. To overcome such limitations,
neural language models have been proposed, inducing
lowdimensional, distributed embeddings of words by means of
neural networks. The goal of such approaches is to estimate
the likelihood of a speci c sequence of words appearing in a
corpus, explicitly modeling the assumption that closer words
in the word sequence are statistically more dependent.</p>
      <p>
        While some of the initially proposed approaches su ered
from ine cient training of the neural network models, with
the recent advancements in the eld several e cient
approaches has been proposed. One of the most popular and
widely used is the word2vec neural language model [
        <xref ref-type="bibr" rid="ref19 ref20">19, 20</xref>
        ].
Word2vec is a particularly computationally-e cient two-layer
neural net model for learning word embeddings from raw
text. There are two di erent algorithms, the Continuous
Bag-of-Words model (CBOW) and the Skip-Gram model.
3.2.1
      </p>
      <sec id="sec-5-1">
        <title>Continuous Bag-of-Words Model</title>
        <p>The CBOW model predicts target words from context
words within a given window.The input layer is comprised
from all the surrounding words for which the input vectors
are retrieved from the input weight matrix, averaged, and
projected in the projection layer. Then, using the weights
from the output weight matrix, a score for each word in the
vocabulary is computed, which is the probability of the word
being a target word. Formally, given a sequence of training
p(wojwi) =</p>
        <p>exp(vw0Tovwi)
PVw=1 exp(vw0T vwi)
;
where vw and vw0 are the input and the output vector of the
word w, and V is the complete vocabulary of words.</p>
        <p>
          In both cases, calculating the softmax function is
computationally ine cient, as the cost for computing is
proportional to the size of the vocabulary. Therefore, two
optimization techniques have been proposed, i.e., hierarchical
softmax and negative sampling [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. The empirical studies
show that in most cases negative sampling leads to better
performances than hierarchical softmax, which depends on
the selected negative samples, but it has higher runtime.
        </p>
        <p>Once the training is nished, semantically similar words
appear close to each other in the feature space. Furthermore,
basic mathematical functions can be performed on the
vectors, to extract di erent relations between the words.
4.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>EVALUATION</title>
      <p>We evaluate di erent variants of our approach on two
distinct datasets, and compare them to common approaches
for creating content-based item representations from LOD
and with state of the art collaborative approaches.
Furthermore, we investigate the use of two di erent LOD datasets
as background knowledge, i.e., DBpedia and Wikidata.
4.1</p>
    </sec>
    <sec id="sec-7">
      <title>Datasets</title>
      <p>In order to test the e ectiveness of our proposal, we
evaluate it in terms of ranking accuracy and aggregate diversity
on two datasets belonging to di erent domains, i.e.
Movielens 1M4 for movies and LibraryThing5 for books. The</p>
      <sec id="sec-7-1">
        <title>4http://grouplens.org/datasets/movielens/</title>
        <p>
          5https://www.librarything.com/
former contains 1 million 1-5 stars ratings from 6,040 users
on 3,883 movies. The LibraryThing dataset contains more
than 2 millions ratings from 7,564 users on 39,515 books.
As there are many duplicated ratings in the dataset, when
a user has rated more than once the same item, we select
her last rating. This choice brings to have 626,000
ratings in the range from 1 to 10. The user-item interactions
contained in the datasets are enriched with side
information thanks to the item mapping and linking to DBpedia
technique detailed in [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ], whose dump is available at http:
//sisin ab.poliba.it/semanticweb/lod/recsys/datasets/. In
the attempt to reduce the popularity bias from our nal
evaluation we decided to remove the top 1% most popular
items from both datasets [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Moreover we keep out, from
LibraryThing, users with less than ve ratings and items
rated less than ve times, and to have a dataset
characterized by lower sparsity we retain for Movielens only users
with at least fty ratings, as already done in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. Table 1
contains the nal statistics for our datasets.
        </p>
        <p>Number of users
Number of items
Number of ratings
Data sparsity</p>
        <p>Movielens
4,186
3,196
822,597
93.85%</p>
        <p>LibraryThing
7,149
4,541
352,123
98.90%</p>
        <sec id="sec-7-1-1">
          <title>RDF Embeddings</title>
          <p>As RDF datasets we use DBpedia and Wikidata.</p>
          <p>We use the English version of the 2015-10 DBpedia dataset,
which contains 4; 641; 890 instances and 1; 369 mapping-based
properties. In our evaluation we only consider object
properties, and ignore the data properties and literals.</p>
          <p>For the Wikidata dataset we use the simpli ed and
derived RDF dumps from 2016-03-286. The dataset contains
17; 340; 659 entities in total. As for the DBpedia dataset, we
only consider object properties, and ignore the data
properties and literals.
4.2</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Evaluation Protocol</title>
      <p>
        As evaluation protocol for our comparison, we adopted the
all unrated items methodology presented in [
        <xref ref-type="bibr" rid="ref41">41</xref>
        ] and already
used in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Such methodology asks to predict a score for
each item not rated by a user, irrespective of the existence
of an actual rating, and to compare the recommendation list
with the test set.
      </p>
      <p>
        The metrics involved in the experimental comparison are
precision, recall and nDCG as accuracy metrics, and
catalog coverage and Gini coe cient for the aggregate diversity.
precision@N represents the fraction of relevant items in the
top-N recommendations. recall@N indicates the fraction of
relevant items, in the user test set, occurring in the top-N
list. As relevance threshold, we set 4 for Movielens and 8 for
LibraryThing, as previously done in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Although
precision and recall are good indicators to evaluate the accuracy
of a recommendation engine, they are not rank-sensitive.
nDCG@N [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] instead takes into account also the position in
the recommendation list, being de ned as
6http://tools.wm abs.org/wikidata-exports/rdf/index.
php?content=dumpn download.phpn&amp;dump=20160328
(6)
(7)
      </p>
      <p>
        1
iDCG
i=1
where rel(u; i) is a boolean function representing the
relevance of item i for user u and iDCG is a normalization
factor that sets nDCG@ N value to 1 when an ideal ranking
is returned [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. As suggested in [
        <xref ref-type="bibr" rid="ref41">41</xref>
        ] and set up in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], in
the computation of nDCG@N we xed a default \neutral"
value for those items with no ratings, i.e. 3 for Movielens
and 5 for LibraryThing.
      </p>
      <p>
        Providing accurate recommendations has been recognized
as just one of the main task a recommender system must be
able to perform. We therefore evaluate the contribution of
our latent features in terms of aggregate diversity, and more
speci cally by means of catalog coverage and Gini coe
cient [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The catalog coverage represents the percentage of
available candidate items recommended at least once. It is
an important quality dimension for both user and business
perspective [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], since it exhibits the capacity to not settle
just on a subset of items (e.g. the most popular). This
metric however should be supported by a distribution metric
which has to show the ability of a recommendation engine
to equally spread out the recommendations across all users.
Gini coe cient [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] is used for this purpose, since it measures
the concentration degree of top-N recommendations across
items and is de ned as
      </p>
      <p>Gini =</p>
      <p>n
2 X
i=1
n + 1
n + 1
i
rec(i)
total</p>
      <p>
        In Equation (7), n is the number of candidate items
available for recommendation, total represents the total
number of top-N recommendations made across all users, and
rec(i) is the number of users to whom item i has been
recommended. Gini coe cient gives therefore an idea of the
\equity" in the distribution of the items. It is worth to
remind that we are following the notion given in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], where
the complement of the standard Gini coe cient is used, so
that higher values correspond to more balanced
recommendations.
4.3
      </p>
    </sec>
    <sec id="sec-9">
      <title>Experimental Setup</title>
      <p>
        The rst step of our approach is to convert the RDF
graphs into a set of sequences. Therefore, to extract the
entities embeddings for the large RDF datasets, we use only
random graph walks entity sequences. More precisely, we
follow the approach presented in [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ] to generate only a
limited number of random walks for each entity. For DBpedia,
we experiment with 500 walks per entity with depth of 4
and 8, while for Wikidata, we use only 200 walks per entity
with depth of 4. Additionally, for each entity in DBpedia
and Wikidata, we include all the walks of depth 2, i.e.,
direct outgoing relations. We use the corpora of sequences to
build both CBOW and Skip-Gram models with the
following parameters: window size = 5; number of iterations =
5; negative sampling for optimization; negative samples =
25; with average input vector for CBOW. We experiment
with 200 and 500 dimensions for the entities' vectors. All
the models are publicly available7.
      </p>
      <p>
        We compare our approach to several baselines. For
generating the data mining features, we use three strategies that
7http://data.dws.informatik.uni-mannheim.de/rdf2vec/
take into account the direct relations to other resources in
the graph [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ], and two strategies for features derived from
graph sub-structures [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]:
      </p>
      <p>Features derived from generic relations, i.e., we
generate a feature for each incoming (rel in) or outgoing
relation (rel out) of an entity, ignoring the value of the
relation.</p>
      <p>Kernels that count substructures in the RDF graph
around the instance node. These substructures are
explicitly generated and represented as sparse feature
vectors.</p>
      <p>Features derived from speci c relations. In the
experiments we use the relations rdf:type (types), and
dcterms:subject (categories) for datasets linked to
DBpedia.</p>
      <p>Features derived from generic relations-values, i.e, we
generate feature for each incoming (rel-vals in) or
outgoing relation (rel-vals out) of an entity including the
value of the relation.
where ratedItems(u) is the set of items already evaluated
by user u, ru;j indicates the rating for item j by user u
and cosineSim(j; i) is the cosine similarity score between
items j and i. In our experiments, the size of the considered
neighbourhood is limited to 5. The computation of
recommendations has been done with the publicly available library
RankSys9. All the results have been computed @10, that is
considering the top-10 lists recommended to the users:
precision, recall and nDCG are computed for each user and then
averaged across all users, while diversity metrics are global
measures.</p>
      <p>
        Tables 2 and 3 contain the values of precision, recall and
nDCG, respectively for Movielens and LibraryThing, for
each kind of features we want to test. The best approach
for both datasets is retrieved with a Skip-Gram model and
with a size of 200 for vectors built upon DBpedia. For the
sake of truth, on the Movielens dataset the highest value
of precision is achieved using vector size of 500, but the
size 200 is prevalent according to the F1 measure, i.e. the
harmonic mean of precision and recall. A substantial
difference however concerns the exploratory depth of the
random walks, since for Movielens the results related to depth
4 outdo those computed with depth 8, while the tendency
{ The Weisfeiler-Lehman (WL) graph kernel for RDF [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] is reversed for LibraryThing. The advantage of the
Skipcounts full subtrees in the subgraph around the Gram model over the CBOW is a constant both on DBpedia
instance node. This kernel has two parameters, and Wikidata. Moreover, the employment of the Wikidata
the subgraph depth d and the number of itera- RDF dataset turns out to be more e ective for
Librarytions h (which determines the depth of the sub- Thing, where the Skip-Gram vectors with depth 4 exceeds
trees). We use d = 1 and h = 2 and therefore we the corresponding DBpedia vectors. Moving to the features
will indicate this strategy as WL12. extracted from direct relations, the contribution of the
\categories" stands clearly out, together with relations-values
\rel-vals", especially when just incoming relations are
considered. The extraction of features from graph structures,
i.e. WC2 and WL12 approaches, seems not to provide
signi cant advantages to the recommendation algorithm.
{ The Intersection Tree Path kernel for RDF [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]
counts the walks in the subtree that span from the
instance node. Only the walks that go through
the instance node are considered. We will
therefore refer to it as the root Walk Count (WC)
kernel. The root WC kernel has one parameter: the
length of the paths l, for which we test 2. This
strategy will be denoted accordingly as WC2.
      </p>
      <p>
        The strategies for creating propositional features from Linked
Open Data are implemented in the RapidMiner LOD
extension8 [
        <xref ref-type="bibr" rid="ref31 ref35">31, 35</xref>
        ].
4.4
      </p>
    </sec>
    <sec id="sec-10">
      <title>Results</title>
      <p>The target of the experimental section of this paper is
two-fold. On the one hand, we want to prove that the
latent features we extracted are able to subsume the other
kind of features in terms of accuracy and aggregate
diversity. On the other hand we aim at qualifying our strategies
as valuable means for the recommendation task, through a
rst comparison with state of the art approaches. Both goals
are pursued implementing an item-based K-nearest-neighbor
method, hereafter denoted as ItemKNN, with cosine
similarity among features vectors. Formally, this method
determines similarities between items through cosine similarity
between relative vectors and then selects a subset of them {
the neighbors { for each item, that will be used to estimate
the rating of user u for a new item i as follows:
r (u; i) =</p>
      <p>X
j2ratedItems(u)
cosineSim(j; i) ru;j
8http://dws.informatik.uni-mannheim.de/en/research/
rapidminer-lod-extension</p>
      <p>To point out that our latent features are able to capture
the structure of the RDF graph, placing closely semantically
similar items, we provide some examples of the neighbouring
sets retrieved using our graph embeddings technique and
used within the ItemKNN. Table 4 is related to movies and
displays that neighboring items are highly relevant and close
to the query item, i.e. the item for which neighbors are
searched for.</p>
      <p>To further analyse the semantics of the vector
representations, we employ Principal Component Analysis (PCA)
to project the \high"-dimensional entities' vectors in a two
dimensional feature space, or 2D scatter plot. For each of
the query movies in Table 4 we visualize the vectors of the
5 nearest neighbors as shown in Figure 1. The gure
illustrates the ability of the model to automatically cluster the
movies.</p>
      <p>
        The impact on the aggregate diversity. As a further
validation of the interactiveness of our latent features for
recommendation task, we report the performances of the ItemKNN
approach in terms of aggregate diversity. The relation
between accuracy and aggregate diversity has gained the
attention of researchers in the last few years and is generally
characterized as a trade-o [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Quite surprisingly, however,
the increase in accuracy, shown in Tables 2 and 3, seems not
to rely on a concentration on a subset of items, e.g. the most
      </p>
      <sec id="sec-10-1">
        <title>9http://ranksys.org/</title>
        <p>Strategy
DB2vec CBOW 200 4
DB2vec CBOW 500 4</p>
        <p>DB2vec SG 200 4</p>
        <p>DB2vec SG 500 4
DB2vec CBOW 200 8
DB2vec CBOW 500 8</p>
        <p>DB2vec SG 200 8</p>
        <p>DB2vec SG 500 8
WD2vec CBOW 200 4
WD2vec CBOW 500 4</p>
        <p>WD2vec SG 200 4
WD2vec SG 500 4</p>
        <p>types
categories
rel in
rel out
rel in &amp; out
rel-vals in
rel-vals out
rel-vals in &amp; out</p>
        <p>WC2</p>
        <p>WL12</p>
        <p>Strategy
DB2vec CBOW 200 4
DB2vec CBOW 500 4</p>
        <p>DB2vec SG 200 4</p>
        <p>DB2vec SG 500 4
DB2vec CBOW 200 8
DB2vec CBOW 500 8</p>
        <p>DB2vec SG 200 8</p>
        <p>DB2vec SG 500 8
WD2vec CBOW 200 4
WD2vec CBOW 500 4</p>
        <p>WD2vec SG 200 4
WD2vec SG 500 4</p>
        <p>types
categories
rel in
rel out
rel in &amp; out
rel-vals in
rel-vals out
rel-vals in &amp; out</p>
        <p>WC2
WL12
popular ones, according to the results proposed in Tables 5
and 6. Here we are reporting, for the sake of concisenesses,
only the best approaches for each kind of features. More
clearly, we are displaying the best approach for latent
features computed on DBpedia, the best approach for latent
features computed on Wikidata and the values for the
strategy involving categories, since it provides the highest scores
among features extracted through direct relations. We are
not reporting the values related to WL12 and WC2
algorithms, since their contribution is rather low also in this
Bambi
Star Trek: Generations</p>
        <p>K Nearest Neighbours
Batman Forever, Batman
Returns, Batman &amp; Robin,
Superman IV: The Quest for
Peace, Dick Tracy
Cinderella, Dumbo, 101
Dalmatians , Pinocchio, Lady and
the Tramp
Star Trek VI: The
Undiscovered Country, Star Trek:
Insurrection, Star Trek III: The
Search for Spock, Star Trek V:
The Final Frontier, Star Trek:</p>
        <p>First Contact (1996)
analysis. For both movies and books domain, the best
approaches found on DBpedia for the accuracy metrics, i.e.
respectively \DB2vec SG 200 4" and \DB2vec SG 200 8",
perform better also in terms of aggregate diversity. For the
LibraryThing dataset the Skip-Gram model computed with
random walks on Wikidata and size vector limited to 200 is
very close to the highest scores retrieved in DBpedia, while
for Movielens is the CBOW model, with depth 4, to gain
the best performance on Wikidata. The contribution of the
categories, despite being lower than the best approach on
each dataset, is quite signi cant for diversity measures too.
Comparison with state of the art collaborative approaches.
It is a quite common belief in the RS eld that using pure
content-based approaches would not be enough to provide
accurate suggestions and that the recommendation engines
must ground on collaborative information too. This
motivated us to explicitly compare the best approaches built on
graph embeddings technique with the well-known state of
the art collaborative recommendation algorithms listed
below, and implemented with the publicly available software
library MyMediaLite10.</p>
        <p>
          Biased Matrix Factorization (MF) [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], recognized as
the state of the art for rating prediction, is a
matrix factorization model that minimizes RMSE using
stochastic gradient descent and both user and item
bias.
        </p>
        <p>
          PopRank is a baseline based on popularity. It
recommends the same recommendations to all users
according to the overall items popularity. Recent studies have
point out that recommending the most popular items
could already result in a high performance [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
        <p>
          Bayesian Personalized Ranking (BPRMF) combines a
matrix factorization approach with a Bayesian
Personalized Ranking optimization criterion [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ].
        </p>
        <p>
          SLIM [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] is a Sparse LInear Method for top-N
recommendation that learns a sparse coe cient matrix for
the items involved in the system by only relying on
the users purchase/ratings pro le and by solving a
L1norm and L2-norm regularized optimization problem.
Soft Margin Ranking Matrix Factorization (RankMF)
is a matrix factorization approach for ranking, whose
loss function is ordinal regression [
          <xref ref-type="bibr" rid="ref43">43</xref>
          ].
        </p>
        <p>Tables 7 and 8 provide the comparison results for
Movielens and LibraryThing respectively. Table 7 shows that
matrix factorization techniques and the SLIM algorithm
exceed our approach based only on content information. This
outcome was somehow expected, especially considering that,
in our experimental setting, Movielens dataset retains only
users with at least fty ratings. The community-based
information is unquestionably predominant for this dataset,
whose sparsity would probably be unlikely for most
realworld scenarios. The behaviour however is completely
overturned on the LibraryThing dataset, whose results are
collected in Table 8. In this case, the mere use of our features
vectors (i.e. the \DB2vec SG 200 8" strategy) is able to
outperform the competitor algorithms, which are generally
regarded as the most e cient collaborative algorithms for
both rating and ranking prediction.
10http://www.mymedialite.net</p>
        <p>Strategy
DB2vec SG 200 4</p>
        <p>MF
PopRank
BPRMF</p>
        <p>SLIM
RankMF</p>
        <p>Strategy
DB2vec SG 200 8</p>
        <p>MF
PopRank
BPRMF</p>
        <p>SLIM
RankMF
P@10
0.0768
0.0173
0.0397
0.0449
0.0543
0.0369
R@10
0.1777
0.0209
0.0452
0.0751
0.0988
0.0459
nDCG@10
0.2523
0.1423
0.1598
0.1858
0.2317
0.1714</p>
        <p>In this paper, we have presented an approach for
learning low-dimensional real-valued representations of entities in
RDF graphs, in a completely domain independent way. We
have rst converted the RDF graphs into a set of sequences
using graph walks, which are then used to train neural
language models. In the experimental section we have shown
that a content-based RS relying on the similarity between
items computed according to our latent features vectors,
outdo the same kind of system but grounding on explicit
features (e.g. types, categories,...) or features generated
with the use of kernels, from both perspectives of accuracy
and aggregate diversity. Our purely content-based system
has been further compared to state of the arts collaborative
approaches for rating prediction and item ranking, giving
outstanding results on a dataset with a realistic sparsity
degree.</p>
        <p>
          As future work, we intend to introduce the features
vectors deriving from the graph embeddings technique within a
hybrid recommender system in order to get a fair comparison
against state of the art hybrids approaches such as SPRank
[
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] and BRP-SSLIM [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ]. In this perspective we could take
advantage of the Factorization Machines [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ], general
predictor working with any features vector, that combine
Support Vector Machines and factorization models. We aim to
extend the evaluation to additional metrics, such as the
individual diversity [
          <xref ref-type="bibr" rid="ref44 ref9">44, 9</xref>
          ], and to provide a deeper insight
into cold-start users, i.e. users with a small interaction with
the system for whom the information inference is di cult to
draw and that generally bene t most of content \infusion".
        </p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Gediminas</given-names>
            <surname>Adomavicius and YoungOk Kwon</surname>
          </string-name>
          .
          <article-title>Improving aggregate recommendation diversity using ranking-based techniques</article-title>
          .
          <source>IEEE Trans. on Knowl. and Data Eng</source>
          .,
          <volume>24</volume>
          (
          <issue>5</issue>
          ):
          <volume>896</volume>
          {
          <fpage>911</fpage>
          , May
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Alejandro</surname>
            <given-names>Bellog n</given-names>
          </string-name>
          , Ivan Cantador, and
          <string-name>
            <given-names>Pablo</given-names>
            <surname>Castells</surname>
          </string-name>
          .
          <article-title>A comparative study of heterogeneous item recommendations in social systems</article-title>
          .
          <source>Inf. Sci.</source>
          ,
          <volume>221</volume>
          :
          <fpage>142</fpage>
          {
          <fpage>169</fpage>
          ,
          <year>February 2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Christian</given-names>
            <surname>Bizer</surname>
          </string-name>
          , Tom Heath, and
          <string-name>
            <surname>Tim</surname>
          </string-name>
          Berners-Lee.
          <article-title>Linked Data { The Story So Far</article-title>
          .
          <source>International journal on semantic web and information systems</source>
          ,
          <volume>5</volume>
          (
          <issue>3</issue>
          ):1{
          <fpage>22</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Robin</given-names>
            <surname>Burke</surname>
          </string-name>
          .
          <article-title>Hybrid recommender systems: Survey and experiments</article-title>
          .
          <source>User Modeling</source>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <volume>12</volume>
          (
          <issue>4</issue>
          ):
          <volume>331</volume>
          {
          <fpage>370</fpage>
          ,
          <string-name>
            <surname>November</surname>
          </string-name>
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Paolo</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          , Yehuda Koren, and
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Turrin</surname>
          </string-name>
          .
          <article-title>Performance of recommender algorithms on top-n recommendation tasks</article-title>
          .
          <source>In Proceedings of the Fourth ACM Conference on Recommender Systems, RecSys '10</source>
          , pages
          <fpage>39</fpage>
          {
          <fpage>46</fpage>
          , New York, NY, USA,
          <year>2010</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Marco de Gemmis</surname>
            , Pasquale Lops, Cataldo Musto, Narducci Fedelucio, and
            <given-names>Giovanni</given-names>
          </string-name>
          <string-name>
            <surname>Semeraro</surname>
          </string-name>
          .
          <article-title>Semantics-aware content-based recommender systems</article-title>
          . In Francesco Ricci, Lior Rokach, and Bracha Shapira, editors,
          <source>Recommender Systems Handbook</source>
          , pages
          <volume>119</volume>
          {
          <fpage>159</fpage>
          . Springer, 2nd edition,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Gerben</given-names>
            <surname>Klaas Dirk de Vries</surname>
          </string-name>
          and Steven de Rooij.
          <article-title>Substructure counting graph kernels for machine learning from rdf data</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          ,
          <volume>35</volume>
          :
          <fpage>71</fpage>
          {
          <fpage>84</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Scott</given-names>
            <surname>Deerwester</surname>
          </string-name>
          , Susan T. Dumais, George W. Furnas,
          <string-name>
            <surname>Thomas</surname>
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Landauer</surname>
          </string-name>
          , and Richard Harshman.
          <article-title>Indexing by latent semantic analysis</article-title>
          .
          <source>JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE</source>
          ,
          <volume>41</volume>
          (
          <issue>6</issue>
          ):
          <volume>391</volume>
          {
          <fpage>407</fpage>
          ,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T. Di</given-names>
            <surname>Noia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. C.</given-names>
            <surname>Ostuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Tomeo</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E. Di</given-names>
            <surname>Sciascio</surname>
          </string-name>
          .
          <article-title>An analysis of users' propensity toward diversity in recommendations</article-title>
          .
          <source>In ACM RecSys '14</source>
          , RecSys '
          <volume>14</volume>
          , pages
          <fpage>285</fpage>
          {
          <fpage>288</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>T. Di</given-names>
            <surname>Noia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. C.</given-names>
            <surname>Ostuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Tomeo</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E. Di</given-names>
            <surname>Sciascio</surname>
          </string-name>
          . Sprank:
          <article-title>Semantic path-based ranking for top-n recommendations using linked open data</article-title>
          .
          <source>ACM Transactions on Intelligent Systems and Technology (TIST)</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Tommaso</given-names>
            <surname>Di</surname>
          </string-name>
          <string-name>
            <surname>Noia</surname>
          </string-name>
          , Roberto Mirizzi, Vito Claudio Ostuni, and
          <string-name>
            <given-names>Davide</given-names>
            <surname>Romito</surname>
          </string-name>
          .
          <article-title>Exploiting the web of data in model-based recommender systems</article-title>
          .
          <source>In Proceedings of the Sixth ACM Conference on Recommender Systems, RecSys '12</source>
          , pages
          <fpage>253</fpage>
          {
          <fpage>256</fpage>
          , New York, NY, USA,
          <year>2012</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Tommaso</given-names>
            <surname>Di</surname>
          </string-name>
          <string-name>
            <surname>Noia</surname>
          </string-name>
          , Roberto Mirizzi, Vito Claudio Ostuni, Davide Romito, and
          <string-name>
            <given-names>Markus</given-names>
            <surname>Zanker</surname>
          </string-name>
          .
          <article-title>Linked open data to support content-based recommender systems</article-title>
          .
          <source>In Proceedings of the 8th International Conference on Semantic Systems, I-SEMANTICS '12</source>
          , pages
          <issue>1</issue>
          {
          <fpage>8</fpage>
          , New York, NY, USA,
          <year>2012</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Mouzhi</surname>
            <given-names>Ge</given-names>
          </string-name>
          ,
          <article-title>Carla Delgado-battenfeld, and Dietmar Jannach. Beyond accuracy: evaluating recommender systems by coverage and serendipity</article-title>
          . In In RecSys '
          <volume>10</volume>
          ,
          <string-name>
            <surname>page</surname>
            <given-names>257</given-names>
          </string-name>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Mihajlo</surname>
            <given-names>Grbovic</given-names>
          </string-name>
          , Vladan Radosavljevic, Nemanja Djuric, Narayan Bhamidipati, Jaikit Savla, Varun Bhagwan, and
          <string-name>
            <given-names>Doug</given-names>
            <surname>Sharp</surname>
          </string-name>
          .
          <article-title>E-commerce in your inbox: Product recommendations at scale</article-title>
          .
          <source>In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '15</source>
          , pages
          <year>1809</year>
          {
          <year>1818</year>
          , New York, NY, USA,
          <year>2015</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Z. S.</given-names>
            <surname>Harris</surname>
          </string-name>
          .
          <source>Mathematical Structures of Language</source>
          . Wiley, New York, NY, USA,
          <year>1968</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Oktie</given-names>
            <surname>Hassanzadeh</surname>
          </string-name>
          and
          <string-name>
            <given-names>Mariano</given-names>
            <surname>Consens</surname>
          </string-name>
          . M.:
          <article-title>Linked movie data base</article-title>
          .
          <source>In In: Workshop on Linked Data on the Web</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Yehuda</surname>
            <given-names>Koren</given-names>
          </string-name>
          , Robert Bell, and
          <string-name>
            <given-names>Chris</given-names>
            <surname>Volinsky</surname>
          </string-name>
          .
          <article-title>Matrix factorization techniques for recommender systems</article-title>
          .
          <source>Computer</source>
          ,
          <volume>42</volume>
          (
          <issue>8</issue>
          ):
          <volume>30</volume>
          {
          <fpage>37</fpage>
          ,
          <year>August 2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Jens</surname>
            <given-names>Lehmann</given-names>
          </string-name>
          , Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas,
          <string-name>
            <given-names>Pablo N.</given-names>
            <surname>Mendes</surname>
          </string-name>
          , Sebastian Hellmann, Mohamed Morsey, Patrick van Kleef,
          <string-name>
            <surname>SA</surname>
          </string-name>
          ~ u}ren Auer, and
          <string-name>
            <given-names>Christian</given-names>
            <surname>Bizer</surname>
          </string-name>
          .
          <article-title>DBpedia { A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia</article-title>
          .
          <source>Semantic Web Journal</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Tomas</surname>
            <given-names>Mikolov</given-names>
          </string-name>
          , Kai Chen, Greg Corrado, and
          <article-title>Je rey Dean. E cient estimation of word representations in vector space</article-title>
          .
          <source>arXiv preprint arXiv:1301.3781</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Tomas</surname>
            <given-names>Mikolov</given-names>
          </string-name>
          , Ilya Sutskever, Kai Chen, Greg S Corrado, and
          <string-name>
            <given-names>Je</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          , pages
          <volume>3111</volume>
          {
          <fpage>3119</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Cataldo</surname>
            <given-names>Musto</given-names>
          </string-name>
          , Giovanni Semeraro, Marco De Gemmis, and
          <string-name>
            <given-names>Pasquale</given-names>
            <surname>Lops</surname>
          </string-name>
          .
          <article-title>Word embedding techniques for content-based recommender systems: an empirical evaluation. In RecSys Posters, ser</article-title>
          . CEUR Workshop Proceedings, P. Castells, Ed, volume
          <volume>1441</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Cataldo</surname>
            <given-names>Musto</given-names>
          </string-name>
          , Giovanni Semeraro, Pasquale Lops, and Marco de Gemmis.
          <article-title>Random Indexing and Negative User Preferences for Enhancing Content-Based Recommender Systems</article-title>
          , pages
          <fpage>270</fpage>
          {
          <fpage>281</fpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Cataldo</surname>
            <given-names>Musto</given-names>
          </string-name>
          , Giovanni Semeraro, Pasquale Lops, and Marco de Gemmis.
          <source>Contextual eVSM: A Content-Based Context-Aware Recommendation Framework Based on Distributional Semantics</source>
          , pages
          <volume>125</volume>
          {
          <fpage>136</fpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>Xia</given-names>
            <surname>Ning</surname>
          </string-name>
          and
          <string-name>
            <given-names>George</given-names>
            <surname>Karypis</surname>
          </string-name>
          .
          <article-title>SLIM: sparse linear methods for top-n recommender systems</article-title>
          .
          <source>In 11th IEEE International Conference on Data Mining, ICDM</source>
          <year>2011</year>
          , Vancouver, BC, Canada,
          <source>December 11-14</source>
          ,
          <year>2011</year>
          , pages
          <fpage>497</fpage>
          {
          <fpage>506</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>Xia</given-names>
            <surname>Ning</surname>
          </string-name>
          and
          <string-name>
            <given-names>George</given-names>
            <surname>Karypis</surname>
          </string-name>
          .
          <article-title>Sparse linear methods with side information for top-n recommendations</article-title>
          .
          <source>In Proceedings of the Sixth ACM Conference on Recommender Systems, RecSys '12</source>
          , pages
          <fpage>155</fpage>
          {
          <fpage>162</fpage>
          , New York, NY, USA,
          <year>2012</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>Tommaso</given-names>
            <surname>Di</surname>
          </string-name>
          <string-name>
            <surname>Noia</surname>
          </string-name>
          , Vito Claudio Ostuni, Jessica Rosati, Paolo Tomeo, Eugenio Di Sciascio, Roberto Mirizzi, and
          <string-name>
            <given-names>Claudio</given-names>
            <surname>Bartolini</surname>
          </string-name>
          .
          <article-title>Building a relatedness graph from linked open data: A case study in the it domain</article-title>
          .
          <source>Expert Systems with Applications</source>
          ,
          <volume>44</volume>
          :
          <fpage>354</fpage>
          {
          <fpage>366</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>V. C.</given-names>
            <surname>Ostuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. Di</given-names>
            <surname>Noia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Di</given-names>
            <surname>Sciascio</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Mirizzi</surname>
          </string-name>
          .
          <article-title>Top-n recommendations from implicit feedback leveraging linked open data</article-title>
          .
          <source>In ACM RecSys '13</source>
          , pages
          <fpage>85</fpage>
          {
          <fpage>92</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Makbule</surname>
            <given-names>Gulcin</given-names>
          </string-name>
          <string-name>
            <surname>Ozsoy</surname>
          </string-name>
          .
          <article-title>From word embeddings to item recommendation</article-title>
          .
          <source>arXiv preprint arXiv:1601.01356</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>Heiko</given-names>
            <surname>Paulheim</surname>
          </string-name>
          .
          <article-title>Knowledge graph re nement: A survey of approaches and evaluation methods</article-title>
          .
          <source>Semantic Web</source>
          , (Preprint):
          <volume>1</volume>
          {
          <fpage>20</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>Heiko</given-names>
            <surname>Paulheim</surname>
          </string-name>
          and
          <article-title>Johannes Fumkranz. Unsupervised generation of data mining features from linked open data</article-title>
          .
          <source>In Proceedings of the 2nd international conference on web intelligence, mining and semantics, page 31. ACM</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <surname>Heiko</surname>
            <given-names>Paulheim</given-names>
          </string-name>
          , Petar Ristoski, Evgeny Mitichkin, and
          <string-name>
            <given-names>Christian</given-names>
            <surname>Bizer</surname>
          </string-name>
          .
          <article-title>Data mining with background knowledge from the web</article-title>
          .
          <source>RapidMiner World</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <surname>Bryan</surname>
            <given-names>Perozzi</given-names>
          </string-name>
          , Rami Al-Rfou, and
          <string-name>
            <given-names>Steven</given-names>
            <surname>Skiena</surname>
          </string-name>
          . Deepwalk:
          <article-title>Online learning of social representations</article-title>
          .
          <source>In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          , pages
          <volume>701</volume>
          {
          <fpage>710</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <article-title>Ste en Rendle. Factorization machines with libfm</article-title>
          .
          <source>ACM Trans. Intell. Syst. Technol.</source>
          ,
          <volume>3</volume>
          (
          <issue>3</issue>
          ):
          <volume>57</volume>
          :1{
          <fpage>57</fpage>
          :
          <fpage>22</fpage>
          , May
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <surname>Ste</surname>
            <given-names>en Rendle</given-names>
          </string-name>
          , Christoph Freudenthaler, Zeno Gantner, and
          <string-name>
            <surname>Lars</surname>
          </string-name>
          Schmidt-Thieme.
          <article-title>Bpr: Bayesian personalized ranking from implicit feedback</article-title>
          .
          <source>In Proceedings of the Twenty-Fifth Conference on Uncertainty in Arti cial Intelligence</source>
          ,
          <source>UAI '09</source>
          , pages
          <fpage>452</fpage>
          {
          <fpage>461</fpage>
          ,
          <string-name>
            <surname>Arlington</surname>
          </string-name>
          , Virginia, United States,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <surname>Petar</surname>
            <given-names>Ristoski</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Christian</given-names>
            <surname>Bizer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Heiko</given-names>
            <surname>Paulheim</surname>
          </string-name>
          .
          <article-title>Mining the web of linked data with rapidminer</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          ,
          <volume>35</volume>
          :
          <fpage>142</fpage>
          {
          <fpage>151</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <surname>Petar</surname>
            <given-names>Ristoski</given-names>
          </string-name>
          ,
          <article-title>Eneldo Loza Menc a, and Heiko Paulheim. A hybrid multi-strategy recommender system using linked open data</article-title>
          .
          <source>In Semantic Web Evaluation Challenge</source>
          , pages
          <volume>150</volume>
          {
          <fpage>156</fpage>
          . Springer,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>Petar</given-names>
            <surname>Ristoski</surname>
          </string-name>
          and
          <string-name>
            <given-names>Heiko</given-names>
            <surname>Paulheim</surname>
          </string-name>
          . Rdf2vec:
          <article-title>Rdf graph embeddings for data mining</article-title>
          . In International Semantic Web Conference (To Appear). Springer,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>Magnus</given-names>
            <surname>Sahlgren</surname>
          </string-name>
          .
          <article-title>An introduction to random indexing</article-title>
          .
          <source>In In Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering</source>
          ,
          <string-name>
            <surname>TKE</surname>
          </string-name>
          <year>2005</year>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <surname>Max</surname>
            <given-names>Schmachtenberg</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Christian</given-names>
            <surname>Bizer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Heiko</given-names>
            <surname>Paulheim</surname>
          </string-name>
          .
          <article-title>Adoption of the linked data best practices in di erent topical domains</article-title>
          .
          <source>In International Semantic Web Conference</source>
          , pages
          <volume>245</volume>
          {
          <fpage>260</fpage>
          . Springer,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <surname>Giovanni</surname>
            <given-names>Semeraro</given-names>
          </string-name>
          , Pasquale Lops, Pierpaolo Basile, and Marco de Gemmis.
          <article-title>Knowledge infusion into content-based recommender systems</article-title>
          .
          <source>In Proceedings of the Third ACM Conference on Recommender Systems, RecSys '09</source>
          , pages
          <fpage>301</fpage>
          {
          <fpage>304</fpage>
          , New York, NY, USA,
          <year>2009</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>Harald</given-names>
            <surname>Steck</surname>
          </string-name>
          .
          <article-title>Evaluation of recommendations: Rating-prediction and ranking</article-title>
          .
          <source>In Proceedings of the 7th ACM Conference on Recommender Systems, RecSys '13</source>
          , pages
          <fpage>213</fpage>
          {
          <fpage>220</fpage>
          , New York, NY, USA,
          <year>2013</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>Denny</given-names>
            <surname>Vrandecic</surname>
          </string-name>
          and
          <article-title>Markus Krotzsch. Wikidata: a free collaborative knowledgebase</article-title>
          .
          <source>Communications of the ACM</source>
          ,
          <volume>57</volume>
          (
          <issue>10</issue>
          ):
          <volume>78</volume>
          {
          <fpage>85</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [43]
          <string-name>
            <surname>Markus</surname>
            <given-names>Weimer</given-names>
          </string-name>
          , Alexandros Karatzoglou, and
          <string-name>
            <given-names>Alex</given-names>
            <surname>Smola</surname>
          </string-name>
          .
          <article-title>Improving maximum margin matrix factorization</article-title>
          .
          <source>Mach</source>
          . Learn.,
          <volume>72</volume>
          (
          <issue>3</issue>
          ):
          <volume>263</volume>
          {
          <fpage>276</fpage>
          ,
          <year>September 2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>Mi</given-names>
            <surname>Zhang</surname>
          </string-name>
          and
          <string-name>
            <given-names>Neil</given-names>
            <surname>Hurley</surname>
          </string-name>
          .
          <article-title>Avoiding monotony: Improving the diversity of recommendation lists</article-title>
          .
          <source>In Proceedings of the 2008 ACM Conference on Recommender Systems, RecSys '08</source>
          , pages
          <fpage>123</fpage>
          {
          <fpage>130</fpage>
          , New York, NY, USA,
          <year>2008</year>
          . ACM.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>