<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Range</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Representing Items as Word-Embedding Vectors and Generating Recommendations by Measuring their Linear Independence</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ludovico Boratto</string-name>
          <email>ludovico.boratto@acm.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Salvatore Carta</string-name>
          <email>salvatore@unica.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gianni Fenu</string-name>
          <email>fenu@unica.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roberto Saia</string-name>
          <email>roberto.saia@unica.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>CCS Concepts</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Semantic Analysis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Word Embeddings</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Algorithms</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Metrics.</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dipartimento di Matematica e Informatica, Università di Cagliari Via Ospedale 72</institution>
          ,
          <addr-line>09124 Cagliari -</addr-line>
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <volume>21</volume>
      <issue>40</issue>
      <abstract>
        <p>In order to generate e ective results, it is essential for a recommender system to model the information about the user interests in a pro le. Even though word embeddings (i.e., vector representations of textual descriptions) have proven to be e ective in many contexts, a content-based recommendation approach that employs them is still less e ective than collaborative strategies (e.g., SVD). In order to overcome this issue, we introduce a novel criterion to evaluate the word-embedding representation of the items a user rated. The proposed approach de nes a vector space in which the similarity between an unevaluated item and those in a user pro le is measured in terms of linear independence. Experiments show its e ectiveness to perform a better ranking of the items, w.r.t. collaborative ltering, both when compared to a latent-factor-based approach (SVD) and to a classic neighborhood user-based system. This work is partially funded by Regione Sardegna under project NOMAD (Next generation Open Mobile Apps Development), through PIA - Pacchetti Integrati di Agevolazione \Industria Artigianato e Servizi" (annualita 2013).</p>
      </abstract>
      <kwd-group>
        <kwd>Information systems ! Data mining</kwd>
        <kwd>Recommender systems</kwd>
        <kwd>Learning to rank</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        Recommender systems are essential for e-commerce
companies, to lter the huge amounts of items they can provide
and improve the quality and e ciency of the sales
criteria [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In order to perform this task, these systems need to
de ne a set of pro les that model the preferences of their
customers. In this context, the collaborative techniques,
which represent a user with the ratings given to the items
she evaluated, are usually the most e ective. Even though
semantic technologies are moving at a very rapid pace and
state-of-the-art solutions, such as deep learning algorithms
able to extract word embeddings [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] from a text corpus, have
been successfully employed in numerous information
ltering and retrieval tasks, at the moment collaborative ltering
approaches continue to be more accurate at generating
recommendations [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The extraction of the word embeddings
from a corpus is made possible thanks to several
state-ofthe-art tools, which create a vector representation of each
word (Google's word2vec1) or document (a word2vec
extension, usually known as doc2vec).
      </p>
      <p>A user pro le represented by a unique vector of features
allows a system to perform quick comparisons, e.g., in a
content-based system, it can be easily compared to the
vector that represents an item with a simple metric, like the
cosine similarity. The idea behind this paper is to represent a
user pro le as a matrix of word-embeddings, where each row
is represented by an item a user positively evaluated, and to
de ne a metric able to evaluate the correlation between an
item not rated by a user and those in her matrix-based user
pro le, in terms of linear independence.</p>
      <p>We observe that if the vector representation of an
unevaluated item is linearly dependent to the items in a user pro le
that have been positively evaluated, their features match,
thus it is similar to the user preferences. This leads to a
higher accuracy of a recommender system w.r.t.
collaborative approaches.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>APPROACH</title>
      <p>Here, we present the steps our approach performs:
Item Vectorization. Given a set I of items, we rst
de ne and train a model by using the doc2vec neural net
(by using as source the textual description of the items in
I). The result is the vector representation of the items in
I, where the cardinality of each vector (item) depends on
the number L of features used in the doc2vec vectorization
process. Given a user, the output of this step will be a
matrix that contains the embeddings of the items positively
evaluated by her (denoted as Iu), plus an empty row to
employ during the ltering process to evaluate the items
not yet evaluated by the user.</p>
      <p>Linear Independence Rate. To evaluate the similarity
between the items in the matrix-based user pro le and an
1http://deeplearning4j.org/word2vec
unevaluated one, we de ne the Linear Independence Rate
(LIR) coe cient. It is the average of the determinants of
all square sub-matrices, de ned by decomposing the user
pro le matrix in square sub-matrices of size jIuj jIuj (with
jIuj L, otherwise we have only a matrix of size L L). We
calculate the LIR value by moving on the entire user
prole matrix, extracting the determinant of each sub-matrix,
without overlaps. We can note that the maximum size of the
square sub-matrices is the cardinality of the vectors, i.e., the
L parameter used to build the doc2vec model. Through this
compositional process, we evaluate the LIR of an item by
placing its vector representation as last element of the user
pro le. We consider closer to the preferences of a user the
items with a LIR value as close as possible to zero.</p>
      <p>Ranking Algorithm. The Algorithm 1 takes as input
the items I, a user u, the items Iu she evaluated, and the
number of features L in the vectors created by doc2vec (i.e.,
the layerSize parameter). It returns as output a list R of the
items not evaluated by the user u (ranked by LIR value).
Algorithm 1 Items evaluation and ranking
Input: I=Set of items, u=User, Iu=Items of u, L=layerSize
Output: R = List of ranked items
1: procedure GetRankedItems(I,u,Iu,L)
2: if jIuj &gt; L then Iu=GetLastLItems(Iu; L)
3: end if
4: V =Doc2VecVectorization(I)
5: M=De neUserPro leMatrix(V ,Iu)
6: M=AddEmptyVectorAsLastRow(M);
7: for each i in I do
8: if i NOT IN Iu then
9: v=GetItemVector(V; i)
10: M=FillLastMatrixRow(M; v)
11: LIR=CalculateLIR(M)
12: R (i; LIR)
13: end if
14: end for
15: Return SortItemsByDescLIR(jRj)
16: end procedure</p>
      <p>It should be noted that our approach is scalable by
employing distributed computing models (e.g., MapReduce).
Indeed, the computation of the LIR metric for each
unevaluated item can be distributed over di erent machines.</p>
    </sec>
    <sec id="sec-3">
      <title>3. EVALUATION</title>
      <p>We compared our approach with two collaborative
ltering approaches: CF , which is based on a classic
neighborhood model, and SV D, based on the latent factor model.
The experiments have been performed by using a dataset
that represents a standard benchmark for recommender
systems, i.e., Movielens 1M, composed by 6,040 users, 3,900
items, and 1,000,209 ratings.</p>
      <p>The criterion adopted for obtaining the training and the
test sets was the K-fold cross validation with K = 3, and
the independent-samples two-tailed Student's t-tests. The
adopted metric is the Mean Reciprocal Rank (M RR), a
statistical measure able to evaluate the ranking generated for
a set of elements that belong to a certain domain.</p>
      <p>The rst experiment evaluates the metric capability to
measure the similarity between a user pro le and an item,
in terms of linear independence (Figure 1: Top). On the
basis of our approach, we rank all the items in decreasing
order, by verifying that almost all of those in the test set
have been placed in the top positions (i.e., 1 20), proving
the LIR capability to e ectively rank the items.
R0:15
R0:10
M0:05
62.02 %</p>
      <p>26.68 %
01.63 %
09.67 %</p>
      <p>CF
SVD
LIR
(75)</p>
      <p>(150) (225)
T ested items ( 1000)
(300)</p>
      <p>The second experiment evaluates the metric capability to
infer the future choices of the users (Figure 1: Bottom), by
comparing the rank assigned to the items in the test set by
using the LIR metric, with those assigned to the same items
by the other state-of-the-art approaches taken into account.</p>
      <p>The t-tests highlighted a statistical di erence between the
results (p &lt; 0:05).
4.</p>
      <p>DISCUSSION AND CONCLUSIONS
The experimental results show the e ectiveness of the
compositional approach used by our LIR metric, as well
as its capability to overcome the canonical state-of-the-art
metrics, in terms of modeling of the user preferences.
Indeed, we have a strong improvement in the process of rating
of the unevaluated items, and this means that our metric
assigns an higher score (w.r.t. the state-of-the-art approaches
to which we compared) to the items positively evaluated by
a user. The use of the LIR metric can be also extended to
other contexts that do not use word embeddings, e.g., those
based on a canonical term-document matrix.</p>
      <p>Future work. We will consider our metric to evaluate
the items negatively evaluated by the users, in order to
obtain additional information in terms of unpreferred items,
exploiting it to improve the recommendations accuracy, e.g.,
by verifying the preferences collision (i.e., very similar items
rated both positively and negatively by a user).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Fu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Qin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Che</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Liu</surname>
          </string-name>
          .
          <article-title>Learning semantic hierarchies via word embeddings</article-title>
          .
          <source>In ACL (1)</source>
          , pages
          <fpage>1199</fpage>
          {
          <fpage>1209</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C.</given-names>
            <surname>Musto</surname>
          </string-name>
          , G. Semeraro, M. de Gemmis, and
          <string-name>
            <given-names>P.</given-names>
            <surname>Lops</surname>
          </string-name>
          .
          <article-title>Learning word embeddings from wikipedia for content-based recommender systems</article-title>
          .
          <source>In Advances in Information Retrieval</source>
          , pages
          <volume>729</volume>
          {
          <fpage>734</fpage>
          . Springer,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Sivapalan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sadeghian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Rahnama</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Madni</surname>
          </string-name>
          .
          <article-title>Recommender systems in e-commerce</article-title>
          .
          <source>In World Automation Congress (WAC)</source>
          ,
          <year>2014</year>
          , pages
          <fpage>179</fpage>
          {
          <fpage>184</fpage>
          . IEEE,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>