<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Moving From Item Rating to Features Relevance in Top-N Recommendation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vito Walter Anelli</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tommaso Di Noia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eugenio Di Sciascio</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pasquale Lops</string-name>
          <email>pasquale.lops@uniba.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Joseph Trotta</string-name>
          <email>joseph.trottag@poliba.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Polytechnic University of Bari</institution>
          ,
          <addr-line>Via E. Orabona, 4, Bai</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Bari \Aldo Moro"</institution>
          ,
          <addr-line>Via E. Orabona, 4, Bai</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Although very e ective in computing accurate recommendations, due to their inner nature, collaborative algorithms work very well with dense matrices but show their limits when they deal with sparse ones. In these cases, using only past ratings may lead to unsatisfactory results in the recommendation list. In this paper we show how to move from a user-item to a user-feature matrix by exploiting original user ratings. We then use matrix factorization techniques to compute recommendations.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Matrix factorization techniques have proven their e ectiveness in improving the
performance of recommendation engines in a pure collaborative approach and
are implemented in many industrial and commercial systems [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Whenever
available, descriptions of the items can be used as a valuable source of information
to augment the knowledge injected in and exploited by the system to compute
the recommendation list of items. More recently, thanks to the Linking Open
Data initiative, many structured data have become freely available to represent
the content of items in di erent knowledge domains and then feed
recommendation engines [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Several works have tried to build recommender systems by
exploiting Linked Open Data (LOD) as side information for representing items,
in addition to the user preferences usually collected through ratings. Properties
gathered from DBpedia, the cornerstone dataset of the LOD cloud, may be used
in di erent ways: (1) to de ne semantic similarity measures for providing more
accurate recommendations [
        <xref ref-type="bibr" rid="ref4 ref8">8, 4</xref>
        ]; (2) to deal with problems as the limited
content analysis or cold-start, e.g. by introducing new relevant features to improve
item representations [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], or to cope with the increasing data sparsity [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]; (3)
to provide a good balance between di erent recommendation objectives, such as
An extended version of this paper has been published at [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
      </p>
      <p>
        IIR 2018, May 28-30, 2018, Rome, Italy. Copyright held by the author(s).
accuracy and diversity [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], for instance, e ective strategies to incorporate
item features for top-N recommender systems are developed. Recently, an
interesting approach called Feature Preferences Matrix Factorization (FPMF) has
been proposed in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. FPMF incorporates user feature preferences in a matrix
factorization to predict user likes. It is worth to note that the previously
mentioned approaches does not rely on features coming from the Linked Open Data
cloud. Features composing the description of an item, whatever the source, are
not considered per se in the recommendation process but are usually exploited
to evaluate the similarity between items or users. We believe that more attention
should be paid to modeling the recommendation problem with a focus on
recommending features rather then items. Expanding an item in its features brings
with it some interesting side e ects. On the one hand, all features may represent
relations that, e.g., latent factor models we are not able to look at. On the other
hand, features give us a new set of explicit connections between items to be
exploited with collaborative ltering algorithms. Finally, recommending items
via feature recommendation may lead to an easier generation of explanations for
the recommended list of items. Unfortunately, moving from items to features is
not that straight as in a forest of many features, most of them may result not
relevant to a user. Moreover, once we design an algorithm able to compute a
recommendation list of features, we have to go back to the items space, as the
ultimate goal of a recommender systems is to suggest items to a user. In this
paper we present FF (for Features Factorization), a top-N recommendation
algorithm originally introduced in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] that relies on user's feature preferences and
collaborative ltering information in the features space. The main goal of FF is
to compute an ordered list of features preferred by the user and, starting from
such list, to reassemble the relevance values of each returned feature to produce
a top-N list of items to recommend. All the side information adopted by FF with
reference to a speci c item i is retrieved from DBpedia in form of triples hi; p; ei.
For each item in the user pro le we retrieve its features by querying DBpedia
thus getting them as a set of entities e.
      </p>
      <p>The remainder of the paper is structured as follows. In the next section we
introduce and describe FF. We than close the paper with a section devoted to
Conclusion and future works.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Proposed Approach</title>
      <p>Motivation. This work aims at investigating the role of feature rating and
relevance in the item rating process. The main intuition behind FF is that items
can be handled as a collection of features on which the recommendation process
is then performed. If we want to discover the contribution of each single feature in
the evaluation, rst of all, we need to unpack each item in its composing features.
Then, by combining the overall popularity of each feature in the user pro le
(feature relevance) and the rating assigned to items containing that feature we
may estimate the implicit rating the user is giving to that speci c feature. The
second observation we based our work on, is that the relevance of an item in
the user pro le cannot be entirely encoded in its ratings as the single rating
represents a degree of liking about the speci c item.</p>
      <p>Data Model. Each item in the user pro le is associated with a relevance
function we denote with ui( ). Its value represents an estimation of how important
is a particular item to the user u. Analogously, we have a value associated to
each feature in the pro le computed via the function uf ( ) computing the
relevance of the feature f in the user pro le. Actually, each feature is associated
also with a rating ruf ( ) which is inferred by considering the rating of all the
items containing f .</p>
      <p>Problem Formulation. By considering the data associated to the user pro le
as described in the previous section we can move from a rating matrix connecting
user and items to a user-feature matrix where each value is represented by the
pair h uf ( ); ruf ( )i. In other words, we may consider two user-feature matrices:
the one P containing relevance values uf ( ), the other R including the inferred
ratings ruf ( ).</p>
      <p>In FF, the relevance of a feature pe is computed as its probability of belonging
to the set Iu representing the items already rated by a user u. More formally we
have:
uf (pe) =</p>
      <p>Pi2Iu jfhi; p; ei j hi; p; ei 2 DBpediagj
jIuj</p>
      <p>The idea behind this computation is quite straight: the more a feature is
connected to the items in the user pro le, the higher its relevance for the user.</p>
      <p>Once we have computed the relevance of all the features in the user pro le,
we can move to the computation of the relevance for the items i 2 Iu. This can
be computed as the normalized summation of the relevance for all the features
it is composed by. In formulas, we have
ui(i) =</p>
      <p>hi;p;ei2DBpedia
jfhi; p; ei j hi; p; ei 2 DBpediagj</p>
      <p>P
uf (pe)
Given a feature pe, the computation of the feature rating ruf (pe) exploits both
the rating and the relevance of each item i 2 Iu containing pe.</p>
      <p>ruf (pe) =</p>
      <p>P
hi;p;ei2DBpedia rui
P
hi;p;ei2DBpedia</p>
      <p>ui(i)
ui(i)
(1)
top-N Recommendation. The pro les we built contain only the features
the user met before, but usually the number of those features is dramatically
smaller than the overall number of features and this results in P and R being
very sparse. In order to complete the information they contain, we compute, via
Biased Matrix Factorization, the missing values ^uf (pe) for P and r^uf (pe) for
R. We run matrix factorization independently on P and R. ^uf (pe) and r^uf (pe)
represent the predicted relevance and the predicted rating for all those features
not belonging to any of the items in Iu. As the resulting matrices contain both
content-based and collaborative informations (due to the matrix factorization),
we refer to them as hybrid pro le.</p>
      <p>With the hybrid pro le we can estimate a ranked list for all the remaining
items within the collection. In fact, the ranking of an item in the list is
computed by considering the rating of the features belonging to the item and their
relevance.
r^ui(i) =</p>
      <p>X
(hi;p;ei2DBpedia)^(i2Iu)
uf (pe) ruf (pe)+</p>
      <p>
        X
(hi;p;ei2DBpedia)^(i62Iu)
^uf (pe) r^uf (pe)
It is important to point out that these estimations do not correspond to an
actual rating but the correct item ranking is yet preserved. In order to improve
the results of the nal recommendation process, we may reduce the number of
features considered while computing the nal rank based on their relevance and
popularity [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion and Future Works</title>
      <p>
        In this paper we presented FF, a novel algorithm that bases on feature
recommendation as an intermediate step for computing top-N items recommendation
lists. The main idea behind FF is that feature relevance in a user pro le plays a
key role in the selection and rating of an item in a collection. As future work,
we are investigating the behavior of FF with respect to novelty and diversity of
results. We are also interested in exploring the behavior of FF approach with
di erent collaborative ltering algorithms, other than factorization techniques
in the item-feature space and in particular with Factorization Machines [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Anelli</surname>
            ,
            <given-names>V.W.</given-names>
          </string-name>
          , Di Noia,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Lops</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Di Sciascio</surname>
          </string-name>
          , E.:
          <article-title>Feature factorization for top-n recommendation: From item rating to features relevance</article-title>
          .
          <source>In: Proceedings of the 1st Workshop on Intelligent Recommender Systems by Knowledge Transfer &amp; Learning co-located with ACM Conference on Recommender Systems (RecSys</source>
          <year>2017</year>
          ), Como, Italy,
          <year>August 27</year>
          ,
          <year>2017</year>
          . pp.
          <volume>16</volume>
          {
          <issue>21</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bell</surname>
            ,
            <given-names>R.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koren</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Lessons from the net ix prize challenge</article-title>
          .
          <source>Acm Sigkdd Explorations Newsletter</source>
          <volume>9</volume>
          (
          <issue>2</issue>
          ),
          <volume>75</volume>
          {
          <fpage>79</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Di</given-names>
            <surname>Noia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Mirizzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Ostuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.C.</given-names>
            ,
            <surname>Romito</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Zanker</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Linked open data to support content-based recommender systems</article-title>
          .
          <source>In: Proceedings of the 8th International Conference on Semantic Systems</source>
          . pp.
          <volume>1</volume>
          {
          <issue>8</issue>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Di</given-names>
            <surname>Noia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Ostuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.C.</given-names>
            ,
            <surname>Rosati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Tomeo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Di Sciascio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Mirizzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Bartolini</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>Building a relatedness graph from linked open data: A case study in the IT domain</article-title>
          .
          <source>Expert Syst. Appl</source>
          .
          <volume>44</volume>
          ,
          <issue>354</issue>
          {
          <fpage>366</fpage>
          (
          <year>2016</year>
          ). https://doi.org/10.1016/j.eswa.
          <year>2015</year>
          .
          <volume>08</volume>
          .038, https://doi.org/10.1016/j.eswa.
          <year>2015</year>
          .
          <volume>08</volume>
          .038
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Musto</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Basile</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lops</surname>
            , P., de Gemmis,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Semeraro</surname>
          </string-name>
          , G.:
          <article-title>Introducing linked open data in graph-based recommender systems</article-title>
          .
          <source>Information Processing &amp; Management</source>
          <volume>53</volume>
          (
          <issue>2</issue>
          ),
          <volume>405</volume>
          {
          <fpage>435</fpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Nasery</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Braunhofer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ricci</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Recommendations with optimal combination of featurebased and item-based preferences</article-title>
          .
          <source>In: Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization</source>
          , UMAP. pp.
          <volume>269</volume>
          {
          <issue>273</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ning</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karypis</surname>
          </string-name>
          , G.:
          <article-title>Sparse linear methods with side information for top-n recommendations</article-title>
          .
          <source>In: Sixth ACM Conference on Recommender Systems, RecSys</source>
          . pp.
          <volume>155</volume>
          {
          <issue>162</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Piao</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Breslin</surname>
            ,
            <given-names>J.G.</given-names>
          </string-name>
          :
          <article-title>Measuring semantic distance for linked open data-enabled recommender systems</article-title>
          .
          <source>In: Proceedings of the 31st Annual ACM Symposium on Applied Computing</source>
          . pp.
          <volume>315</volume>
          {
          <issue>320</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Rendle</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Factorization machines</article-title>
          .
          <source>In: Proceedings of the 2010 IEEE International Conference on Data Mining</source>
          . pp.
          <volume>995</volume>
          {
          <fpage>1000</fpage>
          . ICDM '10, IEEE Computer Society, Washington, DC, USA (
          <year>2010</year>
          ). https://doi.org/10.1109/ICDM.
          <year>2010</year>
          .
          <volume>127</volume>
          , http://dx.doi.org/10.1109/ICDM.
          <year>2010</year>
          .127
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Schmachtenberg</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Strufe</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paulheim</surname>
          </string-name>
          , H.:
          <article-title>Enhancing a location-based recommendation system by enrichment with structured data from the web</article-title>
          .
          <source>In: 4th International Conference on Web Intelligence, Mining and Semantics WIMS</source>
          . pp.
          <volume>17</volume>
          :
          <issue>1</issue>
          {
          <fpage>17</fpage>
          :
          <fpage>12</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>