<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Adaptive Diversity in Recommender Systems?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tommaso Di Noia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vito Claudio Ostuni</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jessica Rosati</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paolo Tomeo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eugenio Di Sciascio</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Polytechnic University of Bari</institution>
          ,
          <addr-line>Via Orabona, 4, 70125 Bari</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Camerino</institution>
          ,
          <addr-line>Piazza Cavour 19/f, 62032 Camerino (MC)</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Milano-Bicocca</institution>
          ,
          <addr-line>Piazza dell'Ateneo Nuovo, 1, 20126 Milano</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>The evaluation of a recommendation engine cannot rely only on the accuracy of provided recommendations. One should consider additional dimensions, such as diversity of provided suggestions, in order to guarantee heterogeneity in the recommendation list. In this paper we analyse users' propensity in selecting diverse items, by taking into account content-based item attributes. Individual propensity to diversication is used to re-rank the list of Top-N items predicted by a recommendation algorithm, with the aim of fostering diversity in the nal ranking. We show experimental results that con rm the validity of our modelling approach.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        In the recommender systems eld, most of the approaches have been devoted
to maximizing recommendation accuracy. However, it has been recognized that
improving only the predictive accuracy is not enough to judge the e ectiveness
of a recommender system [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], since the most accurate recommendations for a
user are often too similar to each other and attention has to be paid towards
the goal of improving individual diversity, the degree of diversi cation in the
recommendations provided to an individual user. A number of works propose
strategies to enhance the trade-o between accuracy and diversity [
        <xref ref-type="bibr" rid="ref10 ref8 ref9">9, 8, 10</xref>
        ].
      </p>
      <p>The main intuition behind our work is that some users may prefer diversi
cation in suggestions while others may not and they could be inclined to diversify
with respect to not all item attributes. We propose an adaptive attribute-based
diversi cation approach able to customize the degree of individual diversity of
the Top-N recommendation list, using the Entropy measure to represent the
inclination to diversity of the user over di erent content-based item dimensions.
We apply our approach to the movie domain, considering what reasonably leads
a user to choose a movie in a huge collection of items, that is genre, actor,
director and year of release. However not all these factors have the same in uence
on di erent users: by way of example, a user can decide to cling to a particular
director and accept to watch several genres.</p>
      <p>
        The main contributions of this paper are:
{ a representation of user's propensity in diversifying her choices.
{ an adaptive attribute-based re-ranking approach based on the
aforementioned representation.
? An extended version of this paper has been published in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>Adaptive diversi cation</title>
      <p>
        In the recommendation process, after the ratings prediction for unrated items,
the maximization of user's utility and the improvement of individual diversity in
the items list can be pursued through a re-ranking phase [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. There are several
heuristics which let to re-rank items in an e cient way, such as the MMR greedy
strategy [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. MMR iteratively selects the item which maximizes an objective
function fobj , which in turn can deal with the trade-o between accuracy and
diversity and is de ned as
fobj (i; S) = r (u; i) (1 ) max sim(i; j) (1)
j2S
where S is the previously re-ranked list, r is a function to estimate the rating
of user u for item i, sim a similarity measure on item pairs and the parameter
lets to manage the accuracy-diversity balance.
      </p>
      <p>The diversi cation attitude of each user for each item attribute a 2 A is
measured through Shannon's entropy. For each attribute, users are clustered in
four groups, referred to as quadrants, de ned by the medians of the entropy
and user pro le length distributions across all users. For example a user u is in
the rst quadrant for the genre attribute, if her entropy Hgenre(u) is less than
the median of the entropy computed across all users and she has a short user
pro le (her number of ratings is less than the median of users' ratings). The
same user may belong to di erent quadrants in relation to di erent attributes.
Table 1 provides a representation of quadrants. The main modelling hypothesis
behind this classi cation is that users who have explored items with di erent
characteristics in the past are willing to accept diverse recommendations. Given
an attribute a, we interpret a high value of entropy as an attitude of the user
to choose items with di erent values for a. Conversely, a low value of entropy is
read as her willing to consider items similar for that attribute.</p>
      <p>Entropy
hQuadrant 1 Quadrant 2
tgLow Entropy High Entropy
e Small Pro le Small Pro le
n
L
leQuadrant 3 Quadrant 4
oLow Entropy High Entropy
rPLarge Pro le Large Pro le</p>
      <p>Table 1. Quadrants</p>
      <p>Quadrants are used to de ne the similarity measure in Equation (1). Let us
consider a user u and indicate with A the set of item attributes (for example
in the movie domain A = fyear; genre; direction; starringg). We consider a
function qu : A ! f1; 2; 3; 4g, which assigns, for each attribute, the quadrant to
which user u belongs to and then we de ne a quadrant weight !i 2 [0; 1], with
i 2 f1; 2; 3; 4g. The overall similarity between items i and j in Equation (1), for
user u, is tailored to the quadrants she belongs to and is de ned as:
sim(i; j) =</p>
      <p>Pa2A !qu(a) sima(i; j)
m jAj
(2)
with m = maxf!i j i = 1; 2; 3; 4g and sima(i; j) a similarity measure between i
and j with respect to attribute a. The weights associated to user belonging
quadrants in uence the similarity score and hence the resulting objective function of
MMR, eventually varying the diversity.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Experiments and Results</title>
      <p>
        We carried out experiments on Movielens 1M4 dataset, enriched with further
attribute information (actors and directors) extracted from DBpedia5, as in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
We concentrated on users who gave at least fty ratings. The nal dataset
contains 4297 users, 3689 items and 942590 ratings. Training and test sets were
built with a temporal 60-40% split. We compared our approach with two
baselines: no-MMR, user-based kNN Collaborative Filtering algorithm with Pearson
correlation; MMR, re-ranking with Equation 1 of the top 200 recommendations
generated by no-MMR for each user. Our adaptive approach is denoted as
adaptiveMMR. The parameter in Equation 1 was set to 0:5. As similarity measure
for attribute a in (2), we used the Jaccard index. To reduce the number of distinct
attribute values, we divided movies in decades and performed a K-means
clustering for actors and directors on the basis of their DBpedia categories, obtaining
20 clusters. The number of values is 19 and 8 for genre and year, respectively.
      </p>
      <p>
        We used the TestItems evaluation methodology presented in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], with
Precision (P@k ) and nDCG@k for accuracy, ILD@k for diversity and avg(P,ILD) for
the balance between accuracy and diversity, as in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. P@k is chosen instead of
nDCG@k since they have a similar trend.
      </p>
      <p>Firstly, we tested the validity of the hypothesis that users who have explored
di erent items in the past are inclined to diversity. As shown in Table 2, MMR
dominates the no-MMR for quadrant 2 and 4 for both precision and ILD,
demonstrating that users with high entropy bene t from diversi cation. In the other
quadrants (1 and 3) there is a normal decrease of accuracy. Hence users with low
entropy in their user pro les are not inclined to an uncontrolled diversi cation.</p>
      <p>Later, to test the e ectiveness of adaptiveMMR, we conducted a grid search
on !, nding, as a rst result, that our intuition of choosing small values for
!1 and !3 and bigger ones for !2 and !4 is validated by accuracy and ILD
results. Without such constraints, in fact, the accuracy values of adaptiveMMR get
deeply worse. For lack of space we discuss here only three weights con gurations:
A = h0; 0; 0; 1i, B = h0; 1; 0; 1i, C = h0:1; 1; 0:1; 0:75i. The values of list C were
computed via grid search xing !1 and !3 and varying !2 and !4 with a step of
0:05. These con gurations let us deal with emblematic situations: con guration
4 Available at http://grouplens.org/datasets/movielens
5 http://dbpedia.org</p>
      <p>A acts on users who are in quadrant 4 for some attributes and con guration B on
users belonging to quadrant 2 or 4. Table 3 shows the results with k = 10.
AdaptiveMMR gains the best balance between accuracy and diversity, represented
by avg(P,ILD). In terms of accuracy, adaptiveMMR out-performs no-MMR and
MMR, especially adaptiveMMR-C. Remarkably, the con guration C has an ILD
value close to MMR but a signi cantly better accuracy values.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>Results showed in this paper suggest that the individual tendency to diversity,
represented by entropy, is a factor to take into account in the diversi cation
process and should be considered even for users with a small pro le length.
Acknowledgements The authors acknowledge partial support of VINCENTE (PON02 00563 3470993)
and RES NOVAE (PON04a2 E)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>G.</given-names>
            <surname>Adomavicius</surname>
          </string-name>
          and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kwon</surname>
          </string-name>
          .
          <article-title>Improving aggregate recommendation diversity using ranking-based techniques</article-title>
          .
          <source>IEEE TKDE</source>
          ,
          <volume>24</volume>
          (
          <issue>5</issue>
          ):
          <volume>896</volume>
          {
          <fpage>911</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>A.</given-names>
            <surname>Bellogin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Castells</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and I.</given-names>
            <surname>Cantador</surname>
          </string-name>
          .
          <article-title>Precision-oriented evaluation of recommender systems: An algorithmic comparison</article-title>
          .
          <source>In ACM RecSys '11</source>
          , pages
          <fpage>333</fpage>
          {
          <fpage>336</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>K.</given-names>
            <surname>Bradley</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Smyth</surname>
          </string-name>
          .
          <article-title>Improving Recommendation Diversity</article-title>
          .
          <source>In Irish Conference in Arti cial Intelligence and Cognitive Science</source>
          , pages
          <volume>75</volume>
          {
          <fpage>84</fpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Tommaso</given-names>
            <surname>Di</surname>
          </string-name>
          <string-name>
            <surname>Noia</surname>
          </string-name>
          , Vito Claudio Ostuni, Jessica Rosati, Paolo Tomeo, and Eugenio Di Sciascio.
          <article-title>An analysis of users' propensity toward diversity in recommendations</article-title>
          .
          <source>In Proceedings of the 8th ACM Conference on Recommender Systems, RecSys '14</source>
          , pages
          <fpage>285</fpage>
          {
          <fpage>288</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>V. C.</given-names>
            <surname>Ostuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. Di</given-names>
            <surname>Noia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Di</given-names>
            <surname>Sciascio</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Mirizzi</surname>
          </string-name>
          .
          <article-title>Top-n recommendations from implicit feedback leveraging linked open data</article-title>
          .
          <source>In ACM RecSys '13</source>
          , pages
          <fpage>85</fpage>
          {
          <fpage>92</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>U.</given-names>
            <surname>Panniello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tuzhilin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Gorgoglione</surname>
          </string-name>
          .
          <article-title>Comparing context-aware recommender systems in terms of accuracy and diversity</article-title>
          .
          <source>User Modeling and UserAdapted Interaction</source>
          ,
          <volume>24</volume>
          (
          <issue>1-2</issue>
          ):
          <volume>35</volume>
          {
          <fpage>65</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>S.</given-names>
            <surname>Vargas</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Castells</surname>
          </string-name>
          .
          <article-title>Exploiting the diversity of user preferences for recommendation</article-title>
          .
          <source>In OAIR '13</source>
          , pages
          <fpage>129</fpage>
          {
          <fpage>136</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhang</surname>
          </string-name>
          .
          <article-title>Enhancing diversity in top-n recommendation</article-title>
          .
          <source>In ACM RecSys '09</source>
          , pages
          <fpage>397</fpage>
          {
          <fpage>400</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhang</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Hurley</surname>
          </string-name>
          .
          <article-title>Avoiding monotony: Improving the diversity of recommendation lists</article-title>
          .
          <source>In ACM RecSys '08</source>
          , pages
          <fpage>123</fpage>
          {
          <fpage>130</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>C. Ziegler</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          <string-name>
            <surname>McNee</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          <string-name>
            <surname>Konstan</surname>
            , and
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Lausen</surname>
          </string-name>
          .
          <article-title>Improving recommendation lists through topic diversi cation</article-title>
          .
          <source>In WWW '05</source>
          , pages
          <fpage>22</fpage>
          {
          <fpage>32</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>