<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Exploiting Regression Trees as User Models for Intent-Aware Multi-attribute Diversity</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Paolo Tomeo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tommaso Di Noia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco de Gemmis</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pasquale Lops</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giovanni Semeraro</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eugenio Di Sciascio</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Polytechnic University of Bari - Via Orabona</institution>
          ,
          <addr-line>4 - 70125 Bari</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Bari Aldo Moro - Via Orabona</institution>
          ,
          <addr-line>4 - 70125 Bari</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2015</year>
      </pub-date>
      <volume>20</volume>
      <issue>2015</issue>
      <abstract>
        <p>Diversity in a recommendation list has been recognized as one of the key factors to increase user's satisfaction when interacting with a recommender system. Analogously to the modelling and exploitation of query intent in Information Retrieval adopted to improve diversity in search results, in this paper we focus on eliciting and using the pro le of a user which is in turn exploited to represent her intents. The model is based on regression trees and is used to improve personalized diversi cation of the recommendation list in a multi-attribute setting. We tested the proposed approach and showed its e ectiveness in two di erent domains, i.e. books and movies.</p>
      </abstract>
      <kwd-group>
        <kwd>Personalized diversity</kwd>
        <kwd>Intent-aware diversi cation</kwd>
        <kwd>Regression Trees</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        In the recent years, diversi cation has gained more and
more importance in the eld of recommender systems.
Engines able to get excellent results in terms of accuracy of
results have been proved to be not e ective when we
consider other factors related to the quality of user experience
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. As a matter of fact, when interacting with a system
exposing a recommendation service, the user perceives as
good suggestions those showing also an appropriate degree
of diversity, novelty or serendipity, just to cite a few. The
attitude of populating the recommendation list with
similar items could exacerbate the over-specialization problem
that content-based recommender systems tend to su er from
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], even though it appears also in collaborative- ltering
approaches. Improving diversity is generally a good choice to
foster the user satisfaction as it increases the odds of nding
relevant recommendations [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        Here our focus is on both the individual (or intra-list )
diversity, namely the degree of dissimilarity among all items
in the list provided to a user, and the aggregate diversity
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], namely the number and distribution of distinct items
recommended across all users. The item-to-item
dissimilarity can be evaluated by using content-based attributes
(e.g. genre in movie and music domains, product category
in e-commerce) [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] or statistical information (e.g. number
of co-ratings) [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. Usually, approaches to the diversi
cation take into account only one single attribute while, in the
approach we present here, multiple attributes are selected
to describe the items. The rationale behind this choice is
that we believe there are numerous and heterogeneous item
dimensions conditioning user's interests and choices.
Moreover, depending on the user these dimensions may interact
with each other thus contributing to the creation of her
intents. The question is how to tackle multiple attributes to
address the diversi cation problem.
      </p>
      <p>In this paper we use regression trees as user modeling
technique to infer the individual interests, useful to provide an
intent-aware diversi cation. Compared to approaches where
item attributes are treated independently one to each other,
regression trees make possible to represent user tastes as a
combination of interrelated characteristics. For instance, a
user could have a preference for horror movies of the 80s
irrespective of the director, or for horror movies of the 90s
directed by a a speci c director. In a regression tree,
conditional probability lets to build such inference rules about
user's preferences. We conducted experiments on the movie
and on the book domains to empirically evaluate our
approach. The performance was measured in terms of accuracy
and both individual and aggregate diversity.</p>
      <p>The main contributions of this paper are:
a novel intent-aware diversi cation approach able to
combine multiple attributes. It bases on the use of
regression trees (and rules) to infer and encode the
model of users' interests;
a novel method to combine di erent diversi cation
approaches;
an experimental evaluation which shows the
performance of the proposed approaches with respect to both
accuracy and diversity measures.</p>
      <p>The paper is organized as follows. Section 2 describes the
greedy approach to diversi cation problem, the xQuAD
algorithm and some evaluation metrics. We then continue in
Section 3 by showing how to face the multi-attribute
diversi cation and how to leverage regression trees in the
diversication process with xQuAD to provide more personalized
recommendations. Section 4 describes the experimental
conguration and the datasets used for the experiments while
Section 5 presents and describes the experimental results,
showing the competitive performance of the proposed
approach. In Section 6 we review the related work at the best
of our knowledge. Conclusions close the paper.</p>
    </sec>
    <sec id="sec-2">
      <title>DIVERSITY IN RECOMMENDATIONS</title>
      <p>
        The recommendation step can be followed by a re-ranking
phase nalized to improve other qualities besides accuracy
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Some of re-ranking approaches proposed so far are
based on greedy algorithms designed to handle the balance
between accuracy and diversity in a recommendations list
[
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. Their scheme of work is explained through Algorithm
1, where P = h1; :::; ni is the recommendation list for user u
generated using the predicted ratings and the output is the
re-ranked list S of recommendations, such that S P and
whose length is N n.
      </p>
      <p>Data: The original recommendation list P, N
Result: The re-ranked recommendation list S
n
3
4
5
6 end
7 return S.
1 S = hi;
2 while j S j N do
i = argmax fobj(i; S);</p>
      <p>i2PnS
S = S i ;
P = P n fi g</p>
      <sec id="sec-2-1">
        <title>Algorithm 1: The greedy strategy</title>
        <p>At each iteration, the algorithm selects the item
maximizing the objective function fobj (line 3) { which in turn can
be de ned to deal with the trade-o between accuracy and
diversity { and then adds it to the re-ranked list (line 4).</p>
        <p>
          For our purpose, we focus on the intent-aware approach
xQuAD (eXplicit Query Aspect Diversi cation), with the
aim to diversify the user intents. It was proposed for search
diversi cation in information retrieval by Santos et al. [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ],
as a probabilistic framework to explicitly model an
ambiguous query as a set of sub-queries that will cover the
potential aspects of the initial query. Then it was adapted for
recommendation diversi cation by Vargas and Castells [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ],
replacing query and relative aspects with user and items
categories, respectively. Hereafter we refer to generic item
features - such as categories - as features, considering the
features as possible instances of a generic attribute.
        </p>
        <p>
          More formally, xQuAD greedily selects diverse
recommendations maximizing the following objective function:
fobj(i; S; u) =
r (u; i) + (1
)div(i; S; u)
(1)
with r (u; i) being the score predicted by the baseline
recommender; the parameter allowing to manage the
accuracydiversity balance, where higher values give more weight to
accuracy, while lower values give more weight to diversity.
The last component in Equation 1 promotes the diversity,
providing a measure of novelty with respect to the items
already selected in S. As for the function div(i; S; u), the
original formulation in [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] is:
divorig(i; S; u) =
        </p>
        <p>X p(ijf )p(f ju) Y(1
p(sjf ))
(2)
f
s2S
where p(ijf ) represents the likelihood of item i being chosen
given the feature f while p(fju) represents the user interest
in the feature.</p>
        <p>
          A number of measures have been proposed to evaluate the
diversity in a recommendation list. Smyth and McClave [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]
proposed the ILD (Intra-List Diversity), that computes the
average distance between each couple of items in the list L:
ILD(L) =
        </p>
        <p>1
jLj (jLj
1)</p>
        <p>
          X
i;j2L;i6=j
(1
sim(i; j))
(3)
The sim function is a con gurable and application-dependent
component which can use content-based item features or
statistical information (e.g. number of co-ratings) to compute
the similarity between items. We used also the metric
nDCG, that is the redundancy-aware variant of Normalized
Discounted Cumulative Gain proposed in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. We adopt the
adapted version for recommendation proposed in [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]:
-nDCG(L; u) =
        </p>
        <p>
          1
-iDCG
jLj P
X
r=1
f2F (Lr)(1
log2(1 + r)
)cov(L;f;r 1)
(4)
where cov(L; f; r 1) is the number of items ranked up to
position r 1 containing the feature f . F (Lr) represents
the set of features of the r-th item. The parameter is used
to balance the emphasis between relevance and diversity.
iDCG denotes the value of -nDCG for the best \ideally"
diversi ed list. Considering that the computation of the
ideal value is NP-complete [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], we adopt a greedy approach:
at each step we select solely the item with the highest value,
regardless of the next steps.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. INTENT-AWARE MULTI-ATTRIBUTE</title>
    </sec>
    <sec id="sec-4">
      <title>DIVERSITY</title>
      <p>In this section we show how we address the intent-aware
diversity problem when dealing with multi-attribute item
descriptions. The presentation relies on content-based
attributes (e.g. genres, years, etc. in the movies domain),
but the proposed approach can be used independently of
the attributes types. Therefore, one could also use
statistical information as item attributes, e.g., popularity or rating
variance. As explained in the previous section, we refer to
features as possible instances of a generic attribute. We
tried di erent reformulations of the div function in xQuAD
(Equation 2) to deal with multi-attribute values. After an
empirical evaluation, we chose the best divma (for
multiattribute) in terms of accuracy-diversity balance:
divma(i; S; u) = X
where:</p>
      <p>A2A</p>
      <p>Pf2dom(A) p(ijf )p(f ju)(1</p>
      <p>Pf2dom(A) p(f ju)
avgj2S p(jjf ))
(5)
A is the set of attributes;
for each attribute A 2 A and each feature in the
attribute domain f 2 dom(A), p(ijf ) represents the
importance of f for the item i. It is computed as a binary
function that returns 1 if the item contains f , 0
otherwise;
p(f ju) represents the importance of the feature f for
the user u and is computed as the relative frequency
of the feature f on the rated items from the user u.
Here after we will refer to xQuAD using Equation 5 as basic
xQuAD.</p>
      <p>DivRT. p(jjm) is the average similarity between m
and each rule covered by item j. More formally:
Besides dealing with multi-attribute descriptions, the idea
behind our approach is to infer and model the user pro le
by means of a regression tree, a predictive model where the
user interest represents the target variable, which can take
continuous values. Once a regression tree is produced for a
user u, then it is converted into a set of rules RT (u). Each
rule maps the presence/absence of a categorical feature or a
constraint on a numerical one to a value v in a continuous
interval. This latter indicates the predicted interest of the
user on the items satisfying the rule. In our implementation
we used the interval [1; 5] since the value of the target
variable has been calculated as the rating mean of the training
instances classi ed by the inferred rule. Please note that the
choice of a speci c value interval for the target variable does
not a ect the overall approach. Each rule m has then the
form</p>
      <p>body(m) 7! interest = v
with body(m) = fc1; : : : ; cng. An example of a set of rules
produced for a user is shown in Figure 1.</p>
      <p>1. fhorror 2 dom(genres); western 2= dom(genres);</p>
      <p>DarioArgento 2 dom(directors)g 7! interest = 4:2
2. fhorror 2= dom(genres); thriller 2 dom(genres)g
7! interest = 2:1
3. fyear &gt; 1990; horror 2= dom(genres);
drama 2 dom(genres); Aronofsky 2 dom(directors)g
7! interest = 4:0
4. fyear &lt; 1990; drama 2 dom(genres);</p>
      <p>AlP acino 2 dom(actors)g 7! interest = 3:9
5. fhorror 2= dom(genres)g 7! interest = 3:2</p>
      <p>Eventually, under the assumption that they represent
speci c user interests, the computed rules are used in the
reranking phase as item features to improve the intent-aware
recommendation diversity.</p>
      <p>We propose also a div function for xQuAD so that each
item is evaluated according to the rules it satis es.
divrules(i; S; u) =
Here M (u; i) represents the set of rules for the user u matched
by the item i while p(mju) represents the importance of the
rule m for u and is computed as:
p(mju) =
interestm
jM (u; i)j
In Equation 7, interestm is the normalized predicted
outcome of the regression tree for the rule m. Finally, the last
component in Equation 6 indicates the complement of the
coverage of the rule among the already selected
recommendations. We propose two di erent versions of this adapted
xQuAD.</p>
      <p>RT. p(jjm) is a binary function that returns 1 if the
item j matches the rule, 0 otherwise.
(7)
p(jjm) = avgm02M(u;j) sim(m; m0)
(8)
The rationale behind this formulation is that some
rules may be similar with each other thus not
bringing any actual diversi cation if considered separately.
The computation of sim(m; m0) takes into account the
overlapping between the rules m and m0 as follows:
sim(m; m0) =</p>
      <p>Pci2body(m) overlap(m; m0; ci)</p>
      <p>max(jbody(m)j; jbody(m0)j)
For instance, considering the attributes represented in
Figure 1, we have for actor; genre and director:
overlap(m; m0; ci) =
8 1, ci 2 body(m) ^ ci 2 body(m0)
&lt;
: 0, otherwise
For the numerical attribute year we may adopt a
different formulation for the function overlap(m; m0; ci).
Here we compute, if any, the overlap between the
interval in body(m) and the one in body(m0) normalized
with respect to maximum interval's length. As an
example, if year &gt; 1990 is in body(m) and year &lt; 2010
is in body(m0) we may de ne the overlapping function
as overlap(m; m0; ci) = max(dom(yje1a9r9)0) 2m0i1n0(jdom(year)) .
The functions introduced above have been used in the
experimental setting in order to compute the function
overlap(m; m0; ci) (see Section 4).</p>
      <p>RT and DivRT can be used instead of the basic xQuAD as
diversi cation algorithms in the re-ranking phase.
Alternatively, basic xQuAD and RT or DivRT can be pipelined to
bene t from the strengths of them both. For instance, one
could use xQuAD to select 50 diversi ed recommendations
and then RT to select 20 recommendations from those 50,
or vice versa. Hereafter, we use the syntax X-after-Y, e.g.
xQuAD-after-RT, to indicate that algorithm X is executed
on the results of Y.
4.</p>
    </sec>
    <sec id="sec-5">
      <title>EXPERIMENTS</title>
      <p>We carried out a number of experiments to evaluate the
performance of the methods presented in the Section 3 on
two well known datasets: MovieLens1M and LibraryThing.</p>
      <p>
        MovieLens 1M1 dataset contains 1 million ratings from
6,040 users on 3,952 movies. The original dataset contains
information about genres and year of release, and was
enriched with further attribute information such as actors and
directors extracted from DBpedia2. More details about this
DBpedia enriched version of the dataset are available in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
Because not all movies have a corresponding resource in
DBpedia, the nal dataset contains 998,963 ratings from 6,040
users on 3,883 items. We built training and test sets by
employing a 60%-40% temporal split for each user.
      </p>
      <p>Moreover, we used the LibraryThing3 dataset, which
contains more than 2 million ratings from 7,279 users on 37,232
books. As in the dataset there are many duplicated ratings,
1Available at http://grouplens.org/datasets/movielens
2http://dbpedia.org
3Available at http://www.macle.nl/tud/LT
when a user has rated more than once the same item, we
selected her last rating. The unique ratings are 749,401. Also
in this case, we enriched the dataset by mapping the books
with BaseKB4, the RDF version of Freebase5 and then
extracting three attributes: genre, author and subjects. The
subjects in Freebase represent the topic of the book, for
instance Pilot experiment, Education, Culture of Italy, Martin
Luther King and so on. The dump of the mapping is
available online6. The nal dataset contains 565,310 ratings from
7,278 users on 27,358 books. We built training and test sets
by employing a 80%-20% hold-out split. The di erent ratio
used for LibraryThing respect to Movielens (60%-40%)
depends on its higher sparsity: holding 80% to build the user
pro le ensures a su cient number of ratings to train the
system.</p>
      <sec id="sec-5-1">
        <title>Number of users</title>
        <p>Number of items
Number of ratings
Data sparsity
Avg users per item
Avg items per user
Since the number of distinct values was too large for year,
actors and director attributes in MovieLens and for all the
attributes in LibraryThing, we convert years in the
corresponding decades and performed a K-means clustering for
other attributes on the basis of DBpedia categories7 for
MovieLens and Freebase categories8 for LibraryThing.
Table 2 and 3 report the number of attribute values and
clusters. The number of clusters was decided according to the
calculation of the within-cluster sum of squares (withinss
measure from the R Stats Package, version 2.15.3), that is
picking the value of K corresponding to an evident break in
the distribution of the withinss measure against the number
of clusters extracted.</p>
        <sec id="sec-5-1-1">
          <title>4http://basekb.com</title>
          <p>5https://www.freebase.com
6URL removed to guarantee anonymous submission.
7http://purl.org/dc/terms/subject
8http://www.w3.org/1999/02/22-rdf-syntax-ns#type</p>
          <p>For both datasets, we used the Bayesian Personalized
Ranking Matrix Factorization algorithm (BPRMF) available in
MyMediaLite9 as baseline (using the default parameters).
We performed experiments using other recommendation
algorithms, but we do not report results here since they are
very similar to those obtained by BPRMF.</p>
          <p>We selected the top-200 recommendations for each user to
generate the initial list P used for performing the re-ranking
as shown in Algorithm 1.</p>
          <p>
            Accuracy is measured in terms of Precision, Recall and
nDCG, but we only report nDCG values since the trend of
the other two metrics is very similar. Individual diversity
is measured using ILD and -nDCG (see Section 2) with
= 0:5 to equally balance diversity and accuracy, while
aggregate diversity is measured using both the catalog
coverage { computed as the percentage of items recommended
at least to one user { and the entropy { computed as in
[
            <xref ref-type="bibr" rid="ref3">3</xref>
            ] to analyse the distribution of recommendations among
all users. These two last metrics need to be considered
together, since the coverage gives a indication about the ability
of a recommender to cover the items catalog and the entropy
shows the ability to equally spread out the recommendations
across all the items. Hence, only an improvement of both
those metrics indicates a real increasing of aggregate
diversity, that in turn denotes a better personalization of the
recommendations [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ].
          </p>
          <p>As similarity measure for computing the ILD metric
(Equation 3) we used the Jaccard index. Considering that there
are more attributes for each item, we computed the average
of the Jaccard index value for each attribute shared between
two items. -nDCG is computed as the average of the
Equation 4 for each attribute.</p>
          <p>As presented in Section 3, we propose two novel
diversi cation approaches: RT and DivRT. We also propose a
method to combine in sequence di erent algorithms by means
of a two phase re-ranking procedure, with the aim of
bene ting from the strengths of both. Therefore we
evaluated other two approaches: xQuAD-after-RT and
RT-afterxQuAD, applying the second re-ranking phase on the set
of 50 recommendations provided from the rst phase. We
have also evaluated the combination with xQuAD and
DivRT, but the results are very similar using RT, so they will
not be shown. To evaluate the performances, we compare
the top-10 recommendation list generating from all the
approaches with basic xQuAD, by varying the parameter
from 0 to 0.95 with step xed to 0.05 in Equation 1 (higher
values of give more weight to accuracy, lower values to
diversity).</p>
          <p>
            The rules are produced using M5Rules10 algorithm
available in Weka based on the M5 algorithm proposed by
Quinlan [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ] and improved by Wang and Witten [
            <xref ref-type="bibr" rid="ref22">22</xref>
            ]. M5Rules
generates a list of rules for regression problems using a
separate-and-conquer learning strategy. Iteratively it builds
a model tree using M5 and converts the best leaf into a rule.
We decided to use unpruned rules in order to have more
rules matchable with the items.
          </p>
        </sec>
        <sec id="sec-5-1-2">
          <title>9http://mymedialite.net/</title>
          <p>10http://weka.sourceforge.net/doc.dev/weka/
classifiers/rules/M5Rules.html</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>RESULTS DISCUSSION</title>
      <p>Results of the experiments on MovieLens and
LibraryThing are reported in Figure 2 and 3, respectively.</p>
      <p>MovieLens. xQuAD obtains the best results in terms
of ILD (Figure 2(a)) and -nDCG (Figure 2(b)), though
the xQuAD-after-RT results are very close and, with higher
values (namely giving more importance to the accuracy
factor), the di erences between them are not signi cant.
This outcome is due to the fact that the diversity metrics
are attribute-based and xQuAD operates directly
diversifying the attributes values, while the proposed rule-based
approaches do not take into account all the attributes
values. This also explains why the pure rule-based approaches
(RT and DivRT) obtain the worst diversity results, while
the combined algorithms (xQuAD-after-RT and
RT-afterxQuAD) obtain better results. It is noteworthy that these
last two con gurations have no substantial di erence with
ILD, but, in terms of -nDCG, xQuAD-after-RT
considerably overcomes RT-after-xQuAD. This demonstrates that
the pipeline of xQuAD and the rule-based approach
obtains good diversity. Considering coverage (Figure 2(c))
and entropy (Figure 2(d)) to evaluate the aggregate
diversity, the results show that using the rules the
recommendations are much more personalized. It is interesting to
note the compromise provided by xQuAD-after-RT, that
obtains equidistant results between xQuAD and the
rulebased algorithms, unlike RT-after-xQuAD that slightly
overcomes xQuAD. With respect to the baseline, no con
guration is able to give more accurate recommendations (nDCG
= 0.14); all are able to increase the individual diversity
(ILD = 0.34 and -nDCG = 0.27). With nDCG and the
individual diversity, the di erences are always statistically
signi cant (p &lt; 0.001), except using the pure ruled-based
approaches with &gt; 0.65. The situation is more complex
in terms of aggregate diversity, since the coverage grows
very little on the baseline (coverage = 0.29) and the entropy
slightly decreases (entropy = 0.78) with higher values.
According to a comprehensive analysis on MovieLens, the pure
rule-based approaches may give personalized and diversi ed
recommendations, also with small accuracy loss. However,
when individual diversity is more important than aggregate
diversity, combining xQuAD with a previous rule-based
reranking gives a good compromise between individual and
aggregate diversity.</p>
      <p>LibraryThing. At rst glance, the LibraryThing results
appear similar to those on MovieLens. Although they are
generally consistent, there are interesting di erences. Also
in this case, xQuAD obtains the best diversity values, with
ILD (Figure 3(a)) and -nDCG (Figure 3(b)). However,
both the combined approaches obtain really interesting
results, very close to xQuAD, except for the lower
values (namely giving more importance to the diversi cation
factor). Unlike what happens on MovieLens, in this case
RT-after-xQuAD obtains good results also in terms of
nDCG. The pure rule-based approaches still obtain worse
results. Considering coverage (Figure 3(c)) and entropy
(Figure 3(d)) to evaluate the aggregate diversity, the results
show that using the rules the recommendations are much
more personalized than using only xQuAD. The combined
approaches are able to improve the aggregate diversity with
respect to xQuAD, albeit they are still distant from the pure
rule-based approaches, especially in terms of coverage. With
respect to the baseline, all con gurations give a little more
accurate recommendations, with &gt; 0.65, but the di
erences are not statistically signi cant. In terms of individual
diversity, all of them are able to overcome the baseline (ILD
= 0.4 and -nDCG = 0.285) except when using the pure
rule-based approaches in terms of ILD. However they are
able to improve -nDCG. For the latter two metrics, the
di erences are always statistically signi cant (p &lt; 0.001).
In terms of aggregate diversity, xQuAD does not improve
the baseline result (coverage = 0.15 and -nDCG = 0.77),
while using the rules leads to better results. According to
a comprehensive analysis on LibraryThing, the pure
rulebased approaches may give more personalized
recommendations with a better diversity, especially using RT, with also
a small accuracy loss. Similarly to the analysis on
MovieLens, the results on LibraryThing suggest that diversifying
with only the rules is a good choice when aggregate
diversity is more important than individual diversity, conversely
xQuAD remains the best choice to improve the individual
diversity and combined with the rule-based diversi cation
improves also the aggregate diversity.</p>
      <p>The nal conclusions of this analysis are that using a
regression tree to infer rules representing user interests on
multi-attribute values in the diversi cation process with
xQuAD leads to more personalized recommendations but
with a less diversi ed list and that combining
attributebased and rule-based diversi cations in two phase re-ranking
is a good way for taking the advantages of both. The
better degree of personalization may depend on the fact that
the rules are di erent among the users since they represents
their individual interests. The lower individual diversity
values with ILD and -nDCG are due to the nature of these
metrics which are based directly on the attributes values
while the pure rule-based approaches do not take into
account all the attributes values.
6.</p>
    </sec>
    <sec id="sec-7">
      <title>RELATED WORK</title>
      <p>
        There is a noteworthy e ort by the research community in
addressing the challenge of recommendation diversity. That
interest arises from the necessity of avoiding monotony in
recommendations and controlling the balance between
accuracy and diversity, since increasing diversity inevitably puts
accuracy at risk [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]. However, a user study in the movie
domain [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] demonstrates that user satisfaction is positively
dependent on diversity and there may not be the intrinsic
trade-o when considering user perception instead of
traditional accuracy metrics.
      </p>
      <p>
        Typically, the proposed approaches aim to replace items
in an already computed recommendation list, by minimizing
the similarity among all items. Some approaches exploit a
re-ranking phase with a greedy selection (see Section 2), for
instance [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], or with other techniques such us the Swap
algorithm [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], which starts with a list of K scoring items and
swaps the item which contributes the least to the diversity
of the entire set with the next highest scoring item among
the remaining items, by controlling the drop of the overall
relevance by a pre-de ned upper bound.
      </p>
      <p>
        Other types of approaches try to directly generate
diversi ed recommendation lists. For instance, [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] proposes a
probabilistic neighborhood selection in collaborative
ltering for selecting diverse neighbors, while in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], an adaptive
diversi cation approach is based on Latent Factor Portfolio
model for capturing the user interests range and the
uncertainty of the user preferences by employing the variance of
the learned user latent factors. In [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] it is proposed a hybrid
method based on evolutionary search following the Strength
Pareto approach for nding appropriate weights for the
constituent algorithms with the nal aim of improving accuracy,
diversity and novelty balance. [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] considers the problem to
improve diversity while maintaining adequate accuracy as
a binary optimization problem and proposes an approach
based on solving a trust region relaxation. The advantages
of this approach is that it seeks to nd the best sub-set of
items over all possible sub-sets, while the greedy selections
nds sub-optimal solutions.
      </p>
      <p>
        Multi-attribute diversity has been substantially non-treated
in the literature of recommender systems. A recent work [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]
proposes an adaptive approach able to customize the degree
of recommendation diversity of the top-N list taking into
account the inclination to diversity of the user over di
erent content-based item attributes. Speci cally, entropy is
employed as a measure of diversity degree within user
preferences and used in conjunction with user pro le dimension
for calibrating the degree of diversi cation.
      </p>
      <p>
        Furthermore, increasing attention has been paid to the
intent-aware diversi cation, namely the process of
increasing the diversity taking into account the user interests. Some
approaches are based on adapted algorithms proposed for
the same purpose in the Information Retrieval eld, such as
IA-Select [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and xQuAD [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. An approach for extraction
of sub-pro les re ecting the user interests has been proposed
in [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. There a combination of sub-pro le recommendations
is generated, with the aim of maximizing the number of user
tastes represented and simultaneously avoiding redundancy
in the top-N recommendations. A more recent approach
[
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], based on a binomial greedy re-ranking algorithm,
combines global item genre distribution statistics and
personalized user interests to satisfy coverage and non-redundancy
of genres in the nal list.
      </p>
      <p>
        The aggregate diversity, also known as sales diversity, is
considered another important factor in recommendation for
both business and user perspective: the user may receive
less obvious and more personalized recommendations,
comply with the target to help users discover new content [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]
and the business may increase the sales [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] proposes the
concept of aggregated diversity as the ability of a system to
recommend across all users as many di erent items as
possible and proposes e cient and parametrizable re-ranking
techniques for improving aggregate diversity with controlled
accuracy loss. Those techniques are simply based on
statistical informations such us items average ratings, average
predicted rating values, and so on. [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] explores the impact
on aggregate diversity and novelty inverting the
recommendation task, namely ranking users for items. Speci cally, two
approaches have been proposed: one based on an inverted
neighborhood formation and the other on a probabilistic
formulation for recommending users to items. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] proposed a
k-furthest neighbors collaborative ltering algorithm to
mitigate the popularity bias and increase diversity, considering
also other factors in user-centric evaluation, such as novelty,
serendipity, obviousness and usefulness.
      </p>
    </sec>
    <sec id="sec-8">
      <title>CONCLUSIONS AND FUTURE WORK</title>
      <p>
        This paper addresses the problem of intent-aware
diversication in recommender systems in multi-attribute settings.
The proposed approach bases on xQuAD [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], a relevant
intent-aware diversi cation algorithm, and leverages
regression trees as user modeling technique. In their rule-based
equivalent representation, they are exploited to foster the
diversi cation of recommendation results both in terms of
individual diversity and in terms of aggregate one.
      </p>
      <p>The experimental evaluation on two datasets in the movie
and book domains demonstrates that considering the rules
generated from the di erent attributes available in an item
description provides diversi ed and personalized
recommendations, with a small loss of accuracy. The analysis of the
results suggests that a pure rule-based diversi cation is a good
choice when the aggregate diversity is more needed than
individual diversity. Conversely, basic xQuAD remains the
best choice to improve the individual diversity while its
combination with the rule-based diversi cation improves also the
aggregate diversity.</p>
      <p>For future work, we would like to evaluate the impact of
our approach also on the recommendation novelty. A way
to improve the novelty could be the expansion of the rules
by exploiting collaborative information.</p>
      <p>Acknowledgements. The authors acknowledge partial
support of PON02 00563 3470993 VINCENTE, PON04a2 E RES
NOVAE, PON02 00563 3446857 KHIRA e PON01 03113
ERMES.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P.</given-names>
            <surname>Adamopoulos</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Tuzhilin</surname>
          </string-name>
          .
          <article-title>On unexpectedness in recommender systems: Or how to expect the unexpected</article-title>
          .
          <source>In in Proc of RecSys '11 Intl. Workshop on Novelty and Diversity in Recommender Systems</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>P.</given-names>
            <surname>Adamopoulos</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Tuzhilin</surname>
          </string-name>
          .
          <article-title>On over-specialization and concentration bias of recommendations: Probabilistic neighborhood selection in collaborative ltering systems</article-title>
          .
          <source>In Proceedings of the 8th ACM Conference on Recommender Systems, RecSys '14</source>
          , pages
          <fpage>153</fpage>
          {
          <fpage>160</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>G.</given-names>
            <surname>Adomavicius</surname>
          </string-name>
          and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kwon</surname>
          </string-name>
          .
          <article-title>Improving aggregate recommendation diversity using ranking-based techniques</article-title>
          .
          <source>IEEE Trans. Knowl</source>
          . Data Eng.,
          <volume>24</volume>
          (
          <issue>5</issue>
          ):
          <volume>896</volume>
          {
          <fpage>911</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>P.</given-names>
            <surname>Castells</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vargas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          .
          <article-title>Novelty and Diversity Metrics for Recommender Systems: Choice, Discovery and Relevance</article-title>
          . In International Workshop on Diversity in
          <source>Document Retrieval (DDR 2011) at the 33rd European Conference on Information Retrieval (ECIR</source>
          <year>2011</year>
          ),
          <year>April 2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>C. L.A.</given-names>
            <surname>Clarke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kolla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Cormack</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Vechtomova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ashkan</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          <article-title>Buttcher, and I. MacKinnon. Novelty and diversity in information retrieval evaluation</article-title>
          .
          <source>In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '08</source>
          , pages
          <fpage>659</fpage>
          {
          <fpage>666</fpage>
          . ACM,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>T. Di</given-names>
            <surname>Noia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. C.</given-names>
            <surname>Ostuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Tomeo</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E. Di</given-names>
            <surname>Sciascio</surname>
          </string-name>
          .
          <article-title>An analysis of users' propensity toward diversity in recommendations</article-title>
          .
          <source>In ACM RecSys '14</source>
          , RecSys '
          <volume>14</volume>
          , pages
          <fpage>285</fpage>
          {
          <fpage>288</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Ekstrand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Harper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Willemsen</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. A. Konstan.</surname>
          </string-name>
          <article-title>User perception of di erences in recommender algorithms</article-title>
          .
          <source>In Proceedings of the 8th ACM Conference on Recommender Systems, RecSys '14</source>
          , pages
          <fpage>161</fpage>
          {
          <fpage>168</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>D.</given-names>
            <surname>Fleder</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Hosanagar</surname>
          </string-name>
          .
          <article-title>Blockbuster culture's next rise or fall: The impact of recommender systems on sales diversity</article-title>
          .
          <source>Management science</source>
          ,
          <volume>55</volume>
          (
          <issue>5</issue>
          ):
          <volume>697</volume>
          {
          <fpage>712</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hurley</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhang</surname>
          </string-name>
          .
          <article-title>Novelty and diversity in top-n recommendation { analysis and evaluation</article-title>
          .
          <source>ACM TOIT</source>
          ,
          <volume>10</volume>
          (
          <issue>4</issue>
          ):
          <volume>14</volume>
          :1{
          <fpage>14</fpage>
          :
          <fpage>30</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>McNee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          .
          <article-title>Being accurate is not enough: How accuracy metrics have hurt recommender systems</article-title>
          .
          <source>In CHI '06 Extended Abstracts on Human Factors in Computing Systems, CHI EA '06</source>
          , pages
          <fpage>1097</fpage>
          {
          <fpage>1101</fpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>V. C.</given-names>
            <surname>Ostuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. Di</given-names>
            <surname>Noia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Di</given-names>
            <surname>Sciascio</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Mirizzi</surname>
          </string-name>
          .
          <article-title>Top-n recommendations from implicit feedback leveraging linked open data</article-title>
          .
          <source>In ACM RecSys '13</source>
          , pages
          <fpage>85</fpage>
          {
          <fpage>92</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R. J.</given-names>
            <surname>Quinlan</surname>
          </string-name>
          .
          <article-title>Learning with continuous classes</article-title>
          .
          <source>In 5th Australian Joint Conference on Arti cial Intelligence</source>
          , pages
          <fpage>343</fpage>
          {
          <fpage>348</fpage>
          ,
          <string-name>
            <surname>Singapore</surname>
          </string-name>
          ,
          <year>1992</year>
          . World Scienti c.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Ribeiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lacerda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Veloso</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Ziviani</surname>
          </string-name>
          .
          <article-title>Pareto-e cient hybridization for multi-objective recommender systems</article-title>
          .
          <source>In RecSys '12</source>
          , pages
          <fpage>19</fpage>
          {
          <fpage>26</fpage>
          . ACM,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Said</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Fields</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. J.</given-names>
            <surname>Jain</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Albayrak</surname>
          </string-name>
          .
          <article-title>User-centric evaluation of a k-furthest neighbor collaborative ltering recommender algorithm</article-title>
          .
          <source>In Proceedings of the 2013 Conference on Computer Supported Cooperative Work, CSCW '13</source>
          , pages
          <fpage>1399</fpage>
          {
          <fpage>1408</fpage>
          . ACM,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>R. L.T.</given-names>
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Macdonald</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Ounis.</surname>
          </string-name>
          <article-title>Exploiting query reformulations for web search result diversi cation</article-title>
          .
          <source>In WWW '10</source>
          , pages
          <fpage>881</fpage>
          {
          <fpage>890</fpage>
          . ACM,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Larson</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanjalic</surname>
          </string-name>
          .
          <article-title>Adaptive diversi cation of recommendation results via latent factor portfolio</article-title>
          .
          <source>In ACM SIGIR '12</source>
          , pages
          <fpage>175</fpage>
          {
          <fpage>184</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>B.</given-names>
            <surname>Smyth</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>McClave</surname>
          </string-name>
          .
          <article-title>Similarity vs. diversity</article-title>
          .
          <source>In Proceedings of the 4th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development, ICCBR '01</source>
          , pages
          <fpage>347</fpage>
          {
          <fpage>361</fpage>
          . Springer-Verlag,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>S.</given-names>
            <surname>Vargas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Baltrunas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Karatzoglou</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Castells</surname>
          </string-name>
          .
          <article-title>Coverage, redundancy and size-awareness in genre diversity for recommender systems</article-title>
          .
          <source>In RecSys '14</source>
          , pages
          <fpage>209</fpage>
          {
          <fpage>216</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S.</given-names>
            <surname>Vargas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Baltrunas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Karatzoglou</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Castells</surname>
          </string-name>
          .
          <article-title>Coverage, redundancy and size-awareness in genre diversity for recommender systems</article-title>
          .
          <source>In RecSys '14</source>
          , pages
          <fpage>209</fpage>
          {
          <fpage>216</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>S.</given-names>
            <surname>Vargas</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Castells</surname>
          </string-name>
          .
          <article-title>Exploiting the diversity of user preferences for recommendation</article-title>
          .
          <source>In OAIR '13</source>
          , pages
          <fpage>129</fpage>
          {
          <fpage>136</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>S.</given-names>
            <surname>Vargas</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Castells</surname>
          </string-name>
          .
          <article-title>Improving sales diversity by recommending users to items</article-title>
          . In Eighth ACM Conference on Recommender Systems, RecSys '14,
          <string-name>
            <surname>Foster</surname>
            <given-names>City</given-names>
          </string-name>
          , Silicon Valley, CA, USA - October 06 -
          <issue>10</issue>
          ,
          <year>2014</year>
          , pages
          <fpage>145</fpage>
          {
          <fpage>152</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          and
          <string-name>
            <given-names>I. H.</given-names>
            <surname>Witten</surname>
          </string-name>
          .
          <article-title>Induction of model trees for predicting continuous classes</article-title>
          .
          <source>In Poster papers of the 9th European Conference on Machine Learning</source>
          . Springer,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>C.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lakshmanan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Amer-Yahia</surname>
          </string-name>
          .
          <article-title>It takes variety to make a world: Diversi cation in recommender systems</article-title>
          .
          <source>In EDBT '09</source>
          , pages
          <fpage>368</fpage>
          {
          <fpage>378</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhang</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Hurley</surname>
          </string-name>
          .
          <article-title>Avoiding monotony: Improving the diversity of recommendation lists</article-title>
          .
          <source>In ACM RecSys '08</source>
          , pages
          <fpage>123</fpage>
          {
          <fpage>130</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>T.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Kuscsik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.G.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Medo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.R.</given-names>
            <surname>Wakeling</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.C.</given-names>
            <surname>Zhang</surname>
          </string-name>
          .
          <article-title>Solving the apparent diversity-accuracy dilemma of recommender systems</article-title>
          .
          <source>Proceedings of the National Academy of Sciences</source>
          ,
          <volume>107</volume>
          :
          <fpage>4511</fpage>
          {
          <fpage>4515</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>C.</given-names>
            <surname>Ziegler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>McNee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Lausen</surname>
          </string-name>
          .
          <article-title>Improving recommendation lists through topic diversi cation</article-title>
          .
          <source>In WWW '05</source>
          , pages
          <fpage>22</fpage>
          {
          <fpage>32</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>