<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Hybrid algorithms for recommending new items in personal TV</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fabio Airoldi</string-name>
          <email>fabio.airoldi@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paolo Cremonesi Roberto Turrin</string-name>
          <email>paolo.cremonesi@polimi.it</email>
          <email>paolo.cremonesi@polimi.it roberto.turrin@moviri.com</email>
          <email>roberto.turrin@moviri.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Politecnico di Milano - DEI Moviri - R&amp;D</institution>
          ,
          <addr-line>P.zza Leonardo da Vinci, 32 Via Schiaffino, 11, Milano, Italy Milano</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Politecnico di Milano - DEI</institution>
          ,
          <addr-line>P.zza Leonardo da Vinci, 32, Milano</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Recommending TV programs in the interactive TV domain is a difficult task since the catalog of available items is very dynamic, i.e., items are continuously added and removed. Despite recommender systems based on collaborative filtering typically outperform content-based systems in terms of recommendation quality, they suffer from the new item problem, i.e., they are not able to recommend items that have few or no ratings. On the contrary, content-based recommender systems are able to recommend both old and new items but the general quality of the recommendations in terms of relevance to the users is low. In this article we present two different approaches for building hybrid collaborative+content recommender systems, whose purpose is to produce relevant recommendations, while overcoming the new item issue. The approaches are tested on the implicit ratings collected from 15'000 IPTV users over a period of six months.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>Interactive television allows providers to deliver to their
costumers a huge amount of digital content. Since
discovering interesting items can be difficult in such wide
collections, recommender systems are used to provide the users
with personalized lists of items that they may like.</p>
      <p>
        Recommendations are based on the user preferences,
referred to as ratings, that the system has gathered either
explicitly (typically in the 1 . . . 5 scale) or implicitly (typically
in binary format: 1 if the user likes an item, 0 otherwise). In
the interactive TV (iTV) domain ratings are often implicit
[
        <xref ref-type="bibr" rid="ref12 ref13 ref2">2, 13, 12</xref>
        ], inferred by tracking the users’ activity (e.g., the
purchased movies or the watched TV programs).
      </p>
      <p>Item catalogs in the settings of iTV are intrinsically
subject to frequent modifications: new programs are usually
added as soon as they are available and old content becomes
no longer available. We can identify:
(i) A set of items that are repeated over time, such as
television seasons, weekly talk shows, or reruns of movies.
We can assume that a number of users have watched
these items, thus providing implicit ratings.
(ii) A set of items that are shown for the first time, such
as the first showing of a movie. We can assume that
no ratings have been collected about these items.</p>
      <p>Most recommender systems are based on collaborative
filtering algorithms, i.e., they recommend items on the basis
of the preferences of users similar to the target user.
However, since no ratings have been collected for new programs,
they cannot be recommended by collaborative recommender
systems. This issue is known as the new-item problem.</p>
      <p>
        Alternatively to collaborative filtering, recommender
systems can implement content-based filtering algorithms. Since
content-based approaches base their predictions upon the
description of TV programs in terms of features - such as
genre and actors - they are not influenced by the lack of
ratings. Unfortunately, collaborative algorithms have been
systematically proven to outperform content-based algorithms
in terms of recommendation quality, measured by standard
accuracy (e.g., recall and precision) and error metrics (e.g.,
RMSE and MAE). As an example, if we consider the
stateof-the-art recommender algorithms implemented in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], in
our experiments the collaborative approach reaches a recall
equals to 19%, while the content-based approach does not
go over 3%.
      </p>
      <p>In the settings of iTV, we would like to build a
recommender system not affected by the new-item problem, but
with a quality comparable to collaborative filtering. Several
hybrid algorithms have been proposed in the literature
merging into a unique algorithm both content-based and
collaborative filtering. Some of them have been even proven to
outperform base collaborative recommender systems in therms
of quality. However, the proposed solutions are rarely used
in production environments mainly because of scalability
issues. Furthermore, most approaches are designed to work
in the case of explicit, non-binary ratings, without any
particular focus on the new-item problem.</p>
      <p>In this work we present two families of hybrid
collaborative+content algorithms which are specifically designed
to respect the requirements of commercial real-time
recommender systems. We take into account state-of-the art
collaborative and content-based recommender algorithms
typically used in the presence of implicit, binary ratings. Each
hybrid solutions is composed by one collaborative and one
content-based algorithm.</p>
      <p>The main idea behind the first family of hybrid algorithms
is to augment the existing ratings used for training the
collaborative algorithm with additional ratings estimated with
the content-based algorithm. Diversely, the second family of
hybrid algorithms merges together the item-to-item
similarities computed by the collaborative and the content-based
algorithms.</p>
      <p>As a comparison, we also implemented a state-of-the-art
and a trivial hybrid algorithm. The latter simply mixes in
a unique recommendation list the items recommended with
the collaborative and those recommended with the
contentbased algorithm.</p>
      <p>The recommendation quality of the hybrid algorithms has
been evaluated in terms of recall against the quality of
baseline algorithms on the dataset implicitly collected by an
IPTV provider over a period of six months. A specific
testing methodology has been designed in order to evaluate the
quality of recommender algorithm in presence of new items.</p>
      <p>In Section 2 we present an overview of the existing
panorama regarding hybrid recommender systems, with particular
focus on the algorithms that are most promising in the
dynamic iTV scenario. In Section 3 we describe the reference
state-of-the-art algorithms we included in our experiments.
In Section 4 we illustrate the new algorithms that we
developed. Section 5 details the dataset we used for the
evaluation and on the testing methodologies we adopted. Section 6
shows and discusses the results we obtained. Finally, Section
7 draws the conclusions and leads on some possible future
work.</p>
    </sec>
    <sec id="sec-2">
      <title>RELATED WORK</title>
      <p>
        Recently, several television operators have considered the
integration of a recommender system into their architectures
in order to provide personalized content (e.g., [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]).
      </p>
      <p>
        The need for recommender systems in TV applications
is motivated by the fact that users generally appreciate to
receive personal suggestions generated according to their
dynamically updated profiles, as shown in [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. The same study
reported also that customers prefer to be recommended with
items similar to the ones that they already rated, but also
with items that their friends have enjoyed.
      </p>
      <p>
        Recommender systems used to deliver personalized TV
experiences are typically content-based [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ], collaborative
[
        <xref ref-type="bibr" rid="ref35">35</xref>
        ] or hybrid solutions [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ].
      </p>
      <p>In the next paragraphs we summarize the limitations of
non-hybrid recommender systems and we present an overview
of the most interesting existing algorithms suitable
personalized TV purposes.
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>Drawbacks of standard algorithms</title>
      <p>The two main families of recommender algorithms are the
content-based and the collaborative filtering.</p>
      <p>The former are based on the analysis of the content of
items (e.g., genre and actors). On the contrary, the latter
suggest items on the basis of the preferences of users similar
to the target user.</p>
      <p>
        Collaborative algorithms have recently received much more
attention than content-based solutions. The main reason is
that they usually reach a higher recommendation quality
than content-based systems [
        <xref ref-type="bibr" rid="ref17 ref8">17, 8</xref>
        ]. Furthermore, while
collaborative algorithms can be easily implemented in any
domain, content-based system are much more complex because
they require to analyze the items’ content (e.g., parsing
textual data) [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
      </p>
      <p>
        However, since collaborative algorithms are based on the
user ratings they have the following major drawbacks [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]:
• New-item. Collaborative filtering is particularly
affected by the new-item problem, being not able to
recommend items that have received few or no ratings
because the system does not have enough information.
• Popularity bias. Collaborative filtering is biased
towards the most popular items, i.e., items which have
been rated by many users are more likely to be
recommended than items that have few ratings.
2.2
      </p>
      <p>
        During the last years several approaches have tried to
overcome to the drawbacks of single recommender approaches
by combining them into new hybrid recommender algorithms,
from simple implementations (e.g., [
        <xref ref-type="bibr" rid="ref10 ref25 ref34 ref6">34, 25, 10, 6</xref>
        ]) up to
very complex algorithms, such as the BellKor solution
winning the Netflix prize [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], which combines predictions from
107 different baseline recommender systems [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The idea
of merging multiple predictors has been often used in the
settings of machine learning to improve the classification
accuracy (e.g., [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]).
      </p>
      <p>
        Burke performed an extensive survey and classification of
hybrid recommender systems [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ].
      </p>
      <p>
        Mixed algorithms present simultaneously the items
recommended by two different recommender algorithms. As
an example, Smyth and Cotter [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ] show in the same
interface recommendations generated by a content-based and a
collaborative algorithm.
      </p>
      <p>
        In case the confidence of a prediction is measurable, it
is possible to implement a switched algorithm, that selects
the best algorithm to use according to some confidence
estimates. For instance, the system ‘NewsDude’ [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] combines
a content-based nearest-neighbor recommender, a
collaborative recommender, and na¨ıve Bayes classifier. Ratings are
predicted by the algorithm which shows the highest
confidence and then the user is recommended with the items that
have the highest predictions.
      </p>
      <p>
        Weighted algorithms compute a linear combination of the
ratings predicted by two (or more) recommender algorithms.
A similar approach has been proposed by Mobasher et. al. [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ],
that linearly combines item-to-item similarities generated by
different recommender algorithms. Differently from other
hybrid solutions, this method can be used on implicit datasets
(see Section 3.3 for further details).
      </p>
      <p>
        Meta-level recommender systems use the model produced
by an auxiliary algorithm for training the primary algorithm.
As an example, the restaurant recommender proposed by
Pazzani in [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] builds a feature-based model of the users
using content-based information. This model is then used
by a collaborative algorithm to compute recommendations.
      </p>
      <p>
        Melville et al. propose a feature-augmentation algorithm
denoted “Content boosted collaborative filtering” (CBCF)
[
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], which Burke’s survey [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ] reports as one of the best
algorithms. Their approach basically consists in creating
‘augmented user profiles’ by adding ‘pseudo-ratings’ to original
user profiles prior to generating recommendations.
Pseudoratings are generated using a content-based na¨ıve Bayes
classifier and can be interpreted as the ratings that users would
give to unrated items, given the items’ features. Rating
prediction is computed with a variant of the user-based
collaborative approach, where user-to-user similarities are
computed as the Pearson correlation coefficient between the
original user profiles and the augmented user profiles and the
weights assigned to pseudo-ratings depend on the number
of rated and co-rated items for each user. Melville et. al.
reported that their algorithm performed better than the
content-based na¨ıve Bayes and the user-based collaborative
algorithms in terms of MAE (Mean Absolute Error). They
also showed that their algorithm is less susceptible to
problems induced by sparsity: at 99.9% sparsity their algorithm
performed exactly like the content-based baseline.
      </p>
      <p>
        Regardless several hybrid algorithms have been brought
to the scientific attention during the last few years, there is
no exhaustive literature on hybrid recommender algorithms
studied to alleviate the new-item problem, with the only
notable exception being the work of Schein et. al. [
        <xref ref-type="bibr" rid="ref32 ref33">33, 32</xref>
        ].
They describe a generative algorithm with the specific
purpose to have high performance in cold-start situations. In
their work they present a multi-aspect probabilistic model
which is used to compute the probability that an item is liked
or not by a user. The aspects include collaborative data as
well as content-based data and one or more latent aspect.
The hidden aspects are used to model the hypothesis that,
as an example, a user liked a specific movie because of some
particular, latent motivation. This kind of algorithms, along
with association rule mining, neural networks and Boltzman
machines, even if promising, are however difficult to
integrate in existing systems, as they use a completely different
approach from traditional collaborative filtering and do not
properly scale with the number of users and items.
      </p>
      <p>
        Also, hybrid algorithms are subject to a series of
problematics. First, many of them (e.g., weighted hybrids) need to
use underlying algorithms that are able to perform rating
predictions. This makes them useless on implicit domains
like iTV applications: algorithms which rely on predicting
ratings strictly need explicit ratings to work properly as they
are usually optimized with respect to error metrics (such as
MAE or RMSE - e.g., [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]). However, the presence of only
implicit, binary ratings (as in our datasets) does not allow
to use this set of algorithms since we do not have proper
ratings.
      </p>
      <p>
        Additionally, some of the algorithms have scalability
issues. As an example, CBCF [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] requires augmented user
profiles in order to produce recommendations. Since it is
impossible to store the augmented URM (it would be a full
matrix), they need to be generated at real-time. However,
this requires to compute the pseudo ratings for all the user
in the neighborhood using a naive bayes classifier. Using the
neighborhood size suggested by Melville et. al., this would
mean generating 30 recommendation lists using the bayesian
classifier just to produce one hybrid recommendation list.
      </p>
      <p>Other simpler hybrid algorithms belonging to the weighted,
switching, and mixed categories, need to generate at
realtime a number of recommendation lists which is equal to
the number of the underlying baseline recommenders. The
consequence of this is that even the simplest algorithm
belonging to those categories (e.g., the interleaved approach
described in Section 3.3) would have a halved throughput
with respect to a non-hybrid solution.</p>
      <p>
        The only state-of-the-art hybrid approach with a
complexity comparable to non-hybrid algorithms is SimComb [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ];
however, we did not find any reference of this solution
applied to implicit datasets.
      </p>
    </sec>
    <sec id="sec-4">
      <title>STATE-OF-THE-ART ALGORITHMS</title>
      <p>Recommender algorithms usually collect user ratings in a
user-by-item matrix, from here on referred to as User Rating
Matrix (URM) and denoted by R, where rui is the rating
given by user u to item i. The u-th row of this matrix is
denoted by ru and represents the profile of user u.</p>
      <p>The URM is typically very sparse (users rate, on
average, only a few items). In the settings of iTV we assume to
have only implicit feedbacks, so that the URM contains
binary ratings, where the value 1 and 0 indicate, respectively,
whether the TV program has been watched or not.</p>
      <p>
        Collaborative filtering algorithms analyze the URM to
identify similarity between either users (user-based) or items
(item-based), or hidden relations between users and items
(latent factors). Item-based collaborative algorithms have
been proved to outperform user-based approaches in terms
of scalability and recommendation quality [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. They discover
item-to-item similarities using metrics such as the cosine or
the adjusted cosine similarity [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ]. Rating prediction for a
specific item is computed by considering the ratings given
by the target user to items similar to it (denoted neighbors).
Finally, latent factors collaborative algorithms are typically
based on singular value decomposition (SVD) to extrapolate
underlying relations between users and items [
        <xref ref-type="bibr" rid="ref22 ref31">31, 22</xref>
        ].
      </p>
      <p>
        Content-based recommender systems base
recommendation on the features extracted from the items’ content, such
as: the actors, the genre, and the summary of a movie. The
typical bag-of-word approach represents items as vectors of
features stored into a feature-by-item matrix, which is
referred to as Item Content Matrix (ICM) and denoted by W.
Items’ content requires to be processed in order to identify
terms (tokenizing), to remove useless words (stop words),
to normalize words (stemming), and to weight each feature
(weighting). As for the latter, weights are usually computed
using the TF-IDF metric [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]. Users can be represented
as feature vectors, composed by the features of the movies
they have watched. Rating prediction can be computed as
the similarity between user and item feature vectors.
      </p>
      <p>In the following section we describe the state-of-the-art
recommender algorithms we have taken into consideration.
They comprise two collaborative filtering algorithms, one
content-based algorithm, and two hybrid algorithms.</p>
      <p>The algorithms we have considered are designed to be used
with implicit, binary datasets. Furthermore, the algorithms
share the property of not requiring any user-specific
parameters to be learned in advance. As a consequence, these
algorithms are able to recommend any user at real-time on
the basis of his/her most recent ratings.</p>
      <p>The state-of-the-art collaborative and content-based
algorithms will be used in Section 4 for defining two families
of hybrid recommender algorithms, each one combining one
content-based and one collaborative filtering: filtered feature
augmentation and similarity injection.
3.1</p>
    </sec>
    <sec id="sec-5">
      <title>Collaborative</title>
      <p>In the following we present one item-based (NNCosNgbr)
and one latent factor (PureSVD) state-of-the-art
collaborative algorithms suitable for predicting top-N
recommendations in the case of binary ratings.</p>
      <sec id="sec-5-1">
        <title>Non-normalized cosine KNN (NNCosNgbr).</title>
        <p>
          This is a state-of-the-art item-based collaborative filtering
algorithm described in [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. Note that in the case of binary
ratings we cannot compute similarity metrics such as the
Pearson coefficient and the Adjusted Cosine, thus the
itemitem similarity has been measured as the cosine similarity.
        </p>
        <p>Rating prediction for item i has been computed by
summing up the similarities between item i and its neighbors,
where the set of neighbors - denoted by Dk(u; i) - has been
limited to the k most similar items. This approach is usually
known as knn (k nearest neighborhood). Rating prediction
is computed as:
rˆui =</p>
        <p>X
j∈Dk(u;i)</p>
        <p>
          ruj · siCjF
where siCjF represents the cosine similarity between item i
and j, computed as1:
This algorithm has been proved to have a good quality, as
shown in [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] and [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>PureSVD.</title>
        <p>
          PureSVD is a latent factor collaborative algorithm, proved
to grant high recommendation quality in terms of recall [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
        </p>
        <p>Let f denote the number of latent factors used for
representing users and items. Typical values of f are in the
range [50, 300]. SVD allows to factorize the URM into three
f -dimensional matrices - U, Σ, and Q - and to compute a
f -rank approximation of the original URM:</p>
        <p>Rf = U · Σ · Q⊺
where U and Q are orthonormal matrices and Σ is diagonal.
User u and item i are represented by f -dimensional vectors,
respectively: pu and qi. Rating prediction is computed
using the inner product:</p>
        <p>rˆui = pu · qi⊺
rˆui = ru · Q · qi⊺
If we define P = Σ · U, we can observe that, since U and Q
are orthonormal, P = R · Q and (4) can be rewritten as:</p>
        <p>Finally, observe that the inner product qi · qj⊺ represents
the similarity between items i and j.
3.2</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Content-based</title>
      <p>
        Differently to collaborative filtering, content-based
recommender systems rely only on content features. In the
following we describe LSA [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], a well-known approach based on
SVD.
      </p>
      <sec id="sec-6-1">
        <title>Latent Semantic Analysis (LSA). Latent Semantic Anal</title>
        <p>
          ysis [
          <xref ref-type="bibr" rid="ref16 ref19 ref2">16, 19, 2</xref>
          ] uses SVD to reduce the dimensionality of the
ICM and to capture underlying relations - like synonymy
between items. The weights in the ICM are computed
using the TF-IDF metric[
          <xref ref-type="bibr" rid="ref29">29</xref>
          ]. Let l be the number of factors
- typically referred to as latent size - to be used for
representing items’ features. The ICM can be factorized and
approximated as:
        </p>
        <p>Wl = Z · Λ · Y⊺
Let us define B = YΛ. Similarity between items i and j is
denoted by siCjBF and can be computed as the cosine between
vectors bi and bj :
and rating prediction is computed as:
siCjBF =</p>
        <p>bi · bj⊺
kbik2 · kbjk2
ˆrui = ru · B · bi⊺
1x · y denotes the inner product between vectors x and y,
and kxk2 denotes the Euclidean norm of vector x.
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)</p>
        <p>In the following sections we present two families of hybrid
recommender systems based on one collaborative filtering
and one content-based filtering. The requirement for an
algorithm to be used in the second solution is to allow to derive
a similarity measure between items. Thus, all state-of-the
art collaborative and content-based algorithms presented in
Sections 3.1 and 3.2 can be used in the following hybrid
approaches.</p>
      </sec>
      <sec id="sec-6-2">
        <title>Interleaved.</title>
        <p>This algorithm is a trivial hybrid implementation that
forms the recommendation list to suggest to the target user
by alternating, in turn, one item predicted by the
collaborative algorithm and one predicted by the content-based
algorithm.</p>
      </sec>
      <sec id="sec-6-3">
        <title>SimComb.</title>
        <p>
          Mobasher et. al. [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] developed a weighted hybrid
algorithm in which item-to-item similarity values are computed
as the linear combination between content-based and
collaborative similarities:
cij = α · siCjBF + (1 − α) · siCjF
(9)
where siCjBF and siCjF are computed, respectively, using (2)
and (7). Finally, rating prediction can be computed using
(1), where siCjF is to be replaced with cij . We refer to this
algorithm as simComb.
4.
        </p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>PROPOSED ALGORITHMS</title>
      <p>The algorithms we present in the following sections are
designed to overcome some of the limitations of existing
algorithms. Our first priority was to design hybrid
recommender systems able to work on implicit, binary ratings,
and to grant good recommendation quality even when only
content information is available (new items). In addition,
we also focus on scalability in order to design algorithms to
be used in real iTV scenarios.</p>
      <p>In the next two sections we present two families of hybrid
solutions: Filtered Feature Augmentation and Similarity
Injection Knn.
4.1</p>
    </sec>
    <sec id="sec-8">
      <title>Filtered Feature Augmentation (FFA)</title>
      <p>
        Filtered Feature Augmentation is a feature augmentation
algorithm mainly inspired by CBCF (Content Boosted
Collaborative Filtering) [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. Differently from Mellville’s
solution, our approach
1. does not need any user-specific parameter to be learned;
2. allows to use different content-based and collaborative
filtering;
3. computes item-item similarities on the basis of the
ratings augmented with pseudo-ratings derived from the
content-based filtering, but use the original user
profiles for predicting ratings (as opposed to computing
item-item similarities using the original URM and
augmenting user profiles for predicting ratings).
      </p>
      <p>Figure 1 shows the learning process: the CBF trains the
content-based algorithm and computes the pseudo-ratings
for unknown rating values to be added to the original URM.</p>
      <p>The augmented URM (namely, aURM) is used as input for
the collaborative filtering.</p>
      <p>
        Since adding all pseudo-ratings would lead to a dense,
very large augmented URM, the filter selects only the most
relevant pseudo ratings. In our experiments we used two
different filters: a simple one which excludes all the
pseudoratings which are lower than a fixed threshold (FFAt) and a
more sophisticated one which use the Gini impurity measure
[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] in order to add both high and low pseudo ratings to
increase the intrinsic information to the item profiles (FFAg).
The Gini coefficient is defined as:
2
Gini(v) = 1 − X px
x∈v
where px is the probability of x in v. In our specific case v
is an item profile (i.e., a column of the URM) and px = nnx ,
where nx is the number of ratings equal to x in v and n
is the number of ratings in v. When Gini(v) = 0, v is
pure (and brings almost no information). As we want to
add informative pseudo-ratings, the filter let only pass the
pseudo-ratings that increment the most the Gini index for
each item. This is done until at least g pseudo ratings are
added to each item-profile. The value of g depends on the
number of original ratings for each user profile (denoted by
n):
g =
(nmin − n + (h · n) if n ≤ nmin
      </p>
      <p>(h·nn2min) otherwise
where nmin and h are parameters. In our experiments we
used the average number of ratings as the value for nmin
and 0.3 for h.</p>
      <p>Rating prediction has been computed by using (1), where
CF is the similarity
rui are the the original user ratings, and sij
between items i and j computed using (2) on the augmented
URM.
4.2</p>
    </sec>
    <sec id="sec-9">
      <title>Similarity Injection Knn (simInjKnn)</title>
      <p>Similarity Injection Knn builds a model using item-item
similarities obtained by one collaborative and one
contentbased.</p>
      <p>We first compute item-item similarities siCjF using
collaborative filtering, retaining only the k most similar items.
SimCBF using
contentilarly, we compute item-item similarities sij
based filtering.</p>
      <p>The similarities are later merged into a unique item-item
similarity matrix S, by adopting a two-step process:
(10)
(11)</p>
      <p>
        We evaluated our algorithms using a variant of the
methodology described in [
        <xref ref-type="bibr" rid="ref11 ref22">22, 11</xref>
        ], designed to evaluate the
performance of recommender algorithms with focus on the
newitem problem.
      </p>
      <p>We used the dataset provided by an IPTV provider2. It
contains 25765 binary ratings implicitly collected over a
period of six months given by 15563 users on 794 items (0.0021
sparsity). Content-based features comprise: actors,
directors, category (e.g., movie, series, documentary. . . ), title,
genres, summary and language, for a total of 11291 features
and an average of 18 features per item. The main
characteristics of the available features are summarized in Table
1.</p>
      <p>
        The evaluation of recommender systems is typically
performed by using either error metrics or accuracy metrics
2Ratings refers to the dataset TV2 available at
http://home.dei.polimi.it/cremones/memo/
[
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Since error metrics - such as MAE (mean absolute
error) and RMSE (root mean square error) - rely on
computing the error between actual and predicted ratings, they
cannot be measured on implicit, binary datasets where this
information is not available [
        <xref ref-type="bibr" rid="ref15 ref20">20, 15</xref>
        ].
      </p>
      <p>
        For such reason we focus on accuracy metrics, that
estimate the fraction of relevant items which are actually
recommended (recall) or the fraction of the recommended items
that are actually relevant (precision). In addition, recent
works [
        <xref ref-type="bibr" rid="ref11 ref27">11, 27</xref>
        ] consider accuracy metrics as more suitable,
with respect to error metrics, for evaluating the top-N
recommendation task [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], i.e., the capability of the
recommender system to suggest very limited lists of items that
are likely to be of interest for the users.
      </p>
      <p>The standard definition of recall - which is typically used
in the settings of information retrieval - is:
recall =
|relevant ∧ retrieved|
|relevant|
(12)</p>
      <p>
        Usually, in the settings of recommender systems, the set of
relevant items is composed by items positively rated (e.g.,
if the rating is 5 out of 5). However, since we are facing
with an implicit, binary dataset, in our evaluation we have
considered - analogously to other works such as [
        <xref ref-type="bibr" rid="ref12 ref13 ref15">15, 12, 13</xref>
        ]
all rated items to be relevant, as we do not have any further
information about the degree of user satisfaction.
5.1
      </p>
    </sec>
    <sec id="sec-10">
      <title>Performance on new items</title>
      <p>
        In order to specifically evaluate the impact of new items on
the quality of the different algorithms we developed a testing
methodology, that we refer to as sliding window, which is an
extension of the evaluation methodology presented in [
        <xref ref-type="bibr" rid="ref11 ref22">22,
11</xref>
        ].
      </p>
      <p>The original approach evaluates the quality of
recommender algorithms by measuring the recall as a function of the
number of items displayed to the user (N ). The test consists
in excluding a certain amount of ratings (test set ) and using
the remaining ratings to train the algorithms (training set ).
All available content-based features are used for training the
content-based and hybrid algorithms. Once an algorithm
has been trained with the ratings in the training set, each
rating rui in the test set is tested as follows:
• we predict the score for all items unrated by user u
• we select the top-N items according to the estimated
score
• if item i appears in the top-N recommendation list, we
have a ‘hit ’, i.e., the system has correctly suggested a
relevant item.</p>
      <p>With respect to (12), the set of relevant items corresponds to
the test set, while the set of relevant, retrieved items
corresponds to the number of hits. Thus, recall can be rewritten
as a function of N , i.e., the number of items displayed to
users:
recall(N ) =</p>
      <p>#hits
|test-set|
(13)
Because of the high dataset sparsity, we have formed the
test set by randomly selecting the 20% of ratings in order to
have a significant number of samples.</p>
      <p>This evaluation methodology is not able to measure the
quality of the recommendations on new item problem.
Therefore we have implemented some modifications.</p>
      <p>Let M denote the number of items in the dataset. We
randomly divide items in two sets, H1 and H2, each one
composed by M/2 items. Test set and training set are
defined as a function of the percentage parameters β as follows:
• we define a set T by randomly selecting 20% of ratings
in the URM.
• the training set is defined as the set of ratings related
to items in H1, excluding all ratings in T .
• we form a set H1+2 composed by M/2 items, 100 −
β% randomly extracted from set H1 and β% randomly
extracted from set H2.
• test set is composed by the ratings in T related to
items in H1+2.</p>
      <p>Figure 2 schematically shows how the different sets are
formed. Blank circles, solid circles, and x’s refer to the ratings,
respectively, in the set T , in the test set, and in the training
set.</p>
      <p>For each value of β we have composed a training and a test
set and we have computed the quality of the recommender
algorithm in terms of recall, as defined in (13), where N
has been set equal to 20. The test is the same as the one
described for the original evaluation methdology. For each
rating rui in the test set: (i) we predict the score for all items
unrated by user u, (ii) we select the top-20 items according
to the estimated score, and (iii) we verify if there has been
a hit, i.e., if item i appears in the top-20.</p>
      <p>Note that only ratings related to half the items are
available for training the algorithms. The parameter β specifies
the percentage of new-items, i.e., the percentage of items
that do not have ratings in the training set and so that
cannot be recommended by standard collaborative filtering. In
our experiments we varied β from 0% to 100%.</p>
      <p>Collaborative filtering is expected to be able to
recommend only items in H2, so the higher β the lower the quality
of the algorithm. When β = 100% we expect the quality of
any collaborative filtering to be 0.</p>
      <p>On the other hand, content-based algorithms are trained
exclusively with content-based features, thus resulting
totally independent from ratings included into the training set.
We expect their quality not to be influenced by β. Finally,
hybrid approaches can be training by the ratings related to
half the items and with all available content-based features.</p>
    </sec>
    <sec id="sec-11">
      <title>RESULTS AND DISCUSSION</title>
      <p>In our test we considered the state-of-the-art recommender
algorithms described in Section 3 and the hybrid approaches
proposed in Section 4. As for the former, we included two
item-based collaborative algorithms – PureSVD and
nonnormalized cosine (NNCosNgbr) – a content-based algorithm
- Latent Semantic Analysis (LSA) – and two hybrid
algorithms – interleaved and simComb. As for the latter, we
included simInjKnn and two variants of filtered feature
augmentation, referred to as FFAg and FFAt. FFAg uses a
filter based on Gini’s impurity measure, while FFAt uses a
filter based on a fixed threshold, that has been set to 1, thus
excluding all the pseudo-ratings lower than 1 (the number
of pseudo-ratings added to the URM was the 27.5% of the
original number of ratings).</p>
      <p>The latent size l of LSA has been set to 300, while the
number of features f in PureSVD is equal to 50. The
neighborhood size k has been set equals to 200 for NNCosNgbr.
As for SimComb, the coefficient α has been set to 0.3 as it
empirically provides the better results. Finally, the
neighborhood size k used for simInjKnn is 100.</p>
      <p>Figure 3 shows the sliding window results, plotting the
recall of the state-of-the-art (a) and the proposed hybrid
(b) algorithms as a function of β, i.e., the percentage of new
items. The recall has been computed assuming N = 20.</p>
      <p>The most perceptible result is the very poor quality of the
content-based algorithm, whose recall does not go over 3%,
much lower than the best collaborative approach -
NNCosNgbr - whose recall is about 25% when recommending only
old items. In addition, we can observe that the three hybrid
algorithms - simComb, FFAt, and simInjKnn - have a recall
higher than collaborative and content-based state-of-the-art
solutions. As for the other two hybrid algorithms: the
interleaved approach confirms to have a poor quality as based
on a very trivial technique, while the lower quality of FFAg
is motivated by the fact that binary ratings do not bring
enough information for the filter to work properly. However,
the results of the sliding window test show that FFAt and
SimInjKnn outperform the state-of-the-art algorithm
SimComb when more than 40% of items is new. In addition, we
can notice that recall drastically falls down when β = 100%,
with exception for the content-based algorithm that is not
influenced by the presence of unrated items.</p>
      <p>As a final observation, let us consider the performance
implications of the proposed hybrid solutions. The Similarity
Injection Knn (SimInjKnn) requires to build two
item-toitem similarity matrices, which is the same effort as required
for the interleaved algorithm. Filtered Feature
Augmentation (FFAg and FFAt), on the other hand, requires to
compute pseudo-ratings for each item. According to the number
of items and users, this can be computationally costly. The
main benefit of Filtered Feature Augmentation is that it is
completely independent from the collaborative and
contentbased algorithms used, as opposed to Similarity Injection
Knn, which strictly requires algorithms able to express
itemitem similarities.</p>
    </sec>
    <sec id="sec-12">
      <title>CONCLUSIONS AND FUTURE WORKS</title>
      <p>The new hybrid algorithms we proposed have been proved
to improve the overall performance of the state-of-the-art
algorithms in suggesting new items, though their performance
is limited by the poor quality of content-based algorithms.</p>
      <p>
        In future work we plan to implement between-subjects
controlled experiments for a subjective evaluation of the
proposed solutions. In fact, recent works have pointed out that
objective evaluation of recommender systems (i.e., based on
error and accuracy metrics) is not always aligned with the
actual user perception of the recommendation quality, as
measured via controlled experiments (e.g., [
        <xref ref-type="bibr" rid="ref27 ref9">9, 27</xref>
        ]).
      </p>
      <p>FFAg
FFAt
simInjKnn
0.3</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>G.</given-names>
            <surname>Adomavicius</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Tuzhilin</surname>
          </string-name>
          .
          <article-title>Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. Knowledge and Data Engineering</article-title>
          , IEEE Transactions on,
          <volume>17</volume>
          (
          <issue>6</issue>
          ):
          <fpage>734</fpage>
          -
          <lpage>749</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Bambini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Turrin</surname>
          </string-name>
          .
          <article-title>Recommender Systems Handbook, chapter A Recommender System for an IPTV Service Provider: a Real Large-Scale Production Environment</article-title>
          . to appear, Springer,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>Bell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koren</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Volinsky</surname>
          </string-name>
          .
          <article-title>The BellKor solution to the Netflix Prize</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R. M.</given-names>
            <surname>Bell</surname>
          </string-name>
          and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koren</surname>
          </string-name>
          .
          <article-title>Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights</article-title>
          .
          <source>Seventh IEEE International Conference on Data Mining (ICDM</source>
          <year>2007</year>
          ), pages
          <fpage>43</fpage>
          -
          <lpage>52</lpage>
          , Oct.
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Billsus</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Pazzani</surname>
          </string-name>
          .
          <article-title>User modeling for adaptive news access. User Modeling</article-title>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <volume>10</volume>
          :
          <fpage>147</fpage>
          -
          <lpage>180</lpage>
          ,
          <year>February 2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R.</given-names>
            <surname>Burke</surname>
          </string-name>
          .
          <article-title>Hybrid recommender systems: Survey and experiments</article-title>
          . volume
          <volume>12</volume>
          , pages
          <fpage>331</fpage>
          -
          <lpage>370</lpage>
          , Hingham, MA, USA,
          <year>November 2002</year>
          . Kluwer Academic Publishers.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Burke</surname>
          </string-name>
          .
          <article-title>The adaptive web</article-title>
          .
          <source>chapter Hybrid web recommender systems</source>
          , pages
          <fpage>377</fpage>
          -
          <lpage>408</lpage>
          . Springer-Verlag, Berlin, Heidelberg,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>L.</given-names>
            <surname>Candillier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Jack</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Fessant</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Meyer</surname>
          </string-name>
          .
          <article-title>State-of-the-Art Recommender Systems</article-title>
          .
          <source>IGI Global</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>22</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>L.</given-names>
            <surname>Chen</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pu</surname>
          </string-name>
          .
          <article-title>A user-centric evaluation framework of recommender systems</article-title>
          .
          <source>In In ACM Conference on Recommender Systems (RecSysSˇ10)</source>
          ,
          <source>Workshop on User-Centric Evaluation of Recommender Systems and Their Interfaces (UCERSTISˇ10)</source>
          ., pages
          <fpage>14</fpage>
          -
          <lpage>21</lpage>
          , Barcelona, Spain,
          <year>2010</year>
          . Sept.
          <volume>26</volume>
          -
          <fpage>30</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.</given-names>
            <surname>Claypool</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gokhale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Miranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Murnikov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Netes</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Sartin</surname>
          </string-name>
          .
          <article-title>Combining content-based and collaborative filters in an online newspaper</article-title>
          .
          <source>In Proceedings of ACM SIGIR Workshop on Recommender Systems</source>
          ,
          <year>August 1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koren</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Turrin</surname>
          </string-name>
          .
          <article-title>Performance of recommender algorithms on top-n recommendation tasks</article-title>
          .
          <source>In RecSys</source>
          , pages
          <fpage>39</fpage>
          -
          <lpage>46</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Turrin</surname>
          </string-name>
          .
          <article-title>Analysis of cold-start recommendations in iptv systems</article-title>
          .
          <source>In RecSys '09: Proceedings of the 2009 ACM conference on RecommenderSystems</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          . ACM,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Turrin</surname>
          </string-name>
          .
          <article-title>Time-evolution of iptv recommender systems</article-title>
          .
          <source>In Proc. of the 8th European Conference on Interactive TV and Video</source>
          , Tempere, Finland,
          <year>June 2010</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>B. de Ville</surname>
          </string-name>
          .
          <article-title>Decision trees for business intelligence and data mining: using SAS enterprise miner</article-title>
          .
          <source>SAS Publishing</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Deshpande</surname>
          </string-name>
          and
          <string-name>
            <given-names>G.</given-names>
            <surname>Karypis</surname>
          </string-name>
          .
          <article-title>Item-based top-n recommendation algorithms</article-title>
          .
          <source>ACM Transactions on Information Systems (TOIS)</source>
          ,
          <volume>22</volume>
          (
          <issue>1</issue>
          ):
          <fpage>143</fpage>
          -
          <lpage>177</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>G. W.</given-names>
            <surname>Furnas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Deerwester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. T.</given-names>
            <surname>Dumais</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. K.</given-names>
            <surname>Landauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Harshman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Streeter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K. E.</given-names>
            <surname>Lochbaum</surname>
          </string-name>
          .
          <article-title>Information retrieval using a singular value decomposition model of latent semantic structure</article-title>
          . pages
          <fpage>465</fpage>
          -
          <lpage>480</lpage>
          , New York, NY, USA,
          <year>1988</year>
          . ACM Press.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Ghazanfar</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Prugel-Bennett</surname>
          </string-name>
          .
          <article-title>A scalable, accurate hybrid recommender system</article-title>
          .
          <source>In Proceedings of the 2010 Third International Conference on Knowledge Discovery and Data Mining, WKDD '10</source>
          , pages
          <fpage>94</fpage>
          -
          <lpage>98</lpage>
          , Washington, DC, USA,
          <year>2010</year>
          . IEEE Computer Society.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>J.</given-names>
            <surname>Herlocker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Konstan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Terveen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          .
          <article-title>Evaluating collaborative filtering recommender systems</article-title>
          .
          <source>ACM Transactions on Information Systems (TOIS)</source>
          ,
          <volume>22</volume>
          (
          <issue>1</issue>
          ):
          <fpage>5</fpage>
          -
          <lpage>53</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>P.</given-names>
            <surname>Husbands</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Simon</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Ding</surname>
          </string-name>
          .
          <article-title>On the use of singular value decomposition for text retrieval</article-title>
          .
          <source>Oct</source>
          .
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>G.</given-names>
            <surname>Karypis</surname>
          </string-name>
          .
          <article-title>Evaluation of item-based top-n recommendation algorithms</article-title>
          .
          <source>In Proceedings of the tenth international conference on Information and knowledge management</source>
          ,
          <source>CIKM '01</source>
          , pages
          <fpage>247</fpage>
          -
          <lpage>254</lpage>
          , New York, NY, USA,
          <year>2001</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>J.</given-names>
            <surname>Kittler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hatef</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Duin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Matas</surname>
          </string-name>
          .
          <article-title>On combining classifiers</article-title>
          .
          <source>Pattern Analysis and Machine Intelligence</source>
          , IEEE Transactions on,
          <volume>20</volume>
          (
          <issue>3</issue>
          ):
          <fpage>226</fpage>
          -
          <lpage>239</lpage>
          , mar
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koren</surname>
          </string-name>
          .
          <article-title>Factorization meets the neighborhood: a multifaceted collaborative filtering model</article-title>
          .
          <source>In KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          , pages
          <fpage>426</fpage>
          -
          <lpage>434</lpage>
          , New York, NY, USA,
          <year>2008</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>P.</given-names>
            <surname>Lops</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gemmis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Semeraro</surname>
          </string-name>
          .
          <article-title>Content-based Recommender Systems: State of the Art and Trends</article-title>
          . In F. Ricci,
          <string-name>
            <given-names>L.</given-names>
            <surname>Rokach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Shapira</surname>
          </string-name>
          , and P. B. Kantor, editors,
          <source>Recommender Systems Handbook, chapter 3</source>
          , pages
          <fpage>73</fpage>
          -
          <lpage>105</lpage>
          . Springer US, Boston, MA,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>P.</given-names>
            <surname>Melville</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. J.</given-names>
            <surname>Mooney</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Nagarajan</surname>
          </string-name>
          .
          <article-title>Content-boosted collaborative filtering for improved recommendations</article-title>
          .
          <source>In Eighteenth national conference on Artificial intelligence</source>
          , pages
          <fpage>187</fpage>
          -
          <lpage>192</lpage>
          , Menlo Park, CA, USA,
          <year>2002</year>
          . American Association for Artificial Intelligence.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>B.</given-names>
            <surname>Mobasher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Jin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhou</surname>
          </string-name>
          .
          <article-title>Semantically Enhanced Collaborative Filtering on the Web</article-title>
          .
          <source>In Proceedings of the 1st European Web Mining Forum (EWMF2003)</source>
          , pages
          <fpage>57</fpage>
          -
          <lpage>76</lpage>
          , Sept.
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>R.</given-names>
            <surname>Navarro-Prieto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rebaque-Rivas</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>J.</given-names>
            <surname>Hern</surname>
          </string-name>
          <article-title>´andez-Pablo. Recommending content for itv: what the users really want</article-title>
          ?
          <source>In Proceedings of the 8th international interactive conference on Interactive TV&amp;#38;Video</source>
          , EuroITV '
          <volume>10</volume>
          , pages
          <fpage>123</fpage>
          -
          <lpage>126</lpage>
          , New York, NY, USA,
          <year>2010</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>S. N. A. P. R. T.</given-names>
            <surname>Paolo</surname>
          </string-name>
          <string-name>
            <surname>Cremonesi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Franca</given-names>
            <surname>Garzotto</surname>
          </string-name>
          .
          <article-title>Comparative evaluation of recommender system quality. In CHI extended abstract on Human factors in computing systems</article-title>
          . ACM - to appear,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Pazzani</surname>
          </string-name>
          .
          <article-title>A framework for collaborative, content-based and demographic filtering</article-title>
          .
          <source>Artif. Intell. Rev.</source>
          ,
          <volume>13</volume>
          :
          <fpage>393</fpage>
          -
          <lpage>408</lpage>
          ,
          <year>December 1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29] G. Salton, editor.
          <source>Automatic text processing. Addison-Wesley Longman Publishing Co., Inc</source>
          ., Boston, MA, USA,
          <year>1988</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>B.</given-names>
            <surname>Sarwar</surname>
          </string-name>
          , G. Karypis,
          <string-name>
            <given-names>J.</given-names>
            <surname>Konstan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Reidl</surname>
          </string-name>
          .
          <article-title>Item-based collaborative filtering recommendation algorithms</article-title>
          .
          <source>10th Int. Conf. on World Wide Web</source>
          , pages
          <fpage>285</fpage>
          -
          <lpage>295</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>B.</given-names>
            <surname>Sarwar</surname>
          </string-name>
          , G. Karypis,
          <string-name>
            <given-names>J.</given-names>
            <surname>Konstan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          .
          <article-title>Incremental singular value decomposition algorithms for highly scalable recommender systems</article-title>
          .
          <source>5th Int. Conf. on Computer and Information Technology (ICCIT</source>
          <year>2002</year>
          ), pages
          <fpage>399</fpage>
          -
          <lpage>404</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>A.</given-names>
            <surname>Schein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Popescul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ungar</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Pennock</surname>
          </string-name>
          .
          <article-title>Generate models for cold-start recommendations</article-title>
          .
          <source>In ACMSIGIR Workshop on RecommenderSystems.</source>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>A.</given-names>
            <surname>Schein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Popescul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ungar</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Pennock</surname>
          </string-name>
          .
          <article-title>Methods and metrics for cold-start recommendations</article-title>
          .
          <source>In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR</source>
          <year>2002</year>
          ), pages
          <fpage>253</fpage>
          -
          <lpage>260</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>B.</given-names>
            <surname>Smyth</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Cotter</surname>
          </string-name>
          .
          <article-title>A personalised tv listings service for the digital tv age</article-title>
          .
          <source>Knowl.-Based Syst.</source>
          ,
          <volume>13</volume>
          (
          <issue>2-3</issue>
          ):
          <fpage>53</fpage>
          -
          <lpage>59</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , S. Zheng, and
          <string-name>
            <given-names>J.</given-names>
            <surname>Yuan</surname>
          </string-name>
          .
          <article-title>A personalized tv guide system compliant with mhp</article-title>
          .
          <source>Consumer Electronics</source>
          , IEEE Transactions on,
          <volume>51</volume>
          (
          <issue>2</issue>
          ):
          <fpage>731</fpage>
          -
          <lpage>737</lpage>
          , May
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zimmerman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kurapati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. L.</given-names>
            <surname>Buczak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schaffer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gutta</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Martino</surname>
          </string-name>
          .
          <article-title>Chapter 5 tv personalization system design of a tv show recommender engine and interface</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>