<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Recommender Popularity Controls: An Observational Study</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>F. Maxwell Harper</string-name>
          <email>max@umn.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Minnesota</institution>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <abstract>
        <p>We describe an observational study of a recommender system that provides users with direct control over their personalization. Specifically, we allow users to tune a movie recommender towards more or less popular content. We report on 14 months of usage, which includes 6,846 users who visited the interface at least once. We find, surprisingly, that the popularity of items a user has interacted with historically is a poor predictor of their use of this interface.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        In prior work [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], we reported on a lab study of how users interact
with unlabelled controls that modify the popularity or recency of a
movie recommender system. Among several results, we found that
users who adjusted popularity reported much higher satisfaction
with their recommendations. While most users tuned their
recommender towards more popular content, there was a wide range of
preferences among users. Therefore, we built a popularity tuner
interface into the live system to develop a better understanding of
how this feature would be used in practice, and to collect more data
to experiment with modeling users’ preferred settings.
      </p>
    </sec>
    <sec id="sec-2">
      <title>OBSERVATIONAL STUDY</title>
      <p>
        Our platform, MovieLens (http://movielens.org), allows users to
choose among multiple recommendation algorithms [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]; we enable
the popularity interface on the most popular two: item-item
collaborative filtering [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and Funk SVD [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Our recommendation
algorithm in each case (described in detail in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]) is a linear blend of
predicted star rating with popularity. We use number of ratings in
the last year as our popularity metric. The default setting for both
algorithms is a blend of 95% predicted rating with 5% popularity,
which we chose based on the results of our lab study.
      </p>
      <p>The popularity tuner interface is shown in Figure 1. On each
more/less popular click action, we use a binary search to locate new
weights that will replace 4 of the user’s top 20 recommendations.
For example, a user clicking “more popular” triggers a binary search
that might yield a new blend of 92.1% predicted rating with 7.9%
popularity. We emphasize the changes visually in the interface to
help users understand the result of their actions. To encourage use
of the feature, we added a “configure” button next to the user’s list
of top recommendations at the top of the home page.</p>
      <p>We deployed the feature on April 11, 2016, and collected data
through June 15, 2017. We consider two groups for analysis. First,
for examining overall use of the popularity tuner, we look at all
users who visited the interface one or more times (visited_sample,
N=6846, median logins=11, median ratings=168). Second, for
examining users’ preferred popularity settings, we look at users who
took at least one action (interacted_sample, N=4349, median
logins=14, median ratings=205). These samples are skewed toward
power users as compared with all users that logged in during this
period (N=31371, median logins=3, median ratings=39).</p>
      <p>Over the span of this observational study (or even the span of
a single session), users might change settings multiple times. In
fact, users in the visited_sample opened the tuning interface 24,657
times (3.6 times per user); users in the interacted_sample visited
the page 19,465 times (4.5 times per user). Unless otherwise noted,
user-level analysis examines the user’s final configuration at the
time of analysis. For example, a user might try the tuner interface
three times and explore the system in between each change; our
user-level analysis considers only the most recent configuration.
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>Descriptive Statistics</title>
      <p>
        The distribution of per-user activity in online systems is often
modeled by a power law distribution [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]; use of the tuner interface is
no exception. We find that a few users use the feature heavily, while
the vast majority of users pay it little attention — see Table 1 for an
overview. For instance, 82% of users take 5 or fewer more/less/reset
actions and 78% of sessions are less than a minute in duration. On
the other hand, power users have explored the feature thoroughly
— one user has taken more than 5,500 actions.
      </p>
      <p>Within the interacted_sample, the most common popularity
setting is the default one (1,390 users, 32.0% of sample). That is,
after exploring one or more new settings, many users return to
the default, either by clicking the “reset” button, or by sending the
percentile # actions # sessions session length (sec.)
25th 0 1 3
50th (median) 1 2 5
80th 5 5 67
95th 14 10 251
99th 35 20 1428
Table 1: Descriptive statistics of activity levels by percentile
in the observational study across users in the visited_sample.
# actions and # sessions are user-level variables, while
session length is a session-level variable with possibly multiple
observations per user.</p>
      <p>1400
rs1200
e
su1000
fo 800
r 600
be 400
um 200
n 0
-5
0
5
10</p>
      <p>variable estimate
intercept -1.160000 ***
# logins 0.000012
# ratings 0.000062 ***
predictor==SVD 0.068710 ***
mean pp1y, rated 0.818100 **
mean pp1y, wishlisted 0.064480
mean pp1y, clicked 0.401600 ***
Table 2: Summary of our regression model to predict users’
choices, R2 = 0.04. Statistically significant p-values indicated
as *** (p &lt; 0.001) or ** (p &lt; 0.01).
same number of “more popular” and “less popular” actions. More
users (1,714, 39.4%) end with a setting more popular than default, as
compared with the number that end less popular than default (1,245,
28.6%). See Figure 2 for a visualization of this data, represented as
number of steps from the default.</p>
    </sec>
    <sec id="sec-4">
      <title>2.2 Analysis: Predicting Settings</title>
      <p>It is possible that we can predict users’ preferred popularity
settings from their behavioral data. To explore this idea, we construct
regression models using popularity-related predictor variables. In
particular, we include a metric — “one-year popularity percentile”
(pp1y) — that exactly mirrors the variable used in the
popularity blending algorithm. We measure pp1y of rated, wishlisted, and
clicked items to capture a user’s propensity to interact with more or
less popular content. In addition, we use variables that capture the
user’s number of logins, number of ratings, and chosen prediction
algorithm.
0.30
0.25
lue 0.20
a
v
ing 0.15
d
len 0.10
b
itry 0.05
a
lpuo 0.00
p 0.05
0.100.80 0.85 0.90 0.95 1.00 0.80 0.85 0.90 0.95 1.00 0 200 400 600 800 1000
mean pp1y, rated mean pp1y, clicked # ratings</p>
      <p>Our regression model has several statistically significant
variables, but overall, it is a poor fit with users’ popularity blending
choices (adjusted R-squared=0.04); see Table 2 for a summary and
Figure 3 for a visualization of several variables. The model indicates
that the popularity of a user’s rated and clicked-on movies is
positively correlated with the user’s final configuration value. Also, the
model shows a relationship between a user’s choice of prediction
algorithm and their chosen popularity blending weight (means:
item-item=0.09, funkSVD=0.15). The model shows that number of
ratings is positively correlated with the popularity setting, but with
a tiny efect size; number of logins and popularity of wishlisted
content are not statistically significant.</p>
    </sec>
    <sec id="sec-5">
      <title>3 CONCLUSION</title>
      <p>Thousands of users have tried the popularity tuner feature to
conifgure their movie recommender. They have used the feature in very
diferent ways, and it proves dificult to predict their choices based
on behavioral data. The poor overall fit of our regression model
underscores the potential helpfulness of giving control to users.
While it is possible that our model is too preliminary, or that our
interface does not suficiently elicit users’ ideal preferences, it is also
possible that users have fickle preferences for their recommenders
that are dificult to get right through modeling alone.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Michael</surname>
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Ekstrand</surname>
            ,
            <given-names>Daniel</given-names>
          </string-name>
          <string-name>
            <surname>Kluver</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Maxwell Harper</surname>
          </string-name>
          , and
          <string-name>
            <surname>Joseph</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Konstan</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Letting Users Choose Recommender Algorithms: An Experimental Study</article-title>
          .
          <source>In Proceedings of the 9th ACM Conference on Recommender Systems (RecSys '15)</source>
          . ACM, New York, NY, USA,
          <fpage>11</fpage>
          -
          <lpage>18</lpage>
          . DOI:http://dx.doi.org/10.1145/2792838.2800195
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Simon</given-names>
            <surname>Funk</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Netflix update: Try this at home</article-title>
          .
          <source>(Dec</source>
          .
          <year>2006</year>
          ). http://sifter.org/ ~simon/journal/20061211.html
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>F.</given-names>
            <surname>Maxwell</surname>
          </string-name>
          <string-name>
            <surname>Harper</surname>
          </string-name>
          , Funing Xu, Harmanpreet Kaur, Kyle Condif,
          <string-name>
            <given-names>Shuo</given-names>
            <surname>Chang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Loren</given-names>
            <surname>Terveen</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Putting Users in Control of Their Recommendations</article-title>
          .
          <source>In Proceedings of the 9th ACM Conference on Recommender Systems (RecSys '15)</source>
          . ACM, New York, NY, USA,
          <fpage>3</fpage>
          -
          <lpage>10</lpage>
          . DOI:http://dx.doi.org/10.1145/2792838.2800179
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Badrul</given-names>
            <surname>Sarwar</surname>
          </string-name>
          , George Karypis, Joseph Konstan,
          <string-name>
            <given-names>and John</given-names>
            <surname>Riedl</surname>
          </string-name>
          .
          <year>2001</year>
          .
          <article-title>Item-based Collaborative Filtering Recommendation Algorithms</article-title>
          .
          <source>In Proceedings of the 10th International Conference on World Wide Web (WWW '01)</source>
          . ACM, New York, NY, USA,
          <fpage>285</fpage>
          -
          <lpage>295</lpage>
          . DOI:http://dx.doi.org/10.1145/371920.372071
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Dennis</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Wilkinson</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Strong Regularities in Online Peer Production</article-title>
          .
          <source>In Proceedings of the 9th ACM Conference on Electronic Commerce (EC '08)</source>
          . ACM, New York, NY, USA,
          <fpage>302</fpage>
          -
          <lpage>309</lpage>
          . DOI:http://dx.doi.org/10.1145/1386790.1386837
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>