<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Using Visualizations to Encourage Blind-Spot Exploration</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jayachithra Kumar</string-name>
          <email>j.kumar-1@student.tudelft.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Visualization, Recommender Systems, Blind Spots, Filter Bubble,</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nava Tintarev</string-name>
          <email>n.tintarev@tudelft.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Delft University of Technology</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Scatterplot</institution>
          ,
          <addr-line>Bar-line chart</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <abstract>
        <p>In this paper, we help users to better understand their consumption profiles by exposing them to their unexplored regions, thereby indirectly nudging them to diverse exploration. We refer to these regions as a user's blind-spots, and we visualize these by enabling comparisons between a user's consumption pattern with that of other users of the system. We compare the efectiveness of two visualizations - a bar-line chart and a scatterplot - for increasing a user's intention to explore new content. The results suggest that users can understand both visualizations. Furthermore, our results confirmed that users with higher understanding of their profile tend to explore their blind-spot categories more. This experiment is a first step towards increasing user's awareness of their choices as well as providing the kind of user control that encourages users to explore new types of items.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>CCS CONCEPTS</title>
      <p>• Information systems → Decision support systems; •
Human centered computing → Human computer interaction
(HCI);</p>
    </sec>
    <sec id="sec-2">
      <title>INTRODUCTION</title>
      <p>
        While personalized recommendations can help people to cope with
the information overload problem, over time, using recommender
systems can decrease the diversity of content that we consume
[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], thereby, limiting our exposure to some novel content, views
and opinions contrary to our own. Our current preferences often
reflect our past preferences, and our behaviors may also interact
with online filtering and ranking algorithms to further narrow our
views. This phenomena of algorithmic narrowing, or over-tailoring,
is called ‘filter bubbles’ [
        <xref ref-type="bibr" rid="ref16 ref3 ref6">3, 6, 16</xref>
        ]. However, there may be design
choices for recommender systems that could decrease over-tailoring.
Flaxman et al. found evidence that recent technological changes
both increase and decrease various aspects of over-tailoring [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>This work addresses this possibility by helping users understand
the limitations of their consumption patterns using visualizations.
Specifically, we propose a novel approach for recognizing
‘blindspots’ in user profiles - regions of the preference space that are
under-represented - and describe techniques for revealing these
blind-spots to users. By helping users to recognize these blind-spots,
we also study if this has a demonstrable efect on their
consumption; whether this recognition encourages them to further explore
the recommendation space. In the following sections, we describe
the results of a user experiment to evaluate the eficacy of two
diferent techniques for blind-spot visualization and its efect on
users’ exploration of the recommendation space.</p>
      <p>
        In the next section, we describe related work. This is followed
by a description of the method used to generate the visualizations
used in this study (Section 3). Next in Section 4, we describe a
lab study with 23 participants which investigates the relationship
between understanding the visualizations and exploring blind-spot
regions in a user’s profile. Section 5 outlines the main results. This is
followed by discussion of qualitative findings, and post-hoc analysis
for surprising results in Section 6. We conclude with suggestions
for future work in Section 7
2
This work sits at the intersection between two important
recommender systems themes: 1) the use of visualization to aid
transparency and explanation, and 2) techniques for dealing with filter
bubbles. One important objective of this work is to increase user’s
awareness of their filter-bubble, to improve decision making by
better informing users about their consumption pattern. To help
users understand their own consumption patterns we propose an
approach for visualizing user profiles. This builds on work for
visualizing consumption blind-spots in movie recommender systems
[
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], and visualizing consumption profiles in music [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <p>
        When it comes to mitigating filter-bubbles, there are two
common responses in the literature. The first approach is to develop
recommendation algorithms that are more responsive to the risks
inherent in the filter bubble. This can be achieved by focusing on
tuning algorithms to increase beyond accuracy aspects (such as
diversity, serendipity, coverage and novelty), in addition to relevance
of recommendations (c.f., [
        <xref ref-type="bibr" rid="ref1 ref17 ref2 ref20 ref4">1, 2, 4, 17, 20</xref>
        ]), and re-ranking
recommendation lists to include diversity in an optimization function (c.f.,
[
        <xref ref-type="bibr" rid="ref12 ref19">12, 19</xref>
        ]).
      </p>
      <p>
        While improving recommendation diversity can go some way
to coping with the filter bubble, it is far from a complete solution.
For example, it does not increase user awareness of the filter
bubble itself. A second approach helps users to better understand the
available options – the recommendation space – so as to inform
them about the compromises that are inherent in any set of
recommendations, relative to a wider set of items. In this regard, the
work of [
        <xref ref-type="bibr" rid="ref14 ref18">14, 18</xref>
        ] is pertinent, showing how visualization was found
to increase user awareness of the filter bubble, understandability of
the filtering mechanism, and a user’s sense of control.
      </p>
      <p>In this paper, we address the blind-spot issue by showing the
consumption behaviours of users, and highlighting blind-spots that
may exist in their consumption relative to a larger user population.
We further study whether by making users aware of their
blindspots, we may be able to influence them to explore items in the
under-explored parts of their catalogue.
3</p>
    </sec>
    <sec id="sec-3">
      <title>METHOD</title>
      <p>In this section, we provide a brief overview of the stages involved
in the extraction and visualization of consumption pattern. With
our visualization we aim to give users a holistic view of their
filterbubble by enabling them to compare their consumption pattern
(user profile) with the (aggregate) consumption pattern of other
users of the system (‘global’ consumption pattern or ‘global’
proifle). However, in doing so, we do not aim to explain individual
items to users, but rather highlight the important aspects of their
profile as a whole (i.e., by grouping tracks based on genres). That
way visualization could scale better and still provide an accurate
representation of global and user’s preferences.</p>
      <p>In comparing global and user’s preferences, we not only enable
comparisons between diferent categories, but also within the same
category between user and global profiles (i.e., within the same
genre, we highlight the diferences between user’s preferences and
global preferences). To further emphasize significant categories, in
addition to representing a range of categories, we also represent
interaction between these categories (i.e., when a track belongs to
more than one genre). This enables us to highlight a user’s most
familiar categories thereby increasing their trust in the
visualization. In the following sections we describe the design decisions
that went into the extraction of consumption data and creation of
visualizations.</p>
      <p>Figure 1 provides a brief overview of the stages involved in the
extraction and visualization of consumption pattern. Steps 1 &amp; 2
involve feature extraction and data collection respectively. Step 3
involves extracting global and local preferences using frequent
itemset mining algorithm. Once the global and local preferences are
extracted, visualizations are constructed to represent this data (step
4). The following sections describe in detail, the design decisions
that went into each of these stages.
For visualization, we categorize tracks based on their genre tags.
Genres provide a good collective representation of a user’s
preferences compared to other acoustic features (such as tempo, pitch
etc). Besides, users can easily relate to a genre-based categorization,
since it is used in existing recommender systems like Spotify.</p>
      <p>
        In addition to providing genre-level categorization between user
and global profiles, it is also important for the system to be able
to distinguish between items in the same genre, between user and
global profiles. In order to achieve this, a second dimension is
added to the visualization. To select the most representative feature
we looked into the Million Song Dataset (MSD) which provides
a total of 55 features for each track, and we chose the feature
‘Artist hotness’. ‘Artist hotness’ is a value (0 to 1) assigned by MSD
for each artist, which corresponds to how much buzz the artist is
getting right now. This value is computed algorithmically based on
information derived from several undisclosed sources, including
mentions in the web, mentions in music blogs, music reviews, play
counts etc. In comparison to other features, artist hotness is proven
to provide a stable representation of user’s preferences [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
3.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>Data extraction</title>
      <p>
        We used the Million Song Dataset [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], which is the largest available
music feature dataset containing audio features, song and artist
meta-data for a million contemporary music tracks. It is also the
only dataset that provides artist hotness value for tracks.
      </p>
      <p>
        To obtain global consumption pattern, we used one of the
complementary datasets of MSD, the ‘Taste Profile Subset’ (TPS) and
merged this with the MSD dataset. TPS dataset provides a list of
tracks listened by a number of users of last.fm, along with the play
count of these tracks. We retained users who listened to at least 20
tracks. The artist hotness values of these tracks were obtained by
merging TPS and MSD. Since MSD does not provide genre
information for tracks, we obtained this information from a third dataset
provided by tagtraum [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>To build a user’s individual profile, we obtained a specific user’s
real time music listening pattern from Spotify. Similar to global
proifle, this entails all the track preferences of the user, the
genre/genrecombinations, and artist hotness values of these tracks. We used
Spotify since it is the only API that provides all these three required
information for research. Besides, Spotify is one of the largest music
service providers, and hence it is relatively easy to find real users
for evaluation.
3.3</p>
    </sec>
    <sec id="sec-5">
      <title>Frequent genre-set extraction</title>
      <p>
        We applied frequent itemset mining algorithm (RElim, [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]) in order
to obtain the most frequently listened genres/genre-combinations.
Frequent itemset mining algorithms work by identifying all
common sets of items in a given list, and it is used for discovering
regularities between frequently co-occuring items in large datasets.
      </p>
      <p>We used ‘Recursive Elimination Algorithm’ (RElim) provided
by the ‘pymining’ package of Python. For both global and user’s
profile, this algorithm gives a set of most frequently consumed
genres/genre-combinations and their frequency values (i.e., how
many times the item appears in the profile). By visualizing this
information we believe that we can enable users to compare their
consumption pattern with the global consumption pattern and
subsequently to identify the blind-spots in their profiles.</p>
      <p>RElim was parameterized at a minimum frequency value
(minimum support) of 2. This means that all itemsets that occur less than
two times will be eliminated from the global profile and user profile.
This support value was chosen to ensure a faster computation time
while still preserving significant genres.</p>
      <p>Table 1 shows the top 20 most frequent genre/genre-combinations
along with their (normalized) frequencies, for the global data set.
Certain genres (‘Rock’, ‘Pop’) are highly preferred globally
compared to others. We also notice that certain genre-combinations are
preferred more than other individual genres. For example,
‘Alternative, Rock’ has higher frequency compared to Rap or Metal. For each
of the top-20 genre/genre-combinations, we compute the average
artist hotness values of all the tracks listened in that genre (Table
1). For all the genres, the average artist hotness value lies closer to
the center (0.5) which accounts for the diverse music consumption
of users.
For our visualizations we represent the top-20 most frequent
genresets for user and global profiles. The choice of visualizations was
made based on their ability to represent all the required dimensions
(i.e., genre/genre-combinations, frequency of genres and average
of artist hotness values for each genre, for top 20 genres), to span
across global and user profiles, and to be able to represent all
required data points. We used scatterplot as our main visualization
and we compare the performance of scatterplot with the baseline
bar-line chart. In this section, we describe both these charts.</p>
      <p>3.4.1 Visualization 1: Scaterplot. Scatterplot is the type of chart
in which data is represented as a collection of points, with each
point having the value of its first variable determining its position
along the horizontal (x-) axis, and the second variable determining
its position along the vertical (y-) axis. Traditional scatterplots are
capable of representing only two dimensions, however, with the
inclusion of visual attributes such as color, size and shape, it is
possible to represent up to five dimensions.</p>
      <p>An example scatterplot used in our study is shown in Figure 2.
Here the size of the bubbles represent the frequency of the item sets.
So larger the bubble, higher the frequency of the genre
corresponding to that bubble. To distinguish between genres we use color
hues. The horizontal orientation of a bubble represents its average
artist hotness value and its vertical orientation represents whether
it belongs to the global profile or the user’s profile (‘yours’ label).
We also implemented a hover feature wherein on hovering over a
bubble, the genre corresponding to the bubble gets highlighted in
both global and user profile. This enables easy comparison between
both profiles. Furthermore, on hovering over a bubble, the genre
name, frequency and average artist hotness value corresponding
to the bubble gets displayed. From the given visualization, we can
infer the following:
(1) For the given user, Pop is the most frequently consumed
genre, since it corresponds to the largest bubble under ‘yours’
category of vertical axis.
(2) Pop is also highlighted under the global category, which
means that it is also globally one of the most (but not the
most) frequent genre(s).
(3) The user prefers more popular artists compared to the
average user of the system since the user’s bubbles are generally
aligned more towards the right.</p>
      <p>
        3.4.2 Visualization 2: Bar-line chart. We compare the
performance of scatterplot with the base-line visualization bar-line chart.
Bar-line chart is a combination of bar chart and line chart and it
can represent up to three variables. A bar chart based visualization
was chosen as the base-line for the following reasons:
(1) It is proven to be the most compelling and persuasive means
to convey explanations in recommender systems [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
(2) It is used in existing recommender systems such as
MovieLens1 to represent user’s ratings across genres, and
frequency of ratings (Figure 3).
We performed an online evaluation of our system to compare the
efectiveness of visualizations, and to study changes in user’s
preferences. For ease of explanation, we divide our evaluation process
into two conceptual stages: Stage 1 - where we evaluate user’s
understanding of visualizations, and Stage 2 - where we observe
a user’s music exploration pattern after they are exposed to their
blind-spots. It is important to note here that this classification is
introduced solely for the purpose of better representation of concepts,
and from participant’s perspective the whole evaluation process
is staged as a single experimental session. In the following
sections, we explain the experimental design and research hypotheses
for Stage 1 and Stage 2 in Sections 4.1 and 4.2 respectively. We
then brief about the materials (Section 4.3) and detailed procedures
(Section 4.4) involved in the study.
1https://movielens.org/, retrieved June 2018
      </p>
      <p>Stage 1: To study the understanding of
visualization
4.1.1 Design. For stage 1 of our evaluation, we used a
withinsubjects repeated measures design, where each participant was
presented with both scatterplot and bar-line chart. In order to
minimize order efects we performed counterbalancing by changing the
order of visualization for each participant.</p>
      <p>4.1.2 Independent variable. For each user we show both types
of visualizations (bar-line chart and scatterplot), and study the
efectiveness of each of these visualizations in increasing the
understanding of a user’s consumption pattern and blind-spots. Hence
type of visualization is our independent variable.</p>
      <p>4.1.3</p>
      <p>Dependent variables.
(1) Correctness of understanding: Understandability of a
visualization is measured by asking users to answer questions
about information represented in the visualization. These
questions test a user’s understanding of their consumption
pattern and their blind-spots.
(2) Confidence : In addition to measuring user’s actual
understanding, we also measure the perceived ease of
understanding for both the visualizations. These are self-suggested
conifdence scores provided by the user for each question about
their consumption pattern and blind-spots. It says how
conifdent the users are in the answers they provide.
4.1.4</p>
      <p>Hypotheses.
• H1: Users are able to answer questions about their
consumption pattern more accurately with scatterplot than with
barline chart.
• H2: Users have more confidence in their answers about their
consumption pattern for scatterplot more than bar-line chart.
• H3: Users are able to answer questions about their
blindspots more accurately with scatterplot than with bar-line
chart
• H4: Users have more confidence in their answers about their
blind-spots for scatterplot more than bar-line chart.
4.2</p>
      <p>Stage 2: To study user’s music exploration
4.2.1 Design. For stage 2, we perform a simple correlation
analysis to study the relation between a user’s understanding of
their profile and their music exploration pattern.</p>
      <p>4.2.2 Independent variable. For all users, we measure if their
understanding in their profile has an impact on their exploration of
blind-spot genres. Hence a user’s correctness of understanding
is the independent variable. This value is directly computed for
each user as a dependent variable in Stage 1 (Section 4.1.3).</p>
      <p>4.2.3 Dependent variable. Exploration factor: Exploration
factor is measured for each user by computing the proportion of tracks
the user has explored in their blind-spot category, and it quantifies
a user’s exploration in that category.</p>
      <p>4.2.4</p>
      <p>Hypothesis.
• H5: Users who score higher for their questions about their
blind-spots, explore their blind-spot genres more.
(a) All users: ‘Global’
(b) User’s individual profile: ‘Yours’.
Visualizations were designed using D3.js Javascript visualization
library 2. The online interfaces for web-based survey were developed
using Python Flask framework3.
4.4</p>
    </sec>
    <sec id="sec-6">
      <title>Procedure and Tasks</title>
      <p>Each participant goes through six steps to complete the experiment.
Participants start by providing their basic demographic
information (Step 1) after which, they log in with their Spotify account
(Step 2). From the user’s account, we collect his/her top 50 tracks
using spotify’s API. We then use frequent pattern mining algorithm
(Section 3.3) on the genres of these tracks to compute the user’s top
20 frequent genres/genre-combinations, and their average artist
hotness values.</p>
      <p>In the next two steps, users are presented with each of the two
visualizations (bar-line chart and scatterplot), accompanied by a set
of instructions on how to read the visualization. After a minimum
bufer time of 1 minute to read and understand the visualization,
questionnaires are shown below the visualization for the users to
answer. The questionnaire is designed in such a way that, for each
visualization, they evaluate user’s understanding of the system in
all four aspects - global consumption pattern, user’s consumption
pattern, user’s blind-spots and artist hotness values. More
particularly, we ask users to identify - the top first and second highest
consumed genres (globally and in their profile, i.e., 2x2 = 4
questions), their top first and second highest blind-spots (i.e., genres
with high frequency in global profile but not found in their profile, 2
questions) and the artist hotness values of the all genres chosen for
these questions (6 questions). In order to reduce the learning efect,
we split the above 12 questions, and performed counterbalancing
to assign half of the questions for each chart. For each question,
the user is also asked to provide their confidence in their answers
in a 5-point Likert scale.</p>
      <p>Once users examine both visualizations, in the next step, we
study user’s music exploration pattern. We provide an interface
where users can listen to music from diferent genres and genre
2https://d3js.org, retrieved March 2018
3http://flask.pocoo.org, retrieved March 2018
combinations from their blind-spot and frequent genre categories.
More specifically, users are asked to select one or more genres to
listen to from these categories. Based on their chosen genres, songs
are recommended using Spotify’s recommendation API. Users are
asked to listen to tracks that they find interesting, and if they like
any track they are asked to "add" it to their list. Our interface (Figure
6) was inspired by Spotify’s old exploration interface (Figure 5).
We use color coding to diferentiate user’s frequent and blind-spot
genres (green = frequent, red = blind-spots).</p>
      <p>Once users have listened and rated songs for at least five
genre/genrecombinations, in the final-step users fill-out a post-stage assessment
survey. Here users are provided with a set of questions to test their
overall impression of the visualizations used in the study, with
respect to their perceived - ease of understanding, ease of interaction,
usefulness and interest. The answers are collected in a five-point
likert scale.
5</p>
    </sec>
    <sec id="sec-7">
      <title>RESULTS</title>
      <p>In this section, we summarize the results of our online experiment
with respect to our proposed hypotheses.
5.1</p>
    </sec>
    <sec id="sec-8">
      <title>Participants</title>
      <p>There were a total of 23 participants. 83% of the participants (n =
19) were male and 17% female (n = 4). Participants were between
age-groups 19-35. 20 participants had computer science background
(PhD and MSc). They all had diverse music backgrounds and music
consumption behavior (Figure 7).</p>
    </sec>
    <sec id="sec-9">
      <title>Understandability 1: Genres</title>
      <p>Participants were asked to identify their first and second most
consumed genres. Understandability was measured by how accurately
participants could identify these genres. For each answer, a score
was provided based on its correctness. For example, when
answering about their first most consumed genre, a score of 1 is assigned
if the answer is right, a score of 0.5 is assigned if the participant
provided the name of their second most consumed genre, and a
score of 0 is assigned for all other wrong answers.</p>
      <p>The average scores for all participants for identifying their first
and second most consumed genre, and the artist hotness values of
these genres, is given in table 2. The diference in the mean scores
are not statistically significant (Mann-Whitney U-test at p&lt;0.05).
Thus the results provide no support for hypothesis 1, which stated
that participants would be able to answer questions about their
consumption pattern more accurately with scatterplot than with
bar-line chart.
5.3</p>
    </sec>
    <sec id="sec-10">
      <title>Confidence 1: Genres</title>
      <p>Participants were asked to provide their confidence values in their
answers for identifying their first and second most consumed
genres. The average scores are summarized in Table 3. The trends show
that the participants had higher confidence with bar-line chart for
identifying their first most consumed genre. For their second most
consumed genre, they had higher confidence with scatterplot.
However, the results of statistical tests show that the obtained scores are
significant for identification of artist hotness values of ifrst most
consumed genre (Mann-Whitney U-test, U-value = 29, p&lt;0.05).
Hypothesis 2 predicted that participants will have higher
confidence for answers about their genres with scatterplot more than
bar-line chart. The trend is significant in the reverse direction and
the hypothesis is consequently discarded.
Participants were asked to identify their first and second highest
blind-spots. For each answer, a score of 1 is assigned if the answer is
right, a score of 0.5 is assigned if the participant provided the second
best answer, and a score of 0 is assigned for all other answers.</p>
      <p>The average scores for all participants for identifying their first
and second highest blind-spot and their artist hotness values are
shown in Table 4. The average scores are slightly higher for
scatterplot than for bar-line chart, but the results are not statistically
significant. Hence hypothesis 3, which stated that participants would
be able to answer questions about their consumption pattern more
accurately with scatterplot than with bar-line chart, is not
conifrmed .
Participants were asked to provide their confidence values for their
answers about their first and second highest blind-spots. The
average scores are summarized in Table 5. The trends show that the
participants had higher confidence with scatterplot for
identifying their first most consumed genre. For their second most
consumed genre, they had higher confidence with bar-line chart. The
observed trends are significant for identification of artist hotness
values (Mann-Whitney U-test, U-value = 33 at p&lt;0.05 for artist
hotness of first highest blind-spot, and U-value = 16.5 at p&lt;0.05 for
artist hotness of second highest blind-spot). Hypothesis 4 predicted
that participants will have higher confidence for answers about
their blind-spots with scatterplot more than bar-line chart. The
trend is significant in the both directions, and the hypothesis is not
confirmed .
In the exploration stage, participants were asked to explore music
from their frequent and blind-spot genres/genre-combinations.
Hypothesis 5 states that users who have higher understanding of their
profile explore their blind-spot genres more. For each user, an
exploration factor (EFb s ) was computed to quantify their exploration
in their blind-spot genres:</p>
      <p>EFb s = Nb s * wb s ,
where Nb s is the number of genres listened in blind-spot category,
and wb s is the number of tracks listened in each of these genres. We
compared this exploration factor with the user’s understanding of
their consumption pattern and blind-spots (obtained from their total
scores for all their answers, from Stage 1 evaluation - Section 4.1). A
positive Spearman’s correlation of 0.44 was obtained between user’s
understanding of their profile and their exploration in blind-spot
genres (Significant at p&lt;0.05). Thus hypothesis H5 is confirmed .
5.7</p>
    </sec>
    <sec id="sec-11">
      <title>Post-hoc analysis</title>
      <p>We did a post-hoc analysis to confirm that the positive correlation
obtained between a user’s exploration in blind-spot category and
their understanding of their profile (Section 5.6) is exclusive, and not
observed in frequent and bridge (i.e., frequent + blind-spot
combination) categories. The results of correlation analysis for frequent and
bridge categories are shown in Table 6. The results state that user’s
understanding of their profile has a negative correlation (p&lt;0.05)
with their exploration in frequent category and an insignificant
weak positive correlation with bridge category. This observation
implies that the positive correlation between user’s exploration and
their understanding is exclusive to blind-spot category.
In the first stage of our evaluation (Section 4.1) we aimed to
understand the efectiveness of visualizations at conveying information
about (a) user’s consumption pattern, and (b) user’s blind-spots.</p>
      <p>The correctness scores show that conventional bar-line chart is
better at conveying information that is explicit about user’s
proifles (i.e., information about consumption pattern). For conveying
information about blind-spots, or implicit information, scatterplot
obtained higher scores. But the obtained results were not significant,
and therefore, we did a post-hoc analysis on user’s comments for
each of the visualization. We found that a large number of the users
agreed that bar-line charts were easier to get detailed information
(8 users agreed and no one disagreed), while scatterplot was easier
for comparison of their profile with global profile (9 users agreed
and no one disagreed). This reasoning supports the scores obtained
for both the charts, especially for scatterplot for the identification
of blind-spots, since, the ability of a chart to compare global and
individual profile is significant for blind-spot recognition.</p>
      <p>In Stage 2 of evaluation (Section 4.2), we study the impact of
user’s understanding of their profile, on their intention to explore
blind-spot genres. A positive correlation concluded that users who
are more aware of their profile tend to explore their blind-spot
genres more. Furthermore, the results of post-hoc analysis
(Section 5.7) showed that the observed positive correlation (between
user’s exploration in blind-spot category and their understanding)
is exclusive to blind-spot category, and not observed in frequent
or bridge (frequent + blind-spot combination) categories, thereby
further reinforcing the fact that users with higher understanding
of their profile explore their blind-spot category more.</p>
      <p>Additionally, during exploration, we found that users show
interest in mixing genres from their frequent and blind-spot categories
(i.e., bridge genres), to discover new songs. The total number of
genres that users explored in bridge category is almost as high as
the number of genres explored in purely frequent or blind-spot
genres/genre-combinations (Table 7). This suggests that,
irrespective of their understanding in their profile, users are equally inclined
to combined genres from diferent categories. During exploration
phase, we used diferent color codes to distinguish between
frequent and blind-spot genres. This might have stimulated an urge
among users to combine genres from these two categories.</p>
    </sec>
    <sec id="sec-12">
      <title>6.1 Limitations</title>
      <p>In this section, we delineate the limitations and delimitations of
our system which restrict the scope of our results.</p>
      <p>Firstly, when it comes to the data used in our experiment, the
global consumption data obtained from Million Song Dataset’s taste
profile subset (TPS), is available only until the year 2011. There is
no known way to extract data beyond this time period, and hence
it is quite possible that recent changes in trends are not reflected in
our global profile. Secondly, for comparison of visualizations, we
only compare between the scores of bar-line chart and scatterplot.
However, there could be other visualizations that obtain higher
scores than these two visualizations. Future studies could focus on
exploring better means of representation.</p>
      <p>Finally, when studying the correlation between user’s
exploration and their understanding, our study is restricted to user’s
exploration at that specific point during the experiment. Neither
do we confirm if users continue to explore diverse music, nor do
we consider the impacts of contextual factors such as user’s mood,
time of the day etc. In future work, these factors should be taken
into account.</p>
    </sec>
    <sec id="sec-13">
      <title>7 CONCLUSIONS AND FUTURE WORK</title>
      <p>Recommender systems continue to inform our beliefs and opinions
as they influence the information we consume in the world around
us, ranging from the music we listen and movies we watch, to the
news we read and food we consume. This raises the bar in terms
of the ethics of responsible recommendation, and if recommender
systems are to earn our trust then they must help us understand
why certain suggestions are being made and why others are not.
We have presented a user-centered study to assess the efectiveness
of a visualizations to improve human decision making. The results
suggest that users can understand the two visualizations, and that
these visualizations are efective for helping users to identify their
consumption blind-spots.</p>
      <p>Furthermore, on studying users’ exploration pattern we found
that users who have more understanding of their profile, also
actively explore their blind-spots more. Together, our findings suggest
that it is possible to break a user’s filter-bubble by increasing a user’s
awareness of their choices, and providing user control to explore
new item-sets.</p>
      <p>In our future work, we will learn to detect a user’s exploration
preferences and incorporate this information to refine our
recommendations. Our first step will be to diferentiate between content
that a user is not consuming because they are not aware of it, from
content that the user does not engage with because they are not
interested. We also plan to continue this work in other domains
than music, such as news recommendations.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Zeinab</given-names>
            <surname>Abbassi</surname>
          </string-name>
          , Vahab S. Mirrokni, and
          <string-name>
            <given-names>Mayur</given-names>
            <surname>Thakur</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Diversity Maximization Under Matroid Constraints</article-title>
          .
          <source>Technical Report</source>
          . Department of Computer Science, Columbia University.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Gediminas</given-names>
            <surname>Adomavicius and YoungOk Kwon</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Improving Aggregate Recommendation Diversity Using Ranking-Based Techniques</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>24</volume>
          (
          <year>2011</year>
          ),
          <fpage>896</fpage>
          -
          <lpage>911</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Eytan</given-names>
            <surname>Bakshy</surname>
          </string-name>
          , Solomon Messing, and
          <string-name>
            <surname>Lada</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Adamic</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Exposure to Ideologically Diverse News and Opinion on Facebook</article-title>
          .
          <source>Science</source>
          <volume>348</volume>
          ,
          <issue>6239</issue>
          (
          <year>2015</year>
          ),
          <fpage>1130</fpage>
          -
          <lpage>1132</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Derek</given-names>
            <surname>Bridge</surname>
          </string-name>
          and John Paul Kelly.
          <year>2006</year>
          .
          <article-title>Ways of Computing Diverse Collaborative Recommendations</article-title>
          .
          <source>In Adaptive Hypermedia and Adaptive Web-based Systems</source>
          .
          <volume>41</volume>
          -
          <fpage>50</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Borgelt</surname>
            <given-names>C.</given-names>
          </string-name>
          <year>2005</year>
          .
          <article-title>Keeping things simple: finding frequent item sets by recursive elimination</article-title>
          . (
          <year>2005</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Michael</surname>
            <given-names>D</given-names>
          </string-name>
          <string-name>
            <surname>Conover</surname>
            , Bruno Gonçalves, Alessandro Flammini, and
            <given-names>Filippo</given-names>
          </string-name>
          <string-name>
            <surname>Menczer</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Partisan asymmetries in online political activity</article-title>
          .
          <source>EPJ Data Science</source>
          <volume>1</volume>
          ,
          <issue>1</issue>
          (
          <year>2012</year>
          ),
          <fpage>6</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Seth</given-names>
            <surname>Flaxman</surname>
          </string-name>
          , Sharad Goel, and
          <string-name>
            <surname>Justin</surname>
            <given-names>M</given-names>
          </string-name>
          <string-name>
            <surname>Rao</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Filter bubbles, echo chambers, and online news consumption</article-title>
          .
          <source>Public Opinion Quarterly</source>
          <volume>80</volume>
          ,
          <issue>S1</issue>
          (
          <year>2016</year>
          ),
          <fpage>298</fpage>
          -
          <lpage>320</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Schreiber</surname>
            <given-names>H.</given-names>
          </string-name>
          <year>2015</year>
          .
          <article-title>Improving Genre Annotations for the Million Song Dataset</article-title>
          . (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Jonathan</surname>
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Herlocker</surname>
          </string-name>
          , Joseph A.
          <string-name>
            <surname>Konstan</surname>
            ,
            <given-names>and John</given-names>
          </string-name>
          <string-name>
            <surname>Riedl</surname>
          </string-name>
          .
          <year>2000</year>
          .
          <article-title>Explaining collaborative filtering recommendations</article-title>
          .
          <source>In ACM conference on Computer supported cooperative work</source>
          .
          <volume>241</volume>
          -
          <fpage>250</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Li</surname>
            <given-names>D Hu Y.</given-names>
          </string-name>
          <year>2013</year>
          .
          <article-title>Evaluation on Feature Importance for Favorite Song Detection</article-title>
          . (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Yucheng</surname>
            <given-names>Jin</given-names>
          </string-name>
          , Nava Tintarev, and
          <string-name>
            <given-names>Katrien</given-names>
            <surname>Verbert</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Efects of Individual Traits on Diversity-aware Music Recommender User Interfaces</article-title>
          .
          <source>In UMAP.</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Michael</given-names>
            <surname>Jugovac</surname>
          </string-name>
          , Dietmar Jannach, and
          <string-name>
            <given-names>Lukas</given-names>
            <surname>Lerche</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Eficient optimization of multiple recommendation quality factors according to individual user tendencies</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>81</volume>
          (
          <year>2017</year>
          ),
          <fpage>321</fpage>
          -
          <lpage>331</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Ellis DP Lanckriet GR McFee</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bertin-Mahieux</surname>
            <given-names>T.</given-names>
          </string-name>
          <year>2012</year>
          .
          <article-title>The million song dataset challenge</article-title>
          . (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Sayooran</given-names>
            <surname>Nagulendra</surname>
          </string-name>
          and
          <string-name>
            <given-names>Julita</given-names>
            <surname>Vassileva</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Understanding and controlling the filter bubble through interactive visualization: a user study</article-title>
          .
          <source>In Proceedings of the 25th ACM conference on Hypertext and social media. ACM</source>
          ,
          <volume>107</volume>
          -
          <fpage>115</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Tien</surname>
            <given-names>T Nguyen</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pik-Mai Hui</surname>
            ,
            <given-names>F Maxwell</given-names>
          </string-name>
          <string-name>
            <surname>Harper</surname>
          </string-name>
          ,
          <source>Loren Terveen, and Joseph A Konstan</source>
          .
          <year>2014</year>
          .
          <article-title>Exploring the filter bubble: the efect of using recommender systems on content diversity</article-title>
          .
          <source>In Proceedings of the 23rd international conference on World wide web. ACM</source>
          ,
          <volume>677</volume>
          -
          <fpage>686</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Eli</given-names>
            <surname>Pariser</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>The filter bubble: What the Internet is hiding from you</article-title>
          .
          <source>Penguin Books.</source>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Barry</given-names>
            <surname>Smyth</surname>
          </string-name>
          and
          <string-name>
            <given-names>Paul</given-names>
            <surname>McClave</surname>
          </string-name>
          .
          <year>2001</year>
          .
          <article-title>Similarity vs</article-title>
          .
          <source>Diversity. In 4th International Conference on Case-Based Reasoning.</source>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Nava</surname>
            <given-names>Tintarev</given-names>
          </string-name>
          , Shahin Rostami, and
          <string-name>
            <given-names>Barry</given-names>
            <surname>Smyth</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Knowing the unknown: visualising consumption blind-spots in recommender systems</article-title>
          .
          <source>In Proceedings of the 33rd Annual ACM Symposium on Applied Computing, SAC</source>
          <year>2018</year>
          , Pau, France,
          <source>April 09-13</source>
          ,
          <year>2018</year>
          .
          <fpage>1396</fpage>
          -
          <lpage>1399</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>Saúl</given-names>
            <surname>Vargas</surname>
          </string-name>
          and
          <string-name>
            <given-names>Pablo</given-names>
            <surname>Castells</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Rank and relevance in novelty and diversity metrics for recommender systems</article-title>
          .
          <source>In Proceedings of the fifth ACM conference on Recommender systems. ACM</source>
          ,
          <volume>109</volume>
          -
          <fpage>116</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Cai-Nicolas</surname>
            <given-names>Ziegler</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sean M. McNee</surname>
            ,
            <given-names>Joseph A.</given-names>
          </string-name>
          <string-name>
            <surname>Konstan</surname>
            , and
            <given-names>Georg</given-names>
          </string-name>
          <string-name>
            <surname>Lausen</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Improving Recommendation Lists Through Topic Diversification</article-title>
          . In WWW'
          <volume>05</volume>
          .
          <fpage>22</fpage>
          -
          <lpage>32</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>