<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>ACM
UMAP</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Personalized Next-Track Music Recommendation with Multi-dimensional Long-Term Preference Signals</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Iman Kamehkhosh</string-name>
          <email>iman.kamehkhosh@ tu-dortmund.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dietmar Jannach</string-name>
          <email>dietmar.jannach@ tu-dortmund.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lukas Lerche</string-name>
          <email>lukas.lerche@ tu-dortmund.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Music Recommendation; Personalization; Evaluation</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>TU Dortmund</institution>
          ,
          <addr-line>44227, Dortmund</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <volume>16</volume>
      <abstract>
        <p>The automated generation of playlists given a user's most recent listening history is a common feature of modern music streaming platforms. In the research literature, a number of algorithmic proposals for this \next-track recommendation" problem have been made in recent years. However, nearly all of them are based on the user's most recent listening history, context, or location but do not consider the users' long-term listening preferences or social network. In this work, we explore the value of long-term preferences for personalizing the playlist generation process and evaluate di erent strategies of applying multi-dimensional user-speci c preference signals. The results of an empirical evaluation on ve di erent datasets show that although the short-term listening history should generally govern the next-track selection process, long-term preferences can measurably help to increase the personalization quality.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        Due to the massive volume of online music, relatively large
personal music collections, and the dominant proportion of
long tail items in the music domain [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], music recommender
systems have gained popularity over time. Nowadays, music
recommendation is a typical feature of web platforms and
player applications like iTunes, Deezer or Spotify.
      </p>
      <p>Generally, music recommender systems can be applied to
recommend di erent music-related items such as albums,
artists, or concerts. In this work, we focus on a speci c
type of music recommendation: given a history of recent
tracks played by a user, our goal is to recommend a
(personalized) list of tracks to be played next. We refer to such
recommendations as \next-track recommendations", which
can both be used to generate playlist continuations and to
implement virtually endless radio stations given a number
of seed tracks.</p>
      <p>
        In recent years, a number of next-track recommendation
algorithms have been proposed, among them in particular
context-aware or location-aware ones or approaches that use
musical features, meta-data, or social tags [
        <xref ref-type="bibr" rid="ref15 ref25 ref9">9, 15, 25</xref>
        ].
However, even though long-term user preferences intuitively play
a key role in personalizing the recommendations, nearly all
the playlist generation approaches reviewed in the recent
surveys in [
        <xref ref-type="bibr" rid="ref18 ref7">7, 18</xref>
        ] only consider the user's short-term
listening history.
      </p>
      <p>
        In this work, we therefore aim to explore the value of
long-term preferences for personalizing the next-track
recommendation process. More precisely, instead of basing the
recommendations solely on the sequence of seed tracks from
the users' recent listening histories, as done, e.g., in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ],
or [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], we incorporate multi-dimensional signals extracted
from the users' long-term preferences and their social
network into the recommendation process.
      </p>
      <p>In particular, we look at a user's long-term preferences
to identify and recommend (a) tracks that the user liked in
the past (track repetition), (b) tracks of artists that the user
liked in the past (favorite artists), (c) tracks that are
semantically similar to past liked ones (topic similarity ), and
(d) tracks that are often played together with those that
the user likes (track co-occurrence). In addition, we look for
(e) tracks and artists that the user's social friends liked and
consider them for recommendation. We use then a
multifaceted scoring scheme to incorporate this heterogeneous
information in the recommendation process.</p>
      <p>Our focus is to optimize an accuracy criterion and we
use the track hit rate (recall) to assess to which extent our
algorithms are capable of selecting tracks that would also
be chosen by the users. Our results show that several of the
considered personalization signals (e.g., track co-occurrence
patterns in the users' long-term preferences or favorite track
repetitions) can help to increase the hit rate. Since accuracy
is not the only quality criterion for music recommenders,
we will also discuss the e ect of di erent personalization
strategies on the diversity of the resulting recommendations
and the coherence of the recommendations with the user's
last played tracks.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>PERSONALIZED NEXT-TRACK</title>
    </sec>
    <sec id="sec-3">
      <title>RECOMMENDATION TECHNIQUES</title>
      <p>In this section, we present the details of the proposed
personalization approaches for next-track music
recommendation. A general assumption of our approaches is that in
typical applications the tracks that were most recently played
or most recently added to a playlist should primarily
govern the selection of the immediate next tracks.
Personalization can then be helpful to further adjust the ranking of the
tracks based on the user's long-term preferences or other
user-speci c signals. Recommending tracks of one of the
user's most favorite rock artists can, for example, generally
be a good strategy because many users tend to repeatedly
listen to their favorite artists. If the most recently played
tracks belong however to the genre \rap", recommending
rock hits { even though the user might generally like each
of them { might lead to a poor recommendation quality and
user experience.
2.1</p>
    </sec>
    <sec id="sec-4">
      <title>General Scoring Scheme and Terminology</title>
      <p>In our work, we use a general scoring scheme that
combines a baseline (main) algorithm { which focuses on the
short-term pro le { with personalization components. Given
a short-term listening history h (a sequence of tracks), our
goal is to determine a relevance score for each possible next
track t using di erent personalization signals.</p>
      <p>
        Similar to [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], we use a weighted scoring scheme to
combine the score of the baseline algorithm with additional
personalization scores. The overall relevance score scoreoverall
for a possible next track t given h is computed as
scoreoverall(h; t ) = wbase scorebase(h; t ) +
      </p>
      <p>X
pers2P
wpers scorepers(h; t )
(1)
where P is a set of personalization strategies, each with a
di erent weight wpers, and wbase is the weight of the
baseline algorithm. The scorebase and scorepers are functions to
compute the baseline score and the scores of the individual
personalization components, respectively. The selection of
the scores and their weights both depend on the available
data and the goals that should be achieved, see Section 2.3.6.</p>
      <p>Note that for listening logs, the term short-term history
(h) refers to the most recent (current) listening session of
a user and the long-term history (lth) refers to all listening
sessions of a user before the current session (h). For playlists,
the term short-term history (h) refers to a playlist beginning
and the long-term history (lth) refers to all other shared
playlist by the same user. Since the terms listening session
and playlist both represent sequences of tracks, we will use
them interchangeably in the algorithm descriptions.
2.2</p>
    </sec>
    <sec id="sec-5">
      <title>Short-Term Preference Model:</title>
    </sec>
    <sec id="sec-6">
      <title>Baseline Algorithm</title>
      <p>
        A comparison of various playlister algorithms in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] showed
that in particular for recommendation lists of shorter lengths
a k-Nearest-Neighbor-based (kNN) approach outperformed
other and often more complex algorithms, including Bayesian
Personalized Ranking (BPR) [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ], in terms of accuracy for
this task. As we are interested in high-quality next-track
recommendations, we will use a kNN-based approach as the
baseline algorithm as done in [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>
        kNN was also used as a baseline in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Small hit rate
increases were achieved at long list lengths by combining it
with tag-based track information. In [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] however, a very
small value for k = 10 was used. Since our experiments
show that larger values for k (k = 300) lead to signi cantly
higher hit rates, we will use kNN with k = 300 as a baseline.
      </p>
      <p>The kNN-based approach takes the recent listening
history h as an input and looks for other listening sessions in
the training data that contain the same tracks1. The main
assumption is that if there are additional tracks in a
similar past session, chances are good that these tracks suit the
current listening history h, too. We refer to these similar
sessions as \neighbors".</p>
      <p>Technically, given a short-term history h, we rst compute
the binary cosine similarity of h and the other sessions from
the training data. The similarity values are then sorted and
a set Nh of nearest neighbor sessions of h is determined. The
kNN score of a target track t is then computed as the sum
of the similarity values of h and neighbor sessions nbr 2 Nh
which contain t (Equation 2). Note that the indicator
function 1nbr(t ) returns 1 if nbr contains t and 0 otherwise.
scorekNN (h; t ) =</p>
      <p>simcosine(h; nbr) 1nbr(t ) (2)</p>
      <p>X
nbr2Nh
2.3</p>
    </sec>
    <sec id="sec-7">
      <title>Long-Term Preference Models:</title>
    </sec>
    <sec id="sec-8">
      <title>Personalization Approaches</title>
      <p>In the following sections we propose a number of ways to
compute the additional user-speci c relevance scores.
2.3.1</p>
      <sec id="sec-8-1">
        <title>Track Repetition</title>
        <p>
          Listening to one's favorite tracks repeatedly over time is
common in the music domain. To assess the importance
of repetitions, we examined the listening histories of 2,000
Last.fm and Twitter users as follows. Given the list of tracks
of a user's listening history T = [t1; t2; ; tn]; we measure
the proportion of repetitions Repp(u; T ) in the listening
history of a user u as proposed in [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ]:
        </p>
        <p>Repp(u; T ) = 1
(3)
jTuniqj
jT j
where jTuniqj is the number of unique tracks in the user's
listening history and jT j is the length of his listening history.</p>
        <p>The results show that, on average, more than 25% of the
tracks that the users listen to are repeated tracks.
Therefore, recommending and playing tracks that the user has
listened to in the past is a simple but promising
personalized strategy. To operationalize this idea we have to answer
the questions whether or not to play known tracks and which
tracks to select.</p>
        <p>
          Using a coarse classi cation scheme, we could consider a
user { during a listening session { to be either in exploration
mode when she would like to discover new songs [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] or
in repetition mode when she prefers to listen to her favorite
tracks. To assess the general value of distinguishing between
the two modes, we use the following simple heuristic to
determine the user's mode and to decide whether or not to
include track repetitions: If a pre-de ned proportion of the
user's recent history (e.g., 50%) consists of tracks that the
user has played in the past, we assume that the user prefers
to listen to known tracks. More elaborate schemes are of
course plausible [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
        </p>
        <p>Di erent strategies are also possible to determine which
tracks from the user's long-term history to recommend. In
this paper, we examined the following approaches given a
recent listening history h:
1As mentioned above, we will use the terms \listening
history" or \listening session" in the following descriptions even
though in some of the experiments later on \other shared
playlists" of a user are meant.</p>
        <p>R1: Repeat tracks from the user's history that are generally
popular in the community in terms of play-counts.
CAGH score (Equation 5) and its application on the
longterm history lth of the user:
R2: Repeat tracks which have been listened to by the user
at the same time of the day as the tracks from h.
R3: Repeat tracks which have been performed by the same
artists as in h.</p>
        <p>R4: A weighted combination of the time-based (R2) and
artist-based (R3) strategies.</p>
        <p>To be able to combine these approaches with the baseline
method as described in Section 2.1, various ways of
computing a repetition score scorerep are possible. In our
experiments, we rst build a set Ru of repetition candidates
for each user u using one of the above mentioned strategies
(R1-R4). Given the current listening history h of u, we then
compute a score for a target track t based on the recency
of the past listening event and the number of times t has
been repeated in the user's long-term history.</p>
        <p>
          The experiments on di erent datasets reported in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]
reveal a typical tendency of users to re-consume or repeat more
recently-consumed items. Therefore, as a score, we use the
weighted relative recency of the last time point ts(t ) when
the target track t was played by the same user (Equation 4).
If a candidate track was not played by the user before, the
score is zero. Otherwise, it is the relation between ts(t ) and
the timestamp of the beginning of the current listening
session ts(h) weighted by the number of times the target track
has been repeated in the user's long-term history cntlth(t ).
To further increase the relative importance of the tracks in
the recent history, an exponential decay function can also
be applied as suggested in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
        </p>
        <p>scorerep(h; t ) = cntlth(t )
1Ru (t )</p>
        <p>(4)
ts(t )
ts(h)
2.3.2</p>
      </sec>
      <sec id="sec-8-2">
        <title>Favorite Artists</title>
        <p>Music lovers typically have their favorite artists and online
services like Spotify quite obviously recommend tracks
performed by artists that a user played in the past. In the
proposed \Favorite Artists" personalization approach we adopt
this idea and assign higher preference scores to tracks by a
user's favorite performers. However, instead of only
considering artists that the user actually knows, we also consider
artists that are similar to the known ones.</p>
        <p>
          Technically, we extend the CAGH (Collocated Artists {
Greatest Hits) method from [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], which assigns higher scores
to the most popular tracks of artists that are similar to those
appearing in the recent listening history.
        </p>
        <p>
          scoreCAGH (h; t ) = X simartist(at ; b) cnt(t )
(5)
b2Ah
Ah is the set of artists in current history, cnt(t ) is the
number of occurrences of t in the training data, and at is the
artist of the target track. As a measure of similarity of two
artists simartist(at ; b), we count how often two artists
appear together in the sessions of the training set [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>To personalize the CAGH method, we extend it by also
taking into account artists that are similar to those that
were played in the user's long-term history. The
favoriteartist score scorefavA of a target track t with artist at is
then computed as a weighted combination of the original
scorefavA(h; t ) =</p>
        <p>scoreCAGH (h; t ) +
(1
)</p>
        <sec id="sec-8-2-1">
          <title>X simartist(at ; b) cnt(t )</title>
          <p>(6)
b2Alth
where Alth is the set of artists of the tracks in the long-term
history, at is the artist of the target track, and cnt(t ) as
well as simartist(at ; b) are computed as described above.
The weight factor is used to balance the relative
importance of the two factors.
2.3.3</p>
        </sec>
      </sec>
      <sec id="sec-8-3">
        <title>Topic Similarity</title>
        <p>
          Users of music websites like Last.fm are able to annotate
tracks with tags. Such tags often describe the genre, mood,
artist, or the release year of a track and these social tags
can be used for detecting the topic of a certain playlist [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
Our assumption is that at least some users have long-term
preferences to listen to tracks that relate to a certain topic
or tracks that have the same tags.
        </p>
        <p>
          In [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], it was shown that considering the topical match
of target tracks with the history (using tags from Last.fm)
can be helpful to improve the recommendation accuracy.
Two versions of a \content-based" recommendation method
using averaged TF-IDF (Term Frequency/Inverse Document
Frequency) vectors were proposed. One focused only on the
recent listening history and one also included the long-term
listening preferences.
        </p>
        <p>Given a recent history h, we compute a tag-based
similarity score scoretag as the cosine similarity of the averaged
TF-IDF vector of the recent history and the TF-IDF vector
of the target track t . In our work, we use the extended
version of the scoretag for personalization. We de ne the
topic-similarity score scoretopicSim as a weighted
combination of scoretag and its extended version considering the
long-term history as follows:
scoretopicSim(h; t ) =</p>
        <p>scoretag(h; t ) +
(1
) simcosine(TFlth; t~ )
where TFlth is the averaged TF-IDF vector of all playlists
(listening sessions) of the user's long-term history and t~ is
the vector representation of t . The weight factor is used
to balance the relative importance of the two factors.
2.3.4</p>
      </sec>
      <sec id="sec-8-4">
        <title>Extended Track Co-occurrences</title>
        <p>Similar to the long-term content-based model in Equation
7, we propose to extend the kNN-based scheme (Equation
2) in a way that it considers the user's long-term listening
history in combination with the recent history when
determining the nearest neighbors.</p>
        <p>In particular, for each listening session from the user's
long-term history s 2 lth, we determine a set Ns of similar
sessions (nearest neighbors) as described in Section 2.1. The
extended long-term kNN score kNNlth of a target track t is
then computed as the sum of the similarity values of all the
user's sessions s 2 lth and their respective neighbor sessions
nbr 2 Ns which contains t , i.e., 1nbr(t ) = 1 if nbr contains
t and 0 otherwise.
scorekNNlth (lth; t ) = X</p>
        <p>X</p>
        <p>simcosine(s; nbr) 1nbr(t )
s2lth nbr2Ns
(7)
(8)
wrep
wfavA</p>
        <p>The nal personalization approach explored here leverages
the listening preferences of the social friends of a user. Our
hypothesis is that the preferences and behavior of friends
can have an in uence on a user.</p>
        <p>In one of our experiments described in the next section,
we will therefore utilize the listening logs of a user's Twitter
friends2 and incorporate the favorite tracks of friends into
the next-track selection process. We consider a track as a
favorite, if a user listened to it at least n times.</p>
        <p>To compute the respective personalization score, we rst
build a set FFu of favorite tracks for the friends of u. For
a target track t , the Favorites-of-Friends score scorefof is
then a weighted sum of the occurrences of t in the favorite
tracks of the user's friends cntff (t ). A popularity weight
for each friend pop(f ) can be computed based on, e.g., the
number of the followers of each friend, which gives higher
relevance to the tracks of more \popular" friends.
scorefof (h; t ) =</p>
        <sec id="sec-8-4-1">
          <title>X cntff (t ) pop(f )</title>
          <p>(9)
ff2FFu
2.3.6</p>
        </sec>
      </sec>
      <sec id="sec-8-5">
        <title>Combining the Scores</title>
        <p>
          As mentioned above, we use a weighted scoring scheme to
compute the nal relevance score (Equation 1) as done in
[
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] and assume that the resulting personalization e ect will
lead to higher recommendation accuracy.
        </p>
        <p>Generally, the selection of the scores and their weights
both depend on the data that is available and the goals
that should be achieved. If, for example, discovery is the
main goal, including tracks from social friends might be
suitable. If the goal is to generate a thematic (e.g., mood-based)
playlist, a similarity-based approach seems more helpful.</p>
        <p>In this work, we systematically tested di erent values for
the weight of each score and selected the best ones (listed in
Table 1) according to the accuracy results on the test set.
In addition, to be able to combine the scores with di
erent scales, we apply zero-one normalization by rescaling the
scores to values between 0 and 1. Given a list of values E,
the normalized score for an element ei 2 E is computed as
normalized(ei) =
(10)
ei
Emax</p>
        <p>Emin</p>
        <p>Emin
where Emax and Emin are the maximum and minimum
values of E respectively.</p>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>EXPERIMENTAL EVALUATION</title>
      <p>We conducted a series of experiments to assess the value
of personalizing the next-track recommendation process and
the e ectiveness of the proposed techniques both in terms
of accuracy as well as artist diversity and coherence.
3.1</p>
    </sec>
    <sec id="sec-10">
      <title>Datasets</title>
      <p>We used three playlist datasets and two datasets that
contain listening logs of several thousand users.</p>
      <p>
        Playlists: The playlist datasets are obtained from three
di erent platforms. The Last.fm dataset was collected using
the public API of the service. The Art-of-the-Mix (AotM)
data was published by [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] and contains playlists by music
enthusiasts. The 8tracks dataset was shared with us by the
2Twitter friends are the users that the speci ed user is
following on Twitter.
8tracks platform. A particularity of the last dataset is that
each playlist can only contain two tracks per artist. To be
able to personalize the recommendations we created
subsamples of these datasets such that there are at least 4 playlists
for each user. We chose this comparably low threshold to
have at least 1,000 playlists in the smallest dataset.
      </p>
      <p>
        Listening logs: The listening log datasets are sampled
from two public datasets. First, we created a subset of
the #nowplaying (NP) dataset which contains music-related
tweets of users on Twitter3. Speci cally, we sampled users
who have posted at least 50 tweets over time. Second, we
used the recent 30Music dataset [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ], which contains
listening sessions retrieved from Internet radio stations through
the Last.fm API. Similar to the playlist datasets, we created
subsamples of these datasets in a way that there are 9
listening sessions for each user. Having su cient past sessions
allows us to repeatedly apply the sliding-windows evaluation
procedure as described in the next section.
      </p>
      <p>The listening logs provide the timestamps of each
listening event, which allows us to apply a time-based repetition
scheme as described in Equation 4. Furthermore, the
#nowplaying dataset includes the Twitter IDs of the users. We
used this information to retrieve the friends of the users on
Twitter and explore the e ect of our proposed social score
on this dataset. Table 2 summarizes the statistics of the
used datasets. All datasets used in the experiments except
the non-public one from 8tracks are available online4.
3.2</p>
    </sec>
    <sec id="sec-11">
      <title>Metrics and Measurement Method</title>
      <p>
        We use the accuracy measurement protocol that was also
used in [
        <xref ref-type="bibr" rid="ref15 ref6 ref7">6, 7, 15</xref>
        ]. The data is split into training and test sets
and we then hide the last track of each playlist or listening
session in the test set. The goal is to predict this last hidden
track. We generate recommendation lists of length 100 and
a \hit" is counted when the hidden track was in the top-100
list. Therefore, the hit rate is the fraction of playlists in the
3See http://dbis-nowplaying.uibk.ac.at for more details.
4http://ls13-www.cs.tu-dortmund.de/homepage/ifup2016/
datasets.zip
test set for which the hidden track was found. Besides the
hit rate we also report the Mean Reciprocal Rank (MRR),
which takes the position of the hit into account. Note that
in the literature on the generation of playlists and virtually
endless radio stations top-n lists of this and even much larger
sizes are common [
        <xref ref-type="bibr" rid="ref15 ref6 ref7">6, 7, 15</xref>
        ].
      </p>
      <p>
        We furthermore assess the diversity and coherence of the
resulting recommendations based on the artists and, where
applicable, based on the tags of the tracks. As done in [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ],
we use the inverse Intra-List-Similarity (ILS) [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ] to quantify
the diversity level. The overlap of artists (tags) in the
history and the next-track recommendations is used as a proxy
to assess the coherence level.
      </p>
      <p>For the playlist experiments, we report the results of a
four-fold cross-validation procedure. For the time-ordered
listening logs, we use a four-fold sliding-window validation,
in which the newest session of each user is used for testing
and the 5 previous sessions for training (windowsize = 5).
After each iteration, the window is moved one session toward
the previous sessions (stepsize = 1).
3.3</p>
    </sec>
    <sec id="sec-12">
      <title>Evaluation Results</title>
      <p>We will rst focus on the e ects of the individual
personalization strategies before we report selected results of
combining di erent scores. We use a signi cance level of
p = 0:05 for all statistical tests (Student's t-tests)
throughout the section. Results that are statistically signi cantly
di erent from the baseline are printed in bold face along
with an up/down arrow which indicates whether the
respective value is higher/lower than the baseline value.
3.3.1</p>
      <sec id="sec-12-1">
        <title>Track Repetition</title>
        <p>Reusing the same track in more than one shared playlist
is comparably uncommon. We therefore focus on the value
of repeated recommendations of known tracks based on the
listening logs, for which we also have timestamp
information. Table 3 shows the results for the di erent repetition
strategies (R1 - R4) proposed in Section 2.3.1.</p>
        <p>To highlight the e ect of making repeated
recommendations, Table 3 shows the evaluation results for those sessions,
for which we assumed that the user is mainly listening to
already known tracks, i.e., when more than 50% of the recent
tracks are repetitions. On average, this was the case for
about 22% of the sessions in the #nowplaying dataset and
about 15% of the sessions in the 30Music dataset.</p>
        <p>The results clearly show that if our heuristic assumes that
the user's listening mode is repetition (and not exploration)
and we then correspondingly increase the scores of already
known tracks, the accuracy of the recommendations can be
signi cantly enhanced. All the proposed strategies for
selecting the candidate tracks for repetition led to an increase
of the accuracy measures compared to the baseline of up to
15%. When applying the measurement to all sessions
(regardless if the user is in repetition or exploration mode) the
e ects are less pronounced but we still observe a statistically
signi cant improvement over the baseline. For the
#nowplaying dataset, applying the hybrid strategy to all sessions
increases the hit rate from 21% to 27%, and for the 30Music
dataset from 17% to 21%. Overall, the hybrid method (R4)
that combines both the time aspect as well as artist
information led to the best accuracy results for both datasets5.
More elaborate schemes, e.g., to detect the user's listening
mode, are of course possible, but in the context of this work
we are more interested in the general usefulness of including
repetitions in the recommendations.</p>
        <p>In regard to diversity (inverse ILS) and coherence
(overlap), repeating popular tracks from the user's long-term
history (R1) leads to more diverse recommendation lists in
terms of artists. In contrast, the artist-based strategy (R3)
for nding the candidate tracks by design decreases the artist
diversity and increases the coherence of the
recommendations with the listening history. Even though the
popularitybased strategy (R1) seems to be e ective in terms of
accuracy as well, it can be risky in a sense that recommended
next tracks have little to do with the last few tracks, which
might lead to a decreased quality perception.
3.3.2</p>
      </sec>
      <sec id="sec-12-2">
        <title>Favorite Artists</title>
        <p>Table 4 shows the accuracy (upper part) and diversity
and coherence results (lower part) for the personalization
strategy which assigns higher scores to tracks by the
favorite artists of a user. Our focus is again on the listening
log datasets, but we also report the results for the playlist
datasets to analyze if the proposed techniques work for this
problem setting as well.</p>
        <p>The obtained results show that including long-term artist
preferences is bene cial in terms of the hit rate in all cases
except for the AotM dataset, where we only observed a small
but not signi cant improvement. This phenomenon can be
explained by the nature of the AotM platfom, where artist
\reuse" by users is very infrequent.</p>
        <p>Again by design, the artist diversity (inverse ILS)
decreases when the favorite artists are played more often. The
favorite-artists scheme also leads to a stronger coherence
(overlap) of the next tracks with the most recently listened
to tracks. Note that the desirable level of diversity and
coherence of the recommendations depends on the domain and
the goals that should be achieved.
3.3.3</p>
      </sec>
      <sec id="sec-12-3">
        <title>Topic Similarity</title>
        <p>Table 5 shows the results when we bias the
recommendations towards tracks that have a \topic" that often appeared
in the user's long-term history. Again, this personalization
approach based on social tags can help to measurably
increase the recommendation accuracy.</p>
        <p>Since we used social tags as the personalization source in
5For the hybridization, we systematically tested di erent
weights and selected the best ones according to accuracy.
For #nowplaying, the ratio of R2 to R3 is 3:7 and for
30Music this ratio is 1:1.
this experiment, we exceptionally report here the tag
diversity (inverse ILS) and coherence (overlap) of the
recommendations in Table 5. The tag diversity by design decreases as
we are looking for tracks with similar tags. At the same time
a better coherence of the next-track recommendations can
be achieved with this approach. As there is no tag
information available for the listening logs, we could not repeat the
measurement for the remaining two datasets. Our
conjecture is that the e ects might be less pronounced for listening
logs, because playlists are often carefully designed around a
certain \theme", which is advantageous for a content-based
technique.
3.3.4</p>
      </sec>
      <sec id="sec-12-4">
        <title>Extended Track Co-occurrences</title>
        <p>In Table 6, we report the results of considering a user's
long-term history to nd similar listening sessions
(neighbors). Considering the users' long-term preferences in this
way again leads to an increase in accuracy. Furthermore,
due to the richer user pro le, the recommended tracks
become more diverse in terms of the artists. The exceptions are
the Last.fm and 30Music datasets. One explanation might
be that many of the shared playlists and the listening logs
from Last.fm contain only one or very few artists. Therefore,
considering the long-term preferences of the users can
further limit the number of artists of the recommended tracks.
Note that the 30Music listening logs are also collected from
Last.fm. In terms of coherence, the recommendations are
often less connected to the current listening context.
.125
.206
.241"
.199#
.273
.100
.090
.379
.564
.611
.132
.218"
.319"
.254"
.302"</p>
        <p>In this approach we consider the recommendation of
favorite tracks of the social friends as a means for
personalization. In the particular measurement reported here, we
considered a track as a favorite if a user listened to it at
least 5 times.</p>
        <p>The results in Table 7 show the positive e ect on accuracy
of including these tracks in the recommendations for the
#nowplaying dataset, which is the only dataset for which
information about social friends was available. The obtained
e ects on the hit rate are small but signi cant. We still
consider the result promising given that for more than 30%
of the users no Twitter friends could be identi ed who also
posted tracks.
3.3.6</p>
      </sec>
      <sec id="sec-12-5">
        <title>Combining the Scores</title>
        <p>So far, we have reported the e ects of applying the
different personalization strategies in isolation. Depending on
the available data, di erent scores can be combined
(Equation 1). We have run a variety of additional experiments
using di erent weights to analyze the e ects of combining
the scores. Similar to the individual strategies, we
systematically tested di erent values for the weight of each score {
this time in combination with all other scores { and selected
the best ones (listed in Table 8) with respect to accuracy.
In this section, we discuss the results of these combinations
reported in Table 9.</p>
        <p>The results for playlist datasets are based on a
combination of favorite artists, topic similarity and long-term kNN
scores. For the #nowplaying listening log dataset, we used a
combination of all scores (except topic similarity) and for the
30Music dataset the combination includes favorite artists,
inv. ILS
overlap
repetition and long-term kNN scores (see Table 8).</p>
        <p>The results indicate that a weighted combination of
multiple scores with the baseline method can further increase the
accuracy. Speci cally, the multi-score combinations
outperform every other single-score combination with the baseline
in terms of hit rate across all ve datasets.</p>
        <p>The multi-score method also has e ects on the diversity
and coherence of the recommendations, which are the
combined result of the individual scoring methods. For instance,
for the Last.fm dataset, all of the single combinations reduce
the artist diversity and as a result, the multi-score
combination also generates recommendations that are not only
less diverse than the baseline but also less diverse than each
of the single combinations. As another example, consider
the coherence of the recommendations for the
#nowplaying dataset. Two of the single combinations (kNN + rep
and kNN + favA) lead to a higher coherence than the kNN
method, whereas the other two combinations (kNN + lth
and kNN + fof ) lead to a lower coherence than the kNN
method. The multi-score combination, which is a
combination of all these four scores with the kNN method, leads to
a higher coherence than the baseline method. The
explanation for this e ect is that the rst two scores with weights
of wrep = 2:0 and wfavA = 1:5 have a higher in uence on
the recommendations than the other two scores with lower
weights (wkNNlth = 0:5 and wfof = 0:3). Therefore, this
multi-score combination { similar to the rst two single
combinations { generates more coherent recommendations.</p>
        <p>Generally, how to set the diversity and coherence level of
the recommendations depends on the goals that should be
achieved. For instance, if a user is in exploration mode and
intends to discover new tracks or artists, a more diverse
rec.048
.055
.086
.038
.019
.140#
.123
.321
.078#
.048
.051
.152#
ommendation list would be desired. Similarly, when a user
is creating a playlist with a speci c theme, e.g., a workout
playlist, she would probably prefer tracks with a coherent
tempo. This can be achieved by selecting and weighting
the individual personalization components in our multi-score
combination approach according to the desired goals.</p>
        <p>Overall, we view our results to be promising as we could
show the bene ts of incorporating various forms of
personalization signals into the recommendation process even though
the available data for each user is partially very sparse and,
in the case of the social tags, relatively noisy.
4.</p>
      </sec>
    </sec>
    <sec id="sec-13">
      <title>RELATED WORK</title>
      <p>
        Next-track recommendation can be considered a speci c
type of music recommendation where the goal is to
generate a playlist as a continuation of a recent listening
experience. A variety of playlist generation strategies have been
proposed over the last decade (see [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]). They base their
recommendations, e.g., on content similarity [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], collaborative
ltering [
        <xref ref-type="bibr" rid="ref13 ref15">13, 15</xref>
        ], Markov models [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], discrete optimization
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and hybrid techniques [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. While personalization is
considered a key feature in general recommender systems
research and also for various types of music recommendation
[
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], most existing approaches to playlist generation from
the literature only rely on the users' most recent listening
histories and not on their long-term preferences.
      </p>
      <p>
        A few exceptions exist. In [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], the value of considering
repetition patterns is explored to develop a novelty model
for generating personalized music recommendations. In [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ],
a method is proposed that leverages the online friendship
relations and other signals for recommending. In contrast to
these works that focus on one single source for
personalization, we propose an extension of the general faceted weighted
track scoring approach from [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Instead of musical features
and track meta-data in the scoring process used in [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], here
we rely on a variety of user-speci c listening patterns and
the user's social network. Adding musical features to the
process is however generally possible.
      </p>
      <p>
        There are other examples of combining music
recommendation algorithms in the literature [
        <xref ref-type="bibr" rid="ref11 ref23 ref30">11, 23, 30</xref>
        ]. In the latter,
for instance, music tracks are recommended by combining
a collaborative- ltering, an artist-based and a
clusteringbased algorithm through rank interpolation. Similar to our
scoring scheme, their goal is to increase the
recommendation accuracy as well as other factors like novelty, diversity,
and serendipity. However, their main focus lies on creating
more serendipitous recommendations and the technique is
not based on exchangeable baseline methods.
      </p>
      <p>
        In our work, we utilize seed tracks for estimating the
desired characteristics of the next-track recommendations.
Our seed tracks stem partially from hand-crafted playlists
shared by users on di erent music platforms as in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and
partially from listening logs collected from Internet radio
stations and online social media as in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Other approaches
exist in which only one single seed track [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] or the start and
the end tracks are speci ed [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Relying only on one single
track is in principle possible with our approach but would
not allow us to reliably guess the underlying theme of the
current playlists or listening sessions. As we are interested
in (endless) next-track recommendations, the provision of a
desired end track is not reasonable for us.
      </p>
      <p>
        In order to assess whether a track ful lls the desired
characteristics of a listening history, various types of information
can be used, e.g., musical features extracted from the audio
signals [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], meta-data features about artist, genre, or the
release year [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], as well as popularity or usage data [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], and
other social web data like tags [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. In this paper, we used
the user-speci c signals extracted from the usage data as
input source.
      </p>
      <p>
        To evaluate the quality of our proposed algorithms, we
compare the next-track recommendations of each algorithm
with the respective listening history, i.e., a hand-crafted
playlist or a listening session. The assumption is that the
desired quality level of certain criteria such as diversity can
be determined from these listening logs. To compare the
accuracy of di erent algorithms in Section 2.3, we use the hit
rate measure and the con guration proposed in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], i.e., we
hide the last track in each test listening history. For
measuring the diversity and coherence of recommendations, as done
in [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], we use the inverse ILS and the overlap of artists (or
tags) in the history and the recommendations respectively.
Generally, other evaluation approaches are possible as well
as discussed in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], among them user studies [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and log
analysis [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
5.
      </p>
    </sec>
    <sec id="sec-14">
      <title>CONCLUSIONS</title>
      <p>In this work, we have proposed and evaluated a
number of strategies to leverage a variety of signals for
personalizing next-track music recommendations based on the
users' long-term preferences. One general insight of our
work is that even though the selection of the immediate
next tracks should be primarily governed by the most
recently played tracks, personalization based on user-speci c
signals extracted from the long-term preferences can further
improve the quality of the recommendations.</p>
      <p>Technically, in order to extract potentially relevant
userspeci c signals, we analyzed a larger number of \hand-crafted"
and publicly shared playlists as well as long-term listening
logs. Speci cally, we explored the value of track repetition,
favorite artists, topic similarity, track co-occurrence, and
social friends as possible personalization signals. These signals
were then incorporated in a multi-faceted scoring scheme
that combines a baseline algorithm that focuses on the
shortterm interests with one or more long-term personalization
components.</p>
      <p>Despite the noisiness, sparseness and partial
incompleteness of the data, the results show that our proposed
strategies not only help to further increase the recommendation
accuracy but can also help to in uence other quality
dimensions like diversity and coherence depending on the desired
goals. Generally, deciding on (a) the types of auxiliary
information that should be incorporated in the recommendation
process and (b) the relative importance of short-term and
long-term preferences has to be done with the particularities
of the given application domain in mind.</p>
      <p>Conducting user studies to evaluate our approaches with
respect to the perceived quality of the next-track
recommendations and user satisfaction is part of our ongoing work.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>N.</given-names>
            <surname>Aizenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koren</surname>
          </string-name>
          , and
          <string-name>
            <given-names>O.</given-names>
            <surname>Somekh</surname>
          </string-name>
          .
          <article-title>Build Your Own Music Recommender by Modeling Internet Radio Streams</article-title>
          .
          <source>In Proc. WWW</source>
          , pages
          <volume>1</volume>
          {
          <fpage>10</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Anderson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tomkins</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Vassilvitskii</surname>
          </string-name>
          .
          <article-title>The Dynamics of Repeat Consumption</article-title>
          .
          <source>In Proc. WWW</source>
          , pages
          <volume>419</volume>
          {
          <fpage>430</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.-J.</given-names>
            <surname>Aucouturier</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Pachet</surname>
          </string-name>
          .
          <article-title>Scaling Up Music Playlist Generation</article-title>
          .
          <source>In Proc. ICME</source>
          , volume
          <volume>1</volume>
          , pages
          <fpage>105</fpage>
          {
          <fpage>108</fpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>L.</given-names>
            <surname>Barrington</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Oda</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G. R. G.</given-names>
            <surname>Lanckriet</surname>
          </string-name>
          .
          <article-title>Smarter than Genius? Human Evaluation of Music Recommender Systems</article-title>
          .
          <source>In Proc. ISMIR</source>
          , pages
          <volume>357</volume>
          {
          <fpage>362</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Baur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Boring</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Butz</surname>
          </string-name>
          . Rush:
          <article-title>Repeated Recommendations on Mobile Devices</article-title>
          .
          <source>In Proc. IUI</source>
          , pages
          <volume>91</volume>
          {
          <fpage>100</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>G.</given-names>
            <surname>Bonnin</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Jannach</surname>
          </string-name>
          .
          <article-title>Evaluating the Quality of Playlists Based on Hand-Crafted Samples</article-title>
          .
          <source>In Proc. ISMIR</source>
          , pages
          <volume>263</volume>
          {
          <fpage>268</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>G.</given-names>
            <surname>Bonnin</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Jannach</surname>
          </string-name>
          .
          <source>Automated Generation of Music Playlists: Survey and Experiments. ACM Comput. Surv.</source>
          ,
          <volume>47</volume>
          (
          <issue>2</issue>
          ):
          <volume>26</volume>
          :1{
          <fpage>26</fpage>
          :
          <fpage>35</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>K.</given-names>
            <surname>Bosteels</surname>
          </string-name>
          , E. Pampalk, and
          <string-name>
            <given-names>E. E.</given-names>
            <surname>Kerre</surname>
          </string-name>
          .
          <article-title>Evaluating and Analysing Dynamic Playlist Generation Heuristics Using Radio Logs and Fuzzy Set Theory</article-title>
          .
          <source>In Proc. ISMIR</source>
          , pages
          <volume>351</volume>
          {
          <fpage>356</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Braunhofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kaminskas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Ricci</surname>
          </string-name>
          .
          <article-title>Location-aware music recommendation</article-title>
          .
          <source>International Journal of Multimedia Information Retrieval</source>
          ,
          <volume>2</volume>
          (
          <issue>1</issue>
          ):
          <volume>31</volume>
          {
          <fpage>44</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>X.</given-names>
            <surname>He</surname>
          </string-name>
          . Music Recommendation by Uni ed Hypergraph:
          <article-title>Combining Social Media Information and Music Content</article-title>
          .
          <source>In Proc. MM</source>
          , pages
          <volume>391</volume>
          {
          <fpage>400</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Chedrawy</surname>
          </string-name>
          and
          <string-name>
            <given-names>S. S. R.</given-names>
            <surname>Abidi</surname>
          </string-name>
          .
          <article-title>A Web Recommender System for Recommending, Predicting and Personalizing Music Playlists</article-title>
          .
          <source>In Proc. WISE</source>
          , pages
          <volume>335</volume>
          {
          <fpage>342</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Moore</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Turnbull</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Joachims</surname>
          </string-name>
          .
          <article-title>Playlist Prediction via Metric Embedding</article-title>
          .
          <source>In Proc. KDD</source>
          , pages
          <volume>714</volume>
          {
          <fpage>722</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>C.</given-names>
            <surname>Desrosiers</surname>
          </string-name>
          and
          <string-name>
            <given-names>G.</given-names>
            <surname>Karypis</surname>
          </string-name>
          .
          <article-title>A Comprehensive Survey of Neighborhood-based Recommendation Methods</article-title>
          .
          <source>In RSs Handbook</source>
          , pages
          <volume>107</volume>
          {
          <fpage>144</fpage>
          .
          <string-name>
            <surname>Springer</surname>
            <given-names>US</given-names>
          </string-name>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Flexer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schnitzer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gasser</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Widmer</surname>
          </string-name>
          .
          <article-title>Playlist Generation Using Start and End Songs</article-title>
          .
          <source>In Proc. ISMIR</source>
          , pages
          <volume>173</volume>
          {
          <fpage>178</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hariri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Mobasher</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Burke</surname>
          </string-name>
          .
          <article-title>Context-Aware Music Recommendation Based on Latent Topic Sequential Patterns</article-title>
          .
          <source>In Proc. RecSys</source>
          , pages
          <volume>131</volume>
          {
          <fpage>138</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hu</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Ogihara. NextOne Player</surname>
          </string-name>
          :
          <article-title>A Music Recommendation System Based on User Behavior</article-title>
          .
          <source>In Proc. ISMIR</source>
          , pages
          <volume>103</volume>
          {
          <fpage>108</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>D.</given-names>
            <surname>Jannach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lerche</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Kamehkhosh.</surname>
          </string-name>
          <article-title>Beyond \Hitting the Hits": Generating Coherent Music Playlist Continuations with the Right Tracks</article-title>
          .
          <source>In Proc. RecSys</source>
          , pages
          <volume>187</volume>
          {
          <fpage>194</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kaminskas</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Ricci</surname>
          </string-name>
          .
          <article-title>Contextual Music Information Retrieval and Recommendation: State of the Art and Challenges</article-title>
          . Computer Science Review,
          <volume>6</volume>
          (
          <issue>2</issue>
          -3):
          <volume>89</volume>
          {
          <fpage>119</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>K.</given-names>
            <surname>Kapoor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Terveen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          , and
          <string-name>
            <surname>P. Schrater. \</surname>
          </string-name>
          <article-title>I like to explore sometimes": Adapting to Dynamic User Novelty Preferences</article-title>
          .
          <source>In Proc. RecSys</source>
          , pages
          <volume>19</volume>
          {
          <fpage>26</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>P. B.</given-names>
            <surname>Lamere</surname>
          </string-name>
          .
          <article-title>I've Got 10 Million Songs in My Pocket: Now What?</article-title>
          <source>In Proc. RecSys</source>
          , pages
          <volume>207</volume>
          {
          <fpage>208</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>A.</given-names>
            <surname>Lehtiniemi</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Seppa</surname>
          </string-name>
          <article-title>nen. Evaluation of Automatic Mobile Playlist Generator</article-title>
          .
          <source>In Proc. MC</source>
          , pages
          <volume>452</volume>
          {
          <fpage>459</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>B.</given-names>
            <surname>Logan</surname>
          </string-name>
          .
          <article-title>Content-Based Playlist Generation: Exploratory Experiments</article-title>
          .
          <source>In Proc. ISMIR</source>
          , pages
          <volume>295</volume>
          {
          <fpage>296</fpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>B.</given-names>
            <surname>McFee</surname>
          </string-name>
          and
          <string-name>
            <given-names>G. R. G.</given-names>
            <surname>Lanckriet.</surname>
          </string-name>
          <article-title>The Natural Language of Playlists</article-title>
          .
          <source>In Proc. ISMIR</source>
          , pages
          <volume>537</volume>
          {
          <fpage>542</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>B.</given-names>
            <surname>McFee</surname>
          </string-name>
          and
          <string-name>
            <given-names>G. R. G.</given-names>
            <surname>Lanckriet</surname>
          </string-name>
          .
          <article-title>Hypergraph Models of Playlist Dialects</article-title>
          .
          <source>In Proc. ISMIR</source>
          , pages
          <volume>343</volume>
          {
          <fpage>348</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Moore</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Joachims</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Turnbull</surname>
          </string-name>
          .
          <article-title>Learning to Embed Songs and Tags for Playlist Prediction</article-title>
          .
          <source>In Proc. ISMIR</source>
          , pages
          <volume>349</volume>
          {
          <fpage>354</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>T.</given-names>
            <surname>Pohle</surname>
          </string-name>
          , E. Pampalk, and
          <string-name>
            <given-names>G.</given-names>
            <surname>Widmer</surname>
          </string-name>
          .
          <article-title>Generating Similarity-Based Playlists using Traveling Salesman Algorithms</article-title>
          .
          <source>In Proc. DAFx</source>
          , pages
          <volume>220</volume>
          {
          <fpage>225</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>S.</given-names>
            <surname>Rendle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Freudenthaler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Gantner</surname>
          </string-name>
          , and L.
          <string-name>
            <surname>Schmidt-Thieme</surname>
          </string-name>
          .
          <article-title>BPR: Bayesian Personalized Ranking from Implicit Feedback</article-title>
          .
          <source>In Proc. UAI</source>
          , pages
          <volume>452</volume>
          {
          <fpage>461</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>R.</given-names>
            <surname>Turrin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Quadrana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Condorelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Pagano</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          . 30Music Listening and
          <string-name>
            <given-names>Playlists</given-names>
            <surname>Dataset</surname>
          </string-name>
          .
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Hsu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          .
          <article-title>Exploration in Interactive Personalized Music Recommendation: A Reinforcement Learning Approach</article-title>
          .
          <source>ACM Trans. Multimedia Comput. Commun. Appl.</source>
          ,
          <volume>11</volume>
          (
          <issue>1</issue>
          ):7:
          <issue>1</issue>
          {7:
          <fpage>22</fpage>
          ,
          <string-name>
            <surname>Sept</surname>
          </string-name>
          .
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>Y. C.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. O.</given-names>
            <surname>Seaghdha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Quercia</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Jambor</surname>
          </string-name>
          . Auralist:
          <article-title>Introducing Serendipity into Music Recommendation</article-title>
          .
          <source>In Proc. WSDM</source>
          , pages
          <volume>13</volume>
          {
          <fpage>22</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <surname>C.-N. Ziegler</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          <string-name>
            <surname>McNee</surname>
            ,
            <given-names>J. A.</given-names>
          </string-name>
          <string-name>
            <surname>Konstan</surname>
            , and
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Lausen. Improving Recommendation Lists Through Topic</surname>
          </string-name>
          <article-title>Diversi cation</article-title>
          .
          <source>In Proc. WWW</source>
          , pages
          <volume>22</volume>
          {
          <fpage>32</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>