<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Importance of Song Context in Music Playlists</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Enabling Recommendations in the Long Tail</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ACM Reference format: Andreu Vall, Markus Schedl, Gerhard Widmer and Massimo adrana, Paolo Cremonesi. 2017. e Importance of Song Context in Music Playlists. In RecSys 2017 Poster Proceedings</institution>
          ,
          <addr-line>Como</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Andreu Vall, Markus Schedl, Gerhard Widmer Department of Computational Perception Johannes Kepler University Linz</institution>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Massimo adrana, Paolo Cremonesi Politecnico di Milano</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <abstract>
        <p>Music recommender systems oen operate in sequential mode by suggesting a collection of songs that constitute a listening session. is task is usually called automated music playlist generation and it has been previously studied in the literature with dierent successful approaches based on, e.g., variations of collaborative ltering or content-based similarity. Some of the proposed playlist models take into consideration the current song and a number of previous songs, i.e., the song context, in order to predict the next song. However, it is not yet clear to what extent knowing this song context improves next-song predictions. To shed light on this question, we conduct a numerical experiment on two datasets of hand-curated music playlists, where we compare playlist models that account for different song context lengths. Our results indicate that knowing the song context seems, at rst, uninformative. However, we explain this eect by a strong bias in the data towards very popular songs and observe that, in fact, songs in the long tail are more accurately predicted when the song context is considered.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>CCS CONCEPTS</title>
    </sec>
    <sec id="sec-2">
      <title>MODELING MUSIC PLAYLISTS</title>
      <p>We describe the playlist models considered in our experiment. We
generally assume that: two sets of playlists are available, one for
training and one for test; given the current song and a number
of previous songs from a test playlist, a playlist model has to be
able to rank all next-song candidates according to how likely they
are to be the next song in that playlist. Note that we refer to the
current and the previous songs in the playlist as the song context
as it is commonly done in language models, but this should not be
confused with the incorporation of general contextual information
into recommendation systems.
1.1</p>
    </sec>
    <sec id="sec-3">
      <title>Song Popularity</title>
      <p>is is a unigram model that computes the popularity of a song
according to its relative frequency in the training playlists. At
test time, the next-song candidates are ranked by their popularity,
regardless of the current and previous songs in the playlist.
1.2</p>
      <p>Song-based Collaborative Filtering
is is an item-based Collaborative Filtering (CF) model. A song s is
represented by the binary vector ps indicating the playlists to which
it belongs. e similarity of each pair of songs si ; sj in the training
set is computed as the cosine between psi and psj . At test time, the
next-song candidates are ranked according to their similarity to the
current song, but previous songs in the playlist are ignored.
1.3</p>
    </sec>
    <sec id="sec-4">
      <title>Recurrent Neural Networks</title>
      <p>Recurrent Neural Networks (RNNs) are a class of neural network
models particularly suited to processing sequential data. ey have
a hidden state that accounts for the input at each time step while
recurrently incorporating information from previous hidden states.</p>
      <p>
        We adopt the approach proposed in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], where an RNN model
with one layer of gated recurrent units is combined with a loss
function designed to optimize the ranking of next-item
recommendations. At test time, given the current and all the previous songs
in the playlist, the RNN outputs a vector of song scores that is used
to rank the next-song candidates.
2
      </p>
    </sec>
    <sec id="sec-5">
      <title>DATASETS</title>
      <p>
        e “AotM-2011” dataset [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] is a playlists collection derived from
the Art of the Mix1 database. e “8tracks” dataset is a private
playlists collection compiled from 8tracks,2 an on-line platform
where users can share playlists and listen to other users’ playlists.
      </p>
      <p>We keep only the playlists with at least 3 unique artists and
with a maximum of 2 songs per artist. is is to discard artist- or
album-themed playlists, which may correspond to a less careful
compilation process. We also keep only the playlists with at least
5 songs. Songs occurring in less than 10 playlists are removed to
ensure that the models have sucient observations for each song.</p>
      <p>We randomly assign 80% of the playlists to the training set and
the remaining 20% to the test set. As in any recommendation task
blind to item content, the songs that occur only in test playlists
need to be removed because they can not be modeled at training
time. is aects the nal playlist length and song frequency.</p>
      <p>e ltered AotM-2011 dataset has 17,178 playlists with 7,032
songs by 2,208 artists. e ltered 8tracks dataset has 76,759 playlists
1www.artohemix.org
2hps://8tracks.com
min
med
max
with 15,649 songs by 4,290 artists. Table 1 reports the distribution
of unique songs per playlist, unique artists per playlist and song
popularity in the datasets.</p>
    </sec>
    <sec id="sec-6">
      <title>3 EVALUATION</title>
      <p>e evaluation is based on the prediction of next songs, considering
all the songs in the dataset as next-song candidates. Given a trained
model, the following procedure is repeated over all the test playlists.
We show the model the rst song in a playlist. e model then has
to rank all the next-song candidates according to their likelihood
to be the second song in that playlist. We only keep track of the
rank assigned to the actual second song. We then show the model
the rst and the second actual songs. e model has to rank all the
next-song candidates for the third position, having now a longer
song context. In this way, we progress until the end of the playlist,
always keeping track of the rank assigned to the actual next song.
Finally, the performance of a playlist model is characterized by the
distribution of its predicted ranks for the actual next songs.</p>
      <p>Hyperparameter tuning is performed on a validation split. Along
with the described models, we also consider a random model that
assigns scores to songs uniformly at random, yielding random ranks.</p>
    </sec>
    <sec id="sec-7">
      <title>4 RESULTS</title>
      <p>Considering the ranks achieved for all next-song predictions (le
panels in Figure 1), the RNN and the popularity-based model
perform comparably well, and signicantly beer than the song-based
CF model. is is a surprising result given the simplicity and null
song context of the popularity-based model, but might be explained
by the impact of a small number of very popular songs in both
datasets (Table 1). To investigate this eect, we consider the ranks
achieved when the actual next songs belong to the 10% most
popular songs in the dataset (central panels in Figure 1), and when
the actual next songs belong to the long tail of 90% least popular
songs in the dataset (right panels in Figure 1). As expected, the
popularity-based model performs well on the most popular songs,
but is close to the random model for songs in the long tail. us,
the overall performance of the popularity-based model is similar to
that of the RNN model only because the datasets are biased towards
popular songs. e song-based CF model is not competitive in this
experiment and, especially in the 8tracks dataset, is also aected
by the bias towards popular songs. In contrast, the performance
of the RNN model, which keeps track of the full song context, is
robust to the popularity of the actual next song.
4500
k
n
a
rd3000
e
it
c
d
re1500
p
1500
1</p>
      <sec id="sec-7-1">
        <title>All songs</title>
      </sec>
      <sec id="sec-7-2">
        <title>Popular songs</title>
      </sec>
      <sec id="sec-7-3">
        <title>Long tail songs</title>
        <sec id="sec-7-3-1">
          <title>RNN Pop. CFRand. RNN Pop. CFRand.</title>
          <p>(a) AotM-2011 dataset</p>
        </sec>
        <sec id="sec-7-3-2">
          <title>RNN Pop. CFRand.</title>
          <p>All songs</p>
          <p>Popular songs</p>
          <p>Long tail songs
RNN Pop. CFRand. RNN Pop. CFRand.</p>
          <p>(b) 8tracks dataset</p>
          <p>RNN Pop. CFRand.</p>
          <p>
            In our view, this is a remarkable nding given that the bias
towards popular songs is a characteristic feature of playlist datasets
in the music recommendation domain [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ]. Playlist models that
account for a longer song context are therefore beer suited to
next-song prediction for songs in the long tail.
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>5 CONCLUSION</title>
      <p>In this work we investigated the importance of the song context for
next-song recommendations in music playlists. Our results indicate
that the bias towards very popular songs masks the importance
of the song context. However, we observe that a playlist model
based on an RNN, which considers the full song context of a playlist,
clearly outperforms simpler models when recommending songs
belonging to the long tail of non popular songs.</p>
    </sec>
    <sec id="sec-9">
      <title>6 ACKNOWLEDGMENTS</title>
      <p>We thank Mahias Dorfer, Bruce Ferwerda, Rainer Kelz, Filip
Korzeniowski, Roc´ıo del R´ıo and David Sears for helpful discussions.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>O</given-names>
            <surname>`scar Celma</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Music recommendation and discovery</article-title>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2] Bala´zs Hidasi et al.
          <year>2016</year>
          .
          <article-title>Session-based recommendations with recurrent neural networks</article-title>
          .
          <source>In Proc. ICLR</source>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Brian</given-names>
            <surname>McFee</surname>
          </string-name>
          and
          <string-name>
            <given-names>Gert</given-names>
            <surname>Lanckriet</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Hypergraph models of playlist dialects</article-title>
          .
          <source>In Proc. ISMIR</source>
          .
          <volume>343</volume>
          -
          <fpage>348</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>