<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Comparison of Frequent Paern Techniques and a Deep Learning Method for Session-Based Recommendation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Iman Kamehkhosh</string-name>
          <email>iman.kamehkhosh@tu-dortmund.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dietmar Jannach</string-name>
          <email>dietmar.jannach@tu-dortmund.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Malte Ludewig</string-name>
          <email>malte.ludewig@tu-dortmund.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop on Temporal Reasoning in Recommender Systems, collocated with ACM Rec-</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Sys'17</institution>
          ,
          <addr-line>Como</addr-line>
          ,
          <country country="IT">Italy.</country>
          <institution>, Copyright©2017 for this paper by its authors. Copying permied for private and</institution>
          ,
          <addr-line>academic purposes., .</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>TU Dortmund</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Making session-based recommendations, i.e., recommending items solely based on the users' last interactions without having access to their long-term preference proles, is a challenging problem in various application elds of recommender systems. Using a coarse classication scheme, the proposed algorithmic approaches to this problem in the research literature can be categorized into frequent paern mining algorithms and approaches that are based on sequence modeling. In the context of methods of the laer class, recent works suggest the application of recurrent neural networks (RNN) for the problem. However, the lack of established algorithmic baselines for session-based recommendation problems makes the assessment of such novel approaches dicult. In this work, we therefore compare a state-of-the-art RNN-based approach with a number of (heuristics-based) frequent paern mining methods both with respect to the accuracy of their recommendations and with respect to their computational complexity. e results obtained for a variety of dierent datasets show that in every single case a comparably simple frequent paern method can be found that outperforms the recent RNN-based method. At the same time, the proposed much more simple methods are also computationally less expensive and can be applied within the narrow time constraints of online recommendation.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>CCS CONCEPTS</title>
      <p>•General and reference →Evaluation; •Information systems
→Recommender systems; •Computing methodologies
→Neural networks; Rule learning;</p>
    </sec>
    <sec id="sec-2">
      <title>INTRODUCTION</title>
      <p>
        Making recommendations solely based on a user’s current session
and most recent interactions is a nontrivial problem for
recommender systems. On an e-commerce website, for instance, when
a visitor is new (or not logged in), there are no long-term user
models that can be applied to determine suitable recommendations
for this user. Furthermore, recent work shows that considering the
user’s short-term intent has oen more eect on the accuracy of the
recommendations than the choice of the method used to build the
long-term user proles [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. In general, such types of problems are
common on e-commerce sites, e.g., when returning users do not log
in every time they use the site. e same challenges can, however,
be observed also for other application domains, in particular for
news and media (music and video) recommendation [
        <xref ref-type="bibr" rid="ref21 ref33">21, 33</xref>
        ].
      </p>
      <p>
        e problem of predicting the next actions of users based solely
on their sequence of actions in the current session is referred to
in the literature as session-based recommendation. A number of
algorithmic approaches have been proposed over the years to deal
with the problem. Early academic approaches, for example, rely
on the detection of sequential paerns in the session data of a
larger user community. In principle, even simpler methods can be
applied. Amazon’s “Customers who bought . . . also bought” feature
represents an example that relies on simple co-occurrence paerns
to generate recommendations, in that case in the context of the
very last user interaction (an item view event). A number of later
works then explored the use of Markov models [
        <xref ref-type="bibr" rid="ref30 ref35 ref39">30, 35, 39</xref>
        ], and
most recently, researchers explored the use of recurrent neural
networks (RNN) for the session-based next-item recommendation
problem [
        <xref ref-type="bibr" rid="ref16 ref17 ref38 ref42">16, 17, 38, 42</xref>
        ].
      </p>
      <p>
        Today, RNNs can be considered one of the state-of-the-art
methods for sequence learning tasks. ey have been successfully
explored for various sequence-based prediction problems in the past
[
        <xref ref-type="bibr" rid="ref11 ref18 ref5 ref9">5, 9, 11, 18</xref>
        ] and in a recent work, Hidasi et al. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] investigated an
RNN variant based on gated recurrent units (GRU) for the
sessionbased recommendations problem. In their work, they benchmarked
their RNN-based method gru4rec with dierent baseline methods
on two datasets. eir results showed that gru4rec is able to
outperform the baseline approaches in terms of accuracy for top-20
recommendation lists.
      </p>
      <p>
        While these results indicate that RNNs can be successfully
applied for the given recommendation task, we argue that the
experimental evaluation in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] does not fully inform us about dierent
aspects of the eectiveness and the practicability of the proposed
method. First, regarding the eectiveness, it is unclear if the
methods to which gru4rec was compared are competitive. Second, as
the evaluation was based on one single training-test split and only
using accuracy measures, further investigations are necessary to
assess, for example, if some algorithms exhibit certain biases, e.g., to
recommend mostly popular items. ird, even if the RNN method is
eective, questions regarding the scalability of the method should
be discussed, in particular as hyper-parameter optimization for the
complex networks can become very challenging in practice.
      </p>
      <p>e goal of this work is to shed light on these questions and in
the remainder of this paper we will report the detailed results of
comparing a state-of-the-art RNN-based method with a number
of computationally more ecient paern mining approaches in
dierent dimensions.</p>
    </sec>
    <sec id="sec-3">
      <title>PREVIOUS WORKS</title>
      <p>
        In session-based recommendation problems, we are given a
sequence of the most recent actions of a user and the goal is to nd
items that are relevant in the context of the user’s specic
shortterm intent. One traditional way to determine recommendations
given a set of recent items of interest is to apply frequent
pattern mining techniques, e.g., based on association rules (AR) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
AR are oen applied for market basket analysis with the goal to
nd sets of items that are bought together with some probability
[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. e order of the items or actions in a session is irrelevant
for AR-based approaches. Sequential paerns mining (SP) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
techniques, in contrast, consider the order of the elements in sessions
when identifying frequent paerns. In one of the earlier works,
Mobasher et al. [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ] used frequent paern mining methods to
predict a user’s next navigation action. In another work, Yap et al. [
        <xref ref-type="bibr" rid="ref47">47</xref>
        ]
propose a sequential paern-mining-based next-item
recommendation framework, which weights the paerns according to their
estimated relevance for the individual user. In the domain of music
recommendation, Hariri et al. more recently [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] propose to mine
sequential paerns of latent topics based on the tags aached to
the tracks to predict the context of the next song.
      </p>
      <p>
        A dierent way of nding item-to-item correlations is to look
for sessions that are similar to the current one (neighbors), and to
determine frequent item co-occurrence paerns that can be used in
the prediction phase. Such neighborhood-based approaches were
for example applied in the domains of e-commerce and music in
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] or [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. In some cases and application domains, simple
cooccurrence paerns are despite their simplicity quite eective, see,
e.g., [
        <xref ref-type="bibr" rid="ref20 ref40">20, 40</xref>
        ] or [
        <xref ref-type="bibr" rid="ref44">44</xref>
        ].
      </p>
      <p>
        Dierently from such paern- and co-occurrence-based
techniques, a number of recent approaches are based on sequence
modeling using, e.g., Markov models. e main assumption of
Markovmodel-based approaches in the context of session-based
recommendation is that the selection of the next item in a session is
dependent on a limited number of previous actions. Shani et al. [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ]
were among the rst who applied rst-order Markov chains (MC)
for session-based recommendation and showed the superiority of
sequential models over non-sequential ones. In the music domain,
McFee and Lanckriet [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] proposed a music playlist generation
algorithm based on MCs that – given a seed song – selects the
next track from uniform and weighted distributions as well as from
k-nearest neighbor graphs. Generally, a main issue when applying
Markov chains in session-based recommendation is that the state
space quickly becomes unmanageable when all possible sequences
of user selections should be considered [
        <xref ref-type="bibr" rid="ref12 ref16">12, 16</xref>
        ].
      </p>
      <p>
        More recent approaches to sequence modeling for session-based
recommendation utilize recurrent neural networks (RNN). RNNs
process sequential data one element at a time and are able to
selectively pass information across sequence steps [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. Zhang et al. [
        <xref ref-type="bibr" rid="ref49">49</xref>
        ],
for example, successfully applied RNNs to predict advertisement
clicks based on the users’ browsing behavior in a sponsored search
scenario. For session-based recommendations, Hidasi et al. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]
investigated a customized RNN variant based on gated recurrent
units (GRU) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] to model the users’ transactions within sessions.
ey also tested several ranking loss functions in their solutions.
Later on, in [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] and [
        <xref ref-type="bibr" rid="ref42">42</xref>
        ] RNN-based approaches were proposed
which leverage additional item features to achieve higher
accuracy. For the problem of news recommendation, Song et al. [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ]
proposed a temporal deep semantic structured model for the
combination of long-term static and short-term temporal user preferences.
ey considered dierent levels of granularity in their model to
process both fast and slow temporal changes in the users’
preferences. In general, neural networks have been used for a number
of recommendation-related tasks in recent years. Oen, such
networks are used to learn embeddings of content features in compact
xed-size latent vectors, e.g., for music, for images, for video data,
for documents, or to represent the user [
        <xref ref-type="bibr" rid="ref13 ref25 ref29 ref3 ref46 ref6 ref7 ref8">3, 6–8, 13, 25, 29, 46</xref>
        ].
ese representations are then integrated, e.g., in content-based
approaches, in variations of latent factor models, or are part of new
methods for computing recommendations [
        <xref ref-type="bibr" rid="ref13 ref27 ref37 ref43 ref45 ref7 ref8">7, 8, 13, 27, 37, 43, 45</xref>
        ].
      </p>
      <p>In the work presented in this paper, we will compare dierent
existing and novel paern-mining-based approaches with a
stateof-the-art RNN-based algorithm.
3
3.1</p>
    </sec>
    <sec id="sec-4">
      <title>EXPERIMENT CONFIGURATIONS Algorithms</title>
      <p>
        3.1.1 RNN Baseline. gru4rec is an RNN-based algorithm that
uses Gated Recurrent Units to deal with the vanishing or exploding
gradient problem proposed in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. In our experiments, we used
the Python implementation that is shared by the authors online.1
3.1.2 Session-based kNN – knn. e knn method searches the k
most similar past sessions (“neighbors”) in the training data based
on the set of items in the current session. Since the process of
determining the neighbor sessions becomes very time-consuming
as the number of sessions increases, we use an special in-memory
index data structure (cache) in our implementation. Technically, in
the training phase, we create a data structure that maps the training
sessions to their set of items and one structure that maps the items
to the sessions in which they appear. To make recommendations for
the current session s, we rst create a union of the sessions in which
the items of s appear. is union will be the set of possible neighbors
of the current session. is is a fast operation as it only involves a
cache lookup and set operations. To further reduce the
computational complexity of the prediction process, we select a subsample
of these possible neighbors using a heuristic. In this work, we took
the m most recent sessions as focusing on recent trends has shown
to be eective for recommendations in e-commerce [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. We then
compute the similarity of these m most recent possible neighbors
and the current session and select the k most similar sessions as
the neighbor sessions of the current session. Again through lookup
and set union operations, we create the set of recommendable items
R that contains items that appear in one of the k sessions. For each
recommendable item i in R, we then compute the knn score as the
sum of the similarity values of s and its neighbor sessions n 2 Ns
which contains i (Equation 1). e indicator function 1n ¹iº returns
1 if n contains i and 0 otherwise, see also [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>scoreknn¹i; sº = Σn 2Ns sim¹s; nº
1n ¹iº
(1)</p>
      <p>
        In our experiments, we tested dierent distance measures to
determine the similarity of sessions. e best results were achieved
when the sessions were encoded as binary vectors of the item space
and when using cosine similarity. In our implementation, the set
operations, similarity computations, and the nal predictions can
be done very eciently as will be discussed later in Section 4.2.2.
Our algorithm has only two parameters, the number of neighbors
k and the number of sampled sessions m. For the large e-commerce
dataset used in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], the best parameters were, for example, achieved
with k = 500 and m = 1000. Note that the kNN method used in
[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] is based on item-to-item similarities while our kNN method
aims to identify similar sessions.
      </p>
      <p>3.1.3 kNN Temporal Extension – tknn. e knn method, when
using cosine similarity as a distance measure, does not consider the
temporal sequence of the events in a session. To be able to leverage
the temporal information within the knn technique, we designed
an additional temporal-ltering heuristic for it. e proposed tknn
method uses the same scoring scheme as the knn method
(Equation 1). e only dierence is that, given the current session s, we
consider item i as being recommendable only if it appears in the
neighbor session n directly aer a certain item. In our
implementation, that certain item is the last item of the current session s.
Technically, we therefore use a slightly dierent implementation of
the indicator function of Equation 1: 1n ¹iº = 1 if neighbor session
n contains i and ¹j; iº is a subsequence of n, where j is the last item
of the current session and thus the basis to predict the next item.</p>
      <p>3.1.4 Simple Association Rules – ar. To assess the strength of
simple two-element co-occurrence paerns, we included a method
named ar which can be considered as an association rule technique
with a maximum rule size of two. Technically, we create a rule rp;q
for every two items p and q that appear together in the training
sessions. We determine the weight, wp;q , of each rule simply as the
number of times p and q appear together in past sessions. Given
the current session s, the ar score of a target item i will be then
computed as
scorear¹i; sº = wi; j
1AR ¹ri; j º
(2)
where j is the last item of the current session s for which we want
to predict the successor and AR is the set of rules and their weights
as determined based on the training data. e indicator function
1AR ¹ri; j º = 1 when AR contains ri; j and 0 otherwise.</p>
      <p>3.1.5 Simple Sequential Rules – sr. e sr method is a variant of
ar, which aims to take the order of the events into account. Similar
to the ar method, we create a sequential rule for the co-occurrence
of every two items p and q as rp;q in the training data. is time,
however, we consider the distance between p and q in the session
when computing the weight of the rules. In our implementation,
we use the multiplicative inverse as a weight function and set
wp;q = 1x , where x is the number of items that appear between p
and q in a session. Other heuristics such as a linear or a logarithmic
function can also be used. In case that those two items appear
together in another session in the training data, the weight of the
rule in that session will be added to the current weight. We nally
normalize the weight and divide it by the total number of sessions
that contributed to the weight. Given the current session s, the sr
score of a target item i is then computed as
where j is the last item of session s and SR is the set of sequential
rules. e indicator function 1S R ¹rj;i º = 1 when SR contains rj;i
and 0 otherwise.</p>
      <p>3.1.6 Hybrid Approaches. We made additional experiments with
several hybrids that combine dierent algorithms. At the end, a
weighted combination of the two normalized prediction scores of
the algorithms led to the best results in our experiments.
3.2</p>
    </sec>
    <sec id="sec-5">
      <title>Datasets and Evaluation Protocol</title>
      <p>We performed experiments using datasets from two dierent
domains in which session-based recommendation is relevant, namely
e-commerce and next-track music recommendation. e source
code and the public datasets can be found online.2</p>
      <p>
        3.2.1 E-commerce Datasets. For the e-commerce domain, we
chose the ACM RecSys 2015 Challenge dataset (RSC) as used in
[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. e RSC dataset is a collection of sequences of click events
in shopping sessions. e second e-commerce dataset is a public
dataset published for the TMall competition. is dataset contains
shopping logs of the users on the Tmall.com website.
      </p>
      <p>3.2.2 Music Datasets. We used (a) two datasets that contain
listening logs of several thousand users and (b) two datasets that
comprise thousands of manually created playlists.</p>
      <p>
        Listening logs: ese used datasets are (almost one-year-long)
sub-samples of two public datasets. First, we created a subset of
the #nowplaying dataset [
        <xref ref-type="bibr" rid="ref48">48</xref>
        ], which contains music-related tweets
on Twier. Second, we used the recent 30Music dataset [
        <xref ref-type="bibr" rid="ref41">41</xref>
        ], which
contains listening sessions retrieved from Internet radio stations
through the Last.fm API.
      </p>
      <p>
        Playlists: Generally, music playlists are dierent in nature from
listening logs and e-commerce user logs in various ways.
Nonetheless, they are designed to be consumed in a listening session and the
tracks are oen arranged in a specic sequence. e used playlist
datasets come from two dierent music platforms. e
Art-of-theMix dataset (AotM) was published by [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ] and contains playlists
by music enthusiasts. e 8tracks dataset was shared with us by
the 8tracks platform. A particularity of the 8tracks dataset is that
each public playlist can only contain two tracks per artist.
      </p>
      <p>e dataset statistics are shown in Table 1. e total number
of sessions is larger for the e-commerce datasets. However, the
number of unique items in the music datasets, which corresponds
to the number of tracks included in the playlists or the number of
played tracks in the listening sessions, is higher than the number
of items in e-commerce datasets.
scoresr¹i; sº = wj;i
(3)
2hp://ls13-www.cs.tu-dortmund.de/homepage/rectemp2017
e music sessions are on average longer than the e-commerce
sessions.3 e last row of Table 1 shows the average number of
unique items in each session (“Avg. I/S”). Comparing this value
with the average session length (“Avg. E/S”) indicates what we call
the item repetition rate in each dataset. Including the same track
more than once in a playlist is comparably uncommon. Listening
to a track more than once during a listening session is, however,
common. e dierence between the average session length and
the average number of items in each session for the e-commerce
dataset indicates that re-occurring of the same item in a session is
common in the e-commerce domain.</p>
      <p>
        3.2.3 Evaluation Protocol. e general task of the session-based
recommendation techniques in our experiment is to predict the
nextitem view event in a shopping session or to predict the next track
that is played in a listening session or is included in a playlist. To
evaluate the session-based algorithms, we use the same evaluation
scheme as in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. We incrementally add events to the sessions in
the test set and report the average hit rate (HR), which corresponds
to recall in this evaluation seing, and the mean reciprocal rank
(MRR), which takes the position of the hit into account. We tested
list lengths of 1, 2, 3, 5, 7, 10, 15, and 20. While the experiments
in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] are done without cross-validation, we additionally apply a
vefold sliding-window validation protocol as in [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] to minimize
the risk that the obtained results are specic to the single train-test
split. We, therefore, created ve train-test splits for each dataset.
For the listening logs, we used 3 months of training data and the
next 5 days as the test data and randomized splits for the playlists
as they have no timestamps assigned.
      </p>
    </sec>
    <sec id="sec-6">
      <title>4 RESULTS</title>
    </sec>
    <sec id="sec-7">
      <title>4.1 Accuracy Results</title>
      <p>
        Our rst experiment used the exact same setup as described in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ],
i.e., we use only one training-test split when comparing gru4rec
with our methods. As done in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], we trained the algorithms using
6 months of data containing 7,966,257 sessions of 31,637,239 clicks
on 37,483 items and tested them on the sessions of the next day.
      </p>
      <p>
        In the subsequent sections, we then report the results of our
comparison using the sliding-window validation scheme described
above with recommendation list lengths varying from 1 to 20. In all
experiments, we tuned the parameters for the dierent algorithms
using grid search and optimized for HR@20 on validation sets
(subsets of the training sets). gru4rec was only optimized with
100 layers as done in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] due to the computational complexity of
the method. To test for statistical signicance, we use Wilcoxon
signed-rank test with α = 0:05.
      </p>
      <p>
        4.1.1 Results Using the Original Evaluation Setup. Table 2 shows
the results ordered by the hit rate (HR@20) when using the
original setup. We could reproduce the hit rate and MRR results from
[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] (using their optimal parameters) for gru4rec(1000,bpr) and
gru4rec(1000,top1), which use 1000 hidden units and the TOP1 and
BPR’s pairwise ranking loss functions, respectively. In Table 2, we
additionally report the results for recommendation list length ten,
which might be more important for dierent application domains.
0.568
0.545
0.543
0.521
0.517
0.517
      </p>
      <p>RSC
3Note that each session in the TMall dataset is dened as the sequence of actions of a
user during one day which results in relatively larger average session length.
4Remember that the sessions of the TMALL dataset cover the events of one day, as the
time stamps in this dataset are given only in the granularity of days.</p>
      <p>MRR@1 MRR@2
SR AR KNN</p>
      <p>4.1.2 E-commerce Datasets. Figure 1 shows the MRR results for
the algorithms on the two e-commerce datasets, RSC and TMall.
For both datasets, we can observe that most of the frequent paern
methods lead to higher or at least similar MRR values as gru4rec.
ere is, however, no clear “winner” across both datasets. e sr
method works best for the RSC dataset. On the TMALL dataset,
the knn method outperforms the others, an eect which might be
caused by the longer list session lengths for this dataset.4 In both
cases, however, the dierence between the winning method and the
best-performing gru4rec conguration is statistically signicant.
is is indicated by a star symbol in Figure 1.</p>
      <p>4.1.3 Listening Logs Datasets. Figure 2 shows the accuracy
performance of the algorithms on two selected listening logs datasets.
#nowplaying
AotM
0.3 SR</p>
      <p>MRR@1 MRR@2 MRR@3 MRR@5 MRR@7 MRR@10 MRR@15 MRR@20
AR KNN TKNN GRU4REC(100,TOP1) GRU4REC(100,BPR)
30Music
MRR@1 MRR@2 MRR@3 MRR@5 MRR@7 MRR@10 MRR@15 MRR@20
AR KNN TKNN GRU4REC(100,TOP1) GRU4REC(100,BPR)
8tracks *
*
*
Similar to the e-commerce datasets, in all measurements, a frequent
paern approach, namely the sr method, outperforms gru4rec.
Here again, for MRR@20, the recommendations of sr are
signicantly more accurate than the recommendations of gru4rec.
Note that on the music datasets, we apply gru4rec(100,top1) and
gru4rec(100,bpr), which use 100 hidden units and the TOP1 and
BPR’s pairwise ranking loss function, respectively.5</p>
      <p>e tknn method – the time-aware extension of knn– works
always signicantly beer than the knn method on the listening
logs datasets. tknn also outperforms both gru4rec congurations
on the #nowplaying dataset for list lengths larger than 1.</p>
      <p>Another observation on the listening logs datasets is that the
sequence-based approaches (sr, tknn and gru4rec) work
significantly beer than methods that do not consider the temporal
information in data (knn and ar).</p>
      <p>4.1.4 Playlists Datasets. Figure 3 shows the MRR results of the
algorithms on the playlists datasets. On both datasets, the temporal
extension of kNN, tknn, leads to the best results across all
recommendation list sizes and signicantly outperforms both variants of
gru4rec. e performance of all other methods, however, seems
to depend on the specics of the datset. e sr method works
well on both datasets. e relative performance of the ar method,
however, depends on the dataset and the list length at which the
measurement is made.</p>
      <p>One interesting observation that we made for the music datasets
is that the relative performance of knn strongly improves in terms
of the hit rate6 when the recommendation list length is increased.
is can, for example, be seen in Figure 4, which shows the hit rate
results for the #nowplaying dataset. e hit rate of knn on the
#nowplaying dataset that is about 3% for list length one increases
5Repeating the experiments with 1000 hidden layers for the gru4rec methods did not
lead to any beer results on the music datasets.
6Generally, the hit rate results for the experiments, which we do not include here for
space reasons, are similar to the MRR results.</p>
      <p>SR</p>
      <p>MRR@1 MRR@2 MRR@3 MRR@5 MRR@7 MRR@10 MRR@15 MRR@20</p>
      <p>AR KNN TKNN GRU4REC(100,TOP1) GRU4REC(100,BPR)
SR</p>
      <p>AR
to 24% for list length 20. At the same time, the hit rate of some of
the other methods only slightly increases, e.g., from 6% to 15%. As a
result, across all four investigated music datasets, knn outperforms
all other algorithms in terms of HR@20. A similar trend can also
be seen for ar, the other non-sequential approach.</p>
      <p>
        4.1.5 Aggregated Ranking of Algorithms. To determine the
ranking of dierent algorithms based on their accuracy results (MRR@20)
across all six datasets, we applied the Borda Count (BC) rank
aggregation strategy [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. e results show that sr and tknn are both
ranked rst (30 points), followed by ar as the second best algorithm
(20 points). e gru4rec method with TOP1 ranking loss is ranked
third (18 points). Finally, knn and gru4rec with BPR ranking loss
are ranked fourth (15 points) and h (13 points), respectively.
      </p>
      <p>4.1.6 Hybrid Approaches. We conducted a variety of additional
experiments with dierent hybridization methods as described in
Section 3.1.6 to analyze the eect of combining the algorithms. In
general, a weighted combination of the two normalized prediction
scores of a neighborhood-based and a sequence-based method led
to the best results in our experiments. For instance, the combination
of knn and sr with a weight ratio of 3 to 7, wh(knn,sr:0.3,0.7),
outperformed all other individual algorithms on the 30Music dataset.
0.285
0.142
0.275
0.298
0.261
Another example is combining the normalized score of knn and
gru4rec(100,top1), which can outperform other algorithms in
terms of HR@20. e dierences between the winning hybrid
approaches (printed in bold face in Table 3) and the best
performing individual methods in each measurement were statistically
signicant. Similar results were also achieved for the other datasets,
which we do not include here for space reasons.
4.2</p>
    </sec>
    <sec id="sec-8">
      <title>Additional Analyses</title>
      <p>
        Since prediction accuracy might not be the only possible relevant
quality criterion in a domain [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], we made a number of additional
analyses as shown in Figure 5.
      </p>
      <p>
        4.2.1 Popularity Bias and Catalog Coverage. As in [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], we rst
measured the average popularity of the top-20 recommendations
of the algorithms to assess possible recommendation biases. e
popularity of an item is computed based on its number of
occurrences in the training dataset. Overall, the recommendations of
non-sequential approaches (knn and ar) shows the highest bias
towards popular items. e sequence-based approaches (sr and
gru4rec), in contrast, recommend comparably less popular items.
      </p>
      <p>Additionally, we analyzed the catalog coverage of each algorithm
by counting the number of dierent items that appear in the top-20
recommendation lists of all sessions in the test set. Overall, the
recommendation lists of gru4rec and sr include more dierent items
than the other algorithms. e recommendations of neighborhood
methods, knn and tknn, on the other hand, focus on smaller sets of
items and show a higher concentration bias. is can be explained
by considering the sampling strategy of knn which focuses on a
smaller subset of the sessions, e.g., those of the last few days.</p>
      <p>4.2.2 Computational Complexity and Memory Usage. We
measured the training time as well as the needed memory and time
to generate recommendations for each algorithm. On a desktop
computer with an Intel i7-4790k processor, training gru4rec on
one split of the RSC dataset with almost 4 million sessions and in
its best conguration takes more than 12 hours. is time can be
reduced to 4 hours when calculations are performed by the GPU
(Nvidia GeForce GTX 960).7 e knn method needs about 27
seconds to build the needed in-memory maps, see Section 3.1.2. e
well-performing sr method needs about 48 seconds to determine
the rule weights. A specic advantage of the laer two methods
is that they support incremental updates, i.e., new events can be
immediately incorporated into the algorithms. Creating one
recommendation list with gru4rec needed, on average, about 12 ms.
knn needs about 26 ms for this task and sr only 3 ms.
7Training the model for 6 month of data using the GPU lasts about 8 hours.
#nowplaying</p>
      <p>RSC</p>
      <p>e raw data used for training the algorithms in this specic
experiment (one split of the RSC dataset) occupies about 540 MB
of main memory. e data structures used for training sr and knn
occupy about 50 MB and 3.2 GB, respectively. e model created
by gru4rec needs about 510 MB. Note that memory demand of
gru4rec depends on the algorithm parameters and signicantly
increases with the number of items. For the music and Tmall datasets,
the memory demand of gru4rec exceeded the capacity of our
graphics card. Running gru4rec using the CPU is multiple times
slower than when a graphics card is used.
5</p>
    </sec>
    <sec id="sec-9">
      <title>CONCLUSION AND FUTURE WORKS</title>
      <p>Our work indicates that comparably simple frequent-paern-based
approaches can represent a comparably strong baseline when
evaluating session-based recommendation problems. At the end, we
could nd at least one paern-based approach that was signicantly
beer than a recent RNN-based method. In particular the sr method
was surprisingly eective, despite the fact that both learning and
applying the rules is very fast.</p>
      <p>
        Our results also indicates that the “winning” strategy seems to
strongly depend on the characteristics of the data sets like average
session lengths or repetition rates. Further research is still required
to understand this relationship. In our future work, we will
investigate the performance of additional session-based algorithms. ese
algorithms include both ones that are based on Markov models, e.g.,
Rendle et al.’s factorized Markov chains [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ], as well as recently
proposed improvements to gru4rec, e.g., by Tan et al. [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ]. We
expect that continuously improved RNN-based methods will be
able to outperform the frequent paern based baselines used in the
evaluation reported in this paper. ese methods can, however, be
computationally quite expensive. From a practical perspective, one
has therefore to assess depending on the application domain if the
obtained gains in accuracy justify the usage of these complex
models, which cannot be easily updated online and whose predictions
can be dicult to explain.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Rakesh</given-names>
            <surname>Agrawal</surname>
          </string-name>
          , Tomasz Imielin´ski, and
          <string-name>
            <given-names>Arun</given-names>
            <surname>Swami</surname>
          </string-name>
          .
          <year>1993</year>
          .
          <article-title>Mining Association Rules between Sets of Items in Large Databases</article-title>
          . In SIGMOD '
          <volume>93</volume>
          .
          <fpage>207</fpage>
          -
          <lpage>216</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Rakesh</given-names>
            <surname>Agrawal</surname>
          </string-name>
          and
          <string-name>
            <given-names>Ramakrishnan</given-names>
            <surname>Srikant</surname>
          </string-name>
          .
          <year>1995</year>
          .
          <article-title>Mining Sequential Paerns</article-title>
          .
          <source>In ICDE '95</source>
          .
          <fpage>3</fpage>
          -
          <lpage>14</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Trapit</given-names>
            <surname>Bansal</surname>
          </string-name>
          , David Belanger, and
          <string-name>
            <surname>Andrew McCallum</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Ask the GRU: Multi-task Learning for Deep Text Recommendations</article-title>
          . In RecSys '
          <volume>16</volume>
          .
          <fpage>107</fpage>
          -
          <lpage>114</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <source>[4] Georay Bonnin and Dietmar Jannach</source>
          .
          <year>2014</year>
          .
          <source>Automated Generation of Music Playlists: Survey and Experiments. ACM Computing Surveys</source>
          <volume>47</volume>
          ,
          <issue>2</issue>
          (
          <year>2014</year>
          ),
          <volume>26</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>26</lpage>
          :
          <fpage>35</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Junyoung</given-names>
            <surname>Chung</surname>
          </string-name>
          , C¸aglar Gu¨l¸cehre, KyungHyun Cho, and
          <string-name>
            <given-names>Yoshua</given-names>
            <surname>Bengio</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling</article-title>
          .
          <source>CoRR abs/1412</source>
          .3555 (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Paul</given-names>
            <surname>Covington</surname>
          </string-name>
          , Jay Adams, and
          <string-name>
            <given-names>Emre</given-names>
            <surname>Sargin</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Deep Neural Networks for YouTube Recommendations</article-title>
          . In RecSys '
          <volume>16</volume>
          .
          <fpage>191</fpage>
          -
          <lpage>198</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Sander</given-names>
            <surname>Dieleman</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Deep Learning for Audio-Based Music Recommendation</article-title>
          .
          <source>In DLRS '16 Workshop</source>
          . 1-
          <fpage>1</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Ali</given-names>
            <surname>Mamdouh</surname>
          </string-name>
          <string-name>
            <given-names>Elkahky</given-names>
            ,
            <surname>Yang Song</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Xiaodong</given-names>
            <surname>He</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>A Multi-View Deep Learning Approach for Cross Domain User Modeling in Recommendation Systems</article-title>
          . In WWW '
          <volume>15</volume>
          .
          <fpage>278</fpage>
          -
          <lpage>288</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Je</surname>
            <given-names>rey L.</given-names>
          </string-name>
          <string-name>
            <surname>Elman</surname>
          </string-name>
          .
          <year>1990</year>
          .
          <article-title>Finding Structure in Time</article-title>
          .
          <source>Cognitive Science 14</source>
          ,
          <issue>2</issue>
          (
          <year>1990</year>
          ),
          <fpage>179</fpage>
          -
          <lpage>211</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Peter</given-names>
            <surname>Emerson</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>e Original Borda Count</article-title>
          and
          <string-name>
            <given-names>Partial</given-names>
            <surname>Voting</surname>
          </string-name>
          .
          <source>Social Choice and Welfare</source>
          <volume>40</volume>
          ,
          <issue>2</issue>
          (
          <year>2013</year>
          ),
          <fpage>353</fpage>
          -
          <lpage>358</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Alex</given-names>
            <surname>Graves</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Generating Sequences With Recurrent Neural Networks</article-title>
          .
          <source>CoRR abs/1308</source>
          .0850 (
          <year>2013</year>
          ). hp://arxiv.org/abs/1308.0850
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Alex</surname>
            <given-names>Graves</given-names>
          </string-name>
          , Greg Wayne, and
          <string-name>
            <given-names>Ivo</given-names>
            <surname>Danihelka</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Neural Turing Machines</article-title>
          .
          <source>CoRR abs/1410</source>
          .5401 (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Yupeng</surname>
            <given-names>Gu</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Bo</given-names>
            <surname>Zhao</surname>
          </string-name>
          , David
          <string-name>
            <surname>Hardtke</surname>
            ,
            <given-names>and Yizhou</given-names>
          </string-name>
          <string-name>
            <surname>Sun</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Learning Global Term Weights for Content-based Recommender Systems</article-title>
          . In WWW '
          <volume>16</volume>
          .
          <fpage>391</fpage>
          -
          <lpage>400</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Jiawei</given-names>
            <surname>Han</surname>
          </string-name>
          and
          <string-name>
            <given-names>Micheline</given-names>
            <surname>Kamber</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Data Mining: Concepts and Techniques (Second Edition)</article-title>
          . Morgan Kaufmann.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Negar</surname>
            <given-names>Hariri</given-names>
          </string-name>
          , Bamshad Mobasher, and
          <string-name>
            <given-names>Robin</given-names>
            <surname>Burke</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Context-Aware Music Recommendation Based on Latent Topic Sequential Paerns</article-title>
          . In RecSys '
          <volume>12</volume>
          .
          <fpage>131</fpage>
          -
          <lpage>138</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16] Bala´zs Hidasi, Alexandros Karatzoglou, Linas Baltrunas, and
          <string-name>
            <given-names>Domonkos</given-names>
            <surname>Tikk</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Session-based Recommendations with Recurrent Neural Networks</article-title>
          .
          <source>CoRR abs/1511</source>
          .06939 (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17] Bala´zs Hidasi, Massimo adrana, Alexandros Karatzoglou, and
          <string-name>
            <given-names>Domonkos</given-names>
            <surname>Tikk</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Parallel Recurrent Neural Network Architectures for Feature-rich Session-based Recommendations</article-title>
          . In RecSys '
          <volume>16</volume>
          .
          <fpage>241</fpage>
          -
          <lpage>248</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Sepp</given-names>
            <surname>Hochreiter</surname>
          </string-name>
          and Ju¨rgen Schmidhuber.
          <year>1997</year>
          .
          <article-title>Long Short-Term Memory</article-title>
          .
          <source>Neural Computation</source>
          <volume>9</volume>
          ,
          <issue>8</issue>
          (Nov.
          <year>1997</year>
          ),
          <fpage>1735</fpage>
          -
          <lpage>1780</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>Dietmar</given-names>
            <surname>Jannach</surname>
          </string-name>
          and
          <string-name>
            <given-names>Gedas</given-names>
            <surname>Adomavicius</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Recommendations with a Purpose</article-title>
          .
          <source>In RecSys '16</source>
          .
          <fpage>7</fpage>
          -
          <lpage>10</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Dietmar</surname>
            <given-names>Jannach</given-names>
          </string-name>
          , Lukas Lerche, and
          <string-name>
            <given-names>Michael</given-names>
            <surname>Jugovac</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Adaptation and Evaluation of Recommendations for Short-term Shopping Goals</article-title>
          . In RecSys '
          <volume>15</volume>
          .
          <fpage>211</fpage>
          -
          <lpage>218</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Dietmar</surname>
            <given-names>Jannach</given-names>
          </string-name>
          , Lukas Lerche, and
          <string-name>
            <given-names>Iman</given-names>
            <surname>Kamehkhosh</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Beyond “Hiing the Hits”: Generating Coherent Music Playlist Continuations with the Right Tracks</article-title>
          . In RecSys '
          <volume>15</volume>
          .
          <fpage>187</fpage>
          -
          <lpage>194</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Dietmar</surname>
            <given-names>Jannach</given-names>
          </string-name>
          , Lukas Lerche, Iman Kamehkhosh, and
          <string-name>
            <given-names>Michael</given-names>
            <surname>Jugovac</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>What recommenders recommend: an analysis of recommendation biases and possible countermeasures. User Modeling</article-title>
          and
          <string-name>
            <surname>User-Adapted Interaction</surname>
          </string-name>
          (
          <year>2015</year>
          ),
          <fpage>1</fpage>
          -
          <lpage>65</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>Dietmar</given-names>
            <surname>Jannach</surname>
          </string-name>
          and
          <string-name>
            <given-names>Malte</given-names>
            <surname>Ludewig</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Determining Characteristics of Successful Recommendations from Log Data - A Case Study</article-title>
          .
          <source>In SAC '17.</source>
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>Dietmar</given-names>
            <surname>Jannach</surname>
          </string-name>
          and
          <string-name>
            <given-names>Malte</given-names>
            <surname>Ludewig</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>When Recurrent Neural Networks meet the Neighborhood for Session-Based Recommendation</article-title>
          . In RecSys
          <year>2017</year>
          .
          <article-title>(forthcoming).</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Donghyun</surname>
            <given-names>Kim</given-names>
          </string-name>
          , Chanyoung Park, Jinoh Oh,
          <string-name>
            <given-names>Sungyoung</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Hwanjo</given-names>
            <surname>Yu</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Convolutional Matrix Factorization for Document Context-Aware Recommendation</article-title>
          . In RecSys '
          <volume>16</volume>
          .
          <fpage>233</fpage>
          -
          <lpage>240</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>Lukas</surname>
            <given-names>Lerche</given-names>
          </string-name>
          , Dietmar Jannach, and
          <string-name>
            <given-names>Malte</given-names>
            <surname>Ludewig</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>On the Value of Reminders within E-Commerce Recommendations</article-title>
          . In UMAP '
          <volume>16</volume>
          .
          <fpage>27</fpage>
          -
          <lpage>25</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>Sheng</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Jaya</given-names>
            <surname>Kawale</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Yun</given-names>
            <surname>Fu</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Deep Collaborative Filtering via Marginalized Denoising Auto-encoder</article-title>
          .
          <source>In CIKM '15</source>
          .
          <fpage>811</fpage>
          -
          <lpage>820</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Zachary</surname>
            <given-names>Chase</given-names>
          </string-name>
          <string-name>
            <surname>Lipton</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>A Critical Review of Recurrent Neural Networks for Sequence Learning</article-title>
          .
          <source>CoRR abs/1506</source>
          .00019 (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <surname>Julian</surname>
            <given-names>McAuley</given-names>
          </string-name>
          , Christopher Targe,
          <string-name>
            <given-names>Qinfeng</given-names>
            <surname>Shi</surname>
          </string-name>
          , and Anton van den Hengel.
          <year>2015</year>
          .
          <article-title>Image-Based Recommendations on Styles and Substitutes</article-title>
          . In SIGIR '
          <volume>15</volume>
          .
          <fpage>43</fpage>
          -
          <lpage>52</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <surname>Brian</surname>
            <given-names>McFee</given-names>
          </string-name>
          and
          <string-name>
            <surname>Gert R. G. Lanckriet</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>e Natural Language of Playlists</article-title>
          . In ISMIR '
          <volume>11</volume>
          .
          <fpage>537</fpage>
          -
          <lpage>542</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <surname>Brian</surname>
            <given-names>McFee</given-names>
          </string-name>
          and
          <string-name>
            <surname>Gert R. G. Lanckriet</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Hypergraph Models of Playlist Dialects</article-title>
          . In ISMIR '
          <volume>12</volume>
          .
          <fpage>343</fpage>
          -
          <lpage>348</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <surname>Bamshad</surname>
            <given-names>Mobasher</given-names>
          </string-name>
          , Honghua Dai, Tao Luo, and
          <string-name>
            <given-names>Miki</given-names>
            <surname>Nakagawa</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>Using Sequential and Non-Sequential Paerns in Predictive Web Usage Mining Tasks</article-title>
          . In ICDM '
          <volume>02</volume>
          .
          <fpage>669</fpage>
          -
          <lpage>672</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <surname>Ozlem</surname>
            <given-names>Ozgobek</given-names>
          </string-name>
          , Jon A.
          <string-name>
            <surname>Gulla</surname>
          </string-name>
          , and
          <string-name>
            <surname>Riza</surname>
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Erdur</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>A Survey on Challenges and Methods in News Recommendation</article-title>
          . In WEBIST '
          <volume>14</volume>
          .
          <fpage>278</fpage>
          -
          <lpage>285</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34] Steen Rendle, Christoph Freudenthaler, and
          <string-name>
            <surname>Lars</surname>
          </string-name>
          Schmidt-
          <string-name>
            <surname>ieme</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Factorizing Personalized Markov Chains for Next-basket Recommendation</article-title>
          .
          <source>In WWW '10</source>
          .
          <fpage>811</fpage>
          -
          <lpage>820</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <surname>Guy</surname>
            <given-names>Shani</given-names>
          </string-name>
          , David Heckerman, and
          <string-name>
            <surname>Ronen</surname>
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Brafman</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>An MDP-Based Recommender System</article-title>
          . 
          <source>e Journal of Machine Learning Research 6 (Dec</source>
          .
          <year>2005</year>
          ),
          <fpage>1265</fpage>
          -
          <lpage>1295</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <surname>Yang</surname>
            <given-names>Song</given-names>
          </string-name>
          , Ali Mamdouh Elkahky, and
          <string-name>
            <given-names>Xiaodong</given-names>
            <surname>He</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Multi-Rate Deep Learning for Temporal Recommendation</article-title>
          . In SIGIR '
          <volume>16</volume>
          .
          <fpage>909</fpage>
          -
          <lpage>912</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <surname>Florian</surname>
            <given-names>Strub</given-names>
          </string-name>
          , Romaric Gaudel, and Je´re´mie Mary.
          <year>2016</year>
          .
          <article-title>Hybrid Recommender System based on Autoencoders</article-title>
          .
          <source>In DLRS '16 Workshop</source>
          . 11-
          <fpage>16</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>Yong</given-names>
            <surname>Kiam</surname>
          </string-name>
          <string-name>
            <given-names>Tan</given-names>
            , Xinxing
            <surname>Xu</surname>
          </string-name>
          , and Yong Liu.
          <year>2016</year>
          .
          <article-title>Improved Recurrent Neural Networks for Session-based Recommendations</article-title>
          .
          <source>In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems (DLRS '16)</source>
          . ACM,
          <volume>17</volume>
          -
          <fpage>22</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>Maryam</given-names>
            <surname>Tavakol</surname>
          </string-name>
          and
          <string-name>
            <given-names>Ulf</given-names>
            <surname>Brefeld</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Factored MDPs for Detecting Topics of User Sessions</article-title>
          . In RecSys '
          <volume>14</volume>
          .
          <fpage>33</fpage>
          -
          <lpage>40</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <surname>Roberto</surname>
            <given-names>Turrin</given-names>
          </string-name>
          , Andrea Condorelli, Paolo Cremonesi, Roberto Pagano, and Massimo adrana.
          <year>2015</year>
          .
          <article-title>Large Scale Music Recommendation</article-title>
          .
          <source>In LSRS 2015 Workshop</source>
          at ACM RecSys.
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <surname>Roberto</surname>
            <given-names>Turrin</given-names>
          </string-name>
          , Massimo adrana, Andrea Condorelli, Roberto Pagano, and
          <string-name>
            <given-names>Paolo</given-names>
            <surname>Cremonesi</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>30Music Listening and Playlists Dataset</article-title>
          .
          <source>In Poster Proceedings RecSys '15.</source>
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          <source>[42] Bart lomiej Twardowski</source>
          .
          <year>2016</year>
          .
          <article-title>Modelling Contextual Information in SessionAware Recommender Systems with Neural Networks</article-title>
          . In RecSys '
          <volume>16</volume>
          .
          <fpage>273</fpage>
          -
          <lpage>276</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [43]
          <string-name>
            <surname>Flavian</surname>
            <given-names>Vasile</given-names>
          </string-name>
          , Elena Smirnova, and
          <string-name>
            <given-names>Alexis</given-names>
            <surname>Conneau</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Meta-Prod2Vec: Product Embeddings Using Side-Information for Recommendation</article-title>
          . In RecSys '
          <volume>16</volume>
          .
          <fpage>225</fpage>
          -
          <lpage>232</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>Koen</given-names>
            <surname>Verstrepen</surname>
          </string-name>
          and
          <string-name>
            <given-names>Bart</given-names>
            <surname>Goethals</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Unifying Nearest Neighbors Collaborative Filtering</article-title>
          . In RecSys '
          <volume>14</volume>
          .
          <fpage>177</fpage>
          -
          <lpage>184</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [45]
          <string-name>
            <surname>Jeroen</surname>
            <given-names>B. P.</given-names>
          </string-name>
          <string-name>
            <surname>Vuurens</surname>
          </string-name>
          , Martha Larson, and
          <string-name>
            <surname>Arjen P. de Vries</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Exploring Deep Space: Learning Personalized Ranking in a Semantic Space</article-title>
          .
          <source>In DLRS '16 Workshop</source>
          . 23-
          <fpage>28</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [46]
          <string-name>
            <surname>Hao</surname>
            <given-names>Wang</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Naiyan</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <surname>Dit-Yan Yeung</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Collaborative Deep Learning for Recommender Systems</article-title>
          . In KDD '
          <volume>15</volume>
          .
          <fpage>1235</fpage>
          -
          <lpage>1244</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [47]
          <string-name>
            <surname>Ghim-Eng</surname>
            <given-names>Yap</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiao-Li Li</surname>
            ,
            <given-names>and Philip S.</given-names>
          </string-name>
          <string-name>
            <surname>Yu</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Eective Next-items Recommendation via Personalized Sequential Paern Mining</article-title>
          .
          <source>In DASFAA'12</source>
          . Berlin, Heidelberg,
          <fpage>48</fpage>
          -
          <lpage>64</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [48]
          <string-name>
            <surname>Eva</surname>
            <given-names>Zangerle</given-names>
          </string-name>
          , Martin Pichl, Wolfgang Gassler, and Gu¨nther Specht.
          <year>2014</year>
          . #
          <article-title>Nowplaying Music Dataset: Extracting Listening Behavior from Twier</article-title>
          .
          <source>In WISMM '14 Workshop</source>
          at MM '
          <volume>14</volume>
          .
          <fpage>21</fpage>
          -
          <lpage>26</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          [49]
          <string-name>
            <surname>Yuyu</surname>
            <given-names>Zhang</given-names>
          </string-name>
          , Hanjun Dai, Chang Xu, Jun Feng, Taifeng Wang, Jiang Bian,
          <string-name>
            <given-names>Bin</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <surname>Tie-Yan Liu</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Sequential Click Prediction for Sponsored Search with Recurrent Neural Networks</article-title>
          .
          <source>In AAAI '14</source>
          .
          <fpage>1369</fpage>
          -
          <lpage>1375</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>