=Paper=
{{Paper
|id=Vol-1922/paper7
|storemode=property
|title=Accurate and Diverse Recommendation based on Users’ Tendencies toward Temporal Item Popularity
|pdfUrl=https://ceur-ws.org/Vol-1922/paper7.pdf
|volume=Vol-1922
|authors=Koki Nagatani,Masahiro Sato
|dblpUrl=https://dblp.org/rec/conf/recsys/NagataniS17
}}
==Accurate and Diverse Recommendation based on Users’ Tendencies toward Temporal Item Popularity==
Accurate and Diverse Recommendation based on Users’ Tendencies toward Temporal Item Popularity Koki Nagatani Masahiro Sato Fuji Xerox Co., Ltd. Fuji Xerox Co., Ltd. 6-1 Minatomirai, Nishi-ku, Yokohama, Japan 6-1 Minatomirai, Nishi-ku, Yokohama, Japan nagatani.koki@fujixerox.co.jp sato.masahiro@fujixerox.co.jp ABSTRACT To produce personalized recommendations, collaborative filtering Popularity bias is a phenomenon associated with collaborative fil- (CF) is a widely used approach. The CF approach produces items tering algorithms, in which popular items tend to be recommended for a target user using data compiled from observations of users over unpopular items. As the appropriate level of item popular- with similar preferences as the target user [9]. The CF approach ity differs depending on individual users, a user-level modification is categorized into two types: neighborhood-based CF [12, 13] and approach can produce diverse recommendations while improving model-based CF [5, 11]. The standard approach of model-based CF the recommendation accuracy. However, there are two issues with is a matrix factorization (MF)-based approach, which character- conventional user-level approaches. First, these approaches do not izes both items and users by vectors of latent factors inferred from isolate users’ preferences from their tendencies toward item pop- user feedback [5, 11]. In most cases, model-based CF is superior to ularity clearly. Second, they do not consider temporal item pop- neighborhood-based CF in terms of accuracy. ularity, although item popularity changes dynamically over time In the CF approach, it has been noted that popular items tend in reality. In this paper, we propose a novel approach to counter- to be recommended more often [15, 21]. This is known as pop- act the popularity bias, namely, matrix factorization based collab- ularity bias and various solutions have been proposed to tackle orative filtering incorporating individual users’ tendencies toward this problem [3, 4, 7, 10, 21]. These solutions are classified into item popularity. Our model clearly isolates users’ preferences from two types according to the level of modification: global-level and their tendencies toward popularity. In addition, we consider the user-level. Global-level solutions modify their recommendations temporal item popularity and incorporate it into our model. Ex- for all users uniformly by avoiding recommending popular items perimental results using a real-world dataset show that our model [3, 7, 21]. In reality, however, the appropriate level of modification improve both accuracy and diversity compared with a baseline al- differs depending on the user: some users are likely to select popu- gorithm in both static and time-varying models. Moreover, our lar items, while others tend to seek new or niche items. Therefore, model outperforms conventional approaches in terms of accuracy user-level modification approaches, in which the degree of modifi- with the same diversity level. Furthermore, we show that our pro- cation varies according to individual users’ popularity tendencies, posed model recommends items by capturing users’ tendencies to- have been proposed [4, 10]. ward item popularity: it recommends popular items for the user However, there are two issues in conventional user-level ap- who likes popular items, while recommending unpopular items for proaches. First, these approaches do not isolate users’ preferences those who don’t like popular items. from their popularity tendencies clearly. Second, although item popularity changes dynamically over time in reality, these approaches CCS CONCEPTS do not consider temporal item popularity. In general, incorporat- ing temporal item popularity into models improves the recom- • Information systems → Personalization; Recommender sys- mendation accuracy. Moreover, to counteract popularity bias, es- tems; pecially in user-level solutions, incorporating temporal item pop- ularity is important because the reasons of users’ behaviors are KEYWORDS considered different depending on their purchase time even if they popularity bias, temporal information, personalized recommenda- purchase same items. To the best of our knowledge, however, there tion is no approach considering temporal item popularity in the field of counteraction against popularity bias. In this paper, we propose a novel approach to tackle the popular- 1 INTRODUCTION ity bias, namely, MF-based CF incorporating item popularity ori- Recommender systems help users to access the specific informa- entation of individual users. Our model isolates users’ preference tion that they seek from a huge amount of data. Accurate recom- from their tendencies toward item popularity clearly. We also con- mendations lead to an increase in customers’ purchases or con- sider temporal item popularity and incorporate it into our model. sumption; hence, there is a need for more efficient recommender To verify the efficacy of the proposed model, we conducted exper- systems that produce personalized content for individual users. iments using a real-world dataset. The experimental results show that our model improves both accuracy and diversity compared TempRRS ’17, August 2017, Como, Italy with a baseline algorithm in both static and time-changing mod- Copyright © 2017 for this paper by its authors. Copying permitted for private and els. Moreover, our model outperforms conventional approaches in academic purposes. TempRRS ’17, August 2017, Como, Italy Koki Nagatani and Masahiro Sato terms of accuracy with the same diversity level. We also demon- the time on Netflix data. Since then, several models that consider strate that our proposed model recommends items by capturing temporal dynamics using MF [1, 20] or deep learning methods users’ tendencies toward item popularity: it recommends popu- [16, 19] have been proposed. In user-level approaches to popular- lar items to users who like popular items, and unpopular items to ity bias, temporal item popularity needs to be considered to cap- those who do not like popular items. ture individual users’ tendencies toward item popularity. This is We summarize the main contribution of this paper as follows: because the reasons for purchasing items in case of users having • Our model isolates users’ preferences from their tendencies multiple interactions with the same items may be different depend- toward item popularity clearly. ing on the interaction time: some users purchase items because • We consider temporal item popularity in the field of coun- the items are popular, and some users purchase items because the teraction against popularity bias. items match the users’ preferences. To our knowledge, however, • We conduct experiments using a real-world dataset to verify there is no approach that considers temporal aspects in the field of the efficacy of the proposed model. popularity bias. 2 RELATED WORK 3 OUR MODEL 2.1 Popularity Bias In this section, we present our MF-based model that incorporates Popularity bias is a phenomenon of existing recommendation al- individual users’ tendencies toward item popularity. We focus on gorithms in which popular items tend to be recommended over situations where personalized top-N recommendations are pro- unpopular items. To tackle this problem, several approaches have duced based on users’ implicit feedback (e.g. views, clicks, pur- been proposed [3, 4, 7, 10, 21]. These approaches are classified into chases, etc.). two types according to the level of modification: global-level and user-level. 3.1 Modeling Individual Users’ Tendencies Global-level approaches modify their recommendations for all toward Temporal Item Popularity users uniformly [3, 7, 21]. Most methods avoid recommending pop- In MF, both items and users are characterized by vectors of latent ular items by weighting according to item popularity. In global- factors derived from explicit feedback (e.g. ratings) as well as im- level approaches, the evaluation metrics such as diversity and nov- plicit feedback. The basic model of MF with item bias is formulated elty improve at the cost of a decline in accuracy. Generally, the ap- as follows: propriate level of modification differs depending on the user: some users are likely to select popular items, some tend to seek new or x̂ui = bi0 + fuT fi , (1) niche items, and some select an item irrespective of its popularity. However, global-level approaches do not consider such individual where x̂ui is the prediction score of preference of user u toward differences. item i, bi0 is an item-specific bias which represents item popular- User-level approaches consider these differences and then mod- ity, and fu and fi are k-dimensional vectors of latent factors of ify their recommendations depending on the individual user’s ten- user u and item i, respectively. The inner product fuT fi achieves a dencies toward item popularity. Therefore, user-level approaches high value when both user and item vectors are similar. Further- possibly improve both diversity and accuracy simultaneously. The more, item bias bi increases when an item is popular. The predic- conventional user-level approaches proposed in [4, 10] attempt to tion score is determined by their aggregation. re-rank recommendation lists by post sampling based on users’ If item bias bi values are extremely high, the item is recom- past behavior in terms of popularity. However, users’ preferences mended regardless of whether users like it or not. Hence, recom- and their tendencies toward item popularity might be mixed in mendation systems tend to recommend these items, which leads these approaches for two reasons. First, before reranking, the rec- to popularity biased recommendation. A simple solution for this ommendation lists are created by existing CF models. During the problem is to penalize items according to the item popularity. How- creation process, these models mix users’ preference and item pop- ever, preference toward popular or unpopular items varies for each ularity. Second, popularity tendency distributions are created based user. Considering this, the solution is not suitable for users who on users’ past actions. As users’ past actions are mainly derived like popular items. Therefore, the penalization of popularity needs from the users’ preferences and items’ popularity, these aspects are to be changed depending on the users’ popularity tendencies. also included when creating the distribution. Therefore, these ap- Moreover, the users’ popularity tendencies should be consid- proaches do not isolate user preferences from their popularity ten- ered along with the items’ temporal aspects for two reasons. Firstly, dencies clearly. Our solution overcomes the above issue by mod- item popularity changes dynamically over time in the real world eling users’ popularity tendencies directly, as described in Section for various reasons [17]. Secondly, the reasons for purchasing items 3. in case of users having multiple interactions with the same items may be different depending on the interaction time. 2.2 CF with Temporal Aspects Therefore, we develop a model to incorporate both users’ popu- Incorporating temporal aspects into CF has been investigated, par- larity tendencies and items’ temporal popularity, which is formu- ticularly for developing accurate recommendation algorithms. For lated as follows: example, [6] proposed a matrix factorization model that considered x̂ui = (bi0 + bi (t))(1 + дu ) + fuT fi , (2) temporal dynamics and achieved state-of-the-art performance at Accurate and Diverse Recommendation based on Users’ Tendencies toward Temporal Item Popularity TempRRS ’17, August 2017, Como, Italy where дu is the user-specific parameter of popularity tendency and period. The top-N prediction precision is defined as: bi (t) is the time-varying item bias at the period of time t. The pa- 1 ∑ 1 ∑ |Iu pred rameters, bi0 , bi (t), дu , fu , and fi , are learned by optimization. (t) ∩ Iutrue (t)| Precision@N = , The дu value works as the balancing parameter between the |U | |Tu | N u ∈U t ∈Tu item popularity and preference toward the item. When the дu value of a user u is greater than zero, the user prefers popular items to pred where |Iu (t)| = N , U is the set of users in the testing set and unpopular items. High дu values indicated that the user may sim- Tu is the set of the period of time when interactions of user u are ply prefer popular items without regard to his/her item preference. observed. Similarly, the top-N prediction recall is defined as: Conversely, when it is less than minus one, the user prefers unpop- 1 ∑ 1 ∑ |Iu ular items to popular items. pred (t) ∩ Iutrue (t)| As mentioned in Section 2.1, users’ preferences and their ten- Recall@N = . |U | |Tu | |Iutrue (t)| dencies toward item popularity are mixed in conventional user- u ∈U t ∈Tu level approaches. In contrast, our model resolves the confusion by The top-N item coverage applies to all the output that a recom- modeling as in Eq. 2: the first term represents item popularity and mender system produces for a set of users. This metric is also called users’ popularity tendencies, and the second term represents item the top-N aggregate diversity. In our experiment, this metric is de- feature and users’ preference. Therefore, our model captures these fined as: features separately. ∑ ∪ t ∈T | u ∈Ut Ru | Coverage@N = , 3.2 Model Learning |T | Our model formulated in Eq. 2 can be learned by applying exist- where Ut is a set of users whose interactions are observed at a ing optimization methods, such as point-wise and pair-wise op- period of time t and Ru is the recommendation lists for user u, and timization. For example, for point-wise optimization, root mean the length of the lists is N . square error (RMSE), which is used in Biased-MF [5], and alter- nating least squares, which is used in weighted regularize matrix 4.3 Comparison of Methods factorization [2] can be applied to our model. For pair-wise op- To examine the performance of our proposed methods, we com- timization, area under the curve (AUC) in Bayesian personalized pared them with conventional approaches. For the optimization ranking [11], mean reciprocal rank used in collaborative less-is- of our methods and base models of conventional approaches, we more filtering (CLiMF) [14], and weighted approximately ranked selected the Bayesian personalized ranking (BPR) procedure [11], pairwise loss proposed in [18] can be applied to our models. which is one of the state-of-the-art methods for personalized item recommendation. The model of BPR matrix factorization (BPRMF) 4 EXPERIMENTS is formulated in Eq. 1. For the baseline of the conventional methods In this section, we conduct experiments using a real-world dataset that consider temporal aspects, we extend BPRMF incorporating to verify the efficacy of the proposed model. temporal item popularity, which is called BPRMF(t). Personal Popularity Tendency Matching (PPTM) [10] is a greedy 4.1 Dataset re-ranking method that considers an individual’s personal popu- We used the Amazon.com Movies and TVs dataset [8] in our exper- larity tendency (PPT). It balances novelty and user preference by iment. We utilized a subset from 2013, defined the period of time t matching the PPT of a recommendation to that of the users mea- as monthly, and binarized the data treating reviewed items as rel- sured by earth movers distance (EMD), which is a distance metrics evant and non-reviewed items as irrelevant. Due to the sparsity of between two distributions. the dataset, we preprocessed it by retaining the top 10, 000 items Personalized Ranking Adaptation (PRA) [4] is a versatile greedy and discarding data of users having less than 10 interactions. Af- re-ranking method that considers an individual user tendency suit- ter the preprocessing, the total number of users was 4, 997 and the able for multiple optimization goals. In our experiments, the opti- dataset contained 90, 341 interactions for 9, 221 items. mization target is set to EMD. BPRMF(t)-pop is the method proposed by this paper in Eq. 2. BPRMF-pop is the model that removes temporal item popularity 4.2 Evaluation Metrics from Eq. 2. In our experiments, we performed five-fold cross validation and To model PPT, the discrete distribution of the binned popularity aggregated the results. First, we randomly selected 80% of observed values of the items is required. In our experiments, we defined the feedback as a training set to train models, and the remaining 20% item popularity of the recommendations as the number of item oc- as the testing set for the trained models. To measure the perfor- currences in the top-N recommendation lists for the active users. mance, we used three evaluation metrics: the top-N prediction pre- We used a log-scaled popularity histogram for discrete distribu- cision (Precision@N), the top-N prediction recall (Recall@N), and tion. The parameters of all models were tuned so as to maximize pred the top-N item coverage (Coverage@N). We set Iu (t) as the pre- the accuracy metrics. In the case of conventional approaches, it is dicted items of user u over a certain period of time t, and Iutrue (t) known that a higher coverage setting reduces accuracy. Hence, we as the true list in the testing set. Prediction is performed for each selected the parameter value for which the coverage score became period of time t, and each user’s scores are aggregated over each close to that of our model. TempRRS ’17, August 2017, Como, Italy Koki Nagatani and Masahiro Sato Table 1: Precision@10, Recall@10, and Coverage@10 scores Table 2: The bi (t) and preference score of Top-5 Recommen- on Amazon.com Movies and TVs datasets (#factors = 300). dation of BPRMF(t)-pop for a user at time t. Actual user be- havior (relevant recommendation) is bolded. Ranking is cal- Methods Precision@10 Recall@10 Coverage@10 culated based on the training set. "-" means the item is not BPRMF 0.01349 0.07492 0.6464 in the training set. +PPTM (c = 0.1) 0.01348 0.07487 0.6509 (a) Item popularity orientation score дu = 0.69, a user likes popular items. +PRA (X u =5) 0.01274 0.07012 0.7029 BPRMF-pop 0.01359 0.07550 0.6528 TopN t =3 t = 12 items BPRMF(t) 0.01521 0.09314 0.4939 bi (t ) (#rank) Pref. score bi (t ) (#rank) Pref. score +PPTM (c = 1) 0.01504 0.09047 0.5634 1 4.17 (1) 0.06 4.63 (4) 0.041 +PRA (X u =5) 0.01308 0.07764 0.5605 2 3.95 (3) 0.02 4.33 (1) -0.017 BPRMF(t)-pop 0.01603 0.09569 0.5749 3 3.73 (4) 0.02 4.14 (3) -0.017 4 3.62 (14) 0.10 4.10 (6) -0.028 5 3.67 (8) -0.02 4.04 (7) 0.012 (b) Item popularity orientation score дu = −1.19, a user selects items which match user’s preferences. TopN t =2 t =3 items bi (t ) (#rank) Pref. score bi (t ) (#rank) Pref. score 1 0.28 (1458) 2.29 -1.16 (1123) 2.60 2 -0.44 (1458) 1.98 1.70 (169) 2.83 3 -0.72 (-) 1.76 -1.13 (-) 1.69 4 -2.13 (1458) 1.47 1.75 (492) 2.17 (a) дu vs. Precision@10 (b) дu vs. Recall@10 5 0.36 (1458) 1.90 0.25 (492) 1.88 Figure 1: Plots of дu values versus Precision@10 and Re- call@10. Each point is the average of evaluation metrics the examples that our model recommends popular items for the with regards to the average of дu values of 100 users in de- user who likes popular items and vice versa. bi (t) score represents scending order of the дu value. item popularity at the period of time t and actual users’ purchase is shown in bold in Table 2. As can be seen from the user’s pur- 4.4 Experimental Results chase behavior shown in Table 2-(a), the user tends to purchase Table 1 shows the results of the comparison between our method popular items. Our model learned such purchase behavior from and conventional approaches. The number of latent factors was the user’s past purchases, and then evaluated the дu value of the set to 300 and the number of items in a recommendation list to user as 0.69, which means that the user likes popular items. Our 10. In general, time-aware models improve accuracy and reduce model produced popular items for the user, which were ranked coverage compared with static models. Our model improved both in the top 20. On the other hand, the user in Table 2-(b) selected accuracy and diversity compared with the baseline in both static items that match the user’s preference without regard to the items’ and time-changing models. Particularly in case of time-varying popularity. Our model captured the tendency from the user’s past models, our model achieved significant improvement. This indi- purchases and evaluated the user’s дu value as −1.19. Our model cates that considering temporal item popularity is essential to cap- recommended items that match the user’s preference regardless ture users’ tendencies. Our model outperformed conventional ap- of their popularity for the user. The preference scores are all high, proaches in terms of accuracy with the same diversity level. There- while the items’ rankings are various. Therefore, these results indi- fore, our model effectively captures users’ preference and their ten- cate that our model captured the users’ popularity tendencies and dencies toward item popularity. recommended personalized items appropriately. We suppose that the interactions of users with mainstream tastes are easy to predict. As our model isolates users’ preference from 5 CONCLUSIONS their tendencies toward item popularity, we can verify the idea In this paper, we proposed a novel approach for counteracting pop- by analyzing the distribution of accuracy depending on the mag- ularity bias, using MF-based CF incorporating individual users’ nitude of дu values. Figure 1 shows the plots of дu values versus tendencies toward temporal item popularity. Our model isolated two evaluation metrics: Precision@10 and Recall@10; each point users’ preference from popularity tendency clearly, and considered is the average of evaluation metrics with regards to the average of temporal item popularity. The experimental results based on a real- дu values of 100 users in descending order of the дu value. As can world dataset showed the efficacy of our model. be seen from Fig. 1, both Precision@10 and Recall@10 of the users In future work, we plan to further verify the effectiveness of our who have large дu values are high. Therefore, this result supports proposed model by using various datasets in different domains or our assumption. by learning other optimization methods for top-N recommenda- We also investigated the relation between users’ purchase be- tion. Moreover, as well as item popularity, users’ tendencies toward havior, which corresponds to their tendencies toward temporal item popularity may change over time. We plan to investigate this item popularity, and our model’s recommendations. Table 2 shows temporal phenomenon. Accurate and Diverse Recommendation based on Users’ Tendencies toward Temporal Item Popularity TempRRS ’17, August 2017, Como, Italy REFERENCES https://doi.org/10.1145/2507157.2507210 [1] Ruining He, Chen Fang, Zhaowen Wang, and Julian McAuley. 2016. Vista: A [19] Chao-Yuan Wu, Amr Ahmed, Alex Beutel, Alexander J. Smola, and How Jing. Visually, Socially, and Temporally-aware Model for Artistic Recommendation. 2017. Recurrent Recommender Networks. In Proceedings of the Tenth ACM In- In Proceedings of the 10th ACM Conference on Recommender Systems (RecSys ’16). ternational Conference on Web Search and Data Mining (WSDM ’17). ACM, New ACM, New York, NY, USA, 309–316. https://doi.org/10.1145/2959100.2959152 York, NY, USA, 495–503. https://doi.org/10.1145/3018661.3018689 [2] Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative Filtering for [20] Hsiang-Fu Yu, Nikhil Rao, and Inderjit S Dhillon. 2016. Temporal Regular- Implicit Feedback Datasets. In Proceedings of the 2008 Eighth IEEE International ized Matrix Factorization for High-dimensional Time Series Prediction. In Ad- Conference on Data Mining (ICDM ’08). IEEE Computer Society, Washington, DC, vances in Neural Information Processing Systems 29, D. D. Lee, M. Sugiyama, U. V. USA, 263–272. https://doi.org/10.1109/ICDM.2008.22 Luxburg, I. Guyon, and R. Garnett (Eds.). Curran Associates, Inc., 847–855. [3] Dietmar Jannach, Lukas Lerche, Iman Kamehkhosh, and Michael Jugovac. 2015. [21] Xiangyu Zhao, Zhendong Niu, and Wei Chen. 2013. Opinion-based Collabora- What Recommenders Recommend: An Analysis of Recommendation Biases and tive Filtering to Solve Popularity Bias in Recommneder Systems. In 24th Interna- Possible Countermeasures. User Modeling and User-Adapted Interaction 25, 5 tional Conference on Database and Expert Systems Applications (DEXA). 426–433. (Dec. 2015), 427–491. https://doi.org/10.1007/s11257-015-9165-3 [4] Michael Jugovac, Dietmar Jannach, and Lukas Lerche. 2017. Efficient Opti- mization of Multiple Recommendation Quality Factors According to Individ- ual User Tendencies. Expert Syst. Appl. 81, C (Sept. 2017), 321–331. https: //doi.org/10.1016/j.eswa.2017.03.055 [5] Yehuda Koren. 2008. Factorization Meets the Neighborhood: A Multifaceted Col- laborative Filtering Model. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD ’08). ACM, New York, NY, USA, 426–434. https://doi.org/10.1145/1401890.1401944 [6] Yehuda Koren. 2009. Collaborative Filtering with Temporal Dynamics. In Pro- ceedings of the 15th ACM SIGKDD International Conference on Knowledge Dis- covery and Data Mining (KDD ’09). ACM, New York, NY, USA, 447–456. https: //doi.org/10.1145/1557019.1557072 [7] Qiuxia Lu, Tianqi Chen, Weinan Zhang, Diyi Yang, and Yong Yu. 2012. Serendipi- tous Personalized Ranking for Top-N Recommendation. In Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intel- ligent Agent Technology - Volume 01 (WI-IAT ’12). IEEE Computer Society, Wash- ington, DC, USA, 258–265. http://dl.acm.org/citation.cfm?id=2457524.2457692 [8] Julian McAuley, Christopher Targett, Qinfeng Shi, and Anton van den Hengel. 2015. Image-Based Recommendations on Styles and Substitutes. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’15). ACM, New York, NY, USA, 43–52. https: //doi.org/10.1145/2766462.2767755 [9] Xia Ning, Christian Desrosiers, and George Karypis. 2015. A Comprehensive Survey of Neighborhood-Based Recommendation Methods, In Recommender Systems Handbook. Springer, New York City, New York. [10] Jinoh Oh, Sun Park, Hwanjo Yu, Min Song, and Seung-Taek Park. 2011. Novel Recommendation Based on Personal Popularity Tendency. In Proceedings of the 2011 IEEE 11th International Conference on Data Mining (ICDM ’11). IEEE Com- puter Society, Washington, DC, USA, 507–516. https://doi.org/10.1109/ICDM. 2011.110 [11] Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt- Thieme. 2009. BPR: Bayesian Personalized Ranking from Implicit Feedback. In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial In- telligence (UAI ’09). AUAI Press, Arlington, Virginia, United States, 452–461. http://dl.acm.org/citation.cfm?id=1795114.1795167 [12] Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. 2000. Analysis of Recommendation Algorithms for e-Commerce. In Proceedings of the 2Nd ACM Conference on Electronic Commerce (EC ’00). ACM, New York, NY, USA, 158–167. https://doi.org/10.1145/352871.352887 [13] Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. 2001. Item- based Collaborative Filtering Recommendation Algorithms. In Proceedings of the 10th International Conference on World Wide Web (WWW ’01). ACM, New York, NY, USA, 285–295. https://doi.org/10.1145/371920.372071 [14] Yue Shi, Alexandros Karatzoglou, Linas Baltrunas, Martha Larson, Nuria Oliver, and Alan Hanjalic. 2012. CLiMF: Learning to Maximize Reciprocal Rank with Collaborative Less-is-more Filtering. In Proceedings of the Sixth ACM Conference on Recommender Systems (RecSys ’12). ACM, New York, NY, USA, 139–146. https: //doi.org/10.1145/2365952.2365981 [15] Harald Steck. 2010. Training and Testing of Recommender Systems on Data Missing Not at Random. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD ’10). ACM, New York, NY, USA, 713–722. https://doi.org/10.1145/1835804.1835895 [16] Yong Kiam Tan, Xinxing Xu, and Yong Liu. 2016. Improved Recurrent Neural Networks for Session-based Recommendations. In Proceedings of the 1st Work- shop on Deep Learning for Recommender Systems (DLRS 2016). ACM, New York, NY, USA, 17–22. https://doi.org/10.1145/2988450.2988452 [17] Yusuke Tanaka, Takeshi Kurashima, Yasuhiro Fujiwara, Tomoharu Iwata, and Hiroshi Sawada. 2016. Inferring Latent Triggers of Purchases with Considera- tion of Social Effects and Media Advertisements. In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining (WSDM ’16). ACM, New York, NY, USA, 543–552. https://doi.org/10.1145/2835776.2835789 [18] Jason Weston, Hector Yee, and Ron J. Weiss. 2013. Learning to Rank Recommen- dations with the K-order Statistic Loss. In Proceedings of the 7th ACM Confer- ence on Recommender Systems (RecSys ’13). ACM, New York, NY, USA, 245–248.