CCS CONCEPTS

ComplexRec

Review-Based Cross-Domain Collaborative Filtering: A Neural Framework

Thanh-Nam Doan

tdoan@albany.edu 0

Shaghayegh Sahebi

ssahebi@albany.edu 0 0 University at Albany - SUNY

2019

Cross-domain collaborative filtering recommenders exploit data from other domains (e.g., movie ratings) to predict users' interests in a diferent target domain (e.g., suggest music). Most current crossdomain recommenders focus on modeling user ratings but pay limited attention to user reviews. Additionally, due to the complexity of these recommender systems, they cannot provide any information to users to support user decisions. To address these challenges, we propose Deep Hybrid Cross Domain (DHCD) model, a cross-domain neural framework, that can simultaneously predict user ratings, and provide useful information to strengthen the suggestions and support user decision across multiple domains. Specifically, DHCD enhances the predicted ratings by jointly modeling two crucial facets of users' product assessment: ratings and reviews. To support decisions, it models and provides natural review-like sentences across domains according to user interests and item features. This model is robust in integrating user rating and review information from more than two domains. Our extensive experiments show that DHCD can significantly outperform advanced baselines in rating predictions and review generation tasks. For rating prediction tasks, it outperforms cross-domain and single-domain collaborative ifltering as well as hybrid recommender systems. Furthermore, our review generation experiments suggest an improved perplexity score and transfer of review information in DHCD.

CCS CONCEPTS

• Information systems → Recommender systems; • Computing methodologies → Neural networks. Cross-domain Collaborative filtering, neural network, hybrid collaborative filtering

INTRODUCTION

Nowadays, users are overwhelmed by the number of choices online. Recommender systems are increasingly used as an essential tool, to alleviate this problem. Despite improvements in recommender systems, many of them still sufer from problems, including coldstart [ 21 ] and dificulty in explaining their suggestions [ 26 ]. Moreover, collaborative filtering recommenders [ 11 ] cannot use obvious feature-based relations between users and items. Content-based approaches cannot capture deeper social or semantic similarities between users and items, nor they can suggest novel items (outside the scope of user profile features) to users [ 17 ].

Two major approaches to address some of these problems are hybrid [ 18 ] and cross-domain [ 4 ] recommender systems. Hybrid recommender systems merge content-based and collaborative filtering approaches to provide higher-quality recommendations. Some hybrid recommender systems jointly model user ratings and reviews to introduce a more sophisticated view to user interests and item features, that leads to improved recommendation results [ 18 ].

The idea behind cross-domain recommendation systems is to share useful information across two or more domains to improve recommendation results [ 4 ]. They work by transferring information from one or more source or auxiliary domains to suggest useful items in a target domain. Especially, when user history in the target domain (e.g., books) does not provide enough information about user interests, user preferences in another source domain (e.g., movies) can provide useful insights that can lead to more accurate or novel recommendations1. In addition to improving recommendation results, cross-domain recommender algorithms provide a solution to problems, such as cold-start or user profiling, in singledomain recommenders.

Both hybrid and cross-domain recommender systems have shown to be successful in the current literature. However, a combination of two has been rarely studied. Additionally, the problem of providing more information to users to support their decisions in crossdomain recommender systems, has not been studied. Most of the current research in cross-domain recommenders focus on collaborative filtering cross-domain approaches [ 19 ]. These approaches incorporate users’ explicit (e.g., rating) or implicit (e.g., purchase) feedback in the auxiliary domain to recommend items in the target domain. Many of these algorithms jointly model multiple domains by sharing common user’s latent representations across them. Collaborative filtering cross-domain recommenders, similar to their single-domain counterparts, sufer from ignoring content information. Having advanced models, which are built on users’ rating or binary feedback, complicates the reasoning of why a specific user may be interested in an item. Moreover, these recommender algorithms lose the explicit user-item similarities by ignoring an important source of information: user reviews.

To further enhance the performance and transparency of crossdomain recommendation systems, we propose to combine hybrid and cross-domain approaches together. With this fusion, we can benefit from the strength of both hybrid and cross-domain recommender systems: cross-domain modeling will enhance user latent features by providing extra information from other domains (especially in sparser ones), reviews will bring another dimension for enriching user and item latent features and ofer insights to increase the recommendation transparency. Therefore, merging 1While other definitions of domain exist in the literature, e.g., time-based domains, in this paper, we focus on item domains (e.g., item type or category). the two will enrich content features by using review information across domains as well as enhance prediction performance.

Accordingly, we propose Deep Hybrid Cross Domain (DHCD) recommender, which models various types of user feedback (both ratings and reviews) across multiple domains under neural network framework. We use neural network as a natural choice to model reviews due to its success in natural language processing and generating natural language sentences [ 5, 26 ]. In addition to using reviews for producing better-quality suggestions, DHCD can generate natural and useful reviews to support user decisions for suggested cross-domain items. By generating a review that is based on the specific user’s interests across domains and other reviews, we can help clarify why a specific item is recommended to user. Our model shares information across domains in two levels by sharing users’ latent representations, and cascading it into reviews’ latent representations. It can capture non-linear user-item relationships by having a neural network framework [ 5 ]. Our results and findings of this research are summarized as follows: • We propose a neural network framework named Deep Hybrid Cross Domain (DHCD) model which unifies ratings and reviews of users and items across multiple domains. • To the best of our knowledge, DHCD is the first framework which is able to automatically generate cross-domain reviews that in turn can provide decision support for cross domain recommendations. • We design and implement multiple experiments to evaluate DHCD’s performance in three real-world datasets. Our evaluation is performed via two main tasks: rating prediction and review generation tasks, to answer four research questions. 2

RELATED WORKS

Here, we briefly review the literature on cross-domain recommendations and neural network-based collaborating filtering. Cross-Domain Recommendation focuses on learning user preferences from data across multiple domains [ 4 ]. There are two focuses on cross domain recommendation: collaborative filtering [ 3 ] and content-based methods [ 20 ]. In this work, we focus on collaborative filtering cross domain recommendations. Similar to singledomain collaborative filtering, research work on cross domain recommendation usually use matrix factorization. For example, Pan et al. [ 16 ] propose a cross domain recommendation system based on matrix factorization by using a coordinate system transfer method. Elkahky et al. [ 3 ] use deep learning framework to improve the performance of cross domain recommendation and also provide a scalable method to handle large datasets. However, not considering the reviews of items is the main limitation of these methods.

Xin et al. [ 24 ] proposed the first review-based cross-domain recommender model. They proposed a graphical model to capture the user ratings and item reviews across domains but reviews are not used to model user latent features. Later, Song et al. proposed a joint tensor factorization model to capture both user reviews and implicit feedback on items to provide cross-domain recommendations [ 22 ]. However, it does not capture non-linearities across domains, nor it models reviews as natural sequences of words. None of the above works generate reviews.

Domain 1

Rating Prediction

Good comic book <EOS> user Domain 2

Layer Q Layer 2 Layer 1 Layer Q Layer 2

Layer 1 Rating Prediction item item <start>

Good comic

book Nice jazz song

Nice jazz song

Rating Regression Component Review Generation Component Figure 1: An overview of Deep Hybrid Cross Domain (DHCD) recommendation system.

Neural Frameworks for Collaborative Filtering. Due to its ability to approximate non-linear relation of users and items, neural network is rapidly growing in recommendation systems [ 25 ].

He et al. [ 7 ] propose a fusion model that combines matrix factorization and multi-layer perceptron. Despite the eficiency of their proposed model, it does not consider reviews and is not extended to cross-domain recommendation system. Collaborative Deep Learning (CDL) [ 23 ] overcome the sparsity of ratings by using auxiliary information such as reviews. Using reviews as a set of words, their model outperforms baselines but not considering the sequential nature of words in reviews is a limitation.

Review Generation. Ni et al. [ 14 ] presented one of the first works that focuses on generating reviews along with preference prediction. Ni and McAuley [ 15 ] propose a neural network based upon attention model to assist users writing reviews of items. However, these works and others [ 1, 8 ] do not model the preference between users and items, nor they are extendable to cross-domain recommenders. 3

PROPOSED FRAMEWORK

In this section, we describe the architecture of Deep Hybrid Cross Domain (DHCD) recommendation system in detail. 3.1

Architecture

DHCD predicts user ratings on items and generates user reviews on them using two main components: the rating regression component and the review generation component. In the rating regression component (RRC), user ratings on items of each domain are modeled as a function of user and item latent representations. For each user, this component learns a shared latent representation across all domains. Moreover, the shared representations of users has a role as a gate to transfer information across domains. The shared user latent representations in combination with domain-specific latent item representations predict user ratings on items. The review generation component (RGC) generates user reviews on items according to user, item, and word latent representations. In this component, the user and item representations from rating regression component work as a guide to learn review word embeddings per user-item review. This guidance helps sharing word embedding information across domains. Figure 1 illustrates the architecture of our model. In the following, we present our model in more details. Notations We model the system to include a set of users U, and a set of item domains D. Each of these domains include a set of items Id , d ∈ D. For a user u ∈ U and each item i ∈ Id , the training data may include user’s rating on that item (rudi ) and user’s review on that item (sudi ). Accordingly, we model training data in domain d as a set of tuples T d = {(u, i, s, r )|u ∈ U, i ∈ Id , s ∈ Sd , r ∈ Rd }. Given training data in all domains T , our goal is to simultaneously estimate user u’s missing rating on item i in domain d (rˆud,i ) and generate user u’s missing textual review on that item (sˆud,i ). 3.2

Rating Regression Component (RRC) The main purpose of this component is to form a structure to infer user and item representations using observed user feedback on items across all domains. To do this, we model each user u’s interests as latent factors vu and item i’s representation (in domain d) as latent factors vid . Then, user u’s predicted rating on item i, rˆudi , is calculated as a function дr (·) of vu and vid . Formally, we have: rˆudi = дr (vu , vid ) (1)

In many single-domain factorization-based recommender systems, дr is modeled as the vector dot product of these latent factors plus some bias b [ 11 ]. Namely, rˆui = vuT vid +b. This specification has some limitations that makes it inappropriate for our cross-domain problem. First, the simple factorization formulation is not fit for a cross-domain problem, as it does not transfer information across domains. Also, the predicted ratings in this model are assumed to be a linear combination of user and item latent factors. However, recent work suggests that using a non-linear model can enhance the representation ability of user and items, and lead to more accurate results [ 5, 12 ]. More specifically, in cross-domain recommenders, Xin et al. have shown that user ratings across diferent domains can have non-linear relationships with each other [ 24 ]. Finally, the above formulation requires a shared latent space between users and items. This assumption can restrict the expressiveness capacity of the model since it (i) limits the user and item latent vectors to have equal sizes, and (ii) assumes the kth element of user latent vector must only interact with the corresponding element of item latent factor. For further information, He et al. [ 7 ] provide an example which illustrates these above restrictions.

To tackle the non-linearity problem, we model дr using deep neural networks. Neural networks have been successfully used in collaborative filtering problems [ 2, 7, 25 ] and can inherently model non-linear relationships [ 9 ]. To have a cross-domain solution, we extend the collaborative neural network to include multiple item domains. As shown in left side of Figure 1, for each domain d, we construct a multi-layer perceptron network (H d ) with Q layers. To share and transfer information across diferent domains, we model the user latent factors vu to be shared across all domains. Additionally, to avoid having a shared latent space between users and items, we use concatenation instead of dot product in дr . Consequently, the input xudi to our multi-layer perceptron’s first layer is a concatenation of embedding latent vector vu of user u and embedding latent vector vid of item i ∈ Id . Formally, xudi = [vu ; vid ].

Layer H d maps this input xudi to rating rˆudi . We denote the q-th hidden layer of H d as hdq , which includes a non-linear function projecting from the output of hdq−1; i.e. hdq = ReLU(Wqd hq−1 + bqd ) d where Wqd and bqd are parameters of H d ’s qth layer for domain d and ReLU(x ) = max (0, x ). For the first layer, hd0 = xudi is the input. We ensure the full connectivity between each two adjacent hidden layers hdq and hdq−1. We use regression to map the output vector yˆd Q of final layer to the prediction value rˆudi i.e. rˆudi = wydyˆQd + byd where rˆudi is the predicted rating value of user u and item i in domain d. wyd ∈ Rr and byd ∈ R are regression parameters.

To learn the parameters of RRC, we optimize the following regression loss function:

Lr = Õ Õ

d d 2 (rui − rˆui ) d ∈D u ∈U,i ∈Id (2) where rudi is the observed rating of user u and i in domain d. 3.3

Review Generation Component (RGC) This component is to model and generate reviews for user-item pairs in cross-domain setting. Here, we model user, item, and review word latent factors to generate natural language sentences.

Recently, recurrent neural networks with components such as long-short term memory (LSTM) and gated recurrent units (GRU) have showed high performance in natural language processingrelated tasks such as image captioning, Q&A system [ 5 ]. Inspired by their success, we adapt LSTM as a component for our review generation process.

As shown in Figure 1, for each domain d, we construct a separated LSTM model H¯ d , that can connect to the rating regression component. Assume sudi , user u’s review on item i in domain d, as a sequence of words tj where j ≤ Jui (Jui is the number of words in this review). Given a text sequence t1, t2, ..., t Jui , the LSTM network will update its hidden state parameters (h¯d ), in step j, according to tj j and previous step’s hidden state (h¯d ). Subsequently, the network j−1 will predict tj+1, step (j + 1)’s word, using all of its previous words (t<j+1). The output layer is connected to a softmax layer. The general idea of review modeling is expressed by p(tj |t<j , Φd ) = δ (h¯dj ) where Φd represents neural network’s hyperparameters for domain d, and δ (·) is the softmax function. Each hidden state h¯dj is modeled as a function of word tj and previous state h¯d j−1

The above “vanilla” LSTM can only model sentences from a corpus and is unable to embed user and item latent features. For reviews to represent user tastes, we have to make sure to include user and item features. To enhance modeling power, we first apply word2vec [ 13 ] to the corpus of review texts to learn the embedding vector of each word. Then, for each word in the review, we concatenate this embedding vector with user and item latent vectors to create a latent vector for word tj i.e. [word2vec(tj ); vu ; vid ]. There are three main advantages for this representation mechanism: (i) concatenation with user and item latent vectors ensures that the user and item information will not vanish over steps. Consequently, it can enhance the sequence generation; (ii) since user latent vectors are shared across domains concatenation with user latent vectors work as a mean to transfer information across domains for reviews; and (iii) word2vec is able to learn some hidden word characteristics, within corpus, which cannot be inferred from one-hot encoding vector or tf-idf [ 5 ].

To learn the parameters of LSTM network, we optimize the Negative Log-likelihood of review data: (3) (4) Ls = − where hyperparameters λr and λs control the trade of between rating regression and review generation tasks. λ is the regularization term for avoiding overfitting. Vu and Vi are matrices that stack all latent factors of users/items in all domains. Φ represents all parameters of DHCD. The above loss function is eficient to be optimized in end-to-end manner using back-propagation [ 5 ]. 4

EXPERIMENTS

In this section, we evaluate our proposed model against several baselines to demonstrate the robustness of DHCD. 4.1

Datasets

We consider three category combinations of Amazon datasets [ 6 ]: Book and Digital Music; Book and Ofice Products; Digital Music and Ofice Products. For each cross domain dataset, we select users who make purchase and write reviews on both categories. In each review, we filter out words whose frequency is less than 50. Table 1 describes some statistic of the datasets.

Training/Test Data: For each dataset and each user, we chronologically split their first 80% of ratings and reviews as training and the remaining 20% as testing data. • Matrix factorization (MF) [ 11 ]: It uses user and item ratings as an input. The predicted value is a linear combination of interaction between user and item latent features as well as the user/item/global bias. • Neural Collaborative Filtering (NCF) [ 7 ]: With ratings as its input, this single-domain model combines neural network and matrix factorization to capture the non-linear interaction between users’ and items’ latent factors. • Collaborative Deep Learning (CDL) [ 23 ]: Using ratings and bag of words of reviews, this single-domain model fuses neural network and topic modeling. • Collaborative Filtering with Generative Concatenative Networks (CF-GCN) [ 14 ]: This single-domain hybrid model uniifes both ratings and reviews under a neural framework. • Cross-domain neural network (CDN) [ 3 ]: This model utilizes neural networks for cross domain recommendation system. However, it does not consider reviews along with ratings of users and items.

We design the experiments in two settings: regular single-domain, and cross-domain. In the regular single-domain setting, we model user feedback of one domain to recommend items in the same domain. In the cross-domain setting, although the baseline may have been designed for one domain, we use user feedback on both domains to recommend useful items. For the single domain models, we add a prefix “cd” to their name to indicate when they are provided data from both domains. Specifically, both domain datasets are unified and used for training cdMF and cdNCF.

For rating prediction task, we compare DHCD against all the baselines. For review generation task, we compare DHCD with CF-GCN since it is the only baseline with review generation capacity. Moreover, we also use word-based LSTM (W-LSTM) and character-based LSTM (C-LSTM) [ 14 ] as baselines for performance comparison.

Significance Testing: Hypothesis testing is used to ensure if prediction performance of our model is significantly diferent from the baselines. In this test, for each metric, we select the method whose performance is nearest to our DHCD for comparison.

Default Parameter Setting: Number of latent factors for users and items is set to 20 for all models. For the models using multilayer perceptron (i.e. all models except MF), the number of layers Q is equal in all domains. The capacity of layers of multi-layer perceptron are set to 64, 32, 16, and 8. Embedding size of each word is 50. For models using LSTM (i.e. DHCD, CF-GCN, C-LSTM, WLSTM), numbers of layers in LSTM is set to 2 and its hidden size is set to 128. We assume that rating and review contributions are equal. So, we set λr = λд = 1 and the regularization term λ = 0.01 (see Equation 4). To learn model parameters, we use ADAM [ 5 ] with learning rate 10−4. 4.3

Rating Prediction Performance

Performance Measures: We use recall at K (r @K ), mean absolute error (MAE), and root mean square error (RMSE) to measure the performance of our proposed model and the baselines. Cross-Domain Results: Table 2 shows our model’s and baselines’ performance in cross domain setting. For r @K , we assume ratings greater than 3 as positive. We apply paired t-test [ 10 ] for significance testing. From the table, we observe that our model significantly outperforms all baselines in all metrics. For example, the performance of our model is 7% better than the one of CDL model in term of MAE. Performance of CDL is generally better than cdNCF which indirectly infers that using reviews can help us to enhance the prediction performance. Similarly, CDN outperforms cdNCF which emphasize the importance of modeling items into domains in a cross-domain design, instead of simply merging diferent domains’ data. Our model i.e. DHCD unifies review and rating in a cross-domain design. Hence, it achieves higher performance than other baselines.

Book +

Digital Music 3.10 3.12 3.02 2.93 Table 5: Perplexity comparison between our model and baselines (Lower is better).

Table 4 shows the result of our model in cold-start setting. Due to the space limitation, we only provide the performance of CDN for comparison on MAE and RMSE. As shown in the table, DHCD significantly outperforms CDN in all three datasets. This shows that using reviews in our model adds extra valuable information for predicting user ratings, compared to the CDN model that only uses rating information across the two domains.

Book + Book + Digital Music + Digital Music Ofice Products Ofice Products MAE RMSE MAE RMSE MAE RMSE CDN 0.767 0.97 0.767 0.986 0.743 1.052

DHCD 0.751* 0.94* 0.75* 0.922* 0.725* 1.031* Table 4: Prediction Performance in Cold-start setting. Notation * denotes p < 0.05 in significance test.

Single-Domain Results: Table 3 shows rating prediction performance of DHCD and baselines in single domain set of experiments. We only show the result for Book + Digital Music dataset to save space since the experiments on the other two show the similar result. For the baselines like MF, NCF, CDL, and CF-GCN, the training data and test data are from the same domain. For DHCD and CDN, we use the training data from both domains and report the performance in each domain separately. The first observation that we observe from the table is that for baseline models that are singledomain by design, their performance in single domain is better than in cross domain. For example, the r @10 of MF is 0.03 in Book domain. However, this method only achieves 0.028 in domain Book + Digital Music domains (Table 2). It indirectly implies that using heterogeneous data without proper integration can harm the performance of models. Secondly, in general, our DHCD model outperforms the baselines in each domain’s performance. It implies that DHCD has the proper manner to fuse the ratings and reviews of users and items from diferent domains under one framework. Thirdly, almost in all models, but more specifically in DHCD, performance in smaller domains is better than the one of larger domains. For example, Book domain is larger than Digital Music domain and the performance of CF-GCN in Book domain is worse than the one in Digital Music. It suggests that larger domain contains more noise than the smaller ones, and that the smaller datasets may benefit more from information transfer.

In general, the performance of DHCD is significantly better than the best baseline in significance test for both cross-domain and single domain settings.

Rating Prediction in Cold-start:To conduct this experiment, we keep the same test set and remove users with more than 5 ratings in both domains from training set and use default parameter settings. 4.4

Review Generation Analysis

In this section, we use perplexity as a measurement for review generation. The lower the perplexity, the better the model. ppx = exp ©− N1 Õ 1 ÕJui log p(tc |t<c , Φ)ª® (5) « (u,i) Jui c=1 ¬ where the pair (u, i) is a pair of user and item from test set and N is the number of reviews in test set and Jui is the number of words of each review between u and i. Φ denotes the parameters.

Result: Table 5 shows the perplexity of DHCD and the baselines C-LSTM, W-LSTM and CF-GCN. As shown in the table, perplexity of our model is lower than the one of baselines in the three datasets. For example, the perplexity of DHCD is 6% better than W-LSTM. Therefore, it suggests the latent representations of users and items learned from multi-layer perceptron are able to encode the review generation process. Moreover, the performance of DHCD is better than the one of CF-GCN which implies the domain consideration is useful to generate reviews. 4.5

The Efect of Reviews in Training

In this section, we investigate the impact of reviews in training our model. To do so we compare the rating regression training loss of CDN and our model through epochs (Equation 2). CDN can be considered as a simplified version of our model without using reviews. The faster the convergence of the training loss, the better the method. The parameters are kept as default values.

Experimental Results: Figure 2 shows the training loss of rating regression of DHCD and CDN through 50 epochs on the three datasets. From the figure, our first observation is that the training loss decreases when the number of epoch increases and it reaches

B OP

DM OP experiments, DHCD outperforms the baselines on rating prediction 0.90 0.88 0.86 0.84 0.82 0.80 0.78 0.76 0.74 0.900 model (DHCD) in the three datasets: Book + Digital Music (B_DM), Book + Ofice Products (B_OP) and Digital Music + Ofice Products (DM_OP) through epochs.

B DM

B OP

DM OP 25 Epoch 0.925 2.94 0.920 0.910 0.915 2.88 0.90 0.88 2.90 0.86 2.88 tio λr /λs in the three datasets: Book + Digital Music (B_DM), Book + Ofice Products (B_OP) and Digital Music + Ofice Products (DM_OP). For both metrics, the lower the better. a stable value after some certain numbers of epoch. Secondly, the two methods seem to converge to a fixed point in the three datasets. However, DHCD converges faster than CDN method. For instance, after 10 epochs, our method seems to be close to the convergence but CDN needs 25 epochs to have the same behavior on the dataset of Book + Digital Music. From the result, we can conclude that reviews are actually helpful for the learning of DHCD. 4.6

The Balance between Rating Prediction and Review Generation In DHCD, λr and λs are used to control the trade-of between rating prediction and review generation tasks. To study their efects, we keep λs = 1 and use various values of λr for training, then, we measure the performance of DHCD on test set. The two metrics perplexity and RMSE are selected for evaluation.

Experimental Results: In this experiment, the values of ratio λr /λs are {0.1, 0.5, 0.7, 1.0, 1.5, 2.0, 2.5}. The result is plotted on Figure 3. From the figure, we observe that increasing the ratio λr /λs leads to the better RMSE and worse perplexity since the larger value of the ratio means that the more efort is used for rating prediction. Moreover, the phenomenon is similar for the three datasets. 5

CONCLUSION

In this paper, we have proposed Deep Hybrid Cross Domain (DHCD) recommendation system which captures the reviews and ratings of users and items across diferent domains. Through our extensive 0.95 ss oL0.90 n sseo i rg0.85 e R g n i ta0.80 R n i a rT0.75 3.00 2.98 2.96 ty iex2.94 l p r e2.92 P 2.90 2.88 2.86 and review generation tasks.

There are several directions to extend our work further. DHCD has not considered the sequence of users’ decision and the social efect of users’ friends on their decision. Thus, these interesting directions should be studied in the near future especially to address the data sparsity issues.

[1]

Shuo

Chang ,

Maxwell Harper , and Loren Gilbert Terveen. 2016 . Crowd-Based Personalized Natural Language Explanations for Recommendations . In RecSys.

[2] Thanh-Nam Doan and Ee-Peng Lim . 2018 . PACELA: A Neural Framework for User Visitation in Location-based Social Networks . In UMAP.

[3]

Ali

Mamdouh Elkahky , Yang Song , and

Xiaodong

He . 2015 . A multi-view deep learning approach for cross domain user modeling in recommendation systems . In WWW.

[4]

Ignacio

Fernández-Tobías , Iván Cantador, Marius Kaminskas, and

Francesco

Ricci . 2012 . Cross-domain recommender systems: A survey of the state of the art . In Spanish Conference on Information Retrieval.

[5]

Ian

Goodfellow , Yoshua Bengio, Aaron Courville, and

Yoshua

Bengio . 2016 . Deep learning . Vol. 1 . MIT press Cambridge.

[6]

Ruining

He and Julian McAuley . 2016 . Ups and Downs: Modeling the Visual Evolution of Fashion Trends with One-Class Collaborative Filtering . In WWW.

[7]

Xiangnan

He , Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua . 2017 . Neural collaborative filtering . In WWW.

[8]

Reinhard

Heckel , Michail Vlachos, Thomas Parnell, and

Celestine

Dünner . 2017 . Scalable and interpretable product recommendations via overlapping co-clustering . In ICDE.

[9]

Kurt

Hornik , Maxwell Stinchcombe,

and Halbert

White . 1989 . Multilayer feedforward networks are universal approximators . Neural networks 2 , 5 ( 1989 ).

[10]

Henry

Hsu and Peter A Lachenbruch . 2014 . Paired t test . Wiley StatsRef: Statistics Reference Online ( 2014 ).

[11] Yehuda

Koren

, Robert Bell, and

Chris

Volinsky . 2009 . Matrix Factorization Techniques for Recommender Systems . Computer ( 2009 ).

[12]

Quoc

Le and

Tomas

Mikolov . 2014 . Distributed representations of sentences and documents . In International Conference on Machine Learning.

[13] Tomas

Mikolov

, Kai Chen, Greg Corrado, and

Jefrey

Dean . 2013 . Eficient estimation of word representations in vector space . arXiv preprint arXiv:1301.3781 ( 2013 ).

[14] Jianmo

, Zachary C Lipton, Sharad Vikram , and Julian McAuley . 2017 . Estimating reactions and recommending products with generative models of reviews . In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1 : Long

Papers)

, Vol. 1 . 783 - 791 .

[15]

Jianmo

Ni and Julian McAuley . 2018 . Personalized review generation by expanding phrases and attending on aspect-aware representations . In ACL.

[16] Weike

Pan

, Evan Wei Xiang, Nathan Nan Liu, and

Qiang

Yang . 2010 . Transfer Learning in Collaborative Filtering for Sparsity Reduction. . In AAAI.

[17]

Denis

Parra and

Shaghayegh

Sahebi . 2013 . Recommender systems: Sources of knowledge and evaluation metrics . In Advanced Techniques in Web Intelligence-2.

[18] Francesco

Ricci

, Lior Rokach, and

Bracha

Shapira . 2015 . Recommender systems: introduction and challenges . In Recommender systems handbook. Springer , 1 - 34 .

[19]

Shaghayegh

Sahebi and

Peter

Brusilovsky . 2015 . It Takes Two to Tango: An Exploration of Domain Pairs for Cross-Domain Collaborative Filtering . In RecSys.

[20]

Sahebi and

Walker . 2014 . Content-Based Cross-Domain Recommendations Using Segmented Models . In Workshop on New Trends in Content-based Recommender Systems (CBRecsys) . ACM , 57 - 63 .

[21] Andrew

I. Schein

, Alexandrin Popescul, Lyle H. Ungar , and David M. Pennock . 2002 . Methods and metrics for cold-start recommendations . In SIGIR.

[22] Tianhang

Song

, Zhaohui Peng, Senzhang Wang, Wenjing Fu, Xiaoguang Hong, and

S Yu

Philip . 2017 . Review-Based Cross-Domain Recommendation Through Joint Tensor Factorization . In DASFAA.

[23] Hao

Wang

Naiyan

Wang , and Dit-Yan Yeung . 2015 . Collaborative deep learning for recommender systems . In SIGKDD.

[24] Xin

Xin

, Zhirun Liu, Chin-Yew

Lin

, Heyan Huang,

Xiaochi

Wei , and

Ping

Guo . 2015 . Cross-Domain Collaborative Filtering with Review Text. . In IJCAI.

[25] Shuai

Zhang

, Lina Yao, Aixin Sun, and

Tay . 2019 . Deep Learning Based Recommender System: A Survey and New Perspectives . ACM Comput. Surv . ( 2019 ).

[26]

Yongfeng

Zhang and

Chen . 2018 . Explainable Recommendation: A Survey and New Perspectives . CoRR abs/ 1804 .11192 ( 2018 ). arXiv: 1804 .11192