=Paper= {{Paper |id=Vol-2449/paper4 |storemode=property |title=Review-Based Cross-Domain Collaborative Filtering: A Neural Framework |pdfUrl=https://ceur-ws.org/Vol-2449/paper4.pdf |volume=Vol-2449 |authors=Thanh-Nam Doan,Shaghayegh Sahebi |dblpUrl=https://dblp.org/rec/conf/recsys/DoanS19 }} ==Review-Based Cross-Domain Collaborative Filtering: A Neural Framework== https://ceur-ws.org/Vol-2449/paper4.pdf
               Review-Based Cross-Domain Collaborative Filtering:
                             A Neural Framework
                                                      Thanh-Nam Doan, Shaghayegh Sahebi
                                                                   University at Albany - SUNY
                                                                   {tdoan,ssahebi}@albany.edu

ABSTRACT                                                                                 Two major approaches to address some of these problems are
Cross-domain collaborative filtering recommenders exploit data                        hybrid [18] and cross-domain [4] recommender systems. Hybrid
from other domains (e.g., movie ratings) to predict users’ interests                  recommender systems merge content-based and collaborative filter-
in a different target domain (e.g., suggest music). Most current cross-               ing approaches to provide higher-quality recommendations. Some
domain recommenders focus on modeling user ratings but pay lim-                       hybrid recommender systems jointly model user ratings and re-
ited attention to user reviews. Additionally, due to the complexity of                views to introduce a more sophisticated view to user interests and
these recommender systems, they cannot provide any information                        item features, that leads to improved recommendation results [18].
to users to support user decisions. To address these challenges, we                      The idea behind cross-domain recommendation systems is to
propose Deep Hybrid Cross Domain (DHCD) model, a cross-domain                         share useful information across two or more domains to improve
neural framework, that can simultaneously predict user ratings,                       recommendation results [4]. They work by transferring informa-
and provide useful information to strengthen the suggestions and                      tion from one or more source or auxiliary domains to suggest useful
support user decision across multiple domains. Specifically, DHCD                     items in a target domain. Especially, when user history in the target
enhances the predicted ratings by jointly modeling two crucial                        domain (e.g., books) does not provide enough information about
facets of users’ product assessment: ratings and reviews. To sup-                     user interests, user preferences in another source domain (e.g.,
port decisions, it models and provides natural review-like sentences                  movies) can provide useful insights that can lead to more accurate
across domains according to user interests and item features. This                    or novel recommendations1 . In addition to improving recommen-
model is robust in integrating user rating and review information                     dation results, cross-domain recommender algorithms provide a
from more than two domains. Our extensive experiments show                            solution to problems, such as cold-start or user profiling, in single-
that DHCD can significantly outperform advanced baselines in rat-                     domain recommenders.
ing predictions and review generation tasks. For rating prediction                       Both hybrid and cross-domain recommender systems have shown
tasks, it outperforms cross-domain and single-domain collaborative                    to be successful in the current literature. However, a combination
filtering as well as hybrid recommender systems. Furthermore, our                     of two has been rarely studied. Additionally, the problem of provid-
review generation experiments suggest an improved perplexity                          ing more information to users to support their decisions in cross-
score and transfer of review information in DHCD.                                     domain recommender systems, has not been studied. Most of the
                                                                                      current research in cross-domain recommenders focus on collab-
CCS CONCEPTS                                                                          orative filtering cross-domain approaches [19]. These approaches
                                                                                      incorporate users’ explicit (e.g., rating) or implicit (e.g., purchase)
• Information systems → Recommender systems; • Comput-
                                                                                      feedback in the auxiliary domain to recommend items in the target
ing methodologies → Neural networks.
                                                                                      domain. Many of these algorithms jointly model multiple domains
                                                                                      by sharing common user’s latent representations across them. Col-
KEYWORDS                                                                              laborative filtering cross-domain recommenders, similar to their
Cross-domain Collaborative filtering, neural network, hybrid col-                     single-domain counterparts, suffer from ignoring content informa-
laborative filtering                                                                  tion. Having advanced models, which are built on users’ rating
                                                                                      or binary feedback, complicates the reasoning of why a specific
1    INTRODUCTION                                                                     user may be interested in an item. Moreover, these recommender
                                                                                      algorithms lose the explicit user-item similarities by ignoring an
Nowadays, users are overwhelmed by the number of choices online.
                                                                                      important source of information: user reviews.
Recommender systems are increasingly used as an essential tool,
                                                                                         To further enhance the performance and transparency of cross-
to alleviate this problem. Despite improvements in recommender
                                                                                      domain recommendation systems, we propose to combine hybrid
systems, many of them still suffer from problems, including cold-
                                                                                      and cross-domain approaches together. With this fusion, we can
start [21] and difficulty in explaining their suggestions [26]. More-
                                                                                      benefit from the strength of both hybrid and cross-domain recom-
over, collaborative filtering recommenders [11] cannot use obvious
                                                                                      mender systems: cross-domain modeling will enhance user latent
feature-based relations between users and items. Content-based
                                                                                      features by providing extra information from other domains (es-
approaches cannot capture deeper social or semantic similarities
                                                                                      pecially in sparser ones), reviews will bring another dimension
between users and items, nor they can suggest novel items (outside
                                                                                      for enriching user and item latent features and offer insights to
the scope of user profile features) to users [17].
                                                                                      increase the recommendation transparency. Therefore, merging
ComplexRec 2019, 20 September 2019, Copenhagen, Denmark
                                                                                      1 While other definitions of domain exist in the literature, e.g., time-based domains, in
Copyright ©2019 for this paper by its authors. Use permitted under Creative Commons
License Attribution 4.0 International (CC BY 4.0)..                                   this paper, we focus on item domains (e.g., item type or category).
ComplexRec 2019, 20 September 2019, Copenhagen, Denmark                                                                  Thanh-Nam Doan, Shaghayegh Sahebi


the two will enrich content features by using review information              Domain 1
                                                                                                  Rating Prediction
                                                                                                                                 Good        comic         book         
across domains as well as enhance prediction performance.                                             Layer Q

   Accordingly, we propose Deep Hybrid Cross Domain (DHCD)                                            Layer 2
recommender, which models various types of user feedback (both                                        Layer 1
ratings and reviews) across multiple domains under neural net-
                                                                                                                                   Good         comic          book
work framework. We use neural network as a natural choice to                                                          item
model reviews due to its success in natural language processing                                   Rating Prediction
                                                                                                                                 Nice         jazz         song         
and generating natural language sentences [5, 26]. In addition to                                     Layer Q
                                                                              user
using reviews for producing better-quality suggestions, DHCD can                                      Layer 2
generate natural and useful reviews to support user decisions for                                     Layer 1
suggested cross-domain items. By generating a review that is based
                                                                              Domain 2                                             Nice          jazz          song
on the specific user’s interests across domains and other reviews,                                                    item
we can help clarify why a specific item is recommended to user. Our                  Rating Regression Component                        Review Generation Component
model shares information across domains in two levels by sharing          Figure 1: An overview of Deep Hybrid Cross Domain
users’ latent representations, and cascading it into reviews’ latent      (DHCD) recommendation system.
representations. It can capture non-linear user-item relationships
by having a neural network framework [5]. Our results and findings        Neural Frameworks for Collaborative Filtering. Due to its abil-
of this research are summarized as follows:                               ity to approximate non-linear relation of users and items, neural
                                                                          network is rapidly growing in recommendation systems [25].
    • We propose a neural network framework named Deep Hy-                   He et al. [7] propose a fusion model that combines matrix factor-
      brid Cross Domain (DHCD) model which unifies ratings and            ization and multi-layer perceptron. Despite the efficiency of their
      reviews of users and items across multiple domains.                 proposed model, it does not consider reviews and is not extended to
    • To the best of our knowledge, DHCD is the first framework           cross-domain recommendation system. Collaborative Deep Learn-
      which is able to automatically generate cross-domain reviews        ing (CDL) [23] overcome the sparsity of ratings by using auxiliary
      that in turn can provide decision support for cross domain          information such as reviews. Using reviews as a set of words, their
      recommendations.                                                    model outperforms baselines but not considering the sequential
    • We design and implement multiple experiments to evaluate            nature of words in reviews is a limitation.
      DHCD’s performance in three real-world datasets. Our eval-          Review Generation. Ni et al. [14] presented one of the first works
      uation is performed via two main tasks: rating prediction and       that focuses on generating reviews along with preference prediction.
      review generation tasks, to answer four research questions.         Ni and McAuley [15] propose a neural network based upon atten-
                                                                          tion model to assist users writing reviews of items. However, these
                                                                          works and others [1, 8] do not model the preference between users
2   RELATED WORKS                                                         and items, nor they are extendable to cross-domain recommenders.
Here, we briefly review the literature on cross-domain recommen-
dations and neural network-based collaborating filtering.                 3          PROPOSED FRAMEWORK
Cross-Domain Recommendation focuses on learning user pref-
                                                                          In this section, we describe the architecture of Deep Hybrid Cross
erences from data across multiple domains [4]. There are two fo-
                                                                          Domain (DHCD) recommendation system in detail.
cuses on cross domain recommendation: collaborative filtering [3]
and content-based methods [20]. In this work, we focus on collabo-
rative filtering cross domain recommendations. Similar to single-         3.1           Architecture
domain collaborative filtering, research work on cross domain rec-        DHCD predicts user ratings on items and generates user reviews
ommendation usually use matrix factorization. For example, Pan et         on them using two main components: the rating regression compo-
al. [16] propose a cross domain recommendation system based on            nent and the review generation component. In the rating regression
matrix factorization by using a coordinate system transfer method.        component (RRC), user ratings on items of each domain are mod-
Elkahky et al. [3] use deep learning framework to improve the             eled as a function of user and item latent representations. For each
performance of cross domain recommendation and also provide a             user, this component learns a shared latent representation across
scalable method to handle large datasets. However, not considering        all domains. Moreover, the shared representations of users has a
the reviews of items is the main limitation of these methods.             role as a gate to transfer information across domains. The shared
    Xin et al. [24] proposed the first review-based cross-domain rec-     user latent representations in combination with domain-specific
ommender model. They proposed a graphical model to capture the            latent item representations predict user ratings on items. The re-
user ratings and item reviews across domains but reviews are not          view generation component (RGC) generates user reviews on items
used to model user latent features. Later, Song et al. proposed a joint   according to user, item, and word latent representations. In this
tensor factorization model to capture both user reviews and implicit      component, the user and item representations from rating regres-
feedback on items to provide cross-domain recommendations [22].           sion component work as a guide to learn review word embeddings
However, it does not capture non-linearities across domains, nor it       per user-item review. This guidance helps sharing word embedding
models reviews as natural sequences of words. None of the above           information across domains. Figure 1 illustrates the architecture of
works generate reviews.                                                   our model. In the following, we present our model in more details.
Review-Based Cross-Domain Collaborative Filtering                                 ComplexRec 2019, 20 September 2019, Copenhagen, Denmark

                                                                                                                                                q−1
Notations We model the system to include a set of users U, and a               projecting from the output of hdq−1 ; i.e. hdq = ReLU(Wqd hd           + bqd )
set of item domains D. Each of these domains include a set of items            where Wqd and bqd are parameters of H d ’s q th layer for domain d
I d , d ∈ D. For a user u ∈ U and each item i ∈ I d , the training
                                                   d ) and user’s review       and ReLU(x) = max(0, x). For the first layer, hd0 = xui
                                                                                                                                     d is the input.
data may include user’s rating on that item (rui
               d ). Accordingly, we model training data in domain d
                                                                               We ensure the full connectivity between each two adjacent hidden
on that item (sui                                                                                                                                  d
                                                                               layers hdq and hdq−1 . We use regression to map the output vector ŷQ
as a set of tuples T d = {(u, i, s, r )|u ∈ U, i ∈ I d , s ∈ Sd , r ∈ R d }.                                            d i.e. rˆd = w d ŷ d +b d where
                                                                               of final layer to the prediction value rˆui       ui   y Q       y
Given training data in all domains T , our goal is to simultaneously
                                                                   d ) and       d is the predicted rating value of user u and item i in domain d.
                                                                               rˆui
estimate user u’s missing rating on item i in domain d (ˆru,i
                                                         d ).
generate user u’s missing textual review on that item (ŝu,i                   wyd ∈ Rr and byd ∈ R are regression parameters.
                                                                                    To learn the parameters of RRC, we optimize the following re-
3.2    Rating Regression Component (RRC)                                       gression loss function:

                                                                                                  Lr =                  d      d 2
                                                                                                        Õ       Õ
The main purpose of this component is to form a structure to                                                          (rui − rˆui )            (2)
infer user and item representations using observed user feedback                                         d ∈D u ∈U,i ∈I d
on items across all domains. To do this, we model each user u’s
interests as latent factors vu and item i’s representation (in domain                 d is the observed rating of user u and i in domain d.
                                                                               where rui
d) as latent factors vid . Then, user u’s predicted rating on item i, rˆui
                                                                        d ,
                                                d
is calculated as a function дr (·) of vu and vi . Formally, we have:           3.3    Review Generation Component (RGC)
                              d                                                This component is to model and generate reviews for user-item
                            rˆui = дr (vu , vid )                       (1)
                                                                               pairs in cross-domain setting. Here, we model user, item, and review
   In many single-domain factorization-based recommender sys-                  word latent factors to generate natural language sentences.
tems, дr is modeled as the vector dot product of these latent factors             Recently, recurrent neural networks with components such as
plus some bias b [11]. Namely, rˆui = vuT vid +b. This specification has       long-short term memory (LSTM) and gated recurrent units (GRU)
some limitations that makes it inappropriate for our cross-domain              have showed high performance in natural language processing-
problem. First, the simple factorization formulation is not fit for a          related tasks such as image captioning, Q&A system [5]. Inspired
cross-domain problem, as it does not transfer information across               by their success, we adapt LSTM as a component for our review
domains. Also, the predicted ratings in this model are assumed to              generation process.
be a linear combination of user and item latent factors. However,                 As shown in Figure 1, for each domain d, we construct a sepa-
recent work suggests that using a non-linear model can enhance the             rated LSTM model H̄ d , that can connect to the rating regression
representation ability of user and items, and lead to more accurate            component. Assume sui  d , user u’s review on item i in domain d, as a
results [5, 12]. More specifically, in cross-domain recommenders,              sequence of words t j where j ≤ Jui (Jui is the number of words in
Xin et al. have shown that user ratings across different domains               this review). Given a text sequence t 1 , t 2 , ..., t Jui , the LSTM network
can have non-linear relationships with each other [24]. Finally, the           will update its hidden state parameters (h̄dj ), in step j, according to t j
above formulation requires a shared latent space between users and
                                                                               and previous step’s hidden state (h̄dj−1 ). Subsequently, the network
items. This assumption can restrict the expressiveness capacity of
                                                                               will predict t j+1 , step (j + 1)’s word, using all of its previous words
the model since it (i) limits the user and item latent vectors to have
                                                                               (t