<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Latent Modeling of Unexpectedness for Recommendations</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alexander Tuzhilin</string-name>
          <email>atuzhili@stern.nyu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pan Li</string-name>
          <email>pli2@stern.nyu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>New York University</institution>
          ,
          <addr-line>New York, NY</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <abstract>
        <p>Unexpectedness constitutes an important factor for recommender system to improve user satisfaction and avoid filter bubble issues. Previous methods model unexpectedness in the feature space, making them dificult to capture the latent, complex and heterogeneous interactions between users and items. In this paper, we propose to model unexpectedness in the latent space and utilize a latent convex hull structure to provide unexpected recommendations, as illustrated in Figure 1. Extensive experiments on two real-world datasets demonstrate efectiveness of latent unexpectedness over explicit unexpectedness and show that the proposed model significantly outperforms baseline models in terms of unexpectedness measures while achieving the same level of accuracy.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Evaluation Metrics</title>
    </sec>
    <sec id="sec-2">
      <title>RMSE &amp; MAE</title>
      <p>Root Mean Square Error and Mean
Absolute Error measures the diferences
between estimated ratings and actual
ratings.</p>
    </sec>
    <sec id="sec-3">
      <title>Precision@N &amp; Recall@N</title>
      <p>Precision is the fraction of the
recommended items that are of high
utility to the user. Recall is the fraction
of the high utility items that are
recommended to the user.</p>
    </sec>
    <sec id="sec-4">
      <title>Unexpectedness</title>
      <p>Unexpectedness is calculated through
equation (3) following our proposed
definition.</p>
    </sec>
    <sec id="sec-5">
      <title>Serendipity</title>
      <p>
        Serendipity [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] is computed by
Serendipity = RS&amp;P MR&amp;SU sef ul where
RS stands for the recommended items
using the target model, PM stands
for the recommendation items using
a primitive prediction algorithm and
USEFUL stands for the items whose
utility is above certain threshold.
      </p>
    </sec>
    <sec id="sec-6">
      <title>Diversity</title>
      <p>
        Diversity is computed as the
average intra-list distance [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]:
D(R) = Íi ∈R Íj,i ∈R d(i, j)
      </p>
    </sec>
    <sec id="sec-7">
      <title>Coverage</title>
      <p>Coverage is computed as the
percentage of distinctive recommended items
over all distinctive items in the dataset.
in the case when purchase actions are sparse or noisy. Besides, it does not include specific user and
item features, neither does it address the latent and complex relationships between users and items.
Furthermore, the distance metric between discrete items is hard to define in the feature space, while
in the latent space we have well-defined euclidean distances. Thus, we cannot model the unexpected
relations eficiently in the feature space.</p>
      <p>
        Therefore, though researchers have successfully developed algorithms to improve unexpectedness
and other novelty metrics, they always come at a price of losing accuracy performance, as pointed out
in [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. This severely limits the practical use of the unexpected recommendation, especially in business
applications where the goal is to increase commercial sales and enhance user satisfaction. Thus, it is
important to design a novel unexpected recommender to increase novelty of recommendations while
keeping the same level of accuracy performance.
      </p>
      <p>
        In this paper, we propose to define unexpectedness as the distance metric in the latent space (latent
feature and atribute embeddings) rather than feature space (explicit users and items) by utilizing
Heterogeneous Information Network Embeddings (HINE) [
        <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
        ] to capture the latent, complex
and heterogeneous relations between them. We take the natural closure of convex hull in the latent
space, and calculate unexpectedness as the euclidean distance from new item embedding to the latent
convex hull of expected items of the user. The proposed unexpectedness measure is subsequently
combined with estimated ratings for providing recommendations.
      </p>
      <p>We make the following contributions in this paper. We propose to model unexpectedness in the
latent space by defining the expected set for each user as a latent convex hull generated by item
embeddings. We also conduct extensive experiments on two real-world datasets and show that our
model significantly improves novelty measures without losing any accuracy metrics.</p>
    </sec>
    <sec id="sec-8">
      <title>METHOD</title>
      <p>
        In prior literature [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], unexpectedness is defined as the distance of recommended item from the
closure set of expected items, the later including items that either previously consumed by the user or
closely related to the consumptions. However, this definition does not include the feature information
for users and items, neither does it consider the latent and heterogeneous interactions between them.
Thus, it might not serve as a perfect definition, for we cannot manually codify all explicit associations
and relations between users and items. Also, it is hard to mathematically formalize the expected set
in the feature space, where the distribution of users and items is discrete and unstructured.
      </p>
      <p>
        On the other hand, heterogeneous information network [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] (HIN) along with latent embedding
method [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] provides us with an eficient tool to model users, items and their associated features
simultaneously by linking the associated features with corresponding entities in the network and
subsequently map them into the latent space. Thus in the paper, we propose to combine those two
techniques for latent modeling of unexpectedness.
      </p>
    </sec>
    <sec id="sec-9">
      <title>Baseline Models SPR [9]</title>
      <p>Serendipitous Personalized Ranking
extends traditional personalized ranking
methods by considering item
popularity in AUC optimization, which makes
the ranking sensitive to the popularity
of negative examples.</p>
    </sec>
    <sec id="sec-10">
      <title>Auralist [18]</title>
      <p>Auralist is a personalized
recommendation system that balances between the
desired goals of accuracy and novelty
simultaneously.</p>
      <p>
        HOM-LIN [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
HOM-LIN is the state-of-the-art
unexpected algorithm that utilizes the
combination of estimated ratings and
unexpectedness for recommendation.
DPP [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]
The Determinantal Point Process
utilizes a fast greedy MAP inference
approach to generate relevant and diverse
recommendations.
      </p>
    </sec>
    <sec id="sec-11">
      <title>Random</title>
      <p>Random is the baseline model where
we randomly recommend items to
users without considering any
information about the ratings, unexpectedness
and utility.
where C(TVt , TVt+1 ) stands for the transition coeficient between the type of node Vt and the type
of node Vt +1. |Nt +1(Vt )| stands for the number of nodes of type Vt +1 in the neighborhood of Vt . We
apply heterogeneous random walk iteratively to each node and generate the collection of meta-path
sequences for the skip-gram mechanism.</p>
      <p>
        Note that in previous definition [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], the idea of ‘’closure set” plays an important role. As a natural
closure in the latent space, latent convex hull of each user is the smallest convex set that contains all
the item embeddings that the user has visited. Under this seting, we assume that the expected set
maintains its convexity in the growing process. For example, if a person enjoy rap music and rock &amp;
roll, she/he shall also have certain expectations on rap-rock music, an innovation combination of those
two genres. Subsequently, we define the unexpectedness as the distance between the embedding
of new item and the latent convex hull generated from the embeddings of all visited items
of the user:
      </p>
      <p>U tilityu,i = (1 − α ) ∗ EstRatinдu,i + α ∗ U nexpu,i</p>
      <p>U nexpu,i = d(i; LC(Ni ))
where Ni = (i1, i2, · · · , in ) contains the embeddings of items that link to the user in the heterogeneous
information network. Specifically, the distance function is well defined as the minimal distance from
the given point to all the boundaries of the latent closure. We visualize this definition in Figure 1.</p>
      <p>Once we set up the definition of unexpectedness, we perform the unexpected recommendation
using hybrid-utility based collaborative filtering methods that linearly combine estimated ratings
(representing accuracy) with unexpectedness (representing novelty)
(1)
(2)
(3)
(4)
To calculate P (ct |v; θ ), we propose to use heterogeneous random walk to generate meta-paths of
multiple types of nodes in the form of V1 −−R→1 V2 −−R→2 V3 · · · Vn wherein R = R1 ◦ R2 ◦ · · · Rn defines the
composite relations between the start and the end of the heterogeneous random walk. The transition
probability within each random walk between two nodes is defined as follows:
p(Vt +1 |Vt ) =
( C(TVt ,TVt+1 ) , (Vt , Vt +1) ∈ E</p>
      <p>|Nt+1(Vt )|
0,
(Vt , Vt +1) &lt; E
Dataset Yelp TA
# Review 5,996,996 878,561
# Business 188,593 576,689
# User 1,518,169 3,945
# Sparsity 0.002% 0.039%
Table 1: Descriptive Statistics for the two</p>
    </sec>
    <sec id="sec-12">
      <title>Datasets.</title>
      <p>The key idea lies in that, instead of recommending the similar items that the users are very familiar with
as the classical recommenders do, we recommend unexpected and useful items to the users that they
might have not thought about, but indeed fit well to their satisfactions. These two adversarial forces of
accuracy and novelty work together to obtain the optimal solution, thus improving recommendation
performance and user satisfaction. In real practice, the unexpected coeficient α is optimized through
Bayesian optimization.</p>
    </sec>
    <sec id="sec-13">
      <title>EXPERIMENTS AND RESULTS</title>
      <p>
        We implement our model on two real-world datasets: the Yelp Challenge Dataset Round 12 [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and
the TripAdvisor Dataset [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The descriptive statistics of these two datasets are listed in Table 1. To
address the cold-start issue, we filter out users and items that appear less than 5 times in the dataset.
We measure the recommendation results using accuracy and novelty metrics listed in page 2.
      </p>
      <p>
        We compare the recommendation performance with and without considering the proposed
unexpectedness by 5-fold cross-validation experiments using five popular collaborative filtering algorithms
including KNN, SVD, CoClustering, NMF and FM [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. We also report results for baseline unexpected
recommendation models. As shown in Table 2 and 3, when considering unexpectedness in the
recommendation process, we obtain significant amount of improvements in terms of Unexpectedness,
Serendipity and Diversity measures, without sacrificing any accuracy performance in RMSE, MAE,
Precision and Recall. Therefore, the proposed definition of unexpectedness enables us to provide
unexpected and useful recommendations at the same time. It is crucial for successful deployments of
unexpected recommendation models in industrial applications. In addition, the proposed approach
outperforms all previous unexpected recommendation models introduced in page 3, which validates
the superiority of modeling unexpectedness in the latent space over feature space. Finally, we point out
that the proposed model achieves greater improvements on Yelp dataset, which contains richer feature
information compared to the TripAdvisor dataset. This observation is in line with our assumption
that feature information maters in the definition of unexpectedness.
      </p>
    </sec>
    <sec id="sec-14">
      <title>CONCLUSION</title>
      <p>In this paper, we propose to model unexpectedness and user expectations in the latent space, which
makes it possible to capture the latent, complex and heterogeneous relations between users and items,
thus significantly improving the performance and practicability of unexpected recommendations.
We empirically demonstrate that the proposed approach consistently and significantly outperforms
baseline models in terms of unexpected measures without sacrificing the performance of accuracy.</p>
      <p>As the future work, we plan to conduct live experiments with real business environment in order to
further evaluate the efectiveness of unexpected recommendations and analyze both qualitative and
quantitative aspects in an online retail seting, especially with the utilization of A/B test.
Data
TA</p>
      <p>Model</p>
      <p>FM
FM+Unexp</p>
      <p>CoCluster
CoCluster+Unexp</p>
      <p>SVD
SVD+Unexp</p>
      <p>NMF
NMF+Unexp</p>
      <p>KNN
KNN+Unexp</p>
      <p>SPR
Auralist
HOM-LIN</p>
      <p>DPP
Random</p>
      <p>FM
FM+Unexp</p>
      <p>CoCluster
CoCluster+Unexp</p>
      <p>SVD
SVD+Unexp</p>
      <p>NMF
NMF+Unexp</p>
      <p>KNN
KNN+Unexp</p>
      <p>SPR
Auralist
HOM-LIN</p>
      <p>DPP
Random</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <year>2019</year>
          . TripAdvisor Dataset. htp://www.cs.cmu.edu/~jiweil/html/hotel-review.
          <source>html.</source>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <fpage>2019</fpage>
          .
          <article-title>Yelp Challenge Dataset</article-title>
          . htps://www.yelp.com/dataset/challenge.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Panagiotis</given-names>
            <surname>Adamopoulos</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>On discovering non-obvious recommendations: Using unexpectedness and neighborhood selection methods in collaborative filtering systems</article-title>
          .
          <source>In Proceedings of the 7th WSDM.</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Panagiotis</given-names>
            <surname>Adamopoulos</surname>
          </string-name>
          and
          <string-name>
            <given-names>Alexander</given-names>
            <surname>Tuzhilin</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>On unexpectedness in recommender systems: Or how to beter expect the unexpected</article-title>
          .
          <source>ACM Transactions on Intelligent Systems and Technology (TIST)</source>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Li</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Yonghua</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <surname>Ningxia</surname>
            <given-names>Wang</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keping Yang</surname>
            , and
            <given-names>Quan</given-names>
          </string-name>
          <string-name>
            <surname>Yuan</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>How Serendipity Improves User Satisfaction with Recommendations? A Large-Scale User Evaluation</article-title>
          .
          <source>In The World Wide Web Conference. ACM</source>
          ,
          <volume>240</volume>
          -
          <fpage>250</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Laming</given-names>
            <surname>Chen</surname>
          </string-name>
          , Guoxin Zhang, and
          <string-name>
            <given-names>Eric</given-names>
            <surname>Zhou</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Fast greedy MAP inference for Determinantal Point Process to improve recommendation diversity</article-title>
          .
          <source>In Advances in Neural Information Processing Systems</source>
          .
          <volume>5622</volume>
          -
          <fpage>5633</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Yuxiao</given-names>
            <surname>Dong</surname>
          </string-name>
          , Nitesh V Chawla,
          <string-name>
            <given-names>and Ananthram</given-names>
            <surname>Swami</surname>
          </string-name>
          .
          <year>2017</year>
          . metapath2vec:
          <article-title>Scalable representation learning for heterogeneous networks</article-title>
          .
          <source>In Proceedings of the 23rd ACM SIGKDD. ACM.</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Mouzhi</given-names>
            <surname>Ge</surname>
          </string-name>
          , Carla Delgado-Batenfeld, and
          <string-name>
            <given-names>Dietmar</given-names>
            <surname>Jannach</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Beyond accuracy: evaluating recommender systems by coverage and serendipity</article-title>
          .
          <source>In Proceedings of the fourth ACM conference on Recommender systems. ACM</source>
          ,
          <volume>257</volume>
          -
          <fpage>260</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Qiuxia</given-names>
            <surname>Lu</surname>
          </string-name>
          , Tianqi Chen, Weinan Zhang, Diyi Yang, and
          <string-name>
            <given-names>Yong</given-names>
            <surname>Yu</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Serendipitous personalized ranking for top-n recommendation</article-title>
          .
          <source>In 2012 WI-IAT. IEEE.</source>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Tomoko</surname>
            <given-names>Murakami</given-names>
          </string-name>
          , Koichiro Mori, and
          <string-name>
            <given-names>Ryohei</given-names>
            <surname>Orihara</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Metrics for evaluating the serendipity of recommendation lists</article-title>
          .
          <source>In Annual conference of the Japanese society for artificial intelligence</source>
          . Springer,
          <fpage>40</fpage>
          -
          <lpage>46</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Tien</surname>
            <given-names>T Nguyen</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pik-Mai Hui</surname>
            ,
            <given-names>F Maxwell</given-names>
          </string-name>
          <string-name>
            <surname>Harper</surname>
          </string-name>
          ,
          <source>Loren Terveen, and Joseph A Konstan</source>
          .
          <year>2014</year>
          .
          <article-title>Exploring the filter bubble: the efect of using recommender systems on content diversity</article-title>
          .
          <source>In Proceedings of the 23rd WWW. ACM.</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Eli</given-names>
            <surname>Pariser</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>The filter bubble: How the new personalized web is changing what we read and how we think</article-title>
          .
          <source>Penguin.</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Francesco</surname>
            <given-names>Ricci</given-names>
          </string-name>
          , Lior Rokach, and
          <string-name>
            <given-names>Bracha</given-names>
            <surname>Shapira</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Recommender systems: introduction and challenges</article-title>
          .
          <source>In Recommender systems handbook. Springer</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>34</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Guy</given-names>
            <surname>Shani</surname>
          </string-name>
          and
          <string-name>
            <given-names>Asela</given-names>
            <surname>Gunawardana</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Evaluating recommendation systems</article-title>
          .
          <source>In Recommender systems handbook.</source>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Chuan</surname>
            <given-names>Shi</given-names>
          </string-name>
          , Binbin Hu,
          <string-name>
            <given-names>Xin</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Philip</given-names>
            <surname>Yu</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Heterogeneous Information Network Embedding for Recommendation</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Yizhou</given-names>
            <surname>Sun</surname>
          </string-name>
          and Jiawei Han.
          <year>2013</year>
          .
          <article-title>Mining heterogeneous information networks: a structural analysis approach</article-title>
          .
          <source>Acm Sigkdd Explorations Newsleter</source>
          <volume>14</volume>
          ,
          <issue>2</issue>
          (
          <year>2013</year>
          ),
          <fpage>20</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Mi</given-names>
            <surname>Zhang</surname>
          </string-name>
          and
          <string-name>
            <given-names>Neil</given-names>
            <surname>Hurley</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Avoiding monotony: improving the diversity of recommendation lists</article-title>
          .
          <source>In Proceedings of the 2008 ACM conference on Recommender systems. ACM</source>
          ,
          <volume>123</volume>
          -
          <fpage>130</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Yuan</given-names>
            <surname>Cao</surname>
          </string-name>
          <string-name>
            <surname>Zhang</surname>
          </string-name>
          , Diarmuid Ó Séaghdha, Daniele Quercia, and
          <string-name>
            <given-names>Tamas</given-names>
            <surname>Jambor</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Auralist: introducing serendipity into music recommendation</article-title>
          .
          <source>In Proceedings of the fifth ACM WSDM . ACM.</source>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Zainab</surname>
            <given-names>Zolaktaf</given-names>
          </string-name>
          , Reza Babanezhad, and
          <string-name>
            <given-names>Rachel</given-names>
            <surname>Potinger</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <string-name>
            <given-names>A Generic</given-names>
            <surname>Top-N Recommendation Framework</surname>
          </string-name>
          <article-title>For Trading-of Accuracy, Novelty, and</article-title>
          <string-name>
            <surname>Coverage.</surname>
          </string-name>
          <article-title>In 2018 IEEE 34th ICDE</article-title>
          . IEEE.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>