<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <article-id pub-id-type="doi">10.1145/1235</article-id>
      <title-group>
        <article-title>Weighted Random Walks for Meta-Path Expansion in Heterogeneous Networks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fatemeh Vahedian</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Robin Burke</string-name>
          <email>rburke@cs.depaul.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bamshad Mobasher</string-name>
          <email>mobasher@cs.depaul.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center for Web Intelligence, DePaul University</institution>
          ,
          <addr-line>Chicago, IL 60604</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <abstract>
        <p>In social networks, users and items are joined in a complex web of relations, which can be modeled as heterogeneous information networks. Such networks include a variety of object types and the rich relations among them. Recent research has shown that a hybrid recommendation approach combining components built from extended meta-paths in the network can improve the accuracy of recommendations in such networks. However most of this recent work is focused on unweighted heterogeneous networks, and simplifying relations by ignoring weight (including user ratings) loses important information. We propose a random walk based model to generate meta-paths in weighted heterogeneous network in which the frequency of edge sampling is a function of edge weight, and demonstrate that performance is improved using this method.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        Much research has been focused on recommender systems
in heterogeneous networks using extended relations based
on meta-paths, showing that accuracy can be improved over
the use of simple direct relations. This bene t has been
demonstrated with several di erent recommendation
algorithms including multi-relational matrix factorization [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ],
weighted hybrid of low-dimensional components [
        <xref ref-type="bibr" rid="ref1 ref2 ref5 ref6">2, 1, 6, 5</xref>
        ]
and non-negative matrix factorization [
        <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
        ].
      </p>
      <p>However, these models all assume a uniform preference
associated with all relations in the network. In many
networks, however, users provide rating values that can provide
useful information. The lack of user rating values in
generating meta-paths can be misleading. For example, in a movie
recommender, if user u gave movie m1 the lowest possible
rating of 1, then it would be unproductive to treat this
connection as having the same value as another movie m2 to
which the user gave a higher rating. So, where such
information is available, it is essential to rank di erently the paths
starting from highly rated movies and the ones that are not
as interesting to the user.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>RANDOM WALK SAMPLING</title>
      <p>
        Christo el et al.[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] introduced a random walk sampling
algorithms to calculate the transition probability in a
random walk model to rank items and generate
recommendation model. This model works from an unweighted bipartite
graph which represents the binary relations among user and
items. Building on this approach, we propose a method
for generating meta-paths in a heterogeneous network using
biased random walk sampling. This method has the
advantage of creating greater e ciency in meta-path generation
and allowing for sensitivity to user ratings.
      </p>
      <p>
        The goal of meta-path generation is to create a relation
based on paths through the network. For example, the
extended user movie actor movie user meta-path
enables the system to start with a given user and nd other
users that have watched movies containing actors in common
with the user's movies. The semantics of this operation of
meta-path expansion is that the end result is a set of
destination nodes (in this case, users) weighted by how many of
the expanded meta-paths reach that node. A random-walk
version of this process chooses edges from the next relation
in the meta-path randomly instead of following all possible
paths. This is more e cient than generating all paths and
the number of samples can be chosen to be large enough
to provide a good estimate of what a full expansion would
provide [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In this work, we look speci cally at networks
involving a single \rating" edge from user to item. In other
words, the rst connection from a user is assumed to be to
an item and is assumed to have a weight that represents the
user rating with higher rated items being more preferred.
This construction is common in recommendation contexts
where users' quantitative preferences can be gathered.
      </p>
      <p>
        Random walk meta-path expansion therefore uses the
process shown in Algorithm 1. The algorithm takes as input a
Algorithm 1 Random walk meta-path generation
Require: l [u] // Initialize path with starting node: user
Require: m metapath // Queue of edge types
function Generate(l,m)
if m 6= fg then
me pop(m); // Next edge type
n l[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] //Current node
E GetEdges(n, me) // Get edges of type me
if me = user-item then
      </p>
      <p>hn; j; vi WSample(E) //weighted
else</p>
      <p>hn; j; vi USample(E) //uniform
push(j; l); //Add node j to path</p>
      <p>MPgenerate(l,m)
else</p>
      <p>return l
list with a single user as the start node and returns a single
random walk guided by the meta-path. The functions
USample and WSample, which is omitted for reasons of space,
each returns an edge from the list. In the case of USample
all edges have equal probability. In the case of WSample,
the edge is chosen with probability proportional to ew.</p>
    </sec>
    <sec id="sec-3">
      <title>3. EXPERIMENTS AND RESULTS</title>
      <p>
        In order to test the meta-path algorithm and its impact on
recommendation model we build a movie recommendation
using multi-relational matrix factorization (DMF) [
        <xref ref-type="bibr" rid="ref4 ref8">4, 8</xref>
        ] by
incorporating additional relations built from extended
metapaths. For this paper, we used randomly selected a 33%
subset of the MovieLens 1M dataset 1. We generated four
meta-path relations starting from the user(user ! movie !
actor; user ! movie ! director; user ! movie ! actor !
movie; user ! movie ! director ! movie). The
models were optimized using BPR as the optimization
criterion (BPR-opt), as described in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Figure 1 shows the
results for recall and precision on recommendation lists of
length one through ten for the three recommendation
models. DM F -rw ,the model using extended paths through
random walk, outperforms other generated models. DM F -pc
represents the model using meta-paths generated in a
traditional way, nally DM F shows the result for models using
just direct link of the network.
      </p>
      <p>As anticipated, random sampling for meta-path
generation is much faster than generating the full meta-path
rela1http://grouplens.org/datasets/movielens/
tions. On our test machine, the random walk method takes
less than 5% of the time required by the baseline technique
to generate U M A and U M AM meta-paths. Since
metapath generation is a major portion of the overall learning
time for this system, the random sampling technique would
be strongly preferred even if its accuracy were not better.
4.</p>
    </sec>
    <sec id="sec-4">
      <title>CONCLUSION</title>
      <p>We proposed a weighted sampling method to generate
metapaths in weighted heterogeneous network. The results show
multi-relational matrix factorization recommendation using
those meta-paths can be enhanced comparing to previous
method. Additionally, random walk based meta-path
generation is more e cient while not sacri cing accuracy. As
our future work, we will be exploring other multi-relational
algorithms and hybrid model using weighted sampling and
extend the application of weighted sampling to other types
of weighted edges.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work was supported in part by the National Science
Foundation under Grant No. IIS-1423368
(Multi-dimensional Recommendation in Complex Heterogeneous Networks).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Burke</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Vahedian</surname>
          </string-name>
          .
          <article-title>Social web recommendation using metapaths</article-title>
          .
          <source>In RSWeb@RecSys</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R. D.</given-names>
            <surname>Burke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Vahedian</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Mobasher</surname>
          </string-name>
          .
          <article-title>Hybrid recommendation in heterogeneous networks</article-title>
          .
          <source>In UMAP 2014</source>
          , pages
          <fpage>49</fpage>
          {
          <fpage>60</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>F.</given-names>
            <surname>Christo el</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Paudel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Newell</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Bernstein</surname>
          </string-name>
          .
          <article-title>Blockbusters and wall owers: Accurate, diverse, and scalable recommendations with random walks</article-title>
          .
          <source>In Proceedings of the 9th ACM Conference on Recommender Systems, RecSys '15</source>
          , pages
          <fpage>163</fpage>
          {
          <fpage>170</fpage>
          , New York, NY, USA,
          <year>2015</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>L. R.</given-names>
            <surname>Drumond</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Diaz-Aviles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Schmidt-Thieme</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W.</given-names>
            <surname>Nejdl</surname>
          </string-name>
          .
          <article-title>Optimizing multi-relational factorization models for multiple target relations</article-title>
          .
          <source>In CIKM 2014</source>
          , pages
          <fpage>191</fpage>
          {
          <fpage>200</fpage>
          , New York, NY, USA,
          <year>2014</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>F.</given-names>
            <surname>Vahedian</surname>
          </string-name>
          .
          <article-title>Weighted hybrid recommendation for heterogeneous networks</article-title>
          .
          <source>In RecSys '14</source>
          , pages
          <fpage>429</fpage>
          {
          <fpage>432</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>F.</given-names>
            <surname>Vahedian</surname>
          </string-name>
          and
          <string-name>
            <given-names>R. D.</given-names>
            <surname>Burke</surname>
          </string-name>
          .
          <article-title>Predicting component utilities for linear-weighted hybrid recommendation</article-title>
          .
          <source>In RSWeb</source>
          <year>2014</year>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F.</given-names>
            <surname>Vahedian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. D.</given-names>
            <surname>Burke</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Mobasher</surname>
          </string-name>
          .
          <article-title>Network-based extension of multi-relational factorization models</article-title>
          .
          <source>In Poster Proceedings of the 9th ACM Conference on Recommender Systems, RecSys</source>
          <year>2015</year>
          , Vienna, Austria,
          <year>September 16</year>
          ,
          <year>2015</year>
          .,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>F.</given-names>
            <surname>Vahedian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. D.</given-names>
            <surname>Burke</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Mobasher</surname>
          </string-name>
          .
          <article-title>Meta-path selection for extended multi-relational matrix factorization</article-title>
          .
          <source>In Proceedings of the Twenty-Ninth International Florida Arti cial Intelligence Research Society Conference, FLAIRS</source>
          <year>2016</year>
          ,
          <string-name>
            <given-names>Key</given-names>
            <surname>Largo</surname>
          </string-name>
          , Florida, May
          <volume>16</volume>
          -18,
          <year>2016</year>
          ., pages
          <volume>566</volume>
          {
          <fpage>571</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>X.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Ren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Gu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Sturt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Khandelwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Norick</surname>
          </string-name>
          , and J. Han.
          <article-title>Personalized entity recommendation: A heterogeneous information network approach</article-title>
          .
          <source>In WSDM 2014</source>
          , pages
          <fpage>283</fpage>
          {
          <fpage>292</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>X.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Ren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Sturt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Khandelwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Gu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Norick</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. Han.</surname>
          </string-name>
          <article-title>Recommendation in heterogeneous information networks with implicit user feedback</article-title>
          .
          <source>In Proceedings of the 7th ACM Conference on Recommender Systems, RecSys '13</source>
          , pages
          <fpage>347</fpage>
          {
          <fpage>350</fpage>
          , New York, NY, USA,
          <year>2013</year>
          . ACM.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>