<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>“Where Far Can Be Close”: Finding Distant Neighbors In Recommender Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vikas Kumar</string-name>
          <email>vikas@cs.umn.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Daniel Jarratt</string-name>
          <email>jarratt@cs.umn.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Joseph A. Konstan</string-name>
          <email>konstan@cs.umn.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Brent Hecht</string-name>
          <email>bhecht@cs.umn.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>GroupLens Research Dept. of Computer Science University of Minnesota</institution>
          ,
          <addr-line>Twin Cities Minneapolis</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2015</year>
      </pub-date>
      <volume>19</volume>
      <issue>2015</issue>
      <fpage>3</fpage>
      <lpage>10</lpage>
      <abstract>
        <p>Location and its corollary, distance, are critical concepts in social computing. Recommender systems that incorporate location have generally assumed that the utility of locationawareness monotonically decreases as entities get farther apart. However, it is well known in geography that places that are distant \as the crow ies" can be more similar and connected than nearby places (e.g., by demographics, experiences, or socioeconomic). We adopt theory and statistical methods from geography to demonstrate that a more nuanced consideration of distance in which \far can be close" { that is, grouping users with their \distant neighbors" { moderately improves both traditional and location-aware recommender systems. We show that the distant neighbors approach leads to small improvements in predictive accuracy and recommender utility of an item-item recommender compared to a \nearby neighbors" approach as well as other baselines. We also highlight an increase in recommender utility for new users with the use of distant neighbors compared to other traditional approaches.</p>
      </abstract>
      <kwd-group>
        <kwd>location-aware recommendations</kwd>
        <kwd>user clustering</kwd>
        <kwd>distant neighbors</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        Collaborative ltering starts with the assumption that
past agreement is predictive of future agreement.
Separating recommenders into user and item domains, within which
users' agreement is more predictive, increases the extent to
which that core assumption is true. For this reason, we
do not build recommender systems that mix movies,
baseball cards, and sweaters into a single model. Konstan et
al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] give empirical evidence that partitioning user-user
recommenders by item domain improves predictive
accuracy, showing that recommending within Usenix newsgroups
rather than across them produces the lowest error. Similarly,
partitioning users has resulted in better predictive accuracy
than the full model [
        <xref ref-type="bibr" rid="ref18 ref25">18, 25</xref>
        ].
      </p>
      <p>
        Location-aware recommenders, for example, have used
geographic information as a lter to narrow the user-item
rating space for improved performance. LARS [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] partitions
users into geographic grid squares: an assumption that users
within a contiguous, compact grid square are more alike
than users not in that square. Other location-aware
recommenders [
        <xref ref-type="bibr" rid="ref1 ref28">28, 1</xref>
        ] assume similarity is proportional to
straightline (geodesic) distance.
      </p>
      <p>
        This paper extends that approach by recognizing
that location-aware recommenders can incorporate
nonproximate { distant, non-contiguous, and non-compact {
geographies into their recommendation models. Current
location-aware recommenders are grounded in Tobler's First
Law of Geography [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] (TFL): \everything is related to
everything else, but near things are more related than distant
things". However, Tobler emphasizes that it is a
rule-ofthumb rather than a law, and urges researchers to
consider more sophisticated notions of distance (e.g., population
density-controlled distance, border-aware distance,
socioeconomic distance). Recent work in peer production
communities nds evidence for di erent ways of calculating distance
in regards to the First Law [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. We do this by considering
\ratings preference distance" between locations, combined
with geodesic distance.
      </p>
      <p>We use this non-trivial notion of distance to determine
similar locations (speci cally postal codes: e.g., 19104 in
Philadelphia, Pennsylvania, is most similar to 61801 in
Urbana, Illinois) and partition users based on their associated
location, then build collaborative ltering models for each
partition. We demonstrate that general collaborative
ltering systems { even those that consider items that do not
have a unique spatial footprint, such as movies { can bene t
from an understanding of user geography, and speci cally
an understanding that goes beyond a \closer = more
relevant" assumption. We provide the rst evidence that
similarity of locations (based on ratings) does not monotonically
decrease with geodesic distance, meaning that locations
often have \distant neighbors". We demonstrate that
creating a user space based on location similarity moderately
improves predictive accuracy and recommender utility,
relative to baseline approaches and location-aware recommender
systems that rely just on proximate users.</p>
      <p>At a high level, this paper demonstrates that we
can harness geographic techniques to improve
recommender systems by limiting the recommender
model to groups of users in similar, though not
necessarily adjacent, locations. Intuitively, the argument
we make here for partitioning recommender systems based
on users rests on whether one can nd sets of users who
have signi cantly di erent \views of the world" that make
past agreement less predictive of future agreement. We can
think of this in the context of item-item collaborative
ltering, where we build a model of item-similarity (rating
correlations) of item association (item co-rating or co-purchase).
If items are considered related by one set of users, but not by
others (e.g., if Belgians see french fries associated with
mayonnaise while Americans see them associated with ketchup
but not mayonnaise), then an item model built across these
diverse groups of users may be less e ective at
recommendation than one based on partitions of users. Based on this
intuition, we look at partitions based on location
similarity by forming groups of distant neighbors, and make the
following contributions (compared to an item-item
recommender that uses no partitioning, a geographic partitioning
based only on local neighbors, and a random-partitioning
baseline):
1. We show improvement in prediction accuracy using
partitioning on similar locations (\distant neighbors").
2. In new user cold start, we show improvement in
recommender utility for Top-N lists.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>RELATED WORK</title>
      <p>This paper is motivated by research in two disciplines:
geography and recommender systems. Below we outline
related work in each of these areas.
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>Tobler’s First Law</title>
      <p>
        Current use of geography in collaborative ltering is
almost always under the assumption that the most valuable
ratings come from nearby people { an assumption grounded
in the First Law of Geography (or \Tobler's First Law of
Geography", 1970) [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. In other words, the recommender
systems community has used distance decay as the
relevant property of location. But as noted by Tobler in 2004
[
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], deviations from pure distance decay, such as population
density, are common. Tobler's 2004 assertion has received
empirical support in the work of Li et al. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] and Hecht
and Moxley [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], which found that the relatedness between
geographic entities in Wikipedia generally decreases with
geodesic distance, but that (as advocated by Tobler himself)
we need to consider more sophisticated notions of distance to
more completely model the relationship between relatedness
and distance. Our distant neighbors approach uses \ratings
preference distance" in combination with geodesic distance:
we show evidence that people do have similarity with
people around them, but that at a certain geographic scale (for
which we examine postal codes), they are just as similar
to a subset of distant people. This paper is the rst, to
our knowledge, to show that an understanding of TFL that
combines ratings preference similarity and geodesic distance
can lead to increased accuracy in recommendations.
2.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>Location-aware recommendation</title>
      <p>Location-aware recommender systems consider location in
two ways: (1) \nearby neighbors" ltering systems that
explicitly or implicitly encode a strict form of TFL and (2)
\just another feature" recommenders that treat location as
pro le features but do not apply geographic methods.
2.2.1</p>
      <p>“Nearby neighbors” filtering systems</p>
      <p>
        When recommenders are aware of location (i.e., from
mining GPS tracks [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] or asking a person for her postal code
[
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]), the systems use that context to personalize
suggestions to those relevant to nearby users or items [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Some
location-aware recommenders [
        <xref ref-type="bibr" rid="ref27 ref9">9, 27</xref>
        ] operate within domains
of items which have speci c spatial footprints [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] (such as
restaurants, people, or homes). For instance, recommenders
meant for mobile queries can calculate an area around a
person where her query remains valid [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ][
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] if, for instance,
she requires updated co ee shop recommendations as she
moves from neighborhood to neighborhood. Some
point-ofinterest recommenders [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] use co-visitation rates (implicitly
geographic).
      </p>
      <p>
        Other recommenders operate in domains of items which
do not have speci c spatial footprints, such as movies or
consumer products. Levandoski et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] tessellate users
into geographic grid cells, which is discussed in detail below.
Similarly, Das et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] apply Voronoi tessellation, a strictly
\nearby neighbors" method.
      </p>
      <p>
        The primary motivation for this work is that of
Levandoski et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], who introduce the \Location-Aware
Recommender System" (LARS). Like our work, LARS is a
partitioning process that occurs before collaborative ltering
model-building. LARS lays a hierarchy of grids over the
surface of the Earth at various levels of granularity, so that
each user belongs to one geographically contiguous and
compact square. LARS then creates an item-item recommender
model for each grid cell, suggesting movies { items without
speci c spatial footprints { based on the ratings of other
people in the same grid cell. Its authors nd an in ection
point where quality of recommendations is maximized: a 4x4
grid where users are partitioned into 16 geographic squares.
Smaller squares have too little data; larger squares include
too many less-similar users. LARS thereby encodes physical
proximity as a proxy for similarity, and is the rst paper
to contribute geographic user partitioning for collaborative
ltering. Our contribution contra LARS is that geographic
user partitions need not be contiguous nor a compact space.
2.2.2
      </p>
      <p>Location as “just another feature”</p>
      <p>
        Still other work in general recommender research treats
location information as \just another feature". Seroussi et
al. [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] use standard matrix factorization techniques to nd
latent relevant features in a person's preferences, with U.S.
state names drawn from postal codes as possible features.
Similarly, Kucuktunc et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] match postal codes with
census data, inferring demographics to match users in a
question-and-answer site. We see promise in Kucuktunc's
method, which uses postal codes as a way to guess
demographic tags for users. Their method is based on Weber
and Castillo's demographic analysis which is meant to
\depersonalize and de-localize" location information [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], yet
demographics are themselves geographically clustered [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ],
resulting in a \distant neighbors" type of approach through
a demographic step. This approach has the bene t of
accounting for locations a recommender has never before
encountered { \location cold start" { since demographics are
often widely available.1
1www.ipums.org
      </p>
    </sec>
    <sec id="sec-5">
      <title>DATASET</title>
      <p>We use movie ratings from the MovieLens rating
community2, whose users have the option of entering their postal
code while signing up for the site. In MovieLens, users can
rate movies on a rating scale from 0.5 to 5.0 in increments of
0.5. For our experimental dataset, we select only those users
who entered a valid US postal code (which is about 22% of
all users). To ensure data density, we further exclude postal
codes that do not have at least 20 MovieLens users who
rated at least 20 movies. We nally randomly sample 5000
users to create a test dataset, resulting in 1779 postal codes
and 1.01 million ratings.3 The rating count is equivalent to
the extensively-used publicly available MovieLens dataset of
1 million ratings.4</p>
    </sec>
    <sec id="sec-6">
      <title>PARTITIONING APPROACHES</title>
      <p>We consider three partitioning approaches { distant
neighbors, local neighbors and random { to partition the user item
rating matrix.
4.1</p>
    </sec>
    <sec id="sec-7">
      <title>Distant neighbors partitioning</title>
      <p>Distant neighbors refers to users in a cluster of postal
codes based on rating similarity. In this section, we explain
how we identify distant neighbors from a user-item rating
dataset. We then use a traditional collaborative ltering
recommender on each of those clusters. Partitioning of the
user-item matrix into distant neighbors clusters occurs in
three stages:
1. Convert the user-item rating matrix to a location-item
rating matrix
2. Determine location-to-location similarity
3. Cluster locations to form groups of users associated
with their respective locations</p>
      <p>
        We create a location-item rating matrix from the
useritem rating matrix by matching postal codes with their
respective user identi ers. We then determine the rating of
a postal code for an item by averaging over the location's
2www.movielens.org
3This dataset is public at http://cs.umn.edu/ vikas
4www.grouplens.org/datasets/movielens
users' ratings for the same item. Based on previously
published work on MovieLens data [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], we use the Bayesian
average [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] with a damping value of 5, to dampen the
effect of ratings of movies rated by only few users in a postal
code.
      </p>
      <p>In the above process, for example, we determine the Toy
Story-15213 postal code (Pittsburgh, Pennsylvania) rating
from the ratings of all users in that postal code who rated
Toy Story.</p>
      <p>We then cluster locations in the location-item rating
matrix using spectral clustering.5 After each process that
generates C clusters of postal codes, we partition the original
user-item rating matrix into C partitions. Recall that each
user belongs to a single postal code and hence only to one
cluster. For a cluster count C = 1, the user-item rating
matrix remains the same as the traditional full model used for
recommendation. The process is illustrated in Figure 1.</p>
      <p>Why Distant neighbors?</p>
      <p>
        To understand how postal codes re ect rating preference
distance compared to geodesic distance, we use
geostatistics. The correlogram, a geostatistical tool, helps us
understand the correlation between geodesic distance and
similarity among places [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. We show this correlation in Figure
2, where on the x-axis we plot pairwise geodesic distance
between postal codes (lag = 100 km) and on the y-axis, the
average rating similarity of postal codes (separated by the
corresponding distance on the x-axis). To determine
similarity we compare the rating vectors for each postal code
using (a) cosine similarity, (b) Spearman's rank correlation,
and (c) Jaccard index. We also evaluate (d) cosine
similarity on postal code's item-popularity vectors instead. As
an example in Figure 2, we see that for postal codes
separated by 1000-1100 km, their average cosine similarity is
approximately 0.28.
      </p>
      <p>Now, going back to the de nition of Tobler's First Law
based on purely geodesic distance leads to the assumption
of the similarity of postal codes monotonically decreasing as
the straight-line distance between them increases. However,
as shown in Figure 2, it is immediately obvious that
Tobler's First Law in its geodesic distance interpretation does
not hold in the movie domain. While nearby postal codes
5We nd that 20 features and 50 iterations for a given cluster
results in better inter-cluster and intra-cluster density.
are more similar than slightly more distant postal codes,
average similarity does not decrease as distance increases. In
fact, there are several points where the average similarity of
locations separated by thousands of kilometers exceeds the
average similarity of nearby locations.</p>
      <p>Thus, recommender systems that only consider geodesic
distance using (often) the correlogram's rst bucket(s)
ignore the higher average similarity displayed by distant postal
code pairs. If these systems increase their geodesic distance
threshold in an attempt to include more users, the
newlyadded nearby locations may actually have lower average
similarity than distant locations.</p>
      <p>We reasoned that perhaps the nationwide popularity of
many movies causes the average similarity of distant postal
codes to increase. To test our hypothesis, we re-evaluate the
correlograms ignoring the most popular 20% of movies and
found the same result.
4.2</p>
    </sec>
    <sec id="sec-8">
      <title>Local neighbors</title>
      <p>For each postal code, we build a \local neighbors"
useritem matrix, which is a subsample of the entire dataset.
This consists of the location's own users, plus users from
the successively nearest locations (geodesic) until a given
number of users is reached (matching with the number of
users in the corresponding distant neighbors clusters). We
use this method to compare our approach to location-aware
recommenders that are based on location proximity instead
of similarity.
4.3</p>
    </sec>
    <sec id="sec-9">
      <title>Random partitioning</title>
      <p>As a baseline, we use random partitions where each
partition consists of a given number of users (matched with
the number of users in the corresponding distant neighbors
cluster), randomly selected from the original user-item
rating matrix.
5.</p>
    </sec>
    <sec id="sec-10">
      <title>BUILDING RECOMMENDATIONS</title>
      <p>
        We use the item-item collaborative ltering approach with
log-likelihood similarity [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] for the prediction and
evaluation of the di erent partitioning approaches on
recommendation. We build models for each partitions (or clusters)
independently i.e. if a partitioning approach creates N
useritem matrices then we build N recommendation models, one
for each matrix. For item recommendations to a user we
select her respective partition (determined by her postal code)
and use the respective model.
      </p>
      <p>Note, it is possible that with many clusters we may have
small user and item proportions per cluster. In such cases,
a recommender may fail to predict or recommend
resulting in lower number of successful predictions thus a ecting
the coverage of the recommender. For example, item-item
collaborative ltering fails to predict rating of an item for
a given user, if that item has no correlation with any other
item that user has rated. Hence, we compare the accuracy of
di erent partitioning approaches with respect to their
coverage. Using fallback predictions, we also compare the
accuracy of each partitioning approach at 100% coverage i.e.
whenever recommender fails to predict we either use (a) the
item mean rating, if item exists in the partition's training
set, or (b) the global mean rating if item does not exist in
the partition's training set.
6.</p>
    </sec>
    <sec id="sec-11">
      <title>EVALUATION</title>
      <p>At a high level, our methodology involves three stages.
First, we generate user-item rating matrix partitions in
several sizes for two approaches, distant neighbors and random.
For local neighbors, we build user-item rating matrices in
several sizes for each postal code, which we call \partitions"
for convenience. Next, for each partition, we build an
independent item-item recommender. Finally, we evaluate
prediction accuracy using RMSE (root mean squared error) and
recommendation accuracy with MAP (mean average
precision). We compare the performance of each approach to that
of an item-item recommender built using the full rating
matrix. We evaluate the techniques on four di erent numbers
of disjoint clusters { 1, 5, 20 and 50 { with user proportion
sizes of 100%, 20%, 5% and 2% respectively.</p>
      <p>
        For RMSE evaluation, we use 90% of ratings for each user
in the training data and keep 10% of their ratings in the
test set. In Top-N evaluation, for each user we build an
evaluation set of relevant items [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]: those rated 4 (of 5)
or higher, following [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. We then keep only 80% of that
user's rating in the training data with all other users'
ratings. We repeat this process for a random sample of 10% of
users in the dataset and take the average of MAP for Top-50
recommendations (MAP@50) over this sample of users.
6.1
      </p>
    </sec>
    <sec id="sec-12">
      <title>Results</title>
      <p>The distant neighbors approach with 50 clusters shows
the best RMSE among all other partitioning approaches and
number of clusters, including the full model (\1 partition")
recommender, shown in Table 1. We nd the improvement
against the full model to be statistically signi cant (p &lt;
0.001, per user using the Wilcoxon Rank-Sum test). We
also note that for any given cluster count, distant
neighbors have more accurate predictions than other partitioning
approaches i.e. local neighbors and random partitions;
however, we nd the di erence between them to be statistically
signi cant only for 20 clusters.</p>
      <p>The distant neighbors approach's low coverage of 88.75%
compared to the full model's 99.87% means that we cannot
directly imply better performance. We note that at 20 and
50 partitions, distant neighbors has higher coverage than
local neighbors and random partitioning. Controlling for
100% coverage using fallback predictions, we observe that
distant neighbors at 20 clusters remains statistically signi
cantly more accurate than full-model item-item (p &lt; 0.05),
but the other two partitioning methods are no longer
significantly di erent than the full model.</p>
      <p>For recommendation metrics, distant neighbors shows
signi cantly better MAP@50 for cluster sizes of 20 and 50,
especially over the full model. We hypothesize here that
due to very small user proportions (and therefore items)
per cluster, the recommendations are limited to a speci c
set of items liked by most of the users in that cluster. To
understand this, we calculate the intra-list similarity (or
diversity) between the items, using the full training data, and
nd a much lower diversity using any partitioning method
compared to the full model.</p>
      <p>NEW</p>
    </sec>
    <sec id="sec-13">
      <title>TION</title>
    </sec>
    <sec id="sec-14">
      <title>USER COLD START EVALUA</title>
      <p>
        New users are a crucial part of a recommender system's
success, and this problem forms an important part of
system design decisions. Previous work has used location
information to solve the cold start problem [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], because user
context data can substitute for user rating data. We
analyze the e ect of our location-aware approach by simulating
the new user experience. Here we randomly sample 10% of
users from the training data (that contains 90% of ratings
for each user) and retain only a few ratings for those users
in the training data { 5 ratings per user { and discard other
ratings for that user [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. We evaluate the metrics on the
test data containing only these users.
7.1
      </p>
    </sec>
    <sec id="sec-15">
      <title>Results</title>
      <p>In case of cold start with no fallback predictions, we
observe distant neighbors with 20 clusters to have the
statistically signi cantly best predictive accuracy against the full
model (p &lt; 0.05, signi cance per user). However, we nd
this di erence to be not signi cant against local neighbors
based partitions for the same clusters.</p>
      <p>
        The coverage, like the previous evaluation, remains low
for larger number of clusters. With fallback predictions,
although distant neighbors has better RMSE compared to
local neighbors and random (not signi cant), no partition
method is able to beat the full model RMSE. Moreover, the
RMSE gets better with fallback predictions than the RMSE
without fallback predictions. This suggests that the item
baseline predictor is better than the full model in cold start;
we nd this result consistent with the results from Kluver
et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] on evaluation of item-item for new users.
      </p>
      <p>For recommendation utility, we observe local neighbors to
have better MAP@50 for 20 and 50 clusters compared to
any other partition or the full model. Distant neighbors is
able to do better than other partitions only for 5 clusters. In
contrast, with fallback predictions for all number of clusters,
we observe that distant neighbors is able to produce better
MAP@50 results (0.1972 with 50 clusters) with improvement
over local neighbors (0.14755), random (0.09753) and the full
model (0.0533).
8.</p>
    </sec>
    <sec id="sec-16">
      <title>DISCUSSION</title>
      <p>In this section, we summarize the results in light of the
assumptions and the limitations of our approach. We further
highlight the key takeaways and some anecdotes from our
dataset.</p>
      <p>
        We show that by controlling for coverage, grouping users
based on their location similarity { distant neighbors {
shows signi cant improvement in prediction accuracy over
other partitioning approaches for 20 clusters. In cold start,
though, we notice that partitioning fails to improve
prediction accuracy (RMSE). However, as Kluver et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]
states that recommendation utility is more important for
new users, we perhaps regard distant neighbors better mean
average precision a positive result.
      </p>
      <p>
        Also, we note that the Top-N metrics, like MAP, have
popularity bias [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Mean average precision will be higher
for any algorithm that favors popular items. To understand
if such bias exists in our results, we look at popularity of the
recommended items. We nd that the local neighbors
recommender favors relatively more popular items than distant
neighbors, which recommends more popular items than
random, and than the full model. We calculate popularity based
on the number of users who rated the item in the full matrix.
We also determine the average user rating for the items for
the cold start situation to understand the relevance of
recommendation, by taking the mean of ratings by the user
for the relevant items in the recommendation list. We nd
that distant neighbors consistently provides
recommendations that have higher user average ratings: 4.57, compared
to local neighbors (4.25) and random (4.23). We therefore
nd the distant neighbors performance on Top-N metrics to
be a positive result.
      </p>
      <p>We consider these results as a proof of concept. We note
that the magnitude of the improvements is small, even if
statistically signi cant. We recognize that such small decreases
in RMSE are unlikely to, in and of themselves, result in a
signi cant change in user experience. Rather, we hope to see
further development and optimization of geographic
similarity into recommender systems with the promise of further
improvements in performance. We also note that the Top-N
results show even more promise, and deserve future
usercentered evaluation.</p>
      <p>To further interpret our results we revisit our intuition
of user worldviews that can manifest in rating space. By
\worldviews" we mean the agreement among a subset of
users who have signi cantly di erent \views of the world"
compared to other subsets. We see empirical evidence of how
1
5
20
50
1
5
20
50
0.98076
(98.81%)</p>
      <p>
        0
0.96914
(96.34%)
0.03846
college towns are more similar to other college towns than
to other postal codes in their own city, and hypothesize that
people on a particular college campus may share an
understanding of concepts (for example: registration, fraternities,
and late-night pizza) with people in places far apart. For
instance, as seen in Table 3, the postal code 48104
(representing a university campus in Ann Arbor, Michigan) is most
similar to 55414, another university campus in Minneapolis,
Minnesota, about 900 km away. We further hypothesize that
similarity among users is not primarily based on
straightline distance but on a more nuanced distance { cultural or
socioeconomic { that manifests in a community consensus
of agreement on item likeness, in the same way that the
semantic relatedness of spatially-referenced Wikipedia articles
is not fully consistent with geodesic promiximity [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        We also believe that clusters based on location provide
advantages over clusters based directly on users' ratings [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]:
because (a) the number of postal codes is few compared to
the number of users, an important factor when clustering
within a very large rating matrix; (b) the exogenous
information from location can help explain the clusters.
      </p>
    </sec>
    <sec id="sec-17">
      <title>LIMITATIONS</title>
      <p>An important limitation of our work is that our
experiment considered only United States postal codes. The
United States has a large variety of cultures and
demographics spread throughout the country. Di erent countries may
display di erent properties.</p>
      <p>The other limitation is the low user density within postal
codes used for evaluation. We nd 1779 postal codes
associated with 5000 users which means we have on average 3
users per postal code. Recognizing this limitation, we
performed a predictive accuracy experiment on an equivalent
sized dataset by picking postal codes (which has its own
bias of oversampling users from dense locations) and found
RMSE results were consistent.</p>
      <p>
        In addition, our experiment considered only a Bayesian
average to calculate postal codes' item rating midpoints.
Di erent similarity methods, such as Kullback{Leibler
divergence [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] of rating distributions may provide better
understanding of similarity between two places.
      </p>
      <p>While we calculate local neighbors by proximity, LARS
tessellated users into geographic grid squares. One of LARS'
major contributions was spatial data structure optimization
for best storage and recommendation performance. The use
of a spatial database is out of the scope of our paper.</p>
      <p>Finally, MovieLens users enter postal codes at registration
and rarely update that information, even if they move.
10.</p>
    </sec>
    <sec id="sec-18">
      <title>FUTURE WORK</title>
      <p>There is signi cant complexity around what kind of
location-aware partitioning may improve recommendations.
Other methods might include ESRI's Tapestry dataset,
which groups postal codes not on item rating similarity but
with \lifestyle" { that is, demographic and socioeconomic {
weighted tags. For example, the top lifestyle segments for
94043 (Mountain View, California) are Enterprising
Professionals (38%), Trendsetters (14%), and Laptops and Lattes
(10%).</p>
      <p>We are interested in how this process applies to item
domains other than movies, especially for items with speci c
spatial footprints. We intuit that a distant neighbors
approach may work less well considering individual places (say,
a speci c restaurant) but may work just as well considering
place features (say, a cuisine that applies to many
restaurants).</p>
      <p>
        Location-aware recommenders should be able to use the
context of location to explain their suggestions: e.g., that
people in a similar location o ered relevant opinions.
Future work involves exploring the value of location-based
explanations over those focused on other dimensions of
similarity. Herlocker et al. nd that 86% of recommender
system users value additional explanations [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], and
locationbased partitions are explainable. Placenames are
immediately available, and demographic inference (e.g., \another
college town") is possible.
      </p>
      <p>
        Finally, although we chose postal codes as the unit of
spatial granularity in our system, location is de ned as part of
a hierarchy [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and we could have been more or less precise.
In an area of the world with smaller nations and stricter
cultural borders { say, Europe or Central America { we may
nd the applicable scale is at the level of nations and
languages. Spatial granularity may also be inconsistently scaled
within the same dataset: if speaking German is predictive,
it may be at the national level in Europe, regional level in
the United States, and city level in South America.
11.
      </p>
    </sec>
    <sec id="sec-19">
      <title>CONCLUSION</title>
      <p>We demonstrate that we can harness geographic
techniques to improve recommender systems by limiting the
recommender model to groups of users in similar locations. We
show similar locations are determined by a combination of
ratings preference and geodesic distance, an understanding
that goes beyond a \closer = more relevant" assumption. We
provide the rst evidence that rating similarity of locations
does not monotonically decrease with geodesic distance, and
describe a geostatistical method to discover \distant
neighbors". We show that creating a user space based on location
similarity improves predictive accuracy and recommender
utility, including the new user cold start scenario and
controlling for full coverage.</p>
      <p>Over-relying on the opinions of nearby places can lead to
the inclusion of proximate but less similar people, whereas
the power of distant neighbors in recommender systems is
smart partitioning of the underlying dataset to include the
right users among all those far away.</p>
      <p>We urge a reconsideration of geographic relationships in
recommender systems, where the value of geography goes
beyond geodesic distance.
12.</p>
    </sec>
    <sec id="sec-20">
      <title>NOTES</title>
      <p>Our code and datasets, as well as results for additional
metrics, are available at http://cs.umn.edu/ vikas.
13.</p>
    </sec>
    <sec id="sec-21">
      <title>ACKNOWLEDGMENTS</title>
      <p>This work was supported in part by National Science
Foundation (IIS-0808692 and IIS-0964695) and the
University of Minnesota's Social Media and Business
Analytics Collaborative (SOBACO). We also would like to thank
Daniel Kluver (University of Minnesota) and Michael
Ekstrand (Texas State University) for their continuous
feedback and input in the paper.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>G.</given-names>
            <surname>Adomavicius</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Tuzhilin</surname>
          </string-name>
          .
          <article-title>Context-aware recommender systems</article-title>
          .
          <source>In Recommender systems handbook</source>
          , pages
          <volume>217</volume>
          {
          <fpage>253</fpage>
          . Springer,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zheng</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. F.</given-names>
            <surname>Mokbel</surname>
          </string-name>
          .
          <article-title>Location-based and preference-aware recommendation using sparse geo-social networking data</article-title>
          .
          <source>In Proceedings of the 20th International Conference on Advances in Geographic Information Systems</source>
          , pages
          <fpage>199</fpage>
          {
          <fpage>208</fpage>
          . ACM,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Wilkie</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Mokbel</surname>
          </string-name>
          .
          <article-title>Recommendations in location-based social networks: a survey</article-title>
          .
          <source>GeoInformatica</source>
          , pages
          <volume>1</volume>
          {
          <fpage>41</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B.</given-names>
            <surname>Bill</surname>
          </string-name>
          .
          <article-title>The Big Sort: Why the Clustering of Like-Minded America Is Tearing Us Apart</article-title>
          . New York: Houghton Mi in Company,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>J. Das</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Majumder</surname>
            , and
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Gupta</surname>
          </string-name>
          .
          <article-title>Voronoi based location aware collaborative ltering</article-title>
          .
          <source>In 3rd National Conference on Emerging Trends and Applications in Computer Science (NCETACS)</source>
          , pages
          <fpage>179</fpage>
          {
          <fpage>183</fpage>
          . IEEE,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>K. E.</given-names>
            <surname>Grossner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. F.</given-names>
            <surname>Goodchild</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K. C.</given-names>
            <surname>Clarke. De ning</surname>
          </string-name>
          <article-title>a digital earth system</article-title>
          .
          <source>Transactions in GIS</source>
          ,
          <volume>12</volume>
          (
          <issue>1</issue>
          ):
          <volume>145</volume>
          {
          <fpage>160</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>B.</given-names>
            <surname>Hecht</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Moxley</surname>
          </string-name>
          .
          <article-title>Terabytes of Tobler: evaluating the rst law in a massive, domain-neutral representation of world knowledge</article-title>
          .
          <source>In Spatial information theory</source>
          , pages
          <volume>88</volume>
          {
          <fpage>105</fpage>
          . Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Herlocker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          .
          <article-title>Explaining collaborative ltering recommendations</article-title>
          .
          <source>In Proceedings of the 2000 ACM conference on computer supported cooperative work</source>
          , pages
          <volume>241</volume>
          {
          <fpage>250</fpage>
          . ACM,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Horozov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Narasimhan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>V.</given-names>
            <surname>Vasudevan</surname>
          </string-name>
          .
          <article-title>Using location for personalized poi recommendations in mobile environments</article-title>
          .
          <source>In International Symposium on Applications and the Internet (SAINT)</source>
          . IEEE,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D.</given-names>
            <surname>Kluver</surname>
          </string-name>
          and
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          .
          <article-title>Evaluating recommender behavior for new users</article-title>
          .
          <source>In Proceedings of the 8th ACM conference on recommender systems</source>
          , pages
          <volume>121</volume>
          {
          <fpage>128</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. N.</given-names>
            <surname>Miller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Maltz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Herlocker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. R.</given-names>
            <surname>Gordon</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          .
          <article-title>Grouplens: applying collaborative ltering to usenet news</article-title>
          .
          <source>Communications of the ACM</source>
          ,
          <volume>40</volume>
          (
          <issue>3</issue>
          ):
          <volume>77</volume>
          {
          <fpage>87</fpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>O.</given-names>
            <surname>Kucuktunc</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. B.</given-names>
            <surname>Cambazoglu</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Weber</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Ferhatosmanoglu</surname>
          </string-name>
          .
          <article-title>A large-scale sentiment analysis for Yahoo! answers</article-title>
          .
          <source>In Proceedings of the fth ACM international conference on web search and data mining</source>
          , pages
          <volume>633</volume>
          {
          <fpage>642</fpage>
          . ACM,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kullback</surname>
          </string-name>
          and
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Leibler</surname>
          </string-name>
          .
          <article-title>On information and su ciency</article-title>
          .
          <source>The annals of mathematical statistics</source>
          , pages
          <volume>79</volume>
          {
          <fpage>86</fpage>
          ,
          <year>1951</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Levandoski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sarwat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Eldawy</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. F.</given-names>
            <surname>Mokbel</surname>
          </string-name>
          .
          <article-title>Lars: A location-aware recommender system</article-title>
          .
          <source>In 28th International Conference on Data Engineering (ICDE)</source>
          , pages
          <fpage>450</fpage>
          {
          <fpage>461</fpage>
          . IEEE,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          , W. Liu, and W.-Y. Ma.
          <article-title>Mining user similarity based on location history</article-title>
          .
          <source>In Proceedings of the 16th ACM SIGSPATIAL international conference on advances in geographic information systems, page 34. ACM</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>T. J.-J. Li</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Sen</surname>
            , and
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Hecht</surname>
          </string-name>
          .
          <article-title>Leveraging advances in natural language processing to better understand Tobler's rst law of geography</article-title>
          .
          <source>In Proceedings of the 2014 SIGSPATIAL Conference. ACM</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>N. L.</given-names>
            <surname>Oden</surname>
          </string-name>
          .
          <article-title>Assessing the signi cance of a spatial correlationogram</article-title>
          .
          <source>Geographical Analysis</source>
          ,
          <volume>16</volume>
          (
          <issue>1</issue>
          ):1{
          <fpage>16</fpage>
          ,
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>A. M. Rashid</surname>
            ,
            <given-names>S. K.</given-names>
          </string-name>
          <string-name>
            <surname>Lam</surname>
            , G. Karypis, and
            <given-names>J. Riedl.</given-names>
          </string-name>
          <article-title>ClustKNN: a highly scalable hybrid model- &amp; memory-based CF algorithm</article-title>
          .
          <source>Proceeding of WebKDD</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>B.</given-names>
            <surname>Sarwar</surname>
          </string-name>
          , G. Karypis,
          <string-name>
            <given-names>J.</given-names>
            <surname>Konstan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          .
          <article-title>Item-based collaborative ltering recommendation algorithms</article-title>
          .
          <source>In Proceedings of the 10th international conference on World Wide Web</source>
          , pages
          <volume>285</volume>
          {
          <fpage>295</fpage>
          . ACM,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Seroussi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bohnert</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Zukerman.</surname>
          </string-name>
          <article-title>Personalised rating prediction for new users using latent factor models</article-title>
          .
          <source>In Proceedings of the 22nd ACM conference on hypertext and hypermedia</source>
          , pages
          <volume>47</volume>
          {
          <fpage>56</fpage>
          . ACM,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>W.</given-names>
            <surname>Tobler</surname>
          </string-name>
          .
          <article-title>On the rst law of geography: A reply</article-title>
          .
          <source>Annals of the Association of American Geographers</source>
          ,
          <volume>94</volume>
          (
          <issue>2</issue>
          ):
          <volume>304</volume>
          {
          <fpage>310</fpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>W. R.</given-names>
            <surname>Tobler</surname>
          </string-name>
          .
          <article-title>A computer movie simulating urban growth in the detroit region</article-title>
          .
          <source>Economic geography</source>
          , pages
          <volume>234</volume>
          {
          <fpage>240</fpage>
          ,
          <year>1970</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>S.</given-names>
            <surname>Vargas</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Castells</surname>
          </string-name>
          .
          <article-title>Rank and relevance in novelty and diversity metrics for recommender systems</article-title>
          .
          <source>In Proceedings of the 5th ACM conference on recommender systems</source>
          ,
          <source>RecSys '11</source>
          , pages
          <fpage>109</fpage>
          {
          <fpage>116</fpage>
          . ACM,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>I.</given-names>
            <surname>Weber</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Castillo</surname>
          </string-name>
          .
          <article-title>The demographics of web search</article-title>
          .
          <source>In Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval</source>
          , pages
          <volume>523</volume>
          {
          <fpage>530</fpage>
          . ACM,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>B.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Cai</surname>
          </string-name>
          .
          <article-title>An exploration of improving collaborative recommender systems via user-item subgroups</article-title>
          .
          <source>In Proceedings of the 21st international conference on World Wide Web</source>
          , pages
          <volume>21</volume>
          {
          <fpage>30</fpage>
          . ACM,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>X.</given-names>
            <surname>Yang</surname>
          </string-name>
          and
          <string-name>
            <surname>Z. Zhang.</surname>
          </string-name>
          <article-title>Combining prestige and relevance ranking for personalized recommendation</article-title>
          .
          <source>In Proceedings of the 22nd ACM international conference on information &amp; knowledge management</source>
          , pages
          <year>1877</year>
          {
          <year>1880</year>
          . ACM,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yu</surname>
          </string-name>
          and
          <string-name>
            <given-names>X.</given-names>
            <surname>Chen</surname>
          </string-name>
          .
          <article-title>A survey of point-of-interest recommendation in location-based social networks</article-title>
          .
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Papadias</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tao</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D. L.</given-names>
            <surname>Lee</surname>
          </string-name>
          .
          <article-title>Location-based spatial queries</article-title>
          .
          <source>In Proceedings of the ACM SIGMOD international conference on management of data</source>
          , pages
          <volume>443</volume>
          {
          <fpage>454</fpage>
          . ACM,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>