<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Identifying salient topics for personalized place similarity</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Benjamin Adams</string-name>
          <email>b.adams@auckland.ac.nz</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Raubal</string-name>
          <email>R@Locate14</email>
          <email>mraubal@ethz.ch</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centre for eResearch, The University of Auckland</institution>
          ,
          <country country="NZ">New Zealand</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute for Cartography and Geoinformation</institution>
          ,
          <addr-line>ETH Z u ̈rich</addr-line>
          ,
          <country country="CH">Switzerland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The ability to find similar places is an important component to geographic information retrieval applications as varied as travel recommendation services, marketing analysis tools, and socio-ecological research. Using generative topic modelling on a large collection of place descriptions, we can represent places as distributions over thematic topics, and quantitatively measure similarity for places modelled with these topic signatures. However, existing similarity measures are context independent; in cognitive science research there exists evidence that when people perform similarity judgments they will weigh properties differently depending on personal context. In this paper we present a novel method to re-weight the topics that are broadly associated with a place, based on users' interests inferred from sample place similarity rankings. We evaluate the method by training topics associated with texts about places, and perform a user study that compares user-provided similar places to those provided by automatically personalised place rankings. The results demonstrate improved correspondence between user rankings and automated rankings when personalised weights are applied.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        as mixtures of topics derived from probabilistic topic modelling on a corpus of place descriptions. These descriptions can
take many forms, including travel entries, encyclopaedia articles, newspaper articles, and social media postings. These data
sources are plentiful and all provide insight into the experiences of people in various places and, in particular, attributes
that are commonly described for those places. Topics are trained using latent Dirichlet allocation (LDA) topic model
inferencing on a set of these crowdsourced documents
        <xref ref-type="bibr" rid="ref4">(Blei et al., 2003)</xref>
        . Topic model training applies Bayesian inference
to arrive at a representation of each place as a probability vector over a finite set of topics. Once we have a topic distribution
for each place, we can compare their similarities in terms of the relative entropy of these topic distributions. This measure
captures the similarity of the places as derived from an aggregation of descriptions from many people. In other words,
the similarity of two places is defined as similarity of the prototypical or “average” representations of the places. There is
psychological evidence that for many kinds of categories a prototypical instance of the category can be represented as an
average from a set of exemplars in this manner
        <xref ref-type="bibr" rid="ref38">(Rosch, 1978)</xref>
        .
      </p>
      <p>
        However, there is also evidence that when performing similarity (or dissimilarity) judgements on a set of stimuli,
people will assign higher salience to some properties than others
        <xref ref-type="bibr" rid="ref29">(Medin et al., 1993)</xref>
        . In conceptual space theory, it is
proposed that these salience differences can be modelled as weights on the quality dimensions on which these similarity
measurements are based
        <xref ref-type="bibr" rid="ref10 ref34">(Ga¨rdenfors, 2000; Raubal, 2004)</xref>
        . When people judge the similarity of two places, they choose
a set of salient properties on which to compare the two places. This set of properties will be smaller than the total set of
possible properties on which one could possibly compare the two places
        <xref ref-type="bibr" rid="ref45">(Tversky, 1977)</xref>
        . In addition, the set of properties
that are salient will vary from person to person and depend on the places being compared. The challenge, therefore, is to
identify which weighting to use in a particular context. This feature selection problem becomes one of further reducing the
dimensionality of the topic space to a subset of topics that are relevant to a user in a specific context.
      </p>
      <p>In this paper we present a methodology to 1) generate a prototypical mixture of topics associated with places based
on crowdsourced natural language descriptions, 2) identify the topics of interest to an individual user, and 3) provide a
personalised ranking of similar places to a source place based on the inferred interests of the user. We propose a method to
automatically re-weight the topics using a sample similarity ranking for a small subset of the data.</p>
      <p>In the following section, we provide background on computing place, personalised search and ranking, and topic models.
Section 3 describes our method to identify salient topics. Following that, Section 4 presents the results of a user evaluation
of the method. Finally, we conclude and point to future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>2.1</p>
      <p>Place
This section provides background material on computing place, personalised search and ranking, place recommendation,
and topic modelling algorithms used in this paper.</p>
      <p>
        Several researchers have noted that place is a subjective, experiential phenomenon that is socially constructed
        <xref ref-type="bibr" rid="ref35 ref44">(Relph, 1976;
Tuan, 1977)</xref>
        . Simply put several of the characteristics of places that are salient to people cannot be captured by a spatial
footprint and structured data, such as population or median income. The difficulty in developing place representations
that take into account common-sense understandings has meant that, to a great degree, place has been represented in a
limited manner in geographic information systems (GISs). However, there has been a renewed interest in recent years
in modelling place more holistically as it becomes clear that traditional models are insufficient for the needs of many
technologies
        <xref ref-type="bibr" rid="ref19 ref48">(Jordan et al., 1998; Winter et al., 2009)</xref>
        . An increasingly vast amount of data is available online that describe
places and types of places in natural language. They come in a variety of forms: e.g., travelogues, encyclopaedia entries,
microblogs, literary works, social media sites, etc. These documents provide us with an extraordinary opportunity to
understand places not in terms of quantitative, tabular data, but rather in terms of how people write about them.
2.2
      </p>
      <p>
        Personalised search and ranking
Personalised search and ranking on the World Wide Web has been an ongoing research area for several years
        <xref ref-type="bibr" rid="ref30 ref32 ref43">(Pitkow et al.,
2002; Teevan et al., 2005; Micarelli et al., 2007)</xref>
        . Pitkow et al.
        <xref ref-type="bibr" rid="ref32">(Pitkow et al., 2002)</xref>
        have identified two main strategies for
personalising search: 1) query augmentation, which uses additional contextual information to refine the query; and 2) result
processing, which filters the search results based on a user model. In addition to algorithmic approaches to personalisation,
user interface design can play a large role in how well search results can be personalised
        <xref ref-type="bibr" rid="ref43">(Teevan et al., 2005)</xref>
        .
      </p>
      <p>
        Much of the personalisation research relies on using previous search behaviour to develop a model of the user’s interests
and using this model to personalise search results on the web
        <xref ref-type="bibr" rid="ref43">(Teevan et al., 2005)</xref>
        . Developing a user profile based on
implicit information has several advantages over asking the user for explicit feedback, because not only does gathering
explicit feedback impose an added burden on the user but also users can provide inaccurate information
        <xref ref-type="bibr" rid="ref40">(Speretta and
Gauch, 2005)</xref>
        . In mobile search, user-based personalisation can be augmented by location-based personalisation and these
factors can interrelate to model how a user’s interests change based on location
        <xref ref-type="bibr" rid="ref6">(Bouidghaghen et al., 2011)</xref>
        . Formal
ontologies can be used to align user interests that are inferred from search history with well-defined concepts described in
a semantic hierarchy
        <xref ref-type="bibr" rid="ref7">(Daoud et al., 2007)</xref>
        .
      </p>
      <p>
        Once there is a model that associates user categories with specific interests, it is possible to use different methods of
characterising user similarity to provide personalised results for categories of users
        <xref ref-type="bibr" rid="ref2 ref28">(McKenzie et al., 2013)</xref>
        . Guy et al.
        <xref ref-type="bibr" rid="ref13">(Guy et al., 2010)</xref>
        have explored comparing user similarity in social media along 3 different axes of people, things, and
places.
2.3
      </p>
      <p>
        Semantic similarity in GIScience
The importance of context in semantic similarity measurement is a well-studied research topic in geographic information
science
        <xref ref-type="bibr" rid="ref11 ref17 ref34 ref37">(Raubal, 2004; Rodr´ıguez and Egenhofer, 2004; Janowicz et al., 2011)</xref>
        . Commonly, context is modelled by
applying weights to the attributes that used to describe geospatial features; however, finding the appropriate weights is a
challenge (
        <xref ref-type="bibr" rid="ref24">Keßler, 2012</xref>
        ). Semantic referencing is an algorithm schema that uses the idea of control similarities provided
by a user to semi-automatically calibrate factor weights for similarity measurement of geographic entities
        <xref ref-type="bibr" rid="ref16">(Janowicz et al.,
2010)</xref>
        . The methods presented here are a form of semantic referencing.
      </p>
      <p>
        The problem addressed in this paper (personalised similarity of places) is distinct from previous work on personalised
web ranking, since we focus on the specific problem of finding similar places. Although individuals have different
impressions of places, due to the nature of places being manifested in the physical world, there are broad regularities in the
types of activities and features that people write about a given place. Because of these regularities, a crowdsourced
representation of a place will have broad applicability. Thus, the search for places is not an information retrieval task on an
individual document or artefact but rather a similarity search for places, represented as aggregations derived from multiple
documents. Personalised place similarity remains a relatively unexplored research area outside of smaller-scale
investigations in tourism and sense-of-place research
        <xref ref-type="bibr" rid="ref20 ref21 ref31">(Moore and Graefe, 1994; Kaltenborn, 1998; Jorgensen and Stedman, 2006)</xref>
        and sociological investigations of place attachment
        <xref ref-type="bibr" rid="ref41">(Stedman, 2002)</xref>
        .
      </p>
      <p>As with prior user behaviour for web personalisation, examples of similar places provide a mechanism to infer the
interests of the user without requiring him or her to explicitly enter interests. Furthermore, the sample ranking of similar
places – and which is provided directly by the user in this evaluation – can also be gathered from social data.
2.4</p>
      <p>
        Topic models
Probabilistic topic models, such as latent Dirichlet allocation (LDA), provide methods to characterise the documents in a
natural language corpus as multinomial mixtures of topics
        <xref ref-type="bibr" rid="ref4">(Blei et al., 2003)</xref>
        . Each topic is further modelled as a discrete
probability distribution over all the words in a corpus. The top most probable words for a topic are usually semantically
interpretable by humans. E.g., a topic with the top terms “wine, red, drink, cheese” can be interpreted as being about the
activity of wine tasting
        <xref ref-type="bibr" rid="ref2 ref28 ref46">(Adams and McKenzie, 2013)</xref>
        .
      </p>
      <p>
        LDA models the generation of each document in the corpus as an iterative process; first, a topic is chosen based on the
topic distribution for the document; second, a word is randomly selected from that topic. This is repeated for each term in
the document. Since each word is drawn randomly and is not based on the previous word, word order makes no difference
in the LDA model. Assuming this generative process for how the documents were created, the words of an existing set
of documents become the observed variables in the model and Bayesian inference is used to infer the most-likely topics
to have generated the data. This inference can be performed programmatically in an approximate manner using a markov
chain Monte Carlo Gibbs sampling algorithm
        <xref ref-type="bibr" rid="ref11 ref37">(Griffiths and Steyvers, 2004)</xref>
        .
      </p>
      <p>
        Apart from discovering the latent topics in a corpus in an unsupervised manner, this dimensionality reduction allows
one to compare the thematic similarity of two documents without requiring that the documents share exact terms. A natural
way to measure the similarity of documents is the calculate the Kullback-Leibler (KL) divergence between the two topic
distributions, P and Q (see Equation 1)
        <xref ref-type="bibr" rid="ref42">(Steyvers and Griffiths, 2007)</xref>
        .
(1)
(2)
DKL(P k Q) =
      </p>
      <p>X ln
i
where M = (P + Q)=2. These methods of measuring similarity ignore that the importance of individual topics will vary
from user to user.</p>
      <p>
        Several extensions to LDA have been developed to characterise the mixture of topics associated with a location or place
        <xref ref-type="bibr" rid="ref14 ref15 ref39 ref47 ref49 ref9">(Wang et al., 2007; Eisenstein et al., 2010; Hao et al., 2010; Sizov, 2010; Yin et al., 2011; Hong et al., 2012)</xref>
        . In these
models the generative process for creating a document is extended and some additional form of geographic evidence (either
location or place name) is used to condition topics based on that evidence. Thus, a distribution of topics can be associated
not only with a document but also a place or location. The technique employed here for evaluation is to represent the topic
mixture for a place as a weighted average of the topic probability distributions of all the descriptions for the same place
        <xref ref-type="bibr" rid="ref1">(Adams and Janowicz, 2012)</xref>
        . Since the methods described in this paper for weighting topics are post-hoc operations (i.e.
assigned after the topic model inferencing), they are broadly applicable to these other flavours of topic models.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3 Identifying salient topics</title>
      <p>In this section, we present a method for personalising the weights on the LDA topics, given a source place s and a set of N
target places ft1; t2; : : : ; tN g, and a user specified ordering for those target places based on their similarity to the source
place. For each place there is a discrete probability distribution for topics, and for any given individual topic the strengths
of that topic for both source and target places can be compared. For example, let the following be a sample user ranking of
three cities in terms of similarity to New York City:</p>
      <sec id="sec-3-1">
        <title>1. Chicago</title>
        <p>2. Los Angeles
3. Houston</p>
        <p>
          The set of salient topics ZS is defined as a subset of all topics, such that z 2 ZS if and only if the Kendall’s rank
correlation coefficient between the user-provided ordering and the topic strengths for topic z is positive
          <xref ref-type="bibr" rid="ref23">(Kendall, 1938)</xref>
          .
The Kendall’s measures rank correlation as a function of the number of concordant and discordant pairs in a set of joint
observations. A pair of observations (i; j) of two variables (x; y) is concordant if xi &lt; yi ^ xj &lt; yj or xi &gt; yi ^ xj &gt; yj .
They are discordant if xi &lt; yi ^ xj &gt; yj or xi &gt; yi ^ xj &lt; yj . Letting C be the number of concordant pairs and D be the
number of discordant pairs in a sample and n be the number of observations, Kendall’s is defined in Equation 3.
        </p>
        <p>Thus, the relative weight ( i) for each topic is zero for negative correlations. For positive correlations the weight is the
correlation normalised such that all positive correlations sum to 1. See Table 2 for an example.</p>
        <p>Kendall’s -B is an extension to Kendall’s to deal with situations where there are tied rankings. This will occur with
topics when one or more documents have a value of 0:0 for the topic strength. When a model is trained using a large
user ordering</p>
        <p>Chicago
Los Angeles</p>
        <p>Houston</p>
        <p>Kendall’s
Relative weight</p>
        <p>topic 1</p>
        <p>Chicago (0.0)
Los Angeles (0.22)</p>
        <p>Houston (0.6)
1.0
0.75</p>
        <p>topic 2
Los Angeles (0.22)</p>
        <p>Chicago (0.4)
Houston (0.5)
0.33
0.25</p>
        <p>topic 3
Los Angeles (0.0)</p>
        <p>Houston (0.1)
Chicago (0.4)
-0.67
0.0
number of topics this will occur often. Kendall’s -B (Equation 4) handles tied values by changing the denominator of the
equation.</p>
        <p>B =
p(n0
nc</p>
        <p>nd
n1)(n0
n2)
;
where n0 = n(n 1)=2; n1 = Pi ti(ti 1)=2; n2 = Pj uj (uj 1)=2; nc = number of concordant pairs; nd = number
of discordant pairs; ti = number of tied values in the ith group of ties for the first quantity; and uj = number of tied values
in the jth group of ties for the second quantity.
3.1</p>
        <p>Using Kendall’s</p>
        <p>as product
For each salient topic, zi, an associated salience weight, wi, is assigned to the topic equal to the Kendall’s correlation.
Given a set of salience weights w1; w2; : : : wn we can define relative weights for each topic 1; 2; : : : n, where i =
Pw1niw . Let be this vector of relative weights, a weighted Kullback-Liebler divergence A is now defined in Equation 5.
(4)
(5)
DKL( ; P k Q) = X iP (i) ln
i</p>
        <p>The divergence is weighted not only on the probabilities of the topic variables but also with the salience weight based
on the ordering, and it is thus a weighted average of the logarithm difference between the probabilities. From this new
divergence function, the weighted Jensen Shannon divergence A is defined in Equation 6.</p>
        <p>1 1</p>
        <p>J SD( ; P k Q) = 2 DKL( ; P k M ) + 2 DKL( ; Q k M ); (6)
where M is defined as in the normal JS divergence.</p>
        <p>This weighted measure can be interpreted as a form of weighted sum model in a multi-criteria decision analysis.The
salience weights drawn from a sample ranking can be viewed as a personalised general measure of topic saliency for
comparing all places. Table 3 shows a comparison of Jensen Shannon divergence and weighted Jensen Shannon divergence
A calculations given the user ranking specified above. Note, in this example the weighted ordering matches the user
specified ordering; however, it is not guaranteed to match in all cases, since the salience weights are based on a comparison
of rankings, not actual similarity values of the probabilities.
3.2</p>
        <p>Using Kendall’s to alter topic weights
An alternative method re-weights the topic probabilities associated with each place and uses the traditional JS divergence
measure to obtain a context-dependent similarity. As with the previous method only positive correlations are used.
Letting z be the topic vector for a place, Algorithm 1 describes the steps for re-weighting topics. Each topic probability, zi,
is multiplied by the weight for that topic, i, and then the new topic vector is normalized to sum to 1 (see Algorithm 2).
Algorithm 1 Pseudocode for re-weighting topics from user provided ranking.</p>
        <p>sum 0.0
for all zi do
if wi &gt; 0 then
zi0 wi zi
sum sum +zi0
else</p>
        <p>zi0 0:0
end if
end for
for all zi0 do</p>
        <p>zi0 zi0= sum
end for</p>
        <p>This new topic distribution for a place has a very intuitive interpretation. The re-weighted probability of topic i, zi0, is a
function of the prototypical probability of the topic, zi, for the place, conditioned on the probability that this specific user
is concordant with the average user for this topic. This re-weighted distribution has the advantage that it maintains the
desired mathematical properties when comparing similarities using KL and JS divergences (i.e., a minimum divergence of
0 and the square root of the JS divergence is a metric). In Table 3 the Jensen Shannon result using this method is called the
weighted Jensen Shannon B measure.</p>
        <p>city</p>
      </sec>
      <sec id="sec-3-2">
        <title>Chicago</title>
        <p>Los Angeles</p>
        <p>Houston</p>
        <p>
          JS
In this section, we present the results of a human participants test to evaluate the efficacy of using sample rankings to prime
salience weights on individual topic dimensions. LDA topic models were trained on two corpora of georeferenced place
descriptions: 1) a set of 200,000 georeferenced Wikipedia articles and 2) a set of 275,000 travel blog entries.1 Prior to
performing topic modelling, standard stemming and cleaning of the documents was performed to remove html tags and
other noise in the text. The travel blog entries were matched to places by the authors according to a fixed geographic
hierarchy, e.g., Orlando, Florida, United States. Each georeferenced Wikipedia article was matched to a named place by
intersecting the location associated with the article (based on coordinates template in the article) with the spatial footprint
of the named places. Thus, each article is linked to one place. It is possible that the association between text and a place
can be refined further using more sophisticated techniques; however, this still remains a open research problem and not the
focus of the current work
          <xref ref-type="bibr" rid="ref46">(Vasardani et al., 2013)</xref>
          .
        </p>
        <p>The topic signature for a place was calculated by taking all the documents with a relation to the place and averaging
over their topic vectors. Thus, topics that have a relatively high probability in a large number of documents associated with
a place will be associated with the place (e.g., in travel blog entries a theme park topic will have high value for Orlando,
FL, but a skiing topic will not).
4.1</p>
        <p>
          Design
The user study was designed using the Amazon Mechanical Turk system to gather several place similarity rankings.2 A
qualification test was presented to the Mechanical Turk users to help ensure high-quality (non-spam) results
          <xref ref-type="bibr" rid="ref25">(Kittur et al.,
2008)</xref>
          . This test was a simple filter that gathers user information and asks a set of simple questions about geographic
knowledge. The primary goal of the survey was to determine the degree to which a user-provided ranking on a set of cities
can be used to inform the system on the “interests” of the user more generally, and thus tailor search results.
        </p>
        <p>The study focused on comparing 30 U.S. cities rather than other types of places in order to maximise participant
familiarity. Although initial variants of the test were done using cities from around the world, it was difficult to find
users who had broad familiarity with multiple cities from around the world. Therefore, the tests were limited to residents
of the U.S., and a good sample of participants with familiarity of target cities could be gathered. Table 4 shows all the cities
that were compared.</p>
        <p>Mechanical Turk participants were asked to rank order 7 cities in terms of similarity to another city. Users were also
asked to enter how the places are similar as well as describe their familiarity with each of the 8 cities. Table 5 shows two
sample responses for cities that are ranked in similarity to Los Angeles.</p>
        <p>In total 96 participants provided 5 rankings each, leading to 480 rankings total. Out of the 480 rankings, in 135 cases
(28.1%) the participant was ‘not’ familiar with the main city being compared, in 205 cases (42.7%) the participant was
1downloaded from http://www.travelblog.org
2http://www.mturk.com
Algorithm 2 Pseudocode for re-weighting topics from user provided ranking when not all topics are informed by the
ranking.</p>
        <p>sum 0.0
sumOthers 0.0
for all zi do
if zi 2 ZNI then
zi0 zi
sumOthers sumOthers +zi
else if wi &gt; 0 then
zi0 wi zi
sum sum +zi0
else</p>
        <p>zi0 0:0
end if
end for
for all zi0 do
if zi0 2= ZNI then</p>
        <p>zi0 zi0 (1 sumOthers )= sum
end if
end for</p>
      </sec>
      <sec id="sec-3-3">
        <title>Atlanta</title>
        <p>Austin
Baltimore
Boston
Charlotte
Chicago
Cleveland
Dallas
Denver
Detroit</p>
      </sec>
      <sec id="sec-3-4">
        <title>Houston</title>
        <p>Kansas City
Las Vegas
Los Angeles
Miami</p>
      </sec>
      <sec id="sec-3-5">
        <title>Minn.-St. Paul</title>
        <p>Nashville
New Orleans
New York City
Philadelphia</p>
      </sec>
      <sec id="sec-3-6">
        <title>Phoenix</title>
        <p>Pittsburgh
Portland, OR
Salt Lake City
San Diego</p>
      </sec>
      <sec id="sec-3-7">
        <title>San Francisco San Jose Seattle St. Louis</title>
        <p>Washington D.C.
‘somewhat’ familiar, and in 140 cases (29.2%) the participant was ‘very’ familiar. In 50 cases (10.4%) the participant had
lived in (or near) the city and in 290 cases (60.4%) the participant had visited the city.
4.2</p>
        <p>
          Spearman’s footrule distance
Spearman’s footrule distance is a measure of correspondence (or disarray) between two rankings, similar to Spearman’s
and Kendall’s . Let Sn be the set of all permutations of the set of n integers f1; : : : ; ng. From
          <xref ref-type="bibr" rid="ref8">(Diaconis and Graham,
1977)</xref>
          , the Spearman’s footrule distance, Dn is defined as in Equation 7.
        </p>
        <p>n
Dn( n; n) = X j n(i)
i=1
n(i)j ;
(7)
where n and n are elements of Sn, i.e., different permutations. Spearman’s footrule is used here rather than Spearman’s
rho, because we want to be able to consider the average footrule distance between several automated rankings and user
provided rankings, which is not valid for correlations.
4.3</p>
        <p>Average Spearman’s footrule results
In Figure 1 a chart is shown of the average Spearman’s footrule distance between the human participant rankings and
automated similarity calculations. Looking over all responses, the clearest outcome was that for source places with which
the participant is not familiar; generally, a participant’s assessment is primarily made based on distance rather than other
properties. When the participant is somewhat or very familiar with the source place there is an increase in the average
Spearman’s footrule distance between distance-based rankings.</p>
        <p>Looking at Figure 1 it is abundantly clear that similarity ranking based on all travel blog topics or all Wikipedia articles
does not correspond in any definitive way with user-provided rankings. In part this is because there is no “average person”
– there is very little correspondence between different participants about the appropriate ranking of similar cities. In
Miami
Minneapolis-St. Paul
Dallas
Austin
Salt Lake City
Cleveland
Portland, OR</p>
      </sec>
      <sec id="sec-3-8">
        <title>Miami</title>
        <p>Dallas
Austin
Portland, OR
Minneapolis-St. Paul
Salt Lake City
Cleveland
addition, individuals have a limited set of properties that they use to compare places as opposed to the large number of
different topics found in large text corpora. This is not a problem with the methodology, however, because the task that the
system is to perform is not necessarily to match individual human reasoning but rather to allow exposure of patterns and
similarities in place description topics that are otherwise opaque to an individual user (or human participant).
4.4</p>
        <p>Automatic topic weighting evaluation
Despite this lack of correspondence, we can use the participant studies to evaluate the degree to which it is valid to assume
a user has a set of ‘interests’ (i.e., a salient subset of topics) that are common across different place comparisons and
whether we can identify those interests in terms of topics automatically from one or more sample rankings. For example,
looking back at Table 5, these two participants consider very different properties when comparing the cities for similarity.
Note, however that these participant-provided properties do not necessarily reflect all the properties used by the participant
to discriminate between places – e.g., it does not explain the particular ordering of Austin, Portland, and Minneapolis-St.
Paul by participant 2, because the same property (“music and performing arts”) is given for all of them.</p>
        <p>In order to do this evaluation we test whether on average automatic topic re-weighting using the techniques in the
previous section result in smaller Spearman’s footrule distances for the other rankings by the same participant. In other
words, we evaluate the degree to which a single ranking can be extrapolated more generally to other rankings of cities.
One sample ranking from the participant is used to calculate weights on the topics and new city rankings are calculated
using the weighted JS divergence for each of the other city comparisons done by the same participant. For each ranking the
Spearman’s footrule distance is calculated between the new weighted ranking and the participant-provided ranking. This
process is then repeated for each sample ranking by exchanging which one sets the weights. From this cross-validation an
average footrule distance can be calculated for the weighted ranking for a participant, and this can be further averaged over
all participants. Figure 1 shows this average footrule distance over all participants when using the weighted JS divergence
B technique. The weighted JS divergence A has a similar result, though the overall average footrule distance tends to be
slightly higher.</p>
        <p>The average Spearman’s footrule distance shows a marked decrease over the unweighted results for Wikipedia and travel
blogs that were shown in Figure 1, which indicates that this method does help to identify general topics of interest for a
person. One very interesting result is that the matching to the participant rankings is distinctly better when the participant
is not very familiar with the source city being ranked against. One explanation for this result is that people who are very
familiar with a place will have very specific impressions of the place and thus properties that are salient for that place do
not transfer over from other places. Whereas when people are not familiar with a place there is a stock set of “background
interests” that they use to help compare places. This indicates that automatic topic weighting will be particularly useful
when done as part of a system designed to enable users to learn and explore knowledge about geographic places with which
they are not already very familiar.
Weighted  Wiki  topics  
Weighted  T.B.  topics  </p>
        <p>Distance  </p>
        <p>
          Popula;on  
Wikipedia  topics  
Travel  blog  topics  
Very  
Somewhat  
Not  
0  
From these results, the automatic weighting of topics based on a sample ranking can legitimately be generalised to other
rankings. An interesting alternative would be to extract interests of a user implicitly from social network or other data and
apply those to topic weights, but that remains beyond the scope of the current research
          <xref ref-type="bibr" rid="ref22 ref36">(Kelly and Teevan, 2003; Ricci
et al., 2011)</xref>
          . On a case-by-case basis we also examined whether the reasons participants give for similarity correspond to
high shared values on specific LDA topics, but apart from anecdotal evidence, it was difficult to do a quantitative evaluation
on these data, because the mapping of participant-provided reasons and topics is highly subjective.
        </p>
        <p>Table 6 shows the topics in Wikipedia (from 1200 topics) and travel blogs (from 1500 topics) for Los Angeles that
increase the most (by ratio) based on re-weightings from the two sample participant rankings shown in Table 5. What is
remarkable about the topics shown in Table 6 is that despite the apparent difference between the automatically-determined
most-salient topics and the participant-provided reasons, the automatic weighting does significantly increase the
concordance between the system rankings and the participant rankings. One explanation is that the topics that are weighted with
high salience represent background properties that factor in the participants’ conceptualisations of city categories but which
are not at the forefront of their conscious comparisons of individual cities.
5</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>People judge the similarity or difference of places based on different contextual factors, such as their personal interests.
Geographic information systems that take advantage of these factors and can provide places that are similar to places
known to a user have many potential applications, including travel recommendation services, marketing analysis tools,
and socio-ecological research tools. In this paper we presented a new method to automatically identify the topics that are
salient to a user when performing similarity judgment. Topics can be any set of features associated with a place, such that
a place is represented as a probability vector of topic values. These topic values form a topic signature that is generally
associated with a place, and we demonstrated how probabilistic topic modelling can be used to generate such probability
vectors for places.
Topic 827
Topic 1242
Topic 904</p>
      <sec id="sec-4-1">
        <title>Topic 761 Topic 834 Topic 890</title>
      </sec>
      <sec id="sec-4-2">
        <title>Topic 909 Topic 793 Topic 1228</title>
      </sec>
      <sec id="sec-4-3">
        <title>Topic 417 Topic 164 Topic 847</title>
      </sec>
      <sec id="sec-4-4">
        <title>Similar to Los Angeles ranking 1 – Travel blog</title>
        <p>game,basebal,team,play,watch,sport,hockey,fan,stadium,player
restaur,order,food,tabl,meal,menu,eat,waiter,serv,dinner
money,pay,cost,expens,price,onli,cheap,pound,buy,free</p>
      </sec>
      <sec id="sec-4-5">
        <title>Similar to Los Angeles ranking 1 – Wikipedia</title>
        <p>book,work,publish,life,wrote,histori,year,mani,author,writer
cathol,bishop,dioces,roman cathol,roman,john,father,priest
golf,cour,club,golf club,hole,countri club,countri,locat,golf cours</p>
      </sec>
      <sec id="sec-4-6">
        <title>Similar to Los Angeles ranking 2 – Travel blog</title>
        <p>mile,road,stop,highway,gas,motel,state,sign,rout,trip
extrem,complet,entir,exact,time,veri,actual,howev,made,ani
histor,site,build,visit,histori,tour,histor site,area,museum,mani</p>
      </sec>
      <sec id="sec-4-7">
        <title>Similar to Los Angeles ranking 2 – Wikipedia side,east,east side,west,north,south,locat,north side,west side polic,offic,polic offic,polic station,polic depart,depart,enforc,forc turkish,ottoman,turkey,greek,byzantin,ottoman empir,sultan</title>
        <p>Similarity calculations based on probability distributions are commonly context-neutral and only measure the relative
entropy or JS divergence of the distributions. We presented a novel approach to use a small, user-provided sample ranking
of similar places to automatically re-weight topic weights. This new re-weighting can be used to serve up personalised
similarity values between places based on the topics that are of most interest to a user. Since this re-weighting of topic
values is done after the initial training used to discover topics associated with places, it is fully compatible with a variety
of different methods to create semantic signatures associated with a place, not solely topic modelling as used here.</p>
        <p>We evaluated our method with a Mechanical Turk user study and showed that using a small sample ranking of similar
places (i.e., a set of control similarities) results in a larger correspondence between automated place similarity rankings
and personal users’ rankings. Personalised place recommendation and similarity search remains a relatively unexplored
research area. A follow-up study to better understand user motivation in performing place similarity will be a valuable next
step. Future work will also involve exploring social media data and other sources such as search history to automatically
provide sample similar places for a given user.</p>
        <p>Proceedings of the National Academy of
Sci</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Adams</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          and
          <string-name>
            <surname>K. Janowicz</surname>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>On the geo-indicativeness of non-georeferenced text</article-title>
          . In J. G. Breslin,
          <string-name>
            <given-names>N. B.</given-names>
            <surname>Ellison</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. G.</given-names>
            <surname>Shanahan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Tufekci</surname>
          </string-name>
          (Eds.), ICWSM, pp.
          <fpage>375</fpage>
          -
          <lpage>378</lpage>
          . The AAAI Press.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Adams</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          and G.
          <string-name>
            <surname>McKenzie</surname>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Inferring thematic places from spatially referenced natural language descriptions</article-title>
          . In D. Sui,
          <string-name>
            <given-names>S.</given-names>
            <surname>Elwood</surname>
          </string-name>
          , and M. Goodchild (Eds.),
          <source>Crowdsourcing Geographic Knowledge</source>
          , pp.
          <fpage>201</fpage>
          -
          <lpage>221</lpage>
          . Springer Netherlands.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Banerjee</surname>
            ,
            <given-names>S. S.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>R. R.</given-names>
            <surname>Dholakia</surname>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Mobile advertising: does location-based advertising work</article-title>
          ?
          <source>International Journal of Mobile Marketing</source>
          <volume>3</volume>
          (
          <issue>2</issue>
          ),
          <fpage>68</fpage>
          -
          <lpage>75</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Blei</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Ng</surname>
          </string-name>
          , and
          <string-name>
            <surname>M. I. Jordan</surname>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>Latent Dirichlet allocation</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          <volume>3</volume>
          ,
          <fpage>993</fpage>
          -
          <lpage>1022</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Bodenhamer</surname>
            ,
            <given-names>D. J.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>The potential of spatial humanities</article-title>
          . In D. J.
          <string-name>
            <surname>Bodenhamer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Corrigan</surname>
          </string-name>
          , and T. M. Harris (Eds.),
          <source>The Spatial Humanities: GIS and the Future of Humanities Scholarship</source>
          , pp.
          <fpage>14</fpage>
          -
          <lpage>30</lpage>
          . Indiana University Press.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Bouidghaghen</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Tamine</surname>
          </string-name>
          , and M.
          <string-name>
            <surname>Boughanem</surname>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Personalizing mobile web search for location sensitive queries</article-title>
          . In A. B.
          <string-name>
            <surname>Zaslavsky</surname>
            ,
            <given-names>P. K.</given-names>
          </string-name>
          <string-name>
            <surname>Chrysanthis</surname>
            ,
            <given-names>D. L.</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Chakraborty</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Kalogeraki</surname>
            ,
            <given-names>M. F.</given-names>
          </string-name>
          <string-name>
            <surname>Mokbel</surname>
          </string-name>
          , and C.-Y. Chow (Eds.),
          <source>Mobile Data Management (1)</source>
          , pp.
          <fpage>110</fpage>
          -
          <lpage>118</lpage>
          . IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Daoud</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Tamine</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Boughanem</surname>
          </string-name>
          , and B.
          <string-name>
            <surname>Chebaro</surname>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Learning implicit user interests using ontology and search history for personalization</article-title>
          .
          <source>In Proceedings of the 2007 international conference on Web information systems engineering</source>
          , WISE'
          <fpage>07</fpage>
          , Berlin, Heidelberg, pp.
          <fpage>325</fpage>
          -
          <lpage>336</lpage>
          . Springer-Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Diaconis</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          and R. L.
          <string-name>
            <surname>Graham</surname>
          </string-name>
          (
          <year>1977</year>
          ).
          <article-title>Spearman's footrule as a measure of disarray</article-title>
          .
          <source>Journal of the Royal Statistical Society. Series B (Methodological)</source>
          <volume>39</volume>
          (
          <issue>2</issue>
          ),
          <fpage>262</fpage>
          -
          <lpage>268</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Eisenstein</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>B. O</given-names>
            <surname>'Connor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. A.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and E. P.</given-names>
            <surname>Xing</surname>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>A latent variable model for geographic lexical variation</article-title>
          .
          <source>In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing</source>
          , EMNLP '10,
          <string-name>
            <surname>Stroudsburg</surname>
          </string-name>
          , PA, USA, pp.
          <fpage>1277</fpage>
          -
          <lpage>1287</lpage>
          . Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Ga</surname>
            ¨rdenfors,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Conceptual Spaces: The Geometry of Thought. A Bradford Book</article-title>
          . MIT Press.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Griffiths</surname>
            ,
            <given-names>T. L.</given-names>
          </string-name>
          and
          <string-name>
            <surname>M. Steyvers</surname>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Finding scientific topics</article-title>
          .
          <source>ences 101(Suppl. 1)</source>
          ,
          <fpage>5228</fpage>
          -
          <lpage>5235</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Grossner</surname>
            ,
            <given-names>K. E.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M. F.</given-names>
            <surname>Goodchild</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          K. C.
          <string-name>
            <surname>Clarke</surname>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Defining a digital earth system</article-title>
          .
          <source>Transactions in GIS 12(1)</source>
          ,
          <fpage>145</fpage>
          -
          <lpage>160</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Guy</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jacovi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Perer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. Ronen</given-names>
            , and E.
            <surname>Uziel</surname>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Same places, same things, same people?: mining user similarity on social media</article-title>
          .
          <source>In Proceedings of the 2010 ACM conference on Computer supported cooperative work</source>
          ,
          <source>CSCW '10</source>
          , New York, NY, USA, pp.
          <fpage>41</fpage>
          -
          <lpage>50</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Hao</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.-M. Yang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Pang</surname>
          </string-name>
          , and L. Z.
          <volume>0001</volume>
          (
          <year>2010</year>
          ).
          <article-title>Equip tourists with knowledge mined from travelogues</article-title>
          . In M. Rappa,
          <string-name>
            <given-names>P.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Freire</surname>
          </string-name>
          , and S. Chakrabarti (Eds.), WWW, pp.
          <fpage>401</fpage>
          -
          <lpage>410</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Hong</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ahmed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gurumurthy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. J.</given-names>
            <surname>Smola</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Tsioutsiouliklis</surname>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Discovering geographical topics in the twitter stream</article-title>
          . In A.
          <string-name>
            <surname>Mille</surname>
            ,
            <given-names>F. L.</given-names>
          </string-name>
          <string-name>
            <surname>Gandon</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Misselis</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Rabinovich</surname>
          </string-name>
          , and S. Staab (Eds.), WWW, pp.
          <fpage>769</fpage>
          -
          <lpage>778</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Janowicz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Adams</surname>
          </string-name>
          , and M.
          <string-name>
            <surname>Raubal</surname>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Semantic referencing - determining context weights for similarity measurement</article-title>
          . In S. I. Fabrikant,
          <string-name>
            <given-names>T.</given-names>
            <surname>Reichenbacher</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. J. van Kreveld</surname>
          </string-name>
          , and C. Schlieder (Eds.),
          <source>GIScience</source>
          , Volume
          <volume>6292</volume>
          of Lecture Notes in Computer Science, pp.
          <fpage>70</fpage>
          -
          <lpage>84</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Janowicz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Raubal</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W.</given-names>
            <surname>Kuhn</surname>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>The semantics of similarity in geographic information retrieval</article-title>
          .
          <source>Journal of Spatial Information Science (2)</source>
          ,
          <fpage>29</fpage>
          -
          <lpage>57</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>C. B.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>R. S.</given-names>
            <surname>Purves</surname>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Geographical information retrieval</article-title>
          .
          <source>International Journal of Geographical Information Science</source>
          <volume>22</volume>
          (
          <issue>3</issue>
          ),
          <fpage>219</fpage>
          -
          <lpage>228</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Jordan</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Raubal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Gartrell</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Egenhofer</surname>
          </string-name>
          (
          <year>1998</year>
          ).
          <article-title>An affordance-based model of place in GIS</article-title>
          . In T. Poiker and
          <string-name>
            <given-names>N.</given-names>
            <surname>Chrisman</surname>
          </string-name>
          (Eds.),
          <source>Proceedings of the 8th International Symposium on Spatial Data Handling (SDH'98)</source>
          , Vancouver, Canada, pp.
          <fpage>98</fpage>
          -
          <lpage>109</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Jorgensen</surname>
            ,
            <given-names>B. S.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>R. C.</given-names>
            <surname>Stedman</surname>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>A comparative analysis of predictors of sense of place dimensions: Attachment to, dependence on, and identification with lakeshore properties</article-title>
          .
          <source>Journal of Environmental Management</source>
          <volume>79</volume>
          ,
          <fpage>316</fpage>
          -
          <lpage>327</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Kaltenborn</surname>
            ,
            <given-names>B. P.</given-names>
          </string-name>
          (
          <year>1998</year>
          ).
          <article-title>Effects of sense of place on responses to environmental impacts: A study among residents in svalbard in the Norwegian high Arctic</article-title>
          .
          <source>Applied Geography</source>
          <volume>2</volume>
          (
          <issue>18</issue>
          ),
          <fpage>169</fpage>
          -
          <lpage>189</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>J</given-names>
            .
            <surname>Teevan</surname>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>Implicit feedback for inferring user preference: a bibliography</article-title>
          .
          <source>SIGIR Forum</source>
          <volume>37</volume>
          ,
          <fpage>18</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Kendall</surname>
            ,
            <given-names>M. G.</given-names>
          </string-name>
          (
          <year>1938</year>
          ).
          <article-title>A new measure of rank correlation</article-title>
          .
          <source>Biometrika</source>
          <volume>30</volume>
          (
          <issue>1</issue>
          /2), pp.
          <fpage>81</fpage>
          -
          <lpage>93</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Keßler</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>What is the difference? a cognitive dissimilarity measure for information retrieval result sets</article-title>
          .
          <source>Knowl. Inf. Syst</source>
          .
          <volume>30</volume>
          (
          <issue>2</issue>
          ),
          <fpage>319</fpage>
          -
          <lpage>340</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Kittur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>E. H.</given-names>
            <surname>Chi</surname>
          </string-name>
          , and B.
          <string-name>
            <surname>Suh</surname>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Crowdsourcing user studies with mechanical turk</article-title>
          .
          <source>In CHI '08:</source>
          <article-title>Proceeding of the twenty-sixth annual SIGCHI conference on Human factors in computing systems</article-title>
          , New York, NY, USA, pp.
          <fpage>453</fpage>
          -
          <lpage>456</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Kumar</surname>
            , R. and
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Vassilvitskii</surname>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Generalized distances between rankings</article-title>
          . In M. Rappa,
          <string-name>
            <given-names>P.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Freire</surname>
          </string-name>
          , and S. Chakrabarti (Eds.), WWW, pp.
          <fpage>571</fpage>
          -
          <lpage>580</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>Larson</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>1996</year>
          ).
          <article-title>Geographic information retrieval and spatial browsing</article-title>
          . In L. C. Smith and
          <string-name>
            <given-names>M.</given-names>
            <surname>Gluck</surname>
          </string-name>
          (Eds.),
          <source>Geographic Information Systems and Libraries: Patrons, Maps, and Spatial Information</source>
          , pp.
          <fpage>81</fpage>
          -
          <lpage>124</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <surname>McKenzie</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Adams</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Janowicz</surname>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>A thematic approach to user similarity built on geosocial check-ins</article-title>
          . In D. Vandenbroucke,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bucher</surname>
          </string-name>
          , and J.
          <string-name>
            <surname>Crompvoets</surname>
          </string-name>
          (Eds.),
          <source>Geographic Information Science at the Heart of Europe, Lecture Notes in Geoinformation and Cartography</source>
          , pp.
          <fpage>39</fpage>
          -
          <lpage>53</lpage>
          . Springer International Publishing.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <surname>Medin</surname>
            ,
            <given-names>D. L.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Goldstone</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Gentner</surname>
          </string-name>
          (
          <year>1993</year>
          ).
          <article-title>Respects for similarity</article-title>
          .
          <source>Psychological Review</source>
          <volume>100</volume>
          (
          <issue>2</issue>
          ),
          <fpage>254</fpage>
          -
          <lpage>278</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <surname>Micarelli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gasparetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Sciarrone</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Gauch</surname>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>The adaptive web</article-title>
          .
          <source>Chapter Personalized search on the world wide web</source>
          , pp.
          <fpage>195</fpage>
          -
          <lpage>230</lpage>
          . Berlin, Heidelberg: Springer-Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <surname>Moore</surname>
            ,
            <given-names>R. L.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Graefe</surname>
          </string-name>
          (
          <year>1994</year>
          ).
          <article-title>Attachments to recreation settings: The case of rail-trail users</article-title>
          .
          <source>Leisure Sciences</source>
          <volume>16</volume>
          ,
          <fpage>17</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <string-name>
            <surname>Pitkow</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , H. Schu¨tze, T. Cass,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cooley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Turnbull</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Edmonds</surname>
          </string-name>
          , E. Adar, and
          <string-name>
            <given-names>T.</given-names>
            <surname>Breuel</surname>
          </string-name>
          (
          <year>2002</year>
          ).
          <article-title>Personalized search</article-title>
          .
          <source>Commun. ACM</source>
          <volume>45</volume>
          (
          <issue>9</issue>
          ),
          <fpage>50</fpage>
          -
          <lpage>55</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          <string-name>
            <surname>Ranganathan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>R. H.</given-names>
            <surname>Campbell</surname>
          </string-name>
          (
          <year>2002</year>
          ).
          <article-title>Advertising in a pervasive computing environment</article-title>
          . In M. S. Viveros,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lei</surname>
          </string-name>
          , and
          <string-name>
            <given-names>O.</given-names>
            <surname>Wolfson</surname>
          </string-name>
          (Eds.), Workshop Mobile Commerce, pp.
          <fpage>10</fpage>
          -
          <lpage>14</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          <string-name>
            <surname>Raubal</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Formalizing conceptual spaces</article-title>
          . In A. C. Varzi and L.
          <string-name>
            <surname>Vieu</surname>
          </string-name>
          (Eds.),
          <source>Formal Ontology in Information Systems, Proceedings of the Third International Conference (FOIS</source>
          <year>2004</year>
          ), pp.
          <fpage>153</fpage>
          -
          <lpage>164</lpage>
          . IOS Press.
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          <string-name>
            <surname>Relph</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          (
          <year>1976</year>
          ).
          <article-title>Place and Placelessness</article-title>
          . Pion.
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          <string-name>
            <surname>Ricci</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Rokach</surname>
          </string-name>
          , and B.
          <string-name>
            <surname>Shapira</surname>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Introduction to recommender systems handbook</article-title>
          . In F. Ricci,
          <string-name>
            <given-names>L.</given-names>
            <surname>Rokach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Shapira</surname>
          </string-name>
          , and P. B.
          <string-name>
            <surname>Kantor</surname>
          </string-name>
          (Eds.),
          <source>Recommender Systems Handbook</source>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>35</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          <article-title>Rodr´ıguez, M. A</article-title>
          . and
          <string-name>
            <surname>M. J. Egenhofer</surname>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Comparing geospatial entity classes: an asymmetric and context-dependent similarity measure</article-title>
          .
          <source>International Journal of Geographical Information Science</source>
          <volume>18</volume>
          (
          <issue>3</issue>
          ),
          <fpage>229</fpage>
          -
          <lpage>256</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          <string-name>
            <surname>Rosch</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          (
          <year>1978</year>
          ).
          <article-title>Principles of categorization</article-title>
          . In E. Rosch and
          <string-name>
            <given-names>B. B.</given-names>
            <surname>Lloyd</surname>
          </string-name>
          (Eds.),
          <source>Cognition and Categorization</source>
          , pp.
          <fpage>27</fpage>
          -
          <lpage>48</lpage>
          . Hillsdale (NJ), USA: Lawrence Erlbaum Associates.
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          <string-name>
            <surname>Sizov</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Geofolk: latent spatial semantics in web 2.0 social media</article-title>
          . In B. D.
          <string-name>
            <surname>Davison</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Suel</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Craswell</surname>
          </string-name>
          , and B.
          <string-name>
            <surname>Liu</surname>
          </string-name>
          (Eds.), WSDM, pp.
          <fpage>281</fpage>
          -
          <lpage>290</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          <string-name>
            <surname>Speretta</surname>
            , M. and
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Gauch</surname>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Personalized search based on user search histories</article-title>
          .
          <source>In Web Intelligence</source>
          ,
          <year>2005</year>
          . Proceedings. The 2005 IEEE/WIC/ACM International Conference on, pp.
          <fpage>622</fpage>
          -
          <lpage>628</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          <string-name>
            <surname>Stedman</surname>
            ,
            <given-names>R. C.</given-names>
          </string-name>
          (
          <year>2002</year>
          ).
          <article-title>Toward a social psychology of place : Predicting behavior from place-based cognitions, attitude, and identity</article-title>
          .
          <source>Environment and Behavior</source>
          <volume>34</volume>
          (
          <issue>5</issue>
          ),
          <fpage>561</fpage>
          -
          <lpage>581</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          <string-name>
            <surname>Steyvers</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Griffiths</surname>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Probabilistic topic models</article-title>
          . In T. Landauer,
          <string-name>
            <given-names>D.</given-names>
            <surname>Mcnamara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dennis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W.</given-names>
            <surname>Kintsch</surname>
          </string-name>
          (Eds.),
          <article-title>Handbook of Latent Semantic Analysis</article-title>
          . Lawrence Erlbaum Associates.
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          <string-name>
            <surname>Teevan</surname>
            , J.,
            <given-names>S. T.</given-names>
          </string-name>
          <string-name>
            <surname>Dumais</surname>
          </string-name>
          , and E.
          <string-name>
            <surname>Horvitz</surname>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Personalizing search via automated analysis of interests and activities</article-title>
          .
          <source>In SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval</source>
          , New York, NY, USA, pp.
          <fpage>449</fpage>
          -
          <lpage>456</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          <string-name>
            <surname>Tuan</surname>
            ,
            <given-names>Y.-F.</given-names>
          </string-name>
          (
          <year>1977</year>
          ).
          <article-title>Space and Place: the Perspective of Experience</article-title>
          . The Regents of the University of Minnesota.
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          <string-name>
            <surname>Tversky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>1977</year>
          ).
          <article-title>Features of similarity</article-title>
          .
          <source>Psychological Review</source>
          <volume>84</volume>
          (
          <issue>4</issue>
          ),
          <fpage>327</fpage>
          -
          <lpage>352</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          <string-name>
            <surname>Vasardani</surname>
            , M.,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Winter</surname>
            , and
            <given-names>K.-F.</given-names>
          </string-name>
          <string-name>
            <surname>Richter</surname>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Locating place names from place descriptions</article-title>
          .
          <source>International Journal of Geographical Information Science</source>
          <volume>27</volume>
          (
          <issue>12</issue>
          ),
          <fpage>2509</fpage>
          -
          <lpage>2532</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Xie</surname>
          </string-name>
          , and W.-Y. Ma (
          <year>2007</year>
          ).
          <article-title>Mining geographic knowledge using location aware topic model</article-title>
          . In R. Purves and
          <string-name>
            <given-names>C.</given-names>
            <surname>Jones</surname>
          </string-name>
          (Eds.), GIR, pp.
          <fpage>65</fpage>
          -
          <lpage>70</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          <string-name>
            <surname>Winter</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Kuhn</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Kru</surname>
          </string-name>
          <article-title>¨ger (</article-title>
          <year>2009</year>
          ).
          <article-title>Guest editorial: Does place have a place in geographic information science?</article-title>
          <source>Spatial Cognition and Computation</source>
          <volume>9</volume>
          ,
          <fpage>171</fpage>
          -
          <lpage>173</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          <string-name>
            <surname>Yin</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Cao</surname>
          </string-name>
          , J. Han,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhai</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T. S.</given-names>
            <surname>Huang</surname>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Geographical topic discovery and comparison</article-title>
          . In S. Srinivasan,
          <string-name>
            <given-names>K.</given-names>
            <surname>Ramamritham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. P.</given-names>
            <surname>Ravindra</surname>
          </string-name>
          , E. Bertino, and R. Kumar (Eds.), WWW, pp.
          <fpage>247</fpage>
          -
          <lpage>256</lpage>
          . ACM.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>