<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Location-aware online learning for top-k hashtag recommendation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Róbert Pálovics</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Péter Szalai</string-name>
          <email>pszalai@ilab.sztaki.hu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Levente Kocsis</string-name>
          <email>kocsis@ilab.sztaki.hu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Júlia Pap</string-name>
          <email>papjuli@ilab.sztaki.hu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Erzsébet Frigó</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>András A. Benczúr</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Eötvös University Budapest</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute for Computer Science and Control, Hungarian Academy of Sciences</institution>
          ,
          <addr-line>MTA SZTAKI</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Technical University Budapest</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Szeged</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2015</year>
      </pub-date>
      <volume>19</volume>
      <issue>2015</issue>
      <abstract>
        <p>In this paper we investigate the problem of recommending Twitter hashtags for users with known GPS location, learning online from the stream of geo-tagged tweets. Our method learns the relevance of regions in a geographical hierarchy, combined with the local popularity of the hashtag. Unlike in typical collaborative ltering settings, trends and geolocation turns out to be more important than personalized user preferences. We evaluate in a time-aware setting, where evaluation is cumbersome by traditional measures, since we have di erent top recommendations at di erent times. We describe a time-aware framework based on individual item discounted gain.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        We investigate the problem of recommending Twitter
hashtags for users, based on the temporal geolocation
information of both the users and the hashtags. Our aim is to
recommend new hashtags, i.e. hashtags that the user has not
used before. The recommendations are obtained by
learning online from the stream of geo-tagged tweets. In our task
the novel element is that neither users nor items (hashtags)
are bound to one single location. Hashtags may in fact
relate to certain locations as well as be popular worldwide.
Earlier results on recommendation in location-based social
networks surveyed in e.g. [
        <xref ref-type="bibr" rid="ref1 ref13">1, 13</xref>
        ] combine spatial ratings for
non-spatial items, nonspatial ratings for spatial items, and
spatial ratings for spatial items [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Our new results address
the problem of the fuzzy relation of users and hashtags with
locations.
      </p>
      <p>
        Since hashtag usage is highly volatile, the problem calls for
an online method. Whenever a user sends a geotagged tweet
with a hashtag he or she has not used earlier, we consider the
event as a trigger for recommendation. We measure the
accuracy of our methods in the online evaluation framework of
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] based on discounted cumulative gain (DCG) computed
individually for each event and averaged over time.
      </p>
      <p>
        We nd that location and timing are the key factors with
little contribution from personalized user interest. The
locality of Twitter hashtag adoption in both spatial and
temporal sense is observed among others by Kamath et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
They state that \hashtags are a global phenomenon [. . . ]
but distance between locations is a strong constraint on the
adoption [. . . ] and follow a spray-and-di use pattern".
      </p>
      <p>
        We use a four-month collection of 400 million geotagged
Twitter messages detailed in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. We discard the text of
the tweet messages and keep only the hashtags, the
timestamp and the GPS coordinates. In our experiments we
focus on the user location and the new hashtags that
appear in the message. As we have no information on which
tweets are read by the users but we know the new hashtags
they tweeted, we use the hashtag publishing information to
measure user topic adoption. We consider a hashtag newly
adopted if we have not observed the given user-hashtag pair
before in the dataset. To guarantee at least one month for
each user-hashtag pair without activity, we simply skip the
rst month of the stream of new hashtag usages.
      </p>
      <p>
        Some content may have obvious connection to certain
locations but others can have more widespread interest on
di erent levels such as language, continent, or even
worldwide. Dealing with this, our models rely on the hierarchy of
regions from a global or continent-wide level down to a
village or city district to attribute the momentary popularity
of a hashtag to levels of locations. This hierarchical property
of locations is surveyed also in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. We use the open
hierarchical database of Global Administrative Areas (GADM,
http://gadm.org). We mention that the metadata of tweets
may contain not only GPS coordinates but also a place
attribute that can contain the name and type of the place.
However, we found the place attribute often ambiguous and
less reliable.
      </p>
      <p>We use two models to recommend hashtags at a given time
and location, one based on the estimated probability of the
hashtag appearance based on its recency, and another based
on its temporal popularity. In both cases, our new method
learns the importance of each node in the GADM tree. The
nal prediction arises as the weighted combination of the
hashtag probabilities along the path of the GADM tree from
the leaf location of the user up to the root.</p>
      <p>
        As baseline method, we use online matrix factorization
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Surprisingly, it turns out that matrix factorization
performs much weaker than the distance based methods and
contributes relatively little to the nal prediction. This
observation justi es the importance of the temporal and
geographic context of Twitter messages.
1.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related work</title>
      <p>
        Most of previous publications on geographic recommender
systems work with check-in data, where each of the items has
a prede ned static location. For brevity, we do not survey
these here. Hashtag recommendations are addressed in two
recent papers: Chen et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] give methods for e ciently
maintaining a sliding window for time aware
recommendation, and Diaz et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] introduce methods to compute
matrix factorization online. These results are orthogonal to our
exploitation of the location information.
      </p>
      <p>
        Spatial statistics of hashtag adoption are analyzed by
Kamath et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Cheng et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] give methods to geolocalize
tweets based on content. Mocanu et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] use a data set
similar to ours to analyze geographical properties like
homogeneity and seasonal patterns of language usage at scales
ranging from country-level to city neighborhoods. Similar
to our use of the Global Administrative Areas,
regions-ofinterests partitioning is examined in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] by applying k-means
clustering to establish natural regions over Twitter data.
None of these papers exploit the results in recommender
systems.
      </p>
      <p>
        No other results use external data to de ne the hierarchy
of locations for recommendation tasks. Similar to our result,
in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], GADM is used over the same Twitter data set, but
only for visualization purposes.
      </p>
    </sec>
    <sec id="sec-3">
      <title>ONLINE RECOMMENDATION AND</title>
    </sec>
    <sec id="sec-4">
      <title>EVALUATION</title>
      <p>
        We use the online recommendation framework described
in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], in which model training and evaluation happen
simultaneously, iterating over the dataset only once, in
chronological order. Whenever we see a new tweet, we assume that
the user becomes active and reveals its location to the
recommender system. Next, we recommend hashtags of potential
interest for the user. The recommendation is online, hence
it depends on the context at the exact time instance of the
tweet. If a user u tweets with hashtag h at time t in
location `, our models give a score r^(u; h0; `; t) for each hashtag
h0 seen so far, and recommend to u the k hashtags with the
largest values from those that u has not used before.
      </p>
      <p>The data is implicit: the events imply only that the user is
interested in a hashtag. In most of our models, we need
negative instances as well for training. We use all hashtag usages
as positive training instances and generate negative training
instances by selecting negRate random hashtags uniformly
at the time when a user rst used a hashtag. We tested the
negRate parameter between 1 and 300.</p>
      <p>
        We use the quality metric of [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] that we adopt to
hashtag recommendation. If h is the new hashtag in the message
and the rank of h returned by the recommender system is
rank(h), then the discounted cumulative gain, DCG@k of
this event is log2(ran1k(h) + 1) if rank(h) k, and 0
otherwise. The overall evaluation of a model is the average
cumulative DCG@k.
      </p>
    </sec>
    <sec id="sec-5">
      <title>TWITTER AND GEOGRAPHICAL DATA</title>
      <p>
        Dobos et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] collected the dataset using the Twitter
open API by requesting geotagged tweets. We used the data
between February 1 and May 30, 2012 with February for
training and observing distributions only, hence the online
learning period lasts three months.
      </p>
      <p>Most of the hashtags in the database are quite rare, thus
we use only the hashtags that appear more than 5 times.
This way we exclude about 90% of the hashtags, but most of
the hashtag timeline remains. We also exclude the hashtags
that appear in the rst month of the collection to
recommend newly spreading hashtags for the users. The
properties of the nal cleansed dataset are summarized in Table 1.</p>
      <p>We collected all 214,230 nodes from the GADM database,
from which 190,315 are leaves. The depth of the tree is 6,
and includes 5 levels from the GADM tree plus
continentcountry relations. The hashtag time series data covered
30,450 leaves from the tree.
4.
4.1</p>
    </sec>
    <sec id="sec-6">
      <title>MODELING</title>
    </sec>
    <sec id="sec-7">
      <title>Recommendation by location hierarchy</title>
      <p>In our recommendation model we use the location ` at
time t of the user. Here ` notes the leaf of the tree that is
closest to the current GPS location of the user. First, we
get the path in the tree from the root node to location `,
Path(`). Next, for a given hashtag, assume we have a
recommendation method that yields scores s(h; n; t) for nodes
n along Path(`). We will give two such methods in the next
subsections. In order to aggregate the individual
recommendation at each GADM tree node, we propose the formula
r^(u; h; `; t) =</p>
      <p>wn s(h; n; t);</p>
      <p>X
n2Path(`)
where wn values are node speci c weights. The weights wn
are independent of the hashtags and characterize the area
n only. We learn the weights by online gradient descent by
optimizing for RMSE. If we consider all positive instances
and generate negative ones as described in Section 2, we will
have su ciently many implicit data to update the weights
online as we read the sequence of events.</p>
      <p>In our experiments we also investigate models where we
set all wn values constant, i.e. we do not learn the weights,
but simply sum all s(h; n; t) values along Path(`).
4.2</p>
    </sec>
    <sec id="sec-8">
      <title>Temporal popularity</title>
      <p>Given a prede ned time discretization that we test
between a minute and a day, for each location in the tree, we
compute the number of occurrences of the hashtag in the
time interval at the given location. As it follows power-law
distribution, we use the logarithm of the temporal
popularity values as node scores: s(h; n; t) = log(pop(h; n; t)), where
pop(h; n; t) denotes the number of occurrences of hashtag h
in node n in the time interval ending at time t.
4.3</p>
    </sec>
    <sec id="sec-9">
      <title>Hashtag recency</title>
      <p>
        Our next method estimates the chance of the appearance
of a hashtag by considering its most recent usage. The
advantage of this method is that it is more sensitive to changes
in trends. While it may more aggressively over t to single
events, overall it performs similar to and combines very well
with the popularity based method. In Fig. 1, we investigate
the distribution of the time elapsed between the same
hashtag appearing in tweets. This inter-event time distribution
follows power law, in accordance with several earlier
observations [
        <xref ref-type="bibr" rid="ref14 ref2 ref8">2, 14, 8</xref>
        ], P ( = t) = ( 1) t , whence we easily
get
t
t (1 )
:
(1)
P (t &lt;
t +
t j
&gt; t) = 1
1 +
For location sensitive prediction we maintain the last
appearance of each hashtag for every node in the geolocation
tree. We compute the estimate of (1) in each node by using
the global measured value = 1:2.
4.4
      </p>
    </sec>
    <sec id="sec-10">
      <title>Online matrix factorization</title>
      <p>
        We apply stochastic gradient descent factorization for the
user-hashtag matrix as in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Batch stochastic gradient
descent iterates several times over the training set until
convergence. Online recommenders seem to be more restricted
than those that may iterate over the data set several times.
However, online matrix factorization proved to be superior
to batch in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], since it gives much more emphasis on recent
events.
      </p>
    </sec>
    <sec id="sec-11">
      <title>EXPERIMENTS</title>
      <p>In our graphs we show the average cumulative DCG for
the rst three weeks, by when all of our methods reach stable
performance. Here the cumulative average corresponds to
cumulative time average. We set k = 100 to compare our
methods in detail. All methods show a slight performance
degradation, which is due to the fact that the number of
possible hashtags to recommend increase in time.
5.1</p>
    </sec>
    <sec id="sec-12">
      <title>Online matrix factorization</title>
      <p>
        We used the online version of stochastic gradient descent
(SGD) matrix factorization algorithm of [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. We applied
the mean square error with user and item regularization
terms of weight regRate as our objective function. Since
our data is implicit and contains only positive interactions,
we generated negative samples. Every time a user rst posts
about a hashtag, we generate for her negRate hashtags that
she have not used in her past as described in Section 2. In
all cases, we set negRate=99, learning rate lRate=0.4, and
regRate=0.01.
      </p>
      <p>As we show next, our tree based methods and their
baseline variants to exploit geographical information resulted in
better performance in our experiments.</p>
      <p>0.4</p>
    </sec>
    <sec id="sec-13">
      <title>Popularity and recency based methods</title>
      <p>The temporal popularity based method using the GADM
tree of Section 4.2 achieved best results by setting the time
frame around 2 hours. In the recency based model of
Section 4.3, the parameter t had relatively little e ect, we set
t = 12h. We compared the popularity and recency based
methods separately in Figs. 2 and 3, resp., by using di
erent levels of the GADM tree and turning recency and node
weight learning on and o .</p>
      <p>In the gures, \world" denotes the methods that use global
values only and do not take user geolocation into account,
while \leaves" and \countries" use the corresponding level of
the tree. Note that country level popularity and recency
performed very well while leaves worked only with recency
not popularity. Note that using countries but no temporal
information at all performs the poorest.</p>
      <p>Best performance is obtained when using the whole tree
for recommendation by adding all recency values along the
path corresponding to the current user location in the tree,
marginally improving the country based results. However,
by applying the gradient method to learn node speci c weights
as in Section 4.1, we could achieve signi cantly better results
for recency but not for popularity. By using the recency
based tree learning algorithm, we were able to focus on the
active and representative part of the tree. We achieved our
best results with lRate=0.0001 and negRate=4.
5.3</p>
    </sec>
    <sec id="sec-14">
      <title>Online combination</title>
      <p>
        In our nal experiments we compared and combined our
strongest methods. In Figure 4 we plotted the average
cumulative DCG@100 as the function of time for our best
models. Surprisingly, the tree based methods strongly
outperform online matrix factorization, while the best popularity
based model overtook the best recency based method a little
bit in the long run. Next, we considered the strongest one,
the tree based popularity without node weight learning, to
improve it by combining it with the best factor model and
recency recommender. We used the SGD based double layer
combination method introduced in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] with mean squared
error as objective function. In Figure 5 we show the results
of the combination. The popularity model can be improved
by using the best recency method that uses the tree with
      </p>
      <p>0
0.4
000.38
1
0.24
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
factor
pop
recency</p>
      <p>popularity
popularity + recency
popularity + recency + factor
2
learned node weights. We could further improve our results
by including the factor model in our recommendation. In
Table 2 we collected the overall performance of our best
methods and their combinations.</p>
    </sec>
    <sec id="sec-15">
      <title>CONCLUSION AND FUTURE WORK</title>
      <p>We gave online, location based learning methods for
Twitter hashtag recommendation. Since hashtags are not directly
bound to a location, may be geographically spread, and vary
in popularity at di erent times, we designed methods that
0.206
0.355
0.359
0.374
(4.1% )
0.381
(6.1% )
exploit the time and location context. Surprisingly, user
personalization has little contribution to recommendation
quality, hence our best methods apply in the user cold start
setting as well.
7.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Wilkie</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. F.</given-names>
            <surname>Mokbel</surname>
          </string-name>
          .
          <article-title>A survey on recommendations in location-based social networks</article-title>
          .
          <source>ACM Trans. Intelligent Systems and Technology</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.-L.</given-names>
            <surname>Barabasi</surname>
          </string-name>
          .
          <article-title>The origin of bursts and heavy tails in human dynamics</article-title>
          .
          <source>Nature</source>
          ,
          <volume>435</volume>
          (
          <issue>7039</issue>
          ):
          <volume>207</volume>
          {
          <fpage>211</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yao</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Cui</surname>
          </string-name>
          . Terec:
          <article-title>A temporal recommender system over tweet stream</article-title>
          .
          <source>Proceedings of the VLDB Endowment</source>
          ,
          <volume>6</volume>
          (
          <issue>12</issue>
          ):
          <volume>1254</volume>
          {
          <fpage>1257</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Cheng</surname>
          </string-name>
          , J. Caverlee, and
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          .
          <article-title>You are where you tweet: a content-based approach to geo-locating twitter users</article-title>
          .
          <source>In Proc. CIKM</source>
          , pp.
          <volume>759</volume>
          {
          <fpage>768</fpage>
          . ACM,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E.</given-names>
            <surname>Diaz-Aviles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Drumond</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Schmidt-Thieme</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W.</given-names>
            <surname>Nejdl</surname>
          </string-name>
          .
          <article-title>Real-time top-n recommendation in social streams</article-title>
          .
          <source>In Proc. RecSys</source>
          , pp.
          <volume>59</volume>
          {
          <fpage>66</fpage>
          . ACM,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Dobos</surname>
          </string-name>
          , J. Szule, T. Bodnar,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hanyecz</surname>
          </string-name>
          , T. Seb}ok,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kondor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Kallus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Steger</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Csabai</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Vattay.</surname>
          </string-name>
          <article-title>A multi-terabyte relational database for geo-tagged social network data</article-title>
          .
          <source>In Cognitive Infocommunications (CogInfoCom)</source>
          ,
          <year>2013</year>
          IEEE 4th International Conference on, pp.
          <volume>289</volume>
          {
          <fpage>294</fpage>
          . IEEE,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>K. Y.</given-names>
            <surname>Kamath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Caverlee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Cheng</surname>
          </string-name>
          .
          <article-title>Spatio-temporal dynamics of online memes: a study of geo-tagged tweets</article-title>
          .
          <source>In Proc. WWW</source>
          , pp.
          <volume>667</volume>
          {
          <issue>678</issue>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kivela</surname>
          </string-name>
           and
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Porter</surname>
          </string-name>
          .
          <article-title>Estimating inter-event time distributions from nite observation periods in communication networks</article-title>
          .
          <source>arXiv preprint arXiv:1412.8388</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Lee</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Sumiya</surname>
          </string-name>
          .
          <article-title>Measuring geographical regularities of crowd behaviors for twitter-based geo-social event detection</article-title>
          .
          <source>In Proceedings of the 2nd ACM SIGSPATIAL international workshop on location based social networks</source>
          , pp.
          <volume>1</volume>
          {
          <fpage>10</fpage>
          . ACM,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Levandoski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sarwat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Eldawy</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. F.</given-names>
            <surname>Mokbel</surname>
          </string-name>
          .
          <article-title>Lars: A location-aware recommender system</article-title>
          .
          <source>In Proc. ICDE</source>
          , pp.
          <volume>450</volume>
          {
          <fpage>461</fpage>
          . IEEE,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>D.</given-names>
            <surname>Mocanu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Baronchelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Perra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Goncalves</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>A.</given-names>
            <surname>Vespignani</surname>
          </string-name>
          .
          <article-title>The twitter of babel: Mapping world languages through microblogging platforms</article-title>
          .
          <source>PloS One</source>
          ,
          <volume>8</volume>
          (
          <issue>4</issue>
          ):e61981,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R.</given-names>
            <surname>Palovics</surname>
          </string-name>
          and
          <string-name>
            <given-names>A. A.</given-names>
            <surname>Benczur</surname>
          </string-name>
          .
          <article-title>Temporal in uence over the last.fm social network</article-title>
          .
          <source>Social Netw. Analys. Mining</source>
          ,
          <volume>5</volume>
          (
          <issue>1</issue>
          ):
          <fpage>4</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>P.</given-names>
            <surname>Symeonidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ntempos</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Manolopoulos</surname>
          </string-name>
          .
          <article-title>Location-based social networks</article-title>
          .
          <source>In Recommender Systems for Location-based Social Networks</source>
          , pp.
          <volume>35</volume>
          {
          <fpage>48</fpage>
          . Springer,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Vazquez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Racz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lukacs</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.-L.</given-names>
            <surname>Barabasi</surname>
          </string-name>
          .
          <article-title>Impact of non-poissonian activity patterns on spreading processes</article-title>
          .
          <source>Physical review letters</source>
          ,
          <volume>98</volume>
          (
          <issue>15</issue>
          ):
          <fpage>158702</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>