<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Volatile Classification of Point of Interests based on Social Activity Streams</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>A. E. Cano</string-name>
          <email>A.Cano@dcs.shef.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A. Varga</string-name>
          <email>A.Varga@dcs.shef.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>F. Ciravegna</string-name>
          <email>F.Ciravegna@dcs.shef.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>OAK Group, Dept. of Computer Science, The University of Sheffield</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Location sharing services(LSS) like Foursquare, Gowalla and Facebook Places gather information from millions of users who leave trails in locations (i.e. chekins) in the form of micro-posts. These footprints provide a unique opportunity to explore the way in which users engage and perceive a point of interest (POI). A POI is as a human construct which describes information about locations (e.g restaurants, cities). In this work we investigate whether the collective perception of a POI can be used as a real-time dataset from which POI's transient features can be extracted. We introduce a graph-based model for profiling geographical areas based on social awareness streams. Based on this model we define a set of measures that can characterise a location-based social awareness stream as well as act as indicators of volatile events occurring at a POI. We applied the model and measures on a dataset consisting of a collection of tweets generated at the city of Sheffield and registered over three week-ends. The model and measures introduced in this paper are relevant for design of future location-based services, real-time emergency-response models, as well as traffic forecasting. Our empirical findings demonstrate that social awareness streams not only can act as an event-sensor but also can enrich the profile of a location-entity.</p>
      </abstract>
      <kwd-group>
        <kwd>Points of Interest</kwd>
        <kwd>social awareness streams</kwd>
        <kwd>social data mining</kwd>
        <kwd>citizen sensing</kwd>
        <kwd>emerging semantics</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Recent studies in user profiling have proposed the use of social activity streams for
modelling users’ interest, activities and behaviour [11][
        <xref ref-type="bibr" rid="ref1">1</xref>
        ][
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. These studies explore a
user’s comments in windows of time for revealing hidden features; which can aid in
profiling the user in real-time. Although people-entities have started to be modelled in
real-time, little has been done in modelling other entities involved in the environment
in which a user is immersed. One example of these entities is Location.
      </p>
      <p>In terms of location-awareness, a Point of Interest (POI) has been so far modelled
as a set of static data (e.g. name, address, geo-coordinates) and classified according to
the type of services it provides. Nonetheless, there are diverse latent (or hidden)
features which can describe volatile and temporal aspects of it. For example, in normal
conditions London, UK can be classified as a city labelled as: Urban, Tourism, Fashion.
However during the London riots(Aug 2011), the collective opinions gathered through
social activity streams (i.e. Twitter) regarding this city, started profiling this place with
the following tags: looting,unrest,police. These tags clearly provide a temporal
reclassification of this venue labelling it as for example: Political, Uprising, Violence.</p>
      <p>In this paper, we investigate whether the supplement of situational knowledge
extracted from social activity streams can be used to infer higher level contextual
information, which can induce a transient representation of a venue. Given the real-time and
volatile nature of events happening at a venue, providing an accurate classification of
these events involve different challenges including the variation of the vocabulary and
classes in which an event could be classified in time.</p>
      <p>The contributions of this paper are as follows:
◦ GeoLattice Awareness Streams: We introduce a graph-based model for profiling
geographical areas based on social awareness streams.
◦ Approach to derive a transient semantic classification of a POI: We present a novel
approach for dynamically classifying POI based on location-based social footprints
and DBPedia structured data. We define a set of measures that can characterise a
location-based social awareness stream as well as act as indicators of volatile events
occurring at a POI.
◦ Empirical Study: We applied this methodology in a dataset consisting of a collection
of tweets generated at the city of Sheffield and registered over three week-ends.</p>
      <p>The model and measures introduced in this paper are relevant for design of future
location-based services, real-time emergency-response models, as well as traffic
forecasting.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Little work has been done in classifying POIs based on location-based social activity
streams. However, there are several research directions closely related to POI
classification. Analysing the contextual meanings of places has long attracted attention by
researchers in fields like social interaction, environmental psychology, ubiquitous
computing and spatial data mining. Researchers on social interaction and environmental
psychology have documented the way in which mobile users tend to provide
information about location when they are asked about their current activity [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ][12]. Schegloff
[10] noted that during a conversation, attention is exhibited to: 1)
‘where-we-know-weare’; 2) ‘who-we-know-we-are’; 3) ‘what-we-are-doing-at-this-point-in-conversation’;
from which a ‘this situation’ can be translated in some ‘this conversation, at this place,
with these members, at this point in its course’. This contextual knowledge has been
used to infer a users’ situational features including a person’s level of availability or
interruptibility.
      </p>
      <p>
        The role of geography and location in online social networks has recently attracted
increasing attention. Experimental work done on location awareness has shown that
location sharing services (LSS) (e.g. Foursquare) are used to express not only users’
whereabouts but also their moods, lifestyle and events [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In their work, Barkhuus et al.
allowed users to tag areas and build a repartee in a group. They pointed out four different
types of location labels that participants used in their study, including: 1) geographic
references, 2) personal meaningful place, 3) activity-related labels, and 4) hybrid labels.
      </p>
      <p>
        Cheng et al.[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] modelled the spatial distribution of words in Twitter’s user-generated
content for predicting user’s location. Following a top-down approach they propose a
probabilistic framework for estimating a Twitter user’s city-level location based on the
content of the user’s tweets even on the abscence of any geospatial cues. Although their
approach is content-based and can automatically indetify words in tweets with a strong
geo-scope, they don’t provide a topical categorisation of a given geo-scope.
      </p>
      <p>Further work from Cheng et al [13] study mobility patterns of users in location
sharing services (LSS), they correlate social status, geographic and economic factors
with mobility and perform a sentiment-based analysis of post for deriving unboserved
context between people and locations.</p>
      <p>
        Lin et al [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] derive a taxonomy of different place naming methods, showing that a
person’s perceived familiarity with a place and the entropy of that place (i.e. the variety
of people who visit it) strongly influence the way people refer to it when interacting with
others. Based on this taxonomy, they present a machine learning model for predicting
the place naming method people choose. Ireson and Ciravegna [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] study toponym
resolution (i.e. the allocation of specific geolocation to target location terms) using Flickr
data. They construct an SVM classifier for predicting location labels associated to a
location term. Their model makes use of information context features including geo-tag
media, users’ contacts’related tags.
      </p>
      <p>
        Regarding place descriptions based on location sharing services (LSS), Hightower
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] redefines a place as an evolving set of both communal and personal labels for
potentially overlapping geometric volumes. He highlights that a meaningful place can capture
the venue’s demographic, environmental, historic, personal or commercial significance.
      </p>
      <p>Our work is in line with Hightower’s definition of a place, however rather than study
location-sharing practices we aim to study how location-based generated content can be
modelled for discoverying topics or categories that classify a place on time.
3</p>
    </sec>
    <sec id="sec-3">
      <title>GeoLattice Awareness Stream</title>
      <p>Following the Tweetonomy model suggested by Wagner and Strohmaier[11], we
introduce a formalisation for describing the comments related to a geographical region in
time; we refer to it as GeoLattice Awareness Streams.</p>
      <p>The W3C POI Working Group 1 defines a POI as a human construct which describes
information about locations. According to their definition, a POI is not limited to a set
of coordinates and an identifier but also can include a more complex structure like for
example a three dimensional model o a building, opening and closing hours etc.</p>
      <p>As mentioned in the previous section, location sharing services provide a
classification of their points of interest according to the type of service they provide (e.g. Food,
Nightlife Spots), however these categories are static and do not reveal any information
about the type of events occurring in a given venue. The key idea of our approach is to
enrich a POI by associating transient categories emerging from social activity streams
regarding this POI.</p>
      <p>Definition 1. A GeoLattice Awareness Stream can be defined as a sequence of tuples
S := (P oiq1, Cq2, Rq3, Y, f t) where
• Poi, M, R are finite sets whose elements are called Points of Interest, Messages and</p>
      <p>Resources;
• Each of these sets is qualified by q1, q2 and q3 respectively (explained below);
• The qualifier q1 for a Point of Interest (poi) includes for example name,
geographical-bounding area, and geo-coordinates.
• The qualifier q2 for a message m considers for example the message’s source
(e.g Facebook, Twitter) and it’s geo-coordinates.
• The qualifier q3 for a resource r considers: Rcat (category),Rk (keywords), Rh
(hashtags).
• Y is the ternary relation Y ⊆ Poi × M × R representing a hypergraph with ternary
edges. The hypergraph of a GeoLatice Awareness Stream Y is defined as a tripartite
graph H (Y) = $V, E% where the vertices are V = Poi ∪ M ∪ R, and the edges are:
E = {{poi, m, r} | (poi, m, r) ∈ Y }.
• ft is a function that assigns a temporal marker to each Y; f t : Y → T .</p>
      <p>Given a GeoLattice awareness stream S, a POI awareness stream can be defined as
the sequence of tuples from S where:</p>
      <p>S(Poi!) = (Poi, M, R, Y!, f t) , and Y! = {(poi, m, r) | poi ∈ Poi! ∨ ∃poi! ∈
Poi!, m˜ ∈ M, r ∈ R : (poi!, m˜, r) ∈ Y} i.e., a POI Awareness Stream is the aggregation
of all messages which are related to a certain set of points of interest poi ∈ Poi ! and
all resources and further points of interest related with these messages.
4
4.1</p>
    </sec>
    <sec id="sec-4">
      <title>Transient Semantic Classification of a POI</title>
      <sec id="sec-4-1">
        <title>Problem Statement</title>
        <p>Comments extracted from social activity streams can be described as semi-public,
naturallanguage messages produced by different users and characterised by their brevity. Given
these characteristics and the variation in the vocabulary appearing on a POI awareness
stream comments, finding relevant categories that can accurately qualify a comment is
a challenging task.</p>
        <p>Definition 2. We define a temporal classification of a Point of Interest as the
aggregation of Rcat category resources qualifying messages contained in a specific window
of time denoted by [ts, te]. An S(Poi!)[ts, te] is defined as S(Poi!) where f t : Y →
T , ts ≤ f t ≤ te.</p>
        <p>Given the above definition, our task consists on obtaining category resources R cat
which can classify a poi within a window of time [ts, te]. In this section, we introduce
a strategy for categorising points of interest.</p>
        <p>The POI categorisation within a window of time could enable reactive services (e.g.
targeting advertisements to users based on a users location and the POI categorisation,
emergency-response).
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Entity-Based Discovery of Transient Categories</title>
        <p>Our intuition is to use the categorisation of the messages’ resources generated from
a Point of Interest awareness stream (S(Poi!)) taken in windows of time ([ts, te]), to
induce a categorisation function. Figure 1 presents an overview of our approach.</p>
        <p>Retrieve Messages
from a POI Awareness
Stream for a window
of time [ts-te]</p>
        <p>Message Enrichment Given a message from a POI awareness stream S(Poi!), we
perform a lightweight message enrichment by using Zemanta 2, and OpenCalais 3. These
services perform entity-extraction on the input message identifying resources which
can be qualified as: Ro (organisations – entities recognised as an organisation), R p
(people –entities recognised as a person), R l (location – entities recognised as a location)
and Rli (links resources). These services also provide DBPedia concepts relevant to the
message. Consider the example in Figure 2, where the extracted entities and DBPedia
concepts for a Twitter message are shown.</p>
        <p>http://dbpedia.org/page/Junaio
AR workshop - Creating mobile channels with the Junaio mobile AR app
@ubistudio: Ubiquitous Media Studio #1 (Palo Alto) http://bit.ly/cGSvlC
Facility City Link
http://dbpedia.org/page/Palo_Alto,_California
Semantic Categorisation In order to semantically categorise a POI stream’s message
(m), we search for DBPedia concepts which are relevant to the extracted entity-based
resources, and aggregate these concepts to those already suggested by the message
enrichment services. Given a resource (r) we extract DBPedia categories and broader
categories from the DBPedia Linked Data Graph (D) using the following construct:
2 Zemanta, http://www.zemanta.com/
3 OpenCalais, http://www.opencalais.com/</p>
        <p>Rcat(r) = {xcat ∪ xbroaderCat|</p>
        <p>&lt; r, dcterms:subject, xcat &gt;
∧ &lt; xcat, skos:broader, xbroaderCat &gt;∈ D }
Induce Category Function After applying the semantic categorisation technique to
all messages belonging to a POI stream taken from a window of time [t s, te], we need
to weight them in order to identify the relevant categories.</p>
        <p>In order to do so, we utilise the resource category stream (S(R c!at)) of a POI stream
(S(P oi!)), which is the collection of all category resources classifying the POI stream’s
messages. For characterising the POI stream (S(P oi!)) based on the category resources
we propose two metrics:
1. Category Entropy of a Stream, which indicates the topical diversity of the stream.</p>
        <p>We defined the category entropy in terms of the POI stream’s vocabulary as :
CE(c) = − ! P (w|c) ∗ log(P (w|c))
w∈Rk
(2)
where w is a word in the POI stream’s vocabulary (S(R k!)), and c is a category
in the POI stream’s categories (S(Rc!at)). Low category entropy levels reveal that
a stream is dominated by few categories, while a high category balance reveals a
higher topical diversity. In normal conditions (i.e. no special events happening),
we would expect for example to obtain a low category entropy levels for a POI
stream referring to a Restaurant, since the messages would be classified within a
limited set of categories related to Food. While for a POI stream referring to a city
in normal conditions (no particular events happening), we would expect to observe
higher category entropy levels since the topical diversity would be higher.
However if normal conditions are broken, and unexpected (or volatile) events start
to happen, we would expect to observe an increment in the category entropy levels
of Restaurant POI stream, and a decrement in the category entropy levels of a City
POI stream. The category entropy acts in this way as an indicator of volatile events.
2. Mutual Information (MI), measures the information that two discrete random
variables share. In this work we consider the following:
◦ Categories-Hashtags (MI)</p>
        <p>I(C; H) =
!</p>
        <p>! p(c, h) ∗ log
c∈Rcat h∈Rh
p(c, h)
p(c)p(h)
where c is a category in the POI stream’s categories (S(R c!at)) and h is a hashtag
in the POI stream’s hashtags (S(Rh!)) and p(c,h) is the joint probability
distribution function of C and H, with marginals p(c) and p(h).
◦ Categories-Keywords (MI)</p>
        <p>I(C; K) =
!</p>
        <p>! p(c, w) ∗ log
c∈Rcat w∈Rk
p(c, w)
p(c)p(w)
where c is a category in the POI stream’s categories (S(R c!at)) and w is a word
in the POI stream’s keywords (S(Rk!)).
◦ Hashtags-Keywords (MI)</p>
        <p>I(H; K) = !</p>
        <p>! p(h, w) ∗ log
h∈Rh w∈Rw
p(h, w)
p(h)p(w)
(3)
(4)
(5)
where h is a hashtag in the POI stream’s hashtags (S(Rh!)).</p>
        <p>The higher the mutual information, the more one random variable is relevant to the
other.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Experiments</title>
      <p>In this section we discuss our approach for evaluating the accuracy of the strategies
proposed in Section 4 by using the formalisation introduced in Section 3. In order to
identify a transient categorisation of a point of interest we decided to investigate a POI
stream S(P oi!) in windows of time of one week-end.
5.1</p>
      <sec id="sec-5-1">
        <title>Dataset</title>
        <p>The corpus used for our study consists of Twitter messages taken over three week-ends
in the city of Sheffield. Since we aim to study patterns emerging from volatile events we
registered a week-end in normal conditions (i.e. no events happening) from 2011-06-10
to 2011-06-13 as control and two more week-ends in which especial events occurred.
The especial events were the Sheffield Food Festival (from 2011-07-08 to 2011-07-11)
and the Sheffield Tramlines Music Festival (from 2011-07-22 to 2011-07-25). The data
was collected using the Twitter Streaming API4 with the public firehose and filtering by
geographical area (using Sheffield’s bounding geo-coordinates).</p>
        <p>For each week-end dataset we removed stop words and applied the approach
presented in Section 4.2, extracting hasthags, keywords and entity resources as well as
DBPedia categories for these resources. The statistics for each stream is summarised in
Table 2.
First we analyse the most frequent hashtags in the three datasets presented in Table 4.
Although trends in hashtags are useful for detecting changes in a stream, hashtags tend
to present high ambiguity, and a frequent use of abbreviations. These are some of the
reasons why hashtags are not enough to provide a categorisation by themselves.</p>
        <p>We calculated the categories’ entropies for each of the three datasets’ categories.
The categories entropy distributions are shown in Figure 3. We can observe that the
stream taken from Sheffield in normal conditions (labelled as “Week End” in the graph)
presents denser regions in higher entropy levels.
4 https://dev.twitter.com/</p>
        <p>Since lower category entropy levels provide a better information gain, we pick a
category entropy threshold from which to pick categories. For these data sets and
following Figure 3 we picked -9 as a threshold obtaining: 255 categories for the common
week-end, 28 categories for food festival, and 562 categories for Tramlines. Table 5
shows the top 21 categories for each stream.</p>
        <p>It is important to notice that we are not biasing the results by picking a priori hashtags
relevant to the week-end events, but rather the categories emerge from category entropy
analysis. From Table 5, very disparate categories appeared for the week-end in normal
conditions (“common”), while for the Food Festival week-end we find categories which
appear to be related either to external events or future events (Music Festivals), as well
as categories related to a current event (Food companies of the United Kingdom).
Incidentally for the food festival week-end we found two sets of semantically coherent
categories, the first (categories from 13-17) matches an external event related to the
2012 Olympic tickets sales, while the second (categories 18-23) appears to be closely
relevant to an event involving Spanish football. We can observe that the categories
obtained for the Tramlines Music Festival are more semantically coherent compared to
the other two week-ends. This could be due to a higher relevance of the tramlines event
compared to other events occurring at the same time in the city or externally.</p>
        <p>Although some of the categories emerging from the category entropy analysis give
an insight of endemic events, there are also other categories which provide information
of events occurring externally. Hence, a Point of Interest considered as a
LocationEntity presents the “meformer” and “informer” patterns observed by Naaman et al.
[9] in Person-Entity activity streams. In this case the “Meformer” pattern refers to a
self focus of a Location-Entity, presenting information about endemic events, while the
“Informer” pattern refers to an information sharing of external events, not necessarily
related to this Location-Entity.</p>
        <p>In order to provide a context in which the category is being used, we use the mutual
information between categories and hashtags (see Equation 3), from which we obtain a
set of hashtags that can be used to further derived related keywords (see Equation 5)</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusions and Future Work</title>
      <p>The identification of category resources R cat from a POI awareness stream Ga(P!) can
be considered as a multi-class, multi-label classification task. This becomes challenging
when no assumptions can be made a priori on the type of classes that will classify
future events. Our approach semantically enriches the information of the social stream
by providing a DBPedia based categorisation.</p>
      <p>We have presented a formalisation for describing geographically bounded social
awareness streams, we have also provided an approach for deriving transient
categorisations of points of interest. We have applied our methodology on a data set and we
have presented an empirical analysis of our results.</p>
      <p>Future work includes a quantitative evaluation of this methodology by using larger
datasets in which events have been identified a priori, and against which we can evaluate
the emerging categories resulting from our approach.</p>
      <p>Questions still remain on how we could determine a semantic coherence metric,
which could induce broader category clusters. A semantic cluster of these categories can
provide a better insight to the kind of events to which they refer to. Take for example
the categories found for the Tramlines event, although we know these categories are
related to music, we still haven’t inferred the broader category “Music Festival”.
Acknowledgements A.E. Cano is funded by CONACyT, grant 175203.Andrea Varga
is funded by the SAMULET project, co-funded by TSB and Rolls-Royce plc/
9. M. Naaman, J. Boase, and C.-H. Lai. Is it really about me?: message content in social
awareness streams. In CSCW ’08: Proc., 2010 ACM conference on Computer supported
cooperative work, pages 189–192, 2010.
10. E. Schegloff. Notes on a conversational practice: formulating place. in Studies in Social</p>
      <p>Interaction Ed D Sudnow (Free Press), 1972.
11. C. Wagner and M. Strohmaier. The wisdom in tweetonomies: Acquiring latent conceptual
structures from social awareness streams. In Proc. of the Semantic Search 2010 Workshop
(SemSearch2010), april 2010.
12. A. Weilenmann. ”i can’t talk now, i’m in a fitting room”: formulating availability and location
in mobile-phone conversations. Environment and Planning A, 35(9):1589–1605, 2003.
13. K. L. D. Z. S. Zhiyuan Cheng, James Caverlee. Exploring millions of footprints in location
sharing services. In 5th International Conference on Weblogs and Social Media (ICWSM),
ICWSM ’11. ACM, 2011.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>F.</given-names>
            <surname>Abel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Gao</surname>
          </string-name>
          , G. Houben, and
          <string-name>
            <given-names>K.</given-names>
            <surname>Tao</surname>
          </string-name>
          .
          <article-title>Semantic enrichment of twitter posts for user profile construction on the social web</article-title>
          .
          <source>In In proceedings of Extended Semantic Web Conference</source>
          <year>2011</year>
          , May
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>L.</given-names>
            <surname>Barkhuus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Brown</surname>
          </string-name>
          , M. Bell,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sherwood</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hall</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Chalmers</surname>
          </string-name>
          .
          <article-title>From awareness to repartee: sharing location within social groups</article-title>
          .
          <source>In CHI '08:</source>
          <article-title>Proceeding of the twenty-sixth annual SIGCHI conference on Human factors in computing systems</article-title>
          , pages
          <fpage>497</fpage>
          -
          <lpage>506</lpage>
          , New York, NY, USA,
          <year>2008</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>A.</given-names>
            <surname>Cano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tucker</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Ciravegna</surname>
          </string-name>
          .
          <article-title>Capturing entity-based semantics emerging from personal awareness streams</article-title>
          .
          <source>In Proceedings of the Workshop on Making Sense of Microposts (MSM2011)</source>
          , May
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Z. Cheng, J. Caverlee, and
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          .
          <article-title>You are where you tweet: a content-based approach to geo-locating twitter users</article-title>
          .
          <source>In Proceedings of the 19th ACM international conference on Information and knowledge management</source>
          ,
          <source>CIKM '10</source>
          , pages
          <fpage>759</fpage>
          -
          <lpage>768</lpage>
          , New York, NY, USA,
          <year>2010</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>J.</given-names>
            <surname>Hightower</surname>
          </string-name>
          .
          <article-title>From position to place</article-title>
          . pages
          <fpage>10</fpage>
          -
          <lpage>12</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>N.</given-names>
            <surname>Ireson</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Ciravegna</surname>
          </string-name>
          .
          <article-title>Toponym resolution in social media</article-title>
          .
          <source>In Proc., 9th International Semantic Web Conference. ISWC</source>
          <year>2010</year>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>E.</given-names>
            <surname>Laurier. Why People Say Where They Are During Mobile Phone Calls</surname>
          </string-name>
          .
          <source>Environment and Planning D: Society and Space</source>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>J.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Xiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. I.</given-names>
            <surname>Hong</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Sadeh</surname>
          </string-name>
          .
          <article-title>Modeling people's place naming preferences in location sharing</article-title>
          .
          <source>In Proceedings of the 12th ACM international conference on Ubiquitous computing, Ubicomp '10</source>
          , pages
          <fpage>75</fpage>
          -
          <lpage>84</lpage>
          , New York, NY, USA,
          <year>2010</year>
          . ACM.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>