<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Predicting Land Use of Italian Cities using Structural Semantic Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gianni Barlacchi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bruno Lepri</string-name>
          <email>lepri@fbk.eu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alessandro Moschitti</string-name>
          <email>amoschittig@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Information Engineering and Computer Science, University of Trento</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Fondazione Bruno Kessler</institution>
          ,
          <addr-line>Trento</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Qatar Computing Research Institute</institution>
          ,
          <addr-line>HBKU</addr-line>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>TIM Semantics and Knowledge Innovation Lab</institution>
          ,
          <addr-line>Trento</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>English. We propose a hierarchical semantic representation of urban areas extracted from a social network to classify the most predominant land use, which is a very common task in urban computing. We encode geo-social data from LocationBased Social Networks with standard feature vectors and a conceptual tree structure that we call Geo-Tree. We use the latter in kernel machines, which can thus perform accurate classification, exploiting hierarchical substructure of concepts as features. Our comparative study on three datasets extracted from Milan, Rome and Naples shows that Tree Kernels applied to Geo-Trees are very effective improving the state of the art.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Italiano. In questo lavoro, proponiamo un
nuovo modello semantico per la
rappresentazione di aree urbane utilizzando dati
da social media. In particolare,
modelliamo tale informazione con una struttura
ad albero che abbiamo chiamato
GeoTree. Questa viene utilizzata, in
combinazione con un vettore di feature
classico, nelle kernel machine per fare
classificazione della destinazione di uso delle
aree urbane. Abbiamo valutato il nostro
approccio su tre grandi metropoli italiane
quali Milano, Roma e Napoli. I risultati
mostrano come i Geo-Tree, applicati ai
Tree Kernel, riescono a raggiungere
risultati di molto superiori ad altri modelli
attualmente stato dell’arte.</p>
    </sec>
    <sec id="sec-2">
      <title>1 Introduction</title>
      <p>
        The growing availability of data from cities
        <xref ref-type="bibr" rid="ref2 ref3">(Barlacchi et al., 2015a)</xref>
        (e.g., traffic flow, human
mobility and geographical data) opens new
opportunities for predicting and thus optimizing human
activities. For example, the automatic analysis of
land use enables the possibility of better
administrating a city in terms of resources and provided
services. However, such analysis requires specific
information, which is often not available for
privacy concerns. In this paper we follow the
approach proposed in
        <xref ref-type="bibr" rid="ref4">(Barlacchi et al., 2017)</xref>
        and
we use public textual descriptions of urban
areas to design a novel machine learning
representation. We represent urban areas as: (i) a
bagof-concepts (BOC), e.g., the terms Arts and
Entertainment, College and University, Event, Food
extracted from the Foursquare description of the
area; and (ii) the same concepts above organized in
a tree, which reflects the hierarchical organization
of Foursquare activities. We combine BOC
vectors with Tree Kernels (TKs)
        <xref ref-type="bibr" rid="ref6 ref9">(Collins and Duffy,
2002; Moschitti, 2006)</xref>
        applied to concept trees
(Geo-Tree) and use them in Support Vector
Machines (SVMs). The Geo-Tree allows the model
to learn complex structural and semantic patterns
from the hierarchical conceptualization of an area.
We show that TKs not only can capture
semantic information from natural language text, e.g., as
shown for semantic role labeling
        <xref ref-type="bibr" rid="ref8">(Moschitti et al.,
2008)</xref>
        and question answering
        <xref ref-type="bibr" rid="ref10 ref2 ref3 ref7">(Severyn and
Moschitti, 2013; Barlacchi et al., 2015b)</xref>
        , but they can
also learn from the hierarchy above to perform
semantic inference, such as deciding which is the
major activity of a land.
      </p>
      <p>We carried out a study on land use prediction
of three Italian cities: Milan, Rome and Naples
as follows: (i) we divided each city in squares of
200x200 meters; (ii) then, we classify the most
predominant land use class (e.g., High Density
Urban Fabric or Open Space and Outdoor), assigned
by the city administration. The results show that
GeoTKs achieve an impressive improvement over
state-of-the-art classification approaches based on
BOC., i.e., 21.2%, 13.6% and 54.3% of relative
improvement in Macro-F1 over Milan, Rome and</p>
      <sec id="sec-2-1">
        <title>Naples datasets, respectively.</title>
        <p>2</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Related Work</title>
      <p>
        Previous work has modeled land use classification
by means of different sources of information. For
example, Yuan et al. (2012) built a framework that,
using human mobility patterns derived from
taxicab trajectories and Point Of Interests (POIs),
classifies the functionality of an area for the city of
Beijing. Assem et al. (2016) proposed a
spatiotemporal approach based on three different
clustering algorithms to model the change of
functionality of a city’s region over time. They extracted
features from Foursquare’s POIs and check-in
activities of Manhattan.
        <xref ref-type="bibr" rid="ref12">Yao et al. (2017)</xref>
        built
sequences of POI concepts reflecting their spatial
distance. Then, they applied Word2Vec
        <xref ref-type="bibr" rid="ref7">(Mikolov
et al., 2013)</xref>
        to these sequences to derive vectors
representing each area, which was used to train
a land use classifier. In general, most previous
work applies extensive feature engineering, which
is typically costly as it requires to fully understand
the target domain. Our approach alleviates this
problem with automatic feature engineering
applied to an abstract land representation.
3
      </p>
    </sec>
    <sec id="sec-4">
      <title>Land Description Data</title>
      <p>Geospatial city areas are described with the
popular shape file format, where each shape is a
collection of points geo-localized using their
coordinates. The latter are provided with the well-known
Coordinate Reference System (CRS) WGS84,
adopted for the common latitude/longitude
geolocation. We use (i) shape files provided by Urban
Atlas1, a website providing data for large urban
areas (more than 100; 000 inhabitants) and (ii) POIs
from Foursquare2.</p>
      <sec id="sec-4-1">
        <title>3.1 Land Use</title>
        <p>Cities are divided in small areas associated with
a main land use. In total, there are 17
different land use classes defined from the open dataset
Urban Atlas 3. We focused on those related to
city centers, discarding those less interesting from
a social viewpoint, i.e., associated with rural
areas such as forests, agricultural, semi-natural and
wetland areas and mineral extraction and dump
sites. Thus, we selected the following categories:
1https://www.eea.europa.eu/data-and-maps/data/urbanatlas
2https://foursquare.com/
3https://www.eea.europa.eu/data-and-maps/data/urbanatlas#tab-additional-information
(i) High Density Urban Fabric, (ii) Medium
Density Urban Fabric, (iii) Low Density Urban
Fabric, (iv) Industrial, commercial, public, military
and private units, (v) Open Space &amp; Recreation,
(vi) Transportation. We collapsed Medium and
Low Density Urban Fabric into one single
category, ML-Density Urban Fabric as they only have
few samples. Land use distribution is very
finegrained, making its classification based on POI
information very difficult. A trade-off between
classification accuracy and the desired area
granularity consists in segmenting the regions in squared
cells. As each cell can contain more than one land
use label, we consider the predominant label as its
primary use.</p>
      </sec>
      <sec id="sec-4-2">
        <title>3.2 Point-Of-Interest</title>
        <p>A POI is usually characterized by a location (i.e.,
latitude and longitude), textual information (e.g.,
a description of the activity in that place) and
a hierarchical categorization that provides
different levels of detail about the activity of the place
(e.g., Food, Asian Restaurant, Chinese
Restaurant). We used POIs extracted from Foursquare, a
geolocation-based social network supported with
web search facilities for places and a
recommendation system. In particular, we extracted 46,731,
43,389 and 7,219 POIs from Milan, Rome and
Naples4, respectively. We focused on the ten
macro-categories of such POIs5, each one
specialized in maximum four levels of detail.
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Structural Models</title>
      <p>In most machine learning algorithms data
examples are transformed in feature vectors, which
in turn are used in dot products to carry out
both learning and classification. Kernel Machines
(KMs) allow for replacing the dot product with
kernel functions, which directly compute it on the
examples, i.e., they avoid the transformation of
examples in vectors. The main advantage of KMs is
a much lower computational complexity as it does
not directly depend on the feature space size.</p>
      <sec id="sec-5-1">
        <title>4.1 Point-of-interests Features</title>
        <p>The most straightforward way to represent an area
by means of Foursquare data is the use its POIs.
Every venue is hierarchically categorized (e.g.,
Professional and Other Places ! Medical Center
! Doctor’s office) and the categories are used to
produce an aggregated representation of the area.
4For some reasons Foursquare is less popular in Naples
5https://developer.foursquare.com/categorytree
We define a feature vector for a grid cell by
counting the macro-level category (e.g., Food) in all the
POIs that we found in that cell.</p>
      </sec>
      <sec id="sec-5-2">
        <title>4.2 Geographical Tree Kernel</title>
        <p>Foursquare has its own hierarchy of categories,
which is used to characterize each location and
activity (e.g., restaurants or shops) in the database.
Thus, each Foursquare POI is associated with a
hierarchical path, which semantically describes the
type of location/activity (e.g., for Chinese
Restaurant, we have the path Food ! Asian
Restaurant ! Chinese Restaurant). The path is much
more informative than just the target POI name,
as it provides feature combinations following the
structure and the node proximity information, e.g.,
Food &amp; Asian Restaurant or Asian Restaurant
&amp; Chinese Restaurant are valid features whereas
Food &amp; Chinese Restaurant is not.</p>
        <p>Geo-Tree: we propose a new tree structure, i.e.,
Geo-Tree, whose nodes and edges among them are
subsets of the Foursquare hierarchy (FH). A
GeoTree of a grid cell is constituted by a new root node
connecting the subtrees of FH rooted in concepts
present in the cell. In other words, we connect all
the paths of FH starting from grid concepts. Figure
1 shows an example of the FH paths of a cell and
the resulting Geo-Tree.</p>
        <p>
          This way, the nodes of the first level, i.e.,
the root children, correspond to the most general
FH categories, e.g., Arts &amp; Entertainment, Event,
Food, etc., the second level of our tree
corresponds to the second level of the hierarchical tree
of Foursquare, and so on. The terminal nodes are
the finest-grained descriptions in terms of category
about the area, e.g., College Baseball Diamond
or Southwestern French Restaurant. For
example, Fig. 2 illustrates the semantic structure of a
grid cell obtained by combining all the categories’
chains of each venue.
GeoTK: given a Geo-Tree, we can encode all
its substructures in kernel machines using TKs.
In particular, we used the Syntactic Tree Kernels
(STKb) with Bag-Of-Words and the Partial Tree
Kernel (PTK)
          <xref ref-type="bibr" rid="ref9">(Moschitti, 2006)</xref>
          . Our TKs by
construction do not consider the frequency6 of the
POIs present in a given grid cell.
        </p>
        <p>BOC kernel: to complement GeoTK, we
represent a cell also creating a BOC representation,
namely we count the macro-level category (e.g.,
Food) in all the POIs that we found in any cell
grid. This way, we generate feature vectors by
counting the number of each activity under each
macro-category. In order to take into consideration
the popularity of the area, we included (i) the total
sum of unique users that did at least one check-in
in the cell, and (ii) the total sum of check-in done
in the cell. Note that, given an area, the number of
unique users provides an idea on how many
people visited it, while the number of check-in can be
used to represent its popularity.</p>
        <p>Kernel combination: finally, given two
geographical areas, xa and xb, we define a kernel
combining Geo-Tree and BOC as: K(xa; xb) =
T K(ta; tb) + KV (va; vb), where T K is any
6It is possible to add the frequency in the kernel
computation but for our study we preferred to have a completely
different representation from previous typical frequency-based
approaches.
structural kernel function applied to tree
representations, ta and tb of the geographical areas and
KV is a kernel applied to the feature vectors, va
and vb, extracted from xa and xb using any data
source available (e.g., text, social media, mobile
phone and census data).</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>5 Experiments and Results</title>
      <p>
        We performed our experiments on the data from
Milan, Rome and Naples. We used a grid of
200x200meters as it is indicated as the best size
from other similar previous work on land use
classification
        <xref ref-type="bibr" rid="ref11 ref14 ref4">(Toole et al., 2012; Zhan et al.,
2014; Barlacchi et al., 2017)</xref>
        . We applied a
pre-processing step in order to filter out cells for
which land use classification cannot be performed.
In particular, for Milan and Rome, we selected
the central point of the shape and we included
those cells that have their centroid in the radius
of 15 and 8 kilometers, respectively. For Naples,
we kept all the cells due to the smaller size of the
city. Then, for all the three cities, we removed the
cells that (i) cover areas without a specified land
use (e.g., the cells in the sea) and (ii) do not have
POIs (e.g., the countryside cells). After this step,
we obtained a grid with 2,581, 5,657 and 1,314
cells for Milan, Rome and Naples, respectively.
We created, separately for each city, the training
and test set randomly sampling 80% vs. 20% of
the cells. We labelled the dataset following the
same category aggregation strategy proposed by
Zhan et al. (2014), who assigned the predominant
land use class to each grid cell.
      </p>
      <p>
        To train our models, we applied
SVM-LightTK7, which enables the use of structural kernels
        <xref ref-type="bibr" rid="ref9">(Moschitti, 2006)</xref>
        in SVM-Light8. In particular,
due to the nature of the task, we used the Python
wrapper around SVM-Light-TK to perform
multiclass classification9. We experimented with
linear, polynomial and radial basis function kernels
applied to standard feature vectors. We measured
the performance of our classifier by averaging
Precision, Recall and F1 over all land use categories.
      </p>
      <sec id="sec-6-1">
        <title>5.1 Results for Land Use Classification</title>
        <p>
          We trained multi-class classifiers using
common learning algorithm such XGboost
          <xref ref-type="bibr" rid="ref5">(Chen and
Guestrin, 2016)</xref>
          , and SVM using linear,
poly
        </p>
        <sec id="sec-6-1-1">
          <title>7http://disi.unitn.it/moschitti/Tree-Kernel.htm</title>
          <p>8http://svmlight.joachims.org/
9https://github.com/aseveryn/SVMTK-MulticlassClassifier
nomial and radial basis function kernels, named
SVM-fLin, Poly, Rbfg, respectively, and our
structural semantic models, indicated with STKb
and PTK. We also combined kernels with a
simple summation, e.g., PTK+Lin indicates an SVM
using such kernel combination.</p>
          <p>Table 1 shows the average of F1, Precision and
Recall over the different categories. The model
baseline is obtained by always classifying an
example with the label High Density Urban Fabric,
which is the most frequent. Due to space
constraint, we only reported six models, namely: the
baseline, XGBoost and the top four kernel models.</p>
          <p>We note that: (i) GeoTK always outperforms
XGBoost and the baseline, demonstrating the
superiority of our novel approach. This is an
interesting finding as XGboost is the current state of the
art for land use classification. (ii) STKb combined
with feature vector always produces the best
results, improving the F1-score over XGBoost up to
6.3, 3.8 and 11.9 absolute points for Milan, Rome
and Naples, respectively. (iii) Kernel
combinations always provide the best results.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>6 Conclusions</title>
      <p>In this paper, we have introduced Geo-Trees, a
novel semantic representation based on a
hierarchical classification of POIs, to better exploit
geosocial data to the classification of the primary land
use of an urban area. This is an important task
as it gives the urban planners and policy makers
the possibility to better administrate and renew a
city in terms of infrastructures, resources and
services. More in detail, we have built our
classifiers with combinations of a kernel over BOC and
TKs applied to Geo-Trees, thus exploiting
hierarchical substructure of concepts as features. Our
comparative study on three large Italian cities,
Milan, Rome and Naples shows that our models can
relatively improve the state of the art up to 11.9
absolute points in F1-score.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>
        This work has been partially supported by the EC
project CogNet, 671625
        <xref ref-type="bibr" rid="ref14">(H2020-ICT-2014-2,
Research and Innovation action)</xref>
        .
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Haytham</given-names>
            <surname>Assem</surname>
          </string-name>
          , Lei Xu, Teodora Sandra Buda, and
          <string-name>
            <surname>Declan O'Sullivan</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Spatio-temporal clustering approach for detecting functional regions in cities</article-title>
          .
          <source>In ICTAI</source>
          , pages
          <fpage>370</fpage>
          -
          <lpage>377</lpage>
          . IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Gianni</given-names>
            <surname>Barlacchi</surname>
          </string-name>
          , Marco De Nadai, Roberto Larcher, Antonio Casella, Cristiana Chitic, Giovanni Torrisi, Fabrizio Antonelli, Alessandro Vespignani, Alex Pentland, and
          <string-name>
            <given-names>Bruno</given-names>
            <surname>Lepri</surname>
          </string-name>
          .
          <year>2015a</year>
          .
          <article-title>A multi-source dataset of urban life in the city of milan and the province of trentino</article-title>
          .
          <source>Scientific data</source>
          ,
          <volume>2</volume>
          :
          <fpage>150055</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Gianni</given-names>
            <surname>Barlacchi</surname>
          </string-name>
          , Massimo Nicosia, and
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Moschitti</surname>
          </string-name>
          . 2015b.
          <article-title>Sacry: Syntax-based automatic crossword puzzle resolution system</article-title>
          .
          <source>ACL-IJCNLP</source>
          <year>2015</year>
          , page 79.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>G</given-names>
            <surname>Barlacchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A</given-names>
            <surname>Rossi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B</given-names>
            <surname>Lepri</surname>
          </string-name>
          , and
          <string-name>
            <given-names>A</given-names>
            <surname>Moschitti</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Structural semantic models for automatic analysis of land use</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Tianqi</given-names>
            <surname>Chen</surname>
          </string-name>
          and
          <string-name>
            <given-names>Carlos</given-names>
            <surname>Guestrin</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Xgboost: A scalable tree boosting system</article-title>
          .
          <source>In KDD</source>
          , pages
          <fpage>785</fpage>
          -
          <lpage>794</lpage>
          , New York, NY, USA. ACM.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Michael</given-names>
            <surname>Collins</surname>
          </string-name>
          and
          <string-name>
            <given-names>Nigel</given-names>
            <surname>Duffy</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron</article-title>
          .
          <source>In ACL.</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , Kai Chen, Greg Corrado, and
          <string-name>
            <given-names>Jeffrey</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Efficient estimation of word representations in vector space</article-title>
          .
          <source>CoRR, abs/1301</source>
          .3781.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Moschitti</surname>
          </string-name>
          , Daniele Pighin, and
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Basili</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Tree kernels for semantic role labeling</article-title>
          .
          <source>Computational Linguistics</source>
          ,
          <volume>34</volume>
          (
          <issue>2</issue>
          ):
          <fpage>193</fpage>
          -
          <lpage>224</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Moschitti</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Efficient convolution kernels for dependency and constituent syntactic trees</article-title>
          .
          <source>In ECML</source>
          , pages
          <fpage>318</fpage>
          -
          <lpage>329</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Aliaksei</given-names>
            <surname>Severyn</surname>
          </string-name>
          and
          <string-name>
            <given-names>Alessandro</given-names>
            <surname>Moschitti</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Automatic feature engineering for answer selection and extraction</article-title>
          .
          <source>In EMNLP</source>
          , volume
          <volume>13</volume>
          , pages
          <fpage>458</fpage>
          -
          <lpage>467</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Jameson L Toole</surname>
            ,
            <given-names>Michael</given-names>
          </string-name>
          <string-name>
            <surname>Ulm</surname>
          </string-name>
          ,
          <string-name>
            <surname>Marta C Gonza</surname>
          </string-name>
          <article-title>´lez, and</article-title>
          <string-name>
            <given-names>Dietmar</given-names>
            <surname>Bauer</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Inferring land use from mobile phone activity</article-title>
          .
          <source>In SIGKDD International Workshop on Urban Computing</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Yao</surname>
            <given-names>Yao</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Xia</given-names>
            <surname>Li</surname>
          </string-name>
          , Xiaoping Liu, Penghua Liu, Zhaotang Liang, Jinbao Zhang, and
          <string-name>
            <given-names>Ke</given-names>
            <surname>Mai</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Sensing spatial distribution of urban land use by integrating points-of-interest and google word2vec model</article-title>
          .
          <source>International Journal of Geographical Information Science</source>
          ,
          <volume>31</volume>
          (
          <issue>4</issue>
          ):
          <fpage>825</fpage>
          -
          <lpage>848</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Jing</given-names>
            <surname>Yuan</surname>
          </string-name>
          , Yu Zheng, and
          <string-name>
            <given-names>Xing</given-names>
            <surname>Xie</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Discovering regions of different functions in a city using human mobility and pois</article-title>
          .
          <source>In KDD</source>
          , pages
          <fpage>186</fpage>
          -
          <lpage>194</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Xianyuan</given-names>
            <surname>Zhan</surname>
          </string-name>
          , Satish V Ukkusuri,
          <string-name>
            <given-names>and Feng</given-names>
            <surname>Zhu</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Inferring urban land use using large-scale social media check-in data</article-title>
          .
          <source>Networks and Spatial Economics</source>
          ,
          <volume>14</volume>
          (
          <issue>3-4</issue>
          ):
          <fpage>647</fpage>
          -
          <lpage>667</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>