<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Characterizing Multi-Source Data for Efective Urban Mobility Modelling: The Case of New York City</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tonny Rutayisire</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jun Liu</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Samuel Moore</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Carlos Balsas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Architecture &amp; Built environment, Ulster University</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Computing, Ulster University</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>To explore a more holistic view of urban human mobility, a few researchers have previously attempted to build models driven by a fusion of multi-source datasets. Yet, owing to limited understanding of the intrinsic characteristics and relationships underlying multi-source datasets, most of such interventions naively merge these datasets, producing sub-optimal prediction models. To address this issue, we propose a three-step methodological framework to capture urban mobility from an integrative point of view. The novelty of our framework is three-fold: (i) a systematic characterization of the multi-source data to leverage hidden relationships when integrating the data; (ii) integrating data with contextual information of the urban environments to improve explainability; and (iii) conforming the model building to incremental learning paradigm to adapt to changing patterns of mobility data. Using the New York City (NYC) case study, extensive analyses show salient relationships within datasets which could form a basis for optimal data fusion and subsquently improving the mobility modelling pipeline in line with the proposed three-step methodological framework.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;human mobility</kwd>
        <kwd>data fusion</kwd>
        <kwd>multi-view modelling</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Human mobility patterns in urban settings are indicative of various phenomena such as travel demand,
trafic congestion, resource allocation, CO2 emission, and infectious disease spread. Therefore,
understanding such patterns is crucial to applications such as transportation management, urban planning,
resource optimization, and epidemiology among others. In recent years, the pervasive use of sensing
technologies has produced an enormous amount of human mobility data that researchers have used
to answer critical questions pertaining to when, why, and how people move from one place to the
other [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. A wide range of data-driven models and techniques have been proposed to capture urban
human mobility, based on a variety of dataset: taxi trip records [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], call detail records (CDRs) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ],
social media [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Despite the large number of studies on urban human mobility, majority of existing
techniques have typically been built on single-source empirical data in isolation from other mobility
lfows. Human mobility, especially in urban settings is multi-faceted, mainly due to diverse travel modes
and diferent spatial/temporal scales. It inevitably introduces a bias, against uninvolved flows, when the
capture of urban-scale mobility is single source data-driven. To explore a more holistic view of urban
mobility dynamics, a few researchers have attempted to build models and techniques driven by a fusion
of multi-source datasets. Zhang et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] proposed coMobile, a multi-view learning framework based
on integration of transit view and cellphone view. The approach outperforms 2 single-view models
WHERE and TRANSIT by 51% and 58% in terms of Mean Average Percetage error (MAPE), respectively.
Jiang et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] also utilized taxi GPS trajectories, smart card transaction data of subway and bus from
Beijing to model human mobility in space. It is reasonable to assume that the fusion of these mobility
datasets addresses the biases to some degree and provides a representative view of a broad spectrum of
the population. Yet, owing to limited understanding of the intrinsic characteristics and relationships of
these multi-source datasets, a considerable number of such interventions naively merge these datasets,
which end up introducing hidden drawbacks to the data fusion and subsequently producing sub-optimal
prediction models. For instance, in scenarios where space is a limitation, merging all multi-source
datasets can become an issue, yet correlations within diferent datasest can be leveraged to use part
of the dataset to infer the rest. Moreover, simplistic merging of datasets could actually worsen the
sampling bias problem where datasets with low levels of representativeness contribute equally or more,
to the data fusion. To this end, we propose a three-step methodological framework to capture urban
mobility from an integrative point of view: (i) In the first step, we leverage intrinsic characteristics
of multi-source data to learn, select, and integrate the most optimal vectors of the data for efective
urban mobility modelling; (ii) we then integrate the modelling process with prior knowledge contexts
of the domain to minimize the amount of data requirement, manage the uncertainties, and achieve
model interpretability; (iii) and in the third step, we conform the modelling process to incremental
learning paradigm where the model learns and updates by integrating new data streams as and when
they become available. For multi-view learning, we argue that the efectiveness of the model can be
enhanced when the above-mentioned aspects are incorporated together in the modeling process.
The rest of the paper is organized as follow: Section 2 summarizes the challenges and our motivation for
this research. In Section 3, we introduce our generic methodological framework for integrative human
mobility modeling. Section 4 introduces our case study area, the mobility datasets, and the approaches
we have used to characterize the multi-source datasets. The results and their discussions are presented
in Section 5, while the conclusion and future work are provided in Section 6.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Motivation</title>
      <p>
        Urban mobility is multi-faceted in nature, mainly due to diferent modes of transport. As such, we
have recently seen a growing shift from single source to multi-source data-based models for human
mobility modelling in urban settings [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] , [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. In reality, single data sources provide a limited
perspective on human mobility. For instance, taxi trip data may not represent the people who cannot
aford taxi and choose to use other means of transport such as bus, ride-sharing, and subway. On the
other hand, data from bike-sharing apps is not as representative during the winter season, and or across
unfavourable topography. Figure 1 shows the diferences in trip counts for Yellow taxi &amp; Green taxi
across NYC taxi zones. Even with the same mode of transport (but owing to diferent urban contexts),
we observe a huge divergence in the flow of mobility produced by yellow and green taxi, hitting as high
as 380,000 trips in some zones. Hence, whereas fusion of multiple data sources may provide a more
representative understanding of urban mobility patterns, simply adding together a series of features
extracted from multi-source datasets lays a sub-optimal foundation for multi-view mobility modeling.
It can’t simply be assumed that all multi-source datasets contribute equally to the mobility flow aspect
being learned. In [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] , authors highlight that the representativeness of each source largely depends on
the demographics of the service users in relation to the demographics of the local population, and this
dynamic is yet to be fully investigated. An efective approach to multi-view mobility modeling has to
capture the true spatial-temporal context of the datasets, and their respective qoutas for the fusion. We
argue that leveraging intrinsic characteristics of the datasets, coupled with existing domain knowledge
could result in an appropriate weighting mechanism when multi-source mobility datasets are being
fused. Therefore, our motivation stems from the need for a novel framework to efectively integrate and
learn multi-source data dynamically with domain knowledge for optimal multi-view mobility modeling.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed Integrative Methodological Framework</title>
      <p>To optimize multi-view mobility modeling, we propose an integrative three-step methodological
framework, as shown in Figure 2. The framework combines multi-source mobility data for a more
representative picture of travel behaviours, then integrates it with existing domain-knowledge to minimize data
requirement while enhancing explainability, and finally conforms the mobility modeling to incremental
learning, adapting to ever-changing urban mobility conditions. Each component is briefly overviewed
in the following subsequent sections.</p>
      <sec id="sec-3-1">
        <title>3.1. Multi-Source Data Fusion</title>
        <p>
          In the context of multi-view learning, data fusion captures the comprehensive view of urban mobility
patterns, and also difuses the limitations that come along with relying solely on single-source data.
However, most existing works make general assumption of the multi-source data when it is being
fused. For example in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] , Zhang et al. (2014) used the transport data to simply complement modeling
primarily based on cellphone data, and later in [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], Zhang et al. (2017) adjusted and treated the two kinds
of data sources equally, as two independent views. This is bound to integrate the data in proportions
that don’t reflect the true representation of each view. Increasingly, authors in this field [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ],
acknowledge that data fusion requires a full understanding of each dataset and their eficient utilization,
which is currently still limited. In our proposed framework, the characteristic context of multi-source
data is leveraged to workout an efective weighting scheme upon which data fusion is done. This will
ensure spatial-temporal patterns are realistically and dynamically captured.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Data – Knowledge Integration</title>
        <p>
          It is widely known that data-driven models are only as good as the quantity and quality of data they are
trained on. Given issues relating to incompleteness, inconsistencies, sparsity and uncertainity, it is often
dificult to acquire large amounts of quality mobility data. However, integrating empirical data with
domain-specific knowledge often validates the data, enhancing model reliability and interpretability,
which is still an open challenge in human mobility research. To the best of our knowledge, very few
mobility modeling approaches exist in literature today, which incorporate domain-specific knowledge.
In MobTCast [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], the influence of semantic, social, and geographical contexts is incomporated with
historical data to tackle the sparsity problem, while predicting POIs. Likewise in [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], a framework
was proposed to leverage a knowledge graph to accommodate the influence of users, locations and
semantics, in next location recommendation. Meanwhile in [17], authors proposed an MaaS framework
to combine multi-source data with an understanding of travelers’ contexts to present them personalized
and explainable services based on their preferences. Challenges withstanding, their framework aims
to achieve the possibility of providing the right information to the right user with understandable
explanations. The common principal about these methods is the integration of empirical data with one
or more kinds of contextual understanding of the urban environment. Whereas existing eforts have
focused on the influence of somewhat stable-static contexts, our integrative framework goes a step
further to incorporate situational contexts which is rather a dynamically changing context.
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Incremental Learning</title>
        <p>Urban mobility is highly dynamic, with context and patterns that typically change from time to time.
Conventional data-driven models, though they have shown considerable performance, they often
struggle to keep at per with evolving urban environments, where mobility patterns keep on changing
due to changing policy, transportation systems, road networks, weather conditions, and events. As
an emerging approach, Incremental Learning (IL) which enhances adaptability of models by learning
continuously from new incoming data streams, is poised to address these challenges sufered by static
models. In our framework, the knowledge-enhanced mobility model will be tuned to continuously
update on only new incoming data, as and when it becomes available, while retaining previously learned
knowledge.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Case Study: Urban Mobility in New York City</title>
      <p>In the previous section, we give an overview of the proposed integrative three-step methodological
framework. Note however that in the present work, the focus is to preliminarily use the case of New
York City (NYC) to analyze the underlying characteristics of multi-source data, setting up an optimal
foundation for efective data fusion and subsquent multi-view mobility modeling in the line of the
proposed framework.</p>
      <sec id="sec-4-1">
        <title>4.1. Study Area</title>
        <p>
          In this work, we characterize and compare mobility flows extracted from NYC’s taxi and Citi bike trip
records. NYC is the largest city in the United States, known for its fast pace, dynamism and cultural
diversity. It continues to be a global hub for finance, culture, commerce, and innovation. It covers a
total area of 784 km2 with an approximate population of 8.5 million people as of 2023. The city is
composed of five boroughs: Manhattan, Brooklyn, Queens, Bronx, and Staten Island, each with its own
unique character, and it is demarcated into 263 zones, as shown in Figure 3. The city is covered with an
extensive and a well-integrated transport system that is comprised of public transit, commuter rails,
taxis, and bicycles. A number of researchers have utilized NYC as a case study to investigate diferent
aspects of urban mobility. For example, in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] F. Miao et al. designed and evaluated the performance of
the data-driven vehicle balancing framework on four years’ long taxi trip data from NYC, in [18, 19]
authors narrowed in on mobility data from NYC in 2019 and 2020 to analyze the impact of COVID-19
on people’s mobility, whereas in [20] Dong et al. utilized anonymous mobile phone location and crash
report data in NYC to study the association of human mobility and road crashes. We also chose NYC as
a case study for this work for three main reasons; (1) open data policy/portal of NYC, (2) shape files for
NYC demarcations, and (3) NYC mobility survey report data.
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Datasets</title>
        <p>In this section, the three NYC datasets used in this study, are summarized. The Yellow taxi and Green
taxi datasets are provided by the Taxi &amp; Limousine Commission of NYC, while the bike-sharing dataset
is provided by Citi bike NYC. For all datasets, each record is a distinctive trip extracted, with an origin,
destination, duration, distance and fare among other trip attributes. After pre-processing of the datasets,
a total of 10,047,135 trips were successfully extracted from the Yellow taxi, 1,080,844 trips from the
Green taxi, and 1,000,000 trips from the NYC Citi bikes. To ensure temporal compatibility, the collection
period was the same for all the three datasets, running from April 1, 2017 to April 30, 2017.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Approach</title>
        <sec id="sec-4-3-1">
          <title>4.3.1. Trip extraction</title>
          <p>We characterize multi-source datasets based on three meta-metrics: (1) correlation; and (2)
representativeness. We argue that examining and leveraging these three meta-metrics will form a basis for optimal
data fusion and subsquently improving the multi-view mobility modelling pipeline.
Firstly, we extract trips – which is the basic unit of mobility for this study. With all the three datasets
having pickup and drop-of attributes, extracting origins and destinations (OD) is simple and
straightforward. However, to reduce the size and address errors, irrelevant attributes and errant records were
intuitively eliminated right away before any form of feature engineering was done. For example, all
records with trip duration &lt; 5 minutes and/or trip distance &gt; 30 miles, were automatically dropped for
all datasets.</p>
        </sec>
        <sec id="sec-4-3-2">
          <title>4.3.2. Correlation analysis</title>
          <p>We examine mobility (travel demand) to evaluate whether there exist temporal correlation between
Yellow taxi, Green taxi, and Citi bikes trip patterns in NYC. Owing to significant diference in scale among
the datasets , we observe in Figure 4 that both Green taxi and Citi bike data temporally lag the Yellow taxi
data, but there exist similar mobility trends worth further investigation. To provide fair and meaningful
comparison, we normalize the data to capture the hourly variations as a proportion of total mobility
lfow for each dataset in Figure 5. We then use three diferent measures (Cosine-similarity, Pearson
Correlation, and Spearman Rank Correlation), to explore similarities in their temporal distribution.
These measures have been widely used in previous studies to quantify the similarity between two
vectors in a multi-dimensional space, depending on the scale and context of the data. In our context, we
opted for them because they quantify similarity in patterns regardless the diference in scale.</p>
        </sec>
        <sec id="sec-4-3-3">
          <title>4.3.3. Representation analysis</title>
          <p>Taxi and Bike-sharing mode choices provide NYC residents with certain levels of convenience, timeliness,
and flexibility. However, they co-exist and complement other modes in an integrated transport system.
It is thus crucial to examine if travel demand extracted from the three datasets reflect their actual
respective mobility activity in NYC.</p>
          <p>We perform a spatial coverage analysis to ensure that all significant zones are truly represented. For
each dataset, we compute travel demand as the proportion of total outgoing trip counts at the zone
level. We then use a choropleth map to visualize and compare the spatial distribution of these values,
for the three datasets. We then use oficial statistics from the 2017 citywide mobility survey of NYC, to
benchmark patterns derived from our datasets. Particularly we compare the trip proportions extracted
from the datasets to the relative trip proportions extracted from the 2017 citywide mobility survey of
NYC. And consequently we perform statistical tests on some aspects of the observed trips in comparison
with surveyed trips.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results &amp; Discussions</title>
      <p>This section summarizes the results obtained along with detailed discussions.</p>
      <sec id="sec-5-1">
        <title>5.1. Correlation</title>
        <p>Figure 5 depicts the temporal distribution of normalized travel demand as extracted from Yellow taxi,
Green taxi, and Cite bike trip records. Considerable relationships can be observed among the three
datasets within the time dimension. For instance, we can see a similar drop in demand from midnight
up until 5 a.m., across all the datasets. We also see a shoot-up in demand at 8 a.m. and another one at 6
p.m. To probe these relationships further, Table 1 summarizes results of three well-known measures of
similarity against pairs of the three datasets.</p>
        <p>From the table 1, we can see that travel demand across all pairs of datasets generally has a positive
correlation in the temporal space. Particularly, travel demand patterns from Yellow taxi &amp; Green taxi
exhibit a very strong alignment across time, for all the three measures utilized, whereas the correlation
between green taxi &amp; cite bikes is relatively weak but still considerably above average.</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Representativeness</title>
        <p>To validate this representativeness, we statistically compare our trip data with the 2017 citywide
mobility survey of NYC. Figure 7 illustrates variations in the mode composition as extracted from the
trip data and survey data. We observe that among the three modes, Green taxi exhibits a strong fit,
when trip data is compared against survey data. This is further collaborated by the CDF distribution
of their respective trip durations as shown in Figure 8. We can observe that though Yellow taxi and
Citi bikes both have considerable similar distribution (p-value=&gt;0.05), Green taxi once again exhibit
remarkable values of Kolmogorov-Smirnov (KS) test, which demonstrating a strong representativeness
of the Green taxi records against the actual mode split as by the 2017 city-wide mobility survey of NYC.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions, Limitations &amp; Future Work</title>
      <p>In this study, we proposed a three-step methodological framework for modelling urban mobility from
an integration of multi-source data. The preliminary step of our framework involves the systematic
characterization of multi-source data as a foundation for a weight-based data fusion. Using NYC as a
case study, we characterized three mobility datasets based on two meta-metrics: (1) correlation; and (2)
representativeness. Our findings revealed that there exist strong correlation between datasets, which
can be leveraged in data fusion. For instance, we see a strong temporal correlation between Yellow
taxi &amp; Green taxi, in terms of relative travel demand, which could be used to infer missing data in one
dataset using the other. It could also be used in weighted average fusion of the data, where weights are
based on the correlation. Also revealed via our findings is the level of representativeness between the
respective mobility datasets and the actual urban scenarios. For example we see in our case study how
the Green taxi has the most alignment with actual sample data surveyed from the population in the
2017 city wide mobility survey of NYC. This could also be leveraged in the weight scheme to ensure
the most representative and reliable dataset contributes most to the data fusion. It should be noted
that this work does not attempt to examine all the three steps of the proposed framework, but rather
to give its overview, and the characterization of the multi-source data to be used as ingredients in the
subsequent steps of the framework. One of the limitations of this work however, results from the fact
that currently, we only focus on data sources from NYC. If matching multi-source data from other mega
cities like Shenzhen and Beijing was readily available, a comparison of mobility trends from diferent
urban cities against the two meta-metrics would further validate the characterization of multi-source
data. Another limitation stems from the age of data used (2017), given major mobility shifts that have
occurred with the coming Covid-19 pandemic. While very recent mobility data is available at the NYC
open data portal, for the interest of measuring representativeness of the data, we wanted to avoid a
time diference of data collections between trip records data and the mobility survey data, given that
the city-wide mobility survey data available is of 2017. In our future work, while acknowledging and
working around the above limitations, we plan to leverage the findings in this preliminary work, to
design an efective weighting scheme for the data fusion. We shall then improvise efective ways to
incorporate contextual knowledge in the fused data, and finally employ incremental learning to build
an eficient mobility prediction model in line with the proposed methodological framework for diferent
application contexts.
[17] E. Rajabi, S. Nowaczyk, S. Pashami, M. Bergquist, G. S. Ebby, S. Wajid, A knowledge-based ai
framework for mobility as a service, Sustainability 15 (2023) 2717.
[18] X. Hao, R. Jiang, J. Deng, X. Song, The impact of covid-19 on human mobility: A case study on new
york, in: 2022 IEEE International Conference on Big Data (Big Data), IEEE, 2022, pp. 4365–4374.
[19] A. A. Rajput, Q. Li, X. Gao, A. Mostafavi, Revealing critical characteristics of mobility patterns
in new york city during the onset of covid-19 pandemic, Frontiers in Built Environment 7 (2022)
654409.
[20] N. Dong, J. Zhang, X. Liu, P. Xu, Y. Wu, H. Wu, Association of human mobility with road crashes
for pandemic-ready safer mobility: A new york city case study, Accident Analysis &amp; Prevention
165 (2022) 106478.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. P. L.</given-names>
            <surname>Lau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Yuen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Tuncer</surname>
          </string-name>
          , E. Wilhelm,
          <article-title>Understanding urban human mobility through crowdsensed data</article-title>
          ,
          <source>IEEE Communications Magazine</source>
          <volume>56</volume>
          (
          <year>2018</year>
          )
          <fpage>52</fpage>
          -
          <lpage>59</lpage>
          . doi:
          <volume>10</volume>
          .1109/
          <string-name>
            <surname>MCOM</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <volume>1700569</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>F.</given-names>
            <surname>Miao</surname>
          </string-name>
          , S. Han,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Hendawi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. E.</given-names>
            <surname>Khalefa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Stankovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. J.</given-names>
            <surname>Pappas</surname>
          </string-name>
          ,
          <article-title>Data-driven distributionally robust vehicle balancing using dynamic region partitions</article-title>
          ,
          <source>in: Proceedings of the 8th International Conference on Cyber-Physical Systems</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>261</fpage>
          -
          <lpage>271</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>H.</given-names>
            <surname>Rong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Shafiq</surname>
          </string-name>
          , A. Liu,
          <article-title>The rich and the poor: A markov decision process approach to optimizing taxi driver revenue eficiency</article-title>
          ,
          <source>in: Proceedings of the 25th ACM international on conference on information and knowledge management</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>2329</fpage>
          -
          <lpage>2334</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.</given-names>
            <surname>Rodrigues</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Veloso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Alves</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bento</surname>
          </string-name>
          ,
          <article-title>Using cdr data to understand post-pandemic mobility patterns</article-title>
          ,
          <source>in: EPIA Conference on Artificial Intelligence</source>
          , Springer,
          <year>2023</year>
          , pp.
          <fpage>438</fpage>
          -
          <lpage>449</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Ran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Tsai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Williams</surname>
          </string-name>
          ,
          <article-title>Using call detail records to determine mobility patterns of diferent socio-demographic groups in the western area of sierra leone during early covid-19 crisis</article-title>
          ,
          <source>Environment and Planning B: Urban Analytics and City Science</source>
          <volume>50</volume>
          (
          <year>2023</year>
          )
          <fpage>1298</fpage>
          -
          <lpage>1312</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>Spatiotemporal patterns of human mobility and its association with land use types during covid-</article-title>
          19 in new york city,
          <source>ISPRS International Journal of Geo-Information</source>
          <volume>10</volume>
          (
          <year>2021</year>
          )
          <fpage>344</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J. C.</given-names>
            <surname>Wo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. M.</given-names>
            <surname>Rogers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Berg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Koylu</surname>
          </string-name>
          ,
          <article-title>Recreating human mobility patterns through the lens of social media: using twitter to model the social ecology of crime</article-title>
          ,
          <source>Crime &amp; Delinquency</source>
          <volume>70</volume>
          (
          <year>2024</year>
          )
          <fpage>1943</fpage>
          -
          <lpage>1970</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <article-title>Real-time human mobility modeling with multi-view learning</article-title>
          ,
          <source>ACM Transactions on Intelligent Systems and Technology (TIST) 9</source>
          (
          <issue>2017</issue>
          )
          <fpage>1</fpage>
          -
          <lpage>25</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Guan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <article-title>Human mobility in space from three modes of public transportation</article-title>
          ,
          <source>Physica A: Statistical Mechanics and its Applications</source>
          <volume>483</volume>
          (
          <year>2017</year>
          )
          <fpage>227</fpage>
          -
          <lpage>238</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <article-title>National-scale trafic model calibration in real time with multi-source incomplete data</article-title>
          ,
          <source>ACM Transactions on Cyber-Physical Systems</source>
          <volume>3</volume>
          (
          <year>2019</year>
          )
          <fpage>1</fpage>
          -
          <lpage>26</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>X.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Ye</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Deng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>The characteristics of multi-source mobility datasets and how they reveal the luxury nature of social distancing in the us during the covid-19 pandemic</article-title>
          ,
          <source>International Journal of Digital Earth</source>
          <volume>14</volume>
          (
          <year>2021</year>
          )
          <fpage>424</fpage>
          -
          <lpage>442</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Ling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Mao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.-Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Modeling real-time human mobility based on mobile phone and transportation data fusion, Transportation research part C: emerging technologies 96 (</article-title>
          <year>2018</year>
          )
          <fpage>251</fpage>
          -
          <lpage>269</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , J. Huang,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , C. Xu,
          <string-name>
            <given-names>T.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <article-title>Exploring human mobility with multi-source data at extremely large metropolitan scales</article-title>
          ,
          <source>in: Proceedings of the 20th annual international conference on Mobile computing and networking</source>
          ,
          <year>2014</year>
          , pp.
          <fpage>201</fpage>
          -
          <lpage>212</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Kong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Xia</surname>
          </string-name>
          , L. Sun,
          <article-title>Urban human mobility: Data-driven modeling and prediction</article-title>
          ,
          <source>ACM SIGKDD explorations newsletter 21</source>
          (
          <year>2019</year>
          )
          <fpage>1</fpage>
          -
          <lpage>19</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>H.</given-names>
            <surname>Xue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Salim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Oliver</surname>
          </string-name>
          , Mobtcast:
          <article-title>Leveraging auxiliary trajectory forecasting for human mobility prediction</article-title>
          ,
          <source>Advances in Neural Information Processing Systems</source>
          <volume>34</volume>
          (
          <year>2021</year>
          )
          <fpage>30380</fpage>
          -
          <lpage>30391</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.-L.</given-names>
            <surname>Theng</surname>
          </string-name>
          ,
          <article-title>An attentional recurrent neural network for personalized next location recommendation</article-title>
          ,
          <source>in: Proceedings of the AAAI Conference on artificial intelligence</source>
          , volume
          <volume>34</volume>
          ,
          <year>2020</year>
          , pp.
          <fpage>83</fpage>
          -
          <lpage>90</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>