<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Learning Mobility Profiles: an Application to a Personalised Weather Warning System</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Maximilian Leodolter</string-name>
          <email>maximilian.leodolter@ait.ac.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christian Rudloff</string-name>
          <email>christian.rudloff@ait.ac.at</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>AIT Austrian Institute of Technology GmbH, Mobility Departement</institution>
          ,
          <addr-line>Giefinggasse 2, 1210 Vienna</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>AIT Austrian Institute of Technology GmbH, Mobility Departement</institution>
          ,
          <addr-line>Giefinggasse 2, 1210 Vienna</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
      </contrib-group>
      <fpage>110</fpage>
      <lpage>113</lpage>
      <abstract>
        <p>Learning mobility pro les of citizens can play a crucial role in many applications, including tra c demand estimation, urban planning or personalized advertising. In this paper we demonstrate a framework for building and constantly readjusting mobility pro les using smart phone data coupled with manual user input and personalised discrete choice models. The methods are applied as weather warning service supporting the daily mode choice decisions of users of the system by supplying personalised information based on their mobility pro le and current weather conditions. Since it is well known that weather conditions in uence the tra c demand and the modal split of transport modes, the framework can also further the understanding of mobility patterns and their variability due to weather or tra c events.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Discrete choice model</kwd>
        <kwd>mobility behaviour</kwd>
        <kwd>pro le learning</kwd>
        <kwd>weather warning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        Learning mobility pro les of citizens can play a crucial role
in many applications, including tra c demand estimation,
urban planning or personalised advertising. Progress has
been made both in the area of mobility pro les as well as
the estimation of travel demand. For Mobility pro les this
is done by applying data analysis techniques to data sources
like mobile phone data or data collected using smart phones
(GPS, accelerometer data). In [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] mobility pro les and
patterns are learned from mobile phone log data to estimate
location time distributions for the users. While the
advantage of using mobile phone data is that large samples of users
can be reached, it is di cult to attach important
information like mode choice to the pro les. First steps to wards
enriching such mobility pro les with activity data are taken
e.g. in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Some works exist on improving the quality of
information of mobility pro les created from mobile phone
data using GPS data [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        Another approach to reach mobility pro les is that of
applying mobility diaries. While the sample sizes are
considerably smaller than in the case of mobile phone data the rise
of smart phone availability makes the collection and
enrichment of the data through mode detection easier and enables
the collection of larger samples. In [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] such a system for
collection and data analysis is described in some detail. While
the collection of mobility diaries becomes simpler as
technology improves, large scale mobility surveys are currently still
collected mostly through interviews. As a result, while this
data is useful to model mobility choice behaviour, for
example under the in uence of weather [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], it is usually collected
for only one day for each participant and hence presents a
snapshot of peoples' behaviour.
      </p>
      <p>To learn more about the long term behaviour of people data
needs to be collected more long term and new methodologies
for the analysis of the data need to be developed. In this
paper we demonstrate some missing steps to get from
collected long term data to mobility pro les and personalised
mode choice models. This framework can be applied to
build and constantly readjust mobility pro les of users with
data collected with smart phones (GPS-data, accelerometer
data) coupled with some manual user input. Finally, this
framework is applied as personalised weather warning
service. The system supplies personalised information based
on the mobility pro le enhanced by individualized discrete
choice models and current weather conditions to the user to
support their daily mode choice decisions. Since it is well
known that weather conditions in uence the tra c demand
and the modal split of transport modes, the framework can
also further the understanding of mobility patterns and their
variability due to weather or tra c events.</p>
      <p>The paper will rst present the data collection and the
learning algorithms for the mobility pro les in section 2 before
describing the personalisation of the mode choice models. In
section 3 the nal weather warning system will be presented.
The nal section will give some conclusions and an outlook
to future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. LEARNING THE PROFILE</title>
      <p>
        As a basis for the learning process of the mobility pro le,
GPS and other cell phone data (e.g. accelerometer data, cell
position) were collected on the users daily trips. For each
recorded trip there is a data preprocessing step before the
main learning process. This cleans the data and detects the
transport mode by transforming the collected GPS,
acceleration and cell phone signal data, following the methods of
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The following section 2.1 explains the actual data
collection part on the user side. The learning methodology will
be demonstrated in 2.2. The methodology is such that
initial mobility pro les are calculated once at least four trips
are collected and are subsequently re ned with new data
whenever trips are recorded.
      </p>
    </sec>
    <sec id="sec-3">
      <title>2.1 Data Collection</title>
      <p>Within a three week long experiment users were asked to
record their daily trips with a smart phone application, which
was developed especially for this very experiment. The
application was easy to handle and designed to require
minimum user interaction. Four steps had to be done per trip:
(1) start the trip by opening the application and starting
the recording, (2) state the travel purposes, (3) end the trip
by opening the application again and nishing the
recording, (4) adjust the detected modes instantaneously on the
device. In between steps three and four trips were
automatically cut into uni modal stages and the mode was detected
as described above.</p>
    </sec>
    <sec id="sec-4">
      <title>2.2 Learning Algorithm</title>
      <p>The pro le learning algorithm consists of three steps; (1)
identify points of routine (POR), (2) identify routine trips
(RT) connecting the PORs, and (3) learning about the user's
travel behaviour by modelling their choice situations (see
section 2.3). On a regular basis the user pro les are re ned
by applying the learning algorithm including newly collected
records.</p>
      <sec id="sec-4-1">
        <title>2.2.1 Point of Routine, POR</title>
        <p>For every trip the GPS positions are recorded, so the
origin and destination can be de ned as the rst and last GPS
points of a trip. Consolidating these by an agglomerative
hierarchical cluster method the PORs are de ned as the
clusters' centers. Since the main emphasis is detecting travel
patterns and routine trips, only clusters containing a
minimum number of four points are considered as POR, whereas
smaller ones may build another one as soon as more trips
start or end in the same region. The size of a cluster was
de ned by the clustering height, that is the agglomeration
was stopped at a height of 500 meters.</p>
        <p>
          The cluster analysis was performed in R, applying agnes
[
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] with the Euclidean metric and average distance concept.
Due to the fact that users state the travel purpose these
can be assigned to PORs and, for example, the home and
work place could be located. Assuming a user starts her rst
trip per day at home, the algorithm used the location of the
home POR for adjusting the weather warning (see section
3).
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>2.2.2 Routine Trip, RT</title>
        <p>Once a pro le includes at least two PORs, the
connecting trips are aggregated to formulate routine trips. Each
RT contains information about the usual travel time, main
transport means and earliest and latest departure time for
two PORs. A trip's main transport mean is de ned as the
one chosen for the biggest distance. In this work the
following main transport means are considered: walk, bicycle,
public transport and car. Consequently for one RT up to
four alternatives can be learned. Knowing the available
alternatives for a RT is crucial for formulating:
the choice set in choice model estimation (see section
2.3), and
the weather warning service (see section 3).</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>2.3 Personalised Choice Model</title>
      <p>
        For the algorithm of personalising mode choice models we
follow the methodology of [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], applying the idea of individual
level parameters [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] it was shown that the prediction
quality of the personalised models clearly bene ts from a
base model for a large sample population. Hence, a mixed
logit approach was applied to a combined data set. The
data consisted of a large travel diary data set for the
Vienna region (see [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] for details of the data set and data
preprocessing) combined with the data collected by the App.
The App data was added whenever a user travelled on a RT
with at least two possible alternative modes for that trip.
For trip i of user n, a utility to use mode m is given by
      </p>
      <p>Uni(m) = Xnim + nim where Xnim is a vector of decision
variables for using mode m for that trip, N ( ; ) are
the normally distributed parameters from the mixed logit
model with mean and covariance Matrix and is a
extreme value distributed random error.</p>
      <p>To get to the personalised models the choice situations of
the user are used by
1. draw a sample R of size R from N ( ; ), where R is
a large number (R = 25000 in our implementation)
2. calculate the personalised parameters
as</p>
      <p>R
= X (r)
r=1</p>
      <p>P (ymjUni(m); (r))
PR</p>
      <p>s=1 P (ymjUni(m); (s))
where ym is one if m is the chosen mode and zero
otherwise and P (ymjUni(m); (b)) = Pekxepx(Up(nUi(nmi()k))) , where
the sum in the denominator is over all routes observed
up to this point for the user.</p>
      <p>An example of the changes in parameter values for one user
can be seen in Figure 2. The user is an avid bike rider. As a
result the alternative speci c constant for bike rises quickly
to a value between 2:5 and 3. This raises the likelihood that
the model predicts the mode bike for that user. One can
also see that at low temperatures without rain the person is
more likely to walk than the average user and that biking
becomes more likely for higher temperatures without rain.
For some parameter values it can be seen that they stay
at constant values until a trip of that category is observed
before there are changes in that parameter value.</p>
    </sec>
    <sec id="sec-6">
      <title>3. WEATHER WARNING SERVICE</title>
      <p>For applying the learning algorithm in a real world
experiment users were provided with a smart phone application
that collects and transmits trip data and receives
personalised weather messages. The main purpose was to provide
the users with information whenever it would be helpful for
their decision process, rather than sending redundant
messages, so that the user stayed motivated to continue using
the app and recording trips. Finally, this would sharpen the
mobility pro le and result in more relevant messages to
prevent users from experiencing unpleasant weather situations
for their chosen modes, i.e. prevent them from choosing a
mode that they would not have chosen with perfect
information. Based on the personalised choice model the user's
utilities for di erent transport modes were calculated, both for
the current weather forecast and optimal weather conditions
(no rain and 15-25 degrees C). For each RT only utilities of
modes are compared, that the user has already recorded
for this RT. In case of expected behaviour change the user
was informed about the weather forecast and a favoured
transport mode choice, otherwise a generic weather message
was sent. Figure 3 depicts a personalised message, both on
the lock screen of the smart phone and within the
application. The experiment showed that the learning algorithm
performed as expected. It learned the users' PORs and RTs
as demanded and adjusted the personalised mode choice
parameters continuously. Ten users took part and recorded
on average 40 trips each. Two users left the experiment in
an early stage, so that no PORs could be recognised. For
each of the remaining eight users the home location could
be identi ed. Further the algorithm found 6 education
related PORs, 3 work PORs and one for leisure purpose. For 6
users the algorithm could learn on average 3.2 RTs, whereas
for each of these one to four alternatives (walk, bicycle, car,
public transport) with di erent main transport means were
identi ed.</p>
    </sec>
    <sec id="sec-7">
      <title>4. CONCLUSIONS AND OUTLOOK</title>
      <p>In this project algorithms were developed that create and
update a user mobility pro le, estimate personalised
modechoice models and nally provide users with weather and
mode information supporting their daily mobility decisions.
The system was tested in a eld test showing promising
results. Due to the stable nice weather during the experiment
time span of March 2015 as well as the limitations of the
underlying mobility survey data-set that was collected in late
spring, learning the weather sensitivity of the users'
mobility behaviour was challenging. However the individual level
parameters approach learning about sensitivities to weather
events can be incorporated into the models quickly after
observing such an event. Furthermore due to the short test
period, the mobility pro les lack some grade of detail.
Further steps will deal with developing the algorithm to send
weather messages dependent on the weekday. Therefor a
longer data collection and learning period is necessary.
Consequently for sharpening the pro le also older trips could be
skipped for keeping the input information for pro ling up
to date. For a more precise evaluation method user
interaction would be required. This could either be achieved by
a special survey or directly integrated in the smart phone
application.</p>
      <p>Furthermore, to improve the system, automated start and
stopping of the data collection would limit the reliance on
the user's willingness to record trips and would improve
proling and would better enable the next step of suggesting
alternative routes rather than just alternative modes. Lastly,
the integration of other data like tra c events would
improve the system further.</p>
    </sec>
    <sec id="sec-8">
      <title>5. ACKNOWLEDGMENTS</title>
      <p>This research was partially supported by the research project
Wetter-PROVET. The project was funded through the FFG
- Austrian Research Promotion Agency program Ways2Go
by the Austrian Ministry for Transport, Innovation and
Technology (BMVIT). We also thank the project partners
Fluidtime Data Services GmbH and UBIMET GmbH.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Bayir</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Demirbas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Eagle</surname>
          </string-name>
          .
          <article-title>Mobility pro ler: A framework for discovering mobility pro les of cell phone users</article-title>
          .
          <source>Pervasive and Mobile Computing</source>
          ,
          <volume>6</volume>
          (
          <issue>4</issue>
          ):
          <volume>435</volume>
          {
          <fpage>454</fpage>
          ,
          <year>2010</year>
          .
          <article-title>Human Behavior in Ubiquitous Environments: Modeling of Human Mobility Patterns</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>P.</given-names>
            <surname>Campigotto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Rudlo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Leodolter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Bauer</surname>
          </string-name>
          .
          <article-title>Personalized and situation-aware multimodal route recommendations: the favour algorithm</article-title>
          .
          <source>IEEE Transactions on ITS, sumbmitted</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Krennwallner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Prandtstetter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Rudlo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Schneider</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Straub</surname>
          </string-name>
          .
          <article-title>Semantically enriched multi-modal routing</article-title>
          .
          <source>International Journal of Intelligent Transportation Systems Research</source>
          , pages
          <volume>1</volume>
          {
          <fpage>16</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Maechler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rousseeuw</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Struyf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hubert</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Hornik</surname>
          </string-name>
          .
          <source>cluster: Cluster Analysis Basics and Extensions</source>
          ,
          <year>2015</year>
          .
          <article-title>R package version 2.0.1 | For new features, see the 'Changelog' le (in the package source</article-title>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Nitsche</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Widhalm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Breuss</surname>
          </string-name>
          ,
          <string-name>
            <surname>N.</surname>
          </string-name>
          <article-title>Brandle, and</article-title>
          <string-name>
            <given-names>P.</given-names>
            <surname>Maurer</surname>
          </string-name>
          .
          <article-title>Supporting large-scale travel surveys with smartphones a^AS A practical approach</article-title>
          . Transportation Research Part C: Emerging Technologies,
          <volume>43</volume>
          , Part
          <volume>2</volume>
          (
          <issue>0</issue>
          ):
          <volume>212</volume>
          {
          <fpage>221</fpage>
          ,
          <year>2014</year>
          .
          <article-title>Special Issue with Selected Papers from Transport Research Arena</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>C.</given-names>
            <surname>Rudlo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Leodolter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Auer</surname>
          </string-name>
          , W. Brog, and
          <string-name>
            <given-names>K.</given-names>
            <surname>Kehnscherper</surname>
          </string-name>
          .
          <article-title>In uence of weather on transport demand: A case study from the vienna region</article-title>
          .
          <source>Transportation Research Record: Journal of the Transportation Research Board</source>
          , To appear.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Toole</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ulm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Gonzalez</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Bauer</surname>
          </string-name>
          .
          <article-title>Inferring land use from mobile phone activity</article-title>
          .
          <source>In Proceedings of the ACM SIGKDD International Workshop on Urban Computing, UrbComp '12</source>
          , pages
          <issue>1</issue>
          {
          <fpage>8</fpage>
          , New York, NY, USA,
          <year>2012</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>K.</given-names>
            <surname>Train</surname>
          </string-name>
          .
          <article-title>Discrete Choice Models with simulation</article-title>
          . Cambridge University Press, Cambridge, UK,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>P.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hunter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Bayen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Schechtner</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Gonzalez</surname>
          </string-name>
          .
          <source>Understanding Road Usage Patterns in Urban Areas. Sci. Rep</source>
          .,
          <volume>2</volume>
          , Dec.
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>