<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Spatial Knowledge and Information Canada</journal-title>
      </journal-title-group>
      <issn pub-type="ppub">1365-8816</issn>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1080/13658810902835404</article-id>
      <title-group>
        <article-title>Correlation of Public Transit Accessibility Measures with Actual Ridership</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>SARAH BREE</string-name>
          <email>sarah.bree@usask.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>EHAB DIAB</string-name>
          <email>ehab.diab@usask.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>SCOTT BELL</string-name>
          <email>scott.bell@usask.ca</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Geography and Planning, University of Saskatchewan</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <volume>7</volume>
      <issue>5</issue>
      <abstract>
        <p>Transit accessibility measures are important tools used by planners to understand the effects of changes to the public transit system. However, it is not clear how existing accessibility measures (models) described in the literature correlate with actual public transit ridership data. Public transit systems vary dramatically according to the regions they serve, and no single model has been identified that accurately measures accessibility across the spectrum. This paper evaluates several transit system accessibility models by correlating the accessibility metric they produce with actual ridership data, using the City of Saskatoon as a case study. The results show that frequency based models result in higher correlation than coverage based models and a distance decay function based on the distance from demand location to service location further increases the correlation. This paper provides transportation planners a better understanding of the correlation between different transit accessibility measures and actual transit ridership.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Saskatoon is continuously improving its
transit system, including the planned
introduction of a Bus Rapid Transit (BRT)
system with an estimated cost of between 90
and 150 million dollars
        <xref ref-type="bibr" rid="ref1">(City of Saskatoon,
2018)</xref>
        . Successful implementation of such
an infrastructure project requires that costs
and impacts be accurately estimated and
reported to planners, decision makers, and
citizens
        <xref ref-type="bibr" rid="ref2 ref3">(Ding 2018; Kim 2018)</xref>
        . One
method to assess planned changes to transit
systems is to develop service metric models.
These models estimate how the service
metrics change when model inputs change
(i.e. transit system configuration).
      </p>
      <p>
        Accessibility is a key measure of public
transit system performance. It refers to the
ease with which locations can be accessed
from other locations
        <xref ref-type="bibr" rid="ref10">(Morris, Dumble, &amp;
Wigan, 1979)</xref>
        . Several researchers examined
the concept of accessibility. For example,
        <xref ref-type="bibr" rid="ref12">Thill and Kim (2005)</xref>
        and
        <xref ref-type="bibr" rid="ref5">Lei (2010)</xref>
        proposed several options to calculate
accessibility based on distance to service
using gravity functions.
      </p>
      <p>
        <xref ref-type="bibr" rid="ref7">Luo and Wang (2003)</xref>
        proposed the Two
Step Floating Catchment Area model
(2SFCA) to estimate geographical
accessibility of medical services. Their
model considered supply of surrounding
services at a particular demand location,
and the total demand on the services by
surrounding locations. Subsequently,
        <xref ref-type="bibr" rid="ref9">McGrail and Humphreys (2009)</xref>
        examined
the use of the 2SFCA model in rural
Victoria, Australia, and Dai (2010)
examined the use of the 2SFCA model for
estimating access to health care in Detroit,
Michigan.
      </p>
      <p>
        <xref ref-type="bibr" rid="ref8">Luo and Qi (2009)</xref>
        proposed an enhanced
2SFCA (E2SFCA) model that applies a
distance-decay to both steps of the original
2SFCA model. They proposed discrete
weightings that change in a stepwise fashion
at defined distances. Langford et. al. (2012)
proposed a transit-enhanced E2SFCA model
for estimating geographical access into
transit systems, which is described in more
detail later in this paper. Recently, Walk
Score (Seattle WA) introduced Transit
Score to quantify local accessibility to
transit
        <xref ref-type="bibr" rid="ref13">(Walk Score, 2018)</xref>
        .
      </p>
      <p>This paper evaluates several transit system
accessibility models by correlating the
accessibility metric they produce with actual
public transit ridership data for Saskatoon.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Data and Methods</title>
      <sec id="sec-2-1">
        <title>Saskatoon Transit’s General Transit Feed</title>
        <p>Specification (GTFS) data was accessed on
June 1, 2018. This dataset includes the
location of every stop, route, departure
direction, and the time of every weekly
departure. At that time, Saskatoon Transit
operated 41 bus routes serving 1465 stops
(Figure 1), with 261,868 weekly departures.</p>
        <p>
          Population data and transit ridership data
per Dissemination Area (DA) was obtained
from the latest Statistics Canada Data
Census Population CSV per Dissemination
Area report (2016). The percentage of
riders in each DA is illustrated in Figure 2.
A transit system serves a geographical area.
The smallest geographical sub-areas for
which statistics such as population and
transit ridership are available are DAs. The
most recent data from Statistics Canada
(2016) defines 362 DAs for Saskatoon.
However, the DAs are not uniform: they
range in size from 0.022 km2 to 40.121 km2.
Some DAs are convex with externally
located centroids. This is a typical
Modifiable Areal Unit Problem (MAUP)
issue, in which results can be skewed
depending on the boundaries that are drawn
to aggregate the data
          <xref ref-type="bibr" rid="ref11">(Openshaw, 1983)</xref>
          .
To overcome the MAUP issue, a grid of
100m by 100m cells was overlayed on the
bounding box containing all Saskatoon DAs.
and intersections computed for each DA. In
many cases the grid cells were bisected by
DA boundaries. That is, while there are
many grid cells within DAs, the grid cells
along the DA boundaries are usually clipped
into smaller, non-square shapes. This
operation resulted in 21,807 grid cells, each
with an internal centroid.
        </p>
        <p>Next, 400m network-constrained buffers
were calculated around each bus stop.
Buffers that intersected a grid cell were
considered within the grid cell's catchment
area. Accessibility measures were then
computed for each grid cell. The average
value of for the grid cells within a DA was
then
used
to
compute
an
measure for the DA as a whole.
accessibility</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2 Service Models</title>
        <p>models.
model types:
Transit access depends on the level of
service. In this paper, a unit of service is
defined as a departure, on a route, in a
specific direction. In the analysis, the term
route
implies
a
route-direction
combination. Service parameters can
be
evaluated in
different ways by
different</p>
        <p>In this paper we consider five
1. Stop Model. This model counts the
total
number
of stops
within a
demand location.
2. Coverage Model. This model counts
the total number of routes serving all
stops within a demand location. The
service
provided
by each
stop is
determined
by
the
number of
different routes served.</p>
      </sec>
      <sec id="sec-2-3">
        <title>3. Frequency</title>
      </sec>
      <sec id="sec-2-4">
        <title>Model.</title>
        <p>This
model
counts the departures from all stops
within a demand location over some
time interval (e.g., 1 hour).</p>
      </sec>
      <sec id="sec-2-5">
        <title>4. Filtered</title>
      </sec>
      <sec id="sec-2-6">
        <title>Coverage</title>
        <p>Model. In this
model,</p>
        <p>departures on the same
route from stops farther away from
the demand location are filtered (i.e.,
not considered).
5. Filtered Frequency Model. In this
model departures on the same route
from
stops farther away from
the
demand location are filtered (i.e.,
not considered).</p>
        <sec id="sec-2-6-1">
          <title>Each one of the</title>
          <p>model types described
above involves service locations at different
distances from the demand location.
In
each case, a weighting
Wjk based on the
distance between</p>
          <p>
            service location j (bus
stop) and demand location
k (grid cell
centroid) can
be
applied to the
score
contributed by each stop.
            <xref ref-type="bibr" rid="ref4">Langford (2012)</xref>
            suggested a Butterworth filter given by:
  = 1⁄√1 +  ( 
⁄ 

)
with x=1, n=6 and dpass=250m. For this
paper, each of the model types described
above was run
          </p>
          <p>with and without distance
decay
weighting.</p>
          <p>Distance
decay
was
calculated with dpass=250m and the network
distance djk from the service location j to
the demand location k.
2.3</p>
        </sec>
      </sec>
      <sec id="sec-2-7">
        <title>Floating (E2SFCA)</title>
      </sec>
      <sec id="sec-2-8">
        <title>Transit</title>
      </sec>
      <sec id="sec-2-9">
        <title>Enhanced Two</title>
      </sec>
      <sec id="sec-2-10">
        <title>Catchment</title>
      </sec>
      <sec id="sec-2-11">
        <title>Area</title>
      </sec>
      <sec id="sec-2-12">
        <title>Step Model</title>
        <p>Therefore the service provided by service
location j to demand location k is denoted
Using the Filtered Frequency Model
distance
decay
weighting,
the
accessibility measure at location k is given
  =</p>
        <p>∑    
 ∈{  }
where Bjk denotes the set of filtered service
locations j that fall within demand location
k's catchment buffer. The E2SFCA
model
differs from the Filtered Frequency Model
by using a service-to-demand ratio Rjk in
place
of the
service</p>
        <p>Sjk
such
that:
  =</p>
        <p>∑    
 ∈{  }
  =   ⁄ 
where:
Dj is the demand at service location j and is
the sum of the weighed populations P of
locations k that fall within location j’s
catchment buffer:
  =</p>
        <p>∑    
 ∈{  }
In addition to the five model types described
in Section 2.2, this paper also considered
Langford’s Transit Enhanced 2SFCA model
described above. However, as shown in the
results (Section 3), Langford’s model
performed poorly. Based on that poor
performance, another model, dubbed the
E2SFCA-2 model, was also considered.
In the E2SFCA-2 model, the demand (i.e.,
the population surrounding the service
location) was treated as a potential supply of
transit riders and was used to increase the
service as shown in equation below.</p>
        <p />
        <p>=   √</p>
        <sec id="sec-2-12-1">
          <title>2.4 Walk Score’s Transit Score</title>
          <p>
            Walk Score's Transit Score is a filtered
frequency model that uses departures per
week as its service metric
            <xref ref-type="bibr" rid="ref13">(Walk Score,
2018)</xref>
            . All departures on a route are ignored
except for those from the stop located
closest to the demand location. The score is
computed for demand locations on a 500
foot grid. A distance decay, as shown in
Figure 3, is then applied to the service
scores. Distances are computed using the
road network. Once computed, a log of the
score is taken (Figure 4).
          </p>
        </sec>
      </sec>
      <sec id="sec-2-13">
        <title>2.5 Correlation with Transit Ridership</title>
        <p>The DA data from Statistics Canada includes
transit ridership estimates. Because of
widely varying DA populations, transit
ridership as a percentage of DA population
was computed and used for correlation with
the accessibility measures. The percentage
of transit users by DA ranges from 0% in
many smaller sized and less populated DAs
to a maximum of 46.8%. To protect
individual’s privacy, the transit ridership
estimates are intentionally coarse. The data
reports a population of 246,376 with 24,980
transit users. A Pearson's correlation
coefficient was computed to quantify the
relationship between ridership percentage
and each accessibility measure for
Saskatoon at the DA level.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results</title>
      <p>Each of the five models identified in Section
2.2 was run twice: once with dpass=250 m
and once with no distance decay. Z-scores
were calculated to allow visual comparison
between figures. Figures 4 to 10 show the
results with distance decay applied. Table 1
shows the correlation results for all models.
As seen in the table, the best performance
was obtained with the E2SFCA-2 (figure 10).
A Pearson's r-value of 0.431 (the best
obtained result) is not an especially strong
correlation. However, it should be noted
that there is a relationship between transit
service supply and demand. While the
demand for transit is influenced by its
supply, transit supply itself is adjusted by
transit agencies in response to demand
changes over time (to provide an efficient
service). In other words, demand and transit
accessibility measures should be correlated.
In every case, a model with distance decay
resulted in a better correlation. This
confirms that distance to a bus stop is an
important factor in accessibility. However,
filtering the service when computing service
levels has a mixed result. It improved the
performance of the Frequency Model but
decreased the performance of the Coverage
Model.</p>
      <p>As expected, the E2SFCA model performed
poorly (Table 1), indicating no correlation
between the model and actual transit
ridership. Therefore, the E2SFCA was rerun
with a modification that is labeled
E2SFCA2 in Table 1 and it resulted in the best
performance of all the models considered.
The choice of √ was arbitrary and future
work is required to determine how to best
consider population demand in scenarios
such as urban transit systems.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>This paper examined accessibility models by
correlating their scores with actual ridership
data. Accessibility scores were computed
for 362 DAs in Saskatoon and correlated
with ridership as a percentage of the DA
population using Pearson's r. In all cases,
incorporating distance decay resulted in
improved model performance. Service
frequency, as a service metric, performed
better than coverage, and a filtered
frequency model, in which all but the closest
departure locations were discarded,
improved performance even more. The
E2SFCA algorithm resulted in the worst
performance when treating population as a
demand that reduced accessibility.
Conversely, that model performed best
when modified to treat population as a
supply.</p>
      <p>It should be noted that many factors beyond
physical accessibility have an impact on
transit ridership. Therefore, generating
statistical models to isolate such impacts is
the next step of this research. Although
transit system accessibility may be good in
low population locations (e.g., centers of
work, study, and employment), the transit
ridership data collected by Statistics Canada
links the users to their home DAs.
Therefore, perhaps using origin and
destination data could be recommended for
future research.</p>
      <p>Based on preliminary results, investing in
higher frequency service rather than
expanded coverage might result in greater
transit ridership gains per dollar spent.
Saskatoon’s proposed BRT system
prioritizes frequency over coverage.</p>
      <p>Future work based on the results of this
paper include treating population as a
supply rather than a demand. Factors such
as the catchment buffer size and distance
decay parameters could also be varied. A
model tuned to maximize performance for a
particular transit system could be used to
predict ridership changes in that system
when the system is reconfigured, such as
Saskatoon’s proposed BRT configuration.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>The authors would like to thank the Interact
Team (https://teaminteract.ca) for
inspiration and funding; and Michael Bree
for help with Python.
Dai,</p>
      <p>Dajun. 2010. “Black Residential
Segregation, Disparities in Spatial Access to
Health Care Facilities, and Late-Stage
Breast Cancer Diagnosis in Metropolitan
Detroit”. Health and Place 16 (5). ISSN:
1353-8292.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>City of Saskatoon</surname>
            , and
            <given-names>Robert</given-names>
          </string-name>
          <string-name>
            <surname>Dudiak</surname>
          </string-name>
          .
          <year>2018</year>
          . “Transit Plan”
          <source>Technical Report. City of Saskatoon</source>
          . https://www.saskatoon.ca/engage/transitplan
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Ding</surname>
            , Jishiyu, Yi Zhang, and
            <given-names>Li</given-names>
          </string-name>
          <string-name>
            <surname>Li</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>“A New Accessibility Measure of Bus Transit Networks”</article-title>
          .
          <source>IET Intelligent Transport Systems</source>
          <volume>12</volume>
          (
          <issue>7</issue>
          ):
          <fpage>682</fpage>
          -
          <lpage>688</lpage>
          . ISSN:
          <fpage>1751</fpage>
          -
          <lpage>956X</lpage>
          . DOI:
          <volume>10</volume>
          .1049/iet-its.
          <year>2017</year>
          .0286
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Kim</surname>
            , Hyun, and
            <given-names>Yena</given-names>
          </string-name>
          <string-name>
            <surname>Song</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>“An Integrated Measure of Accessibility and Reliability of Mass Transit Systems”</article-title>
          .
          <source>Transportation</source>
          <volume>45</volume>
          , no.
          <issue>4</issue>
          (
          <issue>July</issue>
          ):
          <fpage>1075</fpage>
          -
          <lpage>1100</lpage>
          . ISSN:
          <fpage>1572</fpage>
          -
          <lpage>9435</lpage>
          . DOI:
          <volume>10</volume>
          .1007/s11116-018- 9866-7 https://doi.org/10.1007/s11116-018-9866-7
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Langford</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fry</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Higgs</surname>
          </string-name>
          .
          <year>2012</year>
          . “
          <article-title>Measuring Transit System Accessibility using a Modified Two-Step Floating Catchment Technique”</article-title>
          .
          <source>International Journal of Geographical Information Science</source>
          <volume>26</volume>
          (
          <issue>2</issue>
          ):
          <fpage>193</fpage>
          -
          <lpage>214</lpage>
          . DOI:
          <volume>10</volume>
          .1080/13658816.
          <year>2011</year>
          .
          <volume>574140</volume>
          https://doi.org/10.1080/13658816.
          <year>2011</year>
          .
          <volume>574</volume>
          140
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Lei</surname>
            ,
            <given-names>T. L.</given-names>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Church</surname>
          </string-name>
          .
          <year>2010</year>
          . “
          <article-title>Mapping Transit-Based Access: Integrating GIS,</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Litman</surname>
          </string-name>
          , Todd.
          <year>2012</year>
          . “
          <article-title>Evaluating Public Transit Benefits and Costs: Best Practices Guidebook”</article-title>
          . http://library.usask.ca/scripts/remote?URL =http://www.deslibris.ca/ID/230688
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Luo</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Wang</surname>
          </string-name>
          .
          <year>2003</year>
          . “
          <article-title>Measures of Spatial Accessibility to Health Care in a GIS Environment: Synthesis and a case study in the Chicago region”</article-title>
          .
          <source>Environment and Planning B: Planning and Design</source>
          <volume>30</volume>
          (
          <issue>6</issue>
          ):
          <fpage>865</fpage>
          -
          <lpage>884</lpage>
          . ISSN:
          <volume>02658135</volume>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Luo</surname>
            , Wei, and
            <given-names>Yi</given-names>
          </string-name>
          <string-name>
            <surname>Qi</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>“An Enhanced TwoStep Floating Catchment Area (E2SFCA) Method for Measuring Spatial Accessibility to Primary Care Physicians”</article-title>
          .
          <source>Health and Place</source>
          <volume>15</volume>
          (
          <issue>4</issue>
          ):
          <fpage>1100</fpage>
          -
          <lpage>1107</lpage>
          . issn:
          <fpage>1353</fpage>
          -
          <lpage>8292</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>McGrail</surname>
          </string-name>
          ,
          <string-name>
            <surname>Matthew</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>and John S.</given-names>
            <surname>Humphreys</surname>
          </string-name>
          .
          <year>2009</year>
          . “
          <article-title>Measuring Spatial Accessibility to Primary Care in Rural Areas: Improving the Effectiveness of the Two-Step Floating Catchment Area Method”</article-title>
          .
          <source>Applied Geography</source>
          <volume>29</volume>
          (
          <issue>4</issue>
          ):
          <fpage>533</fpage>
          -
          <lpage>541</lpage>
          . ISSN:
          <fpage>0143</fpage>
          -
          <lpage>6228</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Morris</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumble</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Wigan</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>1979</year>
          . “
          <article-title>Accessibility indicators for transport planning”</article-title>
          .
          <source>Transportation Research Part A: General</source>
          ,
          <volume>13</volume>
          (
          <issue>2</issue>
          ),
          <fpage>91</fpage>
          -
          <lpage>109</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Openshaw</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <year>1983</year>
          .
          <article-title>The Modifiable Areal Unit Problem</article-title>
          .
          <source>Norwick: Geo Books. ISBN 0860941345. OCLC 12052482</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Thill</surname>
          </string-name>
          ,
          <string-name>
            <surname>Jean-Claude</surname>
            , and
            <given-names>Marim</given-names>
          </string-name>
          <string-name>
            <surname>Kim</surname>
          </string-name>
          .
          <year>2005</year>
          . “
          <article-title>Trip Making, Induced Travel Demand, and Accessibility”</article-title>
          .
          <source>Journal of Geographical Systems</source>
          <volume>7</volume>
          (
          <issue>2</issue>
          ):
          <fpage>229</fpage>
          -
          <lpage>248</lpage>
          . ISSN:
          <fpage>1435</fpage>
          -
          <lpage>5930</lpage>
          .DOI:
          <volume>10</volume>
          .1007/s10109-005-0158-3
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Walkscore</surname>
          </string-name>
          .
          <year>2018</year>
          . “
          <article-title>Walkscore's Transit Score Methodology”</article-title>
          . https://www.walkscore.com/transit-scoremethodology.shtml.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>