<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>GetThere: A Rural Passenger Information System Utilising Linked Data &amp; Citizen Sensing</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>David Corsar</string-name>
          <email>dcorsar@abdn.</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Edwards</string-name>
          <email>p.edwards@abdn.</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chris Baillie</string-name>
          <email>c.baillie@abdn.</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Milan Markovic</string-name>
          <email>m.markovic@abdn.</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Konstantinos Papangelis</string-name>
          <email>k.papangelis@abdn.</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>John Nelson</string-name>
          <email>j.d.nelson@abdn.</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>dot.rural Digital Economy Hub, University of Aberdeen</institution>
          ,
          <addr-line>Aberdeen</addr-line>
          ,
          <country>UK ac.uk</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This demo paper describes a real-time passenger information system based on citizen sensing and linked data. Real-time passenger information (RTPI) systems provide details about public transport, allowing passengers to plan and make decisions regarding their journeys. Typical requirements for RTPI systems include: 1) listing available public transport services; 2) providing timetable (schedule) information for those services; 3) providing (real-time) vehicle locations; and 4) providing details of disruptions. However, few RTPI systems exist in rural areas for a variety of reasons, including a lack of infrastructure for obtaining and providing real-time information [7]. As part of the Informed Rural Passenger project1, we are developing GetThere, an RTPI system for rural areas. The GetThere system consists of a smartphone app, supported by a semantic infrastructure that integrates data from multiple sources (including users). This system has been deployed in the Scottish Borders, UK in partnership with First Group. This demonstration2 will show a typical use of the GetThere app to view timetabled and real-time vehicle locations for a selected route, contribute vehicle locations while making a journey, report a disruption event, and assess the quality of real-time locations with and without the presence of disruption. The demo will utilises the datasets and services shown in Fig. 1.</p>
      </abstract>
      <kwd-group>
        <kwd>provenance</kwd>
        <kwd>data quality</kwd>
        <kwd>citizen sensing</kwd>
        <kwd>linked data</kwd>
        <kwd>semantic infrastructure</kwd>
        <kwd>transport</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Introduction
GetThere is supported by a semantic information ecosystem (Fig. 1) itself
underpinned by a series of ontologies. Semantic web and linked data technologies are
1 http://www.dotrural.ac.uk/irp
2 A video of the demo is available at http://www.gettherebus.com/iswcdemo/
used for data representation and storage within the ecosystem as they provide
an e ective approach for large scale data integration [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Further, accessing and
storing data via SPARQL endpoints allows storage to be handled by
technologies appropriate for the characteristics of individual datasets; for example, using
RDF streams or a database with a R2RML wrapper for high throughput data.
      </p>
      <p>Android Mobile Application
User</p>
      <p>Sensor Location Observation</p>
      <p>Disruption</p>
      <p>Quality Provenance
Infrastructure</p>
      <p>User Observation</p>
      <p>Disruption</p>
      <p>External
Datasets</p>
      <p>NaPTAN
Clients
Web Services Timetable
Internal
Datasets Timetable
Ontologies</p>
      <p>Details of public transport services and timetables are stored in the Timetable
dataset3 and represented by the Transit ontology4. This dataset is used by the
Timetable Service to provide details of available transport services, timetable,
and vehicle location information to the GetThere app. The Infrastructure dataset
provides details of the road networks used by public transport vehicles. This data
is extracted from openstreetmap.org, and is represented using the Infrastructure
ontology5 (which de nes bus route maps) and the LinkedGeoData6 ontology.
NaPTAN7 provides details of bus stops, including their IDs and locations.</p>
      <p>
        The User ontology8 and dataset describe user pro les using SIOC9 and
FOAF10, a description of each user's msobile device(s), along with details of
public transport journeys made while using the GetThere app. The Observation
dataset uses the Transport Sensors ontology11 which extends the W3C
Semantic Sensor Network (SSN) ontology12 to describe observations (e.g. of vehicle
occupancy level, vehicle location) obtained from users of the GetThere app. The
Sensor service provides an API for storing and retrieving sensor and observation
3 Timetable information is received in the ATCO-CIF format (http://www.
travelinedata.org.uk/CIF/atco-cif-spec.pdf); the RDF conversion program is
available at https://github.com/dcorsar/ecosystem.timetable.
4 http://vocab.org/transit/terms/
5 http://www.dotrural.ac.uk/irp/uploads/ontologies/infrastructure.owl
6 http://linkedgeodata.org
7 http://data.gov.uk/dataset/naptan
8 http://www.dotrural.ac.uk/irp/uploads/ontologies/user.owl
9 http://rdfs.org/sioc/spec/
10 http://xmlns.com/foaf/spec/
11 http://www.dotrural.ac.uk/irp/uploads/ontologies/sensors.owl
12 http://www.w3.org/2005/Incubator/ssn/ssnx/ssn
descriptions expressed using the SSN ontology; the Location Observation
service handles real-time locations provided by app users. The Travel Disruption
ontology describes di erent types of disruption, based on an analysis of existing
travel disruption information sources [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Disruption reports from app users are
managed and stored by the Disruption service and dataset.
      </p>
      <p>
        Given the open nature of this data, issues such as data quality and trust
naturally arise [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Examples range from malicious users and inaccurate devices
to out-of-date information (e.g. timetables). As part of addressing these issues,
the ecosystem features a service that can evaluate data quality. The quality
ontology (Qual-O13), and its associated quality assessment service are discussed
in detail elsewhere [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Brie y, the service is con gured with a set of quality
metrics encoded as SPARQL rules expressed against the relevant ontologies.
These guide a SPIN reasoner [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] to perform quality assessment, producing quality
scores which can be utilised by other services to lter low quality data.
      </p>
      <p>Our current quality metrics are focused on real-time locations, and have been
developed following several deployments of the system. They include: Timeliness
- timely observations are under 1 minute old; Accuracy - accurate observations
have a GPS error margin of less than 25 metres; Relevance - relevant observations
are no further than 500 metres from the expected route of travel; Availability
observations with a high availability score have no more than a 1 minute delay
between being created on the device and published by the ecosystem.</p>
      <p>
        The provenance service uses the W3C Prov-O ontology14 to maintain a record
of the entities, agents, and activities involved in producing data within the
ecosystem. Uses of provenance include: associating users with location
observations generated by their mobile device, which can support detection of
potentially malicious users; and recording dataset provenance to ensure the latest
timetable information is provided to users [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
2.1
      </p>
      <p>The GetThere Smartphone App
The ecosystem has been designed to support a range of applications through
the creation of relevant application services. At present we have used the
ecosystem to support the GetThere RTPI system, which is provided via an Android
smartphone app (see Fig. 2). The app invokes the web services, which execute
SPARQL queries against relevant datasets, process the results, and send a
response to the app. Users are presented with a list of available bus routes; after
selecting a route (and direction, either inbound or outbound), vehicle locations
are displayed. These locations include both estimates based on the timetable and
real-time locations obtained from other users on that route (Fig. 2, left
screenshot). Bus stops along the route are also shown. The user can access timetable
information for the previous and next arrivals at a particular stop. When the
user boards the bus, they have the option of pressing a button to have their
13 http://sensornet.abdn.ac.uk/onts/Qual-O.ttl
14 http://www.w3.org/TR/prov-o/
location uploaded to the ecosystem every minute. The uploaded location is then
used as the vehicle's real-time location provided to other users.</p>
      <p>Users can view quality assessment results for a real-time vehicle location
by tapping its icon. We are working with users to determine an appropriate
visualisation of quality results. Currently each assessed dimension is shown with
a colour-coded bar representing its quality score (Fig. 2, centre screenshot).
Acknowledgements The research described here is supported by the award
made by the RCUK Digital Economy programme to the dot.rural Digital
Economy Hub; award reference: EP/G066051/1</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>C.</given-names>
            <surname>Baillie</surname>
          </string-name>
          , E. Edwards,
          <string-name>
            <given-names>P.</given-names>
            <surname>Pignotti</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Corsar</surname>
          </string-name>
          .
          <article-title>Short paper: Assessing the quality of semantic sensor data</article-title>
          .
          <source>In Proc. of The 6th International Workshop on Semantic Sensor Networks</source>
          , page to appear,
          <year>October 2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>D.</given-names>
            <surname>Corsar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Edwards</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Velaga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Nelson</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Pan</surname>
          </string-name>
          .
          <article-title>Exploring provenance in a linked data ecosystem</article-title>
          . In P. Groth and J. Frew, editors,
          <source>Provenance and Annotation of Data and Processes</source>
          , volume
          <volume>7525</volume>
          <source>of LNCS</source>
          , pages
          <volume>226</volume>
          {
          <fpage>228</fpage>
          . Springer Berlin Heidelberg,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>C.</given-names>
            <surname>Furber</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Hepp</surname>
          </string-name>
          .
          <article-title>Swiqa - a semantic web information quality assessment framework</article-title>
          .
          <source>In 19th European Conference on Information Systems</source>
          , pages
          <fpage>922</fpage>
          {
          <fpage>933</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>V.</given-names>
            <surname>Lopez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kotoulas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sbodio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stephenson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gkoulalas-Divanis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Aonghusa. Queriocity</surname>
          </string-name>
          :
          <article-title>A linked data platform for urban information management</article-title>
          .
          <source>In The Semantic Web { ISWC</source>
          <year>2012</year>
          , volume
          <volume>7650</volume>
          <source>of LNCS</source>
          , pages
          <volume>148</volume>
          {
          <fpage>163</fpage>
          . Springer Berlin Heidelberg,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>M.</given-names>
            <surname>Markovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Edwards</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Corsar</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Pan</surname>
          </string-name>
          .
          <article-title>Demo: Managing the provenance of crowdsourced disruption reports</article-title>
          .
          <source>In Provenance and Annotation of Data and Processes</source>
          , volume
          <volume>7525</volume>
          <source>of LNCS</source>
          , pages
          <volume>209</volume>
          {
          <fpage>213</fpage>
          . Springer Berlin Heidelberg,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>S. D.</given-names>
            <surname>Ramchurn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. D.</given-names>
            <surname>Huynh</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N. R.</given-names>
            <surname>Jennings</surname>
          </string-name>
          .
          <article-title>Trust in multiagent systems</article-title>
          .
          <source>The Knowledge Engineering Review</source>
          ,
          <volume>19</volume>
          (
          <issue>1</issue>
          ):1{
          <fpage>25</fpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>N. R.</given-names>
            <surname>Velaga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Beecroft</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Nelson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Corsar</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Edwards</surname>
          </string-name>
          .
          <article-title>Transport poverty meets the digital divide: accessibility and connectivity in rural communities</article-title>
          .
          <source>Journal of Transport Geography</source>
          ,
          <volume>21</volume>
          (
          <issue>0</issue>
          ):
          <volume>102</volume>
          {
          <fpage>112</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>