=Paper=
{{Paper
|id=None
|storemode=property
|title=GetThere: A Rural Passenger Information System Utilising Linked Data & Citizen Sensing
|pdfUrl=https://ceur-ws.org/Vol-1035/iswc2013_demo_22.pdf
|volume=Vol-1035
|dblpUrl=https://dblp.org/rec/conf/semweb/CorsarEBMPN13
}}
==GetThere: A Rural Passenger Information System Utilising Linked Data & Citizen Sensing==
GetThere: A Rural Passenger Information System Utilising Linked Data & Citizen Sensing David Corsar, Peter Edwards, Chris Baillie, Milan Markovic, Konstantinos Papangelis, and John Nelson dot.rural Digital Economy Hub, University of Aberdeen, Aberdeen, UK {dcorsar,p.edwards,c.baillie,m.markovic,k.papangelis,j.d.nelson}@abdn. ac.uk http://www.dotrural.ac.uk Abstract. This demo paper describes a real-time passenger information system based on citizen sensing and linked data. Keywords: provenance, data quality, citizen sensing, linked data, se- mantic infrastructure, transport 1 Introduction Real-time passenger information (RTPI) systems provide details about public transport, allowing passengers to plan and make decisions regarding their jour- neys. Typical requirements for RTPI systems include: 1) listing available public transport services; 2) providing timetable (schedule) information for those ser- vices; 3) providing (real-time) vehicle locations; and 4) providing details of dis- ruptions. However, few RTPI systems exist in rural areas for a variety of reasons, including a lack of infrastructure for obtaining and providing real-time informa- tion [7]. As part of the Informed Rural Passenger project1 , we are developing GetThere, an RTPI system for rural areas. The GetThere system consists of a smartphone app, supported by a semantic infrastructure that integrates data from multiple sources (including users). This system has been deployed in the Scottish Borders, UK in partnership with First Group. This demonstration2 will show a typical use of the GetThere app to view timetabled and real-time vehicle locations for a selected route, contribute vehi- cle locations while making a journey, report a disruption event, and assess the quality of real-time locations with and without the presence of disruption. The demo will utilises the datasets and services shown in Fig. 1. 2 Information Ecosystem GetThere is supported by a semantic information ecosystem (Fig. 1) itself under- pinned by a series of ontologies. Semantic web and linked data technologies are 1 http://www.dotrural.ac.uk/irp 2 A video of the demo is available at http://www.gettherebus.com/iswcdemo/ used for data representation and storage within the ecosystem as they provide an effective approach for large scale data integration [4]. Further, accessing and storing data via SPARQL endpoints allows storage to be handled by technolo- gies appropriate for the characteristics of individual datasets; for example, using RDF streams or a database with a R2RML wrapper for high throughput data. Clients Android Mobile Application Web Services Timetable User Sensor Location Observation Disruption Quality Provenance Internal External Datasets Timetable Infrastructure User Observation Disruption Datasets NaPTAN Ontologies Transport Travel Transit Infrastructure LinkedGeoData User FOAF SIOC SSN Qual-O Prov-O Sensors Disruption Fig. 1. Real-time passenger information ecosystem. Details of public transport services and timetables are stored in the Timetable dataset3 and represented by the Transit ontology4 . This dataset is used by the Timetable Service to provide details of available transport services, timetable, and vehicle location information to the GetThere app. The Infrastructure dataset provides details of the road networks used by public transport vehicles. This data is extracted from openstreetmap.org, and is represented using the Infrastructure ontology5 (which defines bus route maps) and the LinkedGeoData6 ontology. NaPTAN7 provides details of bus stops, including their IDs and locations. The User ontology8 and dataset describe user profiles using SIOC9 and FOAF10 , a description of each user’s msobile device(s), along with details of public transport journeys made while using the GetThere app. The Observation dataset uses the Transport Sensors ontology11 which extends the W3C Seman- tic Sensor Network (SSN) ontology12 to describe observations (e.g. of vehicle occupancy level, vehicle location) obtained from users of the GetThere app. The Sensor service provides an API for storing and retrieving sensor and observation 3 Timetable information is received in the ATCO-CIF format (http://www. travelinedata.org.uk/CIF/atco-cif-spec.pdf); the RDF conversion program is available at https://github.com/dcorsar/ecosystem.timetable. 4 http://vocab.org/transit/terms/ 5 http://www.dotrural.ac.uk/irp/uploads/ontologies/infrastructure.owl 6 http://linkedgeodata.org 7 http://data.gov.uk/dataset/naptan 8 http://www.dotrural.ac.uk/irp/uploads/ontologies/user.owl 9 http://rdfs.org/sioc/spec/ 10 http://xmlns.com/foaf/spec/ 11 http://www.dotrural.ac.uk/irp/uploads/ontologies/sensors.owl 12 http://www.w3.org/2005/Incubator/ssn/ssnx/ssn descriptions expressed using the SSN ontology; the Location Observation ser- vice handles real-time locations provided by app users. The Travel Disruption ontology describes different types of disruption, based on an analysis of existing travel disruption information sources [5]. Disruption reports from app users are managed and stored by the Disruption service and dataset. Given the open nature of this data, issues such as data quality and trust naturally arise [6]. Examples range from malicious users and inaccurate devices to out-of-date information (e.g. timetables). As part of addressing these issues, the ecosystem features a service that can evaluate data quality. The quality ontology (Qual-O13 ), and its associated quality assessment service are discussed in detail elsewhere [1]. Briefly, the service is configured with a set of quality metrics encoded as SPARQL rules expressed against the relevant ontologies. These guide a SPIN reasoner [3] to perform quality assessment, producing quality scores which can be utilised by other services to filter low quality data. Our current quality metrics are focused on real-time locations, and have been developed following several deployments of the system. They include: Timeliness - timely observations are under 1 minute old; Accuracy - accurate observations have a GPS error margin of less than 25 metres; Relevance - relevant observations are no further than 500 metres from the expected route of travel; Availability - observations with a high availability score have no more than a 1 minute delay between being created on the device and published by the ecosystem. The provenance service uses the W3C Prov-O ontology14 to maintain a record of the entities, agents, and activities involved in producing data within the ecosystem. Uses of provenance include: associating users with location obser- vations generated by their mobile device, which can support detection of po- tentially malicious users; and recording dataset provenance to ensure the latest timetable information is provided to users [2]. 2.1 The GetThere Smartphone App The ecosystem has been designed to support a range of applications through the creation of relevant application services. At present we have used the ecosys- tem to support the GetThere RTPI system, which is provided via an Android smartphone app (see Fig. 2). The app invokes the web services, which execute SPARQL queries against relevant datasets, process the results, and send a re- sponse to the app. Users are presented with a list of available bus routes; after selecting a route (and direction, either inbound or outbound), vehicle locations are displayed. These locations include both estimates based on the timetable and real-time locations obtained from other users on that route (Fig. 2, left screen- shot). Bus stops along the route are also shown. The user can access timetable information for the previous and next arrivals at a particular stop. When the user boards the bus, they have the option of pressing a button to have their 13 http://sensornet.abdn.ac.uk/onts/Qual-O.ttl 14 http://www.w3.org/TR/prov-o/ Fig. 2. Screenshots of the GetThere app showing (left to right): vehicle locations; the results of invoking the quality assessment service; and creating a disruption report. location uploaded to the ecosystem every minute. The uploaded location is then used as the vehicle’s real-time location provided to other users. Users can view quality assessment results for a real-time vehicle location by tapping its icon. We are working with users to determine an appropriate visualisation of quality results. Currently each assessed dimension is shown with a colour-coded bar representing its quality score (Fig. 2, centre screenshot). Acknowledgements The research described here is supported by the award made by the RCUK Digital Economy programme to the dot.rural Digital Econ- omy Hub; award reference: EP/G066051/1 References 1. C. Baillie, E. Edwards, P. Pignotti, and D. Corsar. Short paper: Assessing the quality of semantic sensor data. In Proc. of The 6th International Workshop on Semantic Sensor Networks, page to appear, October 2013. 2. D. Corsar, P. Edwards, N. Velaga, J. Nelson, and J. Pan. Exploring provenance in a linked data ecosystem. In P. Groth and J. Frew, editors, Provenance and Annotation of Data and Processes, volume 7525 of LNCS, pages 226–228. Springer Berlin Heidelberg, 2012. 3. C. Furber and M. Hepp. Swiqa - a semantic web information quality assessment framework. In 19th European Conference on Information Systems, pages 922–933, 2011. 4. V. Lopez, S. Kotoulas, M. Sbodio, M. Stephenson, A. Gkoulalas-Divanis, and P. Aonghusa. Queriocity: A linked data platform for urban information manage- ment. In The Semantic Web – ISWC 2012, volume 7650 of LNCS, pages 148–163. Springer Berlin Heidelberg, 2012. 5. M. Markovic, P. Edwards, D. Corsar, and J. Pan. Demo: Managing the provenance of crowdsourced disruption reports. In Provenance and Annotation of Data and Processes, volume 7525 of LNCS, pages 209–213. Springer Berlin Heidelberg, 2012. 6. S. D. Ramchurn, T. D. Huynh, and N. R. Jennings. Trust in multiagent systems. The Knowledge Engineering Review, 19(1):1–25, 2004. 7. N. R. Velaga, M. Beecroft, J. D. Nelson, D. Corsar, and P. Edwards. Transport poverty meets the digital divide: accessibility and connectivity in rural communities. Journal of Transport Geography, 21(0):102 – 112, 2012.