<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Decentralized Publication and Consumption of Transfer Footpaths</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Harm</forename><surname>Delva</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Electronics and Information Systems</orgName>
								<orgName type="laboratory">IDLab</orgName>
								<orgName type="institution">Ghent University -imec</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Julián</forename><surname>Andrés</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Electronics and Information Systems</orgName>
								<orgName type="laboratory">IDLab</orgName>
								<orgName type="institution">Ghent University -imec</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Rojas</forename><surname>Meléndez</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Electronics and Information Systems</orgName>
								<orgName type="laboratory">IDLab</orgName>
								<orgName type="institution">Ghent University -imec</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Pieter</forename><surname>Colpaert</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Electronics and Information Systems</orgName>
								<orgName type="laboratory">IDLab</orgName>
								<orgName type="institution">Ghent University -imec</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ruben</forename><surname>Verborgh</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Electronics and Information Systems</orgName>
								<orgName type="laboratory">IDLab</orgName>
								<orgName type="institution">Ghent University -imec</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Decentralized Publication and Consumption of Transfer Footpaths</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">F9835B94BEC237785F2466F56D287DAE</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T19:06+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Users expect route planners that combine all modes of transportation to propose good journeys to their destination. These route planners use data from several sources such as road networks and schedule-based public transit. We focus on the link between the two; specifically, the walking distances between stops. Research in this field so far has found that computing these paths dynamically is too slow, but that computing all of them results in a quadratically scaling number of edges which is prohibitively expensive in practice. The common solution is to cluster the stops into small unconnected graphs, but this restricts the amount of walking and has a significant impact on the travel times. Moreover, clustering operates on a closed-world assumption, which makes it impractical to add additional public transit services. A decentralized publishing strategy that fixes these issues should thus (i) scale gracefully with the number of stops; (ii) support unrestricted walking; (iii) make it easy to add new services and (iv) support splitting the work among several actors. We introduce a publishing strategy that is based on the Delaunay triangulation of public transit stops, where every triangle edge corresponds to a single footpath that is precomputed. This guarantees that all stops are reachable from one another, while the number of precomputed paths increases linearly with the number of stops. Each public transit service can be processed separately, and combining several operators' can be done with a minimal amount of work. Approximating the walking distance with a path along the triangle edges overestimates the actual distance by 20% on average. Our results show that our approach is a middle-ground between completeness and practicality. It consistently overestimates the walking distances, but this seems workable since overestimating the time needed to catch a connection is arguably better than recommending an impossible journey. The estimates could still be improved by combining the great-circle distance with our approximations. Alternatively, different triangulations could be combined to create a more complete graph.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Fueled by the need for more sustainable transportation, users can pick between more forms of transport than ever before -and they expect that route planners can combine these to find the ideal journey. Those route planners integrate various kinds of data such as road networks and schedule-based public transit. In this article, we focus on the link between the two, specifically on how to walk between stops. The walking time from one stop to another is commonly referred to as a footpath <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2]</ref>. Computing these paths dynamically is often too slow in practice <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4]</ref>, yet computing them beforehand quickly becomes prohibitively expensive in practice <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b0">1,</ref><ref type="bibr" target="#b1">2]</ref>. A small country such as Belgium already contains 100,000 public transit stops, yielding 10 billion pairs of stops. These scaling issues are often avoided by only evaluating algorithms on small unconnected footpath graphs. However, this restricts the amount of walking in a journey and has a significant impact on travel times <ref type="bibr" target="#b3">[4]</ref>. On top of that, determining which stops belong to the same graph ultimately relies on a closedworld assumption; which makes it impractical to add additional public transit services. Our goal is to find a method to compute and publish these footpaths in a scalable and decentralized manner. This implies that we need not publish all the paths, but ideally the ones that are published should allow for unrestricted walking. In practice this means that every stop in the footpath graph must be reachable from every other stop if there is a path between those stops in the real world. Various actors should be able to add information to this footpath graph, and adding a new service should not require changes in existing information. These actors would need a common frame of reference to combine their efforts, which can be provided by Semantic Web technologies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related work</head><p>The publication of transfer footpaths is closely related to the publication of road networks. The Routable Tiles <ref type="bibr" target="#b5">[6]</ref> dataset was recently proposed as a scalable and costefficient way to republish the OpenStreetMap roads using Linked Data Fragments <ref type="bibr" target="#b6">[7]</ref>. The data is fragmented into hierarchical tiles, which are interconnected with hypermedia controls for automatic data discovery. This allows clients to find and download the data they need without having to ingest en entire data dump, while being significantly cheaper than a full-fledged querying API. Determining which stops have to be connected with each other can be seen as a problem in the field of computational geometry. One of the classical problems in this field is the creation of a point set triangulation. A point set triangulation of a set of points can be defined as a maximal set of non-crossing edges between the points of . Any triangulation of points has exactly edges where is the number of points on the convex hull. Triangulations are not unique however, and certain triangulations are NP-hard to compute <ref type="bibr" target="#b7">[8]</ref>. One particularly interesting triangulation, which can be computed in just time, is the Delaunay triangulation <ref type="bibr" target="#b8">[9]</ref>. This triangulation tries to maximize the minimum angle of all the triangles, effectively avoiding long and narrow triangles.</p><formula xml:id="formula_0">P P n 3n − h − 3 h O(n log n)</formula><p>The Delaunay graph, which is simply the graph representation of the triangulation, contains several other interesting graphs such as the nearest neighbor graph and the minimal spanning tree of <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b10">11]</ref>. They are also known to be good approximations of complete graphs: the length of the shortest path between two points along Delaunay edges is at most times the Euclidean distance between them <ref type="bibr" target="#b11">[12]</ref>. We use a Delaunay triangulation of the public transit stop locations to approximate the full footpath graph, based on the insights from the related work section. The triangle edges represent which stops will be directly connected through a footpath.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Method</head><p>Our representation of a footpath differs slightly from the conventional one; we use walking distances instead of walking times to avoid making assumptions about walking speeds. A road network route planner is used to compute the walking distances between the stops. Delaunay triangulations can be generalized to any metric space, but the walking distance is not a mathematical metric because it could, in theory, lack the required symmetry property. Even on foot, you cannot always retrace your steps -either due to legal restrictions or due to physical limitations. And, equally importantly, this would bring us back to the case where an impractical number of path lengths have to be precomputed. This is why we the triangulation uses the Euclidean distance between the stops' WGS84 coordinates instead. Other metrics such as the great-circle distance can be used, but none of them have any relation to the walking distance so we opted for the most simple metric to implement. This also means that the theoretical guarantees of the Delaunay triangulation only hold for the Euclidean distance because that's the metric was used for its construction.</p><p>The paths between stops of a single operator can be calculated independently from the rest. Combining the graphs of two operators is done by comparing their individual triangulations to the triangulation of their combined stops -and computing the missing footpaths. This process is illustrated in Fig. <ref type="figure" target="#fig_0">1</ref> where the graph of a city's public transit network is merged with the one from the region around it. The combined graph is the union of each operators' graph with the additional edges that connect the two graphs. Operators typically have their own service area, which means that the vast majority of paths will be between stops of the same operator, thus making it possible for anyone to combine several operators' graphs with minimal work. The results can be shared and reused, which stands in stark contrast to the conventional approaches. The paths can be published similar to the Routable Tiles dataset so that sharing them does not come at a significant extra cost.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Results</head><p>triangulation of an additional regional operator; (iii) the edges that are needed to attach the new triangulation to the existing one and (iv) the end result.</p><p>We aim to verify two claims about our method: (i) the graph we construct is almost as good as the complete graph for route planning and (ii) footpath graphs can be merged with minimal effort. To verify the first claim we have used a graph of paths between bus stops to approximate the distances between railway stations. Edges between train stations and bus stops have also been added to make the railway stations reachable. Fig <ref type="figure">2</ref> shows that the approximation overestimates the actual distance by about 20% on average. This seems reasonable for route planners -it is better to underestimate the time needed to catch a connection than to recommend an impossible route. 2a: Linear regression plot comparing the actual distances to the approximations. The relative approximation error decreases as the distance increases. Note that the approximations are strictly overestimations due to the triangle inequality.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Fig. 2:</head><p>The relative difference between the actual and the approximated distance has a higher variance for shorter paths. Every data point in these plots is a path in between railway stations where the distance is approximated using a graph of bus stops and the footpaths between bus stops and railway stations.</p><p>To verify that combining the graphs from two operators does not require excessive work, we consider both the usual and the worst case scenario, respectively being mostly disjoint service areas and entirely overlapping service areas. We have taken the public transit operators from the Belgian regions of Flanders and Wallonia for the usual case and we revisit the case from Fig. <ref type="figure" target="#fig_0">1</ref> for the worst case, where the Brussels public transit network is added to the one from Flanders. The results in Table <ref type="table" target="#tab_0">1</ref> show that in the usual scenario merging two operators is relatively straight-forward as most work has already been done. The worst case scenario shows that it is possible that merging two operators can still require a significant amount of work, although the vast majority has already been done. Our implementation contains some additional experiments and is available on https:/ / github.com/hdelva/planner.js/tree/footpaths as a frozen branch of an experimental serverless route planner.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusion and future work</head><p>This paper introduced a decentralized publishing method for transit footpaths based on Delaunay triangulations. We have shown that different actors can add information to the footpath graph, splitting the work between them. The amount of paths scales linearly with the amount of public transit stops, and every stop is reachable from every other. This last property could prove useful in its own right, as it allows for unrestricted walking unlike conventional methods. However, this comes at the cost of precision; the walking distance between stops is consistently slightly overestimated which could cause route planners to discard the most optimal journeys. Future work could focus on comparing the results from a route planner with a complete footpath graph to the one as we describe in this paper. If the difference in quality is negligible our methods could prove useful even for centralized services.</p><p>The approximations may still be improved with different kinds of point set triangulations and several triangulations could even be combined to create a more complete graph. Alternatively, combining the haversine distance and the currently approximated distance seems promising as well -although this could lead to underestimations. Another open question is how scalable this approach is when more modes of transportation are required. It seems undesirable to create separate graphs for bicycles, electric bikes, wheel chairs, etc. but they might be able to share information.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 :</head><label>1</label><figDesc>Fig. 1: Illustration of how two footpath graphs are merged. The subfigures from top to bottom, left to right: (i) an existing triangulation of a larger network; (ii) the</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>The number of paths that need to be computed to connect two operators.</figDesc><table><row><cell>Case</cell><cell>Paths of first</cell><cell>Paths of second</cell><cell>Additional required</cell></row><row><cell></cell><cell>operator</cell><cell>operator</cell><cell>paths</cell></row><row><cell>Flanders + Brussels</cell><cell>107,171</cell><cell>7,969</cell><cell>2,508</cell></row><row><cell>Flanders +</cell><cell>107,171</cell><cell>94,730</cell><cell>4,020</cell></row><row><cell>Wallonia</cell><cell></cell><cell></cell><cell></cell></row></table></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Round-based public transit routing</title>
		<author>
			<persName><forename type="first">D</forename><surname>Delling</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Pajor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">F</forename><surname>Werneck</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Transportation Science</title>
		<imprint>
			<biblScope unit="volume">49</biblScope>
			<biblScope unit="page" from="591" to="604" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Connection scan algorithm</title>
		<author>
			<persName><forename type="first">J</forename><surname>Dibbelt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Pajor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Strasser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Wagner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Experimental Algorithmics (JEA)</title>
		<imprint>
			<biblScope unit="volume">23</biblScope>
			<biblScope unit="page" from="1" to="7" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Computing multimodal journeys in practice</title>
		<author>
			<persName><forename type="first">D</forename><surname>Delling</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Dibbelt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Pajor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Wagner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">F</forename><surname>Werneck</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Symposium on Experimental Algorithms</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="260" to="271" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Public transit routing with unrestricted walking</title>
		<author>
			<persName><forename type="first">D</forename><surname>Wagner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Zündorf</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">17th Workshop on Algorithmic Approaches for Transportation Modelling, Optimization, and Systems (ATMOS 2017</title>
				<imprint>
			<publisher>Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik</publisher>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Fast Routing in Very Large Public Transportation Networks Using Transfer Patterns</title>
		<author>
			<persName><forename type="first">H</forename><surname>Bast</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Carlsson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Eigenwillig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Geisberger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Harrelson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Raychev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Viger</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Algorithms -ESA 2010, 18th Annual European Symposium. Proceedings, Part I</title>
				<imprint>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="290" to="301" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Republishing Open Street Map&apos;s roads as Linked Routable Tiles</title>
		<author>
			<persName><forename type="first">P</forename><surname>Colpaert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Abelshausen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Rojas Meléndez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Delva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Verborgh</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 16th ESWC: Posters and Demos</title>
				<meeting>the 16th ESWC: Posters and Demos</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Triple Pattern Fragments: a Low-cost Knowledge Graph Interface for the Web</title>
		<author>
			<persName><forename type="first">R</forename><surname>Verborgh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Vander Sande</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Hartig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Van Herwegen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>De Vocht</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>De Meester</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Haesendonck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Colpaert</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Web Semantics</title>
		<imprint>
			<biblScope unit="volume">37</biblScope>
			<biblScope unit="issue">38</biblScope>
			<biblScope unit="page" from="184" to="206" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Minimum-weight triangulation is NP-hard</title>
		<author>
			<persName><forename type="first">W</forename><surname>Mulzer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Rote</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the ACM (JACM)</title>
		<imprint>
			<biblScope unit="volume">55</biblScope>
			<biblScope unit="page">11</biblScope>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Improving worst-case optimal Delaunay triangulation algorithms</title>
		<author>
			<persName><forename type="first">G</forename><surname>Leach</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">4th Canadian Conference on Computational Geometry</title>
				<imprint>
			<publisher>Citeseer</publisher>
			<date type="published" when="1992">1992</date>
			<biblScope unit="page">15</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Relative neighborhood graphs and their relatives</title>
		<author>
			<persName><forename type="first">J</forename><surname>Jaromczyk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">T</forename><surname>Toussaint</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the IEEE</title>
				<meeting>the IEEE</meeting>
		<imprint>
			<date type="published" when="1992">1992</date>
			<biblScope unit="volume">80</biblScope>
			<biblScope unit="page" from="1502" to="1517" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Properties of Gabriel graphs relevant to geographic variation research and the clustering of points in the plane</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">W</forename><surname>Matula</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">R</forename><surname>Sokal</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Geographical analysis</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="page" from="205" to="222" />
			<date type="published" when="1980">1980</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Classes of graphs which approximate the complete Euclidean graph</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Keil</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">A</forename><surname>Gutwin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Discrete &amp; Computational Geometry</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="13" to="28" />
			<date type="published" when="1992">1992</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
