<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Computer-assisted city touring for explorers</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gabriel Spadon</string-name>
          <email>spadon@usp.br</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jose F. Rodrigues-Jr</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Sao Paulo</institution>
          ,
          <addr-line>Sao Carlos</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>The basic purpose of a map is to trace shortest paths between two locations in a city. However, this is not always what a user needs. Consider a tourist in an unknown city, he/she might want to trace routes to visit multiple landmarks while passing through the main streets of the city, possibly more than once through the same street. Such functionality is not yet available in online map services, which are prone to provide shortest paths that connect all landmarks. Rather, is of common interest a tour that puts together the most central streets (topologically speaking), minimizes the trajectory and, at the same time, passes through such landmarks. To cope with this problem, it is possible to investigate techniques of Center-Piece Subgraph, Absorbing Random Walk Centrality and Spanning Edge-Betweenness; such techniques can be used to nd induced subgraphs that optimize centrality measures for a set of referential nodes or edges, i.e. landmarks or streets. The results shall be in the form of optimized algorithms, and how to integrate them into online systems. Studies in this line can succeed if they can guarantee timely scalability at the same time that they provide algorithms that produce tours (1) considering all the known destinations; (2) including the main streets of a city; and, (3) ensuring the shortest routes.</p>
      </abstract>
      <kwd-group>
        <kwd>Complex Network</kwd>
        <kwd>Urban Structure</kwd>
        <kwd>City Tour</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Digital maps are becoming available to everyone, anywhere, through computers,
smartphones, car assistants, and electronic devices in general. The most common
use of such maps is to trace routes, or shortest paths, among di erent locations.
However, this basic use is not always what one needs. In the case of a tourist
visiting an unknown city, for instance, the user is not looking for the fastest
and/or shortest path. Rather, he/she is looking for a tour that visits multiple
landmarks (i.e. points of interest) while passing, possibly more than once,
through the main streets of the city. That is, the user not only has several
destinations, he/she desires a route that favors hotspots of the city in what is
usually called city-tour. For instance, imagine a tourist is on the east side of the
Central Park in New York and wants to go all the way down to the World Trade
Center; the fastest and shortest path is to go through the monotonous 7th avenue.
For a tourist, di erently, the best pick might be to go through Broadway, despite
its tra c and higher distance. The idea then is to nd the tour that puts together
City-Tour
the most central streets (topologically speaking), minimizes the trajectory and,
at the same time, passes through points of interest. Di erently, current systems
are more likely to provide the user with shortest paths that connect all the
points of interest. In other cases, the user does not even have destinations to
visit; he/she wants to cruise through the city, discovering it iteratively. Such
functionalities are not yet available in online map services and can be explored
through complex-network tools by using a methodological process similar to the
one presented in Figure 1.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Research Goals</title>
      <p>
        The goal is to use graph-processing techniques to compute touristic routes with,
or without, multiple destinations satisfying performance constraints. Given a
city, or a region of a city, such routes might come as the answer to two queries:
(1) to compute a multiple-destinations tour; or, (2) to compute a tour, even
if the set of destinations is empty. The second query might sound odd, but it
is not an unusual case, especially for explorers in unknown cities. Notice that,
such queries depend on the investigation of techniques based on random-walk
with restart [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], usually used to nd hub nodes of a network, but that, by
means of edge processing, might be used to nd the most signi cant edges
related to a set of nodes [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The queries also depend on techniques that identify
the edges with the highest centrality, which, in the context of maps, must be
concomitantly related to shortest paths so to re ect good routes | Hannah et
al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] performed a study that explores these properties, but not considering the
city-tour constraints.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Related Works</title>
      <p>
        Besides the aforementioned work, others touched the issue of processing digital
maps. Scellato et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] investigated how it is possible to extract the backbone
of a city using Spanning Trees based on Edge-Betweenness and Information
Centrality; they proposed a method that allows extending the comprehension of
the most important routes that a ect the city ow, retails, land-use separation,
and that impact upon collective behavior. However, the authors used a greedy
algorithm, which fails in circumstances when the topology is not adequate to the
greedy strategy, for instance, when the lengths of the streets strongly diverge in
di erent regions. Additionally, Delling et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] have focused on algorithms to
solve problems from the commercial domain; this is the case when one needs
to choose the best placement for a new store. Dibbelt, Pajor, and Wagner [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
explored a multi-modal common-route planning problem. These studies consider
the possibility of using heterogeneous transportation to go from one point to the
other (common-route problem), favoring only the criterion of shortest paths.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Suggested Methodology</title>
      <p>
        Nowadays, digital maps can be extracted from many di erent sources, but
some of them are not freely available. OpenStreetMap1 is a collaborative
street-mapping community and an open-source option for digital maps. Over
OpenStreetMap, it is possible to investigate techniques related to the questions
placed in Section 2. Such research questions can be explored by means of three
techniques, as follows:
Center-Piece Subgraph (CEPS). The Center-Piece Subgraph technique
summarizes a graph, producing an induced topology that connects a subset of
referential nodes provided as input. It follows by using random-walks with restart
and cost functions to measure the adequacy of the edges that will be part of the
resulting topology [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The summarized graph is validated by analyzing the
goodness of the graph nodes [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], such that, heuristically, the best simpli cation
is the one that contains the essential elements according to a goodness criterion
(as de ned by Equation 1), where r(q; j) is the goodness score for a given node
j considering a query set Q.
      </p>
      <p>g(CP ) =</p>
      <p>X
j 2 nodes(CP )
r(Q; j)
(1)</p>
      <p>
        The goodness criterion minimizes the sum of weights to connect all the edges
in the subset while inducing a subgraph; however, it does not consider edge
properties, leading to a subgraph with semantics related to a route in a map.
The metric was not designed to provide a minimum set of edges, as necessary in
multiple-destination routes, but only with as many edges as desired by the user.
Example. Let us consider an unknown city represented as a graph G(V; E)
which has a couple of touristic attractions T and let T 0 be the subset of those
that he/she wants to visit or, alternatively, avoid. The idea then is to extract a
subgraph G0 that confers the user a reduced network in which he/she can travel
1 www.openstreetmap.org
back and forth to visit the desired destinations. The output of CEPS, in this
particular case, shall be a subgraph indicating the easiest and most interesting
way to travel from a source place ti to a target one ti+1, where f(ti; ti+1) 2 T 0g.
Absorbing Random Walk Centrality (ARwC). The Absorbing Random
Walk Centrality technique is capable of evaluating the centrality of a subgraph
considering a set of query nodes Q [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]; ARwC considers the number of steps
needed to absorb a walker that starts from a query node q 2 Q and walks to
all the other nodes C in the graph, denoted as acqQ(C). This metric is based
on the k-Arw-Centrality optimization, which is an NP-hard problem; to solve
that problem, ARwC uses a greedy approach that provides solutions with good
approximation guarantees. Equation 2 presents the metric, whose result is the
centrality of a set of nodes C with regard to a set of query nodes Q.
acQ(C) = X 1
q2Q jQj
acqQ(C)
Example. Consider an unknown city with some touristic attractions represented
as a graph. It is possible to assess the centrality of these attraction points inside
the subset T 0 using ARwC, which will provide a set of values fCtrw 8 t 2 T 0g
that represent the importance of an attraction with respect to the others.
Consequently, the result of this process is a hierarchical city view, which can: (1)
represent how critical a node is among the others; (2) quantify how drastically
he/she needs to avoid critical nodes to improve his/her mobility in a city;
and, (3) classify the touristic attraction which will receive more or fewer visits
considering its positioning.
      </p>
      <p>Spanning Edge-Betweenness (SEB). A Spanning Tree (ST) is a graph
structure that contains all the nodes connected by a subset of non-cyclic edges.
It can be produced with weighted or non-weighted edges. In the context of a
map, an accurately computed ST (not necessarily the minimum) generates a
Backbone representation of a city, which conforms to the problem of computing
a tour without a set of destinations. To this end, SEB is a centrality metric
computed by considering all the STs of a graph; it works by quantifying the
number of times that each edge pertains to an ST. The result of this metric is a
hierarchical formation of streets according to their importance.</p>
      <p>
        SEB was rstly de ned over undirected and weighted graphs to improve the
analysis of phylogenetic trees [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]; it has been shown to be a powerful tool able
to evaluate the most relevant edges of a graph that, if removed, might disrupt
the network structure [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]; notice that, the metric is not only to be used in
the analysis of phylogenetic trees but also over massive graphs with millions of
nodes [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Equation 3 formally de nes SEB, where G is the number of STs for
a graph G, and G(e) is the number of di erent STs where edge e occurs.
      </p>
      <p>G(e) =</p>
      <p>G(e)
(2)
Example. Suppose that it is desired to detect the most common and interesting
route that connects the nodes of an unknown city. That is, one wants not only the
streets with the minimum length, but also wants those that, given the network
topology, render better routes more frequently. This process can be achieved by
using SEB whenever it is possible to infer weights to the set of network edges.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Expected Results</title>
      <p>The results shall be in the form of optimized algorithms, and how to integrate
them into online systems, including how to extract and prepare the network
(maps are not necessarily represented as graphs), how to provide input to these
algorithms, and how to interpret the outputs. The results shall be validated
through extensive testing over a signi cant number of representative cities.
Studies in this line can succeed if they can process queries over a map within
seconds, and if the algorithms produce tours that (1) consider all the destinations
(if known); (2) include the main streets of a city (higher centralities); and,
(3) minimize the length of the paths (good routes). It is possible to verify
those conditions in large scale by using brute-force algorithms or multi-objective
optimization techniques; and, in small scales, by considering case studies of
known cities and their tours. Research following these lines shall pave the way for
future map processing, turning tours into a novel concept on electronic-trajectory
computing. The results have the potential to be reported in international
conferences and journals of Data Mining, Transportation Systems, and Physics.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Further Research Lines</title>
      <p>The possibilities of research and investigation are not limited to the calculation
of city-tours. It is possible, for instance, to analyze di erent sets of cities by
extracting feature vectors that describe the complex-networks of their tours.
This is because feature vectors can describe cities by detailing their topology and
structural peculiarities, which would enable further analysis based on clustering
detection, similarity search, multidimensional projection, and fractal analysis.
All of which have the potential to enhance the understanding both about cities
and tourist behaviors in the face of di erent types of landmarks that can be
found in a given city. More speci cally, by investigating feature vectors through
these tools, it is possible: (i) to analyze cities according to their similarities and
di erences; (ii) to determine characteristics shared between cities or groups of
cities; and, (iii) to disclose routing problems that are exclusive to peculiar cities.
In fact, all these activities help in the designing of the urban space, they also
help in its improvement and comprehension because similar cities might share
the valuable characteristics; meanwhile, exceptional cities might indicate routing
problems that can be further studied through a di erent set of tools.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>We would like to thank the Brazilian agencies CNPq, FAPESP, and CAPES
that fully supported this research.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Bast</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Delling</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <article-title>Muller-</article-title>
          <string-name>
            <surname>Hannemann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pajor</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sanders</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wagner</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Werneck</surname>
            ,
            <given-names>R.F.</given-names>
          </string-name>
          :
          <article-title>Route planning in transportation networks</article-title>
          . In: Kliemann,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Sanders</surname>
          </string-name>
          , P. (eds.) Algorithm Engineering, pp.
          <volume>19</volume>
          {
          <fpage>80</fpage>
          . Springer International Publishing (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Delling</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>A.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pajor</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Werneck</surname>
            ,
            <given-names>R.F.</given-names>
          </string-name>
          :
          <article-title>Customizable Route Planning in Road Networks</article-title>
          . Transportation Science (may
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Dibbelt</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pajor</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wagner</surname>
            ,
            <given-names>D.:</given-names>
          </string-name>
          <article-title>User-Constrained Multi-Modal Route Planning</article-title>
          .
          <source>In: 2012 Proceedings of the Fourteenth Workshop on Algorithm Engineering and Experiments (ALENEX)</source>
          , pp.
          <volume>118</volume>
          {
          <fpage>129</fpage>
          .
          <string-name>
            <surname>Society</surname>
          </string-name>
          for Industrial &amp; Applied
          <string-name>
            <surname>Mathematics</surname>
          </string-name>
          (SIAM) (jan
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Mavroforakis</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia-Lebron</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koutis</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Terzi</surname>
          </string-name>
          , E.:
          <article-title>Spanning edge centrality</article-title>
          .
          <source>In: Proceedings of the 24th International Conference on World Wide Web - WWW'15</source>
          . pp.
          <volume>732</volume>
          {
          <fpage>742</fpage>
          . ACM Press (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Mavroforakis</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mathioudakis</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gionis</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Absorbing random-walk centrality: Theory and algorithms</article-title>
          .
          <source>In: 2015 IEEE International Conference on Data Mining</source>
          . pp.
          <volume>901</volume>
          {
          <fpage>906</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (nov
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Rodrigues-Jr</surname>
            ,
            <given-names>J.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tong</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>J.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Traina</surname>
            ,
            <given-names>A.J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Traina</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faloutsos</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Large Graph Analysis in the GMine System</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>25</volume>
          (
          <issue>1</issue>
          ),
          <volume>106</volume>
          {118 (jan
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Rodrigues-Jr</surname>
            ,
            <given-names>J.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tong</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Traina</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faloutsos</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leskovec</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>GMine: A System for Scalable, Interactive Graph Visualization and Mining</article-title>
          . In: VLDB. ACM, Seul, South Corea (jun
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Scellato</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cardillo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Latora</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Porta</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>The backbone of a city</article-title>
          .
          <source>The European Physical Journal B - Condensed Matter and Complex Systems</source>
          <volume>50</volume>
          (
          <issue>1-2</issue>
          ),
          <volume>221</volume>
          {225 (feb
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Teixeira</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Monteiro</surname>
            ,
            <given-names>P.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carrico</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramirez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Francisco</surname>
            ,
            <given-names>A.P.</given-names>
          </string-name>
          :
          <article-title>Spanning edge betweenness</article-title>
          . In: Workshop on Mining and
          <article-title>Learning with Graphs</article-title>
          . vol.
          <volume>24</volume>
          , pp.
          <volume>27</volume>
          {
          <issue>31</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Teixeira</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>F.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Francisco</surname>
            ,
            <given-names>A.P.</given-names>
          </string-name>
          :
          <article-title>Spanning Edge Betweenness in Practice</article-title>
          .
          <source>In: Studies in Computational Intelligence</source>
          , pp.
          <volume>3</volume>
          {
          <fpage>10</fpage>
          . Springer Nature (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Tong</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faloutsos</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Center-piece subgraphs</article-title>
          .
          <source>In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD'06</source>
          . pp.
          <volume>404</volume>
          {
          <fpage>413</fpage>
          . KDD '06, ACM Press (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Tong</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faloutsos</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>yu</surname>
            <given-names>Pan</given-names>
          </string-name>
          , J.:
          <article-title>Fast random walk with restart and its applications</article-title>
          .
          <source>In: Sixth International Conference on Data Mining (ICDM'06)</source>
          . pp.
          <volume>613</volume>
          {
          <fpage>622</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (dec
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>