<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>April</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Geographical Service: a compass for the Web of Data.</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Gianluca Correndo</institution>
          ,
          <addr-line>Manuel Salvadores, Yang Yang, Nicholas Gibbins, Nigel Shadbolt Intelligence, Agents</addr-line>
          ,
          <institution>Multimedia (IAM) Group School of Electronics and Computer Science Southampton</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2010</year>
      </pub-date>
      <volume>27</volume>
      <issue>2010</issue>
      <abstract>
        <p>This paper describes a Linked Data service that supports the navigation and retrieval of geographical entities for the UK territory. Geographical entities, in the extent of this paper, are linked data resources that describe objects that have a geographical extension. The service presented in this paper allows the querying of resources that contain or are contained by a given entity URI. The recent publication of UK Public Sector Information (PSI) data sets has brought to the attention of the community the redundant presence of location based context. At the same time it stresses the inadequacy of current Linked Data services for exploiting the semantics of such contextual dimensions for easing entity retrieval and browsing. We present an approach for a geography based service that helps in querying qualitative spatial relations for the UK geography (proper containment so far). We also provide an exploitation scenario based on a backlinking service and PSI Open Linked Data, published within the EnAKTing project.</p>
      </abstract>
      <kwd-group>
        <kwd>Linked Data</kwd>
        <kwd>geographical reasoning</kwd>
        <kwd>Web of Data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Categories and Subject Descriptors</title>
      <p>H.3.4 [Systems and Software]: Distributed systems; H.5.4
[Web]: Navigation; H.3.5 [Online Information Services]:
Web-based services</p>
      <sec id="sec-1-1">
        <title>Linked Data, geographical services</title>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>1. INTRODUCTION</title>
      <p>
        The Linked Data Initiative represents the rst
collaborative e ort to create a Web of Data (WoD henceforth) of
considerable scale, providing few, simple guidelines for
publishing content using well established standards [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Such
guidelines and standards are leading the way to a new paradigm of
interaction between government and citizens in the UK. In
order to pursue better access for citizens to information held
by local as well as national public organisations, the UK
government has recently launched1 a public initiative for
pub1Public access to the site http://data.gov.uk has been
granted the 19th of January, 2010.
lishing Public Sector Information (PSI), adopting Linked
Data tenets as future best practices. Data sets recently
delivered to the public include: government expenses, NHS
trusts' performances, public transportation, and a whole set
of statistics about crime, mortality, census, environment,
school and social indicators. Some of the data sets
mentioned have been published already in Linked Data format,
others have been translated within the EnAKTing project,
and many others are waiting to be freed in the LOD cloud.
      </p>
      <p>Such a proli c in ow of Linked Data poses new questions
and challenges to the community of researchers and
developers: how is it possible to integrate such di erent information
into a meaningful schema? How is it possible to exploit the
little semantics that goes a long way? How do we
choreograph the publishing activity of separate organizations from
the public sector? A common trait of PSI seems to be its
locality: local and national public organisations are in fact
mainly concerned with the collection of data about their
territory, and the distribution of their resources.</p>
      <p>
        In the WoD vision, links between resources from di
erent publishers are particularly important since they are the
ones that allow new data to be discovered and integrated
into the current discourse. It is frequently the case that
di erent URIs are used to refer to the same things,
motivating the use of co-reference services for the resolution of
instance equivalences. Knowledge of this type of
relationship increases the potential for reuse since information from
previously unknown sources is now accessible, and makes the
problem of co-reference resolution of primary importance [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
In any case, we can expect more and more of this linking
data to be made available as the number of Linked Data
publishers increases.
      </p>
      <p>The publication of an authoritative geography of the UK,
(its regions, counties, districts and their connections) by
Ordnance Survey (the national mapping agency for Great
Britain, OS henceforth) as Linked Data, has opened
interesting scenarios for exploiting semantics in contextualising
the information sources published on data.gov.uk. The
geographical dimensions in PSI data sets are already
represented, but their semantics may be lost if they are not
exploited for creating new collections of data, browsing related
resources, and making connections.</p>
      <p>In this paper, we present a service for querying spatial
relationships for the UK (extensible to other countries when
authoritative knowledge bases are available). We start in
Section 2 where the available knowledge bases are described
along with an introduction of the qualitative spatial
reasoning supported. Section 3 provides a rationale for the
developing of such service in support of Linked Data
browsing and retrieval. In Section 4 the implementation of the
geographical service and its APIs are described. The paper
then concludes with a description of an evaluation of the
presented service using public sector information from the
UK government in Section 5 and some concluding notes in
Section 6.</p>
    </sec>
    <sec id="sec-3">
      <title>BACKGROUND</title>
      <p>
        The World Wide Web and the WoD can both be
understood as hypertext systems, where the general purpose of
the hypertext system is for information discovery by
navigation. Providing reasoning over hyperlinks for the purpose
of navigation can bene t information discovery. In 1990,
Nanard brought the concept of \semantic network" from
Arti cial Intelligence [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] into the hypertext eld by creating
a Conceptual Hypertext System [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], in which a hyperlink
can be reasoned by using a domain model classi cation. In
the above system, typed links and typed chunks are used
to de ne relationship between types in order to incorporate
knowledge into a hypertext. This domain model classi
cation is used to classify the documents and documents that
share metadata, and which are deemed to be similar in some
way. The Conceptual Open Hypermedia Service (COHSE)
project [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] later took this approach forward by providing
ontological reasoning based on links of services to bridge the
navigation gap between the Web and Linked Data, where
the link services provided a mapping between concepts and
the lexical labels on the web page.
      </p>
      <p>
        Many of the PSI data sets published so far can be plotted
within a spatial and temporal dimension, in other words, all
data can be linked together by its spatial and temporal
indexes. Within this context, the need to provide services
to reason the spatial and temporal aspects of the linked
data is of key importance. This is unsurprising, the
spatial and temporal reasoning have always been considered
to be an important part of common-sense reasoning in
Arti cial Intelligence. In this section, we will mainly focus
on qualitative spatial representation and reasoning. There
are two major approaches to qualitative spatial
representation - point based and region based [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Region based
approaches, such as Topology [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] which describe
relationships between spatial regions are more intuitive than point
based approaches. The commonly known approaches for
formalizing topological properties of spatial regions are based
on work from Whitehead [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] and Clarke [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] who
axiomatized mereotopologies (a theory that combines mereology
and topology) using a single primitive relation and binary
connectivity relationships. By using these primitive
relations, other relations can be de ned. The Region
Connection Calculus (RCC8) proposed by Randell, Cui and
Cohn[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] de nes a set of jointly exhaustive and pairwise
disjoint relations DC, EC, PO, EQ, TPP, NTPP, TPPi an
NTPPi, as illustrated in Figure 1, and is the most
wellknown approach in the domain. Since the RCC Calculus is
expressed in rst-order predicate calculus, a wide range of
theorem provers can be used for reasoning. For instance,
Given a xed vocabulary of relations, Ri, given R1(x,y) and
R2(y,z), one can answer questions about the possible
relations (from the set Ri) that can hold between x and z by
looking up the composition table [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Although general
1storder theorem proving is too ine cient to be useful for many
purposes [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], it is relatively simple to implement and
particularly useful in our case for reasoning are the geographic
location relationships.
      </p>
      <p>Within the Linked Data context, there are several
services providing resolvable URIs for geographic locations.
Geo-names2 for example, is a community based service that
provides geographical representation of geographical entities
covering all countries worldwide and manages eight million
URIs for geographical resources. As a further example, the
national mapping agency of Great Britain, Ordnance
Survey, maintains a continuously updated database of the
topography of Great Britain3 and is responsible for surveying
the boundaries of the administrative areas.</p>
      <p>
        In this paper, we exploited the Administrative
Geography ontology provided by Ordnance Survey as an
authoritative knowledge base for querying the UK geographical
structure [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Such ontology explicitly represents the
mereological relationships within the administrative hierarchy, as
well as topologically representing the boundary information
between administrative units at the same hierarchical level.
The following depicts the class hierarchy created in the
administrative ontology from Ordnance Survey:
      </p>
      <p>CivilAdministrativeArea
{ EuropeanRegion
{ Country
{ UnitaryAuthority
{ MetropolitanDistrict
{ GreaterLondonAuthority
{ LondonBorough
{ District
{ CivilParish
{ Community</p>
      <sec id="sec-3-1">
        <title>Country</title>
        <p>The topological relations adopted by this ontology were
taken from the RCC8 and correspond to the properties
NTPPi, TPPi, EC and EQ respectively. The topology of
administrative geography of Great Britain contains no overlapping
regions, therefore, the PO relation was not required. Later
version of the ontology reported overlapping entities as well.
The property of spatial containment used in the OS
ontology (equivalent to the NTTP(i) and TTP(i) relations in
Figure 1), implies a mereological relationship. For instance, if
Hampshire spatially contains Fareham, then Fareham is a
part of Hampshire.</p>
        <sec id="sec-3-1-1">
          <title>2http://www.geonames.org last accessed 10/02/2010</title>
          <p>3With the exception of Northern Ireland that is covered by
a di erent agency, the Land and Property Services Northern
Ireland.</p>
          <p>
            Dereferenceable URIs adopted by the Linked Data
community inherit the same properties of hyperlinks in the Web
hypertext system, which is (among others) uni-directionality.
The problem of such kind of links is that it is not possible
to navigate back to the original resource by using
dereferenciation mechanism only. This problem becomes even more
relevant when URIs from previous authoritative data sets
are reused in order to provide context and meaning to new
data. It is in fact possible to browse from the new data to
the old one, but not the other way around. The back-linking
service4 we have implemented for UK public sector
information supports the discovery of back-links between datasets.
The bene t of a back-linking service is that it enables users
to discover, from a single dataset, other datasets which
reference back to it, creating therefore data linkage
opportunities between datasets, increasing the recall of valuable data
sources, and doubling the network e ect [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ] that increases
even more when co-reference systems are employed.
          </p>
          <p>
            In this paper, we will mainly focus on exploring the
possibility of exploiting semantics from authoritative knowledge
bases to provide support for consuming Linked Data
resources. The service provided will allow users to retrieve
contained (and container) entity URIs from popular data
sets by exploiting a co-reference service. Moreover, a
backlinking service which we previously created in the
EnAKTing project5, will allow us to retrieve the information
resources that addressed such URIs. Far from trying to
provide a general purpose reasoner for geographical entities, the
aim of the service described in the following sections is to
exploit the semantically rich knowledge base for UK
geography in order to ease users' navigation through the published
PSI data sets. Similar capabilities were already provided by
DBpedia Mobile [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ], an application that retrieved DBpedia
entries mashed up on a map based on users' geographical
coordinates. The results provided by our service although
are based on a spatial subdivision of the territory,
subdivision that is already used by public sector organizations to
classify their data (e.g. crime statistics are based on a police
based subdivision of the territory, while MPs activities are
related to the constituency they were voted in).
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>MOTIVATION</title>
      <p>
        The Linked Data principles [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] promote a Web of Data
whose architecture is inherently decentralised, relying on
data already published (when available) in order to give
semantics and context to new data. The growth the WoD
has experienced over recent years relies on the simplicity
of publishing and linking data. However, up to now a
semantically coherent orchestration of data publishing is still
a mirage. Nevertheless, relying purely on data linkage for
the discovery and browsing of linked data resources would
lead to a serious knot to untie in the near future. The use
of ontologies and powerful ontology languages in publishing
Linked Data will be an e ort that must be justi ed against a
scenario where such explicit semantics are rarely exploited.
      </p>
      <p>In publishing UK Public Sector Information (UK PSI),
we have identi ed an issue concerning data accessibility and
navigability that addresses in particular the missing
exploitation of semantics (in this case about qualitative spatial
description of geographical entities). In this paper we present</p>
      <sec id="sec-4-1">
        <title>4http://backlinks.psi.enakting.org</title>
      </sec>
      <sec id="sec-4-2">
        <title>5http://enakting.org</title>
        <p>a solution to overcome such issue that soundly enhance data
retrieval and browsing when geographical dimensions are
involved.</p>
        <p>
          The issue is about the usage of geographical entities for
contextualising local information (i.e. information that are
related to a particular geographical location, for example
the population of a region, the MPs of a constituency, or
various statistical data based on territory). In publishing
this kind of information, we provided alignments of our data
(at least for the geographical dimensions represented in the
data) to authoritative knowledge bases using co-reference
systems [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. The problem we have to deal with originates
with the fact that, since the public sector information
published was originated by di erent sectors of UK government,
the kind of spatial classi cations used were highly
heterogeneous, ranging from local parishes to counties and up to
European regions (e.g. South East of England). The di
erent granularities used to classify the data means, in Linked
Data terms, that related information sources link to di
erent URIs. Some data may be in fact relevant for
constituencies, while others may use a di erent granularity (by county
for example), and the URI of a county is obviously di
erent from the set of URIs of all its constituencies. Available
knowledge bases about the geographical or administrative
subdivision of a territory can be exploited to cover such gap
in data granularity.
        </p>
        <p>http://mortality.psi.enakting.org
mortality:ds_1_299_1
mortality:ds_1_299_1
mortality:ds_1_299_1
scovo:dimension</p>
        <p>mortality:Hampshire
http://crime.psi.enakting.org</p>
        <p>scovo:dimension
crime:ds1_37_1
crime:ds1_37_1
crime:ds1_37_1
crime:ds1_37_1
crime:ds1_37_1
http://parliament.psi.enakting.org</p>
        <p>dc:coverage
parliament:member/10395
dc:coverage
dc:coverage
parliament:member/101
parliament:member/11884
.
.</p>
        <p>Taking as an example the PSI data sets published
recently6, we adopted the Ordnance Survey administrative
ontology in order to provide context to our data items (i.e.
SCOVO items instances7 and local governmental data). The
SCOVO ontology allows us to describe statistical data as a
collection of Items where each item describes a statistical
value (i.e. a single cell in a multidimensional table) along
with all the dimensions that characterise it. In the case of
UK PSI statistics, many data sets collected were related to
geographical regions (counties, districts, etc.)</p>
        <p>In this case, users who wished to discover useful
information about their own region (e.g. the County of Hampshire,
top Figure 2) would start their searching activity by
browsing one of its available URIs. The OS URI for such
geographical entity would be os:70000000000177658, but any
equivalent URI provided by a co-reference system will
provide the same results as will be described in the following.
Using a backlinking service for resolving the entities
linking to the given URI for Hampshire, we are able to retrieve
links to mortality statistics (mortality:ds1_299_[1...3]9)
and crime statistics (crime:ds1_37_[1...11]10). In Figure
2 those URIs are contained in boxes labelled as \accessible",
meaning that those URIs are retrievable following back
already existent arcs. Those SCOVO data sets' items address
in fact Hampshire county as one of their dimensions. What
is missing is the further data collected that reports valuable
information about regions contained in Hampshire. In
particular, within the EnAKTing project, we published linked
data about the singular constituencies too. In detail we
published, for each of constituency, an historical record of the
MP in charge for that constituency, his/her voting records
and expenses. In Figure 2 those resources are contained in
dotted boxes labelled as \inaccessible", meaning that they
cannot be retrieved with the existent knowledge.</p>
        <p>Example URIs for such inaccessible resources are11:
parliament:cons-637 rdfs:label "Winchester"
parliament:cons-203 rdfs:label "Eastleigh"
parliament:cons-228 rdfs:label "Fareham"</p>
        <p>The URIs for, respectively: Winchester, Eastleigh, and
Fareham, are therefore not retrieved by the resolution of
the Hampshire URI (obviously) or by the additional service
provided from the backlinking service.</p>
        <p>Despite the fact that an entity is still semantically di
erent from the parts that compose it, the information relevant
for all its constituting parts can still be relevant for the
entity as a whole. Without covering such geographical gap it
is not possible to access all the relevant sets of information,
provide them to the user or process them in some way in
order to summarise their content.</p>
        <p>The aim of this research is to exploit authoritative
knowledge bases in order to cover such gaps, allowing therefore
citizens to retrieve information resources relevant to their
region of interest. Moreover, there are many data sources
that describes geographical resources, and all of those are
already partially aligned. The integration of di erent
knowledge bases could lead to the possible exploitation of such
alignments in order to bridge data sets and reuse the
available knowledge in more than one context.
4.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>GEOGRAPHICAL SERVICE FOR UK</title>
      <p>To support the user's experience in browsing and
discovery of new resources in the WoD, we have developed a
geographical service for querying the UK territory structure.
The decision to restrict the service to the UK territory is
mainly due to the fact that the service is mainly used in
order to support the discovery of UK PSI resources.
Knowledge about geographical containment is exploited here to
link information that is contextually related because of their
spatial dimension.</p>
      <p>For this use case we have implemented a service for
querying the topological structure of UK (from the broader entity
to the more particular and the other way around) that can
be easily integrated into a web of linked data. The service,
accessible at http://geoservice.psi.enakting.org is
designed in order to be easily integrated both into web
applications and in linked data resources and it follows few basic
principles:
Lightweight Service : The service should be easy to use
and resolve a speci c problem. A geographical
service is a component of the WoD that supports
discovery when geographical entities are involved, it is not a
general purpose reasoning engine.</p>
      <p>Linked Data Compatible : The geographical service
should be used as a resolvable URI like any other resource,
in order to be used in linked data content as a
useful provider of relevant URIs. Moreover the service
should provide the results in a number of di erent
formats that will be decided using content negotiation
and HTTP 303 redirection.</p>
      <p>Co-reference Support : The service should exploit the
already available knowledge about instance equivalence
provided by co-reference systems12 in order to return
results useful in a number of di erent data sets.
4.1</p>
    </sec>
    <sec id="sec-6">
      <title>Data collection and normalisation</title>
      <p>OS provides an ontology13 and an RDF dump about
spatial relations between UK regions. The triples from OS
have been parsed and only the relation of physical
containments have been retained, normalised and completed with
the inverse relations in a separate knowledge base. The
service presented here, for the sake of simplicity and e
ciency14, manages only the NTPP, and the relative inverse,
the NTPPi relations. The knowledge extracted from the OS
data set has been then normalised in terms of an internal
ontology that represent qualitative spatial relations.</p>
      <p>The normalisation step has been introduced in order to
allow the service to integrate further geographical
hierarchies in the future (e.g. geonames provides containment of
6http://browser.psi.enakting.org
7http://purl.org/NET/scovo
8PREFIX os:&lt;http://data.ordnancesurvey.co.uk&gt; 12Like http://sameas.org
9PREFIX mortality:&lt;http://mortality.psi.enakting.org/ 13http://www.ordnancesurvey.co.uk/oswebsite/
id/&gt; ontology/SpatialRelations/v0.2/SpatialRelations.
10PREFIX crime:&lt;http://crime.psi.enakting.org/id/&gt; owl
11PREFIX parliament:&lt;http://parliament.psi.enakting.org/ 14In this way the data to manage has been drastically reduced
id/&gt; in order to provide a very focused service.
dbpedia:Hampshire
geographical features). The future integration of
qualitative spatial knowledge bases is devised in order to extend
the service outside the borders of UK and for providing an
assessment of co-references between geographical entities.</p>
      <p>A simple example of how the normalised triples from OS
ontology are used in coupling with a co-reference service for
bridging the navigational gap for di erent data sets is
depicted in Figure 3; in the gure it is possible to see that a
single statement from OS describing the fact that the County
of Hampshire contains Fareham and Winchester15:
os:7...17765 os:contains os:7...25157.
os:7...17765 os:contains os:7...25128.</p>
      <p>has been translated into an internal representation
containing both relations: part, and part of; like the following:
os:7...17765 geoservice:part os:7...25157.
os:7...25157 geoservice:part_of os:7...17765 .
os:7...17765 geoservice:part os:7...25128.
os:7...25128 geoservice:part_of os:7...17765 .</p>
      <p>The containment relations so normalised (see central part
of Figure 3) are then internally stored in the system and
queried for serving users requests.</p>
      <p>
        The normalised containment relations are integrated with
the information provided by the co-reference system that
allows to bridge di erent data sources both in the input
phase (i.e. where the input URI must be translated in
the OS equivalent, see top part of Figure 3) and the
output phase (i.e. when the results must be translated into
15OS URIs are shortened, the trail of '0' are replaced by '. . . '.
a target data set provided by the user, see bottom part of
Figure 3). The co-reference service used in this paper is
the http://sameas.org service from Glaser et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The
relevant bundles have been retrieved from the service and
cached for performance. It is important to note that, in
order to chose the wanted quality of service, one could opt for
using one co-reference service instead of another. The
functionality provided is transparent from the provenance of the
co-reference bundles.
      </p>
      <p>Exploiting co-reference services and OS ontology, it is
therefore possible to infer containment relation between
resources from di erent data sets. For example:
dbpedia:Hampshire owl:sameAs os:7...17765
AND
os:7...17765 geoservice:part os:7...25128
AND
os:7...25128 owl:sameAs dbpedia:Winchester
=)
dbpedia:Hampshire geoservice:part dbpedia:Winchester
4.2</p>
    </sec>
    <sec id="sec-7">
      <title>RESTful API</title>
      <p>The service is accessed via HTTP GET requests and
provide two essential information: the list of entities contained
the input URI, and the list of entities that contains the
input URI. The interface is then accessible via the following
URIs:
http://geoservice.psi.enakting.org/{command}/
{dictionary}/{format}/{URI}</p>
      <p>In the above API description, the parameters are enclosed
in brackets and their meaning is the following:
dbpedia:Hampshire
geoservice
5.</p>
      <p>
        1.
command: can be either contains or container: in the
rst case it returns the URIs of the entities contained
by the input URI; in the second case it returns the
URIs of the entities that contains the input URI.
dictionary: can be one of the followings (dbpedia, os,
statistics, geonames, enakting, opencyc,
openlylocal, or none) and instructs the service to use the
co-reference system in order to retrieve the equivalent
URIs in the respective data sets (i.e. DBpedia [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ],
Ordnance Survey [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], UK National Statistics16,
Geonames17, PSI enAKTing18, OpenCYC [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], Openly
Local project19). The value none is used for not applying
any lter. In this case the URIs returned will be the
ones from the Ordnance Survey plus the ones returned
from the co-reference service.
format: the format parameter is optional and can be one
of the followings (rdf, text, ttl, or json). The value
of the format parameter decide then the format of
the returned content: RDF/XML for rdf ; list of URIs
separated by new lines for text; RDF/Turtle for
turtle; and nally JSON20 for json. If the parameter
is not given the right content is decided using the
303 HTTP redirection. Even for the content requests
Accept:text/html done using the browser, the client
is redirected to the HTML page of the service
initialised with the input URI.
      </p>
      <p>URI: is the URI of the input entity to query using the
service. The service uses a co-reference system in order to
nd the equivalent URI for the Ordnance Survey and
the Geonames data set. This means that the user can
16http://statistics.data.gov.uk last accessed 10/02/10
17http://geonames.org last accessed 10/02/10
18http://browser.psi.enakting.org last accessed
10/02/10
19Community devoted to provide linked data access for
local government data, see http://openlylocal.com last
accessed 10/02/10
20http://json.org last accessed 10/02/10
use one of the data set of preference (e.g. DBpedia or
Geonames) and ask for contained, or container,
entities in one of the desired target data set (e.g. again
DBpedia, Geonames, or enAKTing published
information).</p>
      <p>The service returns a list of URIs if the content type is
text or json. The RDF content, for both rdf and turtle,
describes the containment relations between the input URI
and the resulting resources. In both cases the returned URIs
are translated into the desired address space.</p>
      <p>The procedure followed by the service, and an overall
architecture, is depicted in Figure 4, and can be describe as
follows:
1. user generated request (HTTP GET request)
2. normalisation of the input URI to OS
3. computation of the property closure (i.e. part or
partof ) over the normalised URI
4. optional phase of translation and ltering of the
resulting URIs to the target URI space
5. formatted content, as per user request, returned to the
user (HTTP Response)</p>
      <p>As an example, consider the case of a software client
who needs to know all the geographical entities contained
in the Hampshire, the request can adopt as an input one
of many available URIs describing the Hampshire county,
a popular choice could be the DBpedia URI (i.e. URI =
http://dbpedia.org/resource/Hampshire). The agent can
then explicit the desired target data set, for example the
DBpedia data set itself (i.e. dictionary = dbpedia), and
instruct the server to return the JSON format of the document
(i.e. HTTP header contains Accept:application/json).
The HTTP request will be then the following:</p>
      <p>GET /contains/dbpedia/http://dbpedia.org/</p>
      <p>resource/Hampshire
Host: geoservice.psi.enakting.org</p>
      <p>Accept: application/json
And the service will return a response redirecting the client
to the right URL:
HTTP/1.1 302 Found
Location: http://geoservice.psi.enakting.org/
contains/dbpedia/json/http://dbpedia.org/
resource/Hampshire
That, once resolved, will nally return the desired content,
a JSON array of strings that represents the URI of the
DBpedia resource describing entities contained in Hampshire:
HTTP/1.1 200 OK
Content-Type: application/json
["http://dbpedia.org/resource/North_East
_Hampshire_%28UK_Parliament_constituency%29",
"http://dbpedia.org/resource/East_Hampshire
_%28UK_Parliament_constituency%29", ...</p>
      <p>The client agent can obviously immediately refer to the
right URL and retrieve the content in the right format straight
away. A useful way to exploit such service can be seen when
data sets other than OS one are queried. Not every data
set in fact provides a clear semantic representation about
mereological relations. This is due to the fact that the focus
of many data set is to provide information about a
particular region: encyclopaedic information from DBpedia,
statistics information from the UK National Statistics,
geographical features from Geonames21, conceptual description from
Open CYC, local government information from Openly
Local, and UK PSI from EnAKTing.</p>
      <p>Using the service presented in this paper is easy to
exploit the OS administrative ontology in order to retrieve
geographically relevant information regardless from the
starting data set. As an example, let us consider the case where
a user may want to retrieve information about local
government of its own city, for example about Southampton, UK.
The easiest thing to do is to start from a recognizable URI
such as the DBpedia ones:
http://dbpedia.org/resource/Southampton</p>
      <p>From this URI the user can retrieve general information
about the city, even the names about some of the city
leaders. No further information is available on the Southampton
DBpedia page about local government information. Asking
the geographical service to return the contained entities from
the Openly Local site we can then retrieve more resources:</p>
      <p>Those URIs are the ones published for each one of the
wards present in the city of Southampton and provides not
only the names of the local councillors but also some other
statistics about the ward (i.e. demographics and religious
statistics). Moreover, asking again the service for the
DBpedia URIs we are able to retrieve the followings:
21Geonames provides a containment relation that does not
however re ect any administrative subdivision
http://dbpedia.org/resource/Southampton_Test_
%28UK_Parliament_constituency%29
http://dbpedia.org/resource/Southampton_Itchen_
%28UK_Parliament_constituency%29</p>
      <p>From those URIs we are then able to check then the
identities of the MPs in charge (in the Southampton page from
DBpedia their are mentioned both as leaders of the city
whereas an MP is actually in charge only to its constituency
where s/he has been elected. Asking then for the URIs from
the data sets provided by the EnAKTing project we would
be able to retrieve the followings:
http://parliament.psi.enakting.org/id/cons-536
http://parliament.psi.enakting.org/id/cons-535</p>
      <p>Following such links the user would be able then to
retrieve other information about the MPs from each
constituency (even retrieving an historical record of them) and further
information about their political activity.
5.</p>
    </sec>
    <sec id="sec-8">
      <title>EVALUATION</title>
      <p>We have evaluated our geographical service from two
different perspectives. The rst one looks at the direct
bene t that our backlinking service for Public Sector
Information22 would gain from expanding its navigability through
geographic containments (see Section 5.1). The second
evaluation is more analytic and looks at the new knowledge
generated as part of the translation process from an
authoritative geographic closure to the covered vocabularies (see
Section 5.2) .
5.1</p>
    </sec>
    <sec id="sec-9">
      <title>Backlinking Service Integration</title>
      <p>This section studies the navigability improvement that
our backlinking service for the PSI in the UK would
experiment by plugging the containments from a wide range
of vocabularies.The PSI Backlinking Service provides an
access point to retrieve backlinks from Foreign URIs. Foreign
URIs make data discovery di cult because it is not possible
to navigate the RDF documents of the WoD bidirectionally.
http://backlinks.psi.enakting.org provides an API to
retrieve collections of backlinks for a given URI. The study
of the covered knowledge bases23 in the UK PSI Backlinking
Service made explicit that one of the most highly connected
data sets in the PSI WoD are the ones representing some
type of geographic information.</p>
      <p>In this evaluation we have used the Backlinking Service
as a client of the Geographical Service in order to expand
the backlinks that we can get from geographic resources. We
have kept the decentralization nature of the Backlinking and
Geographical services and basically the Backlinking Service
performs HTTP requests to get the geographic containments
(see Figure 5). When the geography extension is enabled
the backlinking service gets the list of contained entities for
the input URI and returns the backlinks connected to any
URI part of containments. The request to the Geography
Service is performed using \contains" as command JSON as
format and \none" as dictionary. The selected dictionary is
\none" because the Backlinking Service doesn't know before
22http://backlinks.psi.enakting.org last accessed
10/02/10
23http://backlinks.psi.enakting.org#KBs last accessed
10/02/10
HTTP GET http://backlinks.psi.enakting.org/resource/URI?geo=enabled
http://backlinks.psi.enakting.org</p>
      <p>(RESTFulAPI)
HTTP GET http://geoservice.psi.enakting.org/contains/none/json/URI
http://geoservice.psi.enakting.org
(RESTFulAPI)
Co-reference
http://sameAs.org
for URI' in geoPartonomyBundle:</p>
      <p>BackLinks += GetBackLinks(URI')</p>
      <p>Backlinks
Knowledge Base
(4store)
hand what type of URIs will be the source of backlinks for
a certain geographic region. So as to improve the coverage
we aim to get all the possible containments from all the
dictionaries supported in the geographical service.</p>
      <p>There is a natural outcome from this integration and it can
be shown using how the systems works when asking for
backlinks connected the URI dbpedia:Hampshire. Prior to the use
of the geographical extension a request to retrieve backlinks
for dbpedia:Hampshire would just give back 14 URIs related
the UK region of dbpedia:Hampshire or any equivalent URI
part of the same co-reference bundle in sameAs.org (see
Figure 6). This same request when the geographical service is
integrated returns the following additional backlinks:
6 010 resources that represent schools from http://
education.data.gov.uk. These RDF documents
represents the totality of education entities in the region
of Hampshire.
42 mortality statistical resources from http://mortali
ty.psi.enakting.org. This statistics are segmented
by geography and gender.
981 CO2 emission measurements from http://co2emis
sion.psi.enakting.org. These resources represent
the CO2 emissions for the region of Hampshire
between 2005 and 2007.
300 resources with information of energy consumption
from http://energy.psi.enakting.org. This data
sets publishes the energy consumption in the UK in
respect to fuel in the road network between 2005 and
2007. These results represent all the RDF documents
linked to geographical regions contained in Hampshire.
4 788 population census information segmented by age
and sex from http://population.psi.enakting.org.
224 parliamentary identities from http://parliament.
psi.enakting.org. These represent mandates for
different members of the UK Parliament and House of
Commons.</p>
      <p>Figure 6 shows the output of the backlinking service with
and without geographical extensions in the Backlinking
Service. All the resources enumerated above are not speci cally
linked to dbpedia:Hampshire or equivalent URIs but to
geographic containments of it in at least one of the data sets
covered by the Geographical Service.</p>
      <p>This scenario has shown one of possible scenarios where
the exploitation of explicit semantic can improved the
accessibility of the resources in the Web of Data. In esence
the backlinking service is improving its graph connectivity
by being aware of the new layer of Linked Data that the
Geographical Service publishes via its RESTFul API. This
case study also shows how di erent Linked Data RESTFul
services (such as co-reference, backlinking and geographical
services) can cooperate in a layer built on top of current
Web of Data to improve its navigability.
5.2</p>
    </sec>
    <sec id="sec-10">
      <title>Vocabulary Closure Coverage</title>
      <p>The geographical service can be seen as an extra layer of
linked data based on an initial geographic closure provided
by Ordnance Survey and its extensions to other data sets via
co-references. This extra layer of linked data is obviously an
added value to the Web of Data. This section analyses the
interlinking improvement between the data sets by means of
number of triples produced by the Geographical Service.</p>
      <p>Table 1 represents the amount of triples generated by our
service in terms of number of triples that contain where the
predicate is geoservice:part or geoservice:part of. This
table shows the numbers of triples linking every pair of data
set in the system. For instance our Geographical Service has
produced 30995 geographic containments between dbpedia
and mortality.psi.enakting.org.</p>
      <p>Of particular interest are the results from the geonames
data set. In fact, the number of containment relations within
such dataset is quite small compared to the number of
containment relations provided by geonames itself (a rough
estimate done by the authors counts about 9K relations). Such
additional source of spatial knowledge open a scenario where
the two knowledge bases can be compared and integrated for
providing a better recall for the service. An important
aspect to take into account in such a scenario would be the
quality of the results computed by the integration.</p>
      <p>The data seed that triggered this new knowledge is the OS
to OS containments, 60M of statements. The total number
of triples generated are 223M and these are partially
interlinking every pair of data sets. Partially because the
completeness of every pair of datasets' closure relies on the
accuracy of the co-reference bundles extracted from sameAs.org.
As the number of co-references from sameAs.org grows and
improves its accuracy the Geographical Service will re ect
those changes automatically. This side e ects is one of key
aspects of the Web of Data and its decentralized nature.
6.</p>
    </sec>
    <sec id="sec-11">
      <title>CONCLUSIONS</title>
      <p>We have presented in this paper a service that helps users
in browsing geographical resources from di erent datasets
(dbpedia, geonames, data.gov.uk. psi.enakting.org, . . . ) by
exploiting an authoritative ontology for the UK territory
(Ordnance Survey). One of the novel aspects of this research
is the use of a co-reference system (http://sameas.org) to
extend the containments from one geographic data set to
others where such containments are not so rich or
complete. Moreover, the added value of integrating such
geographical service with a backlinking service has been shown
with respect to demonstrate a possible exploitation scenario
on Public Sector Information. Due to the particular
naBacklinks Geographical Service Integration Disabled
http://backlinks.psi.enakting.org/resource/doc/http://dbpedia.org/resource/Hampshire</p>
      <p>Backlinks Geographical Service Integration Enabled
ture of the knowledge provided (i.e. closure of
geographical containment properties), there is the possibility of
overwhelming the user with information when asking about top
level features (e.g. England). In order to cope with this
eventuality, the service will be provided soon with the
capability to limit the results by depth. Therefore, when
asked about all the entities contained in the top level
feature England at the rst level of depth, the service will
return only: North East, North West, South East, Eastern,
South West, East Midlands, West Midlands, Yorkshire &amp;
the Humber, Scotland, Wales, London (di erent from the
City of London).</p>
      <p>Another important aspect not tackled in this work, and
subject of future research, is the temporal extent of
administrative divisions. The version of administrative geography
of UK will change shortly and has changed frequently during
the years (e.g. the number and borders of constituencies are
reviewed every 10 or 15 years). New entities can be de ned,
old ones can be abolished or change status. For example
Southampton, once part of Hampshire, became a Unitary
Authority on the 1st of April 1997. Since then,
Southampton has been administratively detached from the county of
Hampshire (i.e. not contained any more), although being
still part of it as a ceremonial county. Versioning of
information resources is an hot topic in Linked Data community
and it is even more important when publishing Public
Sector Information, whose content and validity must be put
into context.</p>
      <p>The research work reported here tackles an important
aspect of Linked Data, the exploitation of explicit
semantic content for enhancing resource retrieval and
browsability. The choice to tackle geographical knowledge rather
than some other data facet is mainly due to the analysis
of the available data sources, their structure and the
available knowledge exploitable for a better integration of the
available information.</p>
      <p>The use of co-reference systems allowed us to exploit the
knowledge created in one organization (Ordnance Survey in
this case) in di erent, and potentially novel, data
collections, overlapping a qualitative spatial dimension that was
not present before. Such reuse of knowledge is potentially
innovative but poses many questions about the management
of the quality of the knowledge and the entity alignments
used. The presence, integration, and comparison of di erent
geographical knowledge bases can be bene cial for the
maintenance and discovery of entity alignments of good quality.</p>
      <p>Another interesting aspect related to the use of co-reference
services integrated with an additional knowledge source is
the ability to exploit the data semantics in order to change
the navigability of the datasets. Such change in the
navigability is clear when new arcs are provided within the same
data set (e.g. between dbpedia resource where they were not
linked before) or between resources belonging to di erent
data sets (see Table 1 for a complete account of the data
sets connected).</p>
    </sec>
    <sec id="sec-12">
      <title>ACKNOWLEDGEMENTS</title>
      <p>This work was supported by the EnAKTing project funded
by the Engineering and Physical Sciences Research Council
under contract EP/G008493/1.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          , G. Kobilarov,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Ives</surname>
          </string-name>
          .
          <article-title>Dbpedia: A nucleus for a web of open data</article-title>
          .
          <source>in 6th International Semantic Web Conference</source>
          , Busan, Korea, pages
          <volume>11</volume>
          {
          <fpage>15</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C.</given-names>
            <surname>Becker</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          .
          <article-title>DBpedia mobile: A location-enabled linked data browser</article-title>
          .
          <source>In 1st Workshop about Linked Data on the Web (LDOW2008)</source>
          ,
          <year>April 2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          .
          <article-title>Design issues: Linked data</article-title>
          . http://www.w3.org/DesignIssues/LinkedData.html,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>L.</given-names>
            <surname>Carr</surname>
          </string-name>
          , W. Hall,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bechhofer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Goble</surname>
          </string-name>
          .
          <article-title>Conceptual linking: ontology-based open hypermedia</article-title>
          .
          <source>In WWW '01: Proceedings of the 10th international conference on World Wide Web</source>
          , pages
          <volume>334</volume>
          {
          <fpage>342</fpage>
          , New York, NY, USA,
          <year>2001</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B. L.</given-names>
            <surname>Clarke</surname>
          </string-name>
          .
          <article-title>A calculus of individuals based on \connection"</article-title>
          .
          <source>Notre Dame J. Formal Logic</source>
          ,
          <volume>22</volume>
          (
          <issue>3</issue>
          ):
          <volume>204</volume>
          {
          <fpage>218</fpage>
          ,
          <year>1981</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A. G.</given-names>
            <surname>Cohn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bennett</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gooday</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N. M.</given-names>
            <surname>Gotts</surname>
          </string-name>
          .
          <article-title>Qualitative spatial representation and reasoning with the region connection calculus</article-title>
          .
          <source>Geoinformatica</source>
          ,
          <volume>1</volume>
          (
          <issue>3</issue>
          ):
          <volume>275</volume>
          {
          <fpage>316</fpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Egenhofer</surname>
          </string-name>
          .
          <article-title>A formal de nition of binary topological relationships</article-title>
          .
          <source>In 3rd International Conference, on Foundations of Data Organization and Algorithms (FODO)</source>
          , pages
          <fpage>457</fpage>
          {
          <fpage>472</fpage>
          , New York, NY, USA,
          <year>1989</year>
          . Springer-Verlag New York, Inc.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>C.</given-names>
            <surname>Freksa</surname>
          </string-name>
          .
          <article-title>Temporal reasoning based on semi-intervals</article-title>
          .
          <source>Artif</source>
          . Intell.,
          <volume>54</volume>
          (
          <issue>1-2</issue>
          ):
          <volume>199</volume>
          {
          <fpage>227</fpage>
          ,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>H.</given-names>
            <surname>Glaser</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          <article-title>Ja ri, and I. Millard. Managing co-reference on the semantic web</article-title>
          .
          <source>In WWW2009 Workshop: Linked Data on the Web (LDOW2009)</source>
          ,
          <year>April 2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Goodwin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Dolbear</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Hart</surname>
          </string-name>
          .
          <article-title>Geographical linked data: The administrative geography of great britain on the semantic web</article-title>
          .
          <source>Transaction in GIS</source>
          ,
          <volume>12</volume>
          (
          <issue>1</issue>
          ):
          <volume>19</volume>
          {
          <fpage>30</fpage>
          ,
          <year>February 2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Grzegorczyk</surname>
          </string-name>
          .
          <article-title>Undecidability of some topological theories</article-title>
          .
          <source>Fundamenta Mathematicae</source>
          ,
          <volume>38</volume>
          :
          <fpage>137</fpage>
          {
          <fpage>152</fpage>
          ,
          <year>1951</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>D. B.</given-names>
            <surname>Lenat</surname>
          </string-name>
          .
          <article-title>Cyc: a large-scale investment in knowledge infrastructure</article-title>
          .
          <source>Commun. ACM</source>
          ,
          <volume>38</volume>
          (
          <issue>11</issue>
          ):
          <volume>33</volume>
          {
          <fpage>38</fpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>J.</given-names>
            <surname>Nanard</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Nanard</surname>
          </string-name>
          .
          <article-title>Using structured types to incorporate knowledge in hypertext</article-title>
          .
          <source>In HYPERTEXT '91: Proceedings of the third annual ACM conference on Hypertext</source>
          , pages
          <volume>329</volume>
          {
          <fpage>343</fpage>
          , New York, NY, USA,
          <year>1991</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A. A.</given-names>
            <surname>Randell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Cui</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A. G.</given-names>
            <surname>Cohn</surname>
          </string-name>
          .
          <article-title>A spatial logic based on regions and connection</article-title>
          . In B. Nebel, W. Swartout, and C. Rich, editors,
          <source>Principles of Knowledge Representation and Reasoning</source>
          ,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>J.</given-names>
            <surname>Rohlfs</surname>
          </string-name>
          .
          <article-title>A theory of interdependent demand for a communications service</article-title>
          .
          <source>The Bell Journal of Economics and Management Science</source>
          ,
          <volume>5</volume>
          (
          <issue>1</issue>
          ):
          <volume>16</volume>
          {
          <fpage>37</fpage>
          ,
          <year>1974</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>J.</given-names>
            <surname>Sowa</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Borgida</surname>
          </string-name>
          .
          <article-title>Principles of semantic networks : explorations in the representation of knowledge</article-title>
          . Morgan Kaufmann,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Whitehead</surname>
          </string-name>
          . Process and Reality. The MacMillan Company, New York, NY, USA,
          <year>1929</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>