<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Opportunities and Challenges for using Linked Data in INSPIRE</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sven Schade</string-name>
          <email>sven.schade@jrc.ec.europa.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Lutz</string-name>
          <email>michael.lutz@jrc.ec.europa.eu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>European Commission - Joined Research Centre</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this statement of interest, we investigate opportunities to use a Linked Data approach in INSPIRE (Infrastructure for Spatial Information in Europe). We briefly present two typical use cases related to INSPIRE and identity the involved components, as well as their workflow. In a subsequent step, we analyze the potential of using Linked Data for each use case. We conclude with a discussion on possible future analysis and implementation work. These specifically include the development of link types relevant for the presented INSPIRE use cases.</p>
      </abstract>
      <kwd-group>
        <kwd>Linked Data</kwd>
        <kwd>Spatial Data Infrastructure</kwd>
        <kwd>INSPIRE</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>•
•
•
the feature types defined in the INSPIRE data specifications. Similar registers
are planned for other artifacts, such as code lists.</p>
      <p>INSPIRE-compliant metadata [10] on data sets which can be in any of the 23
official languages of the EU.</p>
      <p>GEneral Multilingual Environmental Thesaurus (GEMET)4 which is
mandated in the metadata Regulation as a controlled vocabulary for a keyword
describing the spatial data theme.</p>
      <p>INSPIRE network services [11], which enable the access to data and metadata
in the infrastructure.</p>
      <p>The remainder of the document is structured as follows. Possible use cases are
outlined in the next section. They illustrate possible requirements for the use of Linked
Data in the context of INSPIRE. Especially, different links types are requested.
Related issues will be discussed in the last section of this paper. Here, we also include
research questions, and provide pointers to related work.</p>
    </sec>
    <sec id="sec-2">
      <title>2 Two INSPIRE Use Cases</title>
      <p>We selected two representative use cases from INSPIRE; data discovery (described in
more detail in [12]) and cross-theme queries. Both will be briefly described in the
following, including the involved INSPIRE components and basic workflows.
Discovery. A user wants to find a specific data set using a client to an INSPIRE
discovery service. He starts entering keyword (in a search language), for example
‘coast’, which is auto-completed by the client based on a multi-lingual controlled
vocabulary (mlCV), such as GEMET. The client expands the query based on the
relationships in the mlCV in order to find additional search terms (‘cliff’, ‘cove’, and
‘seashore’), and (optionally) some of these terms are selected by the user. Based on
selected terms, a catalogue query is generated, in which the terms’ translations in a
number of target languages are compared with the keyword elements in the metadata.</p>
      <p>Alternatively to GEMET, the FCD can be used. In this scenario, the user enters a
concept from the FCD (for example ‘shore’), and the query expansion is based on the
relationships within the FCD, or between concepts in the FCD and concepts in other
thesauri (GEMET etc.). In the former case, in order to find a match, the metadata
record for a given data set would need to include the information which INSPIRE
feature types are contained in the data set.</p>
      <p>Once data sets that match the query have been identified a corresponding download
or view network service has to be discovered. If this service is directly linked (as a
coupled resource) from the metadata of the data set, the user can ‘browse’ directly to
it (in contrast to the ‘query’ steps illustrated above). Yet the user needs to know the
name of the desired feature type or layer in order to be able to submit a valid service
request.</p>
      <sec id="sec-2-1">
        <title>3 https://inspire-registry.jrc.ec.europa.eu/ 4 http://www.eionet.europa.eu/gemet/</title>
        <p>Cross-theme queries. The INSPIRE data models contain a number of cross-theme
relationships, for example that between an address (represented as points) and the
transport link (for example a road) that forms part of the address5. Thus, there can be
relationships between objects in two (or more) spatial data sets, possibly being made
available using different INSPIRE download services. Like in this example, these
relationships cannot in all cases be derived from spatial characteristics (an address
near a cross road might not contain the road it is closest to).</p>
        <p>In the use case, a user wants to query INSPIRE data sets based on the links that
exist between them. For example, he might be interested in finding all addresses that are
associated with a specific road. For such a query, the relationships between the spatial
objects must be available and query-able, potentially across several INSPIRE
download services.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Discussion and Preliminary Conclusion</title>
      <p>When reconsidering the three main principles of Linked Data, these are already partly
provided for in INSPIRE. Earlier investigations illustrated that linking could be
equally achieved using common geospatial/SDI standards [13]; especially as the Open
Geospatial Consortium (OGC)6 just supports the http URI scheme and the OGC
naming authority provides their dictionaries in http, GML [14], and RDF encoding. In
INSPIRE, URIs as reference points could be derived from the inspireId attributes of
spatial objects or from the relevant INSPIRE registers (for example for feature types,
feature concepts or data sets). Currently, GML is specified as the default encoding for
all existing INSPIRE data specifications. However, since GML and RDF are
isomorphic [13], the data could be provided in RDF as well. The decision of using a GML or
RDF representation of geospatial data depends on the intended use. Both are
optimized for different purposes. While RDF allows for sophisticated querying and
reasoning, numerous GIS clients process GML. In general, equipping geospatial web
services with content negotiation would be beneficial in general.</p>
      <p>This suggests that implementing INSPIRE requirements using a Linked Data
approach is feasible. However, a number of open questions still remain.
Added benefit. From a general perspective, linking INSPIRE resources to existing
(third-party) data clouds, such as Linked Geodata7, is desirable. But, on which level
could we benefit from a Linked Data implementation? Certainly Linked Data would
have the benefit of enabling the use of existing Linked Data tools and the
development of rich clients. It may also be useful in reporting (SEIS8 provides an example),
where the semantics of links are well known, but is this true for SDI in general? Do
we want to introduce MIME types for each INSPIRE theme? Where would INSPIRE
5 See https://inspire-twg.jrc.ec.europa.eu/inspire-model/index.html?goto=2:3:1:7266 for the
cross-theme relationships of the theme Addresses.
6 http://www.opengeospatial.org/
7 http://linkedgeodata.org/
8 http://ec.europa.eu/environment/seis/
benefit from ontologies on top of plain RDF, which may offer easy navigation and
inference of new knowledge?
Browsing vs. Querying. As illustrated in the use cases, there are two prominent
interaction pattern with resources in INSPIRE and ways to make use of the links
between them: The ‘browse’ pattern, where a user subsequently discovers additional
facts by following links, and the ‘query’ pattern, where the user specifies clearly
(using the query) what he is interested in. While the ‘browse’ pattern is directly
supported by Linked Data, the ‘query’ pattern is at the basis of existing OGC service
specifications. It needs to be investigated how the benefits of both approaches can be
combined to achieve maximum benefit, especially for cross-resource queries.
Links. In order to implement the use cases, a number of specific link types would be
required. For the query expansion in the discovery scenario, the SKOS9 properties for
relating concepts to concepts (in the same or different SKOS) and for relating
concepts to labels can be used. But, is the SKOS vocabulary expressive enough? Which
relations would be useful in addition? Other – INSPIRE-specific – link types (for
example between metadata records and keywords, or between spatial objects
belonging to specific feature types) could be derived from the existing legal texts and
corresponding guidelines. However, there are other links which have not yet been defined
in any of the INSPIRE documents, for example the link between a metadata record on
a data set and the feature type(s) or layer(s) the data set provides. Should we develop
INSPIRE ontologies (in the Semantic Web sense)?</p>
      <p>How do we maintain links, especially in the metadata records and catalogues? How
can link ‘vocabularies’ be shared across communities? How do we provide
mechanisms to translate between such vocabularies? Especially this last point may benefit
from research activities, such as ‘ontology localization’ [15], but how exactly?
Metadata. With Linked Data, the distinction between data and metadata becomes less
pronounced. In principle, every linked information item could be considered as
metadata on the data which links to it. Thus, when applying Linked Data principles to
INSPIRE, the role of metadata (records) and discovery services as the central
repository for metadata would need to be investigated. This includes elaborations on the use
of linked metadata for data sets, data objects, layers, and services, as well as inclusion
of metadata links into delivered data sets and maps.</p>
      <p>URIs and Registers. Governance and management of identifiers is an important issue
to be addressed. When National SDIs join the Linked Data Initiative and they
generate their own URIs, will we face issues duplication? Interesting example for a strategy
on URI management can be found in [16] and [17].</p>
      <p>As discussed above, INSPIRE registers could provide some of the required URIs,
for example for feature types or code list values. Currently the INSPIRE registry is
geared towards human consumption. In order to enable their usage in a Linked Data
architecture, they should also offer information in a machine-process-able form.</p>
      <sec id="sec-3-1">
        <title>9 http://www.w3.org/TR/skos-reference/</title>
        <p>In summary, Linked Data seems to connect the Semantic Web and geospatial, and
especially SDI community closer to each other. Benefits for users seem obvious; they
can build mash-ups and applications on top of INSPIRE data more easily. For this
purpose, proper links between potential INSPIRE ontologies and data with existing
geospatial data sources may be considered It still has to be clarified if this would be
beneficial for the (legally binding) INSPIRE data sets and services. We are currently
investigating most of these questions in research projects (GENESIS10, EuroGEOSS11,
and GEO AIP312). In addition we are active in the Ecoterm initiative, which aims at
advancing provision and use of environmental terminologies. Complementary to these
‘vertical’ activities, we aim to broaden our investigations in Linked Data use cases in
INPSIRE to ensure ‘horizontal’ coverage of opportunities and challenges.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>[9] [10] [11] [12] [13] [14] [15] [16]</source>
          [17]
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Heath</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          , “
          <article-title>Linked Data: Principles and State of the Art</article-title>
          ,”
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          , “
          <article-title>The Emerging Web of Linked Data,” IEEE Intelligent Systems</article-title>
          , vol.
          <volume>24</volume>
          ,
          <year>2009</year>
          , pp.
          <fpage>87</fpage>
          -
          <lpage>92</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>W3C/IETF URI Planning Interest</surname>
            <given-names>Group</given-names>
          </string-name>
          , “URIs, URLs, and
          <source>URNs: Clarifications and Recommendations 1</source>
          .
          <fpage>0</fpage>
          .”
          <string-name>
            <given-names>O.</given-names>
            <surname>Lassila</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.R.</given-names>
            <surname>Swick</surname>
          </string-name>
          ,
          <article-title>Resource Description Framework (RDF) Model</article-title>
          and
          <string-name>
            <given-names>Syntax</given-names>
            <surname>Specification</surname>
          </string-name>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>K.</given-names>
            <surname>Holtman</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Mutz</surname>
          </string-name>
          , “RFC 2295 - Transparent Content Negotiation in HTTP”,
          <source>Mar</source>
          .
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>G.</given-names>
            <surname>Hodge</surname>
          </string-name>
          ,
          <source>Report on the Outcome of the ECOTERM V Workshop</source>
          ,
          <string-name>
            <given-names>U.N.</given-names>
            <surname>Food</surname>
          </string-name>
          and Agriculture Organization, Rome, Italy:
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>H.</given-names>
            <surname>Onsrud</surname>
          </string-name>
          ,
          <article-title>Research and theory in advancing spatial data infrastructure concepts</article-title>
          ,
          <source>Redlands</source>
          Calif.: ESRI Press,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>S.</given-names>
            <surname>Schade</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Granell</surname>
          </string-name>
          , and
          <string-name>
            <surname>L</surname>
          </string-name>
          , Díaz, “
          <article-title>Augmenting SDI with Linked Data”</article-title>
          ,
          <source>GIScience 2010 Workshop - Linked Spatiotemporal Data</source>
          ,
          <year>2010</year>
          , accepted for publication.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>INSPIRE</surname>
          </string-name>
          , INSPIRE Scoping Paper,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>INSPIRE</surname>
          </string-name>
          , INSPIRE Metadata Regulation,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>INSPIRE</surname>
          </string-name>
          ,
          <source>Network Services Architecture (Version 3.0)</source>
          .,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Lutz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ostländer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Kechagioglou</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Cao</surname>
          </string-name>
          , “
          <article-title>Challenges for Metadata Creation and Discovery in a multilingual SDI -</article-title>
          <string-name>
            <surname>Facing</surname>
            <given-names>INSPIRE</given-names>
          </string-name>
          ,” Stresa, Italy:
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>S.</given-names>
            <surname>Schade</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Cox</surname>
          </string-name>
          , “
          <article-title>Linked Data in SDI or How GML Is Not about Trees</article-title>
          ,” Guimarães, Portugal:
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>OGC</surname>
          </string-name>
          ,
          <string-name>
            <surname>Geography Markup Language (GML) Encoding</surname>
            <given-names>Standard</given-names>
          </string-name>
          <source>v3.2</source>
          .1,
          <string-name>
            <given-names>Open</given-names>
            <surname>Geospatial</surname>
          </string-name>
          Consortium Inc.,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Espinoza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Montiel-Ponsoda</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Gómez-Pérez</surname>
          </string-name>
          , “Ontology Localization”,
          <source>5th Fifth International Conference on Knowledge Capture (KCAP)</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>Paul</given-names>
            <surname>Davidson</surname>
          </string-name>
          ,
          <article-title>Designing URI Sets for the UK Public Sector</article-title>
          ,
          <source>UK Chief Technology Officer Council</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>D.</given-names>
            <surname>Ayers</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Vlkel</surname>
          </string-name>
          , “
          <article-title>Cool URIs for the Semantic Web”</article-title>
          , Interest Group Note 20080331,
          <issue>W3C</issue>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>