<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>GeoTriples: a Tool for Publishing Geospatial Data as RDF Graphs Using R2RML Mappings</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kostis Kyzirakos</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ioannis Vlachopoulos</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dimitrianos Savva</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Manegold</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manolis Koubarakis</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centrum Wiskunde &amp; Informatica</institution>
          ,
          <addr-line>Amsterdam</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Informatics and Telecommunications National and Kapodistrian University of Athens</institution>
          ,
          <country country="GR">Greece</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>A plethora of Earth Observation data that is becoming available at no charge in Europe and the US recently re ects the strong push for more open Earth Observation data. Linked data is a paradigm which studies how one can make data available on the Web, and interconnect it with other data with the aim of making the value of the resulting \Web of data" greater than the sum of its parts. Open Earth Observation data that are currently made available by space agencies such as ESA and NASA are not following the linked data paradigm. Therefore, Earth Observation data and other kinds of geospatial data that are necessary for a user to satisfy her information needs can only be found in di erent data silos, where each silo may contain only part of the needed data. Publishing the content of these silos as RDF graphs, enables the development of data analytics applications with great environmental and nancial value. In this paper we present the tool GeoTriples that allows for the transformation of Earth Observation data and geospatial data into RDF graphs. GeoTriples goes beyond the state of the art by extending the R2RML mapping language to be able to deal with the speci cities of geospatial data. GeoTriples is a semi-automated tool that allows the publication of geospatial information into an RDF graph using the state of the art vocabularies like GeoSPARQL and stSPARQL, but at the same time it is not tightly coupled to a speci c vocabulary.</p>
      </abstract>
      <kwd-group>
        <kwd>Linked Geospatial Data</kwd>
        <kwd>data publishing</kwd>
        <kwd>GeoSPARQL</kwd>
        <kwd>stSPARQL</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Lots of Earth Observation (EO) data has become available at no charge in
Europe and the US recently and there is a strong push for more open EO data.
Linked data is a paradigm which studies how one can make data available on the
Web, and interconnect it with other data with the aim of making the value of the
resulting \Web of data" greater than the sum of its parts. In the last few years,</p>
      <p>This work has been funded in part by the FP7 project LEO (611141).
linked geospatial data has received attention as researchers and practitioners
have started tapping the wealth of geospatial information available on the Web.
As a result, the linked open data (LOD) cloud has been rapidly populated with
geospatial data. For example, Great Britain's national mapping agency,
Ordnance Survey, has been the rst national mapping agency that has made various
kinds of geospatial data from Great Britain available as open linked data.</p>
      <p>Recently, the geospatial semantic web has also started to be populated with
EO products (e.g., CORINE Land Cover and Urban Atlas published by project
TELEIOS3). In general, open EO data that are currently made available by space
agencies such as ESA and NASA are not following the linked data paradigm.
Therefore, EO data and other kinds of geospatial data that are necessary for
a user to satisfy her information needs can only be found in di erent data
silos, where each silo may contain only part of the needed data. Publishing the
content of these silos as RDF graphs, enables the development of data analytics
applications with great environmental and nancial value. With the recent
emphasis on open government data in many countries, the development of useful
Web applications utilizing EO data and geospatial data in general is just a few
SPARQL queries away.</p>
      <p>Geospatial data in general and EO data in particular, can come in vector
or raster form and are usually accompanied by metadata. Vector data, available
in formats such as ESRI shape les, KML, and GeoJSON documents, can be
accessed either directly or via Web Services such as the OGC Web Feature
Service or the query language of a geospatial DBMS. Raster data, available in
formats such as GeoTIFF, Network Common Data Form (netCDF), Hierarchical
Data Format (HDF), can be accessed either directly or via Web Services such as
the OGC Web Coverage Processing Service (WCS) or the query language of an
array DBMS, e.g., the array-query language SciQL4. Metadata about EO data
are encoded in various formats ranging from custom XML schemas to domain
speci c standards like the OGC GML Application schema for EO products and
the OGC Metadata Pro le of Observations and Measurements.</p>
      <p>
        Automating the process of publishing linked geospatial data has not been
addressed yet. For example, in the wild re monitoring and management application
that we developed in TELEIOS, custom Python scripts were used for publishing
all necessary data as linked data. For this reason, we designed and implemented
the tool GeoTriples in the context of the EU FP7 project LEO5. In this paper
we present the tool GeoTriples that allows for the transformation of EO data
and geospatial data in various formats, such as data stored in spatially-enabled
relational databases and raw les, into RDF graphs. GeoTriples goes beyond
the state of the art by extending the R2RML mapping language to be able to
deal with the speci cities of geospatial data. GeoTriples is a semi-automated
tool that allows for the publication of geospatial information into an RDF graph
using the state of the art vocabularies like GeoSPARQL [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], but at the same
3http://www.earthobservatory.eu/
4http://www.sciql.org/
5http://www.linkedeodata.eu/
time it is not tightly coupled to a speci c vocabulary. The publishing process
comprises three steps. First, GeoTriples generates automatically R2RML
mappings for publishing data that reside in spatially-enabled databases and raw les
(e.g., ESRI shape les). Afterwards, the user may edit these mappings according
to her needs (e.g., utilize a di erent vocabulary) and nally GeoTriples processes
these mappings for producing an RDF graph.
      </p>
      <p>The organization of the paper is as follows. Section 2 discusses related work.
In Section 3 we present the architecture of GeoTriples and our extensions to the
R2RML language for the geospatial domain. We discuss how GeoTriples
generates automatically R2RML mappings and how they are subsequently proccessed
for transforming a geospatial data source into an RDF graph. In Section 4 we
present an example where we demonstrate how GeoTriples is currently being
used for realizing the precision farming application that is being developed in
LEO. Finally, in Section 5 we concludes our work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        In this section we will present related work on methodologies and tools targeting
the transformation of existing data sources into RDF graphs. Much work has
been done recently on mapping relational data into RDF graphs. An extensive
discussion on mapping relational data into RDF can be found in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. In this
section we will present in detail two prevailing approaches for mapping relational
data into RDF: the direct mapping approach and the R2RML mapping language.
Then we discuss some relevant tools that implement these approaches and we
nish our discussion with tools that convert geospatial information into RDF
graphs.
      </p>
      <p>
        Direct Mapping of Relational Data to RDF. A straightforward
mechanism for mapping relational data to the RDF data model is the Direct Mapping
of Relational Data to RDF 6 approach that became a W3C recommendation in
2012. According to this approach, relational tables are mapped to classes
dened by an RDF vocabulary, while the attributes of each table are mapped to
RDF properties that represent the relation between subject and object resources.
Identi ers, class names, properties, and instances are generated automatically
following the respective labels of the input data. For example, given the table
Address, the class &lt;Address&gt; will be generated, and all tuples will be instances
of this class. According to this approach, the generation of RDF data is dictated
by the schema of the relational database. This mechanism was initially de ned
in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] is an implementation of such a mechanism.
      </p>
      <p>The Mapping Language R2RML. A language for expressing customized
mappings from relational databases to RDF graphs is the R2RML mapping
language7 that became W3C recommendation in 2012. R2RML mappings provide
the user with the ability to transform existing relational data into the RDF data
model, following a structure and a target vocabulary that is chosen by the user.
6http://www.w3.org/TR/2012/REC-rdb-direct-mapping-20120927/
7http://www.w3.org/TR/r2rml/
R2RML mappings refer to logical tables to retrieve data from an input database.
A logical table can be a relational table that is explicitly stored in the database,
an SQL view or a valid SQL select query. A triples map is de ned for each
logical table that will be exported into RDF. A triples map is a rule that de nes
how each tuple of the logical table will be mapped to a set of RDF triples. A
triples map consists of a subject map and one or more predicate-object maps.
A subject map is a rule that de nes how to generate the URI that will be the
subject of each generated RDF triple. Usually, the primary key of the relation
is used for this purpose. A predicate-object map consists of predicate maps and
object maps. A predicate map de nes the RDF property to be used to relate
the subject and the object of the generated triple. An object map de nes how
to generate the object of the triple, the value of which originates from the value
of the attribute of the speci ed logical table.</p>
      <p>R2RML processors are applications that take as input a relational database
and R2RML mappings and produce RDF graphs. Some R2RML processors
provide a virtual SPARQL endpoint that transparently translate SPARQL queries
to SQL queries by taking into account the given R2RML mappings.
Alternatively, an R2RML processor can choose to export all relational data as an RDF
dump according to the given mappings and then o er SPARQL or a linked data
interface over the produced data.</p>
      <p>
        Let us now present some tools that are able to translate raw data sources
into RDF following the direct mapping approach, R2RML or a custom mapping
language. OpenLink Virtuoso [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], Morph8 [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], Ultrawrap9 and D2RQ platform
[
        <xref ref-type="bibr" rid="ref3 ref5">3, 5</xref>
        ] are capable of processing R2RML mappings. They support most of the
features of R2RML and can process R2RML mappings both for publishing input
data as an RDF graph and for querying the input data by translating a SPARQL
query into an SQL query. Triplify [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] is a popular tool that follows a light-weight
approach for mapping HTTP-URI requests onto relational database queries, and
translating the result into RDF statements.
      </p>
      <p>However, little attention has been paid to the problem of publishing
geospatial information in the Semantic Web. Most datasets that contain such
information are either generated manually or by semi-automated processes.</p>
      <p>LinkedGeoData10 project focuses on publishing OpenStreetMap11 data as
linked data. Sparqlify12, developed in the context of this project, is employed for
this task, using a proprietary mapping language. It creates mappings of spatial
datatypes into RDF but until now, it does not discuss on how to deal with
non-relational data.</p>
      <p>8https://github.com/jpcik/morph
9http://capsenta.com/ultrawrap/
10http://linkedgeodata.org/
11http://www.openstreetmap.org/
12http://sparqlify.org/</p>
      <p>The tool Geometry2RDF13 was the rst tool that allows for the conversion
of geospatial information that resides in a spatially-enabled relational database
into an RDF graph. It takes as input data stored in a spatially enabled DBMS
like Oracle Spatial or MySQL, and utilizes the libraries Jena and GeoTools to
produce an RDF graph. Geometry2RDF follows the direct mapping approach,
and allows the user to con gure the properties that connect a URI to the
serialization of a geometry and allows for the conversion of the coordinates to
the desired coordinate reference system. Geometry2RDF follows the direct
mapping approach that is not expressive enough to deal with the speci cities of the
geospatial domain. For example, since this work has preceded the proposal of
GeoSPARQL and stSPARQL, it does not produce RDF graphs according to
these vocabularies. For this purpose, the tool TripleGeo14 was recently
developed. TripleGeo is based on Geometry2RDF and allows the generation of RDF
graphs that follow the GeoSPARQL vocabulary. However, this tool has the same
shortcomings with Geometry2RDF since the mapping process of the input data
into an RDF graph that follows the GeoSPARQL vocabulary is hard-coded in
the implementation. Thus, signi cant e ort is required for modifying such tools
in order to support other vocabularies.</p>
      <p>
        A di erent approach was followed by [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] where the authors present how
R2RML can be combined with a spatially enabled relational database in
order to transform geospatial information into RDF. For the manipulation of the
geometric information prior to its transformation into RDF, the authors
create several logical tables that are based on ad-hoc SQL queries that perform
the appropriate pre-processing (e.g., requesting the serialization of a geometry
according to the WKT standard). This approach demonstrates the power of
utilizing a general-purpose mapping language like R2RML, which is in contrast to
other approaches discussed earlier in this section. However, in this work, no
automated method for publishing geospatial datasets into RDF is discussed, and
dealing with di erent types of data sources was out of scope.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>The Tool GeoTriples</title>
      <p>In this section we will present in detail the tool GeoTriples that we developed for
transforming geospatial data sources into RDF. GeoTriples15 is an open-source
tool that is distributed freely according to the Mozilla Public License v2.0. In
this section we will present the architecture of GeoTriples and we will discuss
our implementation choices. We will describe in detail how GeoTriples generates
automatically R2RML mappings for publishing data that reside in
spatiallyenabled databases and raw les (e.g., ESRI shape les), and how it can process
such mappings for producing an RDF graph that follows the GeoSPARQL or
any other vocabulary.</p>
      <p>13http://mayor2.dia.fi.upm.es/oeg-upm/index.php/en/technologies/
151-geometry2rdf
14https://github.com/GeoKnow/TripleGeo
15http://sourceforge.net/projects/geotriples/</p>
      <p>Integrated</p>
      <p>Access and Processing
SQL</p>
      <p>SciQL
stSPARQL
GeoSPARQL</p>
      <p>Linked
Open Data</p>
      <p>Cloud
Raw Data</p>
      <p>Web Services
Shape
Files
KML
The abstract architecture of GeoTriples, from a data perspective is shown in
Figure 1. GeoTriples takes as input data that are stored in a spatially-enabled
database, data that reside in raw les (e.g., ESRI shape les), or the results that
derive from processing the aforementioned data (e.g., the result of a SciQL query
over raster or array data). Regarding the system architecture at a lower level,
GeoTriples uses a connector for each type of input data in order to transparently
access and process the input data. As Figure 2 depicts, GeoTriples comprises
two main components: the mapping generator and the R2RML processor. The
mapping generator takes as input a data source and creates automatically an
R2RML mapping that transforms it into an RDF graph. The generated mapping
is enriched with subject and predicate-object maps, in order to take into account
the speci ties of geospatial data and cater for all transformations that are needed
to produce an RDF graph that is compliant with the GeoSPARQL vocabulary.
To accomplish this task, we extend R2RML mappings to allow the representation
of a transformation function over input data. Afterwards, the user may edit the
generated R2RML mapping document to comply with her requirements (e.g.,
use a di erent vocabulary).</p>
      <p>
        Implementation Choices. One of the main choices we had to make during the
design of GeoTriples was to choose which RDB2RDF framework to extend. We
chose to extend the D2RQ platform which seemed to be the most mature system
for publishing relational data into RDF. It provides a mechanism for generating
and processing R2RML mappings for a variety of relational databases that are
accessible via JDBC, like MonetDB, PostgreSQL and Oracle. D2RQ provides
preliminary support for translating SPARQL queries into SQL queries given
some R2RML mappings. However, given the recent results of the system Morph
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], we will consider utilizing Morph as the R2RML processor of GeoTriples in
the future.
      </p>
      <p>KML
tr
o
c
e
n
n
o
C</p>
      <p>Mapping
Generator
D2RQ Engine</p>
      <p>R2RML
Processor
D2RQ Engine</p>
      <p>R2RML
Mapping
Document</p>
      <p>Transforming Geospatial Data into RDF graphs using</p>
      <p>GeoTriples
In this section we will present the main functionality of GeoTriples. First we
will show the extended R2RML mappings that are produced automatically by
GeoTriples that take into account the spatial dimension of the input data, and
then we will show how these mappings are processed subsequently by GeoTriples
in order to generate an RDF graph.</p>
      <p>
        Automatic Generation of R2RML Mappings. Much work has been done
recently on extending RDF to represent and query geospatial information. The
most mature results of this work are the data model stRDF and the query
language stSPARQL [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and the OGC standard GeoSPARQL. GeoSPARQL
is an OGC standard for the representation and querying of geospatial linked
data. GeoSPARQL de nes much of what is required for such a query
language by providing vocabulary (classes, properties, and functions) that can be
used in RDF graphs and SPARQL queries to represent and query geospatial
data. The top level classes de ned in GeoSPARQL are geo:SpatialObject
that has as instances everything that can have a spatial representation and
geo:Feature that represents all features and is the superclass of all classes of
features that the users might want to de ne. To represent geometric objects,
the class geo:Geometry is introduced by GeoSPARQL. Additional vocabulary
is also de ned by GeoSPARQL for asserting and querying information about
geometries.
      </p>
      <p>Given a spatially-enabled database or a raw le that contains geometric
information, GeoTriples generates an R2RML mapping document. Let us take for
example the Natura 2000 dataset of Germany that contains information about
protected areas in Germany and is distributed in the form of an ESRI shape le.
More details on this dataset can be found in Section 4. Conceptually, each
geometric object stored in the ESRI shape le should be an instance of the class
geo:Geometry, and all non-geometric attributes that characterize each
geometry are thematic attributes of the corresponding feature. Following this modeling
approach, we generate an instance of the class geo:Feature and an instance of
the class geo:Geometry for each geometric object stored in the ESRI shape le.
Geometric and non-geometric attributes that appear at the ESRI shape le are
assigned accordingly to the appropriate instances. Part of the R2RML mapping
that is generated automatically by GeoTriples to represent the features stored
in the Natura 2000 ESRI shape le is the following:
_:natura
rr:logicalTable [ rr:tableName "`natura`"; ];
rr:subjectMap [
rr:class geo:Feature;
rr:template "http://data.example.com/natura/Feature/id/{`gid`}"; ];
rr:predicateObjectMap [
rr:predicate nato:has_SITECODE;
rr:objectMap [ rr:datatype xsd:string;</p>
      <p>rr:column "`SITECODE`"; ];
];
rr:predicateObjectMap [
rr:predicate geo:hasGeometry ;
rr:objectMap [
rr:parentTriplesMap _:natura_geometry;
rr:joinCondition [
rr:child "gid";
rr:parent "gid"; ]; ]; ].</p>
      <p>This mapping document contains a triples map that describes how instances
of the class geo:Feature are constructed. All thematic information that is stored
in the ESRI shape le is assigned to the generated feature and a link between the
feature and its geometry is also generated. However, the notion of a primary key
is not de ned for ESRI shape les. Each ESRI shape le though is accompanied by
a relational table in dBASE format that stores infromation about the geometries,
so we de ne as a unique identi er for each geometric object the respective row
identi er in the dBASE table. In the example above we represent this information
as an extra attribute with name 'gid'.</p>
      <p>Part of the R2RML mapping that is generated automatically by GeoTriples
to represent the geometries stored in the Natura 2000 ESRI shape le is the
following:
_:naturaGeometry
rr:logicalTable [ rr:tableName "`natura`"; ];
rr:subjectMap [
rr:class geo:Geometry;
rr:template "http://data.example.com/natura/Geometry/id/{`gid`}"; ];
rr:predicateObjectMap [
rr:predicate geo:dimension;
rr:objectMap [
rrx:transformation [
rrx:function geof:dimension;
rrx:argumentMap (</p>
      <p>[rr:column "`Geom`"] ); ] ]; ].</p>
      <p>This mapping document contains a triples map that describes how instances
of the class geo:Geometry are constructed. All geometric information that is
stored in the ESRI shape le is assigned to the generated geometry. In addition,
object maps de ne that geospatial functions like geof:dimension are applied
to the serialization of each geometric object in order to produce the values that
will appear at the object part of the corresponding triple.</p>
      <p>Notice that in the above example we extended the de nition of an object map
by allowing it to be the RDF term obtained by applying a transformation on the
source data. Each transformation de nes the SPARQL built-in function or the
SPARQL extension function to be invoked, using as an argument the sequence
of RDF terms that are produced by the respective term maps.</p>
      <p>Processing of R2RML mappings for producing RDF graphs. R2RML
mappings usually consist of two triples maps; one for handling thematic
information and one for geospatial information. The triples map that handles thematic
information de nes a logical table that contains the thematic attributes and a
unique identi er for the generated instances. The latter could be either the
primary key of the table in case the input data is a relational database or a row
number in the dBASE table of an ESRI shape le. Combined with a URI
template, the unique identi er is used to produce the URI that serve as subjects of
the produced triples. The predicate object maps are also processed accordingly
in order to de ne RDF properties the value of which originate from the value of
the column of the thematic logical table.</p>
      <p>The triples map that handles the geospatial information of the input data
source, de nes a logical table with unique identi er similar to the thematic one.
However, according to the type of the data source, the de nition of this logical
table may vary. For instance, in the case of a relational database, it is de ned
by providing an appropriate SQL query that uses spatial functions provided by
spatially enabled relational backend (e.g. utilize the function ST_Dimension).
If the input source is an ESRI shape le, then GeoTriples will perform such
transformations on the y by evaluating the SPARQL extension function using
the JTS Topology Suite.
4</p>
      <p>Linked geospatial data in the precision farming
application of LEO
Let us now give an example that demonstrates how GeoTriples is being used in
LEO for the development of a precision farming application. The aim is to
develop a precision farming application that combines traditional geospatial data
with linked geospatial data for enhancing the quality of precision farming
activities. Precision farming is a concept based on observing the heterogeneity within
an agricultural eld and providing information about cultivating each part of
the eld according to the characteristics of the soil. Precision farming aims to
solve numerous problems for farmers such as the minimization of the
environmental pollution by fertilizers. For dealing with this issue, the farmers have to
comply with many legal and technical guidelines that require the combination
of information that resides in diverse information sources. In this section we
present how linked geospatial data can form the knowledge base for providing
solutions for this problem. We published the following datasets as RDF graphs
using GeoTriples in order to use them in the precision farming application.</p>
      <p>Talking Fields16 is a precision farming project aiming to increase the e
ciency of agricultural production via precision farming by means of geo-information
services integrating space and ground-based assets. Currently, TalkingFields
produces products for improved soil probing using satellite-based zone maps, and
provide services for monitoring crop development through provision of biomass
maps and yield estimates.</p>
      <p>Natura 200017 is an ecological network designated under the Birds
Directive and the Habitats Directive which form the cornerstone of Europe's nature
conservation policy. National authorities submit a standard data form that
describes each site and its ecology in order to be characterized as a Natura site.</p>
      <p>OpenStreetMaps (OSM)18 is a collaborative project for publishing free
maps of the world. OSM maintains a community-driven global editable map
that gathers map data in a crowdsourcing fashion. LinkedGeoData (LGD)19, is
a project focused on publishing OSM data as linked data.</p>
      <p>
        LGD data is the only dataset that is already published as linked geospatial
data. For the rest datasets, we design an appropriate ontology that follows the
GeoSPARQL vocabulary. Afterwards, we use GeoTriples in order to produce
the R2RML mappings that dictate the process of generating the desired RDF
output. Finally, we use the R2RML processor of GeoTriples for translating the
input data into RDF graphs. The produced data are then stored into an
appropriate geospatial RDF store. We chose to utilize our own geospatial RDF store
Strabon20 [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>Let us now see an example query that can provide a precision farming
application with information about the parts of agricultural elds that are either
close to a river or are located within a protected area. This information allows
the precision farming application to take into account legal restrictions regarding
distance requirements when preparing the prescription maps that the farmers
will utilize afterwards.</p>
      <p>16http://www.talkingfields.de/
17http://ec.europa.eu/environment/nature/natura2000/access_data/index_
en.htm
18http://www.openstreetmap.org
19http://linkedgeodata.org
20http://www.strabon.di.uoa.gr/</p>
      <p>Example. Select all parts of agricultural elds that are close to a river or are
located inside a protected area.</p>
      <p>SELECT distinct ?field_name ?river_name ?cell_wkt
WHERE {{?river rdf:type osmo:River ;
osmo:hasName ?river_name ;
geo:hasGeometry ?river_geo .
?river_geo geo:asWKT ?river_wkt .
?field rdf:type tf:Field ;
tfo:hasFieldName ?field_name ;
tf:hasRasterCell ?cell .
?cell geo:hasGeometry ?cell_geo ;
?cell_geo geo:asWKT ?cell_wkt .</p>
      <p>FILTER(geof:distance(?river_wkt ,</p>
      <p>?cell_wkt, uom:meter) &lt; 100) .
} UNION {
?field
rdf:type tf:Field ;
tfo:hasFieldName ?field_name ;
tf:hasRasterCell ?cell .
?cell geo:hasGeometry ?cell_geo ;
?cell_geo geo:asWKT ?cell_wkt .
?nat rdf:type nat:NaturaArea ;</p>
      <p>geo:hasGeometry ?nat_geo .
?nat_geo geo:asWKT ?nat_wkt .</p>
      <p>FILTER(geof:contains(?nat_wkt, ?cell_wkt)}}</p>
      <p>This above query uses the geo:distance function whose arguments are the
serializations of two geometries and a URI that identi es a unit of measurement.
The result of this metric function is the minimum distance between these two
geometric objects. This lter identi es the parts of a eld that are close to a
river, so the precision farming application should adjust the fertilizer/pesticide
usage accordingly. The second part of the query identi es the parts of a eld that
resides inside a protected, thus the precision farming application should adjust
the fertilizer/pesticide usage accordingly. Figure 3 depicts protected areas, rivers,
agricultural elds and the parts of the agricultural elds that are close to a river.
5</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>In this paper we presented the tool GeoTriples that transforms geospatial data
that reside in a spatially-enabled DBMS or raw les into RDF graphs, by
extending the R2RML mapping language. This allows GeoTriples to transform
geospatial data into RDF graphs without being tightly coupled to a speci c
vocabulary. We plan to extend GeoTriples to support numerous types of vector
data formats such as KML, as well as raster formats like GeoTIFF and netCDF.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dietzold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hellmann</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Aumueller</surname>
          </string-name>
          . Triplify:
          <article-title>Lightweight Linked Data Publication from Relational Databases</article-title>
          .
          <source>In Proceedings of the 18th International Conference on World Wide Web, WWW '09</source>
          , pages
          <fpage>621</fpage>
          {
          <fpage>630</fpage>
          , New York, NY, USA,
          <year>2009</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>T.</surname>
          </string-name>
          Berners-Lee.
          <article-title>Relational Databases on the Semantic Web</article-title>
          . http://www.w3.org/DesignIssues/RDB-RDF.html.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Seaborne</surname>
          </string-name>
          .
          <article-title>D2RQ-treating non-RDF databases as virtual RDF graphs</article-title>
          .
          <source>In Proceedings of the 3rd international semantic web conference (ISWC2004)</source>
          , volume
          <year>2004</year>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>K.</given-names>
            <surname>Chentout</surname>
          </string-name>
          and
          <string-name>
            <given-names>A. A.</given-names>
            <surname>Vaisman</surname>
          </string-name>
          .
          <article-title>Adding Spatial Support to R2RML Mappings</article-title>
          .
          <source>In OTM Workshops</source>
          , volume
          <volume>8186</volume>
          of Lecture Notes in Computer Science. Springer,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Garbers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Maresch</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Becker</surname>
          </string-name>
          .
          <article-title>The D2RQ Platform</article-title>
          . http://d2rq.org/.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>O.</given-names>
            <surname>Erling</surname>
          </string-name>
          and
          <string-name>
            <surname>I. Mikhailov. RDF</surname>
          </string-name>
          <article-title>Support in the Virtuoso DBMS</article-title>
          . In S. Auer,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          <article-title>Muller, and</article-title>
          <string-name>
            <given-names>A. V.</given-names>
            <surname>Zhdanova</surname>
          </string-name>
          , editors,
          <source>CSSW</source>
          , volume
          <volume>113</volume>
          <source>of LNI</source>
          , pages
          <volume>59</volume>
          {
          <fpage>68</fpage>
          .
          <string-name>
            <surname>GI</surname>
          </string-name>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>K.</given-names>
            <surname>Kyzirakos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Karpathiotakis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Koubarakis</surname>
          </string-name>
          .
          <article-title>Strabon: A semantic geospatial dbms</article-title>
          . In P.
          <string-name>
            <surname>Cudre-Mauroux</surname>
            , J. He in, E. Sirin,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Tudorache</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Euzenat</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Hauswirth</surname>
            ,
            <given-names>J. X.</given-names>
          </string-name>
          <string-name>
            <surname>Parreira</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Hendler</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Schreiber</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Bernstein</surname>
          </string-name>
          , and E. Blomqvist, editors,
          <source>International Semantic Web Conference (1)</source>
          , volume
          <volume>7649</volume>
          of Lecture Notes in Computer Science, pages
          <volume>295</volume>
          {
          <fpage>311</fpage>
          . Springer,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>K.</given-names>
            <surname>Kyzirakos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Manegold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Vlachopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Savva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Koubarakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Nikolaou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Giannakopoulou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Smeros</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Valentin</surname>
          </string-name>
          .
          <article-title>Data models and languages for mapping EO data to RDF. Deliverable D2.1, LEO - FP7 ICT Project</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9. OGC.
          <article-title>GeoSPARQL - A geographic query language for RDF data</article-title>
          ,
          <year>November 2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>F.</given-names>
            <surname>Priyatna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Corcho</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Sequeda</surname>
          </string-name>
          .
          <article-title>Formalisation and experiences of R2RMLbased SPARQL to SQL query translation using Morph</article-title>
          . In C.-W. Chung,
          <string-name>
            <given-names>A. Z.</given-names>
            <surname>Broder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Shim</surname>
          </string-name>
          , and T. Suel, editors,
          <source>WWW</source>
          , pages
          <volume>479</volume>
          {
          <fpage>490</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>D.</given-names>
            <surname>Steer</surname>
          </string-name>
          . SquirrelRDF,
          <year>2006</year>
          . http://jena.sourceforge.net/SquirrelRDF/.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>