<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Linked Data for Common Agriculture Policy: Enabling Semantic Querying over Sentinel-2 and LiDAR Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dina Sukhobok</string-name>
          <email>dina.sukhobok@sintef.no</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hector Sanchez</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jesus Estrada</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dumitru Roman</string-name>
          <email>dumitru.roman@sintef.no</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>SINTEF</institution>
          ,
          <addr-line>Forskningsveien 1a, 0373 Oslo</addr-line>
          ,
          <country country="NO">Norway</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>TRAGSA, Calle Conde de Pen~alver</institution>
          ,
          <addr-line>84 - 2 planta, 28006 Madrid</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The amount of open and free satellite earth observation data combined with available data from other sectors (e.g. biodiversity, landscape elements, cadaster data) has the potential to enhance decisionmaking processes in various domains. An example of such a domain is agriculture, where the ability to objectively and automatically identify di erent types of agricultural features (e.g., irrigation patterns and landscape elements) can lead to more e ective agriculture management. In this paper we show the possibility to publish and integrate multi-sectoral data from several sources into an existing data-intensive service targeting better and fairer Common Agriculture Policy (CAP) funds assignments to farmers and land owners. We show an end-to-end approach for integrating multi-sectoral data and publishing the result as Linked Data with the help of the DataGraft platform. To demonstrate the use of the resulted dataset, we developed a visualization system prototype showing various information about agricultural parcel features.</p>
      </abstract>
      <kwd-group>
        <kwd>Linked Data</kwd>
        <kwd>Common Agricultural Policy (CAP)</kwd>
        <kwd>data integration</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>One important decision-making problem in Europe is the Common Agricultural
Policy (CAP) funds assignment to farmers and land owners. For the last 50 years,
CAP has been the European Union's most important common policy, which has
historically required a signi cant part of the budget. The main objective of the
policy is to support the provisioning of a stable supply of safe food produced
in a sustainable way at a ordable prices for consumers. At the same time CAP
aims to ensure a decent standard of living for farmers and agricultural workers,
sets requirements for environmental protection and food safety3.</p>
      <p>
        The Tragsa Group, which is the part of the group of companies administered
by the Spanish state-owned holding company Sociedad Estatal de
Participaciones Industriales (SEPI), has developed an extension of the existing service of
3 https://ec.europa.eu/agriculture/cap-overview_en
Spanish CAP funds assignments { the Common Agriculture Policy Assignment
Service (CAPAS)[
        <xref ref-type="bibr" rid="ref1 ref2">1,2</xref>
        ]. Currently, the assignment of funds is based on several
parameters provided by human operators, whose work is mainly concerned with
ortophotos, satellite and aerial images, and interested parties such as the land
owners. The CAPAS service aims to extend and improve the existing service with
the purpose of achieving a better and more objective assignment of funds as well
as to increase the transparency of the data and enabling its open use through
the reliance on Linked Data technologies. The main advantage of using Linked
Data technologies is the possibility to make data explicitly and publicly available
in a machine-readable and human-understandable format. The improvement is
based on the integration of new datasets, currently underused and the
replacement of human-generated data (subjective) for more objective data collected and
integrated from di erent cross-sectorial sources in an automated way. The main
users of the developed service are the Spanish Environment and Agriculture
Ministry, autonomous communities, Treasury Ministry, and Cadastre agencies.
Secondary users are banks, other public administrations and farmers.
      </p>
      <p>In this paper, we demonstrate the applicability of Linked Data technologies
to integrate relevant multi-sectoral data from several sources into an existing
CAP service. The resulting dataset can enable fairer fund speci cation, better
environmental management and cadastral assessment. In addition, we present a
CAPAS GIS viewer as a general showcase for various information about
agricultural parcel features.
2</p>
      <p>Linked Open Data from Sentinel-2 and LiDAR data
Data Sources. Various datasets were used in CAPAS service to produce an
enhanced assessment of suitability for agricultural grant requests. The aim of
CAPAS is to improve the e ciency of the existing Spanish CAP database, which
is the foundation of the funds assignment service. This database is also known as
LPIS or Land Parcel Identi cation System. The additional datasets come from
two main external sources. One source is Sentinel-24 satellite data,5 obtaining
the following layers:
{ Simple products generated by a single image of one speci c date, including
true color images, false color images and NDVI images, useful to identify the
type of crop in a parcel.
{ Complex products produced annually through several images taken at
different dates.</p>
      <p>Another source includes data from LiDAR6 ights within the Spanish
National Plan of Aerial Ortho-photography, obtaining the following layers:
4 https://sentinel.esa.int/web/sentinel/missions/sentinel-2
5 Sentinel-2 data are openly accessible at Sentinels Scienti c Data Hub (https://
scihub.copernicus.eu/) as raster images (JPEG2000).
6 https://en.wikipedia.org/wiki/Lidar
{ Landscape elements layer, de ned as areas of natural vegetation within the
agricultural parcels, including isolated trees, copses and hedgerows.
{ Ecological value layer, de ned as the total value of a speci c parcel,
calculating its protected surface under Natura 2000 protected sites speci cations
and the surface of the inventoried landscape elements.</p>
      <p>For LiDAR data, we make use of the LiDAR les provided by the Spanish
National Geographic Institute (IGN). After collecting data from external sources,
we applied the developed data processing algorithms (e.g., algorithm for
identi cation of crops, algorithm for detection of landscape elements7) in order to
simplify the huge amount of data. The processed data have been made available
as ESRI Shape Files8.</p>
      <p>
        Ontology description. Within scope of the CAPAS service we developed
the proDataMarket9[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] ontology for representing data relevant for service. The
ontology has been divided into several sub-ontologies, where each sub-ontology
contains concepts and properties speci c for the particular domain. This modular
approach also helped to handle the complexity of the model and made it easier
to maintain. The sub-ontologies used for publishing the CAPAS service related
data include Land Parcel Identi cation System (LPIS), Protected Sites, Sentinel
data, Landscape Elements, and Assessment.
      </p>
      <p>
        Linked data generation and publication. The publication of CAPAS
service data as Linked Data was performed with the help of DataGraft [
        <xref ref-type="bibr" rid="ref3 ref4">3,4</xref>
        ]10{ a
cloud-based platform for data cleaning and Linked Data generation. DataGraft
provides facilities such as raw data cleaning and preparation (most often from
tabular formats), mapping to standard Linked Data ontologies, and generating
a semantic RDF graph. Data cleaning and preparation activities for the CAPAS
service data included: generating and assigning unique identi ers to identify
unequivocally entities; data type casting; geospatial data conversion from the
Universal Transverse Mercator (UTM) system to the World Geodetic System
1984 (WGS84).
      </p>
      <p>After the input data les were cleaned, they were mapped to the concepts
de ned by the proDataMarket ontology and made available through a SPARQL
endpoint11. At this stage, data from two pilot areas was used in the service. The
DataGraft capabilities of reusing and extending existing data transformations
allowed inclusion of new data to the endpoint in a convenient way.
3</p>
      <p>Demonstration Outline
During the demonstration, we will introduce both the enabler for publishing
CAPAS service data as a Linked Data { the DataGraft platform, and a visualization
7 https://blog.prodatamarket.eu/wp-content/uploads/2017/04/paper_
submitted2.pdf
8 https://www.esri.com/library/whitepapers/pdfs/shapefile.pdf
9 http://vocabs.datagraft.net/proDataMarket/
10 https://datagraft.io/
11 https://rdf.datagraft.net/4037585987/db/repositories/capas-2
system prototype { the CAPAS GIS viewer12. The scenario demonstrated will
cover uploading raw CAPAS data in the DataGraft platform, using a prepared
transformation for data cleaning and Linked Data generation and the
demonstration of the visualization system. The CAPAS GIS viewer allows users to
con gure visualizations and browse data for two pilot areas from existing
Spanish CAP database as well as processed data obtained from Sentinel-2 and LiDAR
(see Figure 1).
Acknowledgements This work is partly funded by the EC H2020 project
proDataMarket (Grant number: 644497).
12 http://capas.prodatamarket.eu</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Navarro</surname>
          </string-name>
          ,
          <string-name>
            <surname>Mariano</surname>
          </string-name>
          , et al.
          <article-title>"CAPAS: A Service for Improving the Assignments of Common Agriculture Policy Funds to Farmers and Land Owners." Challenge+ DC@ RuleML</article-title>
          .
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Estrada</surname>
          </string-name>
          ,
          <string-name>
            <surname>Jesus</surname>
          </string-name>
          , et al.
          <article-title>Enabling the Use of Sentinel-2 and LiDAR Data for Common Agriculture Policy Funds Assignment</article-title>
          .
          <source>ISPRS Int. J. Geo-Inf</source>
          .
          <year>2017</year>
          ,
          <volume>6</volume>
          ,
          <fpage>255</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Roman</surname>
          </string-name>
          ,
          <string-name>
            <surname>Dumitru</surname>
          </string-name>
          , et al.
          <article-title>"Datagraft: Simplifying open data publishing." ESWC (Satellite Events)</article-title>
          <year>2016</year>
          :
          <fpage>101</fpage>
          -
          <lpage>106</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Roman</surname>
          </string-name>
          ,
          <string-name>
            <surname>Dumitru</surname>
          </string-name>
          , et al.
          <article-title>"DataGraft: One-stop-shop for open data management." To appear in the Semantic Web Journal (SWJ) Interoperability, Usability, Applicability (published and printed by IOS Press</article-title>
          , ISSN:
          <fpage>1570</fpage>
          -
          <lpage>0844</lpage>
          ),
          <year>2017</year>
          , DOI: 10.3233/SW170263.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <surname>Ling</surname>
          </string-name>
          , et al.
          <article-title>The proDataMarket Ontology for Publishing and Integrating Crossdomain Real Property Data. To appear in the journal "Territorio Italia. Land Administration, Cadastre and Real Estate"</article-title>
          . n.2/
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>