<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Short Paper: An Ontology Design Pattern for Spatial Data Quality Characterization in the Semantic Sensor Web</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Auriol Degbelo</string-name>
          <email>degbelo@uni-muenster.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute for Geoinformatics, University of Muenster</institution>
          ,
          <addr-line>Muenster</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Quality is an important aspect of data discovery in the Semantic Sensor Web. This work extends current endeavors to make the Sensor Web more semantic by introducing an ontology design pattern which facilitates the modeling of aspects of spatial data quality. The implementation of a software program over two scenarios demonstrates the usefulness of the ontology design pattern for the Semantic Sensor Web.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>Quality and components of spatial data quality</title>
      <p>
        Two major aspects of quality characterization in the SSW can be distinguished:
quality aspects that have to do with the characteristics of the data sources,
and those that have to do with the creation of applications based on these data
sources. These two categories of quality aspects were already summarily
mentioned in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The current work discusses only quality aspects of data sources. In
particular, it focuses on spatial data quality, in view of the fact that the SSW
reposes on standards developed by the Open Geospatial Consortium. The
following de nition (re ecting a data consumer perspective) to the term is suggested.
It is adapted from [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>Quality is the degree to which a data or service ful lls the needs of a
consumer. It is a function of intangible properties (of the data) considered
pertinent to the satisfaction of the consumer's needs.</p>
      <p>
        The intangible properties considered pertinent to the satisfaction of the
consumer's needs are also called components of data quality. As regards spatial
data, quality components vary from author to author. For instance, ISO 19113
includes completeness, logical consistency, positional accuracy, temporal
accuracy and attribute accuracy (see [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]); Paradis and Beard [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] includes accuracy,
resolution, consistency and lineage. This section will not review all the
components of spatial data quality. Instead, for the purposes of the illustration (see
section 4), the quality component `resolution' is chosen. Resolution is de ned
here as the amount of detail in the dataset. Spatial resolution of raster data can
be measured using the size of the raster cells; for spatial resolution of vector
data, several measures are possible (see [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] for the discussion).
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>ODP for spatial data quality characterization</title>
      <p>
        Gangemi and Presutti [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] suggest four ways of creating content ontology design
patterns (CPs), namely: (i) reengineering from other data models, (ii)
specialization/composition of other CPs, (iii) extraction from reference ontologies, and (iv)
creation by combining extraction, specialization, generalization, and expansion.
      </p>
      <p>The pattern presented in this section is obtained by extraction from the
Stimulus Sensor Observation (SSO) ontology design pattern 4. Two classes are left
out: the stimulus and the sensor. They are not included because for a quality
assessment operation, there is no need to describe the stimulus, nor is there a
need to describe a sensor that performs a measurement. Instead, it su ces that
the data consumer describes the procedure used for quality assessment.</p>
      <p>
        The documentation of the pattern uses the elds suggested in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. A small
difference though, is that the eld Diagram points to a conceptual map depicting
the ODP aligned to the foundational ontology Dolce Ultra Light (DUL), instead
of a UML class diagram as initially suggested in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The pattern is encoded in
the Web Ontology Language (OWL) using Protege 5.
4 or the Semantic Sensor Network (SSN) Ontology.
5 http://protege.stanford.edu/.
      </p>
      <p>Name: ontology design pattern for spatial data quality characterization
Intent: to describe spatial data quality components of sensor observations
Competency questions: (i) what is the value of the spatial data quality
component for this sensor observation? (ii) what is the quality criterion used to
assess the quality of the sensor observation?
Elements: The pattern has 5 elements: Data, DataQualityCriterion,
DataQualityComponent, DataQualityObservation and DataQualityResult. A data is
the output of an observation process involving a sensor, a stimulus, a sensed
property and a feature. It is equivalent to `Observation' as de ned in the
SSN ontology. A DataQualityComponent is any property of the data that
the consumer would like to approximate. Examples of spatial data quality
components were mentioned in section 2. A DataQualityCriterion is a
criterion de ned by the data consumer to get information about the quality of
the data. A DataQualityObservation is an operation by which a data quality
value is assigned to a data quality component using a data quality criterion.</p>
      <p>The outcome of a data quality observation is a DataQualityResult.
Diagram: see Figure 1
Consequences: Bene ts: (i) Reasoning and inference of spatial data quality
component values for existing sensor observations (ii) Detection of
inconsistencies during the integration of observations with di erent quality levels
(iii) Detection of inconsistencies during the integration of observations for
which di erent quality criterion have been used to assess the spatial data
quality. Trade-o s: The pattern does not give a quality value like `high
quality' or `low quality' as an end result. Instead, it helps to infer the value of a
spatial data component (e.g. resolution = 20m) and it is left up to the data
consumer to decide whether `resolution = 20m' means high or low quality
for the task at hand.</p>
      <p>Known uses: see examples of uses in section 4
Extracted from: the SSO ontology design pattern / the SSN ontology
Building block: http://wsmls.googlecode.com/svn/trunk/global/</p>
      <p>Patterns/Quality/DataQuality/dataqualitymodule.owl
4</p>
    </sec>
    <sec id="sec-4">
      <title>Validation of the ODP</title>
      <p>
        A software program was developed to test the usefulness of the ODP. This
software program serves as a proof that (i) the pattern can be used to perform
inference of quality component values and (ii) the pattern can be used to warn
against the integration of datasets for which di erent quality criteria are used
for quality assessment. This method for validation falls into the category
`Empirical validation' introduced in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Regarding the technologies, Java was used
as programming language, inference rules were written using the Semantic Web
Rule Language (SWRL), Pellet was used as reasoner, the ODP was accessed
using the OWL API, and Jena was used to perform SPARQL queries over the
ontology. Ontology instances were added to the ODP in order to answer the
two questions posed in ScenarioA and ScenarioB. The Java code is available
at http://ifgi.uni-muenster.de/~degbelo/Resources/SSN2012Degbelo or
upon request to the author.
      </p>
      <p>
        ScenarioA - Inference of data quality value. A decision-maker has at his/her
disposal di erent OGC services delivering data about an observed property. He/she
would like to answer the question - what is the value of the spatial resolution
for an observation o ering? 6 - in order to compare them and select the most
appropriate for his/her task. A quality criterion for the observation o ering is
the sampling density 7 (i.e. number of sensors per square meters).
Comment: Information about the spatial resolution of an observation o ering
can be deduced from the GetCapabilities le of an OGC service. Using the ODP
introduced earlier for this scenario, it is only required to parse the
GetCapabilities le and assert (or store) the spatial extent of the observation o ering as
well as its number of sensors. The spatial resolution can then be inferred using a
SWRL rule (see details in the Java code). Inference in turn is useful to address
one of the drawbacks of adding semantic annotations to sensor nodes in sensor
networks. In e ect, [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] pointed out that adding semantics to sensor nodes in a
6 The question is an application of the competency question (i) from section 3 to the
spatial data quality component `Resolution'.
7 This is only one way of assessing the resolution of an observation o ering. Resolution
as de ned in the SSN ontology can be inferred from the characteristics of the sensor.
sensor networks implies more data to be exchanged, which in turn leads to an
increase of sensor nodes' power consumption. Therefore, the less the amount of
semantic data to store, the better.
      </p>
      <p>ScenarioB - Detection of inconsistencies. This scenario is adapted from the
Oil Spill scenario 8 of the European project ENVISION 9. The project aims at
developing an infrastructure for environmental web services with ontologies.</p>
      <p>Scenario: accidental oil releases to the sea may have severe consequences on
both natural resources and human enterprises. For oil spill decision making, it is
essential to be able to predict the fate and e ects of the spilled oil. Fate prediction
requires data on the spill (location, time, amount, oil type), the environmental
conditions (wind, current), and geography (sea depths, coast line). A
decisionmaker has di erent datasets for oil spill prediction at his/her disposal and would
like to combine them.</p>
      <p>Comment: For the purposes of the illustration, it is assumed that - within
this scenario - the di erent types of data for oil spill prediction are available in
vector format. It is also assumed that the decision-maker has done a preliminary
look-over where all the datasets available were found to have a similar spatial
resolution (say 100meters).</p>
      <p>Given that there are various ways of de ning the spatial resolution for vector
data, an additional question to answer is: what criterion is used to assess the
spatial resolution of the observation o ering? 10 in order to ensure that
heterogeneous datasets which have the same data quality value are e ectively compatible
with respect to their resolution. This check is possible with the ODP proposed
through a simple SPARQL query (see details in the Java code).
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>Knowledge about the quality of sensor observations is an important aspect of the
discovery of resources in the Semantic Sensor Web. This work has suggested an
ontology design pattern (ODP) to characterize the quality of sensor observations.
The ODP is relevant for the annotation of sensor observations with spatial data
quality components. It can be used to infer spatial data quality component
values for existing sensor observations and warn against the integration of sensor
observations assessed with di erent quality criteria. The ODP was aligned to the
foundational ontology Dolce Ultra Light and validated through the development
of a software program.
8 See a detailed presentation of the oil spill scenario at http://envision.brgm-rec.</p>
      <p>fr/OS-UseCase.aspx (last accessed: August 30, 2012).
9 See a presentation of the project at http://www.envision-project.eu.
10 This question is an application of the competency question (ii) from section 3 to the
spatial data quality component `Resolution'.
The work is funded by the German Academic Exchange Service (DAAD A/10
/98506), and the European Commission through the ENVISION Project
(FP7249170). It is conducted within the International Research Training Group on
Semantic Integration of Geospatial Information (DFG GRK 1498). The ideas
presented were shaped through discussions with Patrick Maue, Alejandro Llaves,
Werner Kuhn, Alkyoni Baglatzi, Henry Michels and Marcell Roth.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Barnaghi</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meissner</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Presser</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moessner</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Sense and sens'ability: Semantic data modelling for sensor networks</article-title>
          . In: Cunningham,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Cunningham</surname>
          </string-name>
          , M. (eds.)
          <article-title>Proceedings of the ICT-MobileSummit 2009</article-title>
          . Santander,
          <string-name>
            <surname>Spain</surname>
          </string-name>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Botts</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Robin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Bringing the sensor web together</article-title>
          .
          <source>Geosciences</source>
          <volume>6</volume>
          ,
          <issue>46</issue>
          {
          <fpage>53</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. Broring,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Echterho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Jirka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Simonis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Everding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Stasch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Liang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Lemmens</surname>
          </string-name>
          , R.:
          <article-title>New generation sensor web enablement</article-title>
          .
          <source>Sensors</source>
          <volume>11</volume>
          (
          <issue>3</issue>
          ),
          <volume>2652</volume>
          {
          <fpage>2699</fpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Compton</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barnaghi</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bermudez</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garc</surname>
            a-Castro,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corcho</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cox</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Graybeal</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hauswirth</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Henson</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herzog</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Janowicz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kelsey</surname>
            ,
            <given-names>W.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Le Phuoc</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lefort</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leggieri</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neuhaus</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikolov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Page</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Passant</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sheth</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , K.:
          <article-title>The SSN ontology of the W3C semantic sensor network incubator group</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Corcho</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garc</surname>
            a-Castro,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Five challenges for the semantic sensor web</article-title>
          .
          <source>Semantic Web</source>
          <volume>1</volume>
          (
          <issue>1</issue>
          ),
          <volume>121</volume>
          {
          <fpage>125</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Gangemi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Presutti</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Ontology design patterns</article-title>
          . In: Staab,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Studer</surname>
          </string-name>
          ,
          <string-name>
            <surname>R</surname>
          </string-name>
          . (eds.) Handbook on ontologies, pp.
          <volume>221</volume>
          {
          <fpage>243</fpage>
          . Springer Berlin Heidelberg (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Goodchild</surname>
            ,
            <given-names>M.F.</given-names>
          </string-name>
          :
          <article-title>Scale in GIS: an overview</article-title>
          .
          <source>Geomorphology</source>
          <volume>130</volume>
          (
          <issue>1-2</issue>
          ), 5{
          <issue>9</issue>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Hammar</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sandkuhl</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>The state of ontology pattern research: A systematic review of ISWC, ESWC and ASWC 2005-2009</article-title>
          . In: Blomqvist,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Chaudhri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Corcho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            ,
            <surname>Presutti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Sandkuhl</surname>
          </string-name>
          ,
          <string-name>
            <surname>K</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of the 2nd International Workshop on Ontology Patterns - WOP2010</source>
          .
          <article-title>CEUR-WS</article-title>
          .org, Shanghai, China (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Janowicz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Compton</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The Stimulus-Sensor-Observation ontology design pattern and its integration into the semantic sensor network ontology</article-title>
          . In: Taylor,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Ayyagari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>De Roure</surname>
          </string-name>
          , D. (eds.)
          <source>The 3rd International workshop on Semantic Sensor Networks. CEUR-WS.org</source>
          , Shanghai, China (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kumi-Boateng</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yakubu</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Assessing the quality of spatial data</article-title>
          .
          <source>European Journal of Scienti c Research</source>
          <volume>43</volume>
          (
          <issue>4</issue>
          ),
          <volume>507</volume>
          {
          <fpage>515</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Paradis</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beard</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Visualization of spatial data quality for the decision maker: a data quality lter</article-title>
          .
          <source>URISA Journal</source>
          <volume>6</volume>
          (
          <issue>2</issue>
          ),
          <volume>25</volume>
          {
          <fpage>34</fpage>
          (
          <year>1994</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Sheth</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Henson</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sahoo</surname>
            ,
            <given-names>S.S.:</given-names>
          </string-name>
          <article-title>Semantic sensor web</article-title>
          .
          <source>IEEE Internet Computing</source>
          <volume>12</volume>
          (
          <issue>4</issue>
          ),
          <volume>78</volume>
          {
          <fpage>83</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Veregin</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>Data quality parameters</article-title>
          . In: Longley,
          <string-name>
            <given-names>P.A.</given-names>
            ,
            <surname>Maguire</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.J.</given-names>
            ,
            <surname>Goodchild</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.F.</given-names>
            ,
            <surname>Rhind</surname>
          </string-name>
          , D.W. (eds.)
          <article-title>Geographical information systems: principles and technical issues</article-title>
          ,
          <source>chap. 12</source>
          , pp.
          <volume>177</volume>
          {
          <fpage>189</fpage>
          . John Wiley and Sons, New York, 2nd edn. (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>