<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Visualizing Uncertainty In Environmental Work- ows And Sensor Streams</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Karthikeyan Bollu Ganesh</string-name>
          <email>karthikeyan@uni-muenster.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Patrick Maue</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute for Geoinformatics (IFGI), University of Muenster</institution>
          ,
          <addr-line>D-48151 Muenster</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Environmental data and models are uncertain by nature. The lack of knowledge about, for example, the magnitude of potential measurement errors may lead to unforeseen consequences. This makes it di cult to assess the data's or model's usefulness for critical applications. We present an approach for the visualization of uncertainty coming from in-situ environmental sensors. The visualization component is part of a Web-enabled environmental modelling platform which also supports the speci cation of processing work ows. The concept of uncertainty and means for its encoding as part of the environmental data are introduced. The individual components in the processing work ows propagate and update the uncertainty information. We also explain how the uncertainty in the original sensor data has been identi ed, and how the visualization component has been implemented.</p>
      </abstract>
      <kwd-group>
        <kwd>Uncertainty</kwd>
        <kwd>Uncertainty Visualization</kwd>
        <kwd>Environmental Workows</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The on-going implementation of the European INSPIRE directive facilitated the
access to geographic information on the Web. Embedded in Spatial Data
Infrastructures (SDI), geospatial Web services provide means to access, process, and
visualize spatial data. Interoperability between di erent Web services is to a
certain degree ensured by the standards set by the Open Geospatial Consortium.
The standards also enable the direct integration into local geospatial
applications. But there has been only limited uptake by the environmental modelling
community. The concept of a Shared Environmental Information System (SEIS)
has been recently introduced to address this issue [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The published SEIS
principles aim to facilitate access to environmental information for public authorities
and the general public. This calls for new ways of how to publish environmental
services, and how to present the resulting information to also non-ICT skilled
end-users. Investigating the requirements for environmental services
infrastructures, including the execution of environmental services via standard runtime
engines for service work- ows and the Web-enabled visualization through
environmental portals, is subject of the research project ENVISION1 [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
ENVISION aims to provide an environmental services infrastructure with ontologies
which investigates the distributed execution, semantic discovery and annotation
of environmental services. Results from our research regarding the encoding,
propagation, and visualization of uncertainty are presented in this paper.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Environmental Models As Work ows</title>
      <p>Environmental information is traditionally resulting from environmental
computer models. Simple computer models could perform atomic computations,
such as interpolating the elevation within the DEM (Digital Elevation Model).
To solve complex problems such as predicting the impact of an oil spill on the
local wildlife, the chaining of individual simple models might be required. In a
rst step, a weather forecasting service encapsulated as an OGC Web Processing
Service takes real-time weather data from an OGC Sensor Observation Service
to predict future weather conditions. The result is taken as input for the oil
drift model, which computes the dispersion and weathering of an oil slick for
the forecasting period. As nal step, this distribution may be used to assess the
impact of the oil slick on the local wildlife (e.g. using ESI2 maps). Each of these
models are implemented as WPS, the coupling with other Spatial Data Services
(SDS) serving the input is done with the Business Process Modelling Language
(BPEL). Standard BPEL engines can then execute these work ows.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Uncertainty In Environmental Data</title>
      <p>
        Results from the work ows help domain experts in their decision-making tasks,
e.g. for assessing response measures for an oil spill. Oil spills could be fought
by distributing detergents, skimming, or even burning. Which method to apply
depends on the results of the oil spill model. Taking the wrong decisions here
could potentially have devastating e ects on the environment; having correct
and highly certain results is therefore crucial. Much of the data that forms the
input for the computer models contains some sort of uncertainty. Environmental
information is uncertain by nature. "You are uncertain, to varying degrees, about
everything in the future; much of the past is hidden from you; and there is a
lot of the present about which you do not have full information. Uncertainty is
everywhere and you cannot escape from it" [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
3.1
      </p>
      <sec id="sec-3-1">
        <title>Uncertainty in Work Flows</title>
        <p>
          Uncertainty is an expression of con dence about our knowledge [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Uncertainty
results from imprecise measurements of environmental phenomena. It is
processed using geospatial algorithms, which generalise, infer, or merge data to
        </p>
        <sec id="sec-3-1-1">
          <title>1 More information available at http://www.envision-project.eu 2 ESI stands for "Environmental Sensitivity Index"</title>
          <p>
            generate new data. Uncertainty is part - and therefore an important aspect
of GI throughout the complete services work ow, starting with the creation of
the data until its visualisation on a map [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ]. The following two types of
uncertainty should be explicitly quanti ed to enable users to assess the quality of the
end product of environmental service chains. Input errors result from imprecise
measurements, either due to human error or insu cient sensing technology. The
human factor or wrongly calibrated sensors are hard to quantify, while errors
coming from sensors not sensitive enough can be known beforehand. Model
errors emerge during the processing of the data, e.g. the interpolation of unknown
values from sparse input data. The uncertainty of the resulting product depends
on both, input and model error. That requires a solution to add uncertainty
information to the data, and to keep and update uncertainty information while
processing the underlying data.
          </p>
          <p>
            Processing uncertain environmental data propagates the uncertainty often
unpredictably [
            <xref ref-type="bibr" rid="ref8">8</xref>
            ]. Environmental models may introduce their own model errors,
which in uence the original uncertainty of the input observational data. The
introduced error depends on the selected algorithms. This work is focussing on
the visualization of uncertainty coming from work ows of environmental data
and processing services. Discussing how to compute the uncertainty coming from
the models is out of scope of this paper. This is addressed by the research
currently performed in the UncertWeb3 project; here we have to assume that the
processing components in the work ow are able to propagate and update the
uncertainty parameters. Finding adequate visualization techniques for
uncertainty is an active research topic. Examples for such techniques are portraying
uncertainty in graphs by showing normal distribution and con dence intervals,
colour models, or time-series charts. The following implementation presents a
tool showing the uncertainty in environmental data as charts.
4
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Implementation</title>
      <p>The implementation of this project is divided into the following sub-tasks: (1)
Identi cation of existence data quality parameters, (2) extending the standard
data format for sensor data with uncertainty information, and (3) visualizing
uncertainty in a graph.
4.1</p>
      <sec id="sec-4-1">
        <title>Identi cation Of Existing Uncertainty Parameters</title>
        <p>Information about data quality can be either qualitative (e.g. adding
information about the data provider to address issues such as trust) or, in most cases,
quantitative (e.g. completeness, accuracy, scale, and more). Data quality has
been acknowledged to play an important role for geographic information, and
OGC and ISO published standards for representing data quality parameters.
This work is focussing on the accuracy of sensor measurements. A piezometric</p>
        <sec id="sec-4-1-1">
          <title>3 More information available at: http://www.uncertweb.org/</title>
          <p>sensor system is used for ENVISION to monitor underground water levels. The
sensed data can be accessed through an OGC SOS4 interface. The accuracy
of the measured data from these piezometers is found to be around 1% of the
measured value5.
4.2</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>Uncertainty Extensions For O&amp;M 2.0</title>
        <p>
          Sensed data always comes with some sort of error. From a conceptual
perspective all data should be considered to be uncertain. Even though most data lacks
information about uncertainty, some data sets may have descriptive
information about it in its metadata (following the ISO 19013 standard for data quality
metadata). This global de nition of uncertainty is many cases insu cient for
the visualization and assessment of the data set's usefulness for critical
application. The Uncertainty Markup Language (UncertML) has been introduced as
extension to the OGC Geography Markup Language to address this issue. It
is focussed on an XML encoding for the transport and storage of uncertainty
information [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. UncertML includes means to express simple summary statistics
(e.g., mean and variance) as well as complex representations such as parametric,
multivariate distributions at each point of a regular grid [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. Uncertainty can be
encoded either in form of (1) statistics, e.g. values for probability or the
quantile, (2) distributions, e.g. a normal distribution, or as (3) realisations. UncertML
2.0 relies on the OGC Observation &amp; Measurement (O&amp;M) standard to encode
uncertainty [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. The following listing for the piezometer observations encodes
Uncertainty as a Gaussian distribution. The mean value is to be understood
to be the actually sensed value, while the variance re ects the precision of the
sensor measurement.
&lt;om:resultQuality&gt;
&lt;gmd:DQ_QuantitativeAttributeAccuracy&gt;
&lt;gmd:result&gt;
&lt;gmd:DQ_UncertaintyResult&gt;
&lt;gmd:value&gt;
&lt;un:GaussianDistribution&gt;
&lt;un:mean&gt;0.89 0.87 &lt;/un:mean&gt;
&lt;un:variance&gt;0.01 0.01&lt;/un:variance&gt;
&lt;/un:GaussianDistribution&gt;
&lt;/gmd:value&gt;
&lt;/gmd:DQ_UncertaintyResult&gt;
&lt;/gmd:result&gt;
&lt;/gmd:DQ_QuantitativeAttributeAccuracy&gt;
&lt;/om:resultQuality&gt;
Listing 1 - Example of uncertainty-enabled O&amp;M 2.0
4 http://swe.brgm.fr/pleiade-core-service-ades-om2-0.0.1-recette/REST/sos?
        </p>
        <p>Request parameters and examples are available at: http://sosades.brgm.fr/
5 As reported by the service providers BRGM, the French geological survey</p>
      </sec>
      <sec id="sec-4-3">
        <title>Visualization</title>
        <p>The implementation adopts the visualization through the charts. The component
expects the element Gaussian Distribution and computes the according
uncertainty intervals. A time-series chart displays the maximum, minimum value of
the uncertainty and the mean value of the incoming observations. The following
screen-shots includes the chart viewer on the left side, and the map showing the
according sensor positions on the right side. They belong to a set of components
developed in the ENVISION project. The individual modules are implemented
as Portlets (compliant to the Java Portlet Speci cation 2866), which can be best
described as pluggable user interface components for the Web. By simply
selecting one of the sensors displayed in the map, the according time series can
be visualized in the chart. The red line represents the actually observed values
(the mean), while the blue lines represent the according boundaries of the
uncertainty intervals. The chart is based on a JavaScript library7 which supports
rich interaction with the graph. The user can hover over the chart to see the
individual values at di erent points in time.
The focus of this work is on the visualization of uncertainty resulting from
measurement errors and the processing of the environmental data. It relies on the
encoding of the uncertainty using the UncertML standard. How to come up with
6 More information available at: http://www.jcp.org/en/jsr/detail?id=286
7 More information available at: http://www.highcharts.com/
this uncertainty information - and how to propagate uncertainty in geospatial
work ows - has not been subject of this research. This is investigated in the
research project UncertWeb. Hence, future work will focus on integrating the other
features supported by UncertML in the visualization components. This includes
research on the usability, i.e. how can we best communicate uncertainty to
support the end-users in the decision making process. In ENVISION the execution
of the geospatial work- ow is handled by a distributed execution infrastructure.
It includes techniques for the optimization of the work- ows, and the ad-hoc
adaptation of execution paths according to certain context parameters. Future
work will also investigate how uncertainty information may contribute to this
adaptation process.
6</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>This work has been funded by the European research project ENVISION
(FP7249170, see http://www.envision-project.eu)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Cox</surname>
            <given-names>S.:</given-names>
          </string-name>
          <article-title>Observations and Measurements</article-title>
          .
          <source>Technical report 05-087</source>
          , OGC. (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>European</given-names>
            <surname>Commission</surname>
          </string-name>
          .
          <article-title>COM 46 nal: Towards a Shared Environmental Information System (SEIS). Communication from the Commission</article-title>
          .
          <article-title>(</article-title>
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Gr</surname>
            <given-names>nmo</given-names>
          </string-name>
          , R.,
          <string-name>
            <surname>Roman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Llaves</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Maue</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <source>ENVISION D3</source>
          .
          <article-title>1: MaaS Composition Portal - Architecture speci cation</article-title>
          . Available at http://www. envision-project.
          <source>eu</source>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Heuvelink</surname>
            ,
            <given-names>G. B. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brown</surname>
            ,
            <given-names>J. D.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Loon</surname>
            ,
            <given-names>E. E. van.</given-names>
          </string-name>
          :
          <article-title>A probabilistic framework for representing and simulating uncertain environmental variables</article-title>
          .
          <source>International Journal of Geographical Information Science</source>
          ,
          <volume>21</volume>
          (
          <issue>5</issue>
          ),
          <fpage>497</fpage>
          -
          <lpage>513</lpage>
          . (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Lindley</surname>
            ,
            <given-names>D. V.</given-names>
          </string-name>
          : Understanding Uncertainty (p.
          <fpage>272</fpage>
          ). Wiley-Interscience (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Roman D.</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Schade</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Berre</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Bodsberg</surname>
            , and
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Langlois</surname>
          </string-name>
          .:
          <article-title>Environmental Services Infrastructure with Ontologies - A Decision Support Framework</article-title>
          .
          <source>In Proceedings of EnviroInfo Conference</source>
          <year>2009</year>
          , Berlin, Germany. (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cornford</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bastin</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Ingram</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>UncertML : an XML schema for exchanging uncertainty</article-title>
          .
          <source>Proceedings of GISRUK</source>
          , Manchester, UK. (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cornford</surname>
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bastin</surname>
            <given-names>L.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Pebesma</surname>
            <given-names>E.</given-names>
          </string-name>
          :
          <source>OGC Discussion Paper</source>
          <volume>08</volume>
          - 122r:
          <article-title>Uncertainty Markup Language (UncertML)</article-title>
          . Available at: http://portal. opengeospatial.org/files/?artifact_id=
          <volume>33234</volume>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>