<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">The uncertainty enabled model web (UncertWeb)</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Edzer</forename><surname>Pebesma</surname></persName>
							<email>edzer.pebesma@uni-muenster.de</email>
							<affiliation key="aff0">
								<orgName type="department">Institute for geoinformatics</orgName>
								<orgName type="institution">University of Münster</orgName>
								<address>
									<addrLine>Weseler Strasse 253</addrLine>
									<postCode>48151</postCode>
									<settlement>Münster</settlement>
									<region>DE</region>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Dan</forename><surname>Cornford</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Computer Science and NCRG</orgName>
								<orgName type="institution">Aston University</orgName>
								<address>
									<postCode>B4 7ET</postCode>
									<settlement>Birmingham</settlement>
									<country key="GB">UK</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Stefano</forename><surname>Nativi</surname></persName>
							<affiliation key="aff2">
								<orgName type="institution">CNR and University of Firenze</orgName>
								<address>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Christoph</forename><surname>Stasch</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Institute for geoinformatics</orgName>
								<orgName type="institution">University of Münster</orgName>
								<address>
									<addrLine>Weseler Strasse 253</addrLine>
									<postCode>48151</postCode>
									<settlement>Münster</settlement>
									<region>DE</region>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">The uncertainty enabled model web (UncertWeb)</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">49FD1DC501CF7C231A4D4B59247FACC4</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T06:14+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>interoperability</term>
					<term>uncertainty</term>
					<term>OGC</term>
					<term>workflows</term>
					<term>geostatistics</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>UncertWeb is a European research project running from 2010-2013 that will realize the uncertainty enabled model web. The assumption is that data services, in order to be useful, need to provide information about the accuracy or uncertainty of the data in a machinereadable form. Models taking these data as imput should understand this and propagate errors through model computations, and quantify and communicate errors or uncertainties generated by the model approximations. The project will develop technology to realize this and provide demonstration case studies.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Environmental models are often complex. They need to represent space and time in a discrete and referenced way, and typically need to cope with combinations of points, lines, polygons, or regular grids, and potentially irregular points in time (regular or irregular) or time intervals. For example, an air quality model may accept sources (emissions) from areas (agricultural sources), lines (representing traffic along streets) and points (e.g. construction sites or factories). It may discretize space as a regular grid in the horizontal direction and use vertical layers of different thickness in order to solve the continuity equation that describes the change over time of the concentration of a component resulting from transport, chemistry, dry and wet deposition, and emissions. As an input it may already require wind velocity fields generated by a numerical weather prediction model, or a Monte Carlo sample (ensemble) of these that summarize our limited knowledge about the atmosphere.</p><p>According to <ref type="bibr" target="#b0">[1]</ref>,</p><p>The Model Web is a concept for a dynamic network of computer models that, together, can answer more questions than the individual models operating alone [...]. It is based on a philosophy that encourages modellers to provide access to their models and model outputs via standard "web services" [...], which makes it easier for the models to exchange information.</p><p>Model inputs may be outputs from other models, but may also be sensor data, e.g. temperature sensor readings. To function well, the model web should provide standard web interfaces to allow both access to model resources and data resources. The seemingly sharp distinction between sensor readings and model outputs might be not that sharp: sensors need to do some processing to translate an electronic, analogue signal into a binary representation of a number reflecting temperature in some unit, and models cannot distinguish between measured or modelled values when they read them as input.</p><p>A major motivation to convert models and data sources into web services (or encapsulate models in web service interfaces) that are standardized, is that it allows an easier coupling of model and data resources, and allows the exchange of resources to evaluate robustness to particular choices for model representations or data resources. When access to model and data resources is open, a further motivation is that it democratizes access to scientific data and computation, and enhances the possibility to reproduce scientific research, and hence the transperency of scientific activity.</p><p>In addition to openness about the procedures used to reach a given result, a second pilar to obtaining transparency of scientific research is openness about the limits of knowledge obtained. Measurements have limited accuracy. Models are approximations to reality. Models using measurements result in values that are not identical to the "reality" modelled. The most common way to represent the degree of knowledge of a given quantity is by representing quantities through probability theory <ref type="foot" target="#foot_0">4</ref> .</p><p>The UncertWeb project (2010-2013) will realize instances of the model web, and set it up in such a way that individual components of it (i) can understand inputs that are encoded as uncertain quantities, for example described using probability distributions rather than fixed values, (ii) can propagate errors through the model computations, and (iii) can output errors generated as a result of model approximation. At the time of writing this paper the project has barely started, but a number of technological challenges that will be faced, as well as the case studies that will be addressed, will be given.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Chaining environmental model web services</head><p>Web services need to advertise what they assume as inputs and outputs, in order for machines to be able to check that they meet the assumptions made. The industry standard for this is to us the SOAP/WSDL protocol. The Open Geospatial Consortium (OGC, [2]), a standardisation body dedicated to geospatial data handling, has made assumptions about OGC-compliant web services that do not match the SOAP protocol. Using OGC Web Services (OWS) has the advantage that software in the geospatial domain understands them (read: clients exist that can work with their output), but has the disadvantage that standard interactive, visual tools for web service chaining such as Kepler, Taverna and BPEL enginges <ref type="bibr">[4,</ref><ref type="bibr">3,</ref><ref type="bibr" target="#b1">6]</ref> for workflow orchestration do not work without a lot of additional effort. Within UncertWeb we are exploring a number of ways of integrating OWS with the SOAP/WSDL world.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">O&amp;M, WPS, profiling</head><p>Model webs, consisting of environmental models that are used for nowcasting typically integrate current sensor data in the workflow. Sensor data can be provided over the OGC Sensor Observation Service, which yields either a SensorML document or a document conforming to the Observations and Measurements (short: O&amp;M) conceptual model. Because observations and measurements can, in principle, be anything from a temperature reading to a video stream to a text message telling "It's hot in here", and time and space descriptions can have many forms, this standard has to cope with a potentially huge number of possible data representations. This makes it hard to predict whether an arbitrary O&amp;M compliant document will be accepted by a particular service. To limit the scope of possibilities, so called profiles for particular applications or application domains, can be developed as a workaround. This is important to allow developers to support only the aspects of O&amp;M that are required in the given application context. Attemping to support the complete conceptual model would be beyond the scope of any existing systems.</p><p>An OGC standard for a service that could execute a model is the Web Processing Service (WPS). This service puts no constraints to the type of process that can be executed, nor what the input should and output will look like. Again, the advice is to develop a profile to describe the constraints on this in a machine-readable way.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">UncertML</head><p>Error propagagion, and in particular error propagation by Monte Carlo simulation is not complicated in nature-instead of running a model once it is simply run, say, 1000 times with inputs sampled from the distribution that reflects the uncertainty of this particular input. The tricky bit is to organize the set of inputs and outputs when they have complicated structure (e.g. are a time series of maps), and to derive relevant information from them.</p><p>One layer of information needed is that of semantics of the uncertainty: which names do we give particular properties, such as the 10th realisation in a Monte Carlo sample, the mean and variance, or the 95-percentile. UncertML <ref type="bibr" target="#b2">[7]</ref> is the starting point for a markup language to encode uncertain information. Uncer-tWeb will improve and test this first version.In particular within UncertWeb, UncertML will be separated into a simple conceptual model, a controlled vocabulary and series of encodings.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Discovery</head><p>UncertWeb discovery functionalities will consider environmental models descriptions expressed as specific metadata model profiles. These will include the extra information needed to discover and chain models according to significant use cases. The proposed solutions will consider the existing and ongoing specifications from relevant initiatives (e.g. GEOSS, GMES, INSPIRE, etc.). The profiles will be based on well-accepted international standards, where appropriate.</p><p>As far as discovery and access are concerned, presently UncertWeb is investigating the following challenges: a Uncertainty support for Scientific Data Types, with particular reference to the next data model and encoding languages: Geography Mark-up Language (GML), Obeservation&amp;Measurement (O&amp;M), and Common-Data-Model (CDM) / netCDF / ncML; b An extended SOA approach addressing interoperability for uncertainty-enabled sharing of environmental resources. A SOA brokering approach was successfully experimented in several GEO/GEOSS initiatives and FP7 projects (e.g. GEO AIP, EuroGEOSS project, etc.) to reduce/solve interoperability issues on discovery, access, and semantics functionalities. c Support of uncertainty in discovery services; this concerns: (i) possible uncertainty queryables, e.g. relative/average error on a data coverage; (ii) uncertainty accommodation in standard metadata models, e.g. ISO 19115, IN-SPIRE Metadata profile, etc.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4">Service chaining</head><p>As for chaining and publication methodology and services, presently UncertWeb is defining the standard baseline for chaining, including standard services (e.g. OGC services and ISO models) as well as communities of practice specification and special interoperability arrangements, e.g. from the biodiversity community.</p><p>Besides, in keeping with the Model Web principles, UncertWeb is investigating a suitable approach for chaining uncertainty-enabled services, starting with data and processing services. Different possible solutions (i.e. legacy frameworks such as Kepler/Taverna/JOpera [4,3,5]; standard environments such as BPEL <ref type="bibr" target="#b1">[6]</ref>; Composition-as-a-Service, and Mash-ups that could include Google maps) are compared evaluating their infrastructure complexity, entry level barrier, flexibility, and portability.</p><p>An extended SoA approach, applied to the data service domain, promises to facilitate workflows chaining and accomplish brokered data access for not directly interoperable nodes. In fact, this approach could also enable the transformation/mediation of uncertainty related metadata description and uncertain scientific data-types.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.5">Methods and tools for uncertainty management</head><p>Generic technology for building uncertainty enabled model webs is only partially available, and UncertWeb will need to develop some more. Among these are improved Monte Carlo methods, needed to reduce the number of model runs required when model runs are computationally expensive. Other developments involve expert elicitation, a method where experts opinions are used, and combined to obtain quantitative uncertainty measures for quantities for which this is hard or impossible to obtain experimentally. Spatial, temporal or spatiotemporal aggregation or disaggregation is often needed to obtain information at a finer or coarser spatial resolution, and has consequences for uncertainty: the average value for a country is easier to estimate than the value for each point. The value, limitations and requirements to Monte Carlo simulation methods for error propagation will also be studied in the context of complex models with high-dimensional parameter spaces. Finally, visualisation of uncertain information <ref type="bibr" target="#b5">[10]</ref> is an active area of research, and is of particular importance to communicate information to end users, who could be domain experts or members of the public. Usability testing is foreseen to evaluate different alternatives.</p><p>3 Case studies The system developed in this case study (Fig 1 <ref type="figure">)</ref> will enable land cover data to be efficiently exchanged and validated, prior to being used to measure (mainly anthropogenic) changes to protected zones. The model server that will be put in place to compute and serve interoperable habitat information will benefit from validated information, and the uncertainty framework developed in UncertWeb will further allow us to quantify uncertainties in habitat changes. The coupling in a Service Oriented Architecture (SOA) environment of this habitat model server with climate change model servers will provide powerful new mechanisms to policy makers and the scientific community for performing multi-scale analyses of biodiversity-related problems and for forecasting ecological change. The prototype systems developed in biodiversity modelling work package (Fig. <ref type="figure" target="#fig_1">2</ref>) within UncertWeb will enable policy makers to integrate computer models that simulate land-use and its change under different economic and climatic scenarios. An important advance is the ability to propagate uncertainty through the model Web: instead of focusing on highly constrained scenarios (as in the FP6 LUMOCAP project, http://www.riks.nl/projects/LUMOCAP), we can consider the impacts of uncertainty at the policy level. The model Web will also allow us to consider other facets of the land-use problem, and we could open up the possibility of using some more detailed environmental models (for examples, see <ref type="bibr" target="#b6">[11]</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Biodiversity modelling</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Food chain modelling</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Air quality modelling</head><p>The application of uncertainty-enabled forecasts for air quality (Fig. <ref type="figure" target="#fig_2">3</ref>) is a logical and inevitable extension of numerical weather prediction ensemble forecasts. Whilst current developments mainly focus on the regional scale, the concept is just as applicable, and indeed more useful, for local authorities on the urban scale. It is expected that in the near future such ensemble forecasts will be commonly available, and so it is essential to develop a communications system suitable for dealing with such information.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4">Human activity modelling</head><p>Activity-based modelling of individual movements (Fig. <ref type="figure" target="#fig_3">4</ref>) is an area that is just entering the operational phase, because of its complexity, demands on input data, and runtime requirements. The quantification of the prediction errors which result from model approximation and input uncertainties is a novel application  area, but is required in order to acknowledge and manage the limited reliability of outputs from realistic models.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.5">Integrating air quality and human activity modelling</head><p>Successful integration of the two case studies on air quality forecasting and individual activity modelling (Fig. <ref type="figure" target="#fig_4">5</ref>) will be an important proof of concept for the uncertainty-enabled model Web. The application of human activity modelling is the first of its kind in Europe, but its predictions are subject to uncertainty. Using these uncertain predictions to feed an air quality model is a challenging and innovative approach, and will allow assessment of the input (predicted emissions from traffic) as well as assessment of exposure based on realistic allocation of individuals in space and time.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Discussion</head><p>UncertWeb is an ambitious program that has the potential to demonstrate that environmental model webs work, and can sensibly cope with uncertainty in a quantitative way. A number of technological issues to realize this will have to be tackled, solved or circumvented, in four very diverse case studies a number of cutting edge challenges, including that of linking physical atmospheric modelling (air quality) with social, agent-based human activity modelling while taking into account the major sources of uncertainty. In particular we aim to address the usability of the resulting system, providing tools to enable people to assess, manage and visualise uncertainties in chains of web services.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Work flow for biodiversity modelling</figDesc><graphic coords="5,134.77,358.68,345.81,132.88" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Work flow for food chain modelling</figDesc><graphic coords="6,134.77,157.72,345.83,127.03" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 3 .</head><label>3</label><figDesc>Fig. 3. Work flow for air quality modelling</figDesc><graphic coords="7,134.77,295.78,345.82,97.19" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Fig. 4 .</head><label>4</label><figDesc>Fig. 4. Work flow for human activity modelling</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Fig. 5 .</head><label>5</label><figDesc>Fig. 5. Work flow for integrating air quality and human activity modelling</figDesc><graphic coords="8,134.77,115.83,345.81,212.66" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_0">Probability theory is traditionally used to represent aleatory uncertainty (variability) and epistemic uncertainty (lack of knowledge). An alternative that models concepts as vague (sometimes called ontological uncertainty) rather than crisp in a quantitative fashion is to use fuzzy set theory, although some argue that this can be addressed within a subjective Bayesian perspective.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>This work has been funded by the European Commission, under the Seventh Framework Programme, by Contract No. 248488 with the DG INFSO. The views expressed herein are those of the authors and are not necessarily those of the European Commission. More information on UncertWeb and UncertML can be found on http://www.uncertweb.org/ and http://www.uncertml.org/, respectively.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Looking Forward: Applying an Ecological Model Web to assess impacts of climate change</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">N</forename><surname>Geller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Melton</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Biodiversity</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">3&amp;4</biblScope>
			<biblScope unit="page" from="79" to="83" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Building scientific workflow with Taverna and BPEL: A comparative study in caGRID</title>
		<author>
			<persName><forename type="first">W</forename><surname>Tan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Missier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Madduri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Foster</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings. 4th International workshop on Engineering Service-Oriented applications (WESOA)</title>
				<meeting>4th International workshop on Engineering Service-Oriented applications (WESOA)</meeting>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page">118129</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Uncertainty Markup Language (UncertML)</title>
		<author>
			<persName><forename type="first">M</forename><surname>Williams</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Cornford</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Bastin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Pebesma</surname></persName>
		</author>
		<ptr target="http://portal.opengeospatial.org/files/?artifact_id=33234" />
	</analytic>
	<monogr>
		<title level="m">Open Geospatial Consortium</title>
				<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="122" to="123" />
		</imprint>
	</monogr>
	<note>Discussion Paper 08-</note>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<ptr target="http://portal.opengeospatial.org/files/?artifact_id=26667" />
		<title level="m">Sensor Observation Service</title>
				<imprint>
			<date type="published" when="2007">2007a</date>
		</imprint>
		<respStmt>
			<orgName>OGC</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">OpenGIS Web Processing Service</title>
		<ptr target="http://portal.opengeospatial.org/files/?artifact_id=24151" />
		<imprint>
			<date type="published" when="2007">2007b</date>
		</imprint>
		<respStmt>
			<orgName>OGC</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Visualising uncertain spatial and spatio-temporal data under different scenarios: an air quality example</title>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">J</forename><surname>Pebesma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Jong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">J</forename><surname>Briggs</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of GIS</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="page">515527</biblScope>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Modelling environmental change in Europe: towards a model inventory (SEIS/Forward)</title>
		<author>
			<persName><surname>Eea</surname></persName>
		</author>
		<idno>No 11/2008</idno>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
	<note type="report_type">EEA Technical report</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
