=Paper=
{{Paper
|id=None
|storemode=property
|title=Supporting Content Provision in Environmental Information Infrastructures
|pdfUrl=https://ceur-ws.org/Vol-679/paper5.pdf
|volume=Vol-679
|dblpUrl=https://dblp.org/rec/conf/enviroinfo/SchadeD10
}}
==Supporting Content Provision in Environmental Information Infrastructures==
Supporting Content Provision in Environmental
Information Infrastructures
Sven Schade1 and Laura Díaz2
1
Institute for Environment and Sustainability
European Commission, Joint Research Centre
Ispra, Italy
2
Institute of New Imaging Technologies
University Jaume I
Castellón, Spain
sven.schade@jrc.ec.europa.eu, laura.diaz@uji.es
Abstract. Information Infrastructures for managing and providing
environmental resources are requested by numerous initiatives on regional,
national, and international scales. While much research focuses on the
discovery and consumption of provided content, environmental data and model
provision is hardly addressed. Each time an expert creates new information or
develops a novel scientific algorithm it is delegated to an expert in information
and communication technology to make this content available in a given
Environmental Information Infrastructure (EII). From our point of view, this
workflow is a bottleneck in sharing environmental content that impedes the
efficient maintenance of EII. Accordingly, we have extended the classical three-
layered EII architecture with a middleware for assisted content publication and
deployment. A first implementation for data publishing is in place, while
investigations on the publication of environmental models are ongoing. In this
position paper, we briefly present the status of our work and discuss
possibilities for publishing environmental models. We point to related activities
and outline our future plans. We hope that our contribution will help to increase
content availability in EIIs in standard basis and thereby will aid content
discovery and model composition.
Keywords: INSPIRE, GEOSS, SDI, Environmental Information
Infrastructures, standard services, model web, publishing, deployment.
Introduction
Due to rising challenges of climate change and natural hazards, environmental content
sharing became a central need in environmental sciences [1]. Since many considered
phenomena, such as wild fires, floods or change in biodiversity, cross administrative
borders initiatives for establishing Environmental Information Infrastructures (EII) on
different scales emerged over the past years. Those include Infrastructure for Spatial
Information in Europe (INSPIRE) and Global Monitoring for Environment and
2 Sven Schade and Laura Díaz
Security (GMES) on European level, and Global Earth Observation System of
Systems (GEOSS) worldwide. An analysis of these three and their interplay has been
published recently [2]. In a nutshell, efforts on following INSPIRE implementing
rules in GMES are ongoing, while both can be seen as part of European contribution
to GEOSS. INSPIRE and GMES also contribute to the Shared Environmental
Information System (SEIS) initiative. The concepts behind SEIS focus mainly on
reporting but they are still evolving.
So far, EIIs assume that only Information and Communication Technology (ICT)
experts can provide content as services. This is because current EIIs are based on
Service Oriented Architectures (SOA) where services are implemented according to
international agreements and standards. The deployment mechanisms imply to
understand these standard service specifications and their implementations. ICT
experts became the only mediator between the environmental experts, who create the
content, and the infrastructure for content sharing [3] [4].
In order to improve the given situation, we suggested extending existing
architectures with components and mechanisms, which assist EII users in content
deployment [4]. The GEOSS Service Factory (GSF) realizes this proposal. In the next
section, we describe the GSF, and point to related work. As we intent to extend GSF
with deployment capabilities for (environmental) models, we discuss possibilities and
a future stepwise development. We conclude this position paper by summarizing our
findings and by outlining our future development plan.
Recent GSF Developments and Related Work
We enable content deployment by adding a (fourth) layer to the classical EII
architecture (Figure 1). This middleware layer (GSF, dashed lines in the figure), acts
as a mediator to provide content ‘as a Service’ to an EII, which is compliant with
INSPIRE and GEOSS. With the GSF, applications became able to push newly created
content into EIIs using common formats. Appropriate access services are selected
based on content types: Metadata, Data, Model, and Warning.
Applications Applications
Workflow Engine Application Logic Service Connector Workflow Engine Application Logic Service Connector
Warning
Warning Services as a
Service
Model
Geospatial Networking Services as a
Processing Services
Discovery Download View Processing Service
Data
Data and Metadata Services as a
Discovery View Download Service
Deploy
(GEOSS) Service Factory as a
Service
Geospatial Content Geospatial Content
Metadata Data Models Metadata Data Models Warnings
Figure 1: Extending the classical EII architecture (left) with GSF (right).
Supporting Content Provision in Environmental Information Infrastructures 3
It might be noticed that the service layer was extended by Warning Services in order
to address upcoming requirements for event notification. Among other information
resources, warnings are created in the application layer, for example by spatial
decision support systems that integrate available models and execute them on the
available data sets. Warning Services are used for managing all warnings that have
been published within the EII and distribute according notifications to the EII users.
These notifications may follow push or pull based approaches.
In order to provide GSF as a (web) service itself. We decided to use a common
geospatial standard, the Web Processing Service (WPS) [5]. WPS specification allows
encapsulating all kinds of functionality, and it has been proven as mature to expose
processing functionality in EII [6], therefore it looks appropriate to describe the GSF
interface to provide content publication capability. The added value is that due to the
fact that the use of WPS is increasing as well as the number of implementations both
for service and client side. To access to GSF from any application any generic WPS
client can be used [new7]1.
We designed the GSF using the Abstract Factory pattern from software
engineering [8]. The pattern provides a central entry point for content
creation/deployment (the Factory), which encapsulated task delegation to specialized
deployment components. In this way, the current implementation is easy to extend. As
an abstract factory, the GSF holds a group of concrete factories; each of them dealing
with a distinct service type and deploying content via transactional service interfaces.
A proof of concept is provided for data deployment in the context of the European
Forest Fire Information System EFFIS [9]. Here, GSF is able to provide a unique
entry point to deploy vector data (shape files), raster data (GeoTIFF) and even user
contributed content (KML) in a View Service and in a Download Service existing in
EFFIS. GSF is also able to register basic metadata in a Discovery Service. So far,
GSF is not able to deploy processing content such as environmental models, like fire
risk calculations or procedures for burned area assessment.
Much research on model deployment has been carried out, including the Model
Web concept [10], work on model decomposition [6], and outcomes of the projects
that are listed on the web page of this workshop (envip 2010). Still we require
scenarios of using GSF for model deployment and have to specify a development
plan.
Model Deployment with GFS
At this stage, we remark the following options for model deployment (order indicates
complexity, from simple to most difficult implementation):
1 We have used a generic HTTP client for testing, and a self developed java client [9] but many
projects such as 52North and uDIG already provide graphical user interfaces for WPS
execution.
4 Sven Schade and Laura Díaz
(1) Deploy conceptual model descriptions to a repository. First, we concentrate on
the description of scientific models, which create environmental information.
For example, the processing steps, required to generate a burned area map, may
be described and deployed as standard encoding of a workflow language. We
suggest using the Business Process Modeling Notation (BPMN) [11] for this
purpose. Such information helps to understand the model to generate
environmental data. Discussions on model improvements may be triggered.
However, BPMN does not provide information on model semantics. We
propose using a vocabulary, which is shared within the EII community, for this
purpose. It should be available from a shared registry and should be used for
labeling the various BPMN elements.
(2) Deploy executable files to repository. Following the initial ideas of the Model
Web [10] and in line with GEOSS, models may be provided as executables (as
*.exe, *.jar, etc) in a repository or Web Accessible Folder (WAF) following
GEOSS terminology. Users looking for models may just be provided with a
simple list of available files and their formats. For example, a package for
statistical calculations of burned area characteristics can be offered stand alone.
Execution will still require download and invocation in a suited environment,
but at least models become sharable.
In order to make such an implementation usable, we face a model description
problem. We require metadata for model evaluation and use. To make a model
executable, users will require information about: input and output parameters,
required operation systems, versions, and libraries. All may be affected by
licensing issues. Additionally, supported interfaces should be described in a
common manner. Again, descriptions of model semantics are required.
Approaches, such as Web Service Modelling Ontology (WSMO) [12], try to
address such issues for web services, but can this be projected to multiple types
of executables?
Complementary to the above, we face the practical problem of obsolescence,
i.e. required basic technology or software may simply be outdated and not
available anymore. Open archives try to tackle these problems of long-term
data preservation [13]. Nevertheless, it has to be ensured that all needs to
execute the specific environmental modeling algorithms are covered.
(3) Deploy executable model descriptions to a Processing Service. WPS has been
proven as a technology useful to expose and share processing capabilities in the
EII domain [6]. Existing WPS software like 52North implements transactional
capabilities [14], to be added to extend the upcoming version 2.0 of WPS.
Among other new functionalities WPS 2.0 is considering to deploy new
processes in running instances. In other words WPS will support the concept of
Composition as a Service (CaaS) [15].
We consider deploying executable process descriptions (using, for instance,
Business Process Execution Language (BPEL)) as a next deployment step. In
this case, chainable service instances have to be available within the EII and
the model has to be provided as an executable process description. For
example, assuming all required data is available as Download Service and each
processing task as a WPS, the complete burned areas calculation workflow
could be made available. A Processing Factory within the GSF would deploy
Supporting Content Provision in Environmental Information Infrastructures 5
that script and the composed model will be directly available as a distributed
Processing Service described with WPS interface. The BPEL deployment
functionality of the 52north implementation of WPS [14] may serve as a
starting point. However, the use of BPEL limits possible service compositions
to components, which operate via SOAP/WSDL.
(4) Deploy existing software to a Processing Service. The final step, to expose
scientific models fully, is to migrate binary-encoded model components as
Processing Services [6]. In order to assist the domain expert and to automate
this process as much as possible, we require sophisticated deployment
mechanisms and a methodology for workflow modeling with domain experts.
Challenges, such as deployment of software developed in diverse programming
languages (FORTRAN, Java, etc) must be overcome. Similar issues hold for
the operating platforms (windows, unix) in which the models have been
developed. Distributed computing, and in particular the ‘mobile code
approach’ [16], in which executable algorithms are sent across a network and
executed at distinct nodes, may provide solutions. The relations to grid
computing, cloud computing, and virtualization require further exploration.
We believe that the overall solution can only be semi-automatic. For
example, only distinct parts of EFFIS can be provided as decoupled processes,
due to software packages and dependencies. As for all other options, model
semantics still have to be defined in some form of metadata.
Conclusions and Future Work
We argued that content deployment to EIIs faces complex deployment issues and a
central bottleneck in environmental information sharing. The GEOSS Service Factory
(GSF) was proposed as a solution. Although the GSF concept is supported by a proof
of concept implementation for providing environmental data, detailed elaborations for
model deployment remain challenging. We pointed to related research and projects.
On this basis, we argued for four possible approaches for model provision. As these
complement each other, we plan to address them sequentially. Implementations will
be guided by the EFFIS example. The GSF will help to increase content availability in
EIIs and thereby will aid information discovery and model composition.
Offering GSF as a service provides means to secure deployment. According
implementations may be considered in future. This notably differs from a ‘secure’
execution of models on external machines. The latter is out of the scope of our work.
Acknowledgements
The presented work was partially founded by EUROGEOSS (FP7-ENV-2008-1-
226487). The authors thank their colleagues from the Spatial Data Infrastructure Unit
of the Joint Research Centre for numerous discussions that shaped the GSF principle.
Four anonymous reviewers provided constructive comments for improving an earlier
version of this document.
6 Sven Schade and Laura Díaz
References
[1] M.F. Goodchild, Geographic information science: the grand challenges. In J.P. Wilson and A.S.
Fotheringham, editors, The Handbook of Geographic Information Science. Malden, MA: Blackwell,,
pp. 596–608, 2008.
[2] P. Smits, S. Cox, F. Fierens, J. Schulze Althoff and A. Biancalana, GIGAS Business Model and
Exploitation Plan. GIGAS D 1.2b Annex 1, 2010.
[3] L. Díaz, Improving resource availability for Geospatial Information Infrastructures. PhD Thesis.
University Jaume I of Castellón, 2010.
[4] L. Díaz, C. Granell, M. Gould, and J. Huerta. Managing user generated information in geospatial
cyberinfrastructures. Future Generation Computer Systems. Future Generation Computer Systems, in
press..
[5] P. Schut (ed). OGC Web Processing Service (WPS) version 1.0.0. OGC Standard Document, Open
Geospatial Consortium, 2007.
[6] C. Granell, L. Díaz, and M. Gould, Service-oriented applications for environmental models: Reusable
geospatial services. Environmental Modelling and Software, vol 25, issue 2, pp. 182-198, 2010.
[7] T. Foerster and B. Schäffer, B, A Client for Distributed Geo-processing on the Web. Lecture Notes in
Computer Science (LNCS), vol. 4857, 7th International Symposium on Web and Wireless GIS
(W2GIS 2007), 252-263, 2007.
[8] E. Gamma, R. Helm, R. Johnson, and J. Vlissides, Design Patterns. Addison-Wesley, 1995.
[9] http://effis.jrc.ec.europa.eu/, last acessed 22nd of September 2010.
[10] G. Geller and F. Melton, Looking forward: Applying an ecological model web to assess impacts of
climate change, Biodiversity 9, no. 3&4, 2008.
[11] OMG, Business Process Model and Notation (BPMN), Version 1.2. Object Management Group
Standard, 2009.
[12] D. Roman, U. Keller, H. Lausen, J. de Bruijn, R. Lara, M. Stollberg, A. Polleres, C. Feier, C. Bussler,
and D. Fensel, Web Service Modeling Ontology, Applied Ontology, 1(1): 77 - 106, 2005.
[13] CCSDS, Reference Model for an Open Archival Information System (OAIS). Consultative
Committee for Space Data Systems – Blue Book, 2002.
[14] B. Schäffer, Towards a transactional Web Processing Service (WPS-T). In Proceedings of the 6th
Geographic Information Days, IfGIprints Nr. 32, Institut für Geoinformatik, Münster, 2008.
[15] M.B. Blake, W. Tan, and F. Rosenberg, Composition as a Service. IEEE Internet Computing, vol. 14,
no. 1, pp. 78-82, 2010.
[16] A. Fuggetta, G.P. Picco, and G. Vigna, Understanding Code Mobility. IEEE Transactions on Software
Engineering, 24(5), pp. 342-361, 1998.