<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Provenance Registration and Discovery using Geospatial Catalogue Service</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Peng Yue</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jianya Gong</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Liping Di</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lianlian He</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yaxing Wei</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mapping</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Remote Sensing</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wuhan University</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Luoyu Road</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wuhan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>China</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Center for Spatial Information Science</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Systems (CSISS)</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>George Mason University</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>4400 University Drive</institution>
          ,
          <addr-line>MS 6E1, Fairfax, VA 22030</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Mathematics, Hubei University of Education</institution>
          ,
          <addr-line>Nanhuan Road 1, Wuhan, Hubei, China, 430205</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Environmental Sciences Division, Oak Ridge National Laboratory</institution>
          ,
          <addr-line>Oak Ridge, TN 37831-6407</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>- A geospatial catalogue service allows geospatial users to discover appropriate geospatial data and services in a Web-based distributed environment. Metadata for geospatial data and services is organized structurally in catalogue services. Provenance for geospatial data products, as a kind of metadata describing the derivation history of data products, can be managed in a same way as other kinds of metadata using metadata catalogue services, thus keeping consistency and interoperability with existing metadata catalogue services. Meanwhile, Semantic Web technologies have shown considerable promises for more effective connection, discovery, and integration of provenance information. This paper addresses how geospatial catalogue services can be enriched with semantic provenance. Semantic relationships defined in provenance ontologies are registered in an OGC standard-compliant CSW service by extending ebRIM elements. The work illustrates that such a semantically-enriched CSW can assist in the discovery of data, service, and knowledge level of geospatial provenance.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Keywords: Data Provenance, Lineage, GIS, CSW, ebRIM,
Geospatial Web Service</p>
    </sec>
    <sec id="sec-2">
      <title>I. INTRODUCTION</title>
      <p>
        The advancement of Earth observing technologies has
significantly increased the capability for collecting geospatial
data. The National Aeronautics and Space Administration
(NASA)’s Earth Observing System (EOS) alone is generating
1000 terabytes annually [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Significant efforts have been
devoted to make full use of the data and derive useful
information from the raw data. The Open Geospatial
Consortium (OGC)’s Web Service technologies such as the
Web Feature Service (WFS), Web Map Service (WMS), and
Web Processing Service (WPS) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] have been widely used in
geospatial domain to facilitate the open discovery of, access
to, and processing of distributed geospatial data. A geospatial
catalogue service allows geospatial users to discover
appropriate geospatial data and services in a Web-based
distributed environment. Metadata for geospatial data and
services is organized structurally in catalogue services. The
OGC’s Catalogue Services for the Web (CSW) is a domain
consensus regarding an open, standard interface for geospatial
catalogue service [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>Provenance for geospatial data products records the
derivation history of the data products. In a service-oriented
information infrastructure, geoprocessing steps in deriving a
data product are usually implemented by chaining multiple
geoprocessing services together. To derive useful data
products from large volumes of raw data, the integration of
geoprocessing services become more and more frequent.
Provenance provides important context information to help
end users make decisions about the quality of the derived data
products. Semantic Web technologies provide ways to
connect Web resources together and allow semantics of Web
resources to be machine-understandable, thus enabling more
effective discovery, automation, integration, and reuse of
resources. Semantic provenance, provenance information
represented using Semantic Web technologies, therefore, can
provide more informed understanding and effective usage of
provenance information.</p>
      <p>
        In the geospatial domain, provenance information has been
regarded as part of metadata describing data quality
information in the International Organization for
Standardization (ISO) 19115 geospatial information—
metadata standard. Similar to other kinds of geospatial
metadata managed using metadata catalogue services,
provenance information can be registered and discovered in
the metadata catalogue services to keep consistency and
interoperability with legacy geographic information system
(GIS) applications. The registration of provenance
information in the catalogue services requires the
specification of the registration information model. OGC has
recommended the ebXML Registry Information Model
(ebRIM) for registration of geospatial information, the
socalled ebRIM profile of CSW [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. However, the existing
standard does not address the registration of provenance
information.
      </p>
      <p>This paper explores the use of OGC CSW for registration
and query of semantic provenance. To make use of semantics
for provenance discovery in CSW, semantic relationships
defined in provenance ontologies are registered in an OGC
standard-compliant CSW service by extending ebRIM
elements. The work illustrates that such semantically-enriched
CSW can assist in the discovery of data, service, and
knowledge level of geospatial provenance. The rest of the
paper is organized as follows. Section 2 introduces the
semantic representation of provenance for geospatial data
products. Section 3 describes the ebRIM-based information
model in CSW, and Section 4 presents the registration of
semantic provenance. Section 5 describes the provenance
discovery using semantically-enriched CSW. The work is
compared with related work in Section 6, and conclusions and
pointers to future work are given in Section 7.</p>
      <p>II. SEMANTIC PROVENANCE FOR GEOSPATIAL DATA</p>
      <p>PRODUCTS</p>
      <p>
        In the context of this paper, we focus on the provenance in
a service-oriented environment in which geospatial data
products are generated by executing geoprocessing service
chains. In the general information domain, service chaining is
a hot research topic in the Web Service area and can be called
service composition. Approaches for service composition
generally follow a three-phase procedure [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5-7</xref>
        ]: (1) process
modeling, which generates an abstract process model
consisting of the control flow and data flow among process
nodes; (2) process model instantiation, where the abstract
process model is instantiated into an executable service chain;
and (3) workflow execution, where the chaining result is
executed in the workflow engine to generate the required data
product. The information involved in the three phases,
therefore, can contribute to the provenance of the data
products.
      </p>
      <p>A three-level view of semantic provenance is adopted for
the geospatial data products generated based on the
threephase procedure of service composition (Fig. 1). The first
level is the knowledge level provenance, which contains
process model ontologies as a knowledge base to support
generation of complex process models. The process model
ontologies are formulated by linking geospatial domain
DataType, ServiceType, and workflow ontologies together.
Examples of process model ontologies are atomic and
composite process models for geoprocessing services
described using the process model ontologies in the Web
Ontology Language (OWL) Service Ontology (OWL-S). The
second level is the service level provenance, which includes
the individual services and service chains. Both can be
represented using the service ontologies in OWL-S. And the
final level is the data level provenance, which contains the
provenance information generated during the execution.
Examples of provenance in this level include source,
intermediate, and final data products, atomic service
executions, and service chain executions.</p>
      <p>The ontologies for the knowledge level provenance and
service level provenance use the geospatial domain ontologies
and OWL-S ontologies. The data level provenance includes
classes and relationships for data products required or
generated by execution (ProvenanceGeoDataType class),
value bindings between parameters and their values
(ParamValueBinding class), specific executions of services
(AtomicServiceExecution class) and service chains
(CompositeServiceExecution class). Example ontologies in
OWL can be viewed online at
http://www.laits.gmu.edu/geo/nga/landslideprovenance.html .</p>
      <p>The three-level view of geospatial provenance corresponds
to the three phases of automatic service composition. The
knowledge level provenance records the process model
knowledge used to derive geospatial data products in the
process modeling phase. Using provenance at this level, users
can check the correctness of the process model and try a
different model when necessary. The service level provenance
describes concrete service chains that can be executed to
generate the geospatial data products. Using this information,
it is possible for users to re-select services based on the
performance of services. The data level provenance helps
users to find dependencies among physically-existed data
products and supports analysis applications such as error
source identification and propagation.</p>
    </sec>
    <sec id="sec-3">
      <title>III. CSW-EBRIM PROFILE</title>
      <p>
        CSW specification provides a framework for the
implementation of application profiles. The core elements in
an OGC catalogue service are the information model, the
query language, and the interface [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The information model
describes information structures and semantics of information
resources. Therefore, the information model of catalogue
services should address the content, syntax, and semantics of
geospatial resources. The ebRIM standard has been defined
by the Organization for the Advancement of Structured
Information Standards (OASIS) and selected by OGC as the
information model for specifying how catalogue content is
structured and interrelated.
      </p>
      <p>Fig. 2 shows the ebRIM-based catalogue information
model. The core metadata class is the RegistryObject. Most
other metadata classes in the information model are derived
from this class. An instance of RegistryObject may have a set
of zero or more Slot instances that serve as extensible
attributes for this RegistryObject instance. An Association
instance represents an association between a source
RegistryObject and a target RegistryObject. Each association
has an associationType attribute that identifies the type of that
association. A Classification instance classifies a
RegistryObject instance by referring to a node defined within
a ClassificationScheme instance. A ClassificationScheme
instance in the ebRIM model defines a tree structure made up
of nodes that can be used to describe a taxonomy.</p>
      <p>
        The ebRIM provides a general and standard metadata
registration information model. However, it needs to be
extended with some extension elements to meet common
requirements in the geospatial domain. Under the guidelines
of the ebRIM profile for CSW, the CSW implementation1,
developed and maintained by Laboratory for Advanced
Information Technology and Standards (LAITS) from George
Mason University [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], has extended ebRIM using
international geographic standards: ISO 19115 Geographic
Information — Metadata (including part 2: Extensions for
imagery and gridded data) and ISO 19119 Geographic
Information — Services.
      </p>
      <p>The ebRIM is extended with ISO 19115 and ISO 19119 in
two ways. The first is by importing new classes into the
ebRIM class tree, deriving new metadata classes from existing
ebRIM classes. The new Dataset class is used to describe
geographic datasets. Many new attributes are added to the
Dataset class based on ISO 19115 and its part 2. The second
way to extend ebRIM is to use Slots to extend an existing
class. The Service class included in ebRIM can be used to
describe geographic services, but the available attributes in
1 Online services are available at http://geobrain.laits.gmu.edu/ .
the class Service are not sufficient to describe geospatial Web
services. New attributes derived from ISO 19119 are added to
the Service class through Slots.</p>
    </sec>
    <sec id="sec-4">
      <title>IV. SEMANTIC PROVENANCE REGISTRATION</title>
      <p>
        The registration of semantic provenance in the CSW takes
advantages of extensibility points in ebRIM. Such
extensibility points include new kinds of classes, associations,
classifications, and additional slots to record OWL classes,
properties and related axioms. Some efforts have already
addressed the registration of OWL-based ontologies in ebRIM
[
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref9">9-12</xref>
        ]. In this study, we focus on the application and
extension of ebRIM in the provenance registration. In
particular, the paper explores how to register the OWL-based
semantic provenance in the ebRIM-based catalogue
information model to support the provenance discovery.
      </p>
      <p>
        For the knowledge level and service level provenance, we
adopt the previous approach on registration of OWL/OWL-S
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. A new type of ExtrinsicObject, named ProcessModel, is
created in the ebRIM model to describe process models.
Geospatial DataType and ServiceType ontologies are
recorded using two new ClassificationScheme instances,
which can be used to classify the ProcessModel and Dataset
instances. The Service class in the ebRIM model can be used
to describe both services and service chains, since a service
chain as a whole can act as a service. The semantics for inputs,
outputs, preconditions and effects (i.e. IOPE semantics) are
recorded by using slots.
      </p>
      <p>For the data level provenance, a new type of
ExtrinsicObject, ServiceExecution, which can support the
registration of both atomic and service chain execution, is
created. ProvenanceGeoDataType in OWL is mapped to the
existing class Dataset. Individuals of ParamValueBinding in
provenance ontologies are recorded using the slots of the
ServiceExecution. The relationships among
AtomicServiceExecution, CompositeServiceExecution, and
ProvenanceGeoDataType in provenance ontologies are
registered using associations in the ebRIM.</p>
      <p>Fig. 3 shows an execution of slope computation service,
which generates terrain slope data from the digital elevation
model (DEM) data. The knowledge level provenance is
recorded by using instances of ProcessModel whose slots
specifies the input Geospatial DataType (Terrain Elevation)
and output Geospatial DataType (Terrain Slope). The service
level provenance is recorded using instances of Service.
DescribedBy association connects a service with its process
model. Some individual geospatial services have their own
metadata constraints on the input data and this can be
recorded using slots. For example, the slope computation
service in Fig. 3 specifies that the input terrain elevation data
should be in the GeoTIFF data format with the EPSG:4326
geographic coordinate reference system. Data level
provenance includes the registration of ServiceExecution and
Dataset. A ServiceExecution is linked to the service executed
using the HasService association. The Terrain slope dataset
generated by the specific ServiceExecution is described using
the ProducedBy association. More kinds of associations can
be registered such as the HasGeoDataTypeAncestor
relationship between datasets.</p>
    </sec>
    <sec id="sec-5">
      <title>V. PROVENANCE DISCOVERY Based on the semantic content registered in the CSW, three types of provenance discoveries are achieved using CSW queries:</title>
      <p>A. Discovery for data level provenance</p>
      <p>The discovery is based on provenance associations at the
data level. Examples of CSW queries includes: collecting
descendant or ancestor datasets to a specific dataset; finding
service executions to generate a specific dataset; retrieving
parameters and values involved when conducing a specific
service execution.</p>
      <p>B. Discovery for service level provenance</p>
      <p>One discovery is to locate services or service chains used to
generate a specific geospatial data product. The query is based
on the HasService association between service executions and
services. Additional discovery includes query on the
preconditions of a specific service. The results from this query
can help check preconditions of the service to find whether
input data is semantically valid. For example, does the input
DEM data have a valid spatial projection?
C. Discovery for knowledge level provenance</p>
      <p>This is to discover process model knowledge used to derive
geospatial data products. The CSW query uses DescribedBy
association as a search condition. The process model, when
obtained, can be rechecked and compared with alternative
process models. Another query strategy is to add
semantically-matched ServiceTypes in the search condition to
find alternate process models for decision support. The
semantic match is performed based on the subsumption
reasoning in description logic.</p>
      <p>&lt;?xml version="1.0" encoding="UTF-8"?&gt;
&lt;csw:GetRecords …&gt;
&lt;csw:Query typeNames="ServiceExecution Association
Dataset ClassificationNode"&gt;
&lt;csw:ElementSetName&gt;full&lt;/csw:ElementSetName&gt;
&lt;csw:ElementName&gt;/ServiceExecution/&lt;/csw:ElementName
&gt;
&lt;csw:Constraint version="1.0.0"&gt;&lt;ogc:Filter&gt;&lt;ogc:And&gt;
&lt;!--temporal condition--&gt;…
&lt;!--spatial condition--&gt;…
&lt;!—ontological concept--&gt;
&lt;ogc:PropertyIsEqualTo&gt;&lt;ogc:PropertyName&gt;/Dataset/@i
d&lt;/ogc:PropertyName&gt;</p>
      <p>&lt;ogc:PropertyName&gt;/Classification/@classifiedObject&lt;/
ogc:PropertyName&gt;&lt;/ogc:PropertyIsEqualTo&gt;</p>
      <p>&lt;ogc:PropertyIsEqualTo&gt;&lt;ogc:PropertyName&gt;/Classificati
on/@classificationScheme&lt;/ogc:PropertyName&gt;</p>
      <p>&lt;ogc:PropertyName&gt;/ClassificationScheme/@id&lt;/ogc:Pr
opertyName&gt;&lt;/ogc:PropertyIsEqualTo&gt;
&lt;ogc:PropertyIsEqualTo&gt;
&lt;ogc:PropertyName&gt;/ClassificationScheme/Description/
LocalizedString/@value&lt;/ogc:PropertyName&gt;
&lt;ogc:Literal&gt;geospatial data type ontology&lt;/ogc:Literal&gt;
&lt;/ogc:PropertyIsEqualTo&gt;
&lt;ogc:PropertyIsEqualTo&gt;&lt;ogc:PropertyName&gt;/Classificati
on/@classificationNode&lt;/ogc:PropertyName&gt;</p>
      <p>&lt;ogc:PropertyName&gt;/ClassificationNode/@id&lt;/ogc:Prop
ertyName&gt;&lt;/ogc:PropertyIsEqualTo&gt;</p>
      <p>&lt;ogc:PropertyIsEqualTo&gt;&lt;ogc:PropertyName&gt;/Classificati
onNode/@code&lt;/ogc:PropertyName&gt;
&lt;ogc:Literal&gt;ETM_NDVI&lt;/ogc:Literal&gt;
&lt;/ogc:PropertyIsEqualTo&gt;
&lt;!--provenance association--&gt;
&lt;ogc:PropertyIsEqualTo&gt;
&lt;ogc:PropertyName&gt;/Dataset/@id&lt;/ogc:PropertyName&gt;
&lt;ogc:PropertyName&gt;/Association/@sourceObject&lt;/ogc:Pr
opertyName&gt;&lt;/ogc:PropertyIsEqualTo&gt;
&lt;ogc:PropertyIsEqualTo&gt;
&lt;ogc:PropertyName&gt;/ServiceExecution/@id&lt;/ogc:Propert
yName&gt;</p>
      <p>&lt;ogc:PropertyName&gt;/Association/@targetObject&lt;/ogc:Pro
pertyName&gt;&lt;/ogc:PropertyIsEqualTo&gt;...
&lt;/ogc:And&gt;&lt;/ogc:Filter&gt;&lt;/csw:Constraint&gt;&lt;/csw:Query&gt;
&lt;/csw:GetRecords&gt;</p>
      <p>All queries are realized through CSW standard query
operations. The query language is implemented using the
OGC Filter specification. It supports comparison operators
and spatial operators. An example provenance query is shown
in Fig. 4. A Web client, e.g. HTML form, can submit queries
using the GetRecords operation based on the request-response
model of the HTTP protocol.</p>
    </sec>
    <sec id="sec-6">
      <title>VI. RELATED WORK</title>
      <p>
        A substantial research on provenance issue has been
conducted in the general information domain. Traditional data
provenance issue focuses on the database systems [
        <xref ref-type="bibr" rid="ref14 ref15 ref16">14-16</xref>
        ].
With the advancement of service-oriented infrastructure in
recent years, provenance for scientific workflows or service
chains becomes an active research field [
        <xref ref-type="bibr" rid="ref17 ref18">17, 18</xref>
        ]. The
international workshop on data derivation and provenance and
its follow-up workshops, namely International Provenance
and Annotation Workshop (IPAW), have been held five times
and resulted in the “provenance challenge” activities. Within
GIS domain, how to incorporate provenance support in
geospatial services is still a challenge. The use of OGC CSW
for serving geospatial provenance is compliant with existing
service standards in geospatial domain can allows easy
integration with legacy GIS applications.
      </p>
      <p>
        Some efforts have been devoted to the use of Semantic
Web technologies for representing and querying data
provenance information [
        <xref ref-type="bibr" rid="ref19 ref20 ref21 ref22">19-22</xref>
        ]. Our approach differs from
their approaches in that we use existing registry services for
management of provenance. The registration of ontologies in
ebRIM can support semantics-enhanced discovery of
information resources in registries [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref9">9-12</xref>
        ]. The work here
extends this approach in the provenance research area and
proposes the registration of semantic provenance in the
ebRIM model.
      </p>
      <p>
        Provenance investigation in GIS can be traced back to
Lanter’s [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] work on data lineage metadata. Frew et al. [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]
provide lineage support for remote sensing data processing in
a script-based environment. Wang et al. [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] proposed a
provenance-aware architecture to record the lineage of spatial
data. Tilmes and Fleig [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] discuss some general concerns of
provenance tracking for Earth science data processing systems.
Plale et al. [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] described architectural considerations to
support provenance collection and management in
geosciences. Yue et al. [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] propose provenance capture in
geospatial service composition when instantiating a
geoprocessing model into an executable service chain. How
provenance can be integrated into existing service-oriented
GIS applications has not been addressed in the literature. In
addition, the arrangement of provenance in the CSW-ebRIM
profile facilitates the query of data, service, and knowledge
level of provenance by exploring the associations among
provenance, data, services, and chains.
      </p>
    </sec>
    <sec id="sec-7">
      <title>VII. CONCLUSION AND FUTURE WORK</title>
      <p>The ontology approach for provenance representation
provides a common vocabulary for provenance information
and defines explicitly the meaning of the terms and the
relations between them. Registration of provenance ontologies
in CSW allows users to take advantage of that benefit in
registries. This paper describes how semantic provenance can
be registered into the ebRIM-based CSW. Such a
semantically-enriched CSW provides support in discovery of
data, service, and knowledge level of geospatial provenance.
Future work includes developing user-friendly tools to
facilitate provenance registration and visualization of query
results, exploring the lifetime management of provenance
information, and developing provenance-aware applications
to demonstration advantages and usage of provenance.</p>
    </sec>
    <sec id="sec-8">
      <title>ACKNOWLEDGEMENT</title>
      <p>We are grateful to the anonymous reviewers for their
comments. This work was funded jointly by Project 40801153
supported by NSFC, 863 Program of China (2007AA120501),
LIESMARS and SKLSE (Wuhan University) Special
Research Fundings.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D.</given-names>
            <surname>Clery</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Voss</surname>
          </string-name>
          , “
          <article-title>All for one and one for all</article-title>
          ,” Science,
          <volume>308</volume>
          (
          <issue>5723</issue>
          ), p.
          <fpage>809</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>OGC</surname>
          </string-name>
          , Open Geospatial Consortium, www.opengeospatial.org, [Accessed 16 May,
          <year>2010</year>
          ].
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Nebert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Whiteside</surname>
          </string-name>
          , and P. Vretanos (eds),
          <source>OpenGIS® Catalog Services Specification, Version 2.0</source>
          .2, OGC 07-006r1, Open GIS Consortium Inc. 218 pp,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>Martell</surname>
          </string-name>
          (ed),
          <source>CSW-ebRIM Registry Service-Part 1: ebRIM profile of CSW, Version 1.0</source>
          .0, OGC 07-110r2, Open Geospatial Consortium, Inc., 57 pp,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Koehler</surname>
          </string-name>
          , “
          <article-title>Web service composition - current solutions and open problems</article-title>
          ,” in
          <source>: Proceedings of International Conference on Automated Planning and Scheduling (ICAPS) 2003 Workshop on Planning for Web Services</source>
          , Trento, Italy, pp.
          <fpage>28</fpage>
          -
          <lpage>35</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Rao</surname>
          </string-name>
          and
          <string-name>
            <given-names>X.</given-names>
            <surname>Su</surname>
          </string-name>
          , “
          <article-title>A survey of automated web service composition methods,” in:</article-title>
          <source>Proceedings of First International Workshop on Semantic Web Services and Web Process Composition (SWSWPC</source>
          <year>2004</year>
          ), San Diego, California, USA, pp.
          <fpage>43</fpage>
          -
          <lpage>54</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Peer</surname>
          </string-name>
          ,
          <article-title>Web service composition as AI planning - a survey</article-title>
          ,
          <source>Technical report</source>
          , University of St.Gallen, Switzerland, 63 pp,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Di</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Liao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Bai</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <year>2005</year>
          . “
          <article-title>The Design and Implementation of a Gridenabled Catalogue Service,” 25th Anniversary IGARSS 2005</article-title>
          ,
          <article-title>July 25-29</article-title>
          , COEX, Seoul, Korea. pp.
          <fpage>4224</fpage>
          -
          <lpage>4227</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dogac</surname>
          </string-name>
          (ed.),
          <article-title>ebXML Registry Profile for Web Ontology Language (OWL), Version 1</article-title>
          .5,
          <string-name>
            <surname>regrep-</surname>
          </string-name>
          owlprofile-v1.
          <fpage>5</fpage>
          -cd01,
          <article-title>Organization for the Advancement of Structured Information Standards (OASIS</article-title>
          ). 76 pp,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dogac</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kabak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.B.</given-names>
            <surname>Laleci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Mattocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Najmi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Pollock</surname>
          </string-name>
          , “
          <article-title>Enhancing ebXML registries to make them OWL aware,” Distributed and Parallel</article-title>
          <source>Databases Journal</source>
          , Springer-Verlag,
          <volume>18</volume>
          (
          <issue>1</issue>
          ), pp.
          <fpage>9</fpage>
          -
          <lpage>36</lpage>
          ,
          <year>July 2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>W.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>He</surname>
          </string-name>
          , and W. Liu, “
          <article-title>Design and realization of ebXML registry classification model based on ontology</article-title>
          .”
          <source>In: Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'05)</source>
          ,
          <year>2005</year>
          , pp.
          <fpage>809</fpage>
          -
          <lpage>814</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bechini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tomasi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Viotto</surname>
          </string-name>
          , “
          <article-title>Enabling ontology-based document classification and management in ebXML registries,”</article-title>
          <source>In: Proceedings of the 2008 ACM symposium on Applied computing, Fortaleza</source>
          , Ceara, Brazil,
          <year>2008</year>
          , pp.
          <fpage>1145</fpage>
          -
          <lpage>1150</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>P.</given-names>
            <surname>Yue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Di</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>He</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wei</surname>
          </string-name>
          , “
          <article-title>Integrating semantic web technologies and geospatial catalog services for geospatial information discovery and</article-title>
          processing in cyberinfrastructure,” GeoInformatica.
          <year>2009</year>
          . DOI:
          <volume>10</volume>
          .1007/s10707-009-0096-1.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Woodruff</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Stonebraker</surname>
          </string-name>
          , “
          <article-title>Supporting finegrained data lineage in a database visualization environment,”</article-title>
          <source>In: Proceedings of International Conference on Data Engineering (ICDE)</source>
          ,
          <year>1997</year>
          , pp.
          <fpage>91</fpage>
          -
          <lpage>102</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Cui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Widom</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Wiener</surname>
          </string-name>
          , “
          <article-title>Tracing the lineage of view data in a warehousing environment</article-title>
          ,
          <source>” ACM Transactions on Database Systems</source>
          ,
          <volume>25</volume>
          (
          <issue>2</issue>
          ), pp.
          <fpage>179</fpage>
          -
          <lpage>227</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>P.</given-names>
            <surname>Buneman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Khanna</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W. C.</given-names>
            <surname>Tan</surname>
          </string-name>
          , “
          <article-title>Why and where: a characterization of data provenance,”</article-title>
          <source>In: Proceedings of International Conference on Database Theory (ICDT)</source>
          ,
          <year>2001</year>
          , pp.
          <fpage>316</fpage>
          -
          <lpage>330</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Simmhan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Plale</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Gannon</surname>
          </string-name>
          , “
          <article-title>A survey of data provenance in e-science,” SIGMOD Record</article-title>
          , vol.
          <volume>34</volume>
          , pp.
          <fpage>31</fpage>
          -
          <lpage>36</lpage>
          ,
          <year>2005</year>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>S.</given-names>
            <surname>Miles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Groth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Branco</surname>
          </string-name>
          , and L. Moreau, “
          <article-title>The requirements of using provenance in e-Science experiments</article-title>
          ,
          <source>” Journal of Grid Computing</source>
          ,
          <volume>5</volume>
          (
          <issue>1</issue>
          ), pp.
          <fpage>1</fpage>
          -
          <lpage>25</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Goble</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Greenwood</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wroe</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Stevens</surname>
          </string-name>
          , “
          <article-title>Annotating, linking and browsing provenance logs for e-Science,”</article-title>
          <source>In: Proceedings of the Workshop on Semantic Web Technologies for Searching and Retrieving Scientific Data, FL., USA</source>
          , 6 pp,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>J.</given-names>
            <surname>Golbeck</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Hendler</surname>
          </string-name>
          , “
          <article-title>A semantic web approach to the provenance challenge,” The First Provenance Challenge (this issue)</article-title>
          ,
          <source>Concurrency and Computation: Practice and Experience</source>
          ,
          <volume>20</volume>
          (
          <issue>5</issue>
          ), pp.
          <fpage>431</fpage>
          -
          <lpage>439</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>S.S.</given-names>
            <surname>Sahoo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sheth</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Henson</surname>
          </string-name>
          , “
          <article-title>Semantic provenance for eScience: managing the deluge of scientific data</article-title>
          ,
          <source>” IEEE Internet Computing</source>
          ,
          <volume>12</volume>
          (
          <issue>4</issue>
          ), pp.
          <fpage>46</fpage>
          -
          <lpage>54</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>S.</given-names>
            <surname>Zednik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Fox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. L.</given-names>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <surname>P. P.</surname>
          </string-name>
          da Silva, and
          <string-name>
            <given-names>C.</given-names>
            <surname>Chang</surname>
          </string-name>
          , “
          <article-title>Semantic provenance for science data products: application to image data processing,”</article-title>
          <source>in: Proceedings of the First International Workshop on the Role of Semantic Web in Provenance Management (SWPM</source>
          <year>2009</year>
          ), Washington DC, USA, CEUR-WS, vol.
          <volume>526</volume>
          ,
          <string-name>
            <surname>October</surname>
            <given-names>25</given-names>
          </string-name>
          <year>2009</year>
          , 7 pp.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Lanter</surname>
          </string-name>
          , “
          <article-title>Design of a lineage-based meta-data base for GIS,” Cartography and Geographic Information Systems</article-title>
          , vol.
          <volume>18</volume>
          , No.
          <issue>4</issue>
          , pp.
          <fpage>255</fpage>
          -
          <lpage>261</lpage>
          ,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>J.</given-names>
            <surname>Frew</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Metzger</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Slaughter</surname>
          </string-name>
          , “
          <article-title>Automatic capture and reconstruction of computational provenance</article-title>
          ,
          <source>” Concurrency and Computation: Practice and Experience</source>
          .
          <volume>20</volume>
          (
          <issue>5</issue>
          ). John Wiley &amp; Sons, Ltd. pp.
          <fpage>485</fpage>
          -
          <lpage>496</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>S.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Padmanabhan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Myers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Tang</surname>
          </string-name>
          , and Y. Liu, “
          <article-title>Towards provenance-aware geographic information systems,”</article-title>
          <source>In: Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems (ACM GIS</source>
          <year>2008</year>
          ).
          <year>4pp</year>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>C.</given-names>
            <surname>Tilmes</surname>
          </string-name>
          and
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Fleig</surname>
          </string-name>
          , “
          <article-title>Provenance tracking in an earth science data processing system</article-title>
          ,” In: Proceedings of Second International Provenance and Annotation Workshop (IPAW),
          <source>LNCS 5272</source>
          , pp.
          <fpage>221</fpage>
          -
          <lpage>228</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>B.</given-names>
            <surname>Plale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Cao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Herath</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sun</surname>
          </string-name>
          , “
          <article-title>Data provenance for preservation of digital geoscience data,” Geological Society of America (GSA)</article-title>
          ,
          <source>Memoir</source>
          Volume
          <volume>12</volume>
          ,
          <year>14pp</year>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>P.</given-names>
            <surname>Yue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gong</surname>
          </string-name>
          , and L. Di, “
          <article-title>Augmenting Geospatial Data Provenance through Metadata Tracking in Geospatial Service Chaining,”</article-title>
          <source>Computers &amp; Geosciences</source>
          , vol.
          <volume>36</volume>
          , no.
          <issue>3</issue>
          , pp.
          <fpage>270</fpage>
          -
          <lpage>281</lpage>
          ,
          <year>2010</year>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>