<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Work ow-Centric Research Ob jects: First Class Citizens in Scholarly Discourse</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Khalid Belhajjame</string-name>
          <email>khalidb@cs.man.ac.uk</email>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oscar Corcho</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Daniel Garijo</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jun Zhao</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paolo Missier</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David Newman</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Raul Palma</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sean Bechhofer</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Esteban Garc a Cuesta</string-name>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jose Manuel Gomez-Perez</string-name>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Graham Klyne</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kevin Page</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Roos</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jose Enrique Ruiz</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stian Soiland-Reyes</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lourdes Verdes-Montenegro</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David De Roure</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Carole A. Goble</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Instituto de Astrof sica de Andaluc a</institution>
          ,
          <addr-line>CSIC</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Leiden University Medical Centre</institution>
          ,
          <country country="NL">Netherlands</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Newcastle University</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Ontology Engineering Group, Universidad Politecnica de Madrid</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Poznan Supercomputing and Networking Center</institution>
          ,
          <country country="PL">Poland</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>University of Manchester</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff6">
          <label>6</label>
          <institution>University of Oxford</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff7">
          <label>7</label>
          <institution>iSOCO</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>A work ow-centric research object bundles a work ow, the provenance of the results obtained by its enactment, other digital objects that are relevant for the experiment (papers, datasets, etc.), and annotations that semantically describe all these objects. In this paper, we propose a model to specify work ow-centric research objects, and show how the model can be grounded using semantic technologies and existing vocabularies, in particular the Object Reuse and Exchange (ORE) model and the Annotation Ontology (AO). We describe the life-cycle of a research object, which resembles the life-cycle of a scienti c experiment.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Scienti c work ows are used to describe series of structured activities and
computations that arise in scienti c problem-solving, providing scientists from
virtually any discipline with a means to specify and enact their experiments [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
From a computational perspective, such experiments (work ows) can be de ned
as directed acyclic graphs where the nodes correspond to analysis operations,
which can be supplied locally or by third party web services, and where the
edges specify the ow of data between those operations.
      </p>
      <p>
        Besides being useful to describe and execute computations, work ows also
allow encoding of scienti c methods and know-how. Hence they are valuable
objects from a scholarly point of view, for several reasons: (i) to allow assessment
of the reproducability of results; (ii) to be reused by the same or by a di
erent scientist; (iii) to be repurposed for other goals than those for which it was
originally built; (iv) to validate the method that led to a new scienti c insight;
(v) to serve as live-tutorials, exposing how to take advantage of existing data
infrastructure, etc. This follows a trend that can be observed in disciplines such
as Biology and Astronomy, with other types of objects, such as databases,
increasingly becoming part of the research outcomes of an individual or a group,
and hence also being shared, cited, reused, versioned, etc. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]
      </p>
      <p>However, the use of work ow speci cations on their own does not
guarantee to support reusability, shareability, reproducibility, or better understanding
of scienti c methods. Work ow environment tools evolve across the years, or
they may even disappear. The services and tools used by the work ow may
change or evolve too. Finally, the data used by the work ow may be updated
or no longer available. To overcome these issues, additional information may be
needed. This includes annotations to describe the operations performed by the
work ow; annotations to provide details like authors, versions, citations, etc.;
links to other resources, such as the provenance of the results obtained by
executing the work ow, datasets used as input, etc.. Such additional annotations
enable a comprehensive view of the experiment, and encourage inspection of
the di erent elements of that experiment, providing the scientist with a picture
of the strengths and weaknesses of the digital experiment in relation to decay,
adaptability, stability, etc.</p>
      <p>
        These richly annotation objects are what we call work ow-centric research
objects. The notion of Research Object has been introduced in previous work
[
        <xref ref-type="bibr" rid="ref1 ref19 ref20">20, 19, 1</xref>
        ] { here we focus on Research Objects that encapsulate scienti c
workows (hence work ow-centric). In particular, we build on earlier work on
myExperiment packs, which are bundles that contain elements such as work ows,
documents and presentations [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Other related work is presented in Section
2. In this paper we extend that work making the following contributions: we
present a model for specifying work ow-centric research objects (Section 3), and
show how it is grounded using semantic technologies; and we characterise and
de ne their lifecycle, illustrating how they evolve over time to be augmented
with provenance of the work ow results and semantic annotations (Section 4).
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>In certain disciplines (e.g., life sciences), scienti c communication channels like
journals encourage or mandate authors of submitted papers to include
information about the methods used to reach the conclusions claimed in the paper.
This has the aim of promoting reproducibility and reuse of the scienti c results
reported on those papers. For example, most 'wet lab' life science journal papers
must contain a `materials and methods' section that describes the details about
the experiments that the authors conducted. These journals typically have strict
rules about how to formulate these sections, but from a computational point
of view it is weakly structured; hence they are still hard for other scientists to
discover and reuse.</p>
      <p>
        The practice of conveying computational methods in a standardised and
highly structured way has had less time to evolve in many areas of science.
Some journals are also encouraging authors to make available the data and
software that have been used and produced, that is, to make data and processes used
part of the published work [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. For example, Bioinformatics1 considers software
availability as an important prerequisite to the acceptance of the paper. And
the NASA ADS (Astrophysics Data System)2 is linking and referencing papers,
references to the journal, data behind the plots used in the papers, catalogues
of objects used (as URL references), software used (as URL references to the
Astrophysics Source Code Library), instrument used to gather the
observed/input data, and the proposal submitted to ask for observation time. These are
important steps forward to promote sharing and reuse. However, software and
data availability may not be su cient to check the reproducibility of results, as
described in the introduction.
      </p>
      <p>
        As stated in the introduction, our model is built on earlier work on
myExperiment packs [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], which aggregate elements such as work ows, documents
and datasets together, following Web 2.0 and Linked Data principles [
        <xref ref-type="bibr" rid="ref17 ref18">18, 17</xref>
        ].
The myExperiment ontology [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], which forms the basis for our research object
model, has been designed such that it can be easily aligned with existing
ontologies. For instance, their elements can be assigned annotations comparable to
those de ned by Open Annotation Collaboration (OAC).
      </p>
      <p>
        One important aspect of our work is that we make use of abstract
workow templates as a means to annotate work ow templates, facilitating work ow
speci cation (as done by Gil et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and Ludascher et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]). Scientists
describe a work ow by identifying abstract tasks and specifying scienti c analyses
using semantic concepts from an underlying domain ontology. The speci ed
abstract work ow is then mapped to a concrete work ow using mappings that
specify for each task the underlying service operations that can be used for its
implementations.
      </p>
      <p>
        Our work is complementary to the above proposals in the sense that, in
addition to semantic annotations of work ows, we exploit provenance of work ow
results to describe work ow templates. In this context, similar proposals are
CrowdLab [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], which provides users with the means for publishing data as well
as work ows and the provenance of their results to promote the reproducibility
of such results, Janus [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and OPMW [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Here we leverage semantic
technologies and underline the importance of annotations, which we hope will yield a
wide adoption of research objects among scientists. Besides, we allow connecting
more elements to the work ow: alternative material, alternative web services,
bibliography, the proposal that led to the work ow/experiment, etc.
      </p>
      <p>A clear demand from domains such as bioinformatics and astronomy is the
ability to understand a work ow, for which elements outside of the work ow are
often needed.
3</p>
    </sec>
    <sec id="sec-3">
      <title>A Model for Work ow-Centric Research Objects</title>
      <p>Our work ow-centric research object model aims at providing support for the
description of the scienti c processes described in the previous section in a machine
1 http://bioinformatics.oxfordjournals.org/
2 http://labs.adsabs.harvard.edu/
processable format, together with the datasets involved, the results obtained, and
their provenance information. The research object will be also accompanied with
annotations, which will promote the discover-ability, and therefore the
reusability of the processes (work ows), as well as enabling third parties to assess the
validity and reproducibility of the results.</p>
      <p>Figure 1 illustrates a coarse-grained view of a work ow-centric research
object, which aggregates a number of resources, namely:
{ a work ow template, which de nes the work ow;
{ work ow runs obtained by enacting the work ow template
{ other artifacts which can be of di erent kinds, e.g., a paper that describes
the research, datasets used in the experiments, etc.;
{ annotations describing the aforementioned elements and their relationships.
Fig. 1: Work ow-centric research object as an aggregation of resources. CHECK IF IT
IS OK</p>
      <p>
        Figure 2 provides a more detailed view of the resources that compose
workow templates and work ow runs. A work ow template is a graph in which the
nodes are processes and the edges represent data links that connect the output
of a given process to the input of another process, specifying that the artifacts
produced by the former are used to feed the latter. A process is used to describe a
class of actions that when enacted give rise to process runs. The process speci es
the software component (e.g., web service) responsible for undertaking the
action. Note that some work ow systems may specify in addition to the data ow,
the control ow, which speci es temporal dependencies and conditional ows
between processes. We chose to con ne the work ow research object model to
data-driven work ows, as in Taverna [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], Triana [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], the process run Network
Director supplied by Kepler [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], Galaxy3, Wings [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], etc.
      </p>
      <p>Figure 3-b illustrates an example of a work ow template that is composed
of two processes. Such a work ow describes an in-silico bioinformatics experiment
that is used to identify gene pathways. Speci cally, the work ow is composed of
two processes: given a protein accession, the GetKeggGeneId process is used to
3 http://galaxy.psu.edu/
retrieve the corresponding gene ID. The gene ID retrieved is then used to feed
the GetKeggPathway process, which returns the corresponding pathways. Note
that we also support work ow instances, which are work ow templates with
the inputs bound to data values. We also distinguish between standard input
parameters and con guration input parameters. Con guration input parameters
are used to set the algorithm, the underlying sources used by the processes that
compose a work ow template and so on. In addition, the processes that compose
a work ow template are not always bound to a software component, rather they
can be performed manually in which case they are associated with a human
agent.</p>
      <p>
        A work ow template can be instantiated and enacted using a work ow
engine, e.g., Taverna. This gives rise to a work ow run that speci es the process
runs that were obtained by executing the processes that constitute the
workow template in question. For example, when the action speci ed by the process
is undertaken by a web service, the process run obtained by enacting such a
process represents a web service call. A process run may take as input some
existing artifacts, speci ed by the used association, and output some new artifacts,
speci ed by the wasGeneratedBy association. Artifact is a general concept that
represents an immutable piece of state, which may have a physical embodiment
in a physical object, or a digital representation in a computer system [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. In
the context of work ow-centric research objects, the focus is on artifacts that
are digital representations in a computer system. It is worth mentioning that the
notion of process run and artifact that we use are aligned with major provenance
models such as the Open Provenance Model (OPM) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] and PROV-DM4.
      </p>
      <p>Figure 3-c illustrates an example of a work ow run that is obtained
by enacting the work ow template together with the provenance of the
results produced by the work ow run, which are depicted in Figure 3-b.
Get4 http://www.w3.org/TR/2011/WD-prov-dm-20111018</p>
      <p>GeneIdRun, and GetGenePathwayRun are process runs that were obtained
by enacting the GetGeneId and GetGenePathway processes, respectively.
GetGeneIdRun took as input the protein accession up:11005 and generated the
gene id syf:Synpcc7942 0655, the process run GetGenePathwayRun then used
syf:Synpcc7942 0655 to generate the pathway path:syf00195.</p>
      <p>It is important to highlight that scientists can annotate the elements of a
work ow-centric research object (along with the research object itself). They
can specify the title of a research object, its purpose, its version, ownership,
citations, etc. A more accurate form of annotation can be used to describe the
elements of a research object by linking them to concepts from domain ontologies.
In particular, this kind of annotation can be used to e ectively browse and query
work ow templates.</p>
      <p>
        Finally, work ow templates can be annotated in an abstract work ow
template, which is a graph of abstract processes that are connected by data
links. The abstract processes and their input and output parameters are labeled
with concepts from underlying domain ontologies, e.g., [
        <xref ref-type="bibr" rid="ref21 ref22">21, 22</xref>
        ], which specify
the tasks performed by the steps and the semantic domains of their parameters,
respectively. An abstract worklfow template awf, which is used to annotate a
given work ow template wf, has the same data ow topology as wf. The abstract
processes that compose awf annotate the processes in wf, and the parameter
domains in awf specify the semantic domains of the process parameters in wf. As
an example, Figure 3-a illustrates an abstract work ow template that
semantically describes the work ow template depicted in Figure 3-b. ProteinAcc to Gene
and Gene to Pathway are two concepts that specify the tasks of the processes
GetKeggGeneId and GetKeggPathway, respectively, whereas ProteinAccession,
GeneId and Pathway are concepts that specify the domain of the input and
output parameters of such processes.
3.1
      </p>
      <p>Grounding Work ow-centric Research Objects Using Semantic
Technologies
Work ow-centric research objects are encoded using RDF5, according to a set
of ontologies that we have made available6.</p>
      <p>Following myExperiment packs, research objects use the Object Exchange
and Reuse (ORE) model7, to represent aggregation. ORE de nes standards for
the description and exchange of aggregations of Web resources. Using ORE, a
work ow-centric research object is de ned as a resource that aggregates other
resources, i.e., work ow(s), provenance, other objects and annotations. For
example, the RDF turtle snippet illustrated below speci es that a research object
identi ed by :wro aggregates a work ow template :pathway wf sp, a work ow
run :pathway wf run, and an annotation :wf annot.</p>
      <p>Example of a research object de ned as an ORE aggregation
: wro a : WorkflowResearchObject , ore : Aggregation ;
ore : aggregates : pathway wf sp ,
: pathway wf run ,
: w f a n n o t .
: p a t h w a y w f s p a : WorkflowTemplate .
: p a t h w a y w f r u n a : WorkflowRun .
: w f a n n o t a ao : Annotation .</p>
      <p>We also use the Annotation Ontology (AO)8, which provides a common
model for annotating resources. This di ers from myExperiment packs, which
use a vocabulary that is mapped to Open Annotation Collaboration (OAC)910.
Several types of annotations are supported by the Annotation Ontology, e.g.,
comments, textual annotations (classic tags) and semantic annotations which
relate elements of the research objects to concepts from underlying domain
ontologies. As an example, the RDF turtle snippet below shows how the abstract
work ow template illustrated in Figure 3-a can be speci ed using a named graph
:pathway abs wf graph. It also shows how, using Annotation Ontology, such
5 http://www.w3.org/RDF
67 hhttttpp:/://w/wwww.wwf4e.voepr-epnroajerccth.oirvge/ws.iokir/gd/isoplraey//d1o.c0s//Rtoesce.ahrcthm+Olbject+Vocabulary+Specification
8 http://code.google.com/p/annotation-ontology
9 www.openannotation.org
10 Note that work is currently underway to align the two annotation vocabularies:
http://www.w3.org/community/openannotation/
an abstract work ow template can be used to annotate the work ow template
:pathway wf sp, which is depicted in Figure 3-b. Speci cally, a resource
representing the annotation, :wf annot, is created to link the work ow template
which is subject to annotation, :pathway wf sp, to the named graph specifying
the corresponding abstract work ow template, :pathway abs wf graph.
Example illustrating how a work ow template can be annotated using AO</p>
    </sec>
    <sec id="sec-4">
      <title>The Lifecycle of a Work ow-Centric Research Object</title>
      <p>We will now illustrate research object lifecycle through a small example that
shows how all the resources contained in a research object are bundled as the
scienti c experiment progresses. This example lifecycle is summarized
graphically in Figure 4.</p>
      <p>A research object normally starts its life as an empty Live Research
Object, with a rst design of the experiments to be performed (which determines
what work ows and resources will be added, by either retrieving them from
an existing platform or creating them from scratch). Then the research object
is lled incrementally by aggregating such work ows that are being created,
reused or re-purposed, datasets, documents, etc. Any of these components can
be changed at any point in time, removed, etc.</p>
      <p>In our scenario, we observe several points in time when this Live Research
Object gets copied and kept into a Research Object snapshot, which aims
to re ect the status of the research object at a given point in time. Such a
snapshot may be useful to release the current version of the research outcome of an
experiment, submit it to be peer reviewed or to be published (with the
appropriate access control mechanisms), share it with supervisors or collaborators, or
for acknowledgement and citation purposes.</p>
      <p>A snapshot may also contain a paper describing the research object in general
and the experiment in particular, depending on the policies of the corresponding
scienti c communication channel, e.g., workshop, conference or journal. Such
snapshots have their own identi ers, and may even be preserved, since it may be
useful to be able to track the evolution of the research object over time, so as to
allow, for example, retrieval of a previous state of the research object, reporting
to funding agencies the evolution of the research conducted, etc.</p>
      <p>At some point in time, the research object may get published and archived, in
what we know as an Archived Research Object, with a permanent identi er.
Such a version of our research object may be the result of copying completely
our Live Research Object, or it may be the result of some ltering or curation
process where only some parts of the information available in the aggregation
are actually published for others to reuse. As illustrated in Figure 4, a user can
use an existing Archived Research Object as a starting point to his or her
research, e.g., to repurpose it or its parts, in which case a new Live Research
Object is created based on the existing Archived Research Object.</p>
      <p>This is only one of the many potential scenarios that could be foreseen for
the lifecycle of a work ow-centric research object and we are currently de ning
di erent storyboards for their evolution. One important aspect to highlight is
the fact that during its whole lifecycle, the research object is aggregating new
objects. The annotation process during the lifecycle of experimentation allows the
generation of su cient metadata about the research objects to support
preservation and sharing. Therefore, when a scientists decides to preserve it most of
the annotations that will be needed for that preservation process will be already
available inside the research object.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions and Further Work</title>
      <p>Scienti c work ows are used by scientists not only as computational units that
encode scienti c methods that can be shared among scientists, but also to specify
their experiments. In this paper we presented a research object model to capture
all the needed information and data including the methods (work ows) and other
elements: namely annotations, datasets, provenance of the work ow results, etc.</p>
      <p>We showed how this model has been implemented using semantic technologies
reusing existing vocabularies, so that scientists are now able to query and publish
their experiments according to existing standards. As a result, experiments may
be more interoperable, since they are recorded with the same general model
to describe them; they can be reused more easily; and decay can be better
handled by representing the information of the templates and the traces in an
environment/execution independent manner.</p>
      <p>The work reported in this paper is preliminary. Our ongoing work includes
the design of an architecture for the management of work ow-centric research
objects, based on the model presented in this paper, which is being
implemented and made available in the Wf4Ever sandbox
(http://sandbox.wf4everproject.eu/). We are also currently validating the model presented in this paper
by creating research objects for existing work ows that are stored within the
myExperiment repository. In doing so, we are examining issues that have to do
with the decay of work ow, mechanisms for querying research objects, and
scalability. As well as the technical challenges, we are aware that there are social
challenges that need to be overcome to encourage scientists to adopt research
object as a unit for publication, discovery and reuse of scienti c
communications. In this respect, we started collaborating with scientists from the European
projects BioVeL (Biodiversity Virtual e-Laboratory)11 and SCAPE (SCAlable
Preservation Environments12).</p>
      <p>Acknowledgements
The research reported in this paper is supported by the Wf4Ever project
(http://www.wf4ever-project.org), Project 270129 funded under EU FP7 Digital
Libraries and Digital Preservation (ICT-2009.4.1).
11 http://www.biovel.eu
12 http://www.scape-project.eu</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>S.</given-names>
            <surname>Bechhofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Buchan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. D.</given-names>
            <surname>Roure</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Missier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ainsworth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bhagat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Couch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Cruickshank</surname>
          </string-name>
          , and Et Al.
          <article-title>Why linked data is not enough for scientists</article-title>
          .
          <source>Future Generation Computer Systems</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>David</given-names>
            <surname>Churches</surname>
          </string-name>
          and Et Al.
          <article-title>Programming scienti c and distributed work ow with triana services</article-title>
          .
          <source>Concurrency and Computation: Practice and Experience</source>
          ,
          <volume>18</volume>
          (
          <issue>10</issue>
          ):
          <volume>1021</volume>
          {
          <fpage>1037</fpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Ewa</given-names>
            <surname>Deelman</surname>
          </string-name>
          and Et Al.
          <article-title>Work ows and e-science: An overview of work ow system features and capabilities</article-title>
          .
          <source>FGCS</source>
          ,
          <volume>25</volume>
          (
          <issue>5</issue>
          ):
          <volume>528</volume>
          {
          <fpage>540</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Lei</given-names>
            <surname>Dou</surname>
          </string-name>
          and Et Al.
          <article-title>Scienti c work ow design 2.0: Demonstrating streaming data collections in kepler</article-title>
          .
          <source>In ICDE</source>
          , pages
          <volume>1296</volume>
          {
          <fpage>1299</fpage>
          . IEEE Computer Society,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Daniel</given-names>
            <surname>Garijo</surname>
          </string-name>
          and
          <string-name>
            <given-names>Yolanda</given-names>
            <surname>Gil</surname>
          </string-name>
          .
          <article-title>A new approach for publishing work ows: Abstractions, standards, and linked data</article-title>
          .
          <source>In Proceedings of the Sixth Workshop on Work ows in Support of Large-Scale Science (WORKS'11)</source>
          ,
          <source>held in conjunction with SC</source>
          <year>2011</year>
          , Seattle, Washington,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Yolanda</given-names>
            <surname>Gil</surname>
          </string-name>
          and Et Al.
          <article-title>Mind your metadata: Exploiting semantics for con guration, adaptation, and provenance in scienti c work ows</article-title>
          .
          <source>In International Semantic Web Conference (2)</source>
          , pages
          <fpage>65</fpage>
          {
          <fpage>80</fpage>
          . Springer,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Yolanda</given-names>
            <surname>Gil</surname>
          </string-name>
          , Varun Ratnakar, Jihie Kim, Pedro Antonio Gonzalez-Calero, Paul Groth, Joshua Moody, and Ewa Deelman.
          <article-title>Wings: Intelligent work ow-based design of computational experiments</article-title>
          .
          <source>IEEE Intelligent Systems</source>
          ,
          <volume>26</volume>
          (
          <issue>1</issue>
          ),
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Darrel</surname>
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Ince</surname>
          </string-name>
          , Leslie Hatton, and John Graham-Cumming.
          <article-title>The case for open computer programs</article-title>
          .
          <source>Nature</source>
          ,
          <volume>482</volume>
          (
          <issue>7386</issue>
          ):
          <volume>485</volume>
          {
          <fpage>488</fpage>
          , 02
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9. Bertram Ludascher, Ilkay Altintas, and
          <string-name>
            <given-names>Amarnath</given-names>
            <surname>Gupta</surname>
          </string-name>
          .
          <article-title>Compiling abstract scienti c work ows into web service work ows</article-title>
          .
          <source>In SSDBM</source>
          , pages
          <volume>251</volume>
          {
          <fpage>254</fpage>
          . IEEE Computer Society,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Phillip</surname>
            <given-names>Mates</given-names>
          </string-name>
          , Emanuele Santos, Juliana Freire, and
          <string-name>
            <surname>Claudio</surname>
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Silva</surname>
          </string-name>
          . Crowdlabs:
          <article-title>Social analysis and visualization for the sciences</article-title>
          .
          <source>In SSDBM</source>
          , pages
          <volume>555</volume>
          {
          <fpage>564</fpage>
          . Springer,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Jill</surname>
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Mesirov</surname>
          </string-name>
          . Accessible reproducible research. Science,
          <volume>327</volume>
          (
          <issue>5964</issue>
          ):
          <volume>415</volume>
          {
          <fpage>416</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Paolo</surname>
            <given-names>Missier</given-names>
          </string-name>
          , Satya S Sahoo, Jun Zhao,
          <string-name>
            <given-names>Carole</given-names>
            <surname>Goble</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Amit</given-names>
            <surname>Sheth</surname>
          </string-name>
          .
          <article-title>Janus: from work ows to semantic provenance and linked open data</article-title>
          .
          <source>Life Sciences</source>
          ,
          <volume>6378</volume>
          (i):
          <volume>129</volume>
          {
          <fpage>141</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>Luc</given-names>
            <surname>Moreau</surname>
          </string-name>
          and Et Al.
          <article-title>The open provenance model core speci cation (v1.1). Future Generation Comp</article-title>
          . Syst.,
          <volume>27</volume>
          (
          <issue>6</issue>
          ):
          <volume>743</volume>
          {
          <fpage>756</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>David</given-names>
            <surname>Newman</surname>
          </string-name>
          .
          <article-title>The Building and Application of a Semantic Platform for an e-Research Society</article-title>
          .
          <source>PhD thesis</source>
          , UNIVERSITY OF SOUTHAMPTON,
          <year>2011</year>
          . Submitted on
          <year>October 2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15. David Newman,
          <article-title>Sean bechhofer</article-title>
          , and David De Roure.
          <article-title>myexperiment: An ontology for e-research</article-title>
          . In Workshop on Semantic Web Applications in Scienti c Discourse in conjunction with the
          <source>International Semantic Web Conference</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Thomas</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Oinn</surname>
          </string-name>
          and Et Al.
          <article-title>Taverna: lessons in creating a work ow environment for the life sciences</article-title>
          .
          <source>Concurrency and Computation: Practice and Experience</source>
          ,
          <volume>18</volume>
          (
          <issue>10</issue>
          ):
          <volume>1067</volume>
          {
          <fpage>1100</fpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Kevin R. Page</surname>
          </string-name>
          , David De Roure, and Et Al.
          <article-title>Rest and linked data: a match made for domain driven development</article-title>
          ? In 2nd International Workshop on RESTful Design (
          <article-title>WS-REST 2011) held in conjunction with</article-title>
          <source>WWW</source>
          <year>2011</year>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>David De Roure</surname>
          </string-name>
          and Et Al.
          <article-title>The evolution of myexperiment</article-title>
          .
          <source>In e-Science</source>
          <year>2010</year>
          . IEEE,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19. David De Roure, Sean Bechhofer, Carole A.
          <string-name>
            <surname>Goble</surname>
            ,
            <given-names>and David R.</given-names>
          </string-name>
          <string-name>
            <surname>Newman</surname>
          </string-name>
          .
          <article-title>Scienti c social objects: The social objects and multidimensional network of the myexperiment website</article-title>
          . In SocialCom/PASSAT. IEEE,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20. David De Roure, Khalid Belhajjame, and Et Al.
          <article-title>Towards the preservation of scienti c work ows</article-title>
          .
          <source>In Procs. of the 8th International Conference on Preservation of Digital Objects (iPRES</source>
          <year>2011</year>
          ). ACM,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21. Vuong Xuan Tran and
          <string-name>
            <given-names>Hidekazu</given-names>
            <surname>Tsuji</surname>
          </string-name>
          .
          <article-title>Owl-t: A task ontology language for automatic service composition</article-title>
          .
          <source>In ICWS</source>
          , pages
          <volume>1164</volume>
          {
          <fpage>1167</fpage>
          . IEEE Computer Society,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Chris</surname>
            <given-names>Wroe</given-names>
          </string-name>
          , Robert Stevens, Carole A.
          <string-name>
            <surname>Goble</surname>
            , Angus Roberts, and
            <given-names>R. Mark</given-names>
          </string-name>
          <string-name>
            <surname>Greenwood</surname>
          </string-name>
          .
          <article-title>A suite of daml+oil ontologies to describe bioinformatics web services and data</article-title>
          .
          <source>Int. J. Cooperative Inf. Syst.</source>
          ,
          <volume>12</volume>
          (
          <issue>2</issue>
          ):
          <volume>197</volume>
          {
          <fpage>224</fpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>