<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Supporting Process Development in Bio-jETI by Model Checking and Synthesis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Anna-Lena Lamprecht</string-name>
          <email>anna-lena.lamprecht@cs.tu-dortmund.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tiziana Margaria</string-name>
          <email>margaria@cs.uni-potsdam.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bernhard Ste en</string-name>
          <email>steffen@cs.tu-dortmund.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Chair for Programming Systems, Dortmund University of Technology</institution>
          ,
          <addr-line>44227 Dortmund</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Chair for Service and Software Engineering, Potsdam University</institution>
          ,
          <addr-line>14482 Potsdam</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Bio-jETI is a platform for the intuitive graphical design and execution of bioinformatics work ows composed from heterogeneous remote services. In this paper we use a simple phylogenetic analysis process to show how formal approaches like model checking and process synthesis can be applied to further support the work ow development in Bio-jETI. To unfold their full potential these methods need a comprehensive knowledge base about the domain, containing semantic information about the single services as well as ontological classi cations of the used terms. We outline how to systematically integrate these semantic web concepts into our framework and discuss the implications on checking and synthesis.</p>
      </abstract>
      <kwd-group>
        <kwd>bioinformatics work ows</kwd>
        <kwd>process veri cation</kwd>
        <kwd>compliance</kwd>
        <kwd>model checking</kwd>
        <kwd>work ow synthesis</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>More than in other domains the heterogeneous services world in bioinformatics
demands for a good methodology to classify and relate resources in a both human
and machine accessible manner. The development of semantically annotated
services and comprehensive service and data ontologies in the domain is already
very promising. Yet software is needed that fully utilizes the available semantic
information in order to provide helpful tools to the in silico researcher.</p>
      <p>
        In the last two years we introduced Bio-jETI [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] as an environment for service
integration, service orchestration, process execution, and process deployment in
the bioinformatics domain. Technically, Bio-jETI builds on the jABC modeling
framework [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] that has been used so far in several di erent domains of science,
education, and industry. We have, however, not yet exploited some techniques
that are available in the jABC framework and would enable Bio-jETI to support
the development of processes in terms of service semantics.
      </p>
      <p>
        For instance, the jABC provides model checking techniques [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] that can be
used for reasoning about properties of process models, and work ow synthesis
methodology [4{6] that allows for the automatic creation of work ows according
to formal speci cations. Considering a simple phylogenetic analysis work ow,
we show an example for problem detection via model checking (section 2.1) and
for subsequent service insertion by means of synthesis (section 2.2).
      </p>
      <p>It is essential for our methods to have proper semantic information available.
On the one hand, we need predicates characterizing the single services, i.e. their
function and their input/output behaviour. On the other hand, taxonomies or
ontologies are required which provide the domain knowledge against which the
services (their predicates) are classi ed. We discuss the open issues and future
work regarding the integration of semantic information about web services, and
implications on model checking and synthesis in section 3. Finally, section 4
draws a short conclusion.
2</p>
      <p>
        Example: Developing a Phylogenetic Analysis Work ow
Using the jABC technology, Bio-jETI provides a framework for the graphical
orchestration of bioinformatics processes. Process models, called Service Logic
Graphs (SLGs) are constructed graphically by placing process building blocks,
called Service Independent Building Blocks (SIBs), on a canvas and connecting
them according to the ow of control. These SLGs are directly executable by an
interpreter component, and they can be transformed into stand-alone
applications via the GeneSys code generation framework [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], or compiled and deployed
as a new web service using jETI [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Furthermore, they are accessible to other
techniques that exploit the structure of the control- ow process models, such as
model checking [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and work ow synthesis [4{6].
      </p>
      <p>For our example we assume a small SIB collection as presented in gure 1,
comprising various remote and local services. It contains data retrieval services
provided by the European Bioinformatics Institute, sequence analysis algorithms
o ered by the Bielefeld Bioinformatics Server, homology search services hosted
by the DNA Data Bank of Japan, and some tools of the EMBOSS suite, as well as
some locally available services, such as a specialized visualizer for phylogenetic
trees or a work ow component realizing user interaction.</p>
      <p>Utilizing these SIBs, a work ow developer (a biologist, for example) can
dene the following simple process (see gure 2): show an input dialog where the
user enters a query sequence, then run a BLAST (Basic Local Alignment Search
Tool) query with this sequence to nd homologous sequences, and nally
visualize the phylogenetic tree for this set of sequences with ATV (A Tree Viewer).</p>
      <p>While an experienced bioinformatician might be aware of the problem within
this process immediately (because of his familiarity with the involved tools),
many users will only realize at runtime that there must be some undesired
behaviour, since the execution comes to an unexpected halt. In the following we
illustrate how to automatically detect and x such issues already at design time
using model checking and model synthesis.
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Using Model Checking to Detect Problems</title>
      <p>In this phylogenetic analysis process two services use data: BLAST uses the blast
query entered in the input dialog, and ATV at the very end uses the result of the
preceding steps. We can now, e.g., check the type correctness of these uses:
If a service uses a data item x of type y, x must have been de ned before with
precisely this type, without having been overwritten since.</p>
      <p>This property can be expressed in a temporal logic and given to a model
checker. While the model checking detects no problems for the rst use, it reveals
a property violation for the second, as can be seen in gure 3: the rightmost SIB
is marked by a red frame, indicating that the property is violated at that node.
The cause is that the process does not provide the appropriate input type for
the tree visualizer.</p>
      <p>Once detected, there are di erent ways to x the problem. One possible
approach is to search for a sequence of additional services that resolve the type
mismatch and insert them into the process. This data mediation sub-work ow is
usually linear. It can consist of type conversions that simply adapt the involved
data, or also of real computational services when they can not be related too
easily. The next section shows how to use process synthesis to nd an adequate
sequence of services automatically.
2.2</p>
    </sec>
    <sec id="sec-3">
      <title>Synthesis</title>
      <p>The knowledge base needed for the process synthesis consists of input and output
information for each service as well as service and type taxonomies that classify
the services and types, respectively.</p>
      <p>We assume a simple type taxonomy for our example, which has a generic
type type at the root and re nes the data types downwards. The basic input
and output information for the services is de ned in terms of the concrete data
types, i.e. the leaves of the taxonomy tree. The corresponding information for
our exemplary SIB collection is shown in gure 4.</p>
      <p>Having all required knowledge available, the synthesis algorithm must now
be provided with a speci cation for the desired service sequence. For instance,</p>
      <p>Take a BLAST result as input and nally produce a phylogenetic tree.
is such a speci cation that can be formulated in a temporal logic.</p>
      <p>Computing the shortest tool combination that satis es the speci cation, the
synthesis returns a sequence of three services that can be used to complete
the erroneous process ( gure 5, bottom): a BlastParser extracts the database
IDs of the homologous sequences from the BLAST result, the corresponding
sequences are fetched from a database using WSDBFetch, and emma (an interface
to the ClustalW multiple sequence alignment algorithm) is invoked to obtain an
alignment and the corresponding phylogenetic tree.
3</p>
      <sec id="sec-3-1">
        <title>Discussion and Perspectives</title>
        <p>The previous section demonstrated the model checking and synthesis
methodology that is currently available in the jABC framework. We are now tailoring
these techniques and underlying development methodologies to the
bioinformatics application domain. This work spans three dimensions: domain modeling
(section 3.1), model checking (section 3.2), and model synthesis (section 3.3).
3.1</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Domain modeling</title>
      <p>This dimension is the heart of making information technology available to
biologists, as it enables them to formulate their problems in their own language
terms - on the basis of adequately designed ontologies.</p>
      <p>This raises the issue where the domain knowledge ideally comes from. It is,
of course, possible for each user to de ne custom service and type taxonomies,
allowing for exactly the generalization and re nement that is required for the
special case. However, as the tools and algorithms that are used are mostly
third-party services, it is desirable to retrieve domain information from
public knowledge repositories as well. Therefore we plan to incorporate knowledge
from di erent publicly available ontologies and integrate it into service and type
taxonomies that can be used by our synthesis methodology.</p>
      <p>
        Signi cative examples for a relevant and popular knowledge bases of
bioinformatics data types and services are the constantly evolving namespace, object
and service ontologies in BioMoby [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Originating from the early 2000s, the 1.0
BioMoby Semantic Web Service speci cation, however, does not adhere to the
ontological standards that have been developed for the Semantic Web in the last
years. Thus we follow with interest and hope to bene t from the development
of the S(emantic)-Moby framework and also the SSWAP (Simple Semantic Web
Architecture and Protocol) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] project, which aims at providing this knowledge
using standard RDF/OWL technology and plans, for instance, to provide the
BioMoby domain knowledge accordingly.
      </p>
      <p>It is, of course, also necessary that the services themselves are equipped with
meta-information in terms of these ontologies. Again, we are looking at BioMoby
with interest: numerous institutions have registered their web services at Moby
Central, describing functionality and data types in pre-de ned structures
using a common terminology. Although BioMoby does not yet use standardized
description formalisms like WSDL-S, it is already clear that there is semantic
information available that we can use as predicates for automatic service
classication.</p>
      <p>
        Furthermore it will be interesting to consider the incorporation of more
content-oriented ontologies like the Gene Ontology [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] or the OBO (Open
Biomedical Ontologies) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] into our process development framework. This would
enable the software to not only support the process development on a technical
level, but also in terms of the underlying biological and experimental questions.
Additional sources of information, like the provenance ontologies of [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], could
also be exploited by our synthesis and veri cation methods.
3.2
      </p>
    </sec>
    <sec id="sec-5">
      <title>Model Checking</title>
      <p>This dimension is meant to systematically and automatically provide biologists
with the required IT knowledge in a seamless way, similar to a spell checker
which hints at orthographical mistakes - perhaps already indicating a proposal
for correction. Immediate concrete examples of detectable issues are (2.1):
{ mismatching data types: a certain service is not able to work on the data
format provided by its predecessor.
{ missing resources: a process step is missing, where a required resource is
fetched/produced.</p>
      <p>However, this is only a rst step. Based on adequate domain modeling, made
explicit via ontologies/taxonomies, model checking can capture semantic
properties to guarantee not only the executability of the biological analysis process
but also a good deal of its purpose, and rules of best practice, like:
{ all experimental data will eventually be stored in the project repository,
{ unexpected analysis results will always lead to an alert, or
{ chargeable services will not be called before permission is given by the user.</p>
      <p>
        On a more technical side, model checking allows us to apply the mature
process analysis methodology that has been established in programming language
compilers in the last decades and has shown to be realizable via model checking
[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Similar to the built-in code checks that most Integrated (Software)
Development Environments provide, this would help Bio-jETI users to avoid the
most common mistakes at process design time. In addition the list of veri ed
properties is extendable by the user, and can thus be easily adapted to speci c
requirements of the application domain.
3.3
      </p>
    </sec>
    <sec id="sec-6">
      <title>Model Synthesis</title>
      <p>This dimension can be seen as a step beyond model checking: The biologist does
not have to care about data types at all - the synthesis automatically makes the
match by inserting required transformation programs (see section 2.2). This is
similar to a spell checker which automatically corrects the text, thus freeing the
writer from dealing with orthography at all.3
3 In our model-based framework, things are well-founded, without the uncertainties of
natural language. So please do not be put o by this example because of annoying
experiences with spell checkers!</p>
      <p>The potential of this technology goes even further: ultimately, the biologist
will be able to specify her requests in a very sparse way, e.g. by just giving the
essential corner stones, and the synthesis will complete this request to a running
process. For instance, consider the following process description:
Having a single (genetic) sequence, I want to nd similar sequences and get a
hypothesis about their evolutionary relationship.</p>
      <p>Typically there are many processes that solve such a request. In the example
under consideration each solution would consist of a di erent set of similarity
search services and tools for estimating phylogenies, as well as the required
conversions, data retrievals etc. in between. Thus our synthesis algorithm provides
the choice of producing a default solution according to a prede ned heuristics,
or proposes sets of alternative solutions for the biologist to select.
4</p>
      <sec id="sec-6-1">
        <title>Conclusion</title>
        <p>The Semantic Web is currently one of the most ambitious projects in computer
science. Collective e orts have already lead to a basis of standards for service
descriptions and meta-information. It is now mainly the service provider's task to
extend the already available technology according to more speci c requirements
of their application domain and build up a semantics knowledge base.</p>
        <p>
          The challenge for user-side software is to abstract from the underlying
Semantic Web technology again and provide the achievements in an intuitive fashion.
Some ontology-aware service browsers and clients are already available in the
scope of the BioMoby project. With the MOBY-S Web Service Browser it is, for
instance, possible to search for a matching next service, while in addition the
sequence of actually executed tools is recorded and stored as a Taverna
workow [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. However, these solutions demand quite some technical understanding
from the user. This hampers the uptake by a larger biological community.
        </p>
        <p>Our approach aims at lowering the required technical knowledge according
to the "easy for the many, di cult for the few" paradigm. After an adequate
domain modeling, including the de nition of the semantic rules to be checked by
the model checker or to be exploited during model synthesis, biologists should
ultimatively be able to pro tably and e ciently work with a world-wide
distributed collection of tools and data, using their own domain language.</p>
        <p>
          This goal di erentiates us from other related projects like Taverna [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ],
Kepler [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] or Triana [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], which address a bioinformatics user, and not the biologist
herself. We believe that our control-oriented approach has a much better
potential to address non-IT personnel: it allows them to continue to think in 'Dos' and
'Don'ts', and steps and sequences of action in their own terms at their level of
domain knowledge. In contrast, the three data ow-oriented tools above require
their users to change the perspective to a resource point of view, which, in fact,
requires implicit (technical) knowlegde in order to pro tably use them.
        </p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Margaria</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kubczak</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ste</surname>
            <given-names>en</given-names>
          </string-name>
          , B.:
          <article-title>Bio-jETI: a service integration, design, and provisioning platform for orchestrated bioinformatics processes</article-title>
          .
          <source>BMC Bioinformatics 9 Suppl</source>
          <volume>4</volume>
          (
          <year>2008</year>
          )
          <article-title>S12 PMID: 18460173.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Ste en,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Margaria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Nagel</surname>
          </string-name>
          ,
          <string-name>
            <surname>R.</surname>
          </string-name>
          , Jorges,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Kubczak</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>Model-Driven Development with the jABC</article-title>
          .
          <source>In: Hardware and Software, Veri cation and Testing</source>
          . (
          <year>2007</year>
          )
          <volume>92</volume>
          {
          <fpage>108</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bakera</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Margaria</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Renner</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ste</surname>
            <given-names>en</given-names>
          </string-name>
          , B.:
          <article-title>Veri cation, Diagnosis and Adaptation: Tool-supported enhancement of the model-driven veri cation process</article-title>
          . In: Revue des Nouvelles Technologies de l'
          <source>Information (RNTI-SM-1)</source>
          .
          <source>(December</source>
          <year>2007</year>
          )
          <volume>85</volume>
          {
          <fpage>98</fpage>
          (Journal version to appear
          <source>in ISSE).</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Ste en,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Margaria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Beeck</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Automatic synthesis of linear process models from temporal constraints: An incremental approach</article-title>
          .
          <source>In ACM/SIGPLAN Int. Workshop on Automated Analysis of Software (AAS'97)</source>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Margaria</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ste</surname>
            <given-names>en</given-names>
          </string-name>
          , B.:
          <article-title>Backtracking-Free Design Planning by Automatic Synthesis in METAFrame</article-title>
          . In: Fundamental Approaches to Software Engineering. (
          <year>1998</year>
          )
          <fpage>188</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Margaria</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ste</surname>
            <given-names>en</given-names>
          </string-name>
          , B.:
          <article-title>LTL Guided Planning: Revisiting Automatic Tool Composition in ETI</article-title>
          .
          <source>In: Proceedings of the 31st IEEE Software Engineering Workshop</source>
          , IEEE Computer Society (
          <year>2007</year>
          )
          <volume>214</volume>
          {
          <fpage>226</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. Jorges,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Margaria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Ste</surname>
          </string-name>
          <string-name>
            <surname>en</surname>
          </string-name>
          , B.:
          <article-title>Genesys: Service-Oriented Construction of Property Conform Code Generators. Innovations in System and Software Engineering -</article-title>
          a
          <source>NASA Journal</source>
          (to appear)
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Margaria</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nagel</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , Ste en, B.:
          <article-title>jETI: A Tool for Remote Tool Integration</article-title>
          . In:
          <article-title>Tools and Algorithms for the Construction and Analysis of Systems</article-title>
          . Volume
          <volume>3440</volume>
          /2005 of LNCS., Springer Berlin/Heidelberg (
          <year>2005</year>
          )
          <volume>557</volume>
          {
          <fpage>562</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Wilkinson</surname>
            ,
            <given-names>M.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Links</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>BioMOBY: an open source biological web services proposal</article-title>
          .
          <source>Brie ngs in Bioinformatics</source>
          <volume>3</volume>
          (
          <issue>4</issue>
          ) (
          <year>December 2002</year>
          )
          <volume>331</volume>
          {41 PMID:
          <fpage>12511062</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Gessler</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          : SSWAP {
          <article-title>Simple Semantic Web Architecture and Protocol</article-title>
          . http: //sswap.info/docs/SSWAP.pdf
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ashburner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ball</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blake</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          , et al.:
          <article-title>Gene ontology: tool for the uni cation of biology</article-title>
          .
          <source>Nature Genetics</source>
          <volume>25</volume>
          (
          <issue>1</issue>
          ) (May
          <year>2000</year>
          )
          <volume>25</volume>
          {9 PMID:
          <fpage>10802651</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ashburner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , others, C.R.:
          <article-title>The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration</article-title>
          .
          <source>Nat Biotech</source>
          <volume>25</volume>
          (
          <issue>11</issue>
          ) (
          <year>November 2007</year>
          )
          <volume>1251</volume>
          {
          <fpage>1255</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Sahoo</surname>
            ,
            <given-names>S.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sheth</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Henson</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Semantic Provenance for eScience: Managing the Deluge of Scienti c Data</article-title>
          .
          <source>IEEE Internet Computing</source>
          <volume>12</volume>
          (
          <issue>4</issue>
          ) (
          <year>2008</year>
          )
          <volume>46</volume>
          {
          <fpage>54</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. Ste en, B.:
          <article-title>Data Flow Analysis as Model Checking</article-title>
          . In: Theoretical Aspects of Computer Software, International Conference TACS '
          <volume>91</volume>
          ,
          <string-name>
            <surname>Sendai</surname>
          </string-name>
          , Japan,
          <source>September 24-27</source>
          ,
          <year>1991</year>
          , Proceedings, London, UK, Springer-Verlag (
          <year>1991</year>
          )
          <volume>346</volume>
          {
          <fpage>365</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Dibernardo</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pottinger</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wilkinson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Semi-automatic web service composition for the life sciences using the BioMoby semantic web framework</article-title>
          .
          <source>Journal of Biomedical Informatics (March</source>
          <year>2008</year>
          ) PMID:
          <fpage>18373957</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Oinn</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Addis</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferris</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al.:
          <article-title>Taverna: a tool for the composition and enactment of bioinformatics work ows</article-title>
          .
          <source>Bioinformatics</source>
          <volume>20</volume>
          (
          <issue>17</issue>
          ) (
          <year>2004</year>
          )
          <volume>3045</volume>
          {
          <fpage>3054</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Altintas</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berkley</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jaeger</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , et al.:
          <article-title>Kepler: An Extensible System for Design and Execution of Scienti c Work ows</article-title>
          .
          <source>In SSDBM</source>
          (
          <year>2004</year>
          )
          <volume>21</volume>
          |
          <fpage>23</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , I.,
          <string-name>
            <surname>Shields</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harrison</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The Triana Work ow Environment: Architecture and Applications</article-title>
          . In: Work ows for e-Science. Springer, New York, Secaucus, NJ, USA (
          <year>2007</year>
          )
          <volume>320</volume>
          |
          <fpage>339</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>