<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Applying Capability Modelling in the Genomics Diagnosis Domain: Lessons Learned</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Francisco Valverde</string-name>
          <email>fvalverde@pros.upv.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria Jose Villanueva</string-name>
          <email>mvillanueva@pros.upv.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universitat Politècnica de València</institution>
          ,
          <addr-line>Camino de Vera S/N 46022, Valencia</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Because of the evolution of sequencing technologies, Bioinformatics Workflow Management Systems (BWMS) are a popular software for geneticists to describe workflows for analysing genomic data. Although these systems improve development productivity, they are far from being widely accepted by this community. The lack of rigorous conceptual modelling-practices explains the complexity to adapt this genetic analysis software to context changes. In order to face this adaptation issue, we propose using the capability notion as a modelling primitive for providing a sound conceptual background. This paper analyses, from a capability-driven perspective, how daily practices in a bioinformatics SME could be represented as capabilities. From this real scenario, we state current capabilities and explain how they can be supported using current BWMS. As a lessons learned, we discuss how the introductions of capability-driven development could improve their daily work</p>
      </abstract>
      <kwd-group>
        <kwd>Capability-driven development</kwd>
        <kwd>Workflow systems</kwd>
        <kwd>Conceptual modelling</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Current DNA analyses require a complex computer procedure to perform quality
results. As some authors claim [
        <xref ref-type="bibr" rid="ref1 ref2">1-2</xref>
        ], although a lot of commercial and
opensource analysis software is available, it is difficult to find a solution that supports
all geneticist’s requirements. Therefore geneticists have no option but to develop
their own custom software. The most common development approach is to
assemble and reuse open-source software components (mapping algorithms, data
processing utilities, visualizers, file parsers, etc.) using a scripting-oriented language.
Due to this practice, there is a huge amount of isolated scripts, frameworks and
command-line tools that perform the same functionality with slight differences.
      </p>
      <p>
        An interesting approach for improving this scenario is the use of bioinformatics
workflow management systems (BWMS) to describe their tailored software as
workflows made up of software components that manipulate genetic data. Current
BWMS are, to some extent, end-user oriented because they provide some
guidance for creating experiments, such as visual notations or wizards. However, when
they try to use them, BMWS lack of suitable domain abstractions [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In addition,
changes usually require addressing underlying technological issues. For that
reasons, they find BMWS still very complex and they are reluctant to learn
programming [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ][
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] to tailor them to their requirements.
      </p>
      <p>
        From an Information Systems perspective, we have detected a serious problem
that affects the expected quality of those tools: the lack of sound Conceptual
Modeling practices. In previous works, we have presented a holistic Conceptual
Schema for the Human Genome (CSHG) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], some applications for genetic analysis
based on this CSHG [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], and the first attempts to provide an ontological
background based on UFO [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        In this paper we explore how conceptual modelling practices can improve the
design and development of genetic analyses. In particular, our goal is to explore
how the notion of capability can help to better understand and better design those
genetic analyses. Capability-driven development (CDD) [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] is a novel approach
for addressing evolving scenarios like the one we present in this work. From the
business perspective, a capability is defined as the ability and capacity that enables
an enterprise to achieve a business goal in a certain operational context. From the
technical perspective, capability delivery requires dynamic utilization of resources
and services in dynamically changing contexts.
      </p>
      <p>
        The main contribution of this work is to analyse CDD in a domain such
challenging and evolving as Genomics. We want to evaluate that applying the
capability notion, functional semantics associated to BWMS are more precise, design
complexity is reduced and productivity is increased. Our first step is to identify
which capabilities geneticists demand from BWMS: following the principles of
Action Research [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], we observed geneticists from IMEGEN (Instituto de
Medicina Genomica) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], a Spanish bioinformatics SME, when using their current
genetic analysis software to carry out disease diagnoses. From the lessons learned in
this real use case, a set of capabilities are identified and modelled using three
widely known open-source BWMS.
      </p>
      <p>The rest of the paper is structured as follows: Section 2 describes several
approaches that address bioinformatics software development using BWMS. Section
3 explains the IMEGEN geneticists’ scenario for genetic disease diagnosis.
Section 4 specifies a concrete set of four capabilities detected in the IMEGEN use
case. Section 5 evaluates how these four different capabilities are supported by
three popular BWMS. Section 6 discusses how CDD can improve the current
context, and section 7 states the conclusions and the future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2 Related Work</title>
      <p>
        Several authors have also noticed geneticists’ difficulties when they use software
tools to accomplish their work. On the one hand, Lacroix and Menager [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]
carried out an evaluation of BWMS and noticed several points of improvement
regarding extensibility, functionality, usability, understandability, scalability, and
efficiency. Also, Barker and Hemert [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] performed a comparison between
business and scientific workflow technologies and conducted a survey about different
BWMS. Both identified several issues about usability, sustainability and tool
adoption, however, both evaluations were performed in 2005 and 2007
respectively, and conclusions may not be longer accurate because tools have evolved since
then. Looking critically at these works, there is a lack of a rigorous, clear
conceptual modelling background.
      </p>
      <p>
        Cohen, et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] analysed different real workflows from the micro-array data
analysis domain created by scientists using current BWMS. Authors discuss why
scientists are not yet prepared to use BWMS, and they describe several
technological challenges that current BWMS should address regarding reusability,
adaptability and usability. Additionally, McPhillips et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] also believe that
environments must provide to geneticists some assistance for the design and
implementation of workflows. They enumerated several properties that should be satisfied,
such as, clarity, well-formedness, predictability, recordability, reportability,
reusability, scientific data modelling and automatic optimization. Again, we realized
that the point of view of the analysis is basically solution space-oriented
(productoriented), instead of problem space-oriented (conceptual model-driven).
      </p>
      <p>Trying to overcome this lack of conceptual modelling-based approaches, our
work contributes to genetic analysis design by: i) proposing the use of the
capability notion as basic modelling unit; ii) identifying several capabilities directly
associated with geneticists’ requirements; and iii) assessing how BWMS support those
capabilities. Specifically, we describe four dimensions or properties that are not
considered by the aforementioned approaches: the goal KPIs, the context, the
capacity and the ability. With this, we introduce a modelling perspective with a
more suitable abstract representation of the domain.</p>
    </sec>
    <sec id="sec-3">
      <title>3 Illustrative scenario: The genetic disease diagnosis process</title>
      <p>
        IMEGEN [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] is a SME specialized in the diagnosis of hundreds of genetic
diseases and several diagnosis techniques. A generic scenario from their genetic
diagnosis process is made up of the following steps:
1. Sequencing Phase: The patient’s DNA sample is introduced in the sequencing
machine with a set of reactives and, as a result, a set of small sequences is
obtained.
2. Sequence Treatment Phase: Geneticists build and check the correctness of the
DNA sequence from the set of pieces obtained by sequencing machines. To do
this, they must import the set of sequences into a software tool.
3. Alignment Phase: The complete sequence is compared against a reference
sequence in order to obtain the differences between them. To do this, they must
import the reference sequence and the consensus sequence obtained from the
previous phase into a software tool.
4. Knowledge Phase: Each difference is characterized as a genetic variation. If a
variation has been previously described, additional information related to a
genetic disease should be retrieved. To do this, they must search each variation in
different online databases.
5. Report Phase: All genetic variations and their associated information are
gathered in a report. To do this, they use a spreadsheet to represent the information
gathered.
      </p>
      <p>
        IMEGEN’s geneticists adopted a software solution to semi-automate this
manual procedure. But with the called Next Generation Sequencing (NGS)
technologies [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] this solution was out to date; although conceptually the process remained
the same.
      </p>
      <p>As a solution to this handicap, current BWMS aim to provide geneticists, with
a platform to create their own workflows, and evolve them accordingly. The use
of these environments benefits geneticists because: 1) Their interests, such as
financial and temporal requirements, depend only on themselves instead of IT
companies; 2) They are able to create exactly what they expect from a software
product; and 3) They control the created product in case of any change is required.</p>
      <p>However, we detected a common problem: the lack of modelling practices. For
instance, the need to compare a sample sequence against a reference sequence is
always the same problem, even if the evolution of technology changes
continuously. This change is represented in several BWMS with the introduction of a new
alignment algorithm as a new modelling primitive. Mixing the stable, conceptual
part of the problem, with the more volatile, technology-oriented concrete solution,
is being a serious problem shared among different BWMS.</p>
      <p>The benefits of using BWMS and models to describe the semantics of a process
are clear in this sort of situations, in which neither semantics nor the problem
model has changed. Thus, evolution is described as an update model projection
into concrete software technologies, such as toolkits, algorithms and information
systems, which would be changed according to the situation. The contribution of
this work is to use the CDD methodology to describe such processes to be
implemented by a BWMS.</p>
    </sec>
    <sec id="sec-4">
      <title>4 Capabilities in a genomic analysis scenario</title>
      <p>
        Using the experience gained after the study of the IMEGEN scenario, we
identified a list of mandatory capabilities to be supported in their work. According to
the metamodel for capability specification introduced in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], capabilities are
described in terms of five main properties:
• Goal: Desired state of affairs that needs to be attained.
• Goal KPI: Key performance indicator (KPI) for monitoring the achievement
of a goal.
• Context: The context encompasses the information characterizing the
situation in which a business capability should be provided.
• Capacity: Availability of resources, e.g. money, time, personnel, tools, for
delivering the capability
• Ability: Level of available competence, were competence is understood as
talent, intelligence and disposition, of a subject or enterprise to accomplish a
goal.
      </p>
      <p>We use these five properties for describing and characterizing each detected
capability. Additionally, for each capability an example is shown to illustrate how the
capability is deployed in real practice. Next, the capabilities are described:</p>
      <sec id="sec-4-1">
        <title>4.1 Access to genomic public data sources (C1)</title>
        <p>The genetics domain is such a young field that the best way to achieve new
genetic discoveries is sharing as much genetic knowledge as possible. As a result,
geneticists feed from data stored in different public repositories spread around the
web. To use this data, geneticists specify queries to create datasets from several
public repositories. As an example, we can consider this query: “Specify a query
to retrieve a reference sequence from a gene, whose identifier is “BRCA1” from
the NCBI repository”
• Goal: For a giving genetic disease, retrieve all the information publicly
available on the repositories list.
• Goal KPI: Increase the % of detected variations with relevant information for
diagnosis.
• Context: Public databases are accessible using different sources: restful APIs,
relational databases and HTML pages. It is common the inclusion of relevant
new databases.
• Capacity: a private database management system for storing indexes about
previously detected variations and internal data for analysis.
• Ability: knowledge about data sources provides relevant and trustful
information from the genomic diagnosis point of view.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2 Integration with external services (C2):</title>
        <p>Bioinformatics community spends a lot of resources in the development of
algorithms, toolkits, and web services specialized in the automation of different
genetic tasks. Geneticists design their experiments by combining some of these services
into a workflow. Hence, geneticists must be provided with a mechanism to design
the integration and configuration of external services. Example: Integrate a
custom algorithm that translates a DNA sequence into a RNA sequence.
• Goal: Include into the genetic analysis external services that provide data
processing functionalities.
• Goal KPI: Increase the number of services currently integrated into the design
environment.
• Context: External services are available as web services (using SOAP or Rest
interfaces) and as command-line utilities. New and updated services are
published regularly according to novel research.
• Capacity: Computer server for the deployment and integration of external
services.
• Ability: Knowledge about the genetic tasks to be performed and the functional
requirements.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3 Conceptual description and management of new genetic data (C3):</title>
        <p>One of the main problems of the genetic domain is the lack of widely accepted
conceptual models to express genetic data. Geneticists work with different formats
that are basically raw text files following a tabular structure. It is common that
new formats arise with new sequencing technologies. Hence, geneticists require a
mechanism for managing new genetic data representations and extract the relevant
information. Example: Use an entity “Gene” and use its attribute “Gene Name”
as a parameter of a genetic task instead of column 3 of the file Gene.fasta.
•
•
•
•
•</p>
        <p>Goal: Support the management of the common data formats from sequencing
technologies
Goal KPI: Increase the number of supported data formats
Context: Each sequencing technology could potentially provide a specific data
format as there is no standard. Data is stored as flat text files that must be
firstly parsed to be used by BWMS.</p>
        <p>Capacity: Text based processing tools or scripts to transform data files into
structured data, i.e inside a database.</p>
        <p>Ability: Software programmers with the skills to implement translator
modules and text parsers.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4 Reporting (C4)</title>
        <p>During the execution of their experiments, geneticists need to store all the data
transformations and results that are obtained in every step. This information is
essential to check the correct execution of the experiment and also, in order to
reconstruct the experiment under the same conditions. Hence, geneticists must be
able to specify the creation of result reports. Example: Specify a report that
contains the date and the number of Genes obtained by a given query performed
through the NCBI database.</p>
        <p>• Goal: Create a report with the relevant data from a genetic disease
diagnosis.
• Goal KPI: Increase the number of results correctly described in the
report.
• Context: Report information changes according to the disease to be
diagnosed and the customer profile.
• Capacity: Reporting software for generating doc or pdf files from the
results of the genetic analysis.
• Ability: Software programmers with the skills to implement the report
templates and the import mechanisms</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5 Evaluation of capabilities supported by BWMS</title>
      <p>From a conceptual modelling perspective, the next step is to show that the
modelling of these four capabilities can improve the understanding and the
subsequent implementation of genetic analysis. This modelling exercise also provides a
useful framework to conceptually compare which BWMS is more suitable for
deploying the capability.</p>
      <p>Following this reasoning line, we have deployed the presented capabilities into
three of the most reference open-source BWMS. Commercial tools have been
discarded in this analysis, because they cannot be properly evaluated using trial or
evaluation versions. We claim that the discussion reported in this section could be
replicated without any significant constraint using other tools that provide a
similar functionality. For the evaluation, we have used the genetic disease diagnosis
process described in section 3, which was specified as a “main” capability. Using
that scenario, we have configured the specific BWMS and deployed a workflow
that supports each capability. Table 1 summarizes the support degree of each
capability in three levels as: 1) “supported (S)”; 2) “partially supported (PS)”; or 3)
“not supported (N)”. Next, the results are detailed for each tool:</p>
      <p>C1: Access to Public Data Sources</p>
      <p>C2: Integration with External Services
C3: Conceptual description of genetic data</p>
      <p>C4: Reporting</p>
      <sec id="sec-5-1">
        <title>5.1 Taverna</title>
        <p>
          Taverna is an open-source environment for the design, edition and execution of
scientific workflows created by the MyGrid Team [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. Its main objective is to aid
with the definition of in-silico experiments through the integration of web services
specially focused on the biological domain. Taverna integrates functionality
through myExperiment, a social network to share scientific workflows, and the
Biocatalogue [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], a curated catalogue of web services for the life sciences.
        </p>
        <p>Taverna supports C2, because different services specifications, such as WSDL,
BeanShell or Biomoby, are supported, and local tools can be integrated using a
command line application. It partially supports C1 because it provides a set of
services for retrieving data from different biological data sources, but it is only
possible to use predefined queries. Additionally, the expressivity of the workflow
language only supports the definition of specific software tools, such as
“runMiraProgram” or “runBlastn2seq” instead of high-level bioinformatic tasks, such
as “assembly task” or “local alignment task”.</p>
        <p>
          It also partially supports C3, because conceptual descriptions of data can be
defined using the MOBY-S ontology [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. However, this ontology is difficult to
search, as it contains duplicate entries, poor descriptions and entities not related
with the biological domain. Regarding C4, it is partially supported because it is
possible to display data and to store results in text files, but custom reports cannot
be specified.
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2 Galaxy</title>
        <p>
          Galaxy is an open-source web-based environment for the execution of biological
services created by the Galaxy Team [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. Its main purpose is to aid geneticists
with their data intensive biological research through the definition of web
interfaces for biological data retrieval and services execution. With this purpose, it
provides different interfaces that access to UCSC [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] and Ensembl [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] databases
and toolkits.
        </p>
        <p>Galaxy partially supports C1, as users can only retrieve data through predefined
queries provided by the environment’ interfaces, for example the web search
interface provided by the USCS database. Also C2 is partially supported, because there
is a wide array of popular bioinformatics services provided by default.
Additionally, Galaxy supports the integration of new services using programming scripts, but
this approach is not suitable for geneticists that lack of programming knowledge.
Regarding C4, Galaxy outputs data using a mechanism based on HTML
templates. However, the usability and expressivity provided with this solution is not
enough to consider this capability fully-supported.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3 eBioFlow</title>
        <p>
          eBioFlow is an open-source workflow management system to design and execute
biological workflows developed in the academic environment as a proof of
concept of a series of PhD dissertations [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]. Its main purpose is to improve other
BWMS proposals focusing on the support of data provenance, the usability of the
user interface and the execution of workflows step by step.
        </p>
        <p>It partially supports C1, as it provides a set of services that execute a specific
query against a specific data source, but it is not possible to retrieve specific data
properties. Also, C3 is partially supported, as there is a service for creating data
structures according to a taxonomy of entities related with the biological and the
bioinformatics domain. However, the specification of each data structure is
ambiguous, relationships among structures cannot be detected and there is a
confusing mixture between biological properties and computer properties. Neither C2
nor C4 are supported by this BWMS.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6 Discussion</title>
      <p>Using a conceptual modelling approach based on the notion of capability, we have
simplified the deployment of the aforementioned process. One main improvement
is that using capabilities specification, workflows are easy to understand to
geneticists. They state that capabilities specification are a nice and organized
documentation of their process.</p>
      <p>Regarding the analysis, the main conclusion is that Taverna is the most complete
environment, as it takes into account the aforementioned capabilities and partially
supports all of them. However, specific domain knowledge and entities must be
introduced to improve the end-user satisfaction. On the contrary, Galaxy provides
less capabilities and its main advantage is that data retrieval and local data
uploading is easier. Galaxy is specifically designed for the bioinformatics domain, it is
more geneticists friendly and has a simpler workflow notation than Taverna. Its
main disadvantage is that the workflow language is not as expressive as the
provided by Taverna. Finally, eBioFlow provides a good workflow language in terms
of expressivity and a user-friendly interface. This tool also includes some
interesting features such as different workflow design perspectives, automatic creation of
workflow entities from data types or partial workflow execution. But, it lacks of
the advanced bioinformatics functionality that Taverna and Galaxy provide.</p>
      <p>From this preliminary analysis, we validated that the previous issues (before
applying the CDD methodology) were a consequence of modelling workflows
using software-oriented concepts and the lack of documented specification.
Current BWMS provide software components oriented to the solution domain, i.e,
programming frameworks and command-line tools to implement specific
bioinformatics functionality. In order to overcome this situation, the application of
CDD is useful to improve the process understanding by end-users. Reasoning in
terms of the well-supported concepts of goal, goal KPIs, context, capacity and
ability, geneticists can design the expected capabilities instead of designing just
workflows. This idea has been discussed with IMEGEN geneticists in order to
come up with a better evolution approach. Specifically, we have found a set of
improvements regarding the analysed capabilities:
• Integration with public data sources: Current environments only support the
execution of predefined queries against a single data source at a time. We have
proposed the definition of concept models, a modelling construct proposed by
CDD, specifically designed to gather all the concepts regarding the information
to be retrieved. Hence, geneticists will construct queries using this conceptual
model instead of programming for specific APIs or database schemas.
Applying a model-to-code transformation approach is also feasible to transform a
query against this model to several specific SQL queries that extract data from
each biological data source. As a drawback, it will be necessary the inclusion of
a mechanism that combines the query responses and expresses the resulting
datasets according to the conceptual model.
• Data import from genetic flat files: Some environments manage genetic data
by means of text files, which can have different formats. For extracting
information, users have to indicate which fields inside that file contain the data.
Other BWMS provide a conceptualization of genetic data using taxonomies or
biological ontologies. However, usually, these conceptualizations are
ambiguous or little intuitive, and geneticists are unable to select the correct entity. We
have proposed that every piece of data available in the BWMS must be
managed as a domain entity. This entity, with a set of properties, represents
precisely and unambiguously a concept of the biological or the bioinformatics domain.
Following the approach of the previous points, an entity will be instantiated
when data is retrieved in the BWMS and it will be available to become an input
or output of a task of the workflow.
• Reporting: Report generation is constrained to the execution of software
components that generate a specific report. We have proposed the definition of a
model-oriented mechanism that uses the conceptual model to create
personalized reports, where all the domain data available in the environment can be
selected to be reflected in the report.</p>
      <p>
        While several issues have been identified in current BWMS, the main problem
is the steep learning curve. For geneticists, it is a highly time-consuming task to
fully understand and master all the features provided by a BWMS. As a solution,
we believe that geneticists should use a business environment where all the
technological and low-level details are hidden. This approach addresses the current
situation because CDD offers [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] “an approach to business and IT development …
to produce solutions that are fit for changing business contexts, while taking the
advantage of emerging technology solutions”.
      </p>
      <p>The application of CDD is not a generic approach to address any
bioinformatics task, but it is useful in highly evolving processes like the diagnosis of genetic
diseases. Using a CDD environment, geneticists and software analysts will be able
to define their diagnosis process in an agile way as a set of bioinformatics
processes and services. Additionally, process models and patterns defined as a
capability can guide the generation of a workflow specification for a BWMS.</p>
    </sec>
    <sec id="sec-7">
      <title>7 Conclusions and Future Work</title>
      <p>This work discusses the software issues that geneticists must face daily to
accomplish their genetic analyses, and the benefits of using the CDD methodology. In
order to address the unresolved features, it is discussed how CDD can be useful.
The work shows how capabilities can help to put conceptual modelling in practice,
in order to come up with a solution that improves current practices. Hence, we
encourage the definition of capabilities specifically oriented to geneticists, whose
benefits are: 1) they can focus on the experiments specification; 2) they can
abstract technological details and programming issues as domain issues; 3) they can
avoid the necessity to learn different BWMS specific management and features;
and 4) they have flexibility to change and to evolve the processes that represent
their research. As a result of this preliminary study, we state that applying a CDD
approach IMEGEN genetic analyses are improved. The next steps are to go further
in the specification of additional capabilities and to address the deployment of the
presented capabilities.</p>
      <p>Acknowledgments Thanks to the Instituto de Medicina Genómica (IMEGEN,
http://www.imegen.es) for its support and the provision of information. This work has been
partially supported by the EU-FP7 funded project no: 611351 CaaS - Capability as a Service in
Digital Enterprises.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>McKenna</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanna</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Banks</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sivachenko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cibulskis</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kernytsky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garimella</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Altshuler</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , Gabriel,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Daly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>DePristo</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.A.</surname>
          </string-name>
          :
          <article-title>The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data</article-title>
          .
          <source>Genome Research</source>
          <volume>20</volume>
          ,
          <fpage>1297</fpage>
          -
          <lpage>1303</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Bartlett</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Toms</surname>
            ,
            <given-names>E.G.</given-names>
          </string-name>
          :
          <article-title>Developing a protocol for bioinformatics analysis: An integrated information behavior and task analysis approach</article-title>
          .
          <source>Journal of the American Society for Information Science and Technology</source>
          <volume>56</volume>
          ,
          <fpage>469</fpage>
          -
          <lpage>482</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Ludascher</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weske</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McPhillips</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bowers</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          : Scientific Workflows: Business as Usual? In: Dayal,
          <string-name>
            <given-names>U.</given-names>
            ,
            <surname>Eder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Koehler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Reijers</surname>
          </string-name>
          , H. (eds.) Business Process Management, vol.
          <volume>5701</volume>
          , pp.
          <fpage>31</fpage>
          -
          <lpage>47</lpage>
          . Springer Berlin / Heidelberg (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Barker</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          ., van Hemert,
          <string-name>
            <surname>J.</surname>
          </string-name>
          :
          <article-title>Scientific Workflow: A Survey and Research Directions</article-title>
          . In: Wyrzykowski,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Dongarra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Karczewski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Wasniewski</surname>
          </string-name>
          ,
          <string-name>
            <surname>J</surname>
          </string-name>
          . (eds.)
          <source>Parallel Processing and Applied Mathematics</source>
          , vol.
          <volume>4967</volume>
          , pp.
          <fpage>746</fpage>
          -
          <lpage>753</lpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>McPhillips</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bowers</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zinn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , Lud\
          <article-title>"a, s</article-title>
          .B.:
          <article-title>Scientific workflow design for mere mortals</article-title>
          .
          <source>Future Gener. Comput. Syst</source>
          .
          <volume>25</volume>
          ,
          <fpage>541</fpage>
          -
          <lpage>551</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Pastor</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          <article-title>Conceptual Modeling Meets the Human Genome</article-title>
          . ER,
          <year>2008</year>
          ,
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Villanueva</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valverde</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Levín</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pastor Lopez</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Diagen: A Model-Driven Framework for Integrating Bioinformatic Tools</article-title>
          . In: Nurcan,
          <string-name>
            <surname>S</surname>
          </string-name>
          . (ed.)
          <source>IS Olympics: Information Systems in a Diverse World</source>
          , vol.
          <volume>107</volume>
          , pp.
          <fpage>49</fpage>
          -
          <lpage>63</lpage>
          . Springer Berlin Heidelberg (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Ferrandis</surname>
            ,
            <given-names>A. M. M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Pastor</surname>
            ,
            <given-names>O. Ló.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Guizzardi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <article-title>Applying the Principles of an Ontology-Based Approach to a Conceptual Schema of Human Genome</article-title>
          . ER,
          <year>2013</year>
          ,
          <fpage>471</fpage>
          -
          <lpage>478</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Zdravkovic</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stirna</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Henkel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grabis</surname>
          </string-name>
          , J.:
          <article-title>Modeling Business Capabilities and Context Dependent Delivery by Cloud Services</article-title>
          . In: Salinesi,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Norrie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Pastor</surname>
          </string-name>
          , Ó. (eds.)
          <source>Advanced Information Systems Engineering</source>
          , vol.
          <volume>7908</volume>
          , pp.
          <fpage>369</fpage>
          -
          <lpage>383</lpage>
          . Springer Berlin Heidel
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Wieringa</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Morali</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <article-title>Technical Action Research as a Validation Method in Information Systems Design Science</article-title>
          . DESRIST,
          <year>2012</year>
          ,
          <fpage>220</fpage>
          -
          <lpage>238</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>IMEGEN: Instituto de Medicina Genómica</surname>
          </string-name>
          . “http://www.imegen.
          <source>es”</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Lacroix</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ménager</surname>
          </string-name>
          , H.:
          <article-title>Evaluating workflow management systems for bioinformatics</article-title>
          . (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Cohen-Boulakia</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leser</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          : Search, adapt, and
          <article-title>reuse: the future of scientific workflows</article-title>
          .
          <source>SIGMOD Rec</source>
          .
          <volume>40</volume>
          ,
          <fpage>6</fpage>
          -
          <lpage>16</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Shendure</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ji</surname>
          </string-name>
          , H.:
          <article-title>Next-generation DNA sequencing</article-title>
          .
          <source>Nat Biotech</source>
          <volume>26</volume>
          ,
          <fpage>1135</fpage>
          -
          <lpage>1145</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Hull</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolstencroft</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stevens</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goble</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pocock</surname>
            ,
            <given-names>M.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oinn</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Taverna: A tool for building and running workflows of services</article-title>
          .
          <source>Nucleic Acids Research</source>
          <volume>34</volume>
          (
          <issue>Web Server Issue</issue>
          ),
          <fpage>W729</fpage>
          -
          <lpage>W732</lpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>De Roure</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goble</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bhagat</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cruickshank</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goderis</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Michaelides</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Newman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          : myExperiment: Defining the Social Virtual Research Environment. pp.
          <fpage>182</fpage>
          -
          <lpage>189</lpage>
          %&amp;.
          <source>(Year)</source>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Bhagat</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tanoh</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nzuobontane</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laurent</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orlowski</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roos</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolstencroft</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aleksejevs</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stevens</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pettifer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lopez</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goble</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          :
          <article-title>BioCatalogue: a universal catalogue of web services for the life sciences</article-title>
          .
          <source>Nucleic Acids Research</source>
          <volume>38</volume>
          ,
          <fpage>W689</fpage>
          -
          <lpage>W694</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Wilkinson</surname>
            ,
            <given-names>M.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Links</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>BioMOBY: An open source biological web services proposal</article-title>
          .
          <source>Briefings in Bioinformatics 3</source>
          ,
          <fpage>331</fpage>
          -
          <lpage>341</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Goecks</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nekrutenko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , J.,
          <string-name>
            <surname>Team</surname>
          </string-name>
          , T.G.:
          <article-title>Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences</article-title>
          .
          <source>Genome Biology</source>
          <volume>11</volume>
          ,
          <issue>R86</issue>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Karolchik</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baertsch</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diekhans</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Furey</surname>
            ,
            <given-names>T.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hinrichs</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>Y.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roskin</surname>
            ,
            <given-names>K.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schwartz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sugnet</surname>
            ,
            <given-names>C.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thomas</surname>
            ,
            <given-names>D.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weber</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haussler</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kent</surname>
            ,
            <given-names>W.J.:</given-names>
          </string-name>
          <article-title>The UCSC Genome Browser Database</article-title>
          .
          <source>Nucleic Acids Research</source>
          <volume>31</volume>
          ,
          <fpage>51</fpage>
          -
          <lpage>54</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Hubbard</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barker</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birney</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cameron</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clark</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cox</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuff</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Curwen</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Down</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Durbin</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eyras</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gilbert</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hammond</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huminiecki</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kasprzyk</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehvaslaiho</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lijnzaad</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Melsopp</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mongin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pettett</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pocock</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Potter</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rust</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidt</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Searle</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Slater</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spooner</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stabenau</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stalker</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stupka</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ureta-Vidal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vastrik</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clamp</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The Ensembl genome database project</article-title>
          .
          <source>Nucleic Acids Research</source>
          <volume>30</volume>
          ,
          <fpage>38</fpage>
          -
          <lpage>41</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Wassink</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ooms</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neerincx</surname>
          </string-name>
          , P., van der Veer, G.,
          <string-name>
            <surname>Rauwerda</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leunissen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Breit</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nijholt</surname>
          </string-name>
          , A.,
          <string-name>
            <surname>van der Vet</surname>
          </string-name>
          , P.: e-BioFlow:
          <article-title>Improving Practical Use of Workflow Systems in Bioinformatics</article-title>
          . In: Khuri,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Lhotska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Pisanti</surname>
          </string-name>
          , N. (eds.) Information Technology in Bio- and
          <string-name>
            <surname>Medical Informatics</surname>
            ,
            <given-names>ITBAM</given-names>
          </string-name>
          <year>2010</year>
          , vol.
          <volume>6266</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
          . Springer Berlin / Heidelberg (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>