<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The CombineArchiveWeb application { A web based tool to handle les associated with modelling results</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Martin Scharm?</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Florian Wendland</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Peters</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Markus Wol en</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tom Theile</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dagmar Waltemath</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Systems Biology and Bioinformatics, University of Rostock</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Sharing in silico experiments is essential for the advance of research in computational biology. Consequently, the COMBINE archive was designed as a digital container format. It eases the management of les related to a modelling result, fosters collaboration, and ultimately enables the exchange of reproducible simulation studies. However, manual handling of COMBINE archives is tedious and error prone. We therefore developed the CombineArchiveWeb application to support scientists in promoting and publishing their research by means of creating, exploring, modifying, and sharing archives. All les are equipped with meta data and can be distributed over the Web through shareable workspaces.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Computational modelling is an indispensable tool in the life sciences. As a
consequence, standardisation and exchange of models has also become essential.
However, the steadily increasing size and complexity of models and derived
data poses the challenge of sharing reproducible results. Today's results typically
consist of multiple model les with semantic annotations, simulation descriptions
linked to terms in a simulation algorithm ontology, structure information, result
data sets with semantic annotations, and reference publications [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Only if all
relevant les are provided, modelling results can be reliably reproduced and
sharing becomes feasible.
      </p>
      <p>
        To solve this issue, the COMBINE community [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] proposes the COMBINE
archive format [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. A COMBINE archive is a container that bundles all les
related to a project into a single le. Typically, the COMBINE archive also
comprises les with meta data such as people attributions and details about les
inside the archive. The meta data is encoded in XML/RDF. For example, VCards
refer to contributors and DCTERMS capture creation and modi cation dates.
The meta data can be used to de ne relations among the les contained in the
archive. It can also be evaluated to query and compare COMBINE archives, or
to quickly get an overview of the content without necessarily loading the single
les. An example is given in Figure 1. The les included in an archive are listed
? To whom correspondence should be addressed
and their meta data are displayed in a human readable format. In this example,
the archive contains information about the results from Liebal et. al [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]: a model
description in SBML format, a simulation description in SED-ML format, the
reference publication in PDF format and two data tables. Viewing and modifying
meta data is easily possible through the forms on the right hand side.
      </p>
      <p>In summary, the COMBINE archive is a solid format for sharing reproducible
models and in silico experiments with collaborators and public databases.</p>
      <p>However, manual handling of COMBINE archives is tedious and error prone.
Therefore, we developed a CombineArchive Toolkit (http://sems.uni-rostock.
de/cat). It consists of a library, a desktop application, and a web based interface.</p>
      <p>
        The CombineArchiveLibrary1 is written in Java and implements the
COMBINE archive speci cation [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. It o ers all necessary methods to handle COMBINE
archives, including:
      </p>
      <p>Extracting single les or the whole archive
Browsing through the archive
Adding and removing les
Renaming and reorganising les
Attaching and retrieving meta information</p>
      <p>
        The CombineArchiveLibrary is for tool developers. It has already been
integrated with software such as the Functional Curation Project of Chaste [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>We also integrated the CombineArchiveLibrary in the
CombineArchiveDesktop application2. It is intended as a functional browser for COMBINE archives.
One of the application's key features is the user-friendly handling of included
meta data. The meta data is presented in a human readable way and can easily
be modi ed. Since the desktop application only runs locally, your les do not
leave your machine. This is especially important when data protection or
intellectual property claims are of concern. The CombineArchiveDesktop application
is implemented in Java and shipped as an executable Java Archive (JAR) le.
Thus, it is portable and runs on Windows, Linux, and MacOS.</p>
      <p>The second software tool is the CombineArchiveWeb application. It is
described in the following.</p>
    </sec>
    <sec id="sec-2">
      <title>The CombineArchiveWeb application</title>
      <p>
        The CombineArchiveWeb application3 also uses the CombineArchiveLibrary as a
code base. It enables any researcher to work with the COMBINE archive format
on the internet. Additionally, the application o ers RESTful services for use by
other client applications. The web interface connects to open model repositories,
such as the CellML model repository [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], to easily retrieve models. User with
special interest in privacy do not need to alienate their research. Since the
1 sems.uni-rostock.de/trac/combinearchive
2 sems.uni-rostock.de/trac/combinearchive-gui
3 webcat.sems.uni-rostock.de
CombineArchiveWeb application is openly available and easy to install, anyone
could host their own instances on private servers. In order to prohibit abuse,
the maintainer of an installation can easily con gure quotas for the users. For
example, it is possible to limit the maximum le size, the maximum age of an
untouched archive, the number of archives per workspace, the number of les in
an archive, just to name a few options. Since the CombineArchiveWeb application
is not intended as a database for long-term storage of simulation studies old
archives are deleted regularly.
      </p>
      <p>Researchers who wish to share their results through the CombineArchiveWeb
application can simply upload all relevant les belonging to a piece of modelling
work. This automatically creates a workspace. Workspaces are an ideal tool to
manage les. They foster collaborations and prevent inconsistencies in versions
of les. Uploaded les are immediately bundled in a COMBINE archive. Such
virtual experiments can be downloaded at any time and from any location. The
created workspace can also be shared with collaborators to work from di erent
physical locations. The CombineArchiveWeb application supports researchers in
exploring and reproducing scienti c results. For example, authors of a publication
may decide to provide their code as a COMBINE archive. When consumers open
the archive, the CombineArchiveWeb application automatically reads the les
and their meta data. It builds the links between the various les and presents
the contents in a human readable format (see again Figure 1).</p>
    </sec>
    <sec id="sec-3">
      <title>Summary</title>
      <p>The CombineArchive Toolkit implements the latest COMBINE archive speci
cation and, thus, breathes life into the idea of sharing all les necessary to reproduce
an in silico experiment. We present a web based, graphical user interface for
interacting with COMBINE archives. The CombineArchiveWeb application grants
users intuitive access to COMBINE archives. Our tools and code are openly
available through our project website at sems.uni-rostock.de/cat. We invite you
to use our software, to provide feedback and suggestions, and to contribute to
the further development of the CombineArchive Toolkit.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Waltemath</surname>
          </string-name>
          et al.:
          <article-title>Possibilities for Integrating Model-related Data in Computational Biology</article-title>
          .
          <source>In: Proceedings of the 9th International Conference on Data Integration in the Life Sciences. CEUR Workshops</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Le</given-names>
            <surname>Novere</surname>
          </string-name>
          et al.:
          <article-title>Meeting report from the rst meetings of the Computational Modeling in Biology Network (COMBINE)</article-title>
          .
          <source>Standards in genomic sciences</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bergmann</surname>
          </string-name>
          et al.:
          <article-title>COMBINE archive: One File To Share Them All</article-title>
          . arXiv,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Liebal</surname>
          </string-name>
          et al.:
          <article-title>Proteolysis of beta-galactosidase following SigmaB activation in Bacillus subtilis</article-title>
          .
          <source>Molecular BioSystems</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Bergmann</surname>
          </string-name>
          et al.:
          <source>COMBINE Archive Speci cation</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Cooper</surname>
          </string-name>
          et al.:
          <article-title>High-throughput functional curation of cellular electrophysiology models</article-title>
          .
          <source>Progress in Biophysics and Molecular Biology</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Li</surname>
          </string-name>
          et al.:
          <article-title>BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models</article-title>
          .
          <source>BMC Systems Biology</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Lloyd</surname>
          </string-name>
          et al.:
          <article-title>The CellML Model Repository</article-title>
          . Bioinformatics,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>