<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ChEBI for systems biology and metabolic modelling</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Janna Hastings</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Neil Swainston</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Venkatesh Muthukrishnan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Namrata Kale</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Adriano Dekker</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gareth Owen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Steve Turner</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pedro Mendes</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christoph Steinbeck</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Cheminformatics and Metabolism, EMBL -- European Bioinformatics Institute (EMBL-EBI)</institution>
          ,
          <addr-line>Hinxton</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Manchester Institute of Biotechnology, School of Computer Science, University of Manchester</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>ChEBI (http://www.ebi.ac.uk/chebi) is a curated database and ontology of biologically relevant small molecules. It is widely used as a reference for chemicals in the context of biological data such as protein interactions, pathways, and models (Hastings et al., 2013). As of the last release (May 2015), ChEBI contains 44,263 fully curated entries, each of which is classified within one of the sub-ontologies: chemical entities (classified according to structural features), roles (classified according to biological or chemical mode of action or use in application), and subatomic particles.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Systems biology brings together a wide range of
information about cells, genes and proteins, as well as the small
molecules that act on and within these biological structures.
It gives a holistic perspective aiming to track and eventually
simulate the entire functioning of biological systems. One
aspect of systems biology is metabolic modelling, which
aims to develop metabolic reconstructions. At the
wholegenome scale, these are all-encompassing interlinked maps
of all known metabolic reaction pathways for a given
organism
        <xref ref-type="bibr" rid="ref6">(Thiele et al., 2013)</xref>
        . Chemical data from ChEBI, such
as molecular formula, chemical structure and ontology
relationships, can fruitfully be used in the model building and
refining process to improve model accuracy and enhance the
representation of metabolism
        <xref ref-type="bibr" rid="ref5">(Swainston et al., 2011)</xref>
        .
Within this context, efforts are currently underway to
improve ChEBI for systems biology and metabolic modelling.
The enhancements include the addition of a library,
libChEBI, for comprehensive programmatic access to ChEBI
data, which will be widely applicable but with a particular
focus on metabolic modelling. It will include the facility to
determine relationships between molecules, such as
stereochemistry, tautometrism and redox pairings, to calculate
important physicochemical properties, such as pKa and the
Gibbs free energy of formation, and to harness these
facilities in support of developing, merging and expanding
metabolic models. The library will be open source, available in
several programming languages including Java and Python.
ChEBI will be providing a facility for bulk submission of
novel compounds which will be automatically classified
within the ontology. We will also be undertaking curation of
the known metabolomes (i.e. all the known metabolites)
across four major species (human, mouse, E. coli and yeast).
Finally, we will be introducing into the ChEBI public
website novel visualisations of relevance to the systems biology
community, such as chemicals in the context of pathways
        <xref ref-type="bibr" rid="ref1 ref3 ref5">(powered by Reactome, Croft et al., 2011, and
MetaboLights, Haug et al., 2013)</xref>
        and models
        <xref ref-type="bibr" rid="ref4">(powered by
BioModels, Li et al., 2010)</xref>
        .
      </p>
    </sec>
    <sec id="sec-2">
      <title>ACKNOWLEDGEMENTS</title>
      <p>ChEBI is funded by the BBSRC, grant agreement number
BB/K019783/1 within the “Bioinformatics and biological
resources” fund.</p>
      <p>Copyright c 2015 for this paper by its authors. Copying permitted for private and academic purposes</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Croft</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>O'Kelly</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haw</surname>
            <given-names>R</given-names>
          </string-name>
          , et al. (
          <year>2011</year>
          )
          <article-title>Reactome: a database of reactions, pathways and biological processes</article-title>
          .
          <source>Nucleic Acids Res</source>
          .
          <volume>39</volume>
          :
          <fpage>D691</fpage>
          -
          <lpage>7</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Hastings J.</given-names>
            ,
            <surname>de Matos</surname>
          </string-name>
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Dekker</surname>
          </string-name>
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Ennis</surname>
          </string-name>
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Harsha</surname>
          </string-name>
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Kale</surname>
          </string-name>
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Muthukrishnan</surname>
          </string-name>
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Owen</surname>
          </string-name>
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Turner</surname>
          </string-name>
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Williams</surname>
          </string-name>
          <string-name>
            <given-names>M.</given-names>
            and
            <surname>Steinbeck</surname>
          </string-name>
          <string-name>
            <surname>C.</surname>
          </string-name>
          (
          <year>2013</year>
          )
          <article-title>The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013</article-title>
          .
          <source>Nucleic Acids Res</source>
          .
          <volume>41</volume>
          :
          <fpage>D456</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Haug</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salek</surname>
            ,
            <given-names>R. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Conesa</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hastings</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          et al. (
          <year>2013</year>
          )
          <article-title>MetaboLights-an open-access general-purpose repository for metabolomics studies and associated metadata</article-title>
          .
          <source>Nucl. Acids Res</source>
          .
          <volume>41</volume>
          :
          <fpage>D781</fpage>
          -
          <lpage>D786</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Donizelli</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodriguez</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dharuri H</surname>
          </string-name>
          . et al. (
          <year>2010</year>
          )
          <article-title>BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models</article-title>
          .
          <source>BMC Systems Biology</source>
          <volume>4</volume>
          :
          <fpage>92</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Swainston</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mendes</surname>
            <given-names>P</given-names>
          </string-name>
          , et al. (
          <year>2011</year>
          )
          <article-title>The SuBliMinaL Toolbox: automating steps in the reconstruction of metabolic networks</article-title>
          .
          <source>Journal of Integrative Bioinformatics</source>
          ,
          <volume>8</volume>
          (
          <issue>2</issue>
          ):
          <fpage>186</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Thiele</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Swainston</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fleming</surname>
          </string-name>
          , R.M.T, et al. (
          <year>2013</year>
          )
          <article-title>A communitydriven global reconstruction of human metabolism</article-title>
          .
          <source>Nature Biotechnology</source>
          <volume>31</volume>
          ,
          <fpage>419</fpage>
          -
          <lpage>425</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>