<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Taking a view on bio-ontologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Simon Jupp</string-name>
          <email>jupp@ebi.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrew Gibson</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>James Malone</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Helen Parkinson</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Robert Stevens</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Biosystems Data Analysis, Swammerdam Institute for Life Sciences (SILS), University of Amsterdam</institution>
          ,
          <addr-line>Science Park 904, 1098 XH, Amsterdam</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Functional Genomics Group, European Bioinformatics Institute, Wellcome Trust Genome Campus</institution>
          ,
          <addr-line>Hinxton, Cambridge CB10 1SD</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Netherlands Consortium for Systems Biology, University of Amsterdam</institution>
          ,
          <addr-line>PO Box 94215, 1090 GE, Amsterdam</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>School of Computer Science, The University of Manchester</institution>
          ,
          <addr-line>Oxford Road, Manchester, UK M13 9PL</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>We present a technique for separating knowledge representation from application specific views that are currently often conflated within bio-ontologies. Many ontologies contain information for two tasks; one to represent the knowledge of some field of interest and another to support an application through providing views over ontologies that present the terms in a useful way for an application. We analyse this phenomenon in some bio-ontologies and suggest this separation of layers as a solution. We leave dedicated ontology languages like OWL and OBO to represent the knowledge of a field of interest, and use a more lightweight vocabulary, namely SKOS, to capture application specific views. We use this technique to encode a number of views inside the Experimental Factor Ontology. Each of these views serves a special purpose to different user communities; however, it does ensure the underlying ontology can remain for the annotation and integration of biological data. OWL and SKOS together provide a powerful, standards based, mechanism to reconstitute annotated biological data for many different application domains.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 INTRODUCTION</title>
      <p>
        Bio-ontologies have become important in the life sciences through
their provision of identifiers for biomedical concepts that are defined
and managed by community processes
        <xref ref-type="bibr" rid="ref13">(Smith et al., 2007)</xref>
        . More
effective integration, analysis and mining of information from life
science datasets is being routinely and experimentally achieved
as a result of their annotation with these concepts
        <xref ref-type="bibr" rid="ref1 ref10 ref14 ref7">(The Gene
Ontology Consortium, 2010; Camon et al., 2004; Noy et al., 2009;
Kapushesky et al., 2011)</xref>
        . Authoritative collections of concepts
have been produced such as those describing, to name a few,
the characteristics of gene products in GO
        <xref ref-type="bibr" rid="ref3">(Consortium, 2000)</xref>
        ,
chemicals in ChEBI
        <xref ref-type="bibr" rid="ref4">(Degtyarenko et al., 2008)</xref>
        or species in the
NCBI taxonomy
        <xref ref-type="bibr" rid="ref5">(Federhen, 2011)</xref>
        . As the value of annotation
with ontology identifiers has been recognised, the number of
bioontologies with different scopes has increased, as well as the number
of concepts described by existing vocabularies
        <xref ref-type="bibr" rid="ref2">(Castro et al., 2010)</xref>
        .
Many bio-ontologies are being used to annotate biomedical data
for use in different query and browsing tools. The extent of the
annotation varies; some gene product data are just annotated with
GO concepts
        <xref ref-type="bibr" rid="ref1">(Camon et al., 2004)</xref>
        whereas other datasets, such as
those submitted to ArrayExpress
        <xref ref-type="bibr" rid="ref11">(Parkinson et al., 2011)</xref>
        , require
annotations that span many fields of interest, from descriptions of
the experiment to the attributes of the differentially expressed genes.
      </p>
      <p>A significant use of bio-ontologies and annotations is to support
the users of applications that wish to view, browse and search
what can be complex and high-dimensional data. The key parts
of bio-ontologies that support this form of use are the hierarchical
structures formed by ‘is a’ and often ‘part of’ relationships between
the concepts, as well as their labels and definitions. Applications
use these components to drive a presentation that allows users to
inspect annotated data using criteria (concepts) with which they are
most familiar. The labels within the ontology support the interaction
between the user and the interface; a simple ontology-driven
autocomplete function in a search box can transform an opaque dataset
into a useable life science application. The hierarchical structure
of bio-ontologies enables query expansion—when querying with
x we also retrieve x and all the children of x as described in the
ontology (e.g. medline / mesh). Bio-ontologies often include other
non-hierarchical relationships between concepts that are specific to
their scope that can also help applications to guide users to data or
content that is semantically related to their query.</p>
      <p>Applications that use bio-ontologies and data annotations to
drive their presentation are faced with knowledge representation
and presentation issues. First, when faced with a large ontology,
the proportion of concepts that are relevant to (the users of)
an application will be low enough that it would detract from
the usability of the application if all of the concepts in the
vocabulary were exposed. Thus applications need a mechanism to
indicate whether or not a particular concept should be available.
Second, applications will often use concepts that cover multiple
biological aspects of a dataset, intersecting parts of several existing
vocabularies. In order to collect, manage and use the concepts
relevant to an application, developers would often like to have
an ontology that only contains those concepts in which they are
interested, or else have a mechanism by which they can annotate
the original ontology so that their application only processes the
relevant content. In this paper we refer to these kinds of collections
of concepts as a ‘view’.</p>
      <p>
        We consider a view to be analogous to an ontology module, in that
the goal is to reuse a subset of concepts from an existing ontology
in a particular setting
        <xref ref-type="bibr" rid="ref12">(Pathak et al., 2009)</xref>
        . Formal ontology
modularisation is a process whereby the logical entailments of a
set of axiomatically defined classes remains the same in both the
original ontology and in the module. In this paper a view is a more
lightweight collection of concepts from one or more ontologies
where the identifiers and annotation components are useful to the
application for navigation and query expansion, but where the
logical entailments of the original ontology, whilst being preserved
in the original ontology, are not required by the application.
      </p>
      <p>The challenge for the developer is how to represent this
information in line with existing standards and methods. Ideally,
such annotations and mechanisms of annotation would also be
available outside of a particular application.</p>
      <p>
        In this paper we present an example of the above challenges
as faced by the iKUP Browser, a web application for querying
multi-omics data held in the Kidney and Urinary Pathway
Knowledgebase
        <xref ref-type="bibr" rid="ref6">(Jupp et al., 2011)</xref>
        . We then present a technique that
separates the requirements of a bio-ontology as a representation of
knowledge in a domain from the requirements of presentation of that
data in an application setting, that is inline with current standards
for bio-ontology representation. We employ features recently
introduced into the OWL2 specification, in combination with a
W3C specification for representing thesauri, controlled vocabularies
and subject heading systems, the Simple Knowledge Organization
System (SKOS). This method is applied to extract SKOS based
views from the Experimental Factor Ontology (EFO)
        <xref ref-type="bibr" rid="ref9">(Malone et al.,
2010)</xref>
        , which is used to curate transcriptomics data and supports
several applications.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>USE CASE</title>
      <p>
        The development of a kidney and urinary pathway knowledge base
clearly demonstrates the need for a separation of the applications
view of the ontology from the underlying knowledge representation.
The Kidney and Urinary Pathway Knowledge Base (KUPKB) is a
Semantic Web application being used by biologists in the study of
kidney disease
        <xref ref-type="bibr" rid="ref8">(Klein et al., 2012)</xref>
        . A Kidney and Urinary pathway
ontology (KUPO) provides the underlying schema and model for
the data held in the KUPKB. KUPO is an application ontology
that brings together subsets of other ontologies to annotate
multiomic high-throughput data from experiments on kidney disease.
KUPO re-uses concepts from existing ontologies wherever possible,
importing concepts from various ontologies, including the cell
(CTO), gene (GO), mouse anatomy (MAO), human disease (HDO),
phenotype (PATO) and experimental factor (EFO). In bringing
concepts together, the KUPO adds new OWL axioms that further
strengthen the descriptions of various classes. For example, cell
types are described in terms of their parts using part of relationships
to the MAO, and cellular functions are related to concepts from the
GO using the capable of relationship. An OWL reasoner is used to
classify the KUPO class hierarchy, that is then used to drive queries
in the KUPKB.
      </p>
      <p>The KUPO and the ontologies it imports provides a rich
underlying model for the KUPKB, where OWL semantics can be
exploited for powerful query answering. In order to ask complex
queries of the KUPKB, technical knowledge of the underlying
ontologies, and Semantic Web query languages like SPARQL are
needed. This kind of interaction is reserved for experts in the field,
thus excluding a wide range of potential end users. To address
this the iKUP browser (http://www.kupkb.org) was built to provide
a user friendly interface to the KUPKB data. The iKUP browser
uses the underlying ontologies to integrate and query the data.
The ontological axioms allow for query expansion and the class
hierarchies are exploited to provide faceted browsing of search
results. However, the ontologies and their imports are not suitably
organised for presentation in a user friendly interface such as the
iKUP for several reasons:</p>
      <p>Classes towards the top of the hierarchy are often sufficiently
abstract that they have little meaning to the user. For example,
upper level classes in the CTO such as ’cell in vivo’ and
’experimentally modified cell’ were not useful to iKUP users.
Some ontologies provide too many levels of granularity. For
example, the NCI taxonomy contains twelve intermediate
categories between humans (taxied:9606) and mammals
(taxid:40674), which are unnecessary to view in iKUP where
users only require a simple classification of species.</p>
      <p>Multiple concepts in the ontology may be suitable root
concepts for navigational purposes. For example, cell type,
disease, species, function are useful root concepts for
navigation, but may not necessarily be default roots within an
ontological context.</p>
      <p>Concepts that a user might expect to be organised
hierarchically do not have true parent/child relationships
in the ontology. For example, displaying partonomy and
developmental relationship as part of the class hierarchy.</p>
      <p>In the case of iKUP, each of these issues were handled in the
user interface code. User feedback was used to determine which
concepts from the ontology should be presented. This approach
proved difficult to maintain and also means the view logic is hidden
in the iKUP code and cannot be exposed in other tools, such as
Prote´ge´, outside of the iKUP applications.</p>
      <p>
        To address some of these issues, the Open Biomedical Ontology
(OBO) community have adopted an annotation mechanism which
they use to generate subsets or ‘slims’ to mark up concepts in
the ontology that belong to a particular view. The Gene Ontology
provides slims of the ontology that are designed for specific
communities or applications. The GO slims give a broad overview
of the ontology content without the detail of the fine grained
concepts. They are useful in applications such as over expression
analysis
        <xref ref-type="bibr" rid="ref15">(Yi et al., 2007)</xref>
        . When converting OBO into OWL the
annotation property (oboInOwl:subset) is used to indicate that
a concept belongs to a particular slim. The OBO slim pattern
benefits from being simple and has been adopted in some notable
bio-ontologies, however, as annotations, they lack any kind of
real semantics so ontology development tools and applications
need specialist knowledge before they can be exploited. OWL
annotations provide an obvious mechanism for encoding view
information and tools like OntoDog 1 have been developed to assist
users in extracting views from an existing ontology based on it’s
annotations. However, there remains no common design patterns
and a real lack of generic tooling to support the creation and
maintenance of these views.
2.1
      </p>
      <sec id="sec-2-1">
        <title>View requirements</title>
        <p>We have in the KUKPB a scenario where two tasks can be
easily conflated: representing entities ontologicallly and also adding
information that aids presentation, navigation and searching within
the application setting. This suggests separating out these two needs
into separate ontology layers and user layers. We have languages
1 http://ontodog.hegroup.org
for the ontology layer, such as OWL and OBO format, that are
widely used to author ontologies. However, a standard method for
capturing this user layer is needed if we want to share these views
between applications. This standard must meet the following basic
requirements:
1. To identify one or more views within an ontology;
2. Assign concepts in the ontology to one or more of the internal
views;
3. Assert semantic relationships between concepts in the ontology
that provide alternative navigational paths around the ontology;
4. Assert anchors in the ontology that indicate root concepts for
presentation.</p>
        <p>We propose that the W3C Simple Knowledge Organisation
System (SKOS) provides a minimal model for capturing these
views within an OWL ontology. Unlike OWL, the emphasis of
SKOS is not so much on the formal (logical or ontological)
representation of the information, but instead provides a schema
in which concepts can be organized in a lightweight fashion for
concept schemes, cataloguing, indexing and information retrieval
tasks. SKOS models concepts as instances of one OWL class
- skos:Concept. By modelling concepts as instances rather than
classes, SKOS shifts the knowledge representation strategy to
a different meta-level. SKOS provides hierarchical properties in
broader and narrower, as well as the non-hierarchical related
property. These semantic relationship lack any formal definition,
however, their semantics are sufficiently defined by the W3C in
order for applications to make assumptions on how they should be
interpreted computationally. One of the major advantages of SKOS
is that it provides a significant amount of support for describing
the annotation components of an ontology, including labels,
definitions and multiple languages, major desirable components
of most biomedical ontologies. Although SKOS concepts cannot
be (logically) defined as extensively as OWL classes, they can be
(usefully) described just as well for most user-facing applications.</p>
        <p>One of the major improvements of OWL2 was the removal of the
constraint that a named OWL entity must be assigned one ’role’ in
an ontology. This ’punning’ strategy means that it is now permitted
to specify, for example, that an entity is both an OWL class and
also an individual - thereby introducing a basic meta-modelling
capability into the Semantic Web suite of specifications. Figure 1
illustrates how we can use meta-modelling to make assertions
between classes in our ontology. In the way that we described OWL
and SKOS above, it initially seems that they are mutually exclusive,
different interpretations of the same thing. Infact we can exploit
OWL2 punning to integrate both types of representation into one.
We can keep the formal axiomatic view as an OWL ontology and
by punning specify a set of SKOS concepts that happen to have the
same name. In doing this, we can now start to refer to the SKOS
individual representations of concepts in a vocabulary as if they
were items in a dataset rather than defined entities as part of a formal
theory.</p>
        <p>SKOS also provides one other key modelling component when
we are considering views: the concept scheme. A concept scheme
is an entity to which SKOS concepts can be mapped with the
skos:inScheme property. It can be used as a way of grouping
together a set of concepts, and the annotation of the concept scheme
can be used to add a description of why those concepts are part
of that concept scheme. We can define anchors within our views
using the skos:topConceptOf property. skos:topConceptOf can
be used to assert that a particular concept should be viewed at the
root of a particular concept scheme.</p>
        <p>This ability to do some meta-modelling in OWL allows us to use
the SKOS vocabulary to make additional assertions on our ontology.
SKOS can be used to index concepts in our ontology, and to define
alternate navigational hierarchies around the ontology. These SKOS
”views” are not intended as a replacement for the OWL, but rather an
extension to the underlying knowledge representation that supports
the application setting. Capturing this information using a standard
vocabulary like SKOS means we can begin to exploit generic SKOS
tooling to support how the views are created, maintained and used.
SKOS can satisfy our previous requirements as follows:
1. For requirement one, we use the skos:ConceptScheme to
represent a particular view within an ontology
2. For requirement two, for each class that is to be included
in the view from the ontology, we add them as an instance
of skos:Concept. These concepts can then be associated
to a particular view, i.e. the skos:ConceptScheme via the
skos:inScheme property.
3. For requirement three we use a combination of SKOS broader,
narrow and related properties to provide the appropriate
structure to the view.
4. For requirement four we use the skos:hasTopConcept
property to assert that particular concepts are anchors, or root
concepts in a particular view.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>THE EXPERIMENTAL FACTOR ONTOLOGY</title>
      <p>
        To demonstrate the technique we attempt to extract a subset of
views that are encoded with the Experimental Factor Ontology
(EFO). The EFO is an application ontology developed to describe
experimental variables used in transcriptomics data
        <xref ref-type="bibr" rid="ref9">(Malone
et al., 2010)</xref>
        . EFO brings parts of disparate bio-ontologies
together to provide an application ontology for both annotating
the data, and data exploration through tools such as the Gene
Expression Atlas
        <xref ref-type="bibr" rid="ref7">(Kapushesky et al., 2011)</xref>
        and the ArrayExpress
Archive
        <xref ref-type="bibr" rid="ref11">(Parkinson et al., 2011)</xref>
        . Data are initially annotated with
ontology classes which enable more powerful searching, such as
synonym expansion and traversing hierarchies based on an ontology
view. The latter however, has presented challenges in the way EFO
has been developed.
      </p>
      <p>In constructing the EFO application ontology several expedient
representational compromises were made. EFO includes two
annotation properties that are used as ‘flags’ to indicate that a class
should be either hidden from view in the application or that it should
be used as an anchor, i.e. a starting point at which to begin browsing
the hierarchy. Hidden flags are often used on classes such as upper
level ontology classes such as those from the basic formal ontology
(BFO), which are alien to a biologist user. Anchor flags are used
on classes such as cell line and disease, these indicate common
starting points of interest to users navigating the ontology in an
application scenario.</p>
      <p>New applications adopting the approach of EFO, such as the
Genome Wide Association Study (GWAS) browser 2 and the
European Nucleotide Archive (ENA) 3 each require subsets of
classes within EFO, but viewed in bespoke ways to suit their
application needs. We have, therefore, a situation where two
tasks have been conflated in the EFO: representing entities in
transcriptomics experiments and also adding information that aids
presentation, navigation and searching within the application
setting. In order to avoid duplicating and minting of new terms to
serve specific application requirements, these additional views are
embedded within the ontology using both logical OWL axioms and
specialised annotation properties. To demonstrate the applicability
of SKOS we extracted three separate views from the EFO OWL file
and represented them in SKOS. We then show how these views can
be visualised alongside the original EFO ontology using a generic
SKOS tool called SKOSEd.
3.1</p>
      <sec id="sec-3-1">
        <title>Generating EFO SKOS</title>
        <p>We converted the various views in EFO to SKOS using bespoke
scripts that extract the views based on existing annotations and
convert them into SKOS concept schemes. These scripts are written
with the Java OWL API (version 3) and SKOS API (version 3).
These views are available to download from BioPortal under the
EFO ”views” section. These views can be views in any valid SKOS
aware application. The Prote´ge´ 4.1 4 SKOSEd plugin 5 was used to
view and evaluate the SKOS conversions.</p>
        <p>For each view we follow the basic pattern:</p>
        <p>Classify EFO with the HermiT 6 reasoner</p>
        <sec id="sec-3-1-1">
          <title>Create a SKOS concept scheme for the view</title>
          <p>All classes that are flagged as organisational classes are
discarded
All classes flagged as part of a the current view are converted
to SKOS Concepts and added to the Concept Scheme. As the
same URI is used for Class and Concept, all annotations such
as labels are preserved.</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>2 http://www.genome.gov/gwastudies/</title>
        </sec>
        <sec id="sec-3-1-3">
          <title>3 http://www.ebi.ac.uk/ena</title>
        </sec>
        <sec id="sec-3-1-4">
          <title>4 http://protege.stanford.edu</title>
        </sec>
        <sec id="sec-3-1-5">
          <title>5 http://code.google.com/p/skoseditor</title>
        </sec>
        <sec id="sec-3-1-6">
          <title>6 http://www.hermit-reasoner.com/</title>
          <p>All superclasses of the flagged classes up to hidden
organisational classes are added as skos:broader assertion
(corresponding skos:narrower assertions are also added
Any classes flagged as branch or anchor classes are asserted
as top concepts using skos:hasTopConcept in the current
concept scheme
Any flagged properties are mapped to SKOS semantic
relationship. If a property is mapped to a skos relationship
then class restrictions along the mapped property get translated
to the appropriate skos relationship e.g. If part of is mapped
to skos:broader then (subClassOf (X, part of some Y)
becomes (X skos:broader Y).</p>
          <p>The first view extracted is the EFO basic view. This view is
currently used to serve both the ArrayExpress and gene expression
atlas query expansion and results summary view applications. In
this conversion all classes apart from organisation classes were
converted for the view. Additionally, the part of relationship was
mapped to skos:broader in order to incorporate the partonomy
views into the concept hierarchy.</p>
          <p>The second view represents a view generated for the GWAS
catalogue terms. By capturing our view in a standard language
like SKOS we can begin to exploit existing SKOS aware tools to
visualise the GWAS view in EFO for the first time. Figure 2 shows
a portion of the GWAS viewed in the generic SKOSEd extension
for Prote´ge´, no special configuration was required for this view to
be exposed. The terms present in this view are selected by GWAS
annotators for the annotation of studies submitted to the GWAS
catalogue. These terms are currently used to populate a drop down
list, but will soon form part of a more sophisticated information
retrieval system. Only classes flagged as ”gwas” and their parent
classes were converted to SKOS concepts; all organisation classes
were ignored.</p>
          <p>The third view represents a subset of terms from the European
Nucleotide Archive (ENA) 7. These terms are used to annotate
submission to the ENA, these annotations are used to describe
submitted datasets which are used by other databases, such as
ArrayExpress. The ENA currently only requires few terms and have
little in the way of hierarchy. They have their own categories for
terms, these categories have no natural place within EFO. In the
case of ENA we define some additional concepts within our view
in order to categorise some of the terms from EFO for the ENA
application.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>DISCUSSION</title>
      <p>Bio-ontologies are primarily used to represent domain knowledge
from areas of interest to the community, however, the application of
such ontologies to data and data providing services is of increasing
importance. In the case of applications, we need to separate the
concerns of knowledge representation and user presentation—a
classic software engineering approach. We leave the ontology as
an ontology (in OWL or OBO format) and capture application
knowledge in a SKOS representation, a simple transformation
which is suited to the needs of an application or for local problem
solving. Such a separation also means we can have different
application specific user layers for the same knowledge layer or</p>
      <sec id="sec-4-1">
        <title>7 http://www.ebi.ac.uk/ena</title>
        <p>ontology, without undermining ongoing work to make domain
ontologies interoperable.</p>
        <p>The problems encountered developing applications around the
KUPO and EFO highlight a scenario that will emerge many times
over as more bioinformatics tools move to exploiting ontologies in
user facing applications. Whilst other similar patterns may emerge,
the approach outlined in the paper demonstrates how aligning to
a standard vocabulary language like SKOS allows us to exploit
existing infrastructure. The views extracted from EFO allow the
application developers to visualise the application views in ways
that were not previously possible in standard ontology editing
environments. SKOS provides one means to share these views
across communities and applications, and is an attractive solution
for the scenarios outlined in this paper.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>CONCLUSION</title>
      <p>The SKOS vocabulary has some adoption in the life science
ontologies—in particular the labelling and mapping properties.
However, the notion of concept schemes and semantic relationships
have been less well adopted, and these are components that
fulfil our requirements. By taking standard approaches we allow
existing tools that consume SKOS access to the terminological
information of bio-ontologies. There is now a need for better
tool support to enable life scientists to work with SKOS more
easily. This paper demonstrates how separating the concerns of
knowledge representation and user presentation into layers and
adopting standards such as SKOS offers new possibilities for data
sharing and re-use.</p>
    </sec>
    <sec id="sec-6">
      <title>ACKNOWLEDGEMENTS</title>
      <p>AG is supported by the BioRange programme of The Netherlands
Bioinformatics Centre (NBIC; http://www.nbic.nl), supported by a
BSIK grant through The Netherlands Genomics Initiative (NGI) and
the research programme of the Netherlands Consortium for Systems
Biology (NCSB), which is part of the Netherlands Genomics
Initiative/Netherlands Organization for Scientific Research. We
acknowledge funds from EMBL (JM, HP) and The National
Center for Biomedical Ontology, one of the National Centers for
Biomedical Computing supported by the NHGRI, the NHLBI, and
the NIH Common Fund under grant U54-HG004028 (SJ).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Camon</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Magrane</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barrell</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dimmer</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maslen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Binns</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harte</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lopez</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Apweiler</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>The Gene Ontology Annotation (GOA) database: sharing knowledge in Uniprot with Gene Ontology</article-title>
          .
          <source>Nucleic Acids Research</source>
          ,
          <volume>32</volume>
          (
          <issue>suppl 1</issue>
          ),
          <fpage>D262</fpage>
          -
          <lpage>D266</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Castro</surname>
            ,
            <given-names>A. G.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>O</given-names>
            <surname>'Neill</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Garca-Castro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. J.</given-names>
            ,
            <surname>Lord</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. W.</given-names>
            ,
            <surname>Stevens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            , Corcho, ., and
            <surname>Gibson</surname>
          </string-name>
          ,
          <string-name>
            <surname>F.</surname>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Developing ontologies within decentralised settings</article-title>
          . In H. Chen,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          K.-H. Cheung, editors,
          <source>Semantic e-Science</source>
          , volume
          <volume>11</volume>
          <source>of Annals of Information Systems</source>
          , pages
          <fpage>99</fpage>
          -
          <lpage>139</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Consortium</surname>
            ,
            <given-names>T. G. O.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Gene Ontology: Tool for the Unification of Biology</article-title>
          .
          <source>Nature Genetics</source>
          ,
          <volume>25</volume>
          ,
          <fpage>25</fpage>
          -
          <lpage>29</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Degtyarenko</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de Matos</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ennis</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hastings</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zbinden</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McNaught</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alcntara</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darsow</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guedj</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Ashburner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>ChEBI: a database and ontology for chemical entities of biological interest</article-title>
          .
          <source>Nucleic Acids Research</source>
          ,
          <volume>36</volume>
          (
          <issue>suppl 1</issue>
          ),
          <fpage>D344</fpage>
          -
          <lpage>D350</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Federhen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>The NCBI taxonomy database</article-title>
          .
          <source>Nucleic Acids Research</source>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Jupp</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schanstra</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Stevens</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Developing a kidney and urinary pathway knowledge base</article-title>
          .
          <source>Journal of Biomedical Semantics</source>
          ,
          <volume>2</volume>
          (
          <issue>Suppl 2</issue>
          ),
          <fpage>S7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Kapushesky</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adamusiak</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burdett</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Culhane</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Farne</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Filippov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holloway</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klebanov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kryvych</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kurbatova</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kurnosov</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Malone</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Melnichuk</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Petryszak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pultsin</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rustici</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tikhonov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Travillian</surname>
            ,
            <given-names>R. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zorin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parkinson</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Brazma</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Gene Expression Atlas update - a value-added database of microarray and sequencing-based functional genomics experiments</article-title>
          .
          <source>Nucleic Acids Research</source>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jupp</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moulos</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fernandez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , Buffin-Meyer,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Casemayou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Chaaya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Charonis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Bascands</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Stevens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            , and
            <surname>Schanstra</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>A novel web application to access multi-omics data on kidney disease</article-title>
          .
          <source>Faseb J</source>
          , In press.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Malone</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holloway</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adamusiak</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kapushesky</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zheng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kolesnikov</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhukova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brazma</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Parkinson</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Modeling sample variables with an Experimental Factor Ontology</article-title>
          . Bioinformatics,
          <volume>26</volume>
          (
          <issue>8</issue>
          ),
          <fpage>1112</fpage>
          -
          <lpage>1118</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Noy</surname>
          </string-name>
          , . . F.,
          <string-name>
            <surname>Shah</surname>
          </string-name>
          , . . H., Whetzel, . . L.,
          <string-name>
            <surname>Dai</surname>
          </string-name>
          , . ., Dorf, . ., Griffith, . ., Jonquet, . .,
          <string-name>
            <surname>Rubin</surname>
          </string-name>
          , . . L.,
          <string-name>
            <surname>Storey</surname>
            , . .-
            <given-names>A.</given-names>
            , Chute, . . G., and Musen, . . A.
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>BioPortal: ontologies and integrated data resources at the click of a mouse</article-title>
          .
          <source>Nucleic Acids Research</source>
          ,
          <volume>37</volume>
          (
          <issue>suppl 2</issue>
          ),
          <fpage>W170</fpage>
          -
          <lpage>W173</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Parkinson</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sarkans</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kolesnikov</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abeygunawardena</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burdett</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dylag</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Emam</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Farne</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hastings</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holloway</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kurbatova</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lukk</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Malone</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mani</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pilicheva</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rustici</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sharma</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adamusiak</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brandizi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sklyar</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Brazma</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Arrayexpress update - an archive of microarray and high-throughput sequencing-based functional genomics experiments</article-title>
          .
          <source>Nucleic Acids Research</source>
          ,
          <volume>39</volume>
          (
          <issue>suppl 1</issue>
          ),
          <fpage>D1002</fpage>
          -
          <lpage>D1004</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Pathak</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Johnson, T. M., and
          <string-name>
            <surname>Chute</surname>
            ,
            <given-names>C. G.</given-names>
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>Survey of modular ontology techniques and their applications in the biomedical domain</article-title>
          .
          <source>Integr. Comput</source>
          .-Aided
          <string-name>
            <surname>Eng</surname>
          </string-name>
          .,
          <volume>16</volume>
          ,
          <fpage>225</fpage>
          -
          <lpage>242</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ashburner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosse</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bard</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bug</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ceusters</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>L. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eilbeck</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ireland</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mungall</surname>
            ,
            <given-names>C. J.</given-names>
          </string-name>
          ,
          <source>The OBI Consortium</source>
          , Leontis,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>RoccaSerra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Ruttenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Sansone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.-A.</given-names>
            ,
            <surname>Scheuermann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. H.</given-names>
            ,
            <surname>Shah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Whetzel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. L.</given-names>
            , and
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration</article-title>
          .
          <source>Nat Biotechnol</source>
          ,
          <volume>25</volume>
          (
          <issue>11</issue>
          ),
          <fpage>1251</fpage>
          -
          <lpage>1255</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>The</given-names>
            <surname>Gene Ontology Consortium</surname>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>The Gene Ontology in 2010: extensions and refinements</article-title>
          .
          <source>Nucleic Acids Research</source>
          ,
          <volume>38</volume>
          ,
          <fpage>D331</fpage>
          -
          <lpage>D335</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Yi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sze</surname>
          </string-name>
          , S.-H., and
          <string-name>
            <surname>Thon</surname>
            ,
            <given-names>M. R.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Identifying clusters of functionally related genes in genomes</article-title>
          .
          <source>Bioinformatics</source>
          ,
          <volume>23</volume>
          (
          <issue>9</issue>
          ),
          <fpage>1053</fpage>
          -
          <lpage>1060</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>