<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>http://
bioinformatics.oxfordjournals.org/content/20/10/1491.abstract
7. Gieger</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1007/978-3-540-68234-9\_66</article-id>
      <title-group>
        <article-title>Modelling threshold phenomena in OWL: Metabolite concentrations as evidence for disorders</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Janna Hastings</string-name>
          <email>hastings@ebi.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ludger Jansen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christoph Steinbeck</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Schulz</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Chemoinformatics and Metabolism, European Bioinformatics Institute</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Philosophy, RWTH Aachen University</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Department of Philosophy, University of Rostock</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Institute of Medical Informatics, Statistics and Documentation, Medical University of Graz</institution>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Swiss Centre for A ective Science, University of Geneva</institution>
          ,
          <country country="CH">Switzerland</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2009</year>
      </pub-date>
      <volume>5021</volume>
      <abstract>
        <p>While genomic and proteomic information describe the overall cellular machinery available to an organism, the metabolic pro le of an individual at a given time provides a canvas as to the current physiological state. Concentration levels of relevant metabolites vary under di erent conditions, in particular, in the presence or absence of di erent disorders. Metabolite concentrations thus mediate an important link between chemistry and biology, contributing to a systems-wide understanding of biological processes and pathways. However, there are a number of challenges in the ontological representation of such information. Firstly, concentration information is numeric and ranges over continuous values, while ontologies consist of discrete classes. Secondly, ontologies usually model only what is certain, and their logical formalisms are adapted to reasoning from certain axioms to logical deductions, however, the link between chemicals and diseases via concentration levels, like many threshold phenomena, is both uncertain and vague. In this paper we evaluate the representation of this knowledge using a combination of concrete domains and probabilistic reasoning. We parse concentration values from HMDB and create an ontology able to distinguish normal from abnormal concentrations and able to evaluate a probabilistic risk category for the presence of an associated disorder.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Metabolomics is the study of the small molecule products of metabolic processes
present in living organisms, called metabolites. Concentration levels of di erent
metabolites in the uids of the body provide evidence for which processes have
taken place, and thereby can reliably indicate disorders [17], as well as providing
additional support for functional genomics expression studies [7].</p>
      <p>The ChEBI ontology is an ontology of chemical entities and their roles in
biological contexts, presently containing around 25,000 classes. `Metabolite' is
included in ChEBI as a role which chemical entities take in biological contexts.
ChEBI does not currently provide information on the di ering concentration
levels of metabolites, nor on their association with disorders. This information
is provided by metabolome databases such as the Human Metabolome Database
[18], but these resources are not organised into an ontology, with the
disadvantage that they do not allow for automated reasoning and semantic
computational processing. It is therefore crucial to provide an ontological view on this
metabolomics data, especially in the context of the ChEBI project.</p>
      <p>
        Concentration information has historically been di cult to represent
ontologically, for three reasons:
1. Until recently, OWL did not provide support for de ning classes based on
data value ranges (in Description Logics (DLs) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], these are known as
concrete domains). This functionality is included in OWL 2 DL.
2. The link between concentrations and disorders is not certain, that is, it
is information about what may be associated with a given disorder rather
than what is always an indicator for the disorder. Logical reasoning cannot
directly draw inferences from such associations.
3. The threshold between normal concentration levels and the levels associated
with disorders is vague, that is, there is no hard numeric cuto between
normal concentration levels and disordered concentration levels.
      </p>
      <p>In this paper, we present an approach to representing and reasoning with
metabolite concentration levels associated with disorders, using OWL 2 data
ranges and probabilistic DL reasoning [13] as implemented in Pronto [12], a
probabilistic extension of Pellet [15]. We draw the metabolite data from the
Human Metabolome Database [18].</p>
      <p>Our implementation is guided by the following questions:
1. Can we di erentiate normal from abnormal metabolite concentrations?
2. What is the likelihood that a patient has a given disorder, considering
speci ed values for his/her concentrations of di erent metabolites in bio uids?
3. Can we accumulate the evidence (i.e. increase the likelihood) for the presence
of a given disorder if there are multiple metabolite concentration values
pointing towards it?
1
1.1</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <sec id="sec-2-1">
        <title>ChEBI, metabolomics data and the HMDB</title>
        <p>
          ChEBI is an OBO Foundry [16] ontology for the structural features and biological
roles of biologically interesting chemicals [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Many of the biologically interesting
chemicals are metabolites, which are are found in ChEBI together with their
structural chemical classi cation and their biological roles including `metabolite'.
Roles are associated with chemicals using the has role relationship. However,
there is currently no information formally captured in the ontology as to the
context in which a chemical has a particular role.
        </p>
        <p>The identi cation and annotation of the metabolites found in the human
organism together with associated contextual information such as the
disorders linked to di erent metabolic pro les, is being undertaken by the Human
Metabolome Project [17], from which has arisen the Human Metabolome
Database (HMDB) [18]. HMDB contains physicochemical, spectral, clinical,
biochemical and genomic information for all known human metabolites. Each metabolite
contains an extensive collection of information in text elds and images including
measured concentration values taken from human samples of di erent bio uids
(such as blood, urine, cerebrospinal uid), from persons of di erent ages and
with di erent underlying conditions.</p>
        <p>In this paper, we focus in particular on metabolites for which HMDB contains
both a normal and an abnormal (associated with some disease) concentration
level for an adult subject. The di erence between the normal and abnormal
concentration values indicates a threshold between these scenarios, such that we
would be able to infer the likelihood of a sample concentration being from a
disordered organism by virtue of the numeric value being closer to the known
disordered concentration than to the known normal concentration.
1.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Reasoning with data ranges in OWL 2</title>
        <p>OWL properties are separated between those that range over objects
(descendents of owl:Thing) and those that range over data values for di erent types of
data, such as integers or strings.</p>
        <p>In OWL 2 [8], data restrictions can be used to de ne classes by referring to
an operator and a range of values of a data property, such as strings or integers.
For example (in Manchester syntax [11]),</p>
        <p>Adult subClassOf Human and hasAgeInY ears some int[&gt;= 18]
speci es that Adult s are those Humans that have ages greater than or equal to
18 years.</p>
        <p>
          In the Description Logics underlying the OWL language, such data ranges
are called concrete domains [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. Concrete domains are de ned with respect to a
domain over which values can range, and a set of allowed predicates that operate
on that domain. In our example above, the domain over which the hasAgeInYears
data property ranges is the domain of non-negative integers, N0, and the
predicates which operates on that domain (in OWL, allowed predicates correspond
to XSD facets) include ` ', '=', `&gt;'.
        </p>
        <p>For the concentration values being represented in our metabolite
concentration ontology, the domain is non-negative real numbers (R), which we represent
for sake of the necessary precision as XSD doubles (i.e. 64 bit oating point
numbers), and the predicates we use are and .
1.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>Probabilistic Description Logics</title>
        <p>While standard Description Logics are designed to represent information that is
certain, as in chemistry it is certain that all members of the class carboxylic acids
contain at least one carboxy group, a recent DL extension allows the association
of probabilistic uncertainty with DL axioms [13].</p>
        <p>Probabilistic DL-based ontologies extend classical DLs with probabilistic
knowledge about classes and properties (known as terminological
probabilistic knowledge) as well as about individuals (known as assertional probabilistic
knowledge). Terminological probabilistic knowledge expresses knowledge about
randomly chosen individuals belonging to classes, that is, generic members of
the class, while assertional probabilistic knowledge is about speci c named
individuals in the knowledge base [13].</p>
        <p>Probabilistic DLs extend traditional DLs with the ability to quantitatively
model and reason with partially overlapping classes (specifying the degree to
which two classes overlap), and to associate with each axiom in the ontology
a probability value which represents the degree of reliability or certainty of the
axiom. It is the latter capability that we will make use of. Probabilistic knowledge
consists of conditional constraints [13].</p>
        <p>
          De nition 1. A conditional constraint is an expression of the form ( j )[l,
u], where and are classes in the ontology, and l and u are real numbers in
the range [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ]. Informally, ( j )[l, u] encodes that is a subclass of with
probability between l and u.
        </p>
        <p>For example, we may wish to express the knowledge that if a certain patient
has a measured metabolite concentration within a certain range ( ), then the
probability of them having a certain disorder ( ) is in the range [0.75, 0.85].
2
2.1</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Creating the ontology</title>
      <sec id="sec-3-1">
        <title>Data extraction and threshold calculation</title>
        <p>The HMDB database was programmatically parsed from the downloadable
metabocards export. Metabolites for which there was both a normal and an abnormal
concentration in the same bio uid, were extracted. The normal and the
abnormal concentrations were then used to generate a threshold condition which was
half-way between the normal and the abnormal, and which was directed in the
direction of the abnormal (either greater than or less than the threshold
depending on which side the abnormal concentration fell).</p>
        <p>For example, a pair of sample values for metabolite D-glucose in blood were
4440 uM for a normal adult and 7000 uM for an adult with the disorder Diabetes
Mellitus Type 2. In this case we create a threshold at 5700 uM, having abnormal
concentrations greater than the threshold.</p>
        <p>
          Note that the threshold being set half-way between normal and abnormal is
an arti cially introduced constraint for the purpose of this paper. Identifying true
thresholds between normal and abnormal concentration levels is of course a much
more complex procedure requiring large numbers of samples and sophisticated
techniques for eliminating noise in the underlying data [
          <xref ref-type="bibr" rid="ref3">6, 3</xref>
          ]. However, for our
purposes in evaluating the representation of such information in OWL, we can
safely ignore this additional complexity.
2.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Populating the OWL ontology with data</title>
        <p>The OWL ontology was created using the OWL API [10] and reasoned over
with a slightly modi ed form of Pronto6 [12]. The full generated ontology,
illustrated in Figure 1, includes data for 48 metabolites associated with 39 di erent
disorders7.</p>
        <p>Note that the ontology shows uid samples as part of organisms, although
this is a simpli cation since uid samples are in actual fact typically no longer
part of an organism, and concentration values may depend on how the sample
was extracted and processed.</p>
        <p>Calculated threshold values were added to the ontology as classes de ned with
data ranges. For example, we fully de ne the class concentration of D-glucose in
Blood associated with Diabetes mellitus type 2 as:
`concentration of D-glucose in Blood associated with Diabetes mellitus type 2'
equivalentTo ( `concentration in blood'
and (hasMetabolite some `portion of D-glucose')
and (hasConcentrationValue some double[&gt;= 5700.0]) )</p>
        <p>In addition to the simpli cation involved in setting the threshold half way
between the normal and abnormal concentrations, there is a deeper underlying
problem with this threshold model. Even if we included an accurate threshold
between normal and abnormal, this threshold represents at the class level what
is generally true across many individuals, but obscures the underlying individual
variance in phenotype and metabolism which might a ect the actual threshold
for each individual. Furthermore, it represents normal and abnormal as a binary
phenomenon whereas in reality there is a continuum between the normal and
the abnormal [14]. Thus, we cannot create a straightforward DL relationship
6 Version 0.2, upgraded to the latest version of the OWL API and Pellet, since data
ranges were not available in the implemented OWL 1.1. version.
7 The ontology (META.owl) and software (META.zip) are available for download from
http://www.ebi.ac.uk/~hastings/concentrations/.
between a given metabolite concentration and a disorder, since, according to the
current model and the underlying DL semantics, each concentration instance
would then be associated with at least one disorder instance. It is to address
this gap that we propose the use of probabilistic DL.
2.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Adding probabilistic constraints</title>
        <p>The challenge is to be able to infer, based on measured metabolite concentration
values, the likelihood of presence of a disorder. We will call this the risk of having
the disorder, given the concentration value of the metabolite. We create classes
for the categories of low, medium and high risk of having the given disorder.
Note that the variation of risk with concentration value can be thought of, as
a simplifying assumption, as a continuously valued function ranging over all
possible concentration values8. However, as Pronto constraints take the form
of intervals associated with classes (or instances), to create a nite number
of OWL classes and associate probability intervals to them, it is necessary to
discretize the probability function into xed ranges. We will do this as illustrated
in Figure 2.</p>
        <p>For example, we fully de ne the class person with low risk of having diabetes
based on their blood glucose level as:
`person with high risk of having Diabetes mellitus type 2 based on Blood sample of D-glucose'
equivalentTo (organism
and hasPart some (bloodSample
and bearerOf some (concentration
and hasMetabolite some `portion of D-glucose'
and hasConcentrationValue some double[&gt;=6840.0] ) ) )
8 In general, associative relationships between symptoms/signs and disorders are more
complex, involving two parameters: (1) the probability of the disorder given the
sign/symptom; and (2) the probability of the sign/symptom given the disorder [9].
We include only the rst.</p>
        <p>We create the relevant low, medium and high risk categories for each uid
type and disorder for the metabolites D-glucose and acetoacetic acid. Although
it is possible to use our software to create such classes for every metabolite in
the ontology, we have selected this subset to reduce overhead for reasoning.</p>
        <p>Finally we create the conditional constraints that associate the given risk
categories for the associated disorder, with a certain probability. We have
arbitrarily selected the following probability ranges for the given risk categories:
Low risk: [0.00;0.24]; Medium risk: [0.25;0.54]; High risk: [0.55;1.00]. As required
by Pronto, conditional constraints are added to the ontology as an annotation
on a subClassOf axiom; these axioms are then removed from the main ontology
and added to the probabilistic knowledge base by the Pronto pre-processor.</p>
        <p>For example, we add the constraint:
`person with high risk of having Diabetes mellitus type 2 based on Blood sample of D-glucose'
subClassOf `person with Diabetes mellitus type 2'</p>
        <p>pronto:certainty "0.55;1.00"
3</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results of reasoning</title>
      <p>We test the reasoning capability of the generated ontology corresponding to
the three questions listed in the introduction. As a simple probe, we create
three individuals with di erent metabolite concentration measurements. Table 1
describes the individuals and their blood concentration values.
The rst question tests reasoning with numeric thresholds for inferences about
conditional properties. Here, the conditional property is the suspected presence
of a disorder in the organism whose uid sample has been measured for the
particular metabolite concentration. Testing this is straightforward: the individual
Sally in the ontology is associated with a blood glucose concentration value
of 10000 uM. After classifying using Pellet, Sally's concentration is correctly
classi ed as abnormal.
Answering the second question involves the use of the probabilistic constraints.
To allow interplay between the results of reasoning with the data ranges
expressed in the ontology and the probabilistic reasoning, we executed a two-step
process: rstly, the classical reasoning was performed, and then the inferred
class memberships were asserted back into the probabilistic ontology before
performing the probabilistic reasoning. This allows us to ask Pronto to answer the
question: entail that an individual (e.g. Harry) has the disease Diabetes mellitus
Type 2. In response, Pronto provides a probability range and an explanation,
which refers to the probabilistic constraints used in generating the conclusion.
The results are illustrated in Table 2.</p>
      <p>The results for Harry and Sally are a straightforward result of the risk
categories associated with the classes for which their membership is inferred.
However, that of Barry is more complex since he has multiple concentration
values implicating the disease.
3.3</p>
      <sec id="sec-4-1">
        <title>Reasoning with multiple probabilistic constraints in combination</title>
        <p>Pronto uses linear resolution to determine the probability range entailed by a
set of constraints [12]. There are two scenarios: when multiple constraints can be
resolved (into a probabilistic interval entailment), and when they con ict. Since
Barry has a blood D-glucose concentration in the medium risk range and a blood
acetoacetic acid concentration in the high risk range, and the two ranges do not
con ict, the above result for Barry indicates Pronto's strategy in the absence of
a con ict, resembling a union of the two underlying data ranges.</p>
        <p>When multiple constraints con ict, Pronto prefers more speci c statements to
less speci c. We evaluated this behaviour by changing the medium risk constraint
to overlap with the high risk constraint, setting the upper bound for medium
to 0.55 instead of 0.54. In this case, Pronto concludes that the probability for
Barry having diabetes is [0.55;0.55] { the most speci c (narrowest) resolution. If
the medium risk ranges to 0.6, Pronto entails Barry the range [0.55;0.6]. Thus,
it seems that the behaviour on con ict (at least for the two-axiom scenario we
test here) resembles an intersection of the two underlying data ranges.</p>
        <p>While it remains a task for future work to examine the reasoning behaviour
under more complex scenarios, neither of these results is an optimal
representation of the intuitive requirement driven by the use case: it would be better
if the probabilistic combination of di erent types of evidence for the same
conclusion increased the certainty of the conclusion. However, Pronto does allow
for overriding inherited constraints in more speci c subclasses. Thus, we can
specify a new risk subclass for Barry's combined risk categories, and associate
this with the disease with a new probability range (e.g. [0.54;0.85]). However,
this approach is in general somewhat cumbersome as it would require adding
many more classes and constraints to the knowledge base { for all interesting
combinations of risk factors.
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>Metabolomics is the eld which bridges between chemical data and biological
data by investigating the chemical markers for biological processes, and therefore
for their underlying disorders [18]. Accurately modelling the associations between
metabolites and disorders goes beyond traditional OWL modelling constructs.
We have evaluated a probabilistic representation strategy using Pronto. While
probabilistic ontologies have been used to model, e.g. breast cancer risk factors
[12], they have to our knowledge not previously been applied to chemical{disease
associations, nor used in combination with concrete domains. Our prototype has
illustrated the general applicability of the approach, but a more intuitive and
exible solution for reasoning with combined probability constraints would be
mandatory for a real application based on this scenario.</p>
      <p>
        Future work will involve the investigation of alternative probabilistic DL
approaches, such as those which use an underlying Bayesian model [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and
ultimately address the extension of this prototype towards a full implementation
linking ChEBI metabolites to diseases.
      </p>
      <sec id="sec-5-1">
        <title>Acknowledgements</title>
        <p>This work was partly supported by the Deutsche Forschungsgemeinschaft (DFG)
grant JA 1904/2-1, SCHU 2515/1-1 GoodOD (Good Ontology Design) and by
the BBSRC, grant agreement number BB/G022747/1.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuinness</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nardi</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patel-Schneider</surname>
            ,
            <given-names>P.F.</given-names>
          </string-name>
          :
          <article-title>The Description Logic Handbook: Theory, Implementation, and</article-title>
          <string-name>
            <surname>Applications</surname>
          </string-name>
          ,
          <source>2nd Edition</source>
          . Cambridge University Press,
          <volume>2</volume>
          <fpage>edn</fpage>
          .
          <source>(Sep</source>
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Description logics with aggregates and concrete domains</article-title>
          .
          <source>Information Systems</source>
          <volume>28</volume>
          (
          <issue>8</issue>
          ),
          <volume>979</volume>
          {1004 (Dec
          <year>2003</year>
          ), http://www.sciencedirect.com/science/article/B6V0G-481FTVC-1/2/ 06400b60da99c41bc6e07596ff8950c1
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. van den Berg, R.,
          <string-name>
            <surname>Hoefsloot</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Westerhuis</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smilde</surname>
          </string-name>
          , A., van der Werf, M.:
          <article-title>Centering, scaling, and transformations: improving the biological information content of metabolomics data</article-title>
          .
          <source>BMC Genomics</source>
          <volume>7</volume>
          (
          <issue>1</issue>
          ),
          <volume>142</volume>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. da Costa,
          <string-name>
            <given-names>P.C.G.</given-names>
            ,
            <surname>Laskey</surname>
          </string-name>
          , K.B.:
          <article-title>PR-OWL: A framework for probabilistic ontologies</article-title>
          .
          <source>In: International Conference on Formal Ontology in Information Systems</source>
          . pp.
          <volume>237</volume>
          {
          <issue>249</issue>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. de Matos,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Alcantara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Dekker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Ennis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Hastings</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Haug</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Spiteri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Turner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Steinbeck</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>Chemical Entities of Biological Interest: an update</article-title>
          .
          <source>Nucl. Acids Res</source>
          .
          <volume>38</volume>
          ,
          <issue>D249</issue>
          {
          <fpage>D254</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>