<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Formalization of indicators of diagnostic performance in a realist ontology</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Adrien Barton</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Régis Duvauferrier</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anita Burgun</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CHU de Martinique</institution>
          ,
          <addr-line>Université Antilles-Guyane</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>INSERM UMR 1099, LSTI</institution>
          ,
          <addr-line>Rennes</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>INSERM UMR 1138 team 22, Centre de Recherche des Cordeliers</institution>
          ,
          <addr-line>Paris</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>The Institute of Scientific and Industrial Research, Osaka University</institution>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2015</year>
      </pub-date>
      <abstract>
        <p>We  present  a  formalization  of  indicators  of  diagnostic  performance   (sensitivity,   specificity,   positive   predictive   value   and   negative   predic-­tive  value)  in  the  context  of  a  realist  ontology.  We  dissociate  the  indica-­tors   of   diagnostic   performance   from   their   estimations   and   argue   that   the  former  should  be  represented  in  a  first  place  in  biomedical  ontolo-­gies.   Our   formalization   does   not   require   to   introduce   any   possible,   non-­‐actual  entities  -­‐  like  the  result  a  person  would  get  if  a  medical  test   would  be  performed  on  her  -­‐  and  is  therefore  acceptable  in  an  ontology   built  in  a  realist  spirit.  We  formalize  an  indicator  of  diagnostic  perfor-­mance  as  a  data  item  that  is  about  a  disposition  borne  by  a  group;  the   diagnostic   value   of   this   indicator   is   given   by   the   objective   probability   value  assigned  to  this  disposition.  </p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION 1</title>
      <p>1.1</p>
      <sec id="sec-1-1">
        <title>Definition of indicators of diagnostic performance</title>
        <p>Biomedical ontologies aim at providing the most exhaustive
and rigorous representation of reality as described by
biomedical sciences. A large part of medical reasoning
concerns diagnosis and is essentially probabilistic. It would be
an asset for biomedical ontologies to be able to support such
a probabilistic reasoning.</p>
        <p>
          <xref ref-type="bibr" rid="ref7">Ledley &amp; Lusted (1959)</xref>
          ’s seminal article on Bayesian
reasoning in medicine defines different kind of probabilistic
entities. Consider for example the simple case of an instance
of test of type A aiming at detecting if a patient in a group g
has an instance of disease of type M1. The performance of
test A in diagnosing M can be quantified by the positive
predictive value of this test, hereafter abbreviated PPV, and
generally defined as the proportion of people who have the
disease among those who would be tested positive by A in g
(that is, the proportion of true positives among positives);
and by the negative predictive value, hereafter abbreviated
NPV, and generally defined as the proportion of people who
do not have the disease among those who would be tested
negative by A in g (that is, the proportion of true negatives
among negatives). Those two values provide the probability,
once the result of test A is observed, that the patient has the
disease M.
1 These will be abbreviated in the following as “a test A” and “the patient
has M”.
        </p>
        <p>However, such positive and negative predictive values
are typically not available in the scientific literature. Instead,
they are generally computed from other probabilistic values,
namely: the prevalence value of M in g, generally defined as
the proportion of people who have the disease M in g, and
hereafter abbreviated Prev(g,M); the sensitivity value of the
test A for M in g, generally defined as the proportion of
people who would get a positive result by A among those who
have the disease M in g (that is, the proportion of true
positives among diseased), hereafter abbreviated Se(g,A,M); and
the specificity value of A for M, generally defined as the
proportion of people who would get a negative result by A
among those who do not have the disease M in g (that is, the
proportion of true negatives among non-diseased), hereafter
abbreviated Sp(g,A,M). As a matter of fact, these values are
related through the following Bayesian equations:
PPV(g, A, M) =
NPV(g, A, M) =</p>
        <p>Prev(g, M) Se(g, A, M)
Prev(g, M) Se(g, A, M) + (1- Prev(g, M)) (1- Sp(g, A, M))</p>
        <p>(1- Prev(g,M)) Sp(g,A, M)</p>
        <p>Prev(g,M) (1- Se(g,A, M)) + (1- Prev(g,M)) Sp(g,A, M)</p>
        <p>
          In the wake of
          <xref ref-type="bibr" rid="ref7">Ledley &amp; Lusted (1959)</xref>
          , the sensitivity
and specificity values have often been considered as
depending only on the pathophysiological characteristics of
the disease, and thus as independent of the group of people
under consideration. However, sensitivity and specificity
values do in fact depend upon the group under
consideration: this is the “spectrum effect”
          <xref ref-type="bibr" rid="ref2 ref3">(Brenner &amp; Gefeller,
1997; for a detailed explanation, see Barton, Duvauferrier &amp;
Burgun, 2015)</xref>
          . Spectrum effect can be manifested, for
example, as a dependence of sensitivity and specificity on the
degree of severity of the disease in the group under
consideration
          <xref ref-type="bibr" rid="ref9">(Park, Yokota, Gill, El Rassi, &amp; McFarland, 2005)</xref>
          .
        </p>
        <p>In the remainder of the articles, sensitivity, specificity,
PPV and NPV will be called “indicators of diagnostic
performance” and abbreviated “IDPs”.
1.2</p>
      </sec>
      <sec id="sec-1-2">
        <title>The challenge of representing indicators of diagnostic performance in an ontology</title>
        <p>
          To the extent that they aim at representing biomedical
knowledge and enabling medical reasoning, biomedical
ontologies should provide a formalization of IDPs as well as
the prevalence. This article will propose such a
formalization in the context of the OBO Foundry
          <xref ref-type="bibr" rid="ref12">(Smith et al., 2007)</xref>
          ,
one of the most massive sets of interoperable ontologies in
the biomedical domain, built on the upper ontology BFO.
        </p>
        <p>
          The question of how probabilistic notions can be
represented in ontologies has been tackled from different
perspectives in the past. For example, da
          <xref ref-type="bibr" rid="ref5">Costa et al. (2008)</xref>
          have proposed the new PR-OWL format that extends the
classical OWL format; we take here a different approach,
which does not aim at changing the OWL format.
          <xref ref-type="bibr" rid="ref14">Soldatova, Rzhetsky, De Grave, &amp; King (2013</xref>
          ) have described a
model in which probabilities can be assigned to research
statements. We have proposed an alternative approach
          <xref ref-type="bibr" rid="ref1">(Barton, Burgun, &amp; Duvauferrier, 2012)</xref>
          in which we show
how probabilities can be assigned to dispositions, upon
which we are going to build here.
        </p>
        <p>
          Sensitivity and specificity have been recently introduced
in the Ontology of Biological and Clinical Statistics
          <xref ref-type="bibr" rid="ref15">(OBCS;
Zheng et al., 2014)</xref>
          as subclasses of Data item – a
classification that we will endorse here, and extend to PPV and NPV.
A data item, as defined by the Information Artifact
Ontology (IAO), is intended to be a truthful statement about
something. In order to formalize IDPs, one should thus clarify
what entities in the real world they are about.
        </p>
        <p>Sensitivity value2, as we said, is generally defined as the
proportion of people who would get a positive result by A
among those who have the disease M. But note here the
conditional structure: what is referred to is the proportion of
true positives among diseased if A was performed on them.
In practical situations, however, the sensitivity value will be
estimated by performing the test on a sample of the
population only – not the entire population g. This will lead to two
difficulties. First, it will be necessary to differentiate clearly
the IDPs’ values from their estimations, and to determine
which of those should be represented in a first place in an
ontology – part 2 will be devoted to this issue. Second,
possible-but-non-actual situations cannot be straightforwardly
defined in a realist ontology like BFO; this problem will be
explained and solved in part 3, by considering that an IDP is
a data item about a disposition borne by an instance of
group of individuals, whose probability value will be
identified to the diagnostic value of the IDP. This will provide a
formal characterization of IDPs.
2
2.1</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>THE INDICATORS AND THEIR</title>
    </sec>
    <sec id="sec-3">
      <title>ESTIMATIONS</title>
      <sec id="sec-3-1">
        <title>Two limits for the estimations of indicators of diagnostic performance</title>
        <p>
          Numerical estimations of IDPs face two limits
          <xref ref-type="bibr" rid="ref2">(Barton et
al., 2015)</xref>
          . First, frequencies will be measured on a sample
2 Note the distinction between a sensitivity and its value: a sensitivity is a
data item, but its value is a number.
judged to be representative of the population as a whole, and
these values are then extrapolated to the frequencies in the
entire population. Second, whether a given person has M or
not cannot generally be known for sure, through reasonable
means: sometimes, the only way to be certain is to perform
an autopsy on the deceased patient. Consequently, a “gold
standard” must be chosen, namely the best reasonable
available diagnostic test3. If a patient gets a positive result to this
gold standard test, one will conclude that he has the disease;
if he gets a negative result, one will conclude that he does
not have it.
        </p>
        <p>
          For example,
          <xref ref-type="bibr" rid="ref9">Park et al. (2005)</xref>
          estimate the sensitivity of
the Neer test for diagnosing the impingement syndrome;
their estimation is made on a sample of 552 patients
considered as representative of the general population, using as
gold standard surgical observation. The proportion of
patients tested positive by the Neer test among those who are
tested positive by surgical operation in the sample is
considered as representative of the sensitivity value - which can
be interpreted as the proportion of people who would be
tested positive by the Neer test among those who have an
impingement syndrome in the whole population. Similar
estimation strategies hold for prevalence, specificity, PPV
and NPV.
        </p>
        <p>Note that the estimations of the values of the prevalence,
sensitivity, specificity, PPV and NPV depend on both the
sample and the gold standard; however, the real values of
the prevalence, sensitivity, specificity, PPV and NPV, as
defined above, depend neither on the sample, nor on the
gold standard.
2.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>What should be represented in an ontology?</title>
        <p>This being clarified, one can ask which entities should be
preferably represented in an ontology: the IDPs’ values, or
their estimations?</p>
        <p>For sure, we have no direct access to such IDPs’ values;
but this does not imply that they should not be represented
in an ontology. To clarify why, consider an analogy: the
measure of the ambient temperature by reading the height of
a mercury column in a thermometer. Suppose that at a given
time, this height is aligned with the sign “20 °C” written on
the thermometer. In such a case, an ontology curator would
be in a first place interested in formalizing the fact that the
ambient temperature is 20°C, rather than in formalizing the
fact that the mercury column in the thermometer is at the
same height as the sign “20°C”.</p>
        <p>
          In a similar fashion, imagine that 65% of people are
tested positive for a gold standard of M in a sample s of a
population g. The ontology should then formalize in a first place
the fact that 65% of the people in g have M, rather than the
3 Even if the gold standard consists in the naked-eye observation of a
macroscopic disorder associated exclusively with this disease, this can still
theoretically lead to a diagnostic error: any empirical evidence is
defeasible.
fact that 65% of the people in s have a positive result to this
gold standard. This estimation of this prevalence value may
be false (it is indeed very likely to be false, strictly
speaking), but future estimations will lead to its being corrected to
bring it closer to the real value. As a matter of fact, realist
ontologies are built according to a fallibilist methodology
          <xref ref-type="bibr" rid="ref13">(Smith &amp; Ceusters, 2010)</xref>
          : they represent the state of the
world according to our best knowledge at the present
instant, and can be corrected as our knowledge of the world is
refined.
        </p>
        <p>
          That being said, it is possible to represent in an ontology
the measurement process of a temperature involving the
height of a mercury column in a thermometer. Similarly,
one could represent the different estimation processes of the
IDPs, and the results to which they led. Such processes are
biomedical investigations, and should therefore be
formalized in an ontology like OBI
          <xref ref-type="bibr" rid="ref4">(Ontology for Biomedical
Investigations, Brinkman et al., 2010)</xref>
          , a prominent OBO
Foundry candidate dedicated to these issues. This would be
relevant in order to formalize in an ontology different
estimations given by various samples and gold standards.
However, medical practitioners are first and foremost interested
in the IDPs’ values themselves, rather than in their
estimations, and thus we will deal here with the formalization of
the former.
        </p>
        <p>This clarification being made, we can now consider the
second difficulty mentioned at the end of part 1, namely the
formalization of possible-but-non-actual situations in BFO.
3</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>A FORMALIZATION OF INDICATORS OF</title>
    </sec>
    <sec id="sec-5">
      <title>DIAGNOSTIC PERFORMANCE IN APPLIED</title>
    </sec>
    <sec id="sec-6">
      <title>ONTOLOGIES</title>
      <p>Sensitivity value has been interpreted as the proportion of
people who would get a positive result to A among M’s
bearers in g. This definition thus involves the condition of
performing the test A on the members of g. As we said, such
a condition is never realized, because the test is performed
(at best) on a sample of the population, not on the whole
population g: the performance of test A on g’s members is a
possible (leaving aside practical difficulties), non-actual
condition. Interpreting specificity, PPV, and NPV along the
former lines would also imply such possible, non-actual
conditions.</p>
      <p>
        However, BFO is built according to the realist
methodology, which implies that all the instances it recognizes should
be actual entities
        <xref ref-type="bibr" rid="ref13">(cf. Smith &amp; Ceusters, 2010)</xref>
        . Thus, one
cannot represent directly such a possible-but-not-actual
condition in an ontology based on BFO. In order to solve
this difficulty, we will introduce a strategy named
“randomization”, enabling to formalize the probabilities of interest as
assigned to an actual entity, namely a disposition. This
strategy will enable to represent IDPs in a realist fashion,
compliant with BFO’s spirit.
3.1
      </p>
      <sec id="sec-6-1">
        <title>From proportions to objective probabilities: the randomization strategy</title>
        <p>We will explain first how the proportion of a subgroup in a
group can be formalized as a probability value assigned to a
disposition; this will help explaining later how the
proportion of a subgroup in a group undergoing a possible,
nonactual condition can be formalized along similar lines.</p>
        <p>
          Dispositions are entities that can exist without being
manifested; an example of disposition is the fragility of a
glass, which can exist even when the glass does not break.
We will use
          <xref ref-type="bibr" rid="ref10">Röhl &amp; Jansen's (2011)</xref>
          model of disposition in
BFO, which associates to every instance of disposition one
or several instances of realizations, and one or several
instances of triggers (a trigger is the specific process that can
lead to a realization occurring). In this model, the fragility
of a glass is a disposition of the glass to break (the breaking
process is the realization) when it undergoes some kind of
stress (the process of undergoing such a stress is the
trigger); this disposition inheres in the glass. Starting with the
definition of these entities and their relations at the instance
level, Röhl &amp; Jansen proceed to formalize them at the
universal level. We have shown in a former article
          <xref ref-type="bibr" rid="ref1">(Barton,
Burgun &amp; Duvauferrier, 2012)</xref>
          how to adapt this model to
probabilistic dispositions. Thus, an instance of balanced
coin is the bearer of a disposition instance to fall on heads
(the realization process) when it is tossed (the trigger
process), to which an objective probability 1/2 can be assigned.
        </p>
        <p>
          We will now apply this model to the situation at hand.
Consider the prevalence Prev(g,M), which was defined
above as the proportion of bearers of M in the actual
population g. We can define the disposition dg,M, borne by the
group g, that a person randomly drawn in g has M. More
specifically, let’s write Tg the process “randomly drawing a
person in g”, and Rg,M the process “drawing by Tg someone
who has M”: the triggers of dg,M are instances of Tg and its
realizations are instances of Rg,M. Following the lines of
          <xref ref-type="bibr" rid="ref1">Barton et al. (2012)</xref>
          , one can thus define the probability
assigned to the disposition4 dg,M, which is the probability of
drawing randomly someone who has M in g. This
probability is equal to the proportion of individuals who have M in
g, that is, to Prev(g,M): as a matter of fact, if there are
e.g. 10% diseased people in g, then the probability of
drawing randomly a diseased person in g is 10%. Thus, the
prevalence value can be identified to the objective probability
assigned to the disposition dg,M. We name this strategy the
“randomization” of the proportion of M’s bearers among g.
4 In
          <xref ref-type="bibr" rid="ref1">Barton et al. (2012)</xref>
          , a probability was assigned to a triplet (d, T, R)
rather than to a disposition d, because we had to take into account
disposition that may have several classes of triggers or realizations
          <xref ref-type="bibr" rid="ref10 ref6">(that is,
multitrigger and multi-track dispositions, cf. Röhl &amp; Jansen, 2011)</xref>
          . However, in
the present situation, dg,M is simple-trigger and simple-track: all its triggers
are instances of Tg , and all its realizations are instances of Rg,M. Therefore,
the probability value assigned to (dg,M, Tg , Rg,M) can be, for practical
matters, assigned directly to dg,M.
        </p>
        <p>The randomization strategy may not be necessary to
formalize a prevalence, which characterizes a proportion in an
actual group, and thus could be formalized as such in an
ontology based on BFO. But this strategy can also be
applied to proportions of people in groups subject to a
possible, non-actual condition – and thus, be relevant to
formalize sensitivity and other IDPs. As a matter of fact, the
sensitivity value Se(g,A,M) was defined as the proportion of
people who would get a positive result to A among M’s bearers
in g. This value can be “randomized” as follows. We can
define dSe,g,A,M as the disposition to draw someone randomly
who is tested positive by A, among the individuals of g who
have M. More specifically, let’s define the process
TSe,g,A,M as the “performance of test A on the individuals in g,
and random draw of an individual among those who have
the disease M”5; and the process RSe,g,A,M as the “drawing by
TSe,g,A,M of someone who got a positive result to A”. The
triggers of dSe,g,A,M are instances of TSe,g,A,M, and its
realizations are instances of RSe,g,A,M . One can then define the
sensitivity value Se(g,A,M) as the objective probability
assigned to this disposition dSe,g,A,M,: indeed, if there are e.g.
15% of the diseased people in g who would get a positive
result by A, then the probability of randomly drawing
someone who would get a positive test result by A among
diseased people in g is equal to 15%.</p>
        <p>Specificity value can be defined along similar lines, as
probabilities assigned to actual dispositions borne by the
group g noted dSp,g,A,M (and similarly for the PPV and NPV).
Although dSe,g,A,M and dSp,g,A,M are both dispositions inhering
in g, they have different triggers and different realizations;
the process TSp,g,A,M is the “performance of test A on the
individuals in g, and random draw of an individual among
those who do not have the disease M” and the process
RSp,g,A,M is the “drawing by TSp,g,A,M of someone who got a
negative result to A”.
3.2</p>
      </sec>
      <sec id="sec-6-2">
        <title>A formal model of indicators of diagnostic performance in ontologies</title>
        <p>Let us now consider how to formalize these probability
values in ontologies. First, a group g will be considered as any
collection of humans (for more on collections, see Jansen &amp;
Schultz, 2010). dSe,g,A,M is a disposition individual inhering
in the group g; and a probability value can be assigned to
this disposition using a datatype property
has_probability_value. Sensitivity of A for M in g will be
denoted Seg,A,M, and following OBCS, it will be defined as a
data item. Thanks to our analysis above, we can now answer
our original question, and state what this sensitivity is about:
Seg,A,M is_about dSe,g,A,M. We can also introduce a relation
has_diagnostic_value that relates a sensitivity to its value.
5 In general, we cannot determine in practice with certainty which
individuals of g have M, and which do not; but the practical impossibility to realize
this trigger does not preclude to define this entity.</p>
        <p>In our framework, the (diagnostic) value of a sensitivity
Seg,A,M is the probability value assigned to the disposition
dSe,g,A,M; this can be formalized by writing that if s is a
sensitivity, then:</p>
        <p>s has_diagnostic_value p ⇔ ∃ d ∧ d is_a
Disposition ∧ s is_about d ∧ d has_probability_value p</p>
        <p>
          As dSe,g,A,M is an individual, it cannot be related directly to
the universals A and M. However, it can be related indirectly
to them, by the following formalization. First, dSe,g,A,M can
be seen as an instance of a disposition universal symbolized
as DSe,A,M, which has as trigger the processus universal
TSe,A,M: “performance of test A on the members of a group,
and random draw of a person among those who have the
disease M”; and as realization the process universal
RSe,A,M defined as “drawing by TSe,A,M of someone who got a
positive result to A”. We can then introduce two new
relations sensitivity_disposition_of_test and
sensitivity_disposition_for_disease (abreviated as se_of_test and
se_for_disease) such that DSe,A,M se_of_test A and
DSe,A,M se_for_disease M. These two relations are introduced
for pragmatic reasons of facility of use: on a foundational
level, DSe,A,M and M (resp. A) could be related through a
complex array of relations and entities that involve the
relation has_trigger between DSe,A,M and TSe,A,M, as well as a
sequence of relations between TSe,A,M and M (resp. A). Such
an analysis would raise theoretical issues though, as
instances of DSe,A,M can exist even if no instance of M or A do exist.
We would therefore face here issues similar to the ones
addressed by Röhl &amp;
          <xref ref-type="bibr" rid="ref6">Jansen (2011)</xref>
          and
          <xref ref-type="bibr" rid="ref11">Schulz et al. (2014)</xref>
          .
        </p>
        <p>Finally, we introduce a class Sensitivity that can be
characterized as a subclass of Data item, which is related to a
disposition through the above-mentioned relations:
s instance_of Sensitivity ⇒ s instance_of Data item ∧
∃ d instance_of Disposition ∧ ∃ a instance_of Test ∧
∃ m instance_of Disease ∧ s is_about d ∧
d se_of_test a ∧ d se_for_disease m</p>
        <p>We can also introduce SeA,M, the class of sensitivities of
test A for disease M (in whatever group), which can be
formalized as a subclass of Sensitivity related to M and A
through the following relations:</p>
        <p>s instance_of SeA,M ⇒ s instance_of Sensitivity ∧
∃ d instance_of Disposition ∧ ∃ a instance_of A ∧
∃ m instance_of M ∧ s is_about d ∧ d se_of_test a ∧
d se_for_disease m</p>
        <p>Figure 1 summarizes this formalization of sensitivity
(with universals in boxes, instances in diamonds, and the
numerical value assigned by datatype properties in a circle).
Specificity, PPV and NPV can be formalized along similar
lines, as data items about dispositions related to tests and
diseases through relations that could be labeled sp_of_test,
sp_of_disease, ppv_of_test, etc.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>4 CONCLUSION</title>
      <p>We have thus provided a practically tractable formalization
of IDPs in a realist ontology, which clearly dissociates IDPs
from their estimations (which are relative to a sample and a
gold standard). It also solves the difficulty of considering
possible, non-actual conditions in a realist ontology based
on BFO.</p>
      <p>Note that IDPs raise also other theoretical issues. For
example, one may want to aggregate two sensitivity values
Se(g,A,M) and Se(g’,A,M) assigned to two different groups
g and g’ in order to reach a finer assessment of the
sensitivity in a larger group; how to do this is a question for the
meta-analyst though, not the ontologist, who is first and
foremost concerned with representational issues.</p>
      <p>This model could then be extended in three directions. A
first step would consist in formalizing the estimations of the
IDPs, and their relations to a given sample and gold
standard. Second, the relations se_of_test and se_for_disease
could be reduced to basic relations and entities already
accepted in the OBO Foundry. Third, it could be used by
ontology-based diagnostic systems that would compute
positive predictive values or negative predictive values from the
prevalence, sensitivity and specificity values; more
generally, it could be articulated with medical Bayesian networks.</p>
      <p>As it takes into account the dependence of IDPs upon the
group of people considered, it has the potential to contribute
to the development of precision medicine (Mirnezami,
Nicholson &amp; Darzi, 2012), an emerging approach that takes
into consideration patients characteristics and dispositions,
including individual variability in genes, to offer more
personalized preventive, diagnostic and therapeutic strategies.</p>
    </sec>
    <sec id="sec-8">
      <title>ACKNOWLEDGEMENTS</title>
      <p>We would like to thank the audience at several seminars, as
well as four anonymous reviewers, for their helpful
comments. Adrien Barton thanks the Japanese Society for
Promotion of Science for financial support.
Figure 1 Sensitivity of a test A for a disease M in a group g with
probability value 0.75</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Barton</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burgun</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Duvauferrier</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2012</year>
          )
          <article-title>Probability assignments to dispositions in ontologies</article-title>
          .
          <source>Proc. 7th Int. Conf. Form. Ontol. Inf. Syst</source>
          .
          <string-name>
            <surname>FOIS2012 (M. Donnelly</surname>
          </string-name>
          &amp; G. Guizzardi, eds.),
          <fpage>3</fpage>
          -
          <lpage>14</lpage>
          , Amsterdam: IOS Press.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Barton</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Duvauferrier</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Burgun</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2015</year>
          )
          <article-title>Une analyse philosophique des indicateurs de performance des tests diagnostiques médicaux</article-title>
          .
          <source>Submitted manuscript.</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Brenner</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <article-title>and</article-title>
          <string-name>
            <surname>Gefeller</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          (
          <year>1997</year>
          )
          <article-title>Variation of sensitivity, specificity, likelihood ratios and predictive values with disease prevalence</article-title>
          .
          <source>Stat. Med</source>
          .,
          <volume>16</volume>
          (
          <issue>9</issue>
          ),
          <fpage>981</fpage>
          -
          <lpage>991</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Brinkman</surname>
            ,
            <given-names>R. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Courtot</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Derom</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fostel</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lord</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Malone</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al. (
          <year>2010</year>
          )
          <article-title>Modeling biomedical experimental processes with OBI</article-title>
          .
          <source>J. Biomed. Semant.</source>
          , 1
          <issue>Suppl 1</issue>
          ,
          <fpage>S7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Costa</surname>
            , P. C. G. da, Laskey,
            <given-names>K. B.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Laskey</surname>
            ,
            <given-names>K. J.</given-names>
          </string-name>
          (
          <year>2008</year>
          )
          <article-title>PR-OWL: A Bayesian ontology language for the semantic web</article-title>
          .
          <source>In: Uncertainty Reasoning for the Semantic Web I</source>
          ,
          <fpage>88</fpage>
          -
          <lpage>107</lpage>
          , Springer.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Jansen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Schulz</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2011</year>
          )
          <article-title>Grains, components and mixtures in biomedical ontologies</article-title>
          .
          <source>J. Biomed. Semant.</source>
          , 2
          <issue>Suppl 4</issue>
          ,
          <fpage>S2</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Ledley</surname>
            ,
            <given-names>R. S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lusted</surname>
            ,
            <given-names>L. B.</given-names>
          </string-name>
          (
          <year>1959</year>
          )
          <article-title>Reasoning foundations of medical diagnosis</article-title>
          .
          <source>Science</source>
          ,
          <volume>130</volume>
          (
          <issue>3366</issue>
          ),
          <fpage>9</fpage>
          -
          <lpage>21</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Mirnezani</surname>
            ,
            <given-names>M.R.C.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nicholson</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Darzi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2012</year>
          )
          <article-title>Preparing for precision medicine</article-title>
          .
          <source>N. Engl. J. Med</source>
          .,
          <volume>366</volume>
          (
          <issue>6</issue>
          ),
          <fpage>489</fpage>
          -
          <lpage>491</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Park</surname>
            ,
            <given-names>H. B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yokota</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gill</surname>
            ,
            <given-names>H. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>El Rassi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McFarland</surname>
            ,
            <given-names>E. G.</given-names>
          </string-name>
          (
          <year>2005</year>
          )
          <article-title>Diagnostic accuracy of clinical tests for the different degrees of subacromial impingement syndrome</article-title>
          .
          <source>J. Bone Joint Surg</source>
          . Am.,
          <volume>87</volume>
          (
          <issue>7</issue>
          ),
          <fpage>1446</fpage>
          -
          <lpage>1455</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Röhl</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jansen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2011</year>
          )
          <article-title>Representing dispositions</article-title>
          .
          <source>J. Biomed. Semant.</source>
          ,
          <volume>2</volume>
          (
          <issue>Suppl 4</issue>
          ),
          <fpage>S4</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Schulz</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martínez-Costa</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karlsson</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cornet</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brochhausen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Rector</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2014</year>
          )
          <article-title>An Ontological Analysis of Reference in Health Record Statements</article-title>
          .
          <source>Proc. 8th Int. Conf. Form. Ontol. Inf. Syst</source>
          .
          <string-name>
            <surname>FOIS2014 (P. Garbacz</surname>
          </string-name>
          &amp; O. Kutz, eds.),
          <fpage>289</fpage>
          -
          <lpage>302</lpage>
          , Amsterdam: IOS Press.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ashburner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosse</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bard</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bug</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ceusters</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>L. J.</given-names>
          </string-name>
          , et al. (
          <year>2007</year>
          )
          <article-title>The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration</article-title>
          .
          <source>Nat. Biotechnol</source>
          .,
          <volume>25</volume>
          (
          <issue>11</issue>
          ),
          <fpage>1251</fpage>
          -
          <lpage>1255</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Ceusters</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          (
          <year>2010</year>
          )
          <article-title>Ontological realism: A methodology for coordinated evolution of scientific ontologies</article-title>
          .
          <source>Appl</source>
          . Ontol.,
          <volume>5</volume>
          (
          <issue>3</issue>
          ),
          <fpage>139</fpage>
          -
          <lpage>188</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Soldatova</surname>
            ,
            <given-names>L. N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rzhetsky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grave</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>De and King</surname>
            ,
            <given-names>R. D.</given-names>
          </string-name>
          (
          <year>2013</year>
          )
          <article-title>Representation of probabilistic scientific knowledge</article-title>
          .
          <source>J. Biomed. Semant.</source>
          ,
          <volume>4</volume>
          (
          <issue>Suppl 1</issue>
          ),
          <fpage>S7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Zheng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harris</surname>
            ,
            <given-names>M. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masci</surname>
            ,
            <given-names>A. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hero</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>OBCS: The Ontology of Biological and Clinical Statistics in ICBO2014</article-title>
          . Houston, TX, USA
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>