<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>What is a risk? A formal representation of risk of stroke for people with atrial fibrillation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Adrien Barton</string-name>
          <email>adrien.barton@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ludger Jansen</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Arnaud Rosier</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jean-François Ethier</string-name>
          <email>ethierjf@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>GRIIS, Université de Sherbrooke</institution>
          ,
          <addr-line>Québec</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>INSERM UMR_S 1138 Eq 22, Paris Descartes University</institution>
          ,
          <addr-line>Paris</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Ruhr University Bochum and University of Rostock</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We propose a framework for the representation of medical risks in the context of the OBO Foundry using the Web Ontology Language (OWL). The framework is developed for the use case of risk of stroke for people with atrial fibrillation, for which we distinguish three classes of dispositions: the atrial fibrillation disease; the risk of stroke for a human who has atrial fibrillation; and the risk of stroke over 12 months for a human who has atrial fibrillation. The latter is quantified by risk estimates, which are informational entities extracted from documents - such as journal articles - and to which epistemic probability values can be assigned. We discuss the reference-class problem (i.e., the possibility to have several risk estimates with different epistemic probabilities for the same individual, depending on the reference class the risk estimate is based on) and clarify the philosophical hypotheses on which this dispositional framework is based.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Risks of adverse outcomes are ubiquitous in the medical
domain and have a central importance. An older population with
complex combinations of chronic diseases and many
medications makes simple deterministic treatment decisions
difficult. Instead, clinicians need to assess, manage, and balance
risks much more explicitly than ever before. It would
therefore be valuable if ontologies aiming at adequately
representing medical knowledge could formalize such risks. This
paper contributes to this aim by proposing a framework for the
representation of risks, illustrated by the risk of stroke in
people with atrial fibrillation. More specifically, we propose a
representation of absolute risks, such as a 3.2% risk of stroke
over 12 months for people with atrial fibrillation
        <xref ref-type="bibr" rid="ref14">(Nielsen et
al., 2016)</xref>
        .
      </p>
      <p>
        This formalization is expressed in the Web Ontology
Language (OWL), in the context of t
        <xref ref-type="bibr" rid="ref7">he OBO Foundry (Smith et
al., 2007</xref>
        ). The OBO Foundry is one of the most
comprehensive collections of interoperable ontologies in the biomedical
domain, built on the upper ontology Basic Formal Ontology
(BFO) 2.0
        <xref ref-type="bibr" rid="ref1 ref19">(Arp, Smith &amp; Spear, 2015)</xref>
        . A few ontologies
have formalized the notion of medical risk; see
        <xref ref-type="bibr" rid="ref20">Uciteli et al.
(2016)</xref>
        for a recent account (though not in the context of the
OBO Foundry), as well as a review of former accounts.
However, there is currently no comprehensive account of the
notion of risk in the OBO Foundry ontologies.
      </p>
      <p>One principle of BFO is the strict separation between
universals and their instances. We write names of instances as
well as relations between instances in bold, and names of
universals and defined classes in italics. When first introduced,
names of universals will be prefixed by the name of the
source ontology (e.g., “BFO:Disposition”), unless the context
makes it obvious.</p>
      <p>
        The OBO Foundry compliant Ontology of Biological and
Clinical Statistics
        <xref ref-type="bibr" rid="ref21">(OBCS; Zheng et al., 2016)</xref>
        defines
Absolute risk as a subclass of IAO:Information content entity
(“ICE” for short). However, arguably, a person with atrial
fibrillation has a risk to get a stroke independently of whether
or not there exists some ICE estimating his risk to get a
stroke. The risk itself has rather a dispositional character: an
instance of risk of an adverse outcome of type A may be
realized by an instance of A, but it may also never be realized;
however, whether it is realized or not, the risk still exists. For
this reason, this paper formalizes risks as dispositions that can
be estimated by a specific kind of ICE, risk estimates, and the
risk probability values as assigned to these risk estimates.
      </p>
      <p>We begin by distinguishing two types of dispositions: the
disease of atrial fibrillation on the one hand, and the risk of
stroke of a human who has atrial fibrillation on the other
(Sect. 2). We then show how a probability can be assigned to
a risk of stroke in 12 months for a human with atrial
fibrillation (Sect. 3). A discussion and conclusion follow.</p>
    </sec>
    <sec id="sec-2">
      <title>2 DIFFERENTIATING RISK DISPOSITION</title>
    </sec>
    <sec id="sec-3">
      <title>AND DISEASE</title>
      <p>
        The OGMS (Ontology for General Medical Science)
considers a Disease as a BFO:Disposition
        <xref ref-type="bibr" rid="ref17">(Scheuermann, Ceusters
&amp; Smith, 2009)</xref>
        . Röhl &amp;
        <xref ref-type="bibr" rid="ref10">Jansen (2011)</xref>
        developed an
axiomatisation of dispositions in the context of BFO. In this
model, a disposition is a BFO:Dependent continuant that
inheres_in his bearer (which is the bearer_of this property), a
Material entity, and may be realized (realized_in) via a
process. The realization process has the material entity as a
participant (has_participant), and the disposition is triggered by
(has_trigger) some event or process. Finally, according to
      </p>
      <p>BFO, a disposition has_material_basis some entity. For
example, the fragility of a glass is formalized as a disposition
inhering in the glass, that may be realized by a breaking
process when some form of stress (the trigger) happens;
moreover, the fragility exists because of some molecular structure
of the glass, which is its material basis.</p>
      <p>The OGMS model considers Disease as a disposition
realized_in a Disease course that has as parts some
Pathological process. The material basis of a disease is a Disorder in
the organism. For example, the disease epilepsy is seen as a
disposition to have a disease course composed by various
epileptic crises (pathological processes), because of some
disorder in the brain.</p>
      <p>
        The OGMS model is applied to cardiovascular diseases by
the Cardiovascular Disease Ontology
        <xref ref-type="bibr" rid="ref5">(CVDO; Barton et al.,
2014)</xref>
        . It formalizes the atrial fibrillation disease Atrial
fibrillation as a disposition realized by a disease course that has as
parts some processes of atrial fibrillation:
      </p>
      <sec id="sec-3-1">
        <title>Atrial fibrillation subClassOf Disease</title>
      </sec>
      <sec id="sec-3-2">
        <title>Atrial fibrillation process subClassOf Pathological process</title>
      </sec>
      <sec id="sec-3-3">
        <title>Atrial fibrillation subClassOf (realized_in some Disease course and (has_part some Atrial fibrillation process))</title>
        <p>
          This matches to an ambiguity in the natural language
term “atrial fibrillation”: it refers sometimes to a pathological
process of atrial fibrillation (namely, irregular, uncoordinated
contractions of the atria of the heart), and sometimes to a
disease – a disposition exceeding a given threshold
          <xref ref-type="bibr" rid="ref17">(Scheuermann et al., 2009)</xref>
          to atrial fibrillation processes.
        </p>
        <p>Consider a human Jones, who has an atrial fibrillation
disease (“AF” for short) afJones – an instance of the universal
Atrial fibrillation. Jones is an instance of HumanAF, the class
of humans with atrial fibrillation:</p>
        <p>HumanAF equivalentClass Human and (bearer_of some</p>
      </sec>
      <sec id="sec-3-4">
        <title>Atrial fibrillation)</title>
        <p>Suppose that afJones leads to the process instance
strokeJones (an instance of the universal Stroke) through the
following scenario. There is some fibrosis in Jones’ atrial
myocardium (the disorder atrium_fibrosisJones), which is the
material basis of the disposition afJones:
afJones inheres_in Jones
afJones has_material_basis atrium_fibrosisJones
afJones is realized by a (long) disease course af_courseJones,
that encompasses various pathological processes, including
several episodes of atrial fibrillation (ppaf,1,...,ppaf,n):
afJones realized_in af_courseJones</p>
        <sec id="sec-3-4-1">
          <title>For every iÎ[1,n]: af_courseJones has_part ppaf,i</title>
          <p>These pathological processes lead to the development of a
blood clot in Jones’ atrium, which is a new disorder. This
blood clot is the bearer of a disposition to dislodge and
migrate, which is at some point realized by the process of the
blood clot migrating to the brain. The migrating clot has then
a disposition to get stuck in a cerebral artery - by contrast to
dissolve. When it gets stuck, the blood flow is blocked and
strokeJones happens.</p>
          <p>On top of the disposition afJones, Jones is also the bearer of
another disposition: the risk of stroke riskJones,Stroke, which is
realized by his stroke:
riskJones,Stroke inheres_in Jones
riskJones,Stroke realized_in strokeJones
riskJones,Stroke is an instance of RiskAF,Stroke, the class of risks
of stroke for humans with atrial fibrillation, which is itself a
subclass of Risk:</p>
        </sec>
      </sec>
      <sec id="sec-3-5">
        <title>Risk subClassOf Disposition</title>
      </sec>
      <sec id="sec-3-6">
        <title>RiskAF,Stroke subClassOf Risk</title>
      </sec>
      <sec id="sec-3-7">
        <title>RiskAF,Stroke subClassOf (inheres_in some HumanAF)</title>
      </sec>
      <sec id="sec-3-8">
        <title>HumanAF subClassOf (bearer_of some RiskAF,Stroke)</title>
      </sec>
      <sec id="sec-3-9">
        <title>RiskAF,Stroke subClassOf (realized_in only Stroke)</title>
        <p>
          An instance of RiskAF,Stroke may be realized by one or
several instance(s) of stroke, or may remain unrealized. To
clarify the triggers of RiskAF,Stroke, we need to use BFO:History.
BFO defines the history of a material entity as the “process
that is the sum of the totality of processes taking place in the
spatiotemporal region occupied by a material entity or site,
including processes on the surface of the entity or within the
cavities to which it serves as host”
          <xref ref-type="bibr" rid="ref1 ref19">(Arp, Smith and Spear,
2015)</xref>
          . We define History-part as the class of temporal parts
of the history of any material object:
        </p>
        <p>History-part equivalentClass (part_of some History)
In our formalization, RiskAF,Stroke is triggered by any
History-part of its bearer:</p>
      </sec>
      <sec id="sec-3-10">
        <title>RiskAF,Stroke subClassOf (has_trigger some History-part)</title>
        <p>
          This way, risks of stroke are dispositions that are always
triggered. However, RiskAF,Stroke is not a sure-fire disposition
(that is, a disposition that is always realized when triggered),
but a tendency (that is, a disposition that is not always
realized w
          <xref ref-type="bibr" rid="ref7">hen triggered; Jansen 2007</xref>
          ; Röhl &amp; Jansen, 2011).
        </p>
        <p>The material basis of riskJones,Stroke is a disorder that has as
part Jones’ fibrosis, but also other entities. As a matter of fact,
Jones can have a stroke because of his atrial fibrillation, but
also because of various random or progressive factors, such
as the regular senescence of his blood vessels.
riskJones,Stroke has_material_basis some (Disorder and
has_part atrium_fibrosisJones and has_part
senescent_blood_vesselsJones)</p>
        <p>Thus, the disease afJones and the risk riskJones,Stroke are not
the same disposition: even if they both inhere in Jones, they
have distinct material basis and distinct realizations – namely,
af_courseJones vs. strokeJones. (The OGMS model leaves
open the question whether pathological processes that are
caused - or partially caused - by earlier pathological process
of a disease course are also part of this disease course; thus,
it is an open question whether strokeJones is a part of
af_courseJones; see Barton et al. 2014 for a discussion. In any
case, those two instances are distinct entities.)</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3 PROBABILITY ASSIGNMENTS TO RISK</title>
    </sec>
    <sec id="sec-5">
      <title>DISPOSITIONS</title>
      <p>We can now turn to the representation of a probability
assignment to a risk of stroke. For this purpose, we first discuss the
entity characterized by a probability assignment, then discuss
the nature of probabilities at play, and finally formalize the
probability assignment to a risk estimate.
3.1</p>
      <p>
        What kind of entity do risk probabilities
characterize?
        <xref ref-type="bibr" rid="ref14">Nielsen et al. (2016)</xref>
        studied a nationwide cohort of patients
for which the overall ischemic stroke rate was 3.20 per 100
person-years. However, this value 3.2% does not relate only
to a proportion in this cohort: we can infer from this
information that an unspecified human from the same population
with atrial fibrillation has a 3.2% probability to have a stroke
over 12 months – even, of course, if he was not one of the
patients in the cohort. Therefore, as we will now argue, the
probability 3.2% also characterizes a certain property of
HumanAF: their risk to have a stroke over 12 months.
      </p>
      <p>
        Dispositions are natural targets for probability
assignments
        <xref ref-type="bibr" rid="ref2">(Barton et al. 2012)</xref>
        . However, we cannot assign the
probability 3.2% to the disposition RiskAF,Stroke. As a matter of
fact, 3.2% is the probability of a human with AF to have a
stroke over 12 months – but RiskAF,Stroke does not have any
ontological connection with 12-months-long processes.
      </p>
      <p>To solve this issue, let’s define History-part12m as the
subclass of History-part with a 12 months-long duration. Let’s
now introduce RiskAF,12m,Stroke the class of risks of a human
with AF to get a stroke over 12 months, that we also formalize
as a Risk – and therefore, a disposition:</p>
      <sec id="sec-5-1">
        <title>RiskAF,12m,Stroke subClassOf Risk</title>
      </sec>
      <sec id="sec-5-2">
        <title>Like with RiskAF,Stroke, there is an instance of RiskAF,12m,Stroke</title>
        <p>inhering in any person with AF; and those instances can only
be realized by a stroke:</p>
      </sec>
      <sec id="sec-5-3">
        <title>HumanAF subClassOf (bearer_of some RiskAF,12m,Stroke)</title>
      </sec>
      <sec id="sec-5-4">
        <title>RiskAF,12m,Stroke subClassOf (realized_in only Stroke)</title>
        <p>By contrast to RiskAF,Stroke which is triggered by all
historyparts of its bearer, RiskAF,12m,Stroke is only triggered by all
12months-long history-parts of its bearer:</p>
      </sec>
      <sec id="sec-5-5">
        <title>RiskAF,12m,Stroke subClassOf (has_trigger some</title>
      </sec>
      <sec id="sec-5-6">
        <title>History-part12m)</title>
        <p>In order to determine how a probability can characterize
RiskAF,12m,Stroke, we need to clarify the ontological status of
probabilities.
3.2</p>
        <p>
          What are the probabilities?
Standardly, objective and epistemic interpretations of
probabilities are distinguis
          <xref ref-type="bibr" rid="ref8">hed (Hájek, 2012</xref>
          ). Objective
probabilities are meant to characterize the world independently of our
knowledge of it, while epistemic interpretations consider
probabilities to describe our knowledge of the world:
epistemic probabilities can be defined as degrees of belief or
degrees of confidence.
        </p>
        <p>
          Consider for example a biased coin that has three times
more chances to fall on heads than on tails. The objective
probability of the coin falling on heads is ¾, and its objective
probability to fall on tails is ¼. If Mr. Green knows about the
coin’s bias, he should assign epistemic probabilities with the
same values: ¾ to heads and ¼ to tails; this is a consequence
of a principle of rationality called the “principal principle”
          <xref ref-type="bibr" rid="ref11">(Lewis, 1980)</xref>
          . However, if Mr. White is not aware of this
bias and thinks that the coin is balanced, he would assign
epistemic probabilities ½ to heads and ½ to tails. Epistemic
probabilities can be operationalized as rational, hypothetical
betting coefficients – that is, coefficients indicating which
odds should be considered as acceptable by the agent to bet
on the occurrence of heads or tails
          <xref ref-type="bibr" rid="ref13">(Maher, 1997)</xref>
          .
        </p>
        <p>Suppose that the 3.2% value would be an objective
probability that could be assigned to the disposition RiskAF,12m,Stroke.
We would then have:</p>
        <p>RiskAF,12m,Stroke subClassOf (has_objective_probability
0.032)
This
would imply that for every
r instance_of</p>
      </sec>
      <sec id="sec-5-7">
        <title>RiskAF,12m,Stroke:</title>
        <p>r has_objective_probability 0.032</p>
        <p>
          Thus, all people with AF would have an objective
probability 3.2% to have a stroke over 12 months. However, this
cannot be the case: many people with AF have a lower or
higher objective probability to have a stroke over 12 months,
depending on various factors (see below the section 4.2 on
CHADS2 and CHADSVASC scores). Therefore, we would
rather interpret 3.2% as an epistemic probability that
characterizes the rational degree of confidence, given the evidence
provided by
          <xref ref-type="bibr" rid="ref14">Nielsen et al. (2016)</xref>
          , that a person with AF
would have a stroke over 12 months. We will now propose a
formalization along those lines.
        </p>
        <p>
          Epistemic probability assignment
We define the relation object_of as the inverse of
IAO:is_about (Ceusters &amp; Smith, 2010), which relates an
ICE to what it is about. We introduce the class Risk estimate
as a subclass of ICE. Let
risk_estimateAF,12m,Stroke,Nielsen_et_al._(2016) be the following instance of Risk estimate: the
estimate of the risk of a human with atrial fibrillation to have
a stroke over 12 months, extracted from the article
          <xref ref-type="bibr" rid="ref14">Nielsen et
al. (2016)</xref>
          1:
        </p>
      </sec>
      <sec id="sec-5-8">
        <title>RiskAF,12m,Stroke subClassOf (object_of</title>
        <p>risk_estimateAF,12m,Stroke,Nielsen_et_al._(2016))</p>
        <p>To represent the assignment of the probability 3.2% to this
risk estimate, we use two OBI relations (which are currently
being formalized by the OBI development team), the object
property has_value_specification (that relates an
information content entity to an OBI:Value specification) and the
datatype property has_specified_value (that relates a value
specification to its numerical value). As a shortcut, let’s
introduce here the datatype property has_value defined as
has_value_specification o has_specified_value. We can
then write:
risk_estimateAF,12m,Stroke,Nielsen_et_al._(2016)
has_value 0.032</p>
        <p>Informally, if R is a risk and re is a risk estimate, (R
subClassOf object_of re) and (re has_value p) mean together
that according to the risk estimate re, it is rational to assign
an epistemic probability p to the risk R.</p>
        <p>The evidence for the estimate is documented in
Nielsen_et_al._(2016), an instance of IAO:Journal article
(which is, in turn, a subclass of IAO:Document). To formally
relate this journal article with
risk_estimateAF,12m,Stroke,Nielsen_et_al._(2016), we introduce a relation extracted_from, whose
domain is Information content entity and whose range is
Document. If r is a risk estimate and j is a document, r
extracted_from j implies that j participates in a IAO:Planned
process whose specified output is r:
extracted_from subRelationOf
(is_specified_output_of o has_participant)
Then, we can state:
risk_estimateAF,12m,Stroke, Nielsen_et_al._(2016) extracted_from
Nielsen_et_al._(2016)</p>
        <p>
          Altogether, the relations we have introduced here and in
section 3.1 mean that a risk estimate is extracted from
          <xref ref-type="bibr" rid="ref14">Nielsen
et al. (2016)</xref>
          , according to which it is rational to assign a 3.2%
epistemic probability to the risk of stroke over 12 months for
1 As in OWL the term “value” is used in a class restriction to introduce an
individual after an object property, this could more specifically be written as
a person who has AF (in the absence of additional
information).
4
4.1
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>DISCUSSION</title>
      <p>
        The reference class problem
As mentioned in section 3.2, all patients with atrial
fibrillation do not have the same objective probability of stroke. The
CHADS2 score (congestive heart failure, hypertension, age ³
75 years, diabetes mellitus, stroke) is a tool that has been
developed to predict the risk of stroke in patients with atrial
fibrillation by stratifying patients into risk groups
        <xref ref-type="bibr" rid="ref6">(Gage et al.,
2001)</xref>
        . It was later expanded into the CHA2DS2-VASc score
        <xref ref-type="bibr" rid="ref12">(Lip et al., 2010; written “CHADSVASC” from now on)</xref>
        ,
which includes three additional risk factors: vascular disease,
age 65-74 years, and female sex.
      </p>
      <p>Let HumanAF2 be the class of humans with atrial
fibrillation and a CHADSVASC score of 2 (“AF2” for short). We
can introduce a class of dispositions RiskAF2,12m,Stroke, the risk
of stroke over 12 months for people with AF2:</p>
      <sec id="sec-6-1">
        <title>RiskAF2,12m,Stroke equivalentClass (RiskAF,12m,Stroke and inheres_in some HumanAF2)</title>
        <p>Nielsen et al. (2016) state that the rate of stroke over 12
months among patients in the sample who had AF2 was
1.97%. Therefore, there is an instance
risk_estimateAF2,12m,Stroke,Nielsen_et_al._(2016) such that:</p>
      </sec>
      <sec id="sec-6-2">
        <title>RiskAF2,12m,Stroke subClassOf object_of</title>
        <p>risk_estimateAF2,12m,Stroke,Nielsen_et_al._(2016)
risk_estimateAF2,12m,Stroke,Nielsen_et_al._(2016)
has_value 0.0197</p>
        <p>Suppose that Jones has AF2. Jones is the bearer of the risk
to get a stroke over 12 months riskJones,12m,Stroke. Since
Jones instance_of HumanAF2, his risk to get a stroke over 12
months is an instance of the class of risks to get a stroke over
12 months for people with AF2:
riskJones,12m,Stroke instance_of RiskAF2,12m,Stroke
and therefore:
riskJones,12m,Stroke object_of
risk_estimateAF2,12m,Stroke,Nielsen_et_al._(2016)</p>
        <p>Moreover, since HumanAF2 subClassOf HumanAF,
Jones instance_of HumanAF. Therefore, his risk to get a
stroke over 12 months is an instance of the class of risks to
get a stroke over 12 months for people with AF:
riskJones,12m,Stroke instance_of RiskAF,12m,Stroke
and therefore:
RiskAF,12m,Stroke subClassOf (object_of value
risk_estimateAF,12m,Stroke,Nielsen_et_al._(2016))
riskJones,12m,Stroke object_of
risk_estimateAF,12m,Stroke,Nielsen_et_al._(2016)</p>
        <p>
          Thus, riskJones,12m,Stroke is the object of two different
estimates with two different probability values (0.032 and
0.0197), based on two different reference classes (AF or
AF2): this is the reference class problem (
          <xref ref-type="bibr" rid="ref7">Hájek, 2007</xref>
          ). This
is ontologically sound: if Dr. Khan only knows that Jones has
AF, it is rational for him, based on
          <xref ref-type="bibr" rid="ref14">Nielsen et al. (2016)</xref>
          , to
assign a probability 3.2% to the risk that Jones will have a
stroke over 12 months; and if Dr. Patel knows in addition that
Jones has a CHADSVASC score of 2, then it is rational for
him, based on
          <xref ref-type="bibr" rid="ref14">Nielsen et al. (2016)</xref>
          , to assign a probability of
1.97% to this risk.
        </p>
        <p>However, this raises practical difficulties. It might seem at
first sight rational, for a computer system who has the
information that Jones has a CHADSVASC score of 2, to always
give precedence to the 1.97% risk estimation over the 3.2%
estimation, as it is based on more specific factors. However,
other criteria may matter. For example, if both values had
been obtained from different studies, the 3.2% could be
considered as a more reliable value for other reasons – such as a
smaller 95% confidence interval.</p>
        <p>Moreover, different articles relating about different
cohorts or samples might give risk estimates with different
values of the same risk class. They might give also risk estimates
for risk classes that are not included into each others. Suppose
that Jones has atrial fibrillation and is a smoker, and that we
know two data from two different cohorts: the probability pAF
that someone with atrial fibrillation will have a stroke during
ten years; and the probability pSmoker that a smoker will have
a stroke during ten years. There is no easy way to decide
which epistemic probability is the best to estimate Jones’ risk,
or how they should be weighted in a common probability
estimate. Note however that this is a classical issue for
probabilistic reasoning, independent of the ontological
representation chosen here.
4.2</p>
        <p>Articulating objective and epistemic
probabilities
We have seen earlier that we could not formalize in OWL
3.2% as an objective probability assigned to RiskAF,12m,Stroke,
as it would imply that every instance of this risk
(riskJones,12m,Stroke, riskHubbard,12m,Stroke, etc.) would have the
same objective probability – which is false.</p>
        <p>An alternative reading would be to interpret the objective
probability 3.2% in line of Barton, Burgun &amp; Duvauferrier
(2012) as assigned only to the universal RiskAF,12m,Stroke, but
not to its instances – a conception that is not straightforwardly
implementable in OWL. Informally, this assignment would
be elucidated as follows: in a hypothetical, representative
sequence seq0 of instances of RiskAF,12m,Stroke inhering in
hypothetical instances of HumanAF, the proportion of those risks
who are realized – that is, the proportion of those humans
who have a stroke over 12 months – tends towards 0.032 as
the size of the sequence tends towards infinity.</p>
        <p>
          This conception raises the issue of what it means to have a
representative sequence of hypothetical instances of
RiskAF,12m,Stroke inhering in hypothetical instances of HumanAF.
Indeed, several factors can influence the risk of having a
stroke – in particular those involved in the CHADSVASC
score: hypertension, age ³ 75 years, etc. It makes sense to
speak of a representative sequence of instances only by
reference to an actual population pop0: to be representative of
pop0, the sequence seq0 should involve the same proportions
as in pop0 of people with hypertension, of people older than
75 years, etc. But this implies that the probability 0.032
would characterize the actual population pop0
          <xref ref-type="bibr" rid="ref10 ref15">(that is, a
collection of human particulars – cf. Jansen &amp; Schulz, 2011)</xref>
          rather than a subclass of Human. This is a possible orientation,
pursed by Barton, Ethier, Duvauferrier &amp; Burgun (2017) to
formalize indicators of diagnostic performance.
        </p>
        <p>This article has used an alternative interpretation of
probabilities as epistemic in nature. This interpretation makes the
formalization simpler in the present context, as it enables to
relate a probability estimate to a universal of human. Future
work will need to discuss further the articulation between the
objective and epistemic probability views, and compare the
strengths of each.
4.3</p>
        <p>Generalization of this formalization
Note that this formalization can be adapted to represent a risk
during a process that is not characterized by its duration (such
as 12 months), but by some other characteristics. Imagine for
example that we want to represent the probability p that a
human with AF would have a stroke during a hospitalization
process; we would then introduce the risk
riskJones,Hospitalization,Stroke that Jones would have a stroke during a
hospitalization process, and assign the probability p to its risk estimate
(the class of triggers of this risk would be the class of
historyparts of Jones that temporally span any of his hospitalization
process).
5</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>CONCLUSION</title>
      <p>This article has shown how a specific risk and its various
probability estimates could be formalized in the context of
the OBO Foundry. We took the example of the risk of stroke
for people with AF over 12 months RiskAF,12m,Stroke, which was
formalized as a disposition. The article introduced
risk_estimateAF,12m,Stroke,Nielsen_et_al._(2016), related (by the relation
extracted_from) to the instance of Journal article
Nielsen_et_al._(2016). It was also related (by the relation
is_about) to the risk RiskAF,12m,Stroke, which was itself related
to the following relevant classes: humans with atrial
fibrillation HumanAF (by the relation inheres_in); 12-months-long
history-parts History-part12m (by the relation has_trigger);
and Stroke (by the relation realized_in).</p>
      <p>
        This representation of risk of stroke for patients with atrial
fibrillation could also be used to stratify patients into risk
groups by computing their CHADSVASC score, using e.g.
SWRL rules
        <xref ref-type="bibr" rid="ref16">(Rosier, 2015)</xref>
        . This would also provide formal
definitions of classes HumanAF1, HumanAF2, etc.
      </p>
      <p>
        This paper has shown how a specific example of
probability assignment to a risk – the risk of stroke over 12 months of
a patient with atrial fibrillation – could be formalized. Future
work will need to systematize, using OBO-Foundry relations,
the relations involving the classes Risk or Risk estimate.
Elaborating on the work of
        <xref ref-type="bibr" rid="ref4">Barton &amp; Jansen (2016)</xref>
        , a relation of
disposition-parthood could also be introduced to represent
the connection between the risk of a human with AF to have
a stroke (RiskAF,Stroke) and the risk of a human with AF to have
a stroke over 12 months (RiskAF,12m,Stroke).
      </p>
      <p>The formalization presented in this paper relies on two
hypotheses. The first is that for every class of material objects
O, and every classes of processes T and R, there exists a class
of dispositions D that inheres in O, has T as maximally
specified class of triggers and R as maximally specified class of
realizations; and O, T and R together constitute the conditions
of identity of this disposition. This hypothesis was used to
define RiskAF,12m,Stroke (from the classes HumanAF,
Historypart12m and Stroke) as a class different from RiskAF,12m, which
has a different maximally specified class of triggers
(Historypart). The second hypothesis is that a realization r of a
disposition d can happen during a trigger t – not necessarily just
after the trigger ended. Thus, riskJones,12m,Stroke could be
realized during a 12-months-long history part that acted as a
trigger. Finally, this formalization raises the wider philosophical
issue whether the risk of stroke RiskHuman,Stroke could be
classified as a kind of disease, given OGMS definition of disease.</p>
    </sec>
    <sec id="sec-8">
      <title>ACKNOWLEDGEMENTS</title>
      <p>We thank three anonymous reviewers for their feedback. AB
acknowledges financial support by the “bourse de fellowship
du département de médecine de l’université de Sherbrooke”
and the “CIHR funded Quebec SPOR Support Unit”.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Arp</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Spear</surname>
            ,
            <given-names>A.D.</given-names>
          </string-name>
          (
          <year>2015</year>
          )
          <article-title>Building Ontologies with Basic Formal Ontology</article-title>
          . Mit Press.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Barton</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burgun</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Duvauferrier</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2012</year>
          )
          <article-title>Probability assignments to dispositions in ontologies</article-title>
          . In Donnelly,
          <string-name>
            <given-names>M.</given-names>
            &amp;
            <surname>Guizzardi</surname>
          </string-name>
          ,
          <string-name>
            <surname>G</surname>
          </string-name>
          . (eds),
          <source>Proceedings of the 7th International Conference on Formal Ontology in Information Systems (FOIS2012)</source>
          . IOS Press, Amsterdam, pp.
          <fpage>3</fpage>
          -
          <lpage>14</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Barton</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ethier</surname>
            ,
            <given-names>J.-F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Duvauferrier</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Burgun</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2017</year>
          )
          <article-title>An ontological analysis of medical Bayesian indicators of performance</article-title>
          .
          <source>J. Biomed. Semant., 8</source>
          ,
          <fpage>1</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Barton</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Jansen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2016</year>
          )
          <article-title>A modelling pattern for multi-track dispositions for life-science ontologies</article-title>
          . In Loebe, F.,
          <string-name>
            <surname>Boeker</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herre</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jansen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Schober</surname>
            ,
            <given-names>D</given-names>
          </string-name>
          . (eds),
          <source>ODLS 2016 - Ontologies and Data in Life Sciences, CEUR Workshop Proceedings</source>
          . p.
          <source>H.1-2.</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Barton</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosier</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burgun</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Ethier</surname>
            ,
            <given-names>J.-F.</given-names>
          </string-name>
          (
          <year>2014</year>
          )
          <article-title>The Cardiovascular Disease Ontology</article-title>
          . In Garbacz,
          <string-name>
            <given-names>P.</given-names>
            &amp;
            <surname>Kutz</surname>
          </string-name>
          ,
          <string-name>
            <surname>O</surname>
          </string-name>
          . (eds),
          <source>Proceedings of the 8th International Conference on Formal Ontology in Information Systems (FOIS</source>
          <year>2014</year>
          ). IOS Press, Amsterdam, pp.
          <fpage>409</fpage>
          -
          <lpage>414</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Gage</surname>
            ,
            <given-names>B.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Waterman</surname>
            ,
            <given-names>A.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shannon</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boechler</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rich</surname>
            ,
            <given-names>M.W.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Radford</surname>
            ,
            <given-names>M.J.</given-names>
          </string-name>
          (
          <year>2001</year>
          )
          <article-title>Validation of clinical classification schemes for predicting stroke: results from the National Registry of Atrial Fibrillation</article-title>
          . Jama,
          <volume>285</volume>
          ,
          <fpage>2864</fpage>
          -
          <lpage>2870</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Hájek</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2007</year>
          )
          <article-title>The reference class problem is your problem too</article-title>
          .
          <source>Synthese</source>
          ,
          <volume>156</volume>
          ,
          <fpage>563</fpage>
          -
          <lpage>585</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Hájek</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2012</year>
          )
          <article-title>Interpretations of Probability</article-title>
          . In
          <string-name>
            <surname>Zalta</surname>
          </string-name>
          , E.N. (ed),
          <source>The Stanford Encyclopedia of Philosophy</source>
          ,
          <year>Winter 2012</year>
          . edn.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Jansen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2007</year>
          )
          <article-title>Tendencies and other realizables in medical information sciences</article-title>
          .
          <source>The Monist</source>
          ,
          <volume>90</volume>
          ,
          <fpage>534</fpage>
          -
          <lpage>554</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Jansen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Schulz</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2011</year>
          )
          <article-title>Grains, components and mixtures in biomedical ontologies</article-title>
          .
          <source>J. Biomed. Semant.</source>
          ,
          <volume>2</volume>
          ,
          <fpage>S2</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Lewis</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>1980</year>
          )
          <article-title>A Subjectivist's Guide to Objective Chance</article-title>
          .
          <source>In Studies in Inductive Logic and Probability</source>
          . University of California Press, pp.
          <fpage>83</fpage>
          -
          <lpage>132</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Lip</surname>
            ,
            <given-names>G.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nieuwlaat</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pisters</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lane</surname>
            ,
            <given-names>D.A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Crijns</surname>
            ,
            <given-names>H.J.</given-names>
          </string-name>
          (
          <year>2010</year>
          )
          <article-title>Refining clinical risk stratification for predicting stroke and thromboembolism in atrial fibrillation using a novel risk factor-based approach: the euro heart survey on atrial fibrillation</article-title>
          .
          <source>Chest</source>
          ,
          <volume>137</volume>
          ,
          <fpage>263</fpage>
          -
          <lpage>272</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Maher</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>1997</year>
          )
          <article-title>Depragmatized Dutch book arguments</article-title>
          .
          <source>Philos. Sci.</source>
          ,
          <volume>64</volume>
          ,
          <fpage>291</fpage>
          -
          <lpage>305</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Nielsen</surname>
            ,
            <given-names>P.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Larsen</surname>
            ,
            <given-names>T.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Skjøth</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Overvad</surname>
            ,
            <given-names>T.F.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Lip</surname>
            ,
            <given-names>G.Y.</given-names>
          </string-name>
          (
          <year>2016</year>
          )
          <article-title>Stroke and thromboembolic event rates in atrial fibrillation according to different guideline treatment thresholds: a nationwide cohort study</article-title>
          .
          <source>Sci. Rep</source>
          .,
          <volume>6</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Röhl</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Jansen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>2011</year>
          )
          <article-title>Representing dispositions</article-title>
          .
          <source>J. Biomed. Semant.</source>
          ,
          <volume>2</volume>
          ,
          <fpage>S4</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Rosier</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2015</year>
          )
          <article-title>Raisonnement automatique basé ontologies appliqué à la hiérarchisation des alertes en télécardiologie</article-title>
          .
          <source>(doctoral dissertation, Rennes</source>
          <volume>1</volume>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Scheuermann</surname>
            ,
            <given-names>R.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ceusters</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2009</year>
          )
          <article-title>Toward an ontological treatment of disease and diagnosis</article-title>
          .
          <source>In Proceedings of the 2009 AMIA Summit on Translational Bioinformatics</source>
          . San Francisco CA, pp.
          <fpage>116</fpage>
          -
          <lpage>120</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ashburner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosse</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bard</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bug</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , … &amp;
          <string-name>
            <surname>Lewis</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2007</year>
          )
          <article-title>The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration</article-title>
          .
          <source>Nat Biotech</source>
          ,
          <volume>25</volume>
          ,
          <fpage>1251</fpage>
          -
          <lpage>1255</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Ceusters</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          (
          <year>2015</year>
          )
          <article-title>Aboutness: Towards foundations for the information artifact ontology</article-title>
          .
          <source>In Proceedings of the 6th International Conference on Biomedical Ontology. Presented at the ICBO</source>
          <year>2015</year>
          , CEUR Workshop Proceedings, Lisbon, Portugal, pp.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Uciteli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neumann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tahar</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saleh</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stucke</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , … Herre,
          <string-name>
            <surname>H.</surname>
          </string-name>
          (
          <year>2016</year>
          )
          <article-title>Risk Identification Ontology (RIO): An ontology for specification and identification of perioperative risks</article-title>
          . In Loebe, F.,
          <string-name>
            <surname>Boeker</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herre</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jansen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Schober</surname>
            ,
            <given-names>D</given-names>
          </string-name>
          . (eds),
          <source>ODLS 2016 - Ontologies and Data in Life Sciences, CEUR Workshop Proceedings</source>
          . p.
          <source>G.1-7.</source>
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Zheng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harris</surname>
            ,
            <given-names>M.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masci</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hero</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          (
          <year>2016</year>
          )
          <article-title>The Ontology of Biological and Clinical Statistics (OBCS) for standardized and reproducible statistical analysis</article-title>
          .
          <source>J. Biomed. Semant.</source>
          ,
          <volume>7</volume>
          ,
          <fpage>53</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>