<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Abductive Reasoning with Description Logics: Use Case in Medical Diagnosis</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Comenius University in Bratislava</institution>
          ,
          <addr-line>Mlynská dolina, 84248 Bratislava</addr-line>
          ,
          <country country="SK">Slovakia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Ontologies have been increasingly used as a core representation formalism in medical information systems. Diagnosis is one of the highly relevant reasoning problems in this domain. In recent years this problem has captured attention also in the description logics community and various proposals on formalising abductive reasoning problems and their computational support appeared. In this paper, we focus on a practical diagnostic problem from a medical domain the diagnosis of diabetes mellitus - and we try to formalize it in DL in such a way that the expected diagnoses are abductively derived. Our aim in this work is to analyze abductive reasoning in DL from a practical perspective, considering more complex cases than trivial examples typically considered by the theory- or algorithm-centered literature, and to evaluate the expressivity as well as the particular formulation of the abductive reasoning problem needed to capture medical diagnosis.</p>
      </abstract>
      <kwd-group>
        <kwd>Diagnosis</kwd>
        <kwd>abduction</kwd>
        <kwd>use case</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Abduction, originally introduced by Peirce [15], is a form of backward reasoning,
typically with a diagnostic rationale. We have a knowledge base K that is supposed to
model some problem, and we have an observation O which is supposed to follow in
situations captured by K , but we are not able to explain O deductively, i.e., K 6j= O. In
abductive reasoning we ask the question – why is it that O does not follow from K , and
we look for a hypothesis (or, explanation) H such that, if added to K , then O will follow
from the resulting knowledge base. Moreover, we most typically look for explanations
consisting of extensional rather than intensional knowledge, i.e., some set of ground
facts that will, together with K , explain O.</p>
      <p>Abduction only recently captured the researchers’ interest also in the area of
ontologies and DL [7], where it also has some interesting applications, including possible
explanations of incomplete modelling or incomplete matching [4], monitoring
malfunctions in complex systems [11], and multimedia interpretation [16], among others.</p>
      <p>The problem of diagnosis, often shown as a classic example of abductive reasoning
[7,9], is highly relevant in the medical domain. Not only for primary diagnosis of a
certain disease (as the model example in this use case), but also in emerging applications
such as telemedical monitoring systems and ambient assisted living, where the patient’s
condition is continually monitored and diagnosed for anomalies, therapy adherence,
etc. Most of these applications nowadays heavily rely on ontologies (i.e., DL-based
knowledge bases) that have been increasingly used as core representation formalism
for clinical knowledge. While abduction over DL has been studied especially from the
theoretical and from the algorithmic perspective, we are not aware of any case studies
focusing on the practical aspects of modelling problems for abductive reasoning.</p>
      <p>In this paper, we focus on a practical diagnostic problem from a medical domain:
the diagnosis of diabetes mellitus. Based on information from clinical guidelines and
other relevant sources (e.g., [1,2]) we formalize it in DL in such a way that the
expected diagnoses are abductively derived. While we simplify the problem for reasons
of conciseness, we do abstract a number of distinct, less or more problematic cases that
need to be addressed, including: (a) dealing with the hierarchy of symptoms and
possible diagnoses, (b) di erential and elimination diagnosis, (c) associated conditions with
similar symptoms, (d) distinguishing and reporting complications, and some more.</p>
      <p>In the analysis that follows, we evaluate the modelled examples from the perspective
of which particular variant of abduction is being addressed, what DL expressivity is
needed, and we highlight the most important modelling issues that we run into.</p>
      <p>In the end, we learned the following lessons: medical diagnosis especially requires
ABox abduction, as hypothesizing the intensional knowledge in this domain is
typically not desired – that is the area of domain experts. We were mostly able to model our
simplified examples with the rather less expressive DL ALC for which abductive
reasoning is available [9,12,13]. Though, examples requiring more complex constructs can
also be found. Finally, modelling diagnostic knowledge bases with DL is di erent from
modelling typical ontologies. In order to get the desired explanations the statements
often need to be formulated more strongly, so that the desired observations follow. Also,
to compare the generated hypotheses, at least part of the knowledge is used also
deductively. Combining abductive and deductive reasoning within one knowledge base poses
some di culties, even on simplistic examples.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Abductive Reasoning with DL</title>
      <p>The basic abductive framework for DL was introduced by Elsenbroich et al. [7] who
proposed formulations for a number of distinct abductive problems. The main three
types are summarized below. We will assume that the reader is already familiar with
DL, the split of the knowledge base into the TBox (intensional knowledge) and the
ABox (extensional knowledge), basic syntax, and semantics [3].</p>
      <p>Definition 1 (Abduction problems in DL [7]). An abduction problem is a pair P =
(K ; O) such that K is a knowledge base in DL, and O a TBox or ABox assertion. A
solution of P is any finite set H of TBox and ABox assertions such that K [ H is
consistent and K [ H j= O. In addition, P is called
– TBox abduction problem: if H is a set of TBox assertions and O is a TBox
assertion.
– ABox abduction problem: if H is a set of ABox assertions and O is an ABox
assertion.
– Knowledge base abduction problem: the general problem, i.e., if there are no
restrictions on H and O.</p>
      <p>The definition gives a generic framework for abduction, but it is not very useful
without further constraining the possible hypotheses. The number of possible
explanations is very high, even infinite, therefore we need to be able to compare them and select
the most preferred ones. The commonly used restrictions include [7]:
Definition 2. Given an abduction problem P = (K ; O) and hypotheses H; H0, we say
that:
1. H is consistent if H [ K 6j= ?, i.e. H is consistent w.r.t. K
2. H is relevant if H 6j= O, i.e. H does not entail O
3. H is explanatory if K 6j= O, i.e. K does not entail O
4. H is stronger than H0 (H K H0) if K [ H entails H0 (and vice-versa H is weaker
than H0 if K [ H0 entails H)
5. minimal if for every H0, H0 K H, i.e., H is weaker than any other H0</p>
      <p>In general, H is a preferred solution if it is consistent, relevant, explanatory and
there is no strictly weaker solution H0 (i.e., such that H K H0 and H0 K H). If there is
single such solution, it is called the most preferred. Such hypotheses are (semantically)
minimal. Minimality is important, because if there is a (strictly) weaker hypothesis than
H it means that H hypothesizes too much. As abduction amounts to guessing, in a sense
we do not want to guess more than necessary in order to derive the observation.</p>
      <p>Consistency is required, because from inconsistency (K [ H j= ?) one is able to
derive everything. Such hypotheses would explain every observation and so it is not
meaningful to find solutions not consistent with the knowledge base K . The knowledge
base K represents the background theory from which we are interested to derive the
hypotheses. Therefore we should not be able to explain the observation without it. Such
hypotheses (i.e., when H j= O) are therefore irrelevant. Similarly, an abduction problem
only needs explaining the observation does not already follow from K .</p>
      <p>The diagnostic problems in the medical domain, which is our interest in this paper,
will most often call for ABox abduction. This is because our aim is not to enrich the
knowledge base with new axioms; these are typically su ciently described by domain
experts. Therefore our purpose is to build a knowledge base formalizing the available
expert’s knowledge (TBox) and use it to find explanations in form of facts (ABox
assertions) to any observation, which is typically also a fact (ABox assertion).</p>
      <p>A number of researchers addressed the computational solution for abduction
problems. We will focus on those who addressed ABox abduction. Klarman et al. [12]
proposed an algorithm for ABox abduction on top of ALC based on resolution. They show
that it is sound and complete. Halland and Britz [9] developed, also for ALC, a method
based on the DL tableau algorithm. Ma et al. [13] also rely on th DL tableau algorithm,
but extend the approach towards ALCI. Completeness was not shown by Ma et al.,
while Halland and Britz explicitly note that their approach is incomplete.</p>
      <p>Du et al. [5] solve abduction reasoning in DL via a reduction to logic programming,
where abduction has been extensively studied [6]. In logic programming, abductive
explanations are not arbitrary, but are typically drawn from a set of distinctive literals
called abducibles. This is because the user is often able to charaterzie in which part
of the knowledge the hypothesis is expected, and thus to reduce the search space. To
transfer this notion to the area of DL, Du et al. introduced a new variant of the ABox
abduction problem, which we will call simple, in the form of P(K ; A; O). Besides for
the knowledge base K and the observation O (set of concept assertions or atomic roles
assertions), it adds the abducibles A – a set of atomic concepts and atomic roles. An
abductive solution H for P = (K ; A; O) is a minimal set of ABox axioms composed of
individuals of K and concepts or roles of A, such that K [ H j= O. The solution should
be relevant, and consistent. The simple ABox abduction problem may be solved by
reduction to logic programming, and consecutive evalaution by a readymade reasoning
engine for logic programming.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Diabetes Mellitus Use Case</title>
      <p>of the assumed simplification</p>
      <p>Diabetes mellitus (DM) is a group of metabolic diseases characterized by
hyperglycemia resulting from defects in insulin secretion, insulin action, or both. The chronic
hyperglycemia of diabetes is associated with long-term damage, dysfunction, and
failure of various organs, especially the eyes, kidneys, nerves, heart, and blood vessels
[2]. We chose DM for our use case, as its diagnosis is a complex problem, with need
to distinguish between particular subtypes and associated conditions, identification of
possible complications, etc.</p>
      <p>Our aim is to conceptualize a KB in DL that can be used for diagnosis of DM
relying on abductive reasoning. As the problem is rather complex, we will concentrate
on selected specific subproblems, and we will also abstract from some details which
can be implemented analogously.
3.1</p>
      <sec id="sec-3-1">
        <title>Hierarchy of Symptoms</title>
        <p>Typical symptoms of diabetes mellitus include: frequent urination, excessive thirst,
hyperglycemia, blurred vision, anorexia, weight loss, fatigue, and weakness. There are
some more, but they can be added into the formalization analogously. In the
following, diabetes mellitus is represented by the concept symbol DM, and the symptoms by
S1; : : : ; S8, respectively. In abductive reasoning, we observe some set of symptoms and
try to hypothesize the most relevant diagnosis. Therefore we record the relation between
the diagnosis and its symptoms by the axiom:</p>
        <sec id="sec-3-1-1">
          <title>9hasDiag:DM v 9hasSymp:(S1 u</title>
          <p>While literally (more precisely: deductively) this axiom means, that whoever has
DM must simultaneously manifest all eight symptoms S1; : : : S8, which is not always
true; it allows to generate relevant abductive hypotheses: if the patient p is observed
to have any subset of the symptoms S1; : : : ; S8 (e.g., we have an observation O1 =
p : (9hasSymp:S2) u (9hasSymp:S8)) then H1 = fp : 9hasDiag:DMg is among the
generated diagnoses. Note that this kind of modelling is simplified, as it ignores
uncertainties, often present in medical knowledge. In this paper we take this simplification as
we want to explore the possibilities of abduction with regular DL. Introducing
uncertainties is left for future work.</p>
          <p>There are, possibly, some relations between symptoms, like one symptom may be
a more specific or a synonymous name for another, for instance, polydipsia (S9) is a
synonym for excessive thirst. This is modelled by adding:</p>
          <p>S2</p>
          <p>
            S9
(
            <xref ref-type="bibr" rid="ref2">2</xref>
            )
Now, whenever we observe S9 instead of S2 among some other symptoms of DM we
derive equal hypotheses (e.g., if O2 = fp : (9hasSymp:S9) u (9hasSymp:S7)g, H1 is
still abductively derived).
          </p>
          <p>
            A more complex relationship among symptoms may occur in cases when some
symptoms are conditions which may themselves be manifested by some other
symptoms. For instance anorexia (S5) is a condition associated with weight loss, fatigue, and
weakness (S6; : : : ; S8). This may be modelled in two ways, adding either (
            <xref ref-type="bibr" rid="ref3 ref4">3–4</xref>
            ) or (
            <xref ref-type="bibr" rid="ref5">5</xref>
            ):
          </p>
        </sec>
        <sec id="sec-3-1-2">
          <title>S5 v 9hasSymp:(S6 u</title>
          <p>
            hasSymp hasSymp v hasSymp (
            <xref ref-type="bibr" rid="ref4">4</xref>
            )
S5 v S6 u
u S8
This then allows us to keep the axioms that relate diagnoses to symptoms more concise,
e.g., we may now replace (
            <xref ref-type="bibr" rid="ref1">1</xref>
            ) by:
Potential diagnoses of our interest are listed in Table 1. These concepts can be readily
taken from a number of medical ontologies. We chose to root them in SNOMED CT,
where they are present as disorders. As in many cases in the medical domain, the main
diagnosis (DM), has some subtypes which we want to distinguish. There are some
alternate diagnoses (e.g., diabetes insipidus) which need to be rooted out or confirmed
during the diagnostic process. And there are some related diagnoses (e.g., obesity,
ketoacidosis) which may take part in the diagnosis of DM as relevant symptoms. The
diagnoses thus form a hierarchy.
          </p>
          <p>We will narrow down our focus on the diagnoses listed in Table 1. The respective
part of the hierarchy is formalized using the following DL axioms:</p>
          <p>DM1 t DM2 t GD v DM</p>
        </sec>
        <sec id="sec-3-1-3">
          <title>LADA v DM1</title>
          <p>
            The hierarchy of diagnoses has significant influence on the generated hypotheses.
For instance, considering the axioms (
            <xref ref-type="bibr" rid="ref5 ref6">5–6</xref>
            ) formalizing the symptoms of DM, together
(
            <xref ref-type="bibr" rid="ref5">5</xref>
            )
(
            <xref ref-type="bibr" rid="ref6">6</xref>
            )
(
            <xref ref-type="bibr" rid="ref7">7</xref>
            )
(
            <xref ref-type="bibr" rid="ref8">8</xref>
            )
with (
            <xref ref-type="bibr" rid="ref7 ref8">7–8</xref>
            ), if some of the symptoms of DM are observed (e.g., O1 or O2), the
abductive reasoner will generate a number of diagnoses: H1 = fp : 9hasDiag:DMg as
before, but in addition also H2 = fp : 9hasDiag:DM1g, H3 = fp : 9hasDiag:DM2g,
H4 = fp : (9hasDiag:DM1) u (9hasDiag:DM2)g, and similar hypotheses for all
(asserted or derived) subconcepts of DM. This is because they now all allow to derive the
given observations. On the other hand, if we compare these hypotheses semantically,
we see that Hi K H1 for i &gt; 1 in the hypotheses above, as K [ Hi j= H1, and never vice
versa. Hence only H1 will be preferred.
A typical task in medical diagnostics is to distinguish one diagnosis from another, often
similar one. This is called di erential diagnosis. The two diagnoses may be
distinguished by considering some symptoms relevant to one of them, but not the other.
          </p>
          <p>
            We will consider DM1 and DM2; as di erentiating between them is a relevant
medical problem. DM1 and DM2 have some symptoms in common, but they also have some
di erent symptoms. In this case, the common symptoms are those of DM. When the
patient has some of these symptoms we say that she has DM – this case is covered by
axioms (
            <xref ref-type="bibr" rid="ref5 ref6">5–6</xref>
            ).
          </p>
          <p>Specific symptoms of DM1 include: belly pain, vomiting, fruity breath odor,
drowsiness, and coma (S10; : : : ; S14). This is formalized as follows:</p>
          <p>
            9hasDiag:DM1 v 9hasSymp:(S10 u : : : u S14)
Analogously, the symptoms of DM2 include: skin problems, slow healing, tingling,
numbness, and high BMI. We will name these as S15; : : : ; S19, which gives us the axiom:
9hasDiag:DM2 v 9hasSymp:(S15 u : : : u S19)
Together the three diagnoses are now covered with (
            <xref ref-type="bibr" rid="ref5 ref6 ref7">5–7</xref>
            ) and (
            <xref ref-type="bibr" rid="ref10 ref9">9–10</xref>
            ). Let us consider
some observations and the respective hypotheses:
(
            <xref ref-type="bibr" rid="ref9">9</xref>
            )
(
            <xref ref-type="bibr" rid="ref10">10</xref>
            )
– If we observe some set of symptoms that are common to both DM1 and DM2, e.g.,
having again the observation O1 = p : (9hasSymp:S2) u (9hasSymp:S8), then the
most preferred diagnosis will be H1 = fp : 9hasDiag:DMg. However, also H2 =
fp : 9hasDiag:DM1g, H3 = fp : 9hasDiag:DM2g, H4 = fp : (9hasDiag:DM1) u
(9hasDiag:DM2)g, will be valid abductive explanations (among others), but as we
already discussed in Sect. 3.2 they are all stronger than H1 and hence H1 will be the
most preferred.
– If we observe at least one symptom specific to DM1, e.g., having the
observation O3 = p : (9hasSymp:S2) u (9hasSymp:S10), this is no longer abductively
explained by H1, nor H3. The most preferred hypothesis will be H2. H4 is an
abductive explanation as well, but it is stronger than H2, and hence H2 is preferred.
– The case when we observe some specific symptoms of DM2 (but none of DM1) is
exactly analogous.
– Finally, if we observe specific symptoms of both DM1 and DM2, e.g., we have
O4 = p : (9hasSymp:S10) u (9hasSymp:S15), the most preferred hypothesis that
abductively explains this observation is H4. This is because in case of H1–H3 there
is always some symptom which is not explained. There are other explanations (e.g.,
H5 = fp : 9hasDiag:(DM1 u DM2)g), but they are all stronger than H4.
3.4
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Case of Secondary Explanation of the Observation</title>
        <p>During di erential diagnosis it is also important to recognize cases when the given set of
observed symptoms may have other possible explanations than the disease in question.
For instance, one of the major symptoms of DM (hyperglycemia named as S3) may be
caused as a side e ect of some medication.</p>
        <p>
          As the axiom (
          <xref ref-type="bibr" rid="ref6">6</xref>
          ) includes S3 as one of the DM symptoms, we are already able
to answer on the observation of having S3. Another possible explanation is taking the
medications (name them M1). So we add a new axiom:
        </p>
        <sec id="sec-3-2-1">
          <title>9hasMedication:M1 v 9hasSymp:S3</title>
          <p>
            (
            <xref ref-type="bibr" rid="ref11">11</xref>
            )
As this is a very simple conceptualization, there is only one observation by which this
axiom play role. This observation is O5 = p : 9hasSymp:S3. As well we have exactly
two hypotheses H6 = fp : 9hasMedication:M1g and H7 = fp : 9hasDiag:DMg. Neither
H6 nor H7 is stronger then the other one so both are preferred.
          </p>
          <p>This result means, that we cannot be sure, which explanation is correct and we have
to continue in diagnosting. We are satisfied with this answer because also in medical
domain information about hyperglycemia presence is insu cient condition to decide
between taking medications and having diagnosis DM.
3.5</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>Associated Conditions</title>
        <p>
          In the medical domain, relations between diagnoses may come into play. Some
associated conditions may have similar symptoms, or subset of symptoms as other diagnoses.
Thus, in fact, in Axiom (
          <xref ref-type="bibr" rid="ref9">9</xref>
          ) the symptoms S10; : : : ; S14 are symptoms of an associated
diagnosis called ketoacidosis. Similarly the symptom S19 is a symptom of obesity, an
associated diagnosis of DM2.
        </p>
        <p>
          To take this into the account, we may add to (
          <xref ref-type="bibr" rid="ref9">9</xref>
          ) a new axiom, Axiom (
          <xref ref-type="bibr" rid="ref12">12</xref>
          ), and
analogously to (
          <xref ref-type="bibr" rid="ref10">10</xref>
          ) a new axiom, Axiom (
          <xref ref-type="bibr" rid="ref13">13</xref>
          ), as follows:
9hasDiag:KA v 9hasSymp:(S10 u : : : u S14)
        </p>
        <sec id="sec-3-3-1">
          <title>9hasDiag:Ob v 9hasSymp:S19</title>
          <p>
            (
            <xref ref-type="bibr" rid="ref12">12</xref>
            )
When we take the observations O1 and O3 and try to explain them using axioms (
            <xref ref-type="bibr" rid="ref5 ref6 ref7">5–7</xref>
            )
and (
            <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref9">9–13</xref>
            ), we see some changes:
– The case of observation O1 = p : (9hasSymp:S2) u (9hasSymp:S8) is not a ected
by the newly added axioms, as the symptom S2 is not explained by any of them.
          </p>
          <p>So, the most preferred hypothesis is again H1.
– If we observe at least one symptom specific to ketoacidosis together with some
symptoms of DM (which are all symptoms of DM1 as we already know), e.g.,
having the observation O3 = p : (9hasSymp:S2) u (9hasSymp:S10), then
besides for H2 = fp : 9hasDiag:DM1g we have to consider also hypothesis H6 =
fp : (9hasDiag:DM)u(9hasDiag:KA)g: they both explain O3 and both are preferred
to any other but they are mutually incomparable. But H6 is certainly unexpected as
so far we modelled the axioms in such a way that abductively the symptoms DM
and ketoacidosis explain the diagnosis of DM1. To solve this we have to add yet
another axiom:</p>
        </sec>
        <sec id="sec-3-3-2">
          <title>9hasDiag:DM u 9hasDiag:KA v 9hasDiag:DM1</title>
          <p>
            (
            <xref ref-type="bibr" rid="ref14">14</xref>
            )
The axiom enables to compare the two hypotheses also deductively, i.e., any patient
with both diagnoses DM and ketoacidosis is inferred to have also DM1. Hence H6
is now stronger than H2, and so H2 is the single most preferred hypothesis.
– The case when we observe some specific symptoms of obesity together with
symptoms of DM is analogous. We get the expected hypothesis H3 = fp : 9hasDiag:
DM2g, but to suppress H7 = fp : (9hasDiag:DM) u (9hasDiag:Ob)g as less
preferred, we need to add:
9hasDiag:DM u 9hasDiag:Ob v 9hasDiag:DM2
(
            <xref ref-type="bibr" rid="ref15">15</xref>
            )
– If we observe only the symptoms shared by obesity and DM2, in our simplified
example, only O5 = p : (9hasSymp:S18), two preferred and incomparable hypotheses
are H8 = fp : (9hasDiag:Ob)g and H3 = fp : (9hasDiag:DM2)g. In this case, this is
in accord with the medical knowledge: this symptom can either be caused by one
condition or by the other.
3.6
          </p>
        </sec>
      </sec>
      <sec id="sec-3-4">
        <title>Case of Complications</title>
        <p>However, in certain circumstances, one may wish to model the associated diagnoses
di erently, to capture a closer relation between them. This is, for instance, the case of
DM1 and ketoacidosis. When we consider O3 = p : (9hasSymp:S2)u(9hasSymp:S10)
the modelling from the previous section gives the single most preferred diagnosis H2 =
fp : 9hasDiag:DM1g. While ketoacidosis is also indicated by symptom S10, the
hypothesis H9 = fp : 9hasDiag:KAg is not an abductive explanation as it does not explain S10.</p>
        <p>This solution does not correctly capture the importance of ketoacidosis presence in
diabetic patients. Ketoacidosis does not merely share symptoms with DM1, but it is an
acute complication thereof which rarely occurs in non-diabetic individuals.</p>
        <p>Therefore, we would intuitively want the explanation H10 = fp : 9hasDiag:DM1) u
(9hasDiag:KA)g in this case. In fact, H10 also explains O3 in this case but is is stronger
than H2 and hence also less preferred.</p>
        <p>
          To achieve this, we have to remodel the knowledge, so that, given some symptoms
that are shared by both DM1 and ketoacidosis, only the conjunction of these diagnoses
explains them: we exchange (
          <xref ref-type="bibr" rid="ref9">9</xref>
          ) together with (
          <xref ref-type="bibr" rid="ref12">12</xref>
          ) with a new axiom:
(9hasDiag:DM1) u (9hasDiag:KA) v 9hasSymptom:(S10 u : : : u S14)
(
          <xref ref-type="bibr" rid="ref16">16</xref>
          )
This however still does not force H10 as single most preferred, as due to the presence
(
          <xref ref-type="bibr" rid="ref7">7</xref>
          ) and (
          <xref ref-type="bibr" rid="ref14">14</xref>
          ) we now have H10 K H6 and H6 K H10, hence both H10 and H6 are
equally preferred. However, recall that we have asserted (
          <xref ref-type="bibr" rid="ref14">14</xref>
          ) only to support a slightly
di erent relation between DM1 and ketoacedosis in the previous section, so when we
drop this axiom then the single, most preferred diagnosis remains to be H10.
        </p>
        <p>This last move reflects the fact that in order to model di erent relationships between
preferred explanations we need to manipulate also the deductive part of the KB (used
during deductive comparisons of hypotheses) and, what is more, these manipulation is
not just additive, it is selective, which may cause problems in more complex cases with
higher number of variously interrelated diagnoses.
3.7</p>
      </sec>
      <sec id="sec-3-5">
        <title>Case of Further Examination Needed</title>
        <p>Consider again the case of one hypothesis being a more specific case of another.
Typically, we have two diagnoses which have some common symptoms, while the more
specific one likely has some additional symptoms (like with DM1 and DM, or DM2 and
DM above). If we observe only some of the common symptoms, our previous modelling
derives the less specific hypothesis. The more specific hypothesis also explains the
observations, but there is no additional evidence for it, so we prefer the less specific one.</p>
        <p>Most of the time this is expected, but not all the time. It may be necessary to
differentiate between such hypotheses by all means, and so, if perhaps some evidence is
lacking it should be obtained by additional examination if possible, by a laboratory test
for instance. Hence the outcome from the diagnosis procedure should not be just the less
specific hypothesis, but instead it should indicate also that additional tests are needed.</p>
        <p>
          From a medical perspective this might be illustrated on the following problem: some
patients who fit a certain profile (they are older, and not obese) and typically show
symptoms common to DM1, may in fact have a specific type of DM1 called LADA (cf.
Table 1 and axiom (
          <xref ref-type="bibr" rid="ref8">8</xref>
          )). To di erentiate between these two diagnoses, medical
practitioners are advised to test antibodies (e.g., GADA) in a blood sample.
        </p>
        <p>
          As this case leads to some complex modelling, we will explain it on simplified
examples. Assume that LADA is a specific form of DM1, and that DM1 has some
symptom (S01) and LADA has an additional specific one (S02). That is, we start from:
(
          <xref ref-type="bibr" rid="ref17">17</xref>
          )
(
          <xref ref-type="bibr" rid="ref18">18</xref>
          )
(19)
        </p>
        <p>
          Now we end up exactly in the situation described above. If only DM1 symptoms
are observed (i.e., in our simplified case O6 = p : 9hasSymp:S01) then the preferred
hypothesis is H2 = fp : 9hasDiag:DM1g. In order to resolve this we may try to use (20)
instead of (
          <xref ref-type="bibr" rid="ref18">18</xref>
          ):
        </p>
        <p>(9hasDiag:DM1) u (9needS:LT) v 9hasSymp:S01</p>
        <p>We now get the expected most preferred hypothesis H11 = fp : (9hasDiag:DM1) u
(9needS:LT)g for O6 and so much for O7 = p : 9hasSymp:S02 we will get H12 =
fp : 9hasDiag:LADAg as most preferred , however for O8 = p : (9hasSymp:S01) u
(9hasSymp:S02) we now get H12 = fp : (9hasDiag:LADA) u (9needS:LT)g as most
preferred, as the need of the lab test is now necessary to explain S0 . This is unintuitive
1
as indeed once we observe S02 we know that the patient has LADA, no more tests are
needed. It is easy to verify that it does not help to alter (19) to (21):</p>
        <sec id="sec-3-5-1">
          <title>9hasDiag:LADA v 9hasSymp:(S 10 u S 20)</title>
          <p>
            (21)
This is due to (
            <xref ref-type="bibr" rid="ref17">17</xref>
            ), (20), and (21) now for O6 give both H11 and H12 as preferred, and
they are incomparable. But clearly H12 is exactly the wrong hypothesis here. We are
getting into some vicious circle.
          </p>
          <p>
            The only solution we were able to come up with is to start treating the symptoms
as completely specified. That is, either S01 or :S01 is always part of the observation, and
same for S02 or other symptoms possibly involved in this part of the derivation. Using
(
            <xref ref-type="bibr" rid="ref17">17</xref>
            ), (22), and (23), we always get the expected results:
(9hasDiag:DM1) u (9needS:LT) v 9hasSymp:(S01 u :S02)
          </p>
        </sec>
        <sec id="sec-3-5-2">
          <title>9hasDiag:LADA v 9hasSymp:(S01 u S02) u 9hasSymp:(:S01 u S02)</title>
          <p>(22)
(23)
However, we now have to ask queries di erently. For O06 = p : 9hasSymp:(S01 u :S02)
the most preferred hypothesis is H11, and for both for O07 = p : 9hasSymp:(:S1 u S02)
and for O8 (no need to change here) we will get H12.</p>
          <p>So, we were able to get the expected results, but for a considerable price. Treating
symptoms as completely specified is not in line with the usual intuitions behind
abduction, where it is normal to assume that the observations are incomplete, and we are
tasked to give the most appropriate explanation for any such given observation. In
addition, it leads to a considerable blow up in the axioms, where all possible combinations
of positive and negative symptoms need to be enumerated.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Discussion</title>
      <p>All formalizations in our use case are instances of the ABox abduction problem as
defined in Definition 1, based on Elsenbroich et al. [7]. The explanations that we seek
are typically of a specific form of a single assertion:
p : (9R1:C1) u
u (9Rn:Cn)
involving single patient p, where R1, . . . , Rn most typically will come from some a
priori known set of roles which are relevant based on the domain knowledge. A similar
assumption may often hold also for the concepts C1, . . . , Cn (e.g., most often these will
be atomic concepts for various diagnoses and conditions relevant to the patients state).</p>
      <p>While exceptions to these constraints may certainly be found (e.g., a second person
involved from which the patient contracted the disease), in many cases the form of
expected explanations will be reducible into atomic diagnoses by adding “interface”
axioms of the form:</p>
      <p>Diag1 (9R1:C1) u
u (9Rn:Cn)
(24)
(25)
The hypothesis of the complex form (24) now reduces into the atomic form p : Diag1
which makes it possible to postulate the problem as simple ABox abduction. This is an
important observation, as Du et al. [5] showed that the simple abduction problem can
be e ectively solved even for DLs up to SH IQ.</p>
      <p>Most of the examples we have shown rely in a fairly basic DL ALC, abduction
support for which is known [12,13,5,9]. We used complex role inclusions to capture
dependencies between symptoms, but we also showed a simpler modelling which does
not require it. Di erent complex DL constructs that might possibly be needed in more
realistic situations certainly include number restrictions, and also restriction over
concrete domains (known, e.g., in OWL [14]), that would enable to support some more
elaborate statements about symptoms (the patient has at least some number of
symptoms, or has a numeric value of some symptom from a specific range). Extensions of
abductive reasoning for more expressive DLs that include such constructs are therefore
desirable.</p>
      <p>From a modelling perspective, an interesting lesson learned from the use case is that
modelling a knowledge base to be used in a classical, deductive way, and modelling it to
support abductive reasoning pose di erent, and sometimes conflicting requirements. As
we noted in Sect. 3.1, it is often the case that certain explanation is expected to be valid
for a number of symptoms, including any subsets thereof. Hence the typical approach
that we relied upon is to formulate the axioms in a stronger fashion – the explanation
implies all the symptoms, hence any subset follows. This kind of modelling is also
demonstrated in the literature [7,5].</p>
      <p>Firstly, we note that this requires the knowledge to be used in abductive application
to be modelled di erently than it is typically usual for ontologies, which normally are
subject to deductive reasoning. Secondly, as we have repeatedly observed in this paper,
even the process of abduction has an important subtask when hypotheses are compared
and strictly deductive reasoning is used for this. This results in complex inter-relations
between the “abductive” and the “deductive” part of the knowledge base and may lead
to rather complicated modelling. A possible way how to deal with this is to treat the
abductive and the deductive knowledge separately [16], or to used a more refined
formulation of the abduction problem [10]. We plan to try this in the future.</p>
      <p>Finally, we note that abduction has also a number of advantages, e.g., when
compared to deductive diagnostic reasoning. In number of works [8,17,18] relying upon the
latter, where deductive rules of the form S 1; ; S n ! D are used, where S i are
symptoms and D is the diagnosis, the authors point out the problem occurring in case of two
diagnoses D1, D2 with the former having a subset of symptoms of the latter. Using
deductive inference, and observing all symptoms of D2, both D1 and D2 are derived. This
is counterintuitive as specific symptoms of D2 were observed which are not symptoms
of D1. This problem has to be addressed by some suitable workarounds. In comparison,
as we have demonstrated in the use case, abductive reasoning naturally eliminates the
hypotheses which do not explain all of the observed symptoms.</p>
      <p>Acknowledgments. This work was supported by VEGA project no. 1/1333/12. Júlia
Pukancová is also supported by grant GUK no. UK/426/2015.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. American Association of Clinical Endocrinologists:
          <article-title>Diabetes care plan guidelines</article-title>
          .
          <source>Endocrine Practice</source>
          <volume>17</volume>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <article-title>American diabetes association: Diagnosis and classification of diabetes mellitus</article-title>
          .
          <source>Diabetes Care</source>
          <volume>31</volume>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuinness</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nardi</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patel-Schneider</surname>
            ,
            <given-names>P.F</given-names>
          </string-name>
          . (eds.):
          <article-title>The Description Logic Handbook: Theory, Implementation, and Applications</article-title>
          . Cambridge University Press (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Colucci</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Noia</surname>
          </string-name>
          , T.D.,
          <string-name>
            <surname>Sciascio</surname>
            ,
            <given-names>E.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Donini</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mongiello</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Concept abduction and contraction in description logics</article-title>
          .
          <source>In: Proceedings of the 2003 International Workshop on Description Logics (DL2003)</source>
          , Rome,
          <source>Italy September 5-7</source>
          ,
          <year>2003</year>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Du</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>J.Z.</given-names>
          </string-name>
          :
          <article-title>Towards practical ABox abduction in large description logic ontologies</article-title>
          .
          <source>Int. J. Semantic Web Inf. Syst</source>
          .
          <volume>8</volume>
          (
          <issue>2</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>33</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Eiter</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gottlob</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leone</surname>
          </string-name>
          , N.:
          <article-title>Abduction from logic programs: Semantics and complexity</article-title>
          .
          <source>Theor. Comput. Sci</source>
          .
          <volume>189</volume>
          (
          <issue>1-2</issue>
          ),
          <fpage>129</fpage>
          -
          <lpage>177</lpage>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Elsenbroich</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kutz</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>A case for abductive reasoning over ontologies</article-title>
          .
          <source>In: Proceedings of the OWLED*06 Workshop on OWL: Experiences and Directions</source>
          , Athens, Georgia, USA, November
          <volume>10</volume>
          -
          <issue>11</issue>
          ,
          <year>2006</year>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>García-Crespo</surname>
          </string-name>
          , Á.,
          <string-name>
            <surname>Rodríguez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mencke</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gómez-Berbís</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Colomo-Palacios</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          : Oddin:
          <article-title>Ontology-driven di erential diagnosis based on logical inference and probabilistic refinements</article-title>
          .
          <source>Expert Systems with Applications</source>
          <volume>37</volume>
          (
          <issue>3</issue>
          ),
          <fpage>2621</fpage>
          -
          <lpage>2628</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Halland</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Britz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Naïve ABox abduction in ALC using a DL tableau</article-title>
          .
          <source>In: Proceedings of the 2012 International Workshop on Description Logics, DL-2012</source>
          , Rome, Italy, June 7-10,
          <year>2012</year>
          .
          <source>Sun SITE Central Europe (CEUR)</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Hubauer</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lamparter</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pirker</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Relaxed abduction: Robust information interpretation for incomplete models</article-title>
          .
          <source>In: Proceedings of the 24th International Workshop on Description Logics (DL</source>
          <year>2011</year>
          ), Barcelona, Spain,
          <source>July 13-16</source>
          ,
          <year>2011</year>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Hubauer</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Legat</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seitz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Empowering adaptive manufacturing with interactive diagnostics: A multi-agent approach</article-title>
          .
          <source>In: Advances on Practical Applications of Agents and Multiagent Systems - 9th International Conference on Practical Applications of Agents and Multiagent Systems, PAAMS</source>
          <year>2011</year>
          , Salamanca, Spain,
          <fpage>6</fpage>
          -
          <lpage>8</lpage>
          April
          <year>2011</year>
          . pp.
          <fpage>47</fpage>
          -
          <lpage>56</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Klarman</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Endriss</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schlobach</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>ABox abduction in the description logic ALC</article-title>
          .
          <source>Journal of Automated Reasoning</source>
          <volume>46</volume>
          (
          <issue>1</issue>
          ),
          <fpage>43</fpage>
          -
          <lpage>80</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Ma</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>An ABox abduction algorithm for the description logic ALCI</article-title>
          .
          <source>In: Intelligent Information Processing VI - 7th IFIP TC 12 International Conference</source>
          . pp.
          <fpage>125</fpage>
          -
          <lpage>130</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. OWL Working Group (ed.)
          <article-title>: OWL 2 Web Ontology Language Document Overview</article-title>
          . Recommendation,
          <volume>W3C</volume>
          (
          <issue>27</issue>
          <year>October 2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Peirce</surname>
            ,
            <given-names>C.S.</given-names>
          </string-name>
          :
          <article-title>Deduction, induction, and hypothesis</article-title>
          .
          <source>Popular science monthly 13</source>
          ,
          <fpage>470</fpage>
          -
          <lpage>482</lpage>
          (
          <year>1878</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Petasis</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Möller</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karkaletsis</surname>
          </string-name>
          , V.:
          <article-title>BOEMIE: Reasoning-based information extraction</article-title>
          .
          <source>In: Proceedings of the 1st Workshop on Natural Language Processing and Automated Reasoning co-located with 12th International Conference on Logic Programming and Nonmonotonic Reasoning (LPNMR</source>
          <year>2013</year>
          ),
          <string-name>
            <given-names>A</given-names>
            <surname>Corunna</surname>
          </string-name>
          , Spain,
          <year>September 15th</year>
          ,
          <year>2013</year>
          . pp.
          <fpage>60</fpage>
          -
          <lpage>75</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Rodríguez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Labra</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alor-Hernandez</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gómez</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Posada-Gomez</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          : ADONIS:
          <article-title>Automated diagnosis system based on sound and precise logical descriptions</article-title>
          .
          <source>In: Proc. of 22nd IEEE International Symposium</source>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Rodríguez-González</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Labra-Gayo</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Colomo-Palacios</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mayer</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>GómezBerbís</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>García-Crespo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>SeDeLo: Using semantics and description logics to support aided clinical diagnosis</article-title>
          .
          <source>Journal of Medical Systems</source>
          <volume>36</volume>
          (
          <issue>4</issue>
          ),
          <fpage>2471</fpage>
          -
          <lpage>2481</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>