=Paper=
{{Paper
|id=Vol-1350/paper-27
|storemode=property
|title=Advancing
                ELK: Not Only Performance Matters
|pdfUrl=https://ceur-ws.org/Vol-1350/paper-27.pdf
|volume=Vol-1350
|dblpUrl=https://dblp.org/rec/conf/dlog/KazakovK15
}}
==Advancing
                ELK: Not Only Performance Matters==
<pdf width="1500px">https://ceur-ws.org/Vol-1350/paper-27.pdf</pdf>
<pre>
     Advancing ELK: Not Only Performance Matters

                           Yevgeny Kazakov and Pavel Klinov

                             The University of Ulm, Germany
                        {yevgeny.kazakov, pavel.klinov}@uni-ulm.de


       Abstract. This paper reports on the recent development of ELK, a consequence-
       based reasoner for EL+  ⊥ ontologies. It covers novel reasoning techniques which
       aim at improving efficiency and providing foundation for new reasoning services.
       On the former front we present a simple optimization for handling of role com-
       position axioms, such as transitivity, which substantially reduces the number of
       rule applications. For the latter, we describe a new rule application strategy that
       takes advantage of concept definitions to avoid many redundant inferences with-
       out making rules dependent on derived conclusions. This improvement is not vis-
       ible to the end user but considerably simplifies implementation for incremental
       reasoning and proof generation. We also present a rewriting of low-level infer-
       ences used by ELK to higher-level proofs that can be defined in the standard DL
       syntax, and thus be used for automatic verification of reasoning results or (vi-
       sual) ontology debugging. We demonstrate the latter capability using a new ELK
       Protégé plugin.


1   Introduction
ELK is an ontology reasoner designed for top classification performance on OWL EL
ontologies [1]. Its characteristic features are consequence-based calculus, highly par-
allelizable reasoning, and aggressive optimizations to reduce the number of derived
axioms sufficient for classification (deriving all subsumptions between concept names).
     Sheer performance has been the sole goal for the first few versions of ELK and
it enabled it to become the reasoner of choice in biomedical circles where large EL
ontologies are built to manage scientific terminologies [2–7]. After that ELK started
to evolve towards providing additional reasoning-related services, such as incremental
classification [8] and proof tracing [9]. It turned out that some traits of ELK’s classi-
fication procedure, in particular, the non-deterministic saturation, can complicate the
development or weaken the guarantees of such extra services. For example, the compo-
sition/decomposition optimization (cf. [1]) has to be off when incrementally retracting
inferences [8]. Also, the proof tracing method guarantees only that all proofs performed
by ELK will be generated, not all proofs supported by the calculus [9]. This, in partic-
ular, means that one cannot in general obtain all justifications [10] from proofs.
     In this paper we describe the steps towards adapting the main reasoning procedure
to rectify this sort of issues without major performance setbacks. On the performance
front we show a technique to reduce the number of inferences on roles. We also present
a rewriting of the low-level traced inferences into higher-level proof-based explanations
which could be shown to the user or verified using automated reasoning tools. Due to
the space constraints, some results are deferred to the technical report [11].
                                 C v D1 C v D2
 E0                        E+
                            u
      CvC                          C v D1 u D2
E>                               C v ∃R.D D v ⊥
      Cv>                  E⊥
                                      Cv⊥
      CvD
Ev        : D v E ∈ O E C v ∃R.D D v E
      CvE              ∃
                            C v ∃R.E
        C v D1 u D2
E−
 u                               Ci−1 v ∃Ri .Ci (1 ≤ i ≤ k, k ≥ 0)
      C v D1 C v D2         E◦                                     : R1 · · · Rk v R ∈ O
                                           C0 v ∃R.Ck

                     Fig. 1. Basic inference rules for reasoning in EL+
                                                                      ⊥


2     Consequence-Based Reasoning in EL+
                                       ⊥

We first describe ELK’s consequence-based procedure for EL+       ⊥ reasoning. Most the-
oretical results, such as completeness, redundancy elimination, and goal-directed rule
application, are minor variations of [1] but the rules for dealing with role chain axioms
without binarization and rules for reasoning with reflexive roles are new.

2.1   The Description Logic EL+
                              ⊥

The syntax of EL+ ⊥ is defined using a vocabulary consisting of countably infinite sets
of (atomic) roles and atomic concepts. EL+     ⊥ concepts are defined using the grammar
C ::= A | > | ⊥ | C1 u C2 | ∃R.C, where A is an atomic concept, R a role, and
C(i) ∈ C. EL+  ⊥ role chains are defined using the grammar P ::=  | R·ρ, where  is
the empty role chain, R a role and ρ ∈ P. We usually write role chains as R1 ·R2 · · · Rn
instead of R1 ·(R2 · · · (Rn ·)). An EL+⊥ axiom is either a concept inclusion C1 v C2
for C1 , C2 ∈ C or a role inclusion ρ v R for ρ ∈ P and a role R. We regard the
concept equivalence C1 ≡ C2 as an abbreviation for two concept inclusions C1 v C2
and C2 v C1 . We also call  v R a role reflexivity axiom. An EL+  ⊥ ontology O is a finite
set of EL+ ⊥ axioms.  Semantics    of EL +
                                         ⊥ is defined in the usual way ( is interpreted as
identity). A concept C is subsumed by D w.r.t. O if O |= C v D. In this case, we call
C v D an entailed subsumption (w.r.t. O). The ontology classification task requires to
compute all entailed subsumptions between atomic concepts occurring in O.

2.2   Inference Rules
Classification of EL+ ⊥ ontologies is usually performed by applying rules that derive
logical consequences of axioms. Figure 1 lists the EL+   ⊥ -rules that are similar to those
usually considered in the literature [12, 13]. The premises of the rules are written above
the horizontal line, the conclusions below, and the axioms in the ontology (a.k.a. side
conditions) that trigger rule applications after the colon. Note that rule E◦ can be used
with k = 0, in which case it has no premises and uses the reflexivity axiom  v R ∈ O.

Example 1. Consider the EL+  ⊥ ontology O = {A v ∃R.B, B v ∃S.C, R·S·H v V ,
 v H}. Then it is possible to derive A v ∃V.C using the rules in Figure 1 as follows:
  AvA            by E0 (),                                                             (1)
  A v ∃R.B       by Ev (A v A) : A v ∃R.B ∈ O,                                         (2)
  BvB            by E0 (),                                                             (3)
  B v ∃S.C       by Ev (B v B) : B v ∃S.C ∈ O,                                         (4)
  C v ∃H.C       by E◦ () :  v H ∈ O,                                                 (5)
  A v ∃V.C       by E◦ (A v ∃R.B, B v ∃S.C, C v ∃H.C) : R·S·H v V ∈ O.                 (6)

    Formally, a derivation for EL ontology O (using the rules in Figure 1) is a sequence
of EL+ ⊥ axioms d = {αi | i ≥ 1} such that each αi with i ≥ 0 is obtained from axioms
{αj | 1 ≤ j < i} using one of the rules in Figure 1 and axioms in O as side conditions.
The size ||d|| of d is the number of axioms in d. For example, the sequence of axioms
(1)–(6) in Example 1 forms a derivation, in which every axiom is obtained from the
previous axioms by the rules in Figure 1 as indicated next to the axioms.
    The rules in Figure 1 are simple to understand but not very efficient to implement.
The problem is caused by rule E◦ , which may produce many conclusions for ontologies
with deep role hierarchies. For example, consider O = {Rj−1 v Rj | 1 ≤ j ≤ m} ∪
{Ci−1 v ∃R0 .Ci , ∃Rm .Di v Di−1 | 1 ≤ i ≤ n} ∪ {Cn v Dn }. Then one can only
derive C0 v D0 by the rules in Figure 1 by deriving quadratically-many intermediate
axioms Ci−1 v ∃Rj .Ci by E◦ using Rj−1 v Rj ∈ O (1 ≤ i ≤ n, 1 ≤ j ≤ m).
Therefore, ELK implements optimized rules listed in Figure 2 that help avoiding this
problem by deriving subsumptions on role (chains) separately [1]. To formulate these
rules, we have slightly extended the syntax of EL+  ⊥ . First, we can derive role chains on
the right-hand side of role inclusions: ρ1 v ρ2 (I |= ρ1 v ρ2 if ρI1 ⊆ ρI2 ). Second, we
allow role chains to occur in existential restrictions: ∃(R·ρ).C is rewritten to ∃R.C if
ρ = , or to ∃R.∃ρ.C otherwise (whenever we write ∃ρ.C we assume that ρ 6= ). The
extended EL+  ⊥ axioms can be used in derivations, but not in the ontology O.

Example 2. Below is the derivation for A v ∃V.C for the ontology O in Example 1
using the rules in Figure 2:

        AvA                   by C0 (),                                                (7)
        A v ∃R.B              by Cv (A v A) : A v ∃R.B ∈ O,                            (8)
        RvR                   by R0 (),                                                (9)
        BvB                   by C0 (),                                               (10)
        B v ∃S.C              by Cv (B v B) : B v ∃S.C ∈ O,                           (11)
        SvS                   by R0 (),                                               (12)
        CvC                   by C0 (),                                               (13)
         v                  by R0 (),                                               (14)
         vH                  by Rv ( v ) :  v H ∈ O,                              (15)
        S v S·H               by Rr (S v S,  v H),                                  (16)
        A v ∃(R·S·H).C        by C◦ (A v ∃R.B, R v R, B v ∃S.C, S v S·H),             (17)
                                              v R ρ 1 v ρ2
      R0                               Rl
           ρvρ                                  ρ1 v R·ρ2
           ρ1 v ρ2                           ρ 1 v R  v ρ2
      Rv           : ρ2 v R ∈ O        Rr
           ρ1 v R                                ρ1 v R·ρ2

                                             C v ∃ρ.D D v ⊥
      C0                              C⊥
           CvC                                    Cv⊥
                                             CvD vR
      C>                               C∃
           Cv>                                C v ∃R.D
           CvD                               C v ∃ρ.D ρ v R D v E
      Cv       :DvE∈O                  C∃
           CvE                                      C v ∃R.E
             C v D1 u D2                     C v ∃ρ1 .D   ρ1 v R D v ∃ρ2 .E      ρ2 v ρ
      C−
       u                               C◦
           C v D1 C v D2                                   C v ∃(R·ρ).E
           C v D1 C v D2
      C+
       u
             C v D1 u D2

           Fig. 2. Optimized inference rules for reasoning in EL+
                                                                ⊥ implemented in ELK


  R·S·H v R·S·H                 by R0 (),                                                 (18)
  R·S·H v V                     by Rv (R·S·H v R·S·H) : R·S·H v V ∈ O,                    (19)
           A v ∃V.C             by C∃ (A v ∃(R·S·H).C, R·S·H v V, C v C).                 (20)
   As can be seen from Examples 1 and 2, derivations using the rules in Figure 2 can
be more difficult to understand because they use relatively complex rules such as Rr
and C◦ and manipulate with extended EL+ axioms such as A v ∃(R·S·H).C and
S v S·H, the last of which is not even expressible in EL+ . Fortunately, it is always
possible to rewrite any derivation by the rules in Figure 2 into the one by the rules in
Figure 1 using a simple recursive procedure (with an unavoidable quadratic blowup):
Theorem 1. Let O be an EL+    ⊥ ontology, d a derivation by the rules in Figure 2 for O,
and F v G ∈ d an (ordinary) EL+    ⊥ axiom. Then one can construct a derivation e by
the rules in Figure 1 for O with F v G ∈ e such that ||e|| = O(||d||2 ).
    The proof of Theorem 1 can be found in the technical report [11], which also con-
tains an overview of the Protégé plug-in for displaying proofs based on this result.

2.3   Composed Conclusions, Redundancy and Completeness
From now on, we focus in the rules in Figure 2, so when we talk about axioms derived
by these rules, we mean extended EL+   ⊥ axioms. It is easy to see that the rules in Fig-
ure 2 are sound, that is, the conclusions of the rules are logical consequences of the
premises and the side conditions, if there are any. Therefore, every derivation contains
only axioms entailed by the ontology. It turns out that the converse property also holds:
Theorem 2. Let O be an EL+                                   +
                            ⊥ ontology and F v G an EL⊥ concept inclusion such
that O |= F v G. Then there exists a derivation d using the rules in Figure 2 such that
either F v G ∈ d or F v ⊥ ∈ d.
    Similarly to existing results [12, 1], Theorem 2 is proved by constructing a canonical
model using the set of all derivable axioms. One can actually prove a stronger version
of this theorem, namely that every entailed subsumption is derivable by an optimized
derivation—a derivation which does not use a certain kind of redundant inferences [11].
Definition 1. We say that an axiom α in a derivation d is composed if can be obtained
by rules C+       
            u , C∃ or C∃ from the previous axioms. An application of a rule in Figure 2
to premises in d is redundant if it is an application by rule C−
                                                               u in which the premise is
composed, by rule C∃ in which the first premise is composed, or by rule C◦ in which
the first or the third premise is composed. The derivation d is optimized if every axiom
α in d is obtained from the previous axioms using a non-redundant rule application.
Example 3. Consider the ontology O = {A v ∃R.B, B v C, C v D}. Then the fol-
lowing derivation using the rules in Figure 2 is possible:
                 AvA                    by C0 (),                                    (21)
                 A v ∃R.B               by Cv (A v A) : A v ∃R.B ∈ O,                (22)
                 BvB                    by C0 (),                                    (23)
                 BvC                    by Cv (B v B) : B v C ∈ O,                   (24)
                 BvD                    by Cv (B v C) : C v D ∈ O,                   (25)
                 RvR                    by R0 (),                                    (26)
           (+) A v ∃R.C                 by C∃ (A v ∃R.B, R v R, B v C),              (27)
                 CvC                    by C0 (),                                    (28)
               CvD                      by Cv (C v C) : C v D ∈ O,                   (29)
           (+) A v ∃R.D                 by C∃ (A v ∃R.C, R v R, C v D).              (30)
In this derivation, axioms (27) and (30) (labeled with +) are composed because they
were obtained by rule C∃ . Hence the inference that has produced (30) is redundant
because it is made by an application of C∃ to a composed first premise (27). Still,
the derivation (21)–(30) is optimized because (30) can be obtained from the previous
axioms by another (non-redundant) application of C∃ to a non-composed premise (22):
           (+)   A v ∃R.D               by C∃ (A v ∃R.B, R v R, B v D).              (31)
In other words, since a derivation is a sequence of axioms and not a sequence of rule
applications, it matters by which inferences axioms can be obtained from the previous
axioms. Note that if (25) is removed from the derivation, then the inference (31) is no
longer possible and the derivation becomes non-optimized.
    In practice, the optimization above means that when applying the rules in Figure 2
to check entailment of concept inclusion, it is not necessary to apply C−
                                                                        u to premises
derived by C+ u or apply C∃ and C◦ to premises derived by C∃ [1].


2.4   The Subformula Property and Goal-Directed Rule Application
So far Theorem 2 cannot be used for effectively checking if a given subsumption C v
D is entailed by the ontology O since there are infinitely many axioms that can be
derived by the rules in Figure 2—already C0 can produce infinitely many conclusions.
It turns out, it is sufficient to derive only axioms of the form ρ1 v ρ2 , C1 v C2 , and
C1 v ∃ρ2 .C2 such that all concepts Ci and role chains ρi (i = 1, 2) occur (possibly as
sub-expressions) either in the ontology O or in the given subsumption F v G tested for
entailment [1]. In other words, if O |= F v G then F v G or F v ⊥ is derivable by the
rules in Figure 2 without creating new concepts or role chains. We refer to this property
as subformula property. The subformula property implies that checking entailment O |=
F v G can be done in polynomial time since there are at most polynomially-many
different axioms of the above forms, and one can compute a derivation d containing all
such axioms by repeatedly applying the rules in Figure 2.
     The rules, however, can be restricted even further. Specifically, an axiom C1 v C2
needs to be derived in d only if C1 = F for the tested subsumption F v G or if some
non-composed axiom of the form C v ∃ρ.C1 is already derived in d. Indeed, it is easy
to observe from the rules in Figure 2 that an axiom C1 v C2 can be used in a rule
application only if this rule derives a concept inclusion axiom with the same left-hand
side C1 or it uses another non-composed axiom of the form C v ∃ρ.C1 (rules C∃
and C◦ ). Similarly, an axiom ρ1 v ρ2 needs to be derived only if ρ1 =  or if some
other non-composed axiom of the form C v ∃ρ1 .D is already derived. We call this
optimization goal-directed rule application.

Example 4. Consider the ontology O = {A v ∃R.B, A v ∃R.C, B v C, C v B}.
Suppose we want to check whether O |= A v B. Then the following goal-directed
non-redundant rule applications can be performed:

                 AvA                    by C0 (),                                   (32)
                 A v ∃R.B               by Cv (A v A) : A v ∃R.B ∈ O,               (33)
                 BvB                    by R0 (),                                   (34)
                 BvC                    by Rv (B v B) : B v C ∈ O,                  (35)
                 RvR                    by R0 (),                                   (36)
           (+) A v ∃R.C                 by C∃ (A v ∃R.B, R v R, B v C),             (37)
           (+) A v ∃R.C                 by Cv (A v A) : A v ∃R.C ∈ O.               (38)

Note that deriving axioms with the left-hand side C (e.g., C v C by rule C0 ) is not
necessary since the axiom A v ∃R.C is composed (and thus, e.g., cannot be used in
rule C∃ like axiom A v ∃R.B). Since no further rules need to be applied and the axiom
A v B is not derived, we can conclude that O 6|= A v B. Note that if we swap (33)
with (38), then the axiom A v ∃R.C would not be composed and we would need to
derive subsumptions C v C and C v B by rules C0 and Cv . Thus the set of derived
axioms depends on the order in which the rules are applied.

    Note that if a derivation d contains C v D or ρ1 v ρ2 then C v C or ρ1 v ρ1 must
be derived in d respectively by C0 and R0 before that. Therefore, to save space, from
now on we skip applications of the rules C0 and R0 (e.g., like (32), (34), and (36) in
Example 4). We will also skip application of the rules Cv and Rv producing axioms in
the ontology from the conclusions of C0 and R0 (e.g., like (33) and (35) in Example 4).
3   Avoiding Duplicate Role Compositions
In this section, we present an optimization, using which one can avoid some duplicate
conclusions of rule C◦ . Intuitively, this optimization is designed to deal more efficiently
with specific role chain axioms such as transitivity T ·T v T . It is closely related to a
similar optimization for role chain axioms presented previously for a fragment of EL+     ⊥
[13]. To illustrate the problem addressed, consider the following example.

Example 5. Consider the ontology O = {A v ∃L.B, B v ∃P.C, C v ∃P.D, L·P v
L, P ·P v P }. The roles L and P can be thought of as expressing the ‘located-in’ and
‘part-of’ relations. So the last axiom of O, in particular, expresses that if x is located
in y and y is a part of z then x is located in z. Let us try to determine whether O |=
A v B using goal-direct application of rules (skipping applications of C0 and R0 , and
applications of Cv , Rv producing axioms in O as noted before):

    A v ∃(L·P ).C       by C◦ (A v ∃L.B, L v L, B v ∃P.C, P v P ),                     (39)
    A v ∃(L·P ).D       by C◦ (A v ∃(L·P ).C, L·P v L, C v ∃P.D, P v P ),              (40)
    B v ∃(P ·P ).D      by C◦ (B v ∃P.C, P v P , C v ∃P.D, P v P ),                    (41)
    A v ∃(L·P ).D       by C◦ (A v ∃L.B, L v L, B v ∃(P ·P ).D, P ·P v P ).            (42)

Note that the axiom A v ∃(L·P ).D was derived two times by C0 in (40) and (42).
Intuitively, this is because the role chain inclusion L·P ·P v L·P can be proved in two
ways: either as (L·P )·P v L·P using L·P v L or as L·(P ·P ) v L·P using P ·P v P .

    In general, suppose that we have a derivation where some axiom is derived by C◦ :

    C v ∃(R·ρ).F        by C◦ (C v ∃S.D, S v R, D v ∃(T ·µ).F , (T ·µ) v ρ).           (43)

Let us try to determine when the same conclusion can be derived differently. Suppose
that µ 6= . Then D v ∃(T ·µ).F can be only derived by C◦ :

       D v ∃(T ·µ).F           by C◦ (D v ∃τ.E, τ v T , E v ∃η.F , η v µ).             (44)

Now suppose that we also have S v S, S·T v R and η v ρ in the derivation. Then
C v ∃(R·ρ).F can be derived as follows:

    C v ∃(S·T ).E        by C◦ (C v ∃S.D, S v S, D v ∃τ.E, τ v T ),                    (45)
    C v ∃(R·ρ).F         by C◦ (C v ∃(S·T ).E, S·T v R, E v ∃η.F , η v ρ).             (46)

     Note that when we replace the rule application (43) with the two rule applications
(45) and (46), the third premises D v ∃τ.E and E v ∃η.F used in these applications
appear in the derivation before the third premise D v ∃(T ·µ).F of (43) since the
latter was obtained from the former by (44). Therefore, if we apply this transformation
repeatedly to all inferences by C◦ , it will always terminate.
     To summarize, we can always avoid applying C◦ with S v R and T ·µ v ρ as
the second and the last premises if µ 6=  and we can derive S·T v R (S v S is
always derivable by R0 ) and τ v ρ for every derivable τ v µ. Due to the subformula
property, we can precompute all subsumptions on role chains occurring in the ontology
and compute all pairs hρ1 v R, ρ2 v ρi of such role subsumptions with ρ1 , ρ2 and R·ρ
occurring in the ontology, excluding the pairs hS v R, T ·µ v ρi for which the above
condition holds. Only the remaining pairs of subsumptions should then be used in C◦ .

Example 6. Continuing Example 5, we can show that the above conditions hold for the
pair hS v R, T ·µ v ρi = hL v L, P ·P v P i. Indeed, S·T v R = L·P v L can
be derived, and since µ = ρ = P , τ v ρ is derivable if τ v µ is. Thus, the pair
hL v L, P ·P v P i should not be used in C◦ , and thus inference (42) is not necessary.
One can show that the pair hP v P, P ·P v P i should not be used in C◦ as well.

    As mentioned, there is a close relation of the optimization above with a similar
optimization for rules in Figure 1 [13]. The main idea is to identify the role chain axioms
ρ v R for which rule E◦ can be applied in a left-linear way, that is, only if all premises
starting from the second one are derived by rules other than E◦ with k ≥ 2. This is the
case, e.g., for both axioms L·P v L and P ·P v P from ontology O in Example 5.
For example, rule E◦ using L·P v L ∈ O would be applied for A v ∃L.B and
B v ∃P.C (derived by Ev ), but would not be applied for A v ∃L.B and B v ∃P.D
if B v ∃P.D is derived by E◦ from B v ∃P.C and C v ∃P.D using P ·P v P ∈ O.
The reason is that the same conclusion A v ∃L.D can be derived in a left-linear way
from A v ∃L.C (derived by E◦ using L·P v L ∈ O) and C v ∃P.D (derived by Ev ).
The main difference between the two optimizations, is that, due to the differences in the
rules in Figure 1 and Figure 2, instead of classifying which stated role chain axioms
can be used in a left-linear way, we determine which derived role chain axioms should
be ‘concatenated’ in rule C◦ . The latter is algorithmically easier to determine using the
condition formulated above.


4   Deterministic Saturation
Recall from Example 4 that the set of axioms obtained by applying the rules in a goal-
directed way may depend on the order in which the rules are applied. Although this
side effect has no impact on reasoning results, it introduces some difficulties when ex-
tending ELK reasoning services beyond checking of logical entailment. Specifically,
the procedures for incremental reasoning [8] and proof generation [9] implemented in
ELK, require repeating some rule applications performed in the derivation, and if the
rules are applied in a different order than originally, the procedures may result in incor-
rect results. In this section we describe a modification of our rule application procedure
for which the derived axioms do not depend on the order of rule application.
     Recall from Section 2.3 that to determine whether a rule such as R∃ should be
applied to some axioms in the derivation, i.e., it is not redundant, one has to check which
of these axioms are composed, i.e., can be derived by certain rules from the previous
axioms in the derivation. This property ensures that only necessary rules are applied, but
it can make rule application dependent on when (i.e, after which axioms) the premises
were derived. Instead, we may slightly relax this requirement and decide whether an
axiom is composed or not only based on the rules by which it was actually derived,
which would make rule applications not to depend on other axioms in the derivation.
                 CvA                                      CvD
            C−
             ≡       :A≡D∈O                          C+
                                                      ≡       :A≡D∈O
                 CvD                                      CvA

                   Fig. 3. The new inference rules for concept definitions


For example, axiom A v ∃R.C in Example 4 was derived by both rules C∃ and Cv .
We would then not apply C∃ to the first conclusion of a ‘composition’ inference by C∃ ,
but apply it to the second conclusion of ‘non-composition’ inference by Cv . Clearly,
the advantage here is that it does not matter which of the two inferences was made
first. It may seem that axioms are rarely derived by multiple rules, so the relaxed rule
application strategy might not result in too many unnecessary inferences. The following
example illustrates that this may not be really the case in practice.
Example 7. Consider the ontology O = {A v B u ∃R.A, C ≡ ∃R.B}. Recall, that
concept equivalence C ≡ ∃R.B represents two axioms C v ∃R.B and ∃R.B v C. To
test O |= A v C, we apply the rules in Figure 2 in a goal-directed way:

                 AvB                   by C−
                                           u (A v B u ∃R.A),                       (47)
                 A v ∃R.A              by C−
                                           u (A v B u ∃R.A),                       (48)
          (+) A v ∃R.B                 by C∃ (A v ∃R.A, R v R, A v B),             (49)
                 AvC                   by Cv (A v ∃R.B) : ∃R.B v C ∈ O,            (50)
                 A v ∃R.B              by Cv (A v C) : C v ∃R.B.                   (51)

Note that axiom A v ∃R.B is derived by rules C∃ and Cv , so we would need to
consider the second conclusion (51) for applications of C∃ . Note that the second rule
application is a direct result of the equivalence axiom C ≡ ∃R.B, which was used to
replace the subsumer ∃R.B in (49) with C and back with ∃R.B. So it is actually not
possible to have application (51) before (49).
    The scenario illustrated in Example 7 is rather common: whenever an axiom C v D
is derived and D occurs in some concept equivalence in the ontology, the same axiom
C v D will be derived again. To avoid such duplicate inferences, we introduce new
rules in Figure 3 to deal specifically with concept definitions—concept equivalences
A ≡ D where A is a (defined) atomic concept. Most concept equivalences in existing
ontologies are of this form. We assume that all concept equivalences in O are concept
definitions and each atomic concept A is defined in at most one of them; remaining con-
cept equivalences can be always replaced with concept inclusions. Finally, we extend
Definition 1 by allowing composed axioms to be obtained by rule C+      ≡ and redundant
rule applications to include applications by rule C−≡  in which the premise is composed.
Note that if the premise of C−  ≡ is composed,  then  it can only be obtained  by C+
                                                                                   ≡ us-
ing the same concept definition A ≡ D, and hence the conclusion of this rule must be
already derived.
Example 8. Continuing Example 7, with the new rules in Figure 3 we will have just rule
application (52) instead of (50)–(51); the application of rule C−
                                                                ≡ to (52) is redundant.

      (+) A v C                       by C+
                                          ≡ (A v ∃R.B) : C ≡ ∃R.B ∈ O.             (52)
Table 1. Summary information for the test ontologies (numbers of the different kinds of axioms)

          Ontology                  C v D C ≡ D R v S R · R v R R1 · R2 v S
          EL-GALEN              25,563 9,968            958          58             0
          GALEN7                27,820 15,270          1000           0           385
          GALEN8                53,449 113,622         1024           0           385
          SNOMED CT (Jan 2014) 229,330 69,908            11           0             1
          ANATOMY               17,551 21,831             4           3             2


5       Preliminary Experimental Evaluation

In this section we present the preliminary results of empirical evaluation of the two
new reasoning techniques described in Sections 3 and 4: the role composition opti-
mization and the deterministic saturation using new rules for concept definitions. For
both experiments we use a set of the well-known biomedical ontologies:1 the July 2014
version of SNOMED CT,2 three versions of OpenGALEN3 (EL-GALEN, GALEN7,
and GALEN8), and ANATOMY (an experimental version of SNOMED CT which uses
role chain axioms to model the body structure). These ontologies have been frequently
used in the past for evaluation of EL reasoners [14, 15, 1]. The summary information
about these ontologies is presented in Table 1.
     For experiments we used a development version of ELK 0.5. The experimental setup
is the same for all experiments: each ontology is classified 20 times, 10 warm-up runs
to exclude the effects of JIT compilation and 10 measured runs, for which the results
are averaged. The combined loading + classification wall clock time (in ms.) is used as
the main performance metric. We used a PC with Intel Core i5-2520M 2.50GHz CPU,
Java 1.6 and 4GB RAM available to JVM.
     The first experiment evaluates effectiveness of the role composition optimization
described in Section 3. All ontologies (except for SNOMED CT in which the only sub-
role chain axiom has no effect) are classified with the optimization being turned on and
off. The results are shown in Table 2. It can be seen that in most cases the optimization
considerably reduces the number of inferences as well the number of derived subsump-
tions (many subsumptions are derived by several inferences). The difference translates
into time savings. The ANATOMY ontology stands out as the case where the optimiza-
tion is critically important since it cuts down the number of inferences by rule C◦ by
nearly an order of magnitude.
     The aim of the the second experiment is to evaluate the effectiveness of the deter-
ministic saturation optimization described in Section 4. We compare the classification
time and the number of derived axioms in three cases: a) with the ELK’ current non-
deterministic saturation [1], b) with deterministic saturation using the rules in Figure 2,
and c) with deterministic saturation using the additional rules for concept definitions
(see Figure 3). The results are shown in Table 3.
    1
     Unless specified otherwise, the ontologies can be downloaded from the ELK project page
elk.semanticweb.org
   2
     http://www.ihtsdo.org/licensing/
   3
     http://www.opengalen.org/sources/sources.html
               Table 2. Evaluation of the role composition optimization from Section 3

        Ontology    Classification time          Inferences         Derived subsumptions
        Optimization On         Off            On         Off          On         Off
        EL-GALEN    964             1,039 2,080,194 2,207,823 1,485,247 1,550,695
        GALEN7    1,632             1,998 5,707,082 7,001,796 2,787,261 2,952,952
        GALEN8   15,587            16,981 40,239,327 49,719,156 19,377,172 20,136,178
        ANATOMY 8,957              27,766 45,466,892 176,130,924 7,105,923 10,223,484

Table 3. Evaluation of deterministic saturation (see Section 3). The shortcuts “non-det”, “det”,
and “det+defn” stand for non-deterministic saturation, unoptimized deterministic saturation, de-
terministic saturation with the new optimized rules for handling of concept definitions.

    Ontology           Classification time     Inferences      Derived subsumptions
                    non-det det det+defn non-det det det+defn non-det det det+defn
    EL-GALEN     959 1,513             1,032 2.1M 4.2M           2.3M 1.5M 2.7M           1.9M
    GALEN7     1,984 3,408             2,699 5.7M 11.6M          9.9M 2.8M 5.3M           4.7M
    GALEN8    16,750 27,339           19,866 40.2M 93.7M        65.3M 19.4M 37.2M        28.4M
    SNOMED CT 14,441 21,791           15,431 25.5M 54.1M        31.2M 17.1M 30.3M        23.6M
    ANATOMY    9,183 15,546            9,875 45.5M 73.0M        47.2M 7.1M 12.1M          8.3M


    One can see that deterministic saturation without further optimizations is signifi-
cantly slower and often makes nearly twice as many inferences as non-deterministic
saturation. This is largely because many axioms are derived by multiple inferences due
to equivalence axioms (as illustrated in Example 3). The special rules to deal with con-
cept definitions reduce such redundant derivations and improve performance so that it
is close to that of non-deterministic saturation. Still in some cases, e.g., for GALEN7
and GALEN8, the difference between non-deterministic and optimized deterministic
strategies is visible and it remains our goal to investigate how it can be reduced even
further.


6     Summary
The paper presented several recent developments in ELK which range from novel rea-
soning optimizations, such as efficient handling of role chain axioms, to modifications
aimed at supporting additional reasoning services, such as proof-based explanations.
Our experiments show that the latter changes may result in minor performance setbacks,
and it remains our future goal to investigate how to avoid even such minor performance
compromises.


References
 1. Kazakov, Y., Krötzsch, M., Simančík, F.: The incredible ELK: From polynomial procedures
    to efficient reasoning with EL ontologies. J. of Automated Reasoning 53(1) (2014) 1–61
 2. Harris, M.A., Lock, A., Bühler, J., Oliver, S.G., Wood, V.: FYPO: the fission yeast phenotype
    ontology. Bioinformatics 29(13) (2013) 1671–1678
 3. Hoehndorf, R., Dumontier, M., Gkoutos, G.V.: Identifying aberrant pathways through inte-
    grated analysis of knowledge in pharmacogenomics. Bioinformatics 28(16) (2012) 2169–
    2175
 4. Hoehndorf, R., Harris, M.A., Herre, H., Rustici, G., Gkoutos, G.V.: Semantic integration of
    physiology phenotypes with an application to the cellular phenotype ontology. Bioinformat-
    ics 28(13) (2012) 1783–1789
 5. Jupp, S., Stevens, R., Hoehndorf, R.: Logical gene ontology annotations (GOAL): exploring
    gene ontology annotations with OWL. J. of Biomedical Semantics 3(Suppl 1)(S3) (2012)
    1–16
 6. Osumi-Sutherland, D., Reeve, S., Mungall, C.J., Neuhaus, F., Ruttenberg, A., Jefferis,
    G.S.X.E., Armstrong, J.D.: A strategy for building neuroanatomy ontologies. Bioinformatics
    28(9) (2012) 1262–1269
 7. The Gene Ontology Consortium: Gene ontology annotations and resources. Nucleic Acids
    Res (2012)
 8. Kazakov, Y., Klinov, P.: Incremental reasoning in OWL EL without bookkeeping. In: The
    Semantic Web - ISWC 2013 - 12th International Semantic Web Conference, Sydney, NSW,
    Australia, October 21-25, 2013, Proceedings, Part I. (2013) 232–247
 9. Kazakov, Y., Klinov, P.: Goal-directed tracing of inferences in EL ontologies. In: The
    Semantic Web - ISWC 2014 - 13th International Semantic Web Conference, Riva del Garda,
    Italy, October 19-23, 2014. Proceedings, Part II. (2014) 196–211
10. Kalyanpur, A., Parsia, B., Horridge, M., Sirin, E.: Finding all justifications of OWL DL
    entailments. In Aberer, K., Choi, K.S., Noy, N., Allemang, D., Lee, K.I., Nixon, L., Golbeck,
    J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P., eds.: Proc. 6th
    Int. Semantic Web Conf. (ISWC’07). Volume 4825 of LNCS., Springer (2007) 267–280
11. Kazakov, Y., Klinov, P.: Advancing ELK: Not only perormance matters. Technical report,
    University of Ulm (2015) available from http://http://elk.semanticweb.org/
    publications/elk-advancing-trdl-2015.pdf.
12. Baader, F., Brandt, S., Lutz, C.: Pushing the EL envelope. In Kaelbling, L., Saffiotti, A., eds.:
    Proc. 19th Int. Joint Conf. on Artificial Intelligence (IJCAI’05), Professional Book Center
    (2005) 364–369
13. Kazakov, Y., Krötzsch, M., Simančík, F.: Unchain my EL reasoner. In Rosati, R., Rudolph,
    S., Zakharyaschev, M., eds.: Proc. 24th Int. Workshop on Description Logics (DL’11). Vol-
    ume 745 of CEUR Workshop Proceedings., CEUR-WS.org (2011) 202–212
14. Baader, F., Lutz, C., Suntisrivaraporn, B.: Efficient reasoning in EL+ . In Parsia, B., Sattler,
    U., Toman, D., eds.: Proc. 19th Int. Workshop on Description Logics (DL’06). Volume 189
    of CEUR Workshop Proceedings., CEUR-WS.org (2006)
15. Lawley, M.J., Bousquet, C.: Fast classification in Protégé: Snorocket as an OWL 2 EL
    reasoner. In: Proc. 6th Australasian Ontology Workshop (IAOA’10). (2010) 45–49

</pre>