<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Diversity of Reason: Equivalence Relations over Description Logic Explanations</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Samantha Bail</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bijan Parsia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ulrike Sattler</string-name>
          <email>sattler@cs.man.ac.ukg</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>The University of Manchester Oxford Road</institution>
          ,
          <addr-line>Manchester, M13 9PL</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Given the high expressivity of modern ontology languages, such as OWL, there is the possibility for great diversity in the logical content of ontologies. Informally, this can be seen by the constant evolution of reasoners to deal with new sorts of content and the range of optimisations reasoners need in order to be competitive. More formally, the fact that many naturally occurring entailments have multiple justications (i.e., minimal entailing subsets) indicates that ontologies often overdetermine their consequences, indicating a diversity in supporting reasons. However, the multiplicity of justi cations might be due mostly to diverse material, not formal, grounds for an entailment. That is, the logical form of these multiple reasons could be less diverse than their numbers suggest. In the present paper, we introduce and explore several equivalence relations over justi cations for entailments of OWL ontologies. These equivalence relations range from strict isomorphism to a looser notions which cover similarities between justi cations containing di erent concept expressions or possibly di erent numbers of axioms. We survey a corpus of ontologies from the bio-medical domain and nd that large numbers of justi cations can often be reduced to a signi cantly smaller set of justi cations which are isomorphic with respect to one of the given de nitions.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Since its standardisation by the W3C in 2004, the Web Ontology Language
OWL has been used to represent domain knowledge from diverse areas, such
as medicine and general biology. OWL 21 is based on the highly expressive
Description Logic SROIQ [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] as underlying formalism: An OWL 2 ontology
corresponds to a set of SROIQ axioms.
      </p>
      <p>Justi cations, minimal entailing subsets of an OWL ontology, provide helpful
and easy-to-understand explanation support when repairing unwanted
entailments in the ontology debugging process. While previous research has focused
on the issue of making individual justi cations easier to understand, only little
attention has been paid to the occurrence of multiple justi cations other than</p>
    </sec>
    <sec id="sec-2">
      <title>1 http://www.w3.org/TR/owl2-syntax</title>
      <p>the computational problems they often imply. An entailment of an OWL
ontology can have a large number of justi cations (potentially exponential in the
number of axioms in the ontology), with up to several thousands found in large
real-life ontologies.</p>
      <p>When encountering justi cations for a nite set of entailments of an ontology
(e.g. the set of entailed atomic subsumptions), we are often faced with a
seemingly large and diverse body of reasons why these entailments hold. On closer
inspection, however, we frequently nd that multiple justi cations for di erent
entailments are identical sets of axioms, or that justi cations are structurally
identical and only diverge in the concept, role, and individual names they use.</p>
      <p>In order to draw a clearer picture of the logical diversity of a corpus of
justications, we need to take into account these similarities between justi cations.
Grouping the set of justi cations into subsets according to some similarity
measure helps structuring the unsorted pool of justi cations. Rather than trying to
understand each individual justi cation, users can focus on understanding the
template of a particular subset of justi cations. This can lead to signi cantly
fewer justi cations that the user has to deal with, and therefore to a reduction
in user e ort.</p>
      <p>A well-known syntactical equivalence relation in OWL is structural
equivalence. The OWL Structural Speci cation2 states the condition for two OWL
objects (named concepts, roles, or instances, complex expressions, or OWL axioms)
to be structurally equivalent. In short, it de nes the objects to be equivalent if
they contain the same names and constructors, regardless of ordering and
repetition (in an unordered association). The OWL API,3 a Java API which is used
to manipulate OWL ontologies, implements this notion of structural equivalence
by default.</p>
      <p>
        Justi cation isomorphism [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] was rst introduced in a study of the cognitive
complexity of justi cations: Two justi cations are isomorphic if they are
structurally identical,4 i.e. the axioms contain the same concept expressions and only
di er in the concept, role and individual names.
      </p>
      <p>While justi cation isomorphism helps to eliminate the e ects of diverging
concept, role, and individual names, we can also identify types of justi cations
which may be considered to be very similar despite their use of di erent
constructors:</p>
      <sec id="sec-2-1">
        <title>Example 1</title>
        <p>J1 = fA v B u C; B u C v Dg j= A v D</p>
        <p>J2 = fA v 9r:C; 9r:C v Dg j= A v D
In this example, the semantics of the complex concept expressions BuC in J1 and
9r:C in J2 are not relevant for the respective entailment; their occurrences in the
2 http://www.w3.org/TR/owl2-syntax
3 http://owlapi.sourceforge.net
4 Modulo structural equivalence, which takes into account redundant expressions and
the commutativity of constructors.
justi cations and their entailments can be replaced by freshly generated atomic
concept names without a ecting the entailment relation. Such a substitution in
turn would make the two justi cations isomorphic.</p>
        <p>Likewise, justi cations of di erent lengths may be considered similar if their
general structure of reasoning is identical:</p>
      </sec>
      <sec id="sec-2-2">
        <title>Example 2</title>
        <p>J1 = fA v B; B v Cg j= A v C</p>
        <p>J2 = fA v B; B v C; C v Dg j= A v D</p>
        <p>These two justi cations clearly require the same form of reasoning from a
user. Strict isomorphism only applies to justi cations which contain the same
number of axioms; it does not cover situations like the above. However, for the
purpose of structuring sets of justi cations and analysing the logical diversity of
a corpus of justi cations, capturing those kinds of similarities illustrated in the
above examples would be highly desirable.</p>
        <p>These examples motivate a looser notion of isomorphism, which allows us to
identify justi cations as equivalent if they require the same reasoning
mechanisms, regardless of size, signature and logical constructors used. In the present
paper, we introduce two new types of equivalence relations based on matching
subexpressions and lemmas, and show how these extended relations are applied
to a corpus of justi cations from the bio-medical domain.
2</p>
        <sec id="sec-2-2-1">
          <title>Preliminaries</title>
          <p>Justi cations in OWL We assume the reader to be familiar with OWL and the
underlying Description Logic SROIQ. In what follows, A; B; : : : denote concept
names in an ontology O, r; s role names, and sig( ) denotes the set of concept,
role, and individual names in an OWL axiom .</p>
          <p>
            Justi cations [
            <xref ref-type="bibr" rid="ref11 ref13">13,11</xref>
            ] are a popular type of explanation for entailments of
OWL ontologies. A justi cation is de ned as a minimal subset of an ontology O
that causes an entailment to hold:
De nition 1 (Justi cation) J is a justi cation for O j=
and, for all J 0 J , it holds that J 0 2 .
if J
          </p>
          <p>O; J j=</p>
          <p>For every axiom which is asserted in the ontology, the axiom itself naturally
is a justi cation. We call an entailment which has only itself as a justi cation a
self-supporting entailment, and the justi cation a self-justi cation.</p>
          <p>It is important to note that a justi cation is always de ned with respect
to an entailment . In the remainder of this paper we will therefore use the
term justi cation to describe a justi cation-entailment pair (J , ) where J is a
justi cation for .</p>
          <p>
            Justi cation Isomorphism Isomorphism between justi cations was rst
introduced as a method to reduce the number of similar justi cations when
sampling from a large corpus to justi cations of distinct types [
            <xref ref-type="bibr" rid="ref5">5</xref>
            ].
          </p>
          <p>De nition 2 (Justi cation Isomorphism) Two justi cations (J1; 1), (J2; 2)
are isomorphic ((J1; 1) i (J1; 1)) if there exists an injective renaming
which maps concept, role, and individual names in J1 and 1 to concept, role,
and individual names in J2 and 2, respectively, such that (J1) = J2 and
( 1) = 2.</p>
        </sec>
      </sec>
      <sec id="sec-2-3">
        <title>Example 3 (Isomorphic Justi cations)</title>
        <p>J1 = fA v B u 9r:C; B u 9r:C v Dg
J2 = fE v B u 9s:F; B u 9s:F v Dg
= fA 7! E; C 7! F; r 7! sg
j= A v D
j= E v D</p>
        <p>The relation i is symmetric, re exive and transitive, from which it
follows that i is an equivalence relation and thus partitions a set of justi cations.
Justi cation isomorphism preserves the numbers and types of axioms and
subexpressions in the justi cations:
1. J1 i J2 ! jJ1j = jJ2j
2. J1 i J2 ! jsig(J1)j = jsig(J2)j
3. The sets of logical constructs used in J1 and J2 coincide.</p>
        <p>In the remainder of this paper, we may refer to the isomorphism de ned above as
strict isomorphism in order to distinguish it from the other equivalence relations.
3</p>
        <sec id="sec-2-3-1">
          <title>Subexpression-Isomorphism</title>
          <p>From the above de nition of isomorphism it follows that only justi cations which
have the same number and types of axioms and subexpressions can be
isomorphic. It is easy to see, however, that justi cations can have a similar structure
despite their use of di erent concept expressions, as demonstrated in Example
1. This motivates a notion of isomorphism which allows not only the mapping
of concept names, but also that of subexpressions.</p>
          <p>
            The idea of nding similarities between concepts in Description Logics has
been widely explored in the work on uni cation and matching, e.g. [
            <xref ref-type="bibr" rid="ref1 ref2 ref3">2,1,3</xref>
            ], for
the purpose of detecting redundant concept descriptions in knowledge bases.
The aim of uni cation is to nd a suitable substitution which maps atomic
concepts in a concept term C to (possibly non-atomic) concepts in a concept
term D such that the two terms are made equivalent.
          </p>
          <p>While the basic idea behind extended subexpression-isomorphism is based on
uni cation and matching, the concepts are not directly applicable to the given
problem of matching justi cations. We therefore introduce a unifying justi
cation J , which functions as the template for the isomorphic justi cations:
De nition 3 (Subexpression-Isomorphism) Two justi cations (J1; 1),
(J2; 2) are s-isomorphic ((J1; 1) s (J2; 2)) if there exists a justi cation
(J ; ) and two injective substitutions 1; 2, such that 1(J ) = J1, 2(J ) = J2,
1( ) = 1, and 2( ) = 2. The mappings 1 and 2 map concept, role, and
individual names in (J ; ) to subexpressions of (J1, 1) and (J2; 2), respectively.</p>
          <p>It can be shown that the relation s is re exive, transitive and symmetric;
it is therefore an equivalence relation and thus partitions a set of justi cations.</p>
          <p>The substitutions 1; 2 are injective, but not surjective, as the set of
subexpressions in the justi cations J1 and J2 can be of higher arity than the set of
concept names in the unifying justi cation J (unless the justi cations
themselves contain no complex expressions).</p>
          <p>S-isomorphism can easily be shown to be a more general case of strict
isomorphism between two justi cations J1 and J2: The unifying justi cation J is set
to J1, 1 is the identity relation id which maps J1 to itself, and 2 corresponds
to the mapping , which maps concept, role, and individual names in J2 to the
respective names in J1. It follows that J1 i J2 ! J1 s J2.</p>
          <p>In order to be s-isomorphic, the justi cations may di er in the number of
subexpressions. They must, however, have the same number of axioms: J1 s
J2 ! jJ1j = jJ2j</p>
          <p>While we focus on entailed atomic subsumptions in the present paper, we
point out that s-isomorphism between justi cations is not restricted to a speci c
axiom type as entailment, as the substitutions 1; 2 preserve all entailment
relations regardless of the axiom type and constructor usage.
4</p>
        </sec>
        <sec id="sec-2-3-2">
          <title>Lemma-Isomorphism</title>
          <p>
            While s-isomorphism covers a number of justi cations that can be regarded
as equivalent due to them requiring the same type of reasoning to reach the
entailment, it only applies to justi cations which have the same number of
axioms. This does not take into account cases where the justi cations di er only
marginally in some subset, but where the general reasoning may be regarded as
similar nonetheless. We therefore introduce the notion of lemma-isomorphism,
which extends subexpression-isomorphism with the substitution of subsets of
justi cations through intermediate entailments, so-called lemmas [
            <xref ref-type="bibr" rid="ref7">7</xref>
            ]. The
general motivation behind lemma-isomorphism is demonstrated by the following
example:
          </p>
        </sec>
      </sec>
      <sec id="sec-2-4">
        <title>Example 4</title>
        <p>J1 = fA v B; B v Cg
J2 = fA v B; B v C; C v Dg
j= A v C
j= A v D</p>
        <p>It is straightforward to see that both J1 and J2 require the same type of
reasoning from a human user. As the justi cations only di er in the length of
the atomic subsumption chains that lead to the entailment, we can certainly
consider them to be similar with respect to some similarity measure. However, the
two justi cations are not considered isomorphic with respect to the de nitions
for strict isomorphism or subexpression-isomorphism. We therefore introduce
a new type of isomorphism which takes into account the fact that subsets of
justi cations can be replaced with intermediate entailments which follow from
them.</p>
        <p>
          Lemmas of OWL justi cations have previously found use in the extension of
justi cations to justi cation-oriented proofs [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. The following de nitions
introduce simpli ed variants of the de nitions [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] of justi cation lemmas and
lemmatisations. Please note that for the purpose of illustrating the e ect of
lemmaisomorphism, we will simplify the lemmatisations to a more speci c type of
lemmas in the next section.
        </p>
        <p>De nition 4 (Lemma) Let J be a justi cation for an entailment . A lemma
of (J ; ) is an axiom for which there exists a subset S J such that S j= .
A summarising lemma of (J ; ) is a lemma for which there exists an S J
such that J n S [ f g j= for S j= .</p>
        <p>De nition 5 (Lemmatisation) Let (J ; ) be a justi cation, let S1 : : : Sk be
subsets of J , and let 1 : : : k be axioms satisfying Si j= i for i 2 f1; : : : ; kg.
Then the set J := (J n S Si) [ S i is called a lemmatisation of J if J j= .</p>
        <p>If clear from the context, a lemmatisation J may also be called a lemmatised
justi cation. Given the de nitions for lemmatisations, we can now de ne
lemmaisomorphism as an extension to subexpression-isomorphism:
De nition 6 (Lemma-isomorphism) Two justi cations (J1; 1), (J2; 2) are
`-isomorphic ((J1; ) ` (J2; )) if there exist lemmatisations J1 1 ; J2 2 which
are s-isomorphic: J1 1 s J2 2 .</p>
        <p>Lemma-isomorphism using arbitrary lemmas as de ned above carries some
undesirable properties: First, unlike the previously de ned relations, it describes
a relation which is not transitive. Further, the lemmatised justi cations might
di er strongly from the original justi cations; in the most extreme case, the
lemmatisation of a justi cation can be the entailment itself. We therefore have to
introduce some constraints on the admissible lemmatisations in order to
guarantee the transitivity of the isomorphism relation, as well as preserve the nature
of the original justi cations. In what follows we will focus on lemmatisations
through obvious steps in justi cations.
4.1</p>
      </sec>
      <sec id="sec-2-5">
        <title>Lemmatisations and Obvious Steps</title>
        <p>
          The notion of obvious proof steps [
          <xref ref-type="bibr" rid="ref12 ref4">4,12</xref>
          ] describes how proof steps which are
intuitively obvious can be replaced with their conclusion (thereby shortening
the proof) without omitting important information. We loosely base the lemma
restriction on this obviousness and select an example of obvious proof steps which
commonly occur in DL justi cations, namely chains of atomic subsumptions.
        </p>
        <p>In atomic subsumption chains of the type A0 v A1; A1 v A2 : : : An 1 v An
only the relation between the subconcept A0 in the rst axiom and the
superconcept An in the last axiom are relevant for understanding the subsumption chain;
i.e. the step from the subconcept to the nal superconcept is obvious. We can
say that it is only important to understand that there is a connection between
the subconcept and the nal superconcept, but we do not need to know what
this connection is. Therefore, it seems reasonable to substitute the chain with its
conclusion in the form of a single axiom A0 v An. Restricting lemmatisations to
atomic subsumption chains leads to the de nition for a transitivity-preserving
type of l-isomorphism:</p>
      </sec>
      <sec id="sec-2-6">
        <title>De nition 7 (Transitivity-preserving l-isomorphism) Two justi cations</title>
        <p>(J1; 1), (J2; 2) are `t-isomorphic ((J1; ) `t (J2; )) if there exist
summarising lemmatisations J1 1 ; J2 2 which are s-isomorphic, and every Si Ji where
Si j= i 2 1( i 2 2, respectively) is of type fAi v Ai+1 j i 2 f0; : : : ; ngg where
n is the number of axioms in the respective chain of subsumption axioms, and
Ai a concept name.</p>
        <p>Atomic subsumption chains represent only one of many examples of such
lemmatisations which preserve transitivity. For the purpose of introducing
lemmaisomorphism as an equivalence relation, it su ces to focus on this particular
type of lemmatisations, as it captures a frequently occurring pattern in OWL
justi cations.
5
5.1</p>
      </sec>
      <sec id="sec-2-7">
        <title>Test Corpus</title>
        <sec id="sec-2-7-1">
          <title>Diversity of Reason in the NCBO BioPortal Ontologies</title>
          <p>We performed a survey of equivalence relations in OWL- and OBO-ontologies
from the NCBO BioPortal. The BioPortal provides ontologies from various
groups from the biomedical domain, including the full set of daily updated OBO
Foundry5 ontologies, which are built based on common design principles. OBO
ontologies use a at- le format, which can be translated into OWL 2 and were
therefore included in the test corpus.</p>
          <p>At the time of downloading (January 2012), the BioPortal listed 278
OWLand OBO-ontologies, of which 241 could be downloaded, merged with their
imports, and serialised as OWL/XML. 15 of those ontologies could not be processed
in the given time frame of 30 minutes using the selected reasoner, and another 25
did not contain any relevant entailments (direct subsumptions between named
classes). For the remaining 201 ontologies, we computed justi cations for all
entailments with a maximum of 500 justi cations per entailment. Self-supporting
entailments and self-justi cations were excluded from the survey, which led to
the discarding of further ontologies.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>5 http://obofoundry.org</title>
      <p>The nal corpus of justi cations consisted of 6,744 justi cations from 83
ontologies, covering a broad spectrum of sizes and complexity. Half of the ontologies
had less than 1000 named concepts and axioms, with the other half reaching a
maximum of 13,959 concepts and 70,015 axioms. Likewise, the expressivity of
the corpus ranged from AL to several highly expressive samples in SROIQ,
which corresponds to the OWL 2 language. A detailed listing of the surveyed
ontologies, as well the complete data from the study is available online.6
5.2</p>
      <sec id="sec-3-1">
        <title>Implementation</title>
        <p>The algorithm for detecting structural similarities between justi cations (and
therefore, OWL axioms) uses a purely syntactic search for matching
subexpressions of OWL axioms. The axioms are rst parsed into a tree form, which allows
for pairwise comparison of the nodes which represent connectives as well as
concept and role names. The implementation uses the OWL API (v3.2.4.) and the
HermiT (v1.3.6.) reasoner. Our experiments were performed on a 3GHz Intel
Core2 Duo machine with 4GB of RAM allocated to the Java Virtual Machine.
The 7051 justi cations in the test corpus could be analysed for strict
isomorphism in approximately 3 minutes.
5.3</p>
      </sec>
      <sec id="sec-3-2">
        <title>Results of the Survey</title>
        <p>Isomorphic Justi cations Strict isomorphism applied to all justi cations of
the individual ontologies drastically reduces the number of regular justi cations
from an average of 81.3 ( = 185:5) justi cations per ontology to 10.5 ( = 18:0)
templates for equivalent justi cations. The mean number of justi cations per
template is 7.7 ( = 41:7) , which means that in each ontology nearly 8 justi
cations have an identical structure. This e ect is highly visible in the Orphanet
Ontology of Rare Diseases, where the a single template covers 901 (of 1139)
justi cations for distinct entailments. These justi cations are all of the type
fgen x v (9geneOf:pat y); Domain(geneOf; gen id 1)g j= gen x v gen id 1
where x and y denote identi ers used in the ontology.</p>
        <p>Likewise, in the Cognitive Atlas ontology, all 401 justi cations for a single
entailment are reduced to only one template. Intuitively, it seems that we can
observe a much stronger reduction on a large set of justi cations than those
ontologies that produce only 1 or 2 justi cations. And indeed, there exists a
strong correlation between the number of justi cations of an ontology and the
amount of reduction (Spearman's = 0:78). However, even some of the larger
sets of justi cations are only reduced by a small proportion. The 95 justi cations
of the Gene Regulation Ontology are represented by 61 distinct templates, which
indicates a fairly diverse corpus.</p>
        <p>When applied across all justi cations from the corpus, strict isomorphism
reduces the corpus from 6,744 justi cations to only 614 templates, a reduction
6
http://owl.cs.manchester.ac.uk/research/publications/supporting-material/justiso-dl2012
to only 9.1% of the original corpus. On average, 11 justi cations share the same
template, with the most frequent template occurring 1,603 times across 18 di
erent ontologies; coincidentally, this template is of the same form as the Orphanet
Ontology described above. The most prevalent template in the corpus, based on
its occurrence in 37 distinct ontologies, is an atomic subsumption chain with 2
axioms.</p>
        <p>Subexpression-Isomorphism The reduction from strict isomorphism to
sisomorphism is clearly less drastic than the di erence between the main pool
and the non-isomorphic pool. The justi cations of the 83 ontologies are reduced
from an average of 81.3 justi cations to 8.8 templates ( = 13:1), which is a
reduction by 1.7 templates compared to strict isomorphism. An average of 9.2
justi cations ( = 46:6) in an ontology share the same template.</p>
        <p>Surprisingly, the majority of ontologies (67) does not show any di erence
between strict isomorphism and s-isomorphism. Only 2 ontologies, the Lipid
Ontology and Bleeding History Phenotype, are signi cantly a ected by s-isomorphism,
with a reduction from 118 to 13 templates (11.0%) and 32 to 14 templates
(43.8%), respectively. Recall that two justi cations are s-isomorphic, if their
different complex subexpressions can be mapped to an atomic variable name. The
signi cant reduction therefore suggests that in these two ontologies complex
expressions are frequently used in the same way as atomic concepts.</p>
        <p>Closer inspection reveals, however, that a large number of justi cations in
the Lipid Ontology consist of one axiom of the form A B u x, with the
entailment being A v B. Here, x represents a number of complex expressions of
varying nesting depth. While s-isomorphism captures these types of justi
cations as equivalent, the actual reason for their similarity lies in their identical
cores, with the remainder x being a super uous part (with respect to the given
entailment).</p>
        <p>Across the entire corpus, the number of justi cations is reduced from 6,744
to 456 templates (6.8% of the corpus), which is a further reduction compared
to strict isomorphism (614 templates). The most frequent templates in terms of
number of justi cations and prevalence across all ontologies are the same as for
strict isomorphism, with numbers only di ering slightly.</p>
        <p>Lemma-Isomorphism As with s-isomorphism, the e ects of l-isomorphism
were not as signi cant as the rst reduction through strict isomorphism. The
justi cations were further reduced to an average of 7.4 templates per
ontology ( = 11:4), with 11 justi cations per template ( = 51:5). Still, 35 of the
83 ontologies show at least a minor di erence between s-isomorphism and
lisomorphism, which indicates that they contain at least 1 atomic subsumption
chain.</p>
        <p>L-isomorphism reduces the 106 justi cations generated for the Cereal Plant
Gross Anatomy ontology to only 14 templates, compared to 29 templates for
sisomorphism. This shows that, while not very frequent, there are indeed
ontologies in which justi cations with subsumption chains of di ering lengths occur.</p>
        <p>In the Plant Ontology, l-isomorphism reduces the 74 justi cations to 12
templates, with one template capturing 32 justi cations of varying sizes. These
justi cations contain atomic subsumption chains ranging from 2 to 6 atomic
subsumption axioms, which can all be reduced to a single axiom (namely the
entailment of the subsumption chain) in the lemmatised version of the justi
cation.</p>
        <p>Across the corpus, l-isomorphism reduces the 6,744 justi cations to a mere
384 templates, which is an overall reduction of 94.3%. The e ect of
lemmaisomorphism is visible when we look at the most prevalent justi cation, an atomic
subsumption chain of size 2, which occurs in 44 (compared to previously 37)
ontologies. This chain represents all 701 atomic subsumption chains of di ering
sizes that can be found in the corpus.
5.4</p>
      </sec>
      <sec id="sec-3-3">
        <title>Counting Distinct Reasons</title>
        <p>For the purpose of detecting how many distinct types of justi cations there are
for a given set of entailments, we have seen that it is crucial to focus not on
the material form of a justi cation, but rather on the justi cation templates in
an ontology. By only considering the abstract template of a set of justi cations,
we can represent the reasoning that underlies not only one, but a whole class of
justi cations in the ontology.</p>
        <p>Surprisingly, while the newly introduced equivalence relations, s-isomorphism
and l-isomorphism, could be shown to capture some of the structural
similarities in the surveyed corpus, a large number of justi cations were indeed strictly
isomorphic. We can argue, however, that even a small reduction can be bene
cial when presenting multiple justi cations to OWL ontology developers for the
purpose of explaining entailments, as it prevents repetitive actions and gives a
higher-level view of the set of justi cations. This is made obvious in the example
of the Plant Ontology, where a large number of justi cations that di ered only
in the length of their atomic subsumption chains, could be captured by a single
template.</p>
        <p>
          Another aspect to take into account when dealing with OWL justi cations
is the super uousness of expressions, as we have seen in the case of the Lipid
Ontology. Two justi cations may be nearly identical and only di er in
expressions that are not necessary for the entailment to hold; these super uous parts
would prevent them from being isomorphic with respect to any of the above
de nitions, but it is clear that their form is the same. This situation describes
one of several types of justi cation masking [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. In order to prevent distortion
caused by masking e ects, we may want to focus on a type of justi cations which
is minimal with respect to its subexpressions.
        </p>
        <p>
          A laconic justi cation [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] is a justi cation which does not contain any
superuous parts, with every subexpression being as weak as possible. A comparison
of the equivalence relations between the regular justi cations for the above set of
ontologies and their laconic versions as part of future work will allow us to gain
further insight into the e ect of super uousness on the diversity of justi cations.
In this paper, we introduced new types of equivalence relations between OWL
justi cations, subexpression-isomorpism and lemma-isomorphism. We
demonstrated how a seemingly diverse corpus of justi cations from the NCBO
BioPortal could be reduced by over 90% to a much smaller set of non-isomorphic
justi cations. We have found that, surprisingly, most justi cations are in fact
strictly isomorphic, with only a few ontologies being a ected by the other
equivalence relations.
        </p>
        <p>Future work will involve exploring further notions of obvious proof steps in
order to extend lemma-isomorphism beyond atomic subsumption chains. We will
also consider the issue of overlapping chains, i.e. subsumption chains which lead
to non-summarising lemmas. Since the present paper demonstrates the e ects
of the di erent equivalence relations on only a subset of the justi cations from
the BioPortal corpus, we are planning a survey of the entire set of justi cations,
taking into account the di erences between laconic and non-laconic justi
cations. Finally, we aim to obtain more detailed knowledge about the application
of ontology design patterns in the surveyed ontologies, which will allow us to
investigate the relations between these design patterns and the form and frequency
of justi cation templates.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , Kusters, R.,
          <string-name>
            <surname>Borgida</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuinness</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Matching in description logics</article-title>
          .
          <source>Journal of Logic and Computation</source>
          <volume>9</volume>
          (
          <issue>3</issue>
          ),
          <volume>411</volume>
          {
          <fpage>447</fpage>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Narendran</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Uni cation of concept terms in description logics</article-title>
          .
          <source>In: Proc. of ECAI-98</source>
          . pp.
          <volume>331</volume>
          {
          <issue>335</issue>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Brandt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Implementing matching in ALE| rst results</article-title>
          .
          <source>In: Proc. of DL-03</source>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Obvious logical inferences</article-title>
          .
          <source>In: Proc. of IJCAI-81</source>
          . pp.
          <volume>530</volume>
          {
          <issue>531</issue>
          (
          <year>1981</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bail</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>The cognitive complexity of OWL justi cations</article-title>
          .
          <source>In: Proc. of ISWC-11</source>
          . pp.
          <volume>241</volume>
          {
          <issue>256</issue>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Laconic and precise justi cations in OWL</article-title>
          .
          <source>In: Proc. of ISWC-08</source>
          . pp.
          <volume>323</volume>
          {
          <issue>338</issue>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Lemmas for justi cations in OWL</article-title>
          .
          <source>In: Proc. of DL</source>
          <year>2009</year>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Justi cation masking in OWL</article-title>
          .
          <source>In: Proc. of DL</source>
          <year>2010</year>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Justi cation oriented proofs in OWL</article-title>
          .
          <source>In: Proc. of ISWC-10</source>
          . pp.
          <volume>354</volume>
          {
          <issue>369</issue>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kutz</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>The even more irresistible SROIQ</article-title>
          .
          <source>In: Proc. of KR-06</source>
          . pp.
          <volume>57</volume>
          {
          <issue>67</issue>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sirin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Debugging OWL ontologies</article-title>
          .
          <source>In: Proc. of WWW-05</source>
          . pp.
          <volume>633</volume>
          {
          <issue>640</issue>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>P.N.</given-names>
            ,
            <surname>Johnson-Laird</surname>
          </string-name>
          :
          <article-title>Mental models in cognitive science</article-title>
          .
          <source>Cognitive Science</source>
          <volume>4</volume>
          (
          <issue>1</issue>
          ),
          <volume>71</volume>
          {
          <fpage>115</fpage>
          (
          <year>1980</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Schlobach</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cornet</surname>
          </string-name>
          , R.:
          <article-title>Non-standard reasoning services for the debugging of description logic terminologies</article-title>
          .
          <source>In: Proc. of IJCAI-03</source>
          . pp.
          <volume>355</volume>
          {
          <issue>362</issue>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>