<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Facilitating the Analysis of Ontology Di erences</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Rafael S. Goncalves</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bijan Parsia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ulrike Sattler</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Computer Science, University of Manchester</institution>
          ,
          <addr-line>Manchester</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The analysis of changes between OWL ontologies (in the form of a di ) is an important service for ontology engineering. A purely syntactic analysis of changes is insu cient to distinguish between changes that have logical impact and those that do not. The current state of the art in semantic di ng ignores logically ine ectual changes and lacks any further characterisation of even signi cant changes. We present and demonstrate a di method based on an exhaustive categorisation of e ectual and ine ectual changes between ontologies. In order to verify the applicability of our approach we apply it to 88 versions of the National Cancer Institute (NCI) Thesaurus (NCIt), and 5 versions of SNOMED CT, demonstrating that all categories are realized throughout the corpus. Based on the outcome of these studies we argue that the devised categorisation of changes is helpful for ontology engineers and their understanding of changes carried out between ontologies.</p>
      </abstract>
      <kwd-group>
        <kwd>OWL</kwd>
        <kwd>Ontologies</kwd>
        <kwd>SNOMED CT</kwd>
        <kwd>NCI Thesaurus</kwd>
        <kwd>Di</kwd>
        <kwd>Ontology Evolution</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The comparison of ontologies is a valuable service whether used as a subroutine in
a version control system or to support users in understanding changes. Di erent
di methods vary in their sensitivity to changes: e.g., a di method based on
character di erences will nd that two notationally distinct serializations of the
same ontology are radically di erent. If a di method is too sensitive to irrelevant
changes then the user will be faced with determining which reported changes
are actually signi cant. On the other hand, a hard requirement is that the di
method does not miss any change of signi cance.</p>
      <p>
        The OWL 2 speci cation1 de nes a high level notion of syntactic equivalence,
so-called \structural equivalence" (and thus the associated notion of structural
di erence), which abstracts from certain neglectable changes such as the
order of axioms or concrete syntax. A di erent syntactic approach is that of an
edit-based di , wherein change records are produced within the ontology editor
being used, thereby capturing the history and intent of change, as implemented
in Swoop [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. The di s mentioned so far, as well as PROMPTDIFF [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and
1 http://www.w3.org/TR/2009/CR-owl2-syntax-20090611/
Bubastis [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], do not perform any further characterisation of reported changes
(e.g., whether these have logical impact). This forces the user to analyse each
change in the di , and determine whether it a ects the set of entailments of an
ontology; thus whether the change is logically e ectual. When analysing a set of
changes it would be useful to not only distinguish between those logically e
ectual and ine ectual ones, but also to characterise reported changes according to
their impact. Certain classes of ine ectual changes are neglectable, e.g., order of
conjuncts in an axiom. However other ine ectual changes provide useful insights,
such as the introduction of redundant or rewritten axioms. Semantic di s, such
as CEX [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] or ContentCVS [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], regard all ine ectual changes as neglectable,
though in general knowing the relative proportions of e ectual and meaningful
ine ectual changes gives us a better understanding of what has changed between
ontologies. For example, if most changes are meaningfully ine ectual it might
be a sign of wasted e ort, and therefore it would be useful to know if and why
this happens. Contrariwise, while those changes might be logically ine ectual,
other tools might be sensitive to the variant syntactic forms. So on the one hand,
syntactic di s report without distinction both e ectual and ine ectual changes,
and on the other hand semantic di s do not present ine ectual changes.
      </p>
      <p>
        A major problem with the output of change sets is that the user is given a
(possibly large) set of axioms (or terms) to analyse, with no indication as to what
kind of change each of them represents. A reasonable presentation of changes
will cluster changes according to relevant properties. In this paper we discuss
and elaborate on the di method presented in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ],2 referred to as intentional
di erence, which incorporates a categorisation of changes. This categorisation
attempts to capture the impact of each change (e.g., whether it is a rewrite
of another axiom). Aside from the intuitive appeal of categorising changes, our
approach recti es a problem with existing di s, in that it aligns changes with
what they are a change of. E.g., axioms in the category of `rewritings' are shown
together with the rewritten axioms.
      </p>
      <p>
        For the purpose of verifying the suitability of the approach, we collected
88 versions of the National Cancer Institute (NCI) Thesaurus (NCIt) available
in OWL, as well as 5 of the latest Systematized Nomenclature of Medicine {
Clinical Terms (SNOMED CT)3 versions, and conducted a diachronic study of
each corpus. These studies aimed at showing the computational feasibility of our
approach and con rming that the devised categories are realized throughout each
corpus. Additionally we investigate whether this di method helps us understand
the evolution of both the NCIt and SNOMED CT. Moreover we demonstrate
via walkthroughs how the categorisation can support change analysis by users:
for the rst walkthrough we use toy ontologies, while for the second we use two
versions of the NCIt.
2 In addition to material in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], we carry out a cognitive walkthrough of a particular
di instance, discuss tool implementation, and provide a study of the Systematized
Nomenclature of Medicine { Clinical Terms (SNOMED CT).
3 http://www.ihtsdo.org/index.php?id=545
      </p>
    </sec>
    <sec id="sec-2">
      <title>Preliminaries</title>
      <p>
        We assume the reader to be reasonably familiar with ontologies and OWL, as
well as the underlying description logics (DLs) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], though detailed knowledge
is not required. We do use the notion of entailment [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], which is identical to
the standard rst order logic entailment (an axiom entailed by an ontology
O is denoted O j= ). The signature of an ontology O is denoted Oe. The di
categories discussed in the paper are de ned in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], though the categories will be
explained by means of examples.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Ontology Di ng</title>
      <p>The problem of computing the di erence between pairs of ontologies has been
approached both syntactically and semantically. We distinguish two major
aspects of ontology di ng: (i) the detection of changes, and (ii) the presentation
of changes to the user. We note that most e ort has been largely dedicated to
(i). It is often the case that the output (ii) of di operations is the set of axioms
or terms in the di . While this may re ect the desired detection of change, it
does not necessarily convey su cient information to the user w.r.t. the impact
of changes.
3.1</p>
      <sec id="sec-3-1">
        <title>Related Work and Di</title>
      </sec>
      <sec id="sec-3-2">
        <title>Desiderata</title>
        <p>Within the detection of changes (i), one would expect a preliminary distinction
of axioms in the di according to their logical e ectuality. As such, a purely
syntactic change analysis does not su ce to achieve this. Standard semantic di
tools treat all ine ectual changes as neglectable. This goes too far, for example:
consider the case where two ontologies di er in a substantial number of axioms,
but the axioms in the di are only equivalences rewritten into subsumptions.
Semantic di s would point out that there are no di erences, which may seem
counter-intuitive to the user since a shallow inspection of the ontologies would
reveal a discrepancy in number of axioms. In this case the ine ectual changes
point to possibly unnecessary work. Though if these were intentional, then other
developers should be aware of it rather than rewriting the axioms once again. So
for this kind of change we would expect a more granular analysis to be helpful.</p>
        <p>The second fundamental aspect of any di is the presentation of changes
to the user (ii). Currently ontology di s return as the output an unstructured,
uncharacterised set of changes. As a consequence the task of change analysis is
not particularly appealing. Consider the fact that the average di size across the
NCIt corpus is over 6,000 changes; relying on current di methods for change
analysis would be frightening, to say the least. At this point it would be useful
to determine further properties of individual changes which might help the user
understand changes. A change could relate to, e.g, a newly introduced term, or
an adjustment to the class hierarchy. Whatever it may be, the current
state-ofthe-art in ontology di ng does not carry out such a characterisation of changes.</p>
        <p>
          In terms of computability, one would expect an ontology di to be e ciently
computable for OWL 2 ontologies.4 CEX [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], e.g., computes di erences e
ciently only for a fragment of OWL.5
        </p>
        <p>
          Overall there is little tool support available to end-users for analysing and
understanding di erences between ontologies. Particularly when it comes to large
ontologies, such as SNOMED CT, browsing through substantial change sets
while inspecting two of its versions is not only tedious, but also requires
aboveaverage hardware to browse through SNOMED CT seamlessly. Based on the di
method described in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], given two ontologies O1 and O2 we obtain categorised
sets of changes according to their apparent impact from O1 to O2, and vice-versa.
This approach seems more reasonable to grasp what has actually changed from
one ontology to another. In this paper we demonstrate via walkthroughs how one
can have a better understanding of a change set based on such a categorisation,
while for the actual de nitions we refer the reader to [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. We will describe how
the output of this di is obtained based on examples, as well as how di erent
categories can be interpreted by users.
3.2
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Intentional Di</title>
      </sec>
      <sec id="sec-3-4">
        <title>Walkthrough</title>
        <p>In order to demonstrate the usefulness of the di categories, let us compare
ontologies O1 and O2, de ned in Table 1.
From O1 and O2 we have the following structural di erences:
Additions(O1; O2) = f 1; 2; 4; 5; 6; 7; 9; 10; 11; 12; 13g
Removals(O1; O2) = f 1; 3; 4; 5; 7; 8g</p>
        <p>
          Note that 6 is not syntactically equal to 8 ( 6 6= 8), however they are
structurally equivalent ( 6 s 8). Therefore these axioms are not reported as
changes. Given the sets of structural additions and removals from O1 to O2,
we check which axioms in the Removals(O1; O2) are entailed by O2 (ine ectual
removals), and vice-versa for the Additions(O1; O2). Thus we obtain a distinction
between e ectual and ine ectual changes, as follows:
4 Based on DLs up to SROIQ [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
5 Speci cally acyclic EL-terminologies [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
        </p>
        <p>E ectualAdditions(O1; O2) = f 2; 6; 10; 11; 12; 13g
E ectualRemovals(O1; O2) = f 8g
Ine ectualAdditions(O1; O2) = f 1; 4; 5; 7; 9g
Ine ectualRemovals(O1; O2) = f 1; 3; 4; 5; 7g</p>
        <p>There are several ine ectual changes in the change set, while e ectual changes
are mostly additions (and a single removal). The changes are categorised as
shown in Table 2.</p>
        <p>Let us consider two ine ectual additions; 9 is a rewrite of f 7; 5g, as well as
an avoided redundancy (i.e., had it been added to O1 it would be redundant). The
axiom is also weakened, due to 8. This may seem like an unintentional change,
since now we face a loss of information regarding J , which is no longer mentioned
in O2. Such a change may be worth revising. The axiom 1 is redundant, since
we have from O1 that A v C, which is also entailed from O2. Therefore the user
can dispose of this axiom.</p>
        <p>Bear in mind that the existence of a rewritten axiom from O1 to O2 does
not imply that the same holds in the opposite direction. This is applicable to
all categories. Also we can have that an axiom is in more than one category.
Consider axiom 1; a justi cation J1 for 1 is J1 = f 2; 3g, which indicates a
strengthening (since we have that 2 2 E ectualAdditions(O1; O2)), as well as
a redundancy ( 3 2 O1 \ O2). Another justi cation J2 = f 1; 3g indicates a
redundancy; 1 2 Ine ectualAdditions(O1; O2).</p>
        <p>In terms of e ectual changes there is only one removal, and six additions. The
e ectual removal ( 8) represents a weakening of 9 with retired terms (J is not
mentioned in O2). In the analysis of the ine ectual changes it was already noted
that axiom 8 should be revised. The pure additions appear to be adjustments
to the class hierarchy, some associated with new terms in O2. Both axioms 11
and 12 are strengthenings of 4, which suggests that they could be merged,
especially since there is intra-axiom redundancy. Finally there is a new term K
in O2 being described via axiom 10.</p>
        <p>Generally speaking, with such a categorisation it becomes conceivably more
intuitive to navigate and understand sets of changes. In addition to this, we
gathered from the analysis of ine ectual changes useful information about the
changes between O1 and O2, e.g., that axiom 4 is strengthened in two distinct,
yet partially super uous axioms ( 11 and 12). Similarly we discover that axiom
9 is weakened, from 8, which should be reconsidered as we now have that
O2 2 F v J (J becoming a retired term).</p>
      </sec>
      <sec id="sec-3-5">
        <title>3.3 Implementation</title>
        <p>
          The algorithm to compute the di and its categories is straightforwardly
derivable from the de nitions in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], and heavily relies on decision procedures for
entailments [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], justi cation nding [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], and module extraction algorithms [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
The di itself is implemented in Java, relying on the OWL API, and is made
available on the Web as a Java Servlet.6 The output of each di is an XML
le,7 containing the axioms in the di and their respective categories. In order
to present this output in a more sensible way, we use an XSL Transformation
(XSLT)8 and a Cascading Style Sheet (CSS),9 which given the XML le presents
the axioms in the di according to their respective categories. The resulting web
page also contains the pairing of each di erence, as well as the new or retired
terms used, where applicable. An example of the Servlet output is shown in
Figure 1, where axioms are represented in Manchester Syntax.10
6 http://owl.cs.manchester.ac.uk/diff
7 http://www.w3.org/XML/
8 http://www.w3.org/TR/xslt
9 http://www.w3.org/Style/CSS/
10 http://www.w3.org/TR/owl2-manchester-syntax/
Throughout the case studies we have excluded a class of ine ectual changes:
changes to annotations. Since there are ontologies whose major focus is the
annotations, as the NCIt, these represent a topic of study in themselves. E.g.,
in the NCIt the average proportion of axiomatic changes throughout the corpus
is 15%, while the remaining are annotation changes. But for the purposes of
this paper we focus on axiom changes only. The experiment machine is an Intel
Xeon Quad-Core 3.20GHz, with 16Gb DDR3 RAM dedicated to the Java Virtual
Machine (JVM v1.5). The system runs Mac OS X 10.6.8, and all tests were run
using the OWL API (v3.1.0).
        </p>
        <p>The NCIt archive11 contains 88 versions of the ontology in OWL format, two
of which were unparsable (releases 05.03F and 05.04d) with the OWL API,12
11 http://evs.nci.nih.gov/ftp1/NCI_Thesaurus
12 http://owlapi.sourceforge.net/
and consequently Protege.13 The test data is published on Google Public Data
Explorer,14 and can be visualised at http://bit.ly/leZ6fM.</p>
        <p>SNOMED CT is not readily available in OWL, however IHTSDO15
supplies a Perl16 script to transform the published concept and stated relationships
tables into OWL or KRSS formats. We collected all 5 International Releases
of SNOMED CT that can be converted into OWL via this script, since
January 2009 through to January 2011 (the versions before 2009 are not published
with this transformation script into OWL). Upon executing the conversion of
SNOMED CT releases into OWL we discovered a discrepancy between the OWL
and KRSS transformation outputs. Speci cally in the January and July 2009
releases (bundled with version 1.1 of the Perl script), the OWL and KRSS
serializations di er in over 100,000 subclass axioms. The OWL transformation of
the January 2009 release contains 115,941 more subclass axioms than the KRSS
output, while in the July 2009 release the OWL version has 113,665 more
subclass axioms. This situation no longer presents itself with subsequent versions of
the transformation script; the 2010 releases are bundled with version 2.0 of the
script, while the 2011 release comes with version 2.1. Both these versions yield
the same OWL and KRSS outputs. By applying version 2.1 of the Perl
transformation script to the 2009 releases we get a match on the output (of OWL and
KRSS). As such, we used version 2.1 of the transformation script on the 2009
releases in order to carry out this study.
4.1</p>
      </sec>
      <sec id="sec-3-6">
        <title>NCIt Di</title>
      </sec>
      <sec id="sec-3-7">
        <title>Results</title>
        <p>
          As reported in [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], the logical di erence throughout the NCIt time-line consists
of an average of 87% e ectual changes, while the remaining are ine ectual (see
Figures 2 and 3 and Table 3). There are more additions than removals throughout
the corpus, with an average of 60% additions versus 40% removals, which is
hardly surprising for a constantly evolving ontology.
10000 
1000 
100 
10 
1  v2  v3  v4  v5  v6  v7  v8  v9  v10  v11  v12  v13  v14  v15  v16  v17  v18  v19  v20  v21  v22  v23  v24  v25  v26  v27  v28  v29  v30  v31  v32  v33  v34  v35  v36  v37  v38  v39  v40  v41  v42  v43  v44  v45  v46  v47  v48  v49  v50  v51  v52  v53  v54  v55  v56  v57  v58  v59  v60  v61  v62  v63  v64  v65  v66  v67  v68  v69  v70  v71  v72  v73  v74  v75  v76  v77  v78  v79  v80  v81  v82  v83  v84  v85  v86 
        </p>
        <p>Effectual  Ineffectual 
13 http://protege.stanford.edu/
14 http://www.google.com/publicdata/home
15 http://www.ihtsdo.org/
16 http://www.perl.org/
10000 
1000 
100 
10 
1  v2  v3  v4  v5  v6  v7  v8  v9  v10  v11  v12  v13  v14  v15  v16  v17  v18  v19  v20  v21  v22  v23  v24  v25  v26  v27  v28  v29  v30  v31  v32  v33  v34  v35  v36  v37  v38  v39  v40  v41  v42  v43  v44  v45  v46  v47  v48  v49  v50  v51  v52  v53  v54  v55  v56  v57  v58  v59  v60  v61  v62  v63  v64  v65  v66  v67  v68  v69  v70  v71  v72  v73  v74  v75  v76  v77  v78  v79  v80  v81  v82  v83  v84  v85  v86 </p>
        <p>Effectual  Ineffectual </p>
        <p>Throughout the corpus there is a high number of ine ectual removals, with an
average of 35% of all removals. Out of these ine ectual removals 92% turned out
to be strengthened axioms, which indicates a continuous re ning of information
throughout the corpus. There is a high number of removed redundancies as
well, constituting 49% of ine ectual removals. This tells us that there is some
pruning of redundant information going on in the corpus. On average 5% of
additions are ine ectual; among these, 73% are added redundancies, and 82%
are weakened axioms. The latter occur typically due to adjustments in the class
hierarchy. Despite being a high percentage, the number of ine ectual additions
is generally low (Table 3). In cases where the number of ine ectual changes is
quite high (e.g., O24 where 52% of changes are ine ectual, O27, O29 and O30 with
48% each) note that semantic di s would signi cantly understate the amount of
activity performed. While structural di captures this, it does not analyse the
logical impact of such changes.</p>
        <sec id="sec-3-7-1">
          <title>Axiom Terms</title>
        </sec>
        <sec id="sec-3-7-2">
          <title>Total</title>
        </sec>
        <sec id="sec-3-7-3">
          <title>Strengthening</title>
        </sec>
        <sec id="sec-3-7-4">
          <title>New Description</title>
        </sec>
        <sec id="sec-3-7-5">
          <title>Pure Addition</title>
        </sec>
        <sec id="sec-3-7-6">
          <title>Total</title>
        </sec>
        <sec id="sec-3-7-7">
          <title>Weakening</title>
        </sec>
        <sec id="sec-3-7-8">
          <title>Retired Description</title>
        </sec>
        <sec id="sec-3-7-9">
          <title>Pure Removal</title>
          <p>We also identi ed a number of rewrites in the corpus. Particularly in O33,
there are 227 rewritten axioms. Upon inspecting the rewritten axioms, we noticed
that these changes are not only syntactic but also trivial and easily detected.
While ideally the underlying structural di would not include these, at least
with our categorisation and alignment with source axioms, it is easy to spot and
recognise the triviality.</p>
          <p>Among the categories of e ectual changes we discovered that the majority
of these are new and retired descriptions. In terms of e ectual additions the
average of new descriptions is 60%, while retired descriptions average 51% of
e ectual removals. This high number of new descriptions is unsurprising, as the
terminology keeps increasing. Despite the high values of retired descriptions, it
does not mean that such concepts are deleted, it could instead suggest concept
renamings. Strengthenings average around 4% of all additions (with and without
new terms), indicating re nements of concepts with additional constraints. It is
natural that upon introducing new terms, others need to be re-described, thus
explaining the strengthenings with new terms. There are not as many
weakenings in the corpus as there are strengthenings. This tells us that typically there
is not much reduction of information from version to version. The average of
weakenings (with or without retired terms) throughout is below 2% of all
removals. Pure additions account for 37% of all additions, divided between 24% of
additions with new terms and 13% without. Typically pure changes with shared
terms suggest adjustments to the class hierarchy, while pure additions involving
new terms point to the insertion of said terms and subsequent re-adjustment of
the hierarchy accordingly. The average of pure removals throughout the corpus
is 47%, split between 23% with retired terms and 24% without.
4.2</p>
        </sec>
      </sec>
      <sec id="sec-3-8">
        <title>SNOMED CT Di Results</title>
        <p>Throughout the 5 versions of SNOMED CT there are typically more e ectual
changes (74%) than ine ectual (26%), as shown in Figure 4. However, within
the removals there is a high number of ine ectual changes (10,214), averaging
37% of all removals (see Table 4). In the additions there are on average more
e ectual changes (84%) than ine ectual (16%).</p>
        <p>The majority of e ectual additions (43% of all changes) are pure additions,
suggesting numerous alterations to the class hierarchy, possibly as a result of</p>
        <p>Change</p>
        <p>Type</p>
        <sec id="sec-3-8-1">
          <title>Axiom Terms</title>
        </sec>
        <sec id="sec-3-8-2">
          <title>Total</title>
        </sec>
        <sec id="sec-3-8-3">
          <title>Strengthening</title>
        </sec>
        <sec id="sec-3-8-4">
          <title>New Description</title>
        </sec>
        <sec id="sec-3-8-5">
          <title>Pure Addition</title>
        </sec>
        <sec id="sec-3-8-6">
          <title>Total</title>
        </sec>
        <sec id="sec-3-8-7">
          <title>Weakening</title>
        </sec>
        <sec id="sec-3-8-8">
          <title>Retired Description</title>
        </sec>
        <sec id="sec-3-8-9">
          <title>Pure Removal</title>
          <p>the addition of new terms, as there is also a high number of pure additions
involving new terms. Naturally when introducing a new, non-leaf term in the
class hierarchy it causes shifts throughout one or more branches. Thus, given
the high number of new descriptions (6,266) throughout SNOMED CT, the
detection of many pure additions is not surprising.</p>
          <p>Within the e ectual removals (31% of all changes) we see a similar pattern as
in the additions: the majority (63%) of e ectual removals are pure removals, some
with retired terms. There is also a high number of retired descriptions, suggesting
that as terms are retired (either deleted or declared as `retired concepts') this
causes modi cations to the class hierarchy. This also explains the high number of
pure additions, with shared and retired terms. Weakenings are not as common,
amounting to 10% of e ectual removals.</p>
          <p>In terms of ine ectual additions (8% of all changes), there are mostly
weakened axioms (3,897), as well as a high number of avoided redundancies (2,561).
The latter are derived from modi cations to the class hierarchy, which in this case
can be ignored. Had there been redundancies it would be advisable to inspect
them, and possibly dispose of these from the ontology. The weakened axioms,
on the other hand, should be veri ed for correctness, as they reveal a loss of
constraints w.r.t. the terms described in such axioms.</p>
          <p>The number of ine ectual removals (18% of all changes) is much higher than
the additions, and consists of mostly strengthened axioms (9,386). This suggests
that a constant tightening of the ontology is taking place; despite the fact that
these axioms are removed, it is only due to the introduction of stronger
constraints on the meaning of such axioms. The remaining ine ectual removals are
avoided redundancies (4,395).</p>
          <p>Overall we see a good balance of e ectual additions vs removals, and a lot
more ine ectual removals than additions. By inspecting the di erent types of
changes throughout the evolution of SNOMED CT, we can single out particular
occurrences in more detail; e.g., when a new term is introduced in the class
hierarchy (new description), it will typically lead to adjustments to the hierarchy
(pure additions) involving both shared terms which have to be switched around,
and the new term itself (or other new ones). Using a standard syntactic or
semantic di such an analysis would be impractical, and much of the work would
be left up to the user (such as checking which changes have logical impact).
With this categorisation even a user who is not a domain expert or an ontology
engineer can spot trends in the di between ontology versions, as we did here
with SNOMED CT.
Note that in SNOMED CT there is a bigger proportion of ine ectual changes
than in the NCIt, and the number of certain types of change is similar in both
cases, e.g., within ine ectual removals there are mostly strengthened axioms,
followed by redundancies. However, in the case of SNOMED CT, there are only
avoided redundancies, while in the NCIt there are redundancies which could be
disposed of. Clearly certain ine ectual changes are in fact \refactorings", albeit
in the case of strengthened and weakened axioms the refactoring would have to
be of a set of axioms rather than a single axiom. Thus a strengthened axiom
does not necessarily mean strengthening of the ontology, since the change might
either introduce a redundancy or redistribute information from other axioms.
Consider an ontology O1 = f 1 : A v B; 2 : A v Cg, and a change of 1
into A v B u C. The axiom 1 was strengthened, but the resulting ontology
O2 = f 1 : A v B u C; 2 : A v Cg was not. However, if we change 2 2 O2 into
A v C u D, we can say both the axiom 2 and the ontology O2 are strengthened.</p>
          <p>In terms of e ectual changes there are mostly new and retired descriptions
in the NCIt, while in SNOMED CT the majority of e ectual changes are pure
additions or removals. So we see that a major focus in SNOMED CT seems to be
class hierarchy oriented, with many adjustments going on. In the NCIt, on the
other hand, we see that there are a lot of new terms being introduced, and since
we know that terms are never deleted, the high number of retired descriptions
suggests there are many renamings of term names.
In order to elaborate on the potential usefulness of the devised categories, let
us look at a particular comparison, di (O32; O33), and walk through the output
of the di from the perspective of a user. The output of di (O32; O33) is
outlined in Table 5. A user whose role is to assure the overall progress and quality of
changes is particularly interested in ensuring that e ectual changes are
appropriate, and may also be interested in understanding why committed changes have
no logical e ect. Looking at the di , the user immediately sees that the changes
are balanced between e ectual and ine ectual, with most of the action in the
e ectual additions and ine ectual removals. Being more concerned with added
information, the user begins by inspecting the e ectual additions. Particularly
by looking through new descriptions, the user is immediately aware of the new
terms that have been introduced, as the following axioms demonstrate:
1: Oxidized Glutathione v P rotective Agent
2: CHP -HER-2 P eptide V accine v 9Chemical Or Drug Has
P hysiologic Ef f ect.Immunopotentiation Ef f ect</p>
          <p>Change</p>
          <p>Type</p>
        </sec>
        <sec id="sec-3-8-10">
          <title>Total</title>
        </sec>
        <sec id="sec-3-8-11">
          <title>Weakening</title>
        </sec>
        <sec id="sec-3-8-12">
          <title>Retired Description</title>
        </sec>
        <sec id="sec-3-8-13">
          <title>Pure Removal</title>
          <p>The axiom 1 introduces the term Oxidized Glutathione, while 2
introduces CHP -HER-2 P eptide V accine. The user can restrict his/her attention
to those additions that concern his domain expertise, thus being also
interested in strengthenings within this domain. Let us look at the latter:
axiom 3 is a strengthening of 3, and 4 a strengthening with new terms
(Anterior F oramen M agnum) of 4, as follows:
3: Skin Appendage Adenoma Adenoma u Benign Epithelial Skin</p>
          <p>N eoplasm u Benign Skin Appendage N eoplasm
3: Skin Appendage Adenoma v Adenoma
4: Anterior F oramen M agnum M eningioma F oramen M agnum
M eningioma u 8Disease Has P rimary Anatomic Site.Anterior
F oramen M agnum
4: Anterior F oramen M agnum M eningioma v F oramen M agnum</p>
          <p>M eningioma</p>
          <p>The user sees in both axioms an addition of information w.r.t. the previous
version, and can narrow down the search to axioms concerning his/her domain
area to verify correctness. In the same manner the user goes through those
alterations with and without new terms. The following axiom 5 is an alteration,
and 6 is an alteration with a new term c Concept Status:
5: Epidermal Involvement v Cutaneous Involvement
6: U M LS Cross-Ref erence Concept v c Concept Status</p>
          <p>This type of axiom generally indicates adjustments to the class hierarchy,
which the user may want to verify in his respective domain area. Switching
over to ine ectual changes, the user begins by inspecting the rewrites. Since
rewritten axioms are supposed to convey the same logical meaning in both O32
and O33, the user wants to understand why these are presented by the di . The
users nds axiom 7 rewritten into 7, as follows:</p>
          <p>The change from 7 to 7 is purely syntactic, and thus both axioms carry the
same logical meaning. As discussed in Section 4.1, we nd here a particularity
of OWL's notion of structural equivalence that could be re ned. Seeing as the
inspected rewrites exhibit this form, the user skips further analysis of this type
of change. Next the user looks at redundant axioms; since redundancies do not
add any logical meaning the user may want to prune them. Such redundancies
are guaranteed not to alter the semantics of the ontology alone, and so can be
immediately disposed of. The user nds redundancies of similar form to 8, with
a justi cation J 8 O33, as follows:
8: CS-1008 v 9Chemical Or Drug Has M echanism Of Action.</p>
          <p>Antigen Binding Interaction
J 8 = fCS-1008 v M onoclonal Antibody,</p>
          <p>M onoclonal Antibody v 9Chemical Or Drug Has M echanism
Of Action.Antigen Binding Interactiong</p>
          <p>Upon nding these, the user proceeds to remove them from the ontology.
Meantime it would be helpful to investigate with the corresponding developers
why these redundancies were being added in the rst place. The user then carries
on inspecting avoided redundancies; among these the user comes across several
of the same kind as 9, with corresponding justi cation J 9 O32:
9: Lactic Acid L v Industrial P roduct
J 9 = fLactic Acid L v P harmaceutical Excipient,</p>
          <p>P harmaceutical Excipient v Industrial Aid,</p>
          <p>Industrial Aid v Industrial P roductg</p>
          <p>This change reveals to the user that some adjustments to the class hierarchy
took place involving terms in f9. The user can inspect the class hierarchy to
con rm this, and recognise that such changes would be redundant in the previous
ontology. However, while 9 is redundant w.r.t. to O32 it is still the case that
removing it from O33 would cause loss of information. As such, the user leaves
these avoided redundancies behind and begins to verify weakened axioms. A
weakened axiom implies a reduction of constraints on models of the ontology, and
so should be carefully reviewed. The user comes across 10 with a justi cation</p>
          <p>The user realises that 10 is indeed weaker than the axiom in J 10 O32,
and not being the domain expert dispatches this change to the respective expert
for review and con rmation of the original intent. Subsequently the user inspects
the strengthened axioms, since these are more constricting in O33 than they were
in O32. Consider axiom 11 and its justi cation J 11 O33:</p>
          <p>11: Sporadic Cylindroma v Cylindroma
J 11 = fSporadic Cylindroma Cylindroma u 8Disease Has F inding:</p>
          <p>N on-Hereditary Lesiong</p>
          <p>Given changes of this type it may be desirable to con rm them, and so
the user takes the same action as for weakened axioms and delegates them to
appropriate domain experts while inspecting other changes of interest.</p>
          <p>We have shown here an example of how such categorisation makes change
analysis far more manageable, as opposed to having users inspect a whole set
of axioms or terms. The devised categories allow users to focus on speci c
types of change, and particularly see what these are a change of (in the
previous or subsequent version). Moreover a categorisation of this type produces
a reasonable division of labour, thus making collaboration e orts more practical.
5</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Discussion and Outlook</title>
      <p>The diachronic studies of the NCIt and SNOMED CT revealed that all the
categories of changes occur throughout each corpus, providing indications as to the
impact of changes. We showed that such a categorisation of change sets is useful
for change analysis, with particular bene ts for division of labour by ontology
engineers. By means of this categorisation we can group changes according to
their impact, allowing users to shift their attention to speci c types of changes,
rather than going through a change set while inspecting both ontologies. With
our correspondence of changes between ontologies we can show the changed
axioms and what they are a change of. Consequently, by analysing changes in this
way there is no need for constantly having to inspect ontologies manually. As
such, we can support users in understanding the impact of their changes (or lack
thereof), and re ne these before publishing newer versions.</p>
      <p>We found that ine ectual changes account for a signi cant amount of changes
throughout the NCIt, as well as in SNOMED CT with an even higher proportion.
Despite the fact that semantic di s ignore these changes in their output, we show
that they provide helpful modelling insights, and thus are worth examining.
For instance, we discovered a high number of redundant axioms in the NCIt,
some of which could be disposed of. Also we found a number of structurally
distinct ine ectual changes that are clearly neglectable: the rewrites in the NCIt.
These indicate the need for improvement of the underlying di . In general the
inspection of ine ectual changes is helpful to prevent, e.g., re-doing work or
introducing redundancy. Relying on semantic di s one would be missing out on
these meaningful ine ectual changes, which in turn could help users recognise
the impact of certain types of change.</p>
      <p>The next step in our study is to evaluate the di tool with users. In particular
we expect to con rm that the categorisation helps users in understanding changes
between ontologies. In terms of further analysis of the NCIt and SNOMED CT we
intend to inspect the history of axioms, as in checking the progress of each axiom
throughout the corpus since it was introduced (e.g., changes in constructors or
strengthenings that the axiom went through). Another future survey involves
checking for patterns of change throughout the corpus, e.g., if all strengthenings
exhibit a common form.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Cuenca</given-names>
            <surname>Grau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Kazakov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            ,
            <surname>Sattler</surname>
          </string-name>
          ,
          <string-name>
            <surname>U.</surname>
          </string-name>
          :
          <article-title>Modular reuse of ontologies: Theory and practice</article-title>
          .
          <source>J. of Arti cial Intelligence Research</source>
          <volume>31</volume>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Cuenca</given-names>
            <surname>Grau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Motik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Parsia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Patel-Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Sattler</surname>
          </string-name>
          ,
          <string-name>
            <surname>U.</surname>
          </string-name>
          :
          <article-title>OWL 2: The next step for OWL</article-title>
          .
          <source>J. of Web Semantics</source>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Goncalves</surname>
            ,
            <given-names>R.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Analysing the evolution of the NCI Thesaurus</article-title>
          .
          <source>In: Proc. of DL 2011</source>
          . pp.
          <volume>147</volume>
          {
          <fpage>157</fpage>
          . CEUR Workshop Proceedings (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Goncalves</surname>
            ,
            <given-names>R.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Categorising logical di erences between OWL ontologies</article-title>
          .
          <source>In: Proc. of CIKM</source>
          <year>2011</year>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kutz</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>The even more irresistible SROIQ</article-title>
          .
          <source>In: Proc. of KR-06</source>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>A tableaux decision procedure for SHOIQ</article-title>
          .
          <source>J. of Automated Reasoning</source>
          <volume>39</volume>
          ,
          <issue>249</issue>
          {
          <fpage>276</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Jimenez-Ruiz</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuenca Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berlanga</surname>
            <given-names>Llavori</given-names>
          </string-name>
          , R.:
          <article-title>Supporting concurrent ontology development: Framework, algorithms and tool</article-title>
          .
          <source>Data and Knowledge Engineering</source>
          <volume>70</volume>
          (
          <issue>1</issue>
          ),
          <volume>146</volume>
          {
          <fpage>164</fpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sirin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuenca Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hendler</surname>
          </string-name>
          , J.:
          <article-title>Swoop: A Web ontology editing browser</article-title>
          .
          <source>J. of Web Semantics</source>
          <volume>4</volume>
          (
          <issue>2</issue>
          ) (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sirin</surname>
          </string-name>
          , E.:
          <article-title>Finding all justi cations of OWL DL entailments</article-title>
          .
          <source>In: Proc. of ISWC/ASWC-07</source>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Konev</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walther</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>The logical di erence problem for description logic terminologies</article-title>
          .
          <source>In: IJCAR-08. LNCS</source>
          , vol.
          <volume>5195</volume>
          . Springer-Verlag (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Malone</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holloway</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adamusiak</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kapushesky</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zheng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kolesnikov</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhukova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brazma</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parkinson</surname>
            ,
            <given-names>H.E.</given-names>
          </string-name>
          :
          <article-title>Modeling sample variables with an experimental factor ontology</article-title>
          .
          <source>Bioinformatics</source>
          <volume>26</volume>
          (
          <issue>8</issue>
          ),
          <volume>1112</volume>
          {
          <fpage>1118</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Noy</surname>
            ,
            <given-names>N.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          :
          <article-title>PROMPTDIFF: A xed-point algorithm for comparing ontology versions</article-title>
          .
          <source>In: Proc. of AAAI-02</source>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>