<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Applying Evaluation Criteria to Ontology Modules</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Zubeida Casmod KHAN</string-name>
          <email>zkhan@csir.co.za</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Council for Scientific</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Industrial Research</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pretoria</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>South Africa</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>Modularity has been increasingly used as a solution to assist with the use and maintenance of large ontologies. However, there is lack of evaluation methods available for determining the quality of an ontology module. While initial work has been done on producing a comprehensive list of evaluation characteristics that can be used to check the quality of a module, certain aspects are still unclear. For instance, the initial list has not been structured into different groups for different types of evaluation criteria. It is also unclear on how to apply these characteristics to a particular module, and the metrics have not been experimentally validated for some types of ontology modules. In this paper, we structure the comprehensive evaluation criteria into groups, provide practical examples on how to use the evaluation characteristics to assess a module, and validate the evaluation characteristics with a set of ontology modules.</p>
      </abstract>
      <kwd-group>
        <kwd />
        <kwd>ontology evaluation</kwd>
        <kwd>ontology metrics</kwd>
        <kwd>ontology modules</kwd>
        <kwd>ontology modularisation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Over the last few years, there has been a growth in using modularity to assist with
maintaining and using large ontologies. The general concept of modularity refers to dividing
and separating the components of a large system such that modules can be recombined.
Modularity is used to simplify and downsize an ontology for the task at hand; to
modularise a large ontology into smaller manageable ontologies. Modularity has been
successfully applied to a number of different ontologies to improve usability and assist with
complexity. Examples include the myExperiment ontology [16], which is a collaborative
environment where scientists publish and share their work-flows and experiment plans
among groups, the Semantic Sensor Net ontology where there are various modules to
describe sensors and observations [9], and BioTop ontologies for life sciences in which
the principle of modularisation have been applied [22].</p>
      <p>An issue concerning ontology modules is the lack of evaluation metrics. The existing
works on evaluation metrics focus on only some metrics that suit the modularisation
technique [20,21,26], and there is not always a quantitative approach to calculate them.
Overall, the metrics are not comprehensive enough to apply to a variety of modules. It is
therefore not clear on how to determine whether a module is of good quality.</p>
      <p>We have already published initial work done on determining whether a module is
a good or bad module with evaluation metrics [10]. The initial study [10] revealed 13
criteria from the literature, and 3 new ones, some of which were short of a metric for
quantitative evaluation that have now been devised. In this work, we categorise the 16
criteria into 5 broader categories to structure them so that ontology developers can easily
identify which ones to use for an application. We also demonstrate usage of the criteria
with practical examples. Lastly, we perform an experimentation evaluation on a set of
ontology modules using the evaluation criteria revealing how the modules measure in
terms of quality.</p>
      <p>The remainder of the paper is structured as follows. We discuss related works in
Section 2. This is followed by a section on the evaluation criteria in Section 3. In Section 4
we discuss the types of modules we are going to evaluate. The experimental evaluation
using the evaluation metrics is performed in Section 5. Finally we conclude in Section 6.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Works</title>
      <p>
        d’Aquin et. al [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] present some criteria for evaluating ontology modules including logical
criteria, e.g., local correctness, structural criteria e.g., size of module, and intra-module
distance, software criteria, e.g., encapsulation, and independence, quality criteria, e.g.,
module cohesion, and relational criteria, e.g., connectedness, and inter-module distance.
The study [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] also revealed that existing criteria for ontology modularisation evaluation
is not sufficient, and that not all the proposed criteria can be used on all ontology
modules. In other work, Loebe [14], proposed a number of requirements for logical modules,
such as logical correctness and completeness. Loebe also acknowledges that the
requirements do not hold for all applications and that specialised methods should be applied for
different applications.
      </p>
      <p>Tartir et. al [23] propose richness criteria to measure the quality of ontologies. This
criteria is based on how rich the ontology is with regard to attributes and subclasses.
For cohesion metrics, Yao et. al propose metrics such as the number of root classes,
number of leaf classes, and average depth of inheritance tree of all leaf node [26]. These
metrics, however, does not reveal how the entities are related in a module as compared
to the original ontology. The work done by Oh et. al on cohesion metrics, however,
does measure the strength of the relations in a module [18]. To measure the coupling
of an ontology, researchers propose metrics based on the number of externally defined
referenced concepts [19]. This, however, does not take into account external links that
different modules share. Oh and Ahn [17] have improved on this to consider the external
links between different modules based on whether the link is hierarchical or relational.
However, their metric is simply a sum value of the number of each type of links between
modules, which does not measure the complete interdependence of a module since it
only considers one type of variable in the module.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Evaluation Metrics for Modules</title>
      <p>In previous work, we already defined a list of evaluation criteria alongside their
mathematical metrics for modularity [10]. We now group them into categories of criteria, and
provide some examples demonstrating how to use them. For the set of criteria, let O be
an ontology with a corresponding set of axioms, Axioms(O), and M be a module with a
corresponding set of axioms, Axioms(M). Let i, j be arbitrary entities in an ontology.</p>
      <sec id="sec-3-1">
        <title>3.1. Structural Criteria</title>
        <p>Structural criteria are calculated based on the structural and hierarchical properties of
the module. These criteria are calculated by inspecting the syntax of the ontology. It is
usually based on counting components of the ontology such as axioms, entities, etc., and
is a numerical value. Calculating structural criteria involves evaluating the size, relations,
and placement of entities within a module. We now list the structural criteria, alongside
practical examples.</p>
        <p>
          Size: Size is the number of entities in a module (the number of classes, object properties,
data properties, and individuals in a module) [
          <xref ref-type="bibr" rid="ref2 ref3">2,3,18,21,20</xref>
          ].
        </p>
        <p>Relative size: The relative size is the size of the module, i.e., the number of entities in a
module compared to the original ontology [12].</p>
        <p>Relative size(M) = jMj</p>
        <p>O
j j
Atomic Size(M) = jAxiomj
jAtomj</p>
      </sec>
      <sec id="sec-3-2">
        <title>Example 3 Consider the example in Figure 1 of an atomic decomposition [25].</title>
      </sec>
      <sec id="sec-3-3">
        <title>The number of atoms in the example is 6 and there are 7 axioms in total. The</title>
        <p>atomic size is hence 67 = 1:17. This tells us that there is an average of 1.17 axioms
per atom for the example.</p>
      </sec>
      <sec id="sec-3-4">
        <title>Example 1 The GFO-Basic ontology [8] module contains 47 classes, 0 individu</title>
        <p>als, 41 object properties, and 0 data properties. The source ontology, GFO,
contains 78 classes, 0 individuals, 67 object properties, and 0 data properties. Hence
the relative size is 7487++00++6471++00 = 0.61.</p>
        <p>Appropriateness: Appropriateness is measured by mapping the size of an ontology
module to some appropriateness function value between 0 and 1 to reflect the defect
density [21].
(1)
(2)
(3)
Appropriate(x) =</p>
        <p>cos(x:
where x is the number of axioms in the module. The authors propose the function
with 250 axioms as it is based on the fact that the optimal size of software lines in
a code is 200-300 lines [21].</p>
      </sec>
      <sec id="sec-3-5">
        <title>Example 2 The Temporal Relations module of the DOLCE-ExtendedDnS descrip</title>
        <p>
          tions and situations ontology [15] has 435 axioms. Therefore, based on the
equation for calculating appropriateness, the value is 12 21 cos(435: 2p50 ) = 0.16.
Atomic size: The atomic size of a module is the average size of a group of
interdependent axioms in a module. This is based on the notion of atoms in modules,
which are defined as a group of axioms with dependencies between each other
[24]. Dependent axioms are those that describe the same named entity.
Intra-module distance: The intra-module distance in a module is the distance between
entities in a module [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. To measure it, Freeman’s farness equation is used to
measure the sum of distances to all other nodes [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]; the full equation is in [12].
Relative Intra-module distance: The relative intra-module distance is the difference
between entities in a module and entities in a source ontology [12].
        </p>
        <p>Relative intra-module distance(M) =</p>
      </sec>
      <sec id="sec-3-6">
        <title>Intra-module distance(O)</title>
      </sec>
      <sec id="sec-3-7">
        <title>Intra-module distance(M)</title>
        <p>(5)</p>
      </sec>
      <sec id="sec-3-8">
        <title>Example 4 Consider the example of a source ontology O and a module M shown</title>
        <p>in Figure 2, and the farness values in Table 1. Using the farness values in the
intramodule distance equation, we calculated the intra-module distance of the source
ontology O to be 32 and of the module M to be 16. The relative intra-module
distance is 1362 = 2, hence the module entities are twice as close as the original
ontology.</p>
        <p>
          Cohesion: Cohesion refers to the extent to which entities in a module are related to
each other. [
          <xref ref-type="bibr" rid="ref5">5,18,17,26</xref>
          ]. To measure it, we use Oh et. al’s equation [18]; the full
equation is in [12].
        </p>
        <p>A
B
C
D
E</p>
        <p>Farness, O
A B
- 1
1
1 0
4 3
4 3
(7)
Correctness(M) =
(true
f alse
i f Axioms(M)
otherwise</p>
      </sec>
      <sec id="sec-3-9">
        <title>Axioms(O)</title>
      </sec>
      <sec id="sec-3-10">
        <title>Example 6 The GFO-Abstract-Top ontology is a subset of the GFO ontology [8].</title>
      </sec>
      <sec id="sec-3-11">
        <title>No new axioms have been added to the GFO-Abstract-Top ontology; it only con</title>
        <p>tains those axioms which exist in the GFO ontology. Thus, the GFO-Abstract-Top
ontology is logically correct. The GFO-Basic ontology, however, is a smaller
module based on the GFO ontology but it also contains new axioms that do not exist in
the GFO ontology. For instance, the entity Processual Structure exists in the
GFO</p>
      </sec>
      <sec id="sec-3-12">
        <title>Basic module but not in the source ontology, GFO. Thus, the logical correctness property does not hold for the GFO-Basic module.</title>
        <p>
          Completeness: Completeness is when the meaning of every entity in a module is
preserved as in the source ontology [
          <xref ref-type="bibr" rid="ref1 ref3">14,1,3,20</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-13">
        <title>Example 5 Consider module M, from Figure 2. The sum of all the 1/farness values</title>
        <p>is 10 as shown in Table 1. The number of entities in M is 5. Hence the cohesion
value is as follows: Cohesion = 51(40) = 0.5.</p>
      </sec>
      <sec id="sec-3-14">
        <title>3.2. Logical Criteria</title>
        <p>
          By definition, an ontology is: “a logical theory accounting for the intended meaning of
a formal vocabulary, i.e. its ontological commitment to a particular conceptualisation of
the world” [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. As such, it is possible to evaluate ontology modules by the logical criteria
that they hold. We now list the logical criteria, alongside practical examples.
Correctness: This states that every axiom that exists in the module also exists in the
source ontology and that nothing new should be added to the module [
          <xref ref-type="bibr" rid="ref1 ref3">14,1,3,20</xref>
          ].
Completeness(M) =
8
&gt;&lt;true
&gt;: f alse
        </p>
        <p>jMj
i f å Axioms(Entityi(M)) j= Axioms(Entityi(O))</p>
        <p>i
otherwise
Example 7 Consider a source ontology, DOLCE-Lite [15], where the endurant
entity is defined as follows:
endurant v 8 part.endurant
endurant v spatio-temporal-particular
endurant v 9 participant-in.perdurant
endurant v 8 speci c-constant-constituent.endurant
endurant v : quality
endurant v : perdurant
endurant v : abstract</p>
      </sec>
      <sec id="sec-3-15">
        <title>If DOLCE were to be modularised to create a branch module, containing only the</title>
        <p>branch of Endurant entities, with the removal of perdurant entities called
DOLCEendurants, then the endurant entity is defined as follows:
endurant v 8 part.endurant
endurant v spatio-temporal-particular
endurant v 9 participant-in.perdurant</p>
      </sec>
      <sec id="sec-3-16">
        <title>The meaning of the endurant entity was not preserved in the module since the ax</title>
        <p>iom endurant v 8 speci c-constant-constituent.endurant existed in the original
ontology but not in the module. Therefore the DOLCE-endurants module has a
false value for the completeness metric.</p>
      </sec>
      <sec id="sec-3-17">
        <title>3.3. Relational Criteria</title>
        <p>Relational criteria deal with the relations and behaviour that modules exhibit with other
modules in a system of interrelated modules. We now list the relational criteria, alongside
practical examples.</p>
        <p>
          Inter-module distance: The inter-module distance in a set of modules has been described
as the number of modules that have to be considered to relate two entities [
          <xref ref-type="bibr" rid="ref2 ref3">2,3</xref>
          ].
This is measured by calculating the sum of modules that have to be considered
to relate two entities divided by the number of all possible relations in a set of
modules.
        </p>
        <p>Inter-module-distance(M)=
8&lt;Ei;E j2å(Mi;;Mn) j(Mi;::;MNn)Mj((j(EMi;iE;:j:);Mn)j 1) j(Mi; ::; Mn)j &gt; 1
:1
(9)
where NM(Ei; E j) is the number of modules to consider to relate entities i and j.
The product of j(Mi; ::; Mn)j(j(Mi; ::; Mn)j 1) represents the number of possible
relations between entities in a set of modules Mi; :; Mn.</p>
      </sec>
      <sec id="sec-3-18">
        <title>Example 8 Consider the set of inter-related modules in Figure 3. For each en</title>
        <p>
          tity pair, we have the number of modules, NM, that have to be considered to
relate them in Table 2. The sum of NM is 126. The number of entities is 9, hence
the j(Mi; ::; Mn)j(j(Mi; ::; Mn)j 1) value = 9(8). Thus the inter-module distance is
91(182) = 1:75. For the set, it takes 1.75 modules to relate two entities in the set.
Coupling: A measure of the degree of interdependence of a module [
          <xref ref-type="bibr" rid="ref5">5,18,17,19</xref>
          ]. To
measure the coupling of a module, we calculate a ratio of the number of external
links between a modules to every possible external link between modules.
where jMij is the number of entities in the current module and jM jj is the number
of entities in a related module in the set of n modules.
        </p>
      </sec>
      <sec id="sec-3-19">
        <title>Example 9 Consider module M1 from the set of inter-related modules in Fig</title>
        <p>ure 3. The number of external links that have to be considered to relate M1 to
other modules in the set, is 2. The number of possible external link between
a module M1 and the other modules in the system is calculated as follows:
jM1j(jM2j) + jM1j(jM3j) = 18. Hence the coupling(M1) = 128 = 0:11 which
indicates a low interdependence toward other modules in the system.</p>
        <p>Redundancy: Redundancy is the duplication of axioms within a set of ontology modules
[21]. To measure redundancy in a set of modules, we calculate the fraction of
duplicated axioms.
k
where å ni is the total number of axioms and n is the number of distinct axioms
i=1
in a module. The resulting fraction is a value of redundancy.</p>
      </sec>
      <sec id="sec-3-20">
        <title>Example 10 Consider the class declarations and axioms in the set of modules with</title>
        <p>no inter-related links that have been partitioned from a food ontology as shown in
Figure 4. There are 3 ontology modules: Fruit, Vegetable, and Meat. Axioms that
have been repeated more than once (redundant axioms) are shown in blocks. From
the three modules, there is a total of 21 axioms, i.e., the Axt value is 21. There are
15 distinct axioms that exist in the set of modules (these axioms exist at most once
and are those that are not blocks), hence Axd is 15. The redundancy of the set of
partitioned modules is thus 212115 = 0:29. Hence, 29% of the axioms in the set of
modules are redundant.</p>
      </sec>
      <sec id="sec-3-21">
        <title>3.4. Information Hiding Criteria</title>
        <p>Ontology modules sometimes deal with hiding aspects of the source ontology from the
module for privacy and simplification reasons. Information hiding within modules
assesses whether the module encapsulates all the information in the module such that the
privacy is preserved for each module. We now list the information hiding criteria,
alongside practical examples.</p>
        <p>
          Encapsulation: Encapsulation is a metric that holds when “a module can be easily
exchanged for another, or internally modified, without side-effects on the application
can be a good indication of the quality of the module” [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. It is measured using
the number of axioms in the given module and the number of axioms that occur in
both the given module and related modules.
n 1 jAxi jj
å
j=1 jAxij
        </p>
        <p>n
Encapsulation(M i) = 1
(12)</p>
      </sec>
      <sec id="sec-3-22">
        <title>Example 11 Consider the 3 ontology modules Fruit, Vegetable, and Meat, from</title>
        <p>Example 10. We calculate the encapsulation of the Fruit module as follows. There
are 7 axioms in the Fruit module, i.e., the Axi = 7. In the Vegetable module, there
are 3 overlapping axioms, i.e., they also exist in the Fruit module. In the Meat
module, there are 2 overlapping axioms, i.e., they also exist in the Fruit module. Hence,
the Encapsulation(Fruit) is calculated as 1 37 +3 72 = 0:76. Thus, 0.76 (76%), or a
large amount of the domain knowledge is encapsulated in the Fruit module but the
complete privacy of the Fruit module is not preserved.</p>
        <p>
          Independence: Independence evaluates whether a module is self-contained and can be
updated and reused separately [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. A module is independent if it has an
encapsulation value of 1 and a coupling value of 0.
        </p>
        <p>Ind(M i) =
(true</p>
        <p>Encapsulation(Mi) = 1 and Coupling(Mi) = 0
f alse otherwise
(13)
where jMij is the number of entities in the current module and jM jj is the number
of entities in a related module in the set of n modules.</p>
      </sec>
      <sec id="sec-3-23">
        <title>Example 12 Consider the 3 ontology modules in Example 10. We have already</title>
        <p>worked out the encapsulation value for the Fruit module in Example 11 as 0.76.</p>
      </sec>
      <sec id="sec-3-24">
        <title>There are no inter-related links between the modules hence the coupling value is 0. Since the encapsulation value is not 1, the conditions for independence do not hold for the Fruit module hence it is not independent.</title>
      </sec>
      <sec id="sec-3-25">
        <title>3.5. Richness Criteria</title>
        <p>The richness or amount of information in an ontology is designed as one aspect to
measure the quality of an ontology. For modules, this is important to measure in cases where
abstraction is employed to compare the granularity of the source ontology to that of the
module. Tartir et al. [23] propose measurable richness schema metrics. We now list the
richness criteria, alongside practical examples.</p>
        <p>Attribute richness: Attribute richness is defined as the average number of attributes per
class [23].
where att is measured by the number of data properties in the module jDPj and
jCj is the number of classes in the module. In an ontology, an attribute is used to
describe an entity and each attribute, or data type, has a name and value.</p>
      </sec>
      <sec id="sec-3-26">
        <title>Example 13 The pizza ontology has no data properties (attributes) defined. The</title>
      </sec>
      <sec id="sec-3-27">
        <title>AR value is 0, therefore there is no attribute richness in the pizza ontology.</title>
        <p>Inheritance richness: Inheritance richness is defined as the number of subclasses per
class in an ontology [23].</p>
        <p>AR(M) = jattj</p>
        <p>jCj
å jHC(C1;Ci)j
IRS(M) = Ci2C
(14)
(15)
where jHC(C1;Ci)j is the number of subclasses per class and jCj is the total number
of classes in the ontology.</p>
      </sec>
      <sec id="sec-3-28">
        <title>Example 14 Refer back to ontology O and module M from Figure 2. For module</title>
      </sec>
      <sec id="sec-3-29">
        <title>M, the entities which have subclasses are entity A with 2 subclasses, and entity</title>
        <p>B with 2 subclasses. Hence the sum of these subclasses is 2 + 2 = 4. There are 5
classes in total in M. The inheritance richness value for M is thus 45 = 0:8. Using
the same method we work out the inheritance richness value for ontology O, which
is 67 = 0:85.</p>
        <p>To assist with the lack of evaluation metrics and corresponding formulae in ontology
modules, we presented a comprehensive list of evaluation criteria for modules together
with examples on how to operationalise them for ontology modules. To put context to
the values for the metrics, experimentation was performed in other work stating which
values are appropriate for the metrics for particular module types [12]. In Section 5, we
show how each metric is used and what some expected values are.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Types of Modules for Evaluation</title>
      <p>In this work, we focus on evaluating certain types of ontology modules, i.e., those that are
lacking from an existing ontology modularisation framework and experiment [11]. The
modules were generated using the NOMSA modularisation tool 2. We briefly describe
each module type here with its corresponding abbreviation which we will use to describe
them in the remainder of the paper.</p>
      <p>Axiom abstraction (AxAbs): This is a module containing hierarchical relations
between entities, i.e., other relations are removed resulting in a bare taxonomy.
Vocabulary abstraction (VocAbs): This is a module where certain types of entities
are removed, for instance, the object properties or data properties of an ontology.
High-level abstraction (HLAbs): This is a module where entities at a higher level
in the hierarchy have precedence over the others.</p>
      <p>Weighted abstraction (WeiAbs): This is a module where weighting is assigned to
entities that are referenced by axioms more than others.</p>
      <p>Feature expressiveness (FeatExp): This is a module where some axioms of the
ontology are removed based on its language features.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Experimental Evaluation</title>
      <p>In order to uncover information about how evaluation metrics relate to ontology modules,
we use the Tool for Ontology Modularity Metrics (TOMM) software tool [12] which
encompasses all the evaluation metrics described in Section 3. In this experiment, we
evaluate the module types from Section 4, in terms of its quality.</p>
      <sec id="sec-5-1">
        <title>5.1. Materials and Methods</title>
        <sec id="sec-5-1-1">
          <title>The method for the experiment is as follows:</title>
          <p>2http://www.thezfiles.co.za/modularisation/</p>
        </sec>
        <sec id="sec-5-1-2">
          <title>1. Take a set of modules.</title>
          <p>2. Run the TOMM metrics tool [12] for the modules to acquire module metrics.
3. Conduct an analysis from the metrics for each module.</p>
          <p>
            The materials used for the experiment were as follows: TOMM metrics tool [12],
and a set of 128 ontologies that had been modularised using NOMSA ontology
modularisation tool 2 but derived from the set of ontologies described elsewhere [
            <xref ref-type="bibr" rid="ref6">6,13</xref>
            ]. Our
tests were carried out on a 3.00 GHz Intel Core 2 Duo PC with 4 GB of memory running
Windows 7 Enterprise. All the test files are available at http://www.thezfiles.co.
za/modularisation/testfiles_NOMSA.zip.
          </p>
        </sec>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Results</title>
        <p>Each of the modules were evaluated using the metrics that were implemented in the
TOMM tool and the results for the numerical metrics are shown in Table 3. The atomic
sizes of the modules indicates that there are on average between 2.34- 3.80 axioms that
are grouped together in an atom for the modules. The appropriateness, which maps the
size of the ontology module to a function is less than 0.3 for all the modules. Seeing
that 1.0 is the optimal value for appropriateness, all the modules perform poorly for
this metric. For a module to have an optimal value, it must have a value close to 250
axioms. The intra-module distance values which indicate the distance between entities
in a module differ considerably, with the HLAbs modules having the lowest value of 142
698.4 and the AxAbs having the highest value of 866 354.60. The cohesion of a module
indicates how closely related its entities are to each other, with higher values having a
large number of relations among entities. The cohesion is small for all the modules in
the set (less than 0.07). Most of the modules in this set do not contain attributes, as the
attribute richness is less than 1 for all the modules. The number of subclasses per class
is between 2.72 to 4.84 as indicated by the inheritance richness for the modules. In this
set, all the modules were considerably smaller than the original ontologies, as indicated
by relative size values that are less than 1. WeiAbs modules are the smallest at 0.26
(26% the size of the original ontology) while VocAbs modules are 0.85 (85% the size
of the original ontology). It is also important to determine whether the entities in the
modules have moved closer to each other in a module compared to the original ontology.
AxAbs modules, VocAbs modules, and FeatExp modules have values less than 1 for the
relative intra-module distance, which indicate that their entities are further in relation
to the entities of the original ontology. While in HLAbs and WeiAbs the intra-module
distance is larger, indicating that for these modules, the entities have moved closer as
compared to the original ontology.</p>
        <p>We have discussed the evaluation metric values for the set of ontology modules
in isolation, i.e., without an indication of whether the modules in question appear to
be good or of poor quality. To check if the modules are of good quality we refer to
an existing benchmark dependency between the proposed ontology metrics and various
types of ontology modules. The notion here is that for each module type, there is a pattern
or dependencies between type and some set of metrics. This is shown in Table 4 with
expected values for each type of module. The values that are underlined were not met
for this experiment. For instance, reading Table 4, it states that for WeiAbs modules
the cohesion value of range 0-0.25 and the relative size value of range 0.26-0.5 is met.
However, for AxAbs, the cohesion value is met but the correctness value is not met.</p>
        <p>We now examine the failed metrics. For appropriateness, it appears that the number
of axioms in a module need to be between 167- 333 to have a value of between
0.751. In some cases, the original ontologies were already less than 167 axioms in size, so
there could never reach an appropriate value. For the case of ontology modules, which
by definition are a reduction of an ontology, this appropriateness metric needs to be
redesigned to include modules that are less than 167 axioms. The next failed metric is the
correctness value, which needs to be true, but tested false for some modules, meaning that
some new axioms were added to the modules, i.e., axioms that were not from the original
ontology. An inspection of the log files for the metrics revealed that OWL enumeration
was used to declare individuals in an original ontology. In resulting modules, when the
collection of individuals were broken up, they were re-represented as named instances
using class identifiers. This means that new knowledge was not added to the module, but
because of language syntax, it appears so. The ontology metrics tool therefore needs to
recognise that this is not a new axiom to measure the correctness value accurately.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. Discussion</title>
        <p>The evaluation criteria for ontology modules were already presented in previous work
[10]. However, there was limited support for operationalising these metrics. In this paper,
we provide practical examples on how the metrics can be applied to an ontology module
to measure them. We also structured and grouped together the list of evaluation criteria to
various higher level categories to differentiate those that examine, say structural aspects
of a module (structural criteria) to those that examine how well the modules hide certain
information of the source ontology (information hiding criteria). This could aid ontology
developers in selecting the relevant evaluation criteria for their use-case.</p>
        <p>We performed an experiment using various types of modules that were generated
using NOMSA modularisation tool. The modules were evaluated with TOMM to reveal
their metrics. The metrics indicate important information such as, the rate at which
entities in a module move closer to each other (using the relative intra-module distance
metrics), whether a module is rich with attributes, at what rate the size of the module differs
as compared to the source ontology. These metrics are important for gauging how well
the modules would fare with visualisation and comprehension tools or if there would be
some performance issues due to redundancy, among other cases.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>Initial work on evaluation criteria was refined by grouping the evaluation criteria list into
categories to structure them to aid ontology developers. We summarised the list of
evaluation criteria, together with practical examples on how to use them. An experimental
evaluation revealed how the metrics can be used to evaluate modules and uncover
information about how well certain modules fare for certain applications. We also
identified a problem with the appropriateness value, which cannot be applied in some cases
if a module has a small axiom size which needs to be changed. Another problem that
was identified is that some knowledge is recognised as new knowledge by the evaluation
metrics tool due to OWL representation issues.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Cuenca</given-names>
            <surname>Grau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Parsia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Sirin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Kalyanpur</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>Modularity and Web Ontologies</article-title>
          .
          <source>In: 10th International Conference on Principles of Knowledge Representation and Reasoning (KR'06)</source>
          . pp.
          <fpage>198</fpage>
          -
          <lpage>209</lpage>
          . AAAI Press (
          <year>2006</year>
          ), june 2-5,
          <string-name>
            <given-names>Lake</given-names>
            <surname>District</surname>
          </string-name>
          , United Kingdom.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>d'Aquin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schlicht</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stuckenschmidt</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sabou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Ontology modularization for knowledge selection: Experiments and evaluations</article-title>
          . In: Wagner,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Revell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Pernul</surname>
          </string-name>
          ,
          <string-name>
            <surname>G</surname>
          </string-name>
          . (eds.)
          <source>The 18th International Conference on Database and Expert Systems Applications (DEXA'07). Lecture Notes in Computer Science</source>
          , vol.
          <volume>4653</volume>
          , pp.
          <fpage>874</fpage>
          -
          <lpage>883</lpage>
          . Springer (
          <year>2007</year>
          ), regensburg, Germany, September 3-
          <issue>7</issue>
          ,
          <fpage>2007</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>d'Aquin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schlicht</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stuckenschmidt</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sabou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Criteria and evaluation for ontology modularization techniques</article-title>
          .
          <source>In: Modular Ontologies: Concepts</source>
          ,
          <source>Theories and Techniques for Knowledge Modularization, Lecture Notes in Computer Science</source>
          , vol.
          <volume>5445</volume>
          , pp.
          <fpage>67</fpage>
          -
          <lpage>89</lpage>
          . Springer (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Freeman</surname>
            ,
            <given-names>L.C.</given-names>
          </string-name>
          :
          <article-title>Centrality in social networks conceptual clarification</article-title>
          .
          <source>Social Networks</source>
          <volume>1</volume>
          (
          <issue>3</issue>
          ),
          <fpage>215</fpage>
          -
          <lpage>239</lpage>
          (
          <year>1978</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5] Garc´ıa, J., Pen˜alvo,
          <string-name>
            <surname>F.J.G.</surname>
          </string-name>
          , Thero´n, R.:
          <article-title>A survey on ontology metrics</article-title>
          . In: Lytras, M.D., de Pablos,
          <string-name>
            <given-names>P.O.</given-names>
            ,
            <surname>Ziderman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Roulstone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Maurer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.A.</given-names>
            ,
            <surname>Imber</surname>
          </string-name>
          , J.B. (eds.)
          <source>Third World Summit on the Knowledge Society</source>
          ,
          <source>(WSKS'10)</source>
          .
          <source>Communications in Computer and Information Science</source>
          , vol.
          <volume>111</volume>
          , pp.
          <fpage>22</fpage>
          -
          <lpage>27</lpage>
          . Springer (
          <year>2010</year>
          ), corfu, Greece,
          <source>September 22-24</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Gardiner</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsarkov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Framework for an automated comparison of description logic reasoners</article-title>
          .
          <source>In: The 5th International Semantic Web Conference (ISWC'06). Lecture Notes in Computer Science</source>
          , vol.
          <volume>4273</volume>
          , pp.
          <fpage>654</fpage>
          -
          <lpage>667</lpage>
          . Springer (
          <year>2006</year>
          ), athens, GA, USA, November 5-
          <issue>9</issue>
          ,
          <fpage>2006</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Guarino</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          :
          <article-title>Formal ontology and information systems</article-title>
          . In: Guarino, N. (ed.)
          <source>Proceedings of the 1st International Conference on Formal Ontology in Information Systems (FOIS'98)</source>
          . pp.
          <fpage>3</fpage>
          -
          <lpage>15</lpage>
          .
          <source>Frontiers in Artificial intelligence and Applications</source>
          , IOS Press, Amsterdam, The Netherlands (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Herre</surname>
          </string-name>
          , H.:
          <article-title>General Formal Ontology (GFO): A foundational ontology for conceptual modelling</article-title>
          .
          <source>In: Theory and applications of ontology: computer applications</source>
          , pp.
          <fpage>297</fpage>
          -
          <lpage>345</lpage>
          . Springer (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>