<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>SOMET: Algorithm and Tool for SPARQL Based Ontology Module Extraction</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Paul Doran</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ignazio Palmisano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Valentina Tamma pdoran</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>i.palmisano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>valli@liverpool.ac.uk</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Liverpool</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>The di erent ontology module extraction techniques need drawing together under a common framework, to allow for the selection, adaptation and combination of existing techniques. The current work upon a common framework places a signi cant overhead upon the user. The SOMET algorithm and tool uses the W3C standards, RDF and SPARQL. This allows the selection, adaptation and combination to be a simple manipulation of a SPARQL query. SOMET replicates the behavior of some of the existing techniques and there is work in progress in approximating approaches not fully captured. A preliminary evaluation of the e ciency of these techniques within SOMET is also conducted. A discussion is presented highlighting the advantages and disadvantages of SOMET.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>There are a plethora of techniques for carrying out ontology module extraction
[1{5]. All of these techniques are designed for di erent applications and contain
di erent assumptions about the problem as whole. Thus, there is a need to draw
this work together within a common framework; the key advantage of a common
framework is the ability to select, adapt and combine the di erent approaches.
Also, development of new techniques is made easier, since common technical
issues, such as whether reasoning is used or not, are already tackled in the
framework and no e ort is required by the developer.</p>
      <p>
        The notion of a common framework is not new and has already been raised by
d'Aquin et al [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and Borgida and Giunchiglia[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] propose that the common
framework be based upon graph transformations, whereby the ontologies are
represented as a graph. In contrast [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] propose the use of a Tell/Ask interface.
      </p>
      <p>
        The drawback of these approaches is that there could be a substantial
overhead in adapting and combining the existing ontology module extraction
approaches within their frameworks. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]'s use of graph transformations requires
Semantic Web practitioners and Ontology Engineers to become familiar with a
new technique, which currently has no standardised syntax. Whilst the Tell/Ask
interface used by [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] may be more familiar, since there is the DIG Interface 1 for
      </p>
      <sec id="sec-1-1">
        <title>1 http://dig.sourceforge.net/</title>
        <p>Description Logic, there can be a signi cant overhead in adding functionality to
the interface.</p>
        <p>
          As such this paper proposes to use SPARQL[
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] and RDF[
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] as the common
framework, both of which are W3C2 standards. All OWL ontologies can be
represented as an RDF graph and SPARQL is a query language for RDF. Thus,
this paper shows how it is possible to cast the traversal based ontology module
extraction approaches as a series of SPARQL queries upon an RDF graph. Note
that adapting and combining the di erent ontology module extraction techniques
only requires the manipulation of SPARQL queries. A more detailed explanation
of how this can be accomplished is in Section 4.
        </p>
        <p>This has been implemented in SOMET, which is available as a stand-alone
tool. The tool replicates the behavior of the current traversal based extraction
methods. Mechanisms for allowing the user to add their own queries to the
extraction process are also available. Thus, SOMET achieves the aim of providing a
common framework in which to represent these approaches. This is done within
an environment with which most Semantic Web practitioners and Ontology
Engineers are familiar with.</p>
        <p>In Section 2 the related work is presented. Section 3 introduces RDF and
SPARQL. Then Sections 4 and 5 present the SOMET framework and algorithm
respectively. SOMET is evaluated in Section 6. A discussion is presented in
Section 7. Lastly, Section 8 concludes.
2</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        The literature on ontology modularization can be divided into two categories:
ontology partitioning and ontology module extraction. Ontology partitioning
[
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ] divides an input ontology into a number of partitions, not necessarily
disjoint. In contrast, ontology module extraction[1{5] takes an input ontology
and extracts an ontology module about a supplied signature. Each approach has
a bias in how to extract an ontology module based upon their assumptions and
standpoint.
      </p>
      <p>
        Doran, Tamma and Iannone[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] focus on extracting an ontology module that
describes a single, user supplied, concept for the purpose of ontology reuse. Their
approach is agnostic with respect to the language the ontology is expressed in,
as long as the ontology can be transformed into their Abstract Graph Model
for Ontology Module Extraction. The traversal carried out for extraction is done
conditionally, with the conditions changing to suit the language that the ontology
is expressed in. For example, if the start concept is involved in a disjoint relation
then this will not be traversed.
      </p>
      <p>
        d'Aquin, Sabou and Motta[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] present an extraction process which forms part
of a knowledge selection process. The constraints posed by this process prioritise
a reduction in the size of the module produced. To achieve this not all the
superclasses of the included classes are included in the module. Shortcuts are taken in
      </p>
      <sec id="sec-2-1">
        <title>2 http://www.w3.org/</title>
        <p>
          the class hierarchy by only including the most speci c common super-classes (by
applying the LCS, Least Common Subsumer[
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], restricting the possible values
for LCS to the classes already existing in the ontology).
        </p>
        <p>
          Seidenberg and Rector[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] developed a technique speci cally for the Galen3
ontology, but it is possible to take the generic core and apply it to any ontology.
It takes one or more classes of the ontology as an input, and anything that
participates (even indirectly) to the description (and so the de nition) of an
included class has to be included. To reduce module size without losing relevant
information, the authors lter Galen properties on the base of the property
hierarchy.
        </p>
        <p>
          Noy and Musen's[
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] approach is based upon the notion of traversal view
extraction. It is made available via the PROMPT[
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] plugin for Protege. Starting
from one class of the considered ontology, this approach traverses the relations of
this class recursively to include related entities. It can be distinguished from the
other approaches in that it is intended as an interactive tool. This allows the user
to incrementally build ontology modules by extending the currently extracted
one. The user is allowed to select the relations to be traversed and associates to
each of them a depth at which they should stop being traversed.
        </p>
        <p>
          To some extent it is possible to cast [
          <xref ref-type="bibr" rid="ref2 ref3 ref5">2, 3, 5</xref>
          ] as instantiations of [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. However,
[
          <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
          ] cannot be completely captured by the functionalities of [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] do not allow
for the collapsing of the hierarchy, as used by [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]; or for the non-traversal of
certain relations at speci c points, as required by [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. However, despite this all
of these approaches can be seen as traversal approaches.
        </p>
        <p>The orthogonal approach to the traversal approaches is the logical approach.</p>
        <p>
          Cuenca-Grau et al [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] de ne a module as a minimal, conservative extension
[
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] of the original ontology with respect to the considered sub-vocabulary. For
Cuenca-Grau et al [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] a module is that if the new axioms that are added to the
original ontology are a conservative extension[
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] of the original signature, then
the original ontology (not the extension) is said to be a module of all the axioms
together. Being a conservative extension means that the meaning of the terms
in the input sub-vocabulary is completely captured by the module, as it is in the
source ontology. In [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], Cuenca-Grau et al also show that computing a minimal
module considering the de nition they provide is undecidable, and describe two
di erent approximations based on the notion of locality. The rst method makes
use of a reasoner to check the semantic locality of the axioms to be included.
This procedure is decidable. The second one syntactically tests the locality of
axioms and can be realised in polynomial time. This approach is not currently
included in SOMET, but doing so is a focus of the future developments.
        </p>
        <p>
          There has been some preliminary work on providing a common framework
for ontology module extraction. d'Aquin et al [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] propose to use graph
transformations as a common framework. An abstract model for representing ontologies
as attributed graphs is described, along with the reformulation of existing
module extraction techniques as graph transformation rules. The aim being to
produce a parametric modularization tool. The approach presented by Borgida and
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>3 http://www.opengalen.org/</title>
        <p>
          Giunchiglia[
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] can also be viewed as a step towards a common framework. They
provide a Tell/Ask interface for the functional importing of knowledge bases.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>RDF Graph and SPARQL</title>
      <p>The Resource Description Framework (RDF) is a W3C standard which provides
a common syntax and semantics for metamodelling of data. The semantics of
RDF can be represented as a graph. An RDF graph (a labelled, directed graph)
is a set of RDF triples. In RDF triples represent &lt;subject, predicate, object &gt;
relations. For example, consider the following set of triples.</p>
      <p>&lt; C, referees, : x &gt;
&lt; A, playsIn, : x &gt;
&lt; B, playsIn, : x &gt;</p>
      <p>These triples are also a valid RDF graph and can be represented by the graph
shown in Figure 1. ( : x in the above is a blank node4)
playsIn
A</p>
      <p>_:x
referees</p>
      <p>C
playsIn</p>
      <p>B</p>
      <sec id="sec-3-1">
        <title>SELECT ?x WHERE f ?z referees ?x g</title>
        <p>This query will return all ?x where the given pattern is matched. Following
from the example the result for this query will be C. It should be noted that it
is possible that ?z could be satis ed by multiple resources.</p>
        <sec id="sec-3-1-1">
          <title>4 http://www.w3.org/TR/rdf-concepts/#dfn-blank-node</title>
        </sec>
        <sec id="sec-3-1-2">
          <title>5 http://www.w3.org/TR/owl-semantics/mapping</title>
        </sec>
        <sec id="sec-3-1-3">
          <title>6 The PREFIX part of the query has not been included to improve readability.</title>
          <p>However, for SOMET we need to be able to return an RDF graph. The query
shown below will return an RDF graph of the results.</p>
          <p>CONSTRUCT f ?z referees ?x g WHERE f ?z referees ?x g
The ability to return an RDF graph as a result of a query is important for
SOMET as it allows for the iterative construction of an ontology module.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>SOMET Framework</title>
      <p>The SOMET framework is shown in Figure 2. The di erent approaches can be
assimilated to a set of SPARQL queries. This is extensible as it allows the user
to add/remove SPARQL queries from the set of queries that will be passed to
the Traversal Extraction Engine. This exibility is important because it allows
the user to tailor the extraction process to their speci c needs. This bene t is
further enhanced due to the fact that the queries are represented in SPARQL,
which is a standard that Semantic Web practitioners and Ontology Engineers
are familiar with.</p>
      <p>The di erent ontology module extraction techniques are represented as a
set of SPARQL queries. These sets are not disjoint and so their intersections
highlight the areas in common between the techniques.</p>
      <p>SPARQL Queries</p>
      <p>Traversal
Extraction
Engine
Ontology</p>
      <p>Module
PROMPT</p>
      <p>Galen</p>
      <p>D'Aquin</p>
      <p>Doran
Signature</p>
      <p>Ontology</p>
      <p>
        The Traversal Extraction Engine (TEE) is implemented in Java and makes
use of Jena7, a programmatic environment for RDF, OWL and SPARQL; and
Pellet [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], a OWL-DL reasoner. The user supplies the extraction engine with an
ontology to extract an ontology module from and a signature that the ontology
module should be about. From this the Traversal Extraction Engine is able to
produce an ontology module by applying the algorithm presented in the following
section.
      </p>
      <p>Since the description of a strategy is broken down in a set of SPARQL queries,
building a hybrid technique is simple. Each SPARQL query is represented by
a ExtractionQuery object, which contains the query template, the number of
arguments the query accepts and whether or not the arguments are allowed to
be blank nodes. The set of queries is then the input to the TEE, along with
the ontology. In order to add the characteristics of di erent approaches to an
existing one, new ExtractionQuery objects must be created and added to the set
used to invoke the TEE.</p>
      <p>The TEE also has an optional ltering step, where the resulting module is
reduced in size by running another set of SPARQL queries (in this phase limited
to SELECT queries). Any resource matched by the queries is then removed from
the resulting module.
5</p>
    </sec>
    <sec id="sec-5">
      <title>SOMET Algorithm</title>
      <p>The algorithm used by SOMET is a paramaterised graph traversal algorithm,
see Algorithm 1. The parameters to the algorithm are the ontology you wish to
extract a module from, a signature to describe the module and a set of SPARQL
queries. The algorithm iteratively applies the queries to the elements of the
signature to construct a module; any element returned by a query that is not
in the signature also has the queries applied to it. By doing this the ontology
module is iteratively constructed.
5.1</p>
      <p>Breakdown of SPARQL queries used.</p>
      <p>
        This section will present the queries used by the techniques currently
implemented in the SOMET framework. A brief description will be provided for each
query; %1$s present in these descriptions represents the current resource of
focus8. This section does not list all the queries used, due to space limitations, but a
full list can be found at http://www.csc.liv.ac.uk/~pdoran/SOMET/queries.html.
Doran, Tamma and Iannone [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] present an approach focused on extracting
an ontology module about a single concept, with the aim that this ontology
module can be reused. The queries used are:
      </p>
      <sec id="sec-5-1">
        <title>7 http://jena.sourceforge.net</title>
      </sec>
      <sec id="sec-5-2">
        <title>8 this syntax is the Java way of representing an argument which must be replaced with</title>
        <p>a string. http://java.sun.com/j2se/1.5.0/docs/api/java/lang/String.html#format
Algorithm 1 SOMET Graph Traversal Algorithm for Ontology Module
Extraction</p>
        <p>INPUT
{ O - An RDF Graph (O=(V ,E))
{ S - a signature ; where S V \ E
{ Q - a set of SPARQL queries.</p>
        <p>OUTPUT</p>
        <p>{ M - An RDF Graph
procedure graphT raversalExtraction(O, S, Q)
Visited - a container of V that have been visited
ToVisit - a container of V to be visited.
insert S into ToVisit
while ToVisit is not empty do
x = rst element of ToVisit
remove x from ToVisit
if x 2= Visited then
for all q 2 Q do
r = apply q to O
subjects in r are added to ToVisit</p>
        <p>M = M + r
end for
end if
end while
DESCRIBE %1$s</p>
        <p>Describes the current resource. The DESCRIBE keyword de nition in the
SPARQL Recommendation is labeled as \informative", which means the
speci c SPARQL implementation answering the query has some degree of freedom
about what to consider part of the description of a resource. We use ARQ9, which
implements the DESCRIBE as a list of the statements in which the resource
appears as subject, plus the closure computed from any blank nodes involved.</p>
        <p>CONSTRUCT f?y rdfs:domain %1$s.g WHERE f?y rdfs:domain %1$s.g
Returns all the ?y where the resource of interest is the domain.</p>
        <sec id="sec-5-2-1">
          <title>DESCRIBE ?y WHERE f?y rdfs:subClassOf %1$s.g</title>
          <p>Returns the subclasses of the current resource.</p>
          <p>CONSTRUCT f?y owl:equivalentClass %1$s.g WHERE f?y owl:equivalentClass
%1$s.g
Returns all the ?y that the current resource that are owl:equivalentClass</p>
        </sec>
      </sec>
      <sec id="sec-5-3">
        <title>9 http://jena.sourceforge.net/ARQ/</title>
        <p>
          d'Aquin et al [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] present an approach requiring the extraction of a module for
the dynamic scenario of knowledge selection. Our implementation includes the
implementation of a LCS operator as a ARQ extension10. In a similar way, MSC
and most speci c property ancestor are implemented, as property functions.
        </p>
        <p>It is important to note that this approach requires that the input Jena Model
containing the ontology be backed by a reasoner, in order to get correct results;
in fact, subsumption is necessary to correctly compute the LCS.</p>
        <p>An example of how to use the extensions is in the following query:
DESCRIBE ?msc
WHERE fGRAPH %2$sfff?a a owl:Classg
FILTER(isURI(?a) &amp;&amp; !sameterm(?a,%1$s))g
UNIONff?a a rdfs:Classg
FILTER(isURI(?a) &amp;&amp; !sameterm(?a,%1$s))gg
f?msc &lt;java:sparking.propertyfunctions.lcs&gt; (?a %1$s).gg</p>
        <p>It returns a graph containing the de nition of the class that is the LCS that
subsumes the current class and another class, which is already included in the
module (this condition is enforced by the GRAPH instruction, which uses the
support to Named Graphs in SPARQL to restrict the solutions to a subset of
the whole ontology).</p>
        <p>
          Seidenberg and Rector [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] present an approach focused on extracting a
module from Galen, but the core of the technique can be applied to any ontology. The
queries for subclasses and equivalences are similar to those used in the Doran,
Tamma and Iannone queries.
        </p>
        <p>Our replication of this approach includes a ltering step, in which a set of
properties de nition is discarded, as in the original approach:</p>
        <p>SELECT ?y WHERE ff?y rdfs:subPropertyOf</p>
        <p>&lt;http://www.co-ode.org/ontologies/galen#FunctionalAttribute&gt;g"
This query selects any subproperty of http://www.co-ode.org/ontologies/galen#FunctionalAttribute;
the TEE will then remove those de nitions from the module.</p>
        <p>
          Noy and Musen [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] present an approach for extracting a module based on
the notion of view and this approach is highly user con gurable. The query used
is:
        </p>
        <sec id="sec-5-3-1">
          <title>CONSTRUCT f ?x %2$s %1$s g</title>
          <p>WHERE f ?x %2$s %1$s g
As PROMPT is user con gurable the query is also user con gurable. %$1s
remains the current resource, whilst we introduce a second parameter (%$2s) to
represent the predicate. The traversal depth becomes an extra parameter to the
traversal algorithm, which needs to be passed to PROMPT for processing.</p>
          <p>In SOMET, we don't attempt to reimplement the whole of PROMPT so far;
we plan to do so in future developments.
10 ARQ extensions are explained at http://jena.sourceforge.net/ARQ/extension.html</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Preliminary Evaluation</title>
      <p>There are two aspects to consider in evaluating SOMET, these are:
1. What can be said about these techniques in terms of e ciency?
2. Does SOMET replicate the existing ontology module extraction techniques?</p>
      <p>The rst aspect of the evaluation is di cult. There is a need for the ontology
modularisation community as whole to address the issue of evaluation. Existing
evaluation criteria used to measure the `quality' of the modules produced are
insu cient; however, this point is beyond the scope of this paper. Considering
this we evaluated how well the di erent techniques performed on the SOMET
framework. These results are shown in Table 1. In this table we report running
time average (in milliseconds) and triple size average for the three approaches.
For the three ontologies presented here, we ran a di erent number of tests: 50 on
the Galen fragment, 48 for Mind.owl and 285 for Portal.owl; in the last two cases,
these are the number of classes in the ontologies, while for Galen we limited the
test to only 50 of the 4500 classes in there (choosing a random set of concepts),
due to the average time requested to run on this larger ontology.</p>
      <p>Seidenberg Doran d'Aquin Seidenberg tDroiprlaen
time avg time avg time avg triple avg
Galen
fragment
Mind.owl
Portal.owl</p>
      <p>In Figure 3, the performance of SOMET on the Galen fragment is depicted
graphically; in Figure 4, we report the triple size of the extracted modules (here,
Terms refers to the starting points for each extracted modules). The data in
Table 1 shows that the d'Aquin et al approach on average is quicker and produces
smaller modules; this is highlighted further by Figures 3 and 4. However, this
does not say anything about the quality of the modules produced by the di erent
approaches. Looking across the data as a whole it is possible to see that there
are certain 'zones', this would suggest that these 'zones' conform to a coherent
module. This aspect requires further investigation.</p>
      <p>The second aspect of the evaluation requires an in-depth set of experiments
to prove that the SOMET framework fully captures all the functionalities of
the di erent ontology module extraction techniques. This has been completed
for Seidenberg and Doran approach, over Galen (Seidenberg implementation is
targeted at this ontology, therefore validating using di erent ontologies is hard).
Further experiments on d'Aquin approach are ongoing at the time of writing;
avg
we do not report them here since including a partial set of possibly inconclusive
results would not be appropriate.
7</p>
    </sec>
    <sec id="sec-7">
      <title>Discussion</title>
      <p>A common framework provides a fair basis for comparing the di erent ontology
module extraction techniques. The ability to compare the techniques in this
way is important because it will allow an Ontology Engineer to discriminate
objectively between the di erent techniques. Indeed the user can try the di erent
techniques in a convenient manner.</p>
      <p>
        Whilst a common framework is an important step forward it further
highlights the need for robust mechanisms for evaluation. Current attempts at an
objective evaluation based on size, or precision and recall [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] are not optimal.
      </p>
      <p>
        The precision and recall metrics applied by [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] only consider the taxonomic
structure and, as such, Doran et al state that these need extending to capture
the more expressive elements of an ontology. This is a focus of current research.
      </p>
      <p>
        The aim of modularization in general is to reduce the size of an ontology,
but this is not an end in itself because it introduces the obvious paradox that
the optimum module size is 0. Whilst an ontology module of size 0 would be
highly reusable and the e ort required negligible, it is evident that it is also
useless, therefore size must be traded o with some other metric. Finding such
objective metric is not a straightforward task; for a comprehensive overview of
the literature on ontology evaluation see [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
8
      </p>
    </sec>
    <sec id="sec-8">
      <title>Conclusion and Future Work</title>
      <p>SOMET provides a common framework, based on RDF and SPARQL, for the
selection, adaptation and combination of di erent ontology module extraction
techniques. It was shown that it is possible to replicate the current techniques as
SPARQL queries. Even di cult aspects of certain techniques, such as the most
speci c concept, could be implemented. The preliminary evaluation conducted
allowed us to compare the e ciency of the di erent techniques, both in terms of
time and the size of the modules produced. However, this preliminary evaluation
highlighted the need for more robust objective criteria to compare the quality
of the modules produced.</p>
      <p>
        Future work includes assimilating the logical approaches, such as [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] into
the SOMET framework. This is an important step that needs to be taken. In
addition, there is a need to investigate more robust mechanisms for evaluating
the quality of a module. This is because the current objective criteria used are
not powerful enough.
      </p>
      <p>Acknowledgements This work was supported by the Engineering and Physical
Sciences Research Council (EPSRC).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Cuenca-Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kazakov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Just the right amount: Extracting modules from ontologies</article-title>
          .
          <source>In: Proc. of the 16th International World Wide Web Conference (WWW</source>
          <year>2007</year>
          ), pages
          <fpage>717</fpage>
          -
          <lpage>727</lpage>
          , Ban ,
          <string-name>
            <surname>AB</surname>
          </string-name>
          , Canada, May
          <year>2007</year>
          . ACM Press.
          <article-title>(</article-title>
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>d'Aquin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sabou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
          </string-name>
          , E.:
          <article-title>Modularization: a key for the dynamic selection of relevant knowledge components</article-title>
          . In: First International Workshop on Modular Ontologies,
          <string-name>
            <surname>ISWC</surname>
          </string-name>
          <year>2006</year>
          , First International Workshop on Modular Ontologies,
          <string-name>
            <surname>ISWC</surname>
          </string-name>
          <year>2006</year>
          , Athens, Georgia, USA. (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Doran</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tamma</surname>
            ,
            <given-names>V.A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Iannone</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Ontology module extraction for ontology reuse: an ontology engineering perspective</article-title>
          . In Silva, M.J.,
          <string-name>
            <surname>Laender</surname>
            ,
            <given-names>A.H.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>BaezaYates</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuinness</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Olstad</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , Olsen, .H.,
          <article-title>Falca~o, A</article-title>
          .O., eds.: CIKM,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2007</year>
          )
          <volume>61</volume>
          {
          <fpage>70</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Noy</surname>
            ,
            <given-names>N.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          :
          <article-title>Specifying ontology views by traversal</article-title>
          . In: International Semantic Web Conference. (
          <year>2004</year>
          )
          <volume>713</volume>
          {
          <fpage>725</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Seidenberg</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rector</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Web ontology segmentation: analysis, classi cation and use</article-title>
          .
          <source>In: WWW '06: Proceedings of the 15th international conference on World Wide Web</source>
          , New York, NY, USA, ACM Press (
          <year>2006</year>
          )
          <volume>13</volume>
          {
          <fpage>22</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>d'Aquin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Doran</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tamma</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Towards a parametric ontology modularization framework based on graph transformation</article-title>
          . [
          <volume>17</volume>
          ]
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Borgida</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giunchiglia</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Importing from functional knowledge bases - a preview</article-title>
          . [
          <volume>17</volume>
          ]
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. SPARQL:
          <article-title>Query language for RDF. W3C Recommendation -</article-title>
          15th
          <source>January 2008</source>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9. RDF:
          <article-title>Resource description framework RDF</article-title>
          .
          <source>W3C Recommendation - 10th February 2004</source>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Cuenca-Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sirin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Automatic partitioning of owl ontologies using e-connections</article-title>
          .
          <source>In: Proceedings of the 2005 International Workshop on Description Logics (DL-2005)</source>
          .
          <article-title>(</article-title>
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Stuckenschmidt</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Structure-based partitioning of large concept hierarchies</article-title>
          .
          <source>In: Proceedings of the 3rd International Semantic Web Conference</source>
          , Hiroshima, Japan (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>W.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borgida</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirsh</surname>
          </string-name>
          , H.:
          <article-title>Computing Least Common Subsumers in Description Logics</article-title>
          . In: AAAI. (
          <year>1992</year>
          )
          <volume>754</volume>
          {
          <fpage>760</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Noy</surname>
            ,
            <given-names>N.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          :
          <article-title>Prompt: Algorithm and tool for automated ontology merging and alignment</article-title>
          .
          <source>In: Proceedings of the Seventeenth National Conference on Arti cial Intelligence and Twelfth Conference on Innovative Applications of Arti cial Intelligence</source>
          , AAAI Press / The MIT Press (
          <year>2000</year>
          )
          <volume>450</volume>
          {
          <fpage>455</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walther</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Conservative extensions in expressive description logics</article-title>
          . In Veloso, M.M., ed.
          <source>: IJCAI</source>
          . (
          <year>2007</year>
          )
          <volume>453</volume>
          {
          <fpage>458</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Sirin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Pellet: A practical OWL-DL reasoner</article-title>
          .
          <source>Journal of Web Semantics</source>
          <volume>5</volume>
          (
          <issue>2</issue>
          ) (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Gangemi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Catenacci</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ciaramita</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
          </string-name>
          , J.:
          <article-title>Ontology evaluation and validation: an integrated formal model for the quality diagnostic task</article-title>
          .
          <source>Technical report</source>
          , Laboratory for Applied Ontology (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Cuenca-Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Honavar</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schlicht</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolter</surname>
          </string-name>
          , F., eds.
          <source>: Proceedings of the 2nd International Workshop on Modular Ontologies, WOMO'07</source>
          ,
          <string-name>
            <surname>Whistler</surname>
          </string-name>
          , Canada, October
          <volume>28</volume>
          ,
          <year>2007</year>
          . (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>