<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>On the Impact of Ontological Commitment</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Marian Nodine Telcordia Technologies 106 E. 6</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Ontological commitment, or the agreement to have your applications and users conform to a common domain understanding as encapsulated in one or more shared ontologies, is a noble goal and essential for open agent systems. Our experiences building ontology-based agent systems in multiple domains have shown us that the intention for a new application to locate and conform to some existing ontology or ontologies within its domain has many impediments to its success. For instance, the goals of the designer of a domain ontology include developing a complete and comprehensive domain description; however, the application developer may only require a small fragment of that ontology. Multiple applications that conform to the ontology may, in fact, use completely orthogonal fragments of the ontology, and not be able to interact at all. Users may insist on importing into the ontology sets of terms that are neither logically consistent nor easily modelable. With these issues in mind, in this paper we propose some guidelines for ontology development and evolution paradigm that should facilitate ontology reuse. These guidelines could underpin a usage model for ontologies; one that enables the application designer to reuse ontological concepts from multiple ontologies in a more flexible manner, while retaining the essentially good properties of ontology sharing and reuse. These guidelines affect both the design and use of ontology-based applications, as well as the way applications advertise themselves to other agents with which they may interoperate.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>There is a strong relationship between some specific ontology
and the logical rules and computational artifacts that use that
ontology, in that when they communicate among themselves, they
have some level of assurance that the same terms have the same
meanings to all. However, this use requires that the logical rules
and the computational artifacts have explicit linkages with the
ontology; often in the form of hard-coding the ontological terms
into the rules and/or the application code itself.</p>
      <p>Jerry Fowler
setenv, LLC</p>
    </sec>
    <sec id="sec-2">
      <title>Houston, TX</title>
      <p>
        (713)526-8641
In an agent-based system, common ontologies specify the
ontological commitments of a set of participating agents [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. An
ontological commitment is an agreement to use a vocabulary in a
way that is consistent with an ontology. An agent or human
committed to an ontology understands (some subset of) the
ontology and agrees to use it in a manner consistent with the
semantics of the ontology. Agents and humans committed to the
same ontology can share knowledge among themselves with
some confidence that they share an underlying understanding of
what is being said. Commitment to common, shared ontologies
facilitates openness in an agent-based system.
      </p>
      <p>
        In this paper, we examine the conflicting requirements and goals
of ontology designers, ontology-committed applications, and
ontology-aware users, and their respective impact on the problem
of ontology commitment and reuse. As ontological sharing and
reuse increases, the gap between the ontology designers and the
ontology users grows larger. Our goal is to analyze what issues
inhibit reuse and to propose strategies for facilitating reuse. In
particular, we consider the problem of reuse of ontologies whose
specification is complete, for applications whose requirements
were not considered during the design of the ontology. This
problem is not addressed in the ontology design methodologies
summarized in [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. We develop guidelines and approaches for
agents to use existing ontologies in a more flexible manner.
Throughout the paper, we relate the issues to real issues we have
encountered within the context of one of our applications, EDEN
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. EDEN1 is an agent-based system developed for the purpose
of inter-organizational sharing of environmental data collected,
stored and monitored by multiple government agencies and
nongovernment scientists spread throughout the US and Europe, and
relating information from these disparate data sources and
schemas at a semantic level as needed by the users. EDEN uses
ontologies to represent the semantics of the underlying
information, and real and varied databases to populate those
ontologies with instances.
      </p>
      <sec id="sec-2-1">
        <title>2. ONTOLOGICAL COMMITMENT</title>
        <p>
          Because ontologies are meant to facilitate sharing and reuse of
knowledge, it is important that the ontology and its collection of
users (both human and agent) align themselves to a shared view
of the domain during the process of designing and evolving the
ontology for that domain. However, many existing ontologies
have been developed either by designers attempting to
characterize a domain (with no real computational applications
that use them) or by application developers to support individual
applications (with no real sharing of the ontology with other
1 EDEN was funded jointly by the DOD, DOE, EPA and EEA.
applications). The plethora of existing ontologies argues that
many concepts are already represented within some ontology, so
reuse of these ontologies, increasing the ontological commitment
level, is now feasible. For example, some ontologies such as the
Unified Medical Language System (UMLS) Metathesaurus [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]
have achieved a higher level of commitment.
        </p>
        <p>There are several issues that impede this sharing and knowledge.
First are issues related to the conflicting goals and objectives of
ontology definers, ontology-committed applications, and users of
such applications. Second are issues of the mutual conflict
between ontological commitment and ontological evolution.
Third are issues of conceptual mismatches between ontologies
and the applications and users that use them. Unfortunately,
these issues stem from fundamental issues and characteristics of
the problem of sharing ontologies so broadly.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.1 Conflict: Definer, Application, User</title>
        <p>The commitment to ontologies is hampered by the conflicting
goals of ontology definers, developers of ontology-committed
applications and ontology-committed application users, and often
by the confusion of users and definers over the demands of
ontology-committed applications. Here, we use the descriptor
“ontology-committed” to mean that something purports to use the
terms in the ontology(-ies) in a manner consistent with its (their)
definitions.</p>
        <p>
          The goal of the ontology designer, at least one working towards
maximizing the usefulness of his ontology to a wide variety of
applications, is to completely characterize a particular domain at
the semantic level. The ontology designer needs expertise in
knowledge representation and in the domain of the ontology. His
intent is to develop a comprehensive and up-to-date ontology,
with a broad set of acceptable terms. His job is hampered by the
fact that the different domain experts have different viewpoints
of the domain, and these viewpoints must be reconciled within
the ontology itself, placing pressure on the designer to either take
a commonly agreed-upon but consistent subset of the domain, or
to attempt to placate everyone by including everything. The likely
result is either the ontology will express concepts at a higher
level than can be used effectively by an application that seeks to
attach suitable labels to real data, or the terminology within the
ontology will not be logically consistent and thus easily
incorporatable into an application. Fortunately, our experience
indicates that many terminology disagreements can be addressed
by synonymy. For instance, what the EPA refers to as a “site” the
DOD calls a “facility.” What the EPA calls an “operable unit” (a
specific location contained in an EPA “site”) the DOD calls a
“site.”
The goal of the application designer is to use the ontology to
support an application. The application designer has expertise in
building applications, especially within given domains. His intent
is to develop an efficient and useful application. It is likely that
the application will not cover the whole domain, or may span
multiple domains. Also, applications need to focus on some
issues that are unimportant to the designer. For instance, a failure
to take sufficient care of value representation issues in the
ontology may force the application designer to code this
information directly into the application, even though it is really
domain-related knowledge. For example, within the
Environmental Data Registry [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], it was difficult to relate
measurements of chemicals between different representations
because the measurement ontology was “well understood,” and
often explicit in the data representations themselves. Even terms
such as “milligrams per kilogram” were sometimes represented
inconsistently in text fields. This posed a problem in extracting
values for computation in this “well understood” measurement
system.
        </p>
        <p>The desire of the application user is to be able to do his job well
and easily. Application users have expertise in their own jobs
and potentially in the domain of the application, but may have
minimal understanding of knowledge representation or
application design issues. Typical users want an understandable,
natural, easy-to-use interface to the application that facilitates
their work. This implies that they want the scope of the ontology
restricted to the exact area within the domain that they are
specifically concerned with. Another issue is that a user may
have a comfortable vocabulary of domain-related terms that do
not map well to the ontology representation model, to the domain
model that the ontology developer had in mind or to the needs of
the application itself.</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.2 Conflict: Commitment, Evolution</title>
        <p>
          A second hampering factor when considering ontological
commitment itself is the fact that ontological commitment
impedes ontology evolution, and vice versa. Yet, both of these
are necessary for an ontology to be successful. As Mark Tuttle
et.al. say, “if you don’t use it, it won’t improve, if you don’t
improve it, it won’t get used.” [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. In other words, use brings
out the need to change, change is required for continued use.
As stated before, a measure of ontological commitment is the
number of agents and humans that are committed to using the
ontology. Each of these committed entities has made the effort to
learn the ontology and to use the concepts therein in a manner
consistent with their definition. However, the ontology itself may
have problems, or may be incomplete in some aspect, or not
reflect changes in the domain itself. In these situations, it
becomes desirable at some point to evolve the ontology,
providing an updated version. Initially, there is no real
commitment to the updated version; rather, the entities
committed to the old ontology must explicitly learn the new
ontology, and adapt to the changes therein. This leads to the
observation that, as commitment to an ontology rises, so does the
level of effort required to accommodate an evolutionary step.
Development of an application represents a strong degree of
ontological commitment. The incorporation of an ontology into an
application may require specific coding with respect to the
ontology, so application developers tend not to appreciate it when
the ontology evolves. This tends to discourage users from
employing the precision they need to express their views of the
data in an application. In EDEN, the first version of our ontology
easily evolved into the second, because there were only three
small database fragments and a single user view in the initial
demonstration. As the complexity of the user interface increased
to make multiple views possible, and as we added larger, more
complex resources, the task of evolving the ontology from version
two to version three became more demanding due simply to the
numbers of agents impacted.
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>2.3 Impedance Mismatch</title>
        <p>
          There are several areas where mismatch and other problems
seem to crop up. This problem has been studied extensively
within the heterogeneous database community [
          <xref ref-type="bibr" rid="ref19 ref20">19,20</xref>
          ]. Some
examples of these issues are as follows:
Uneven concept granularity: Some ontologies and other
dictionaries were developed with a focus on one particular aspect
of the domain. This aspect was well-developed and well-tested,
but other aspects of the same domain are ignored. For instance,
the ontology could have very strong and detailed concepts for
contaminants and hazardous waste sites on land and in lakes, but
not for ocean contaminants. However, a general application
concerning toxic waste sites would need all of these concepts.
Concept modeling mismatch: Some applications and
information sources model individual concepts differently from
others, thus it was hard to develop a single ontological concept
that would relate well to all the ways it could be represented. For
instance, the concept of a set of measurements of contaminant
levels taken daily could be represented in the ontology and/or the
data itself either with a class of measurement vectors (one
instance per day), or a class of measurements, with one instance
per measurement type per day, specialized on measurement type.
Value representation mismatch: Many ontologies we have
found do not even consider issues relating to the representation
of the values of the attributes in the instances, a requirement for
many applications. For instance, it would be easy to say that
“area” was an integer; therefore setting its range as the range of
positive integers. However, the Americans measured area in
square feet, etc, while the Europeans measured it in square
meters, etc. The application needed to know in which unit the
integer in a particular instance of the area is represented, and
how the units relate. These concepts should therefore exist in the
domain ontology. In a more complex example, one information
source in the EDEN application had “soil” as one of the media
that could be polluted; but a second had “topsoil” and “subsoil”
as separate concepts. A third site had multiple, more fine-grained
classifications for soil. These did not all map easily onto any
single ontological representation for soil, as mapping between the
different concepts required additional knowledge (e.g. the depth
of the soil), and sometimes was lossy (e.g. if everything was just
mapped to “soil”).
        </p>
        <p>Terminology mismatch: Sometimes different sets of users use
the same term for different concepts, or the term best suited to
the user is already present in the ontology with a different
meaning. This means that necessary concepts are sometimes
omitted because the lexical terms needed to express them in
human language are all present in other usages. For instance, the
UMLS set of Semantic Relations was slow to develop the
concept of genetic relationship because the terms parent and
child existed for their use in logical tree structures.</p>
        <p>All of these issues conspire to make ontology sharing
challenging.</p>
      </sec>
      <sec id="sec-2-5">
        <title>3. ONTOLOGY DEVELOPMENT ISSUES</title>
        <p>For ontological commitment to become real, ontology designers,
application designers and users within a domain need all to be
attracted to the same ontology. This affects not only the original
design, but also the process of evolving the ontology.</p>
      </sec>
      <sec id="sec-2-6">
        <title>3.1 Design and Representation</title>
        <p>Ideally, the design and representation of an ontology should be
done in collaboration with users and applications; the
applications then can incorporate the ontology as designed, and
the users can validate the concepts, relationships and axioms that
they access in the ontology via the application. These
interrelationships are shown in Figure 1.</p>
        <sec id="sec-2-6-1">
          <title>Representability vs.</title>
        </sec>
        <sec id="sec-2-6-2">
          <title>Implementability</title>
          <p>Domain
knowledge,
validation
Specification
Implementation
Ontology
Designer
Requirements,
validation</p>
        </sec>
        <sec id="sec-2-6-3">
          <title>Usability vs.</title>
        </sec>
        <sec id="sec-2-6-4">
          <title>Representability</title>
          <p>Specification</p>
          <p>Domain
knowledge,
validation
Application
Designer</p>
        </sec>
        <sec id="sec-2-6-5">
          <title>Implementability vs.</title>
        </sec>
        <sec id="sec-2-6-6">
          <title>Usability</title>
          <p>Application</p>
          <p>
            User
Partly because of the desire to maximize ontological
commitment, it is normally the goal of the ontology designer to
represent the domain as completely as possible. Ontologies such
as UMLS or environmental thesauri such as the General
Multilingual Environmental Thesaurus (GEMET) [
            <xref ref-type="bibr" rid="ref6">6</xref>
            ] or the
Terminology Reference System (TRS) [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ] provide a broad base
for discourse over their relative domains, medicine and the
environment. Their definition has largely been driven, not by the
implementers of the computer systems that use the ontologies,
but rather by the practitioners within the domain itself. An
example of the practical impact of this high-level design on the
EDEN system was the design of value mapping across conceptual
domains. The Environmental Data Register’s (EDR) [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ]
implementation of its value domain model was fully normalized.
This produced response times that were suitable for individual
term lookups and mappings, but the performance scaled
inadequately when several agents were performing simultaneous
large-scale term conversions for entire tables. To obtain any
tolerable performance at all, we were compelled to revise the
ontology of the conceptual domain to support a flatter,
nonnormal data structure.
          </p>
        </sec>
      </sec>
      <sec id="sec-2-7">
        <title>3.2 Evolution</title>
        <p>As application developers develop applications using ontologies,
and users use those applications, they identify weaknesses,
missing concepts, and different issues that need to be addressed.
For instance, when we developed our initial environmental
ontology, some of the users who were used to the terminology in
GEMET requested that we evolve our own ontology to be
consistent with GEMET, or otherwise incorporate its terms. This
request mandated an incorporation of that ontology into the
environmental ontology. Once this incorporation was complete,
all agents had to be adapted to the new ontology (this required a
varying amount of effort depending on how closely the agent was
hardwired to the ontology). Some of the users who were not used
to the GEMET terminology also had to learn it. Thus, the
evolution of the ontology engendered quite a bit of effort on many
people. One could envision that the evolution of an ontology that
has a higher level of ontological commitment, such as UMLS,
could get very complex.</p>
        <p>Clearly, the evolution of an ontology to a new version must be
done carefully and collaboratively, involving the same types of as
were involved in the original design. This indicates a slower,
more coarse-grained type of evolution, which in turn affects the
contents of the ontology. An ontology may contain several types
of information useful to the domain that it represents. These
include:
•
•
•
•</p>
        <p>Concepts/classes, their attributes, and the types of
representations their attributes can take.</p>
        <p>Relationships between concepts, including subclassing,
synonyms, and containment.</p>
        <p>Axioms and functions over those concepts.</p>
        <p>Instances of those concepts the relationships between them,
and axioms over the instances.</p>
        <p>The first three of these are referred to as the schema information
(also known as the conceptual model or meta-knowledge) and the
last is referred to the instance knowledge.</p>
        <p>
          Experience has shown us that instances, because they represent
changing objects in the real world, tend to evolve much faster
than the concepts themselves. Because of this, it is easier to
factor the ontology into two pieces, the schema and the instances.
Within the EDEN application, the environmental ontology itself
only contained schema-level domain information, and we
populated the schema using information from different
environmental databases. Because the agents themselves
naturally were written only against the schema concepts, and the
schema-level ontology information evolved more slowly, ontology
evolution did not create as significant a problem as it could have.
This experience has carried over into numerous other
applications developed using the InfoSleuth agent system [
          <xref ref-type="bibr" rid="ref3 ref8">8,3</xref>
          ];
thus, we recommend this factoring unless the instance
information is guaranteed to be stable (e.g., constants).
Ontological representation systems maintain this natural
factoring between schema and instances. In OIL [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], the schema
level is encapsulated within Standard OIL. OKBC clearly
distinguishes between schema and instances in its own
vocabulary. Therefore, there is no real barrier to this factoring
approach.
        </p>
      </sec>
      <sec id="sec-2-8">
        <title>4. COMPOSITIONAL ONTOLOGIES</title>
        <p>Given that the best approach to ontology design and use is a
collaborative one among ontology designers, application
developers and users, let us now turn our attention to the issue of
reuse. Consider an application developer, developing an
application to fulfill some of the needs of a specific class of users
in a given application domain. The developer has decided to
reuse as much already-defined ontological information as he can
within his application, tagging the imported concepts back to the
ontologies from which they were taken to facilitate the process of
evolving the application within the ontology. Unfortunately, up to
this point the application developer has had no input whatsoever
into the ontological design, though he has been working closely
with the prospective application users. Thus, the necessary
collaboration between the ontology designer, the application
developer and the users is missing in this situation. Furthermore,
the users’ input into the application design is much more direct
than that of the ontology designer. The result often is that there is
no good existing ontological “match” to the application and/or its
users.</p>
        <p>Our experiences with our own ontology-based agent applications
has led us to the following observations:
1.
2.
3.</p>
        <p>Reusing ontologies is necessary for acceptance by domain
specialists, but many mature ontologies specify a lot more
than current agent systems can actually use.</p>
        <p>Most of the applications we have developed require
concepts from multiple ontologies.</p>
        <p>Most of the applications that we have developed require
some concepts defined in no ontologies, because of
mismatch issues.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>GEMET</title>
    </sec>
    <sec id="sec-4">
      <title>EDEN</title>
      <p>GPS</p>
    </sec>
    <sec id="sec-5">
      <title>Positioning</title>
    </sec>
    <sec id="sec-6">
      <title>Facility</title>
      <p>Extension</p>
      <p>EDR
In order to facilitate reuse, we have designed our applications to
use a compositional notion of ontologies, where we select
concepts from different ontologies as needed, and supplement the
result with any new concepts that we cannot locate elsewhere.
This leads us to support three operations over ontologies
themselves, which we term subset, compose and extend,
respectively, as shown in Figure 2.</p>
      <sec id="sec-6-1">
        <title>4.1 Subset</title>
        <p>
          Agents are frequently coded as specialists, understanding a
focused subset of the domain itself. For example, an agent that is
interfacing with information repositories on environmental
remediation techniques would not necessarily understand the
information related to companies and their responsibilities for
cleaning up specific toxic waste sites; yet the scope of the domain
encompasses both areas. These agents need not incorporate entire
ontologies; in fact, this may not even be possible for lightweight
agents using large ontologies. Standards such as FIPA [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] allow
agents to advertise the ontologies they understand, However,
such an advertisement may be misleading to other agents if the
agent only understands a subset of the ontology, in that two
agents advertising the same ontology may in fact understand
orthogonal subsets of the ontology, rendering communication
impossible.
        </p>
        <p>Furthermore, too much ontology information may also confuse
users. As a practical matter, users of several of our agent
applications including EDEN were confused by the presence of
ontological terms for which no agent had advertised any
knowledge. The presence of the term implied to them that they
should be able to extract information (other than definition and
ontological relationships) relevant to the term. That no agent had
advertised the term was an unsatisfying explanation. Because of
this, the agents themselves that implement the ontology must be
careful how they present themselves to their users.</p>
        <p>We define this subsetting using ontological fragments. An
ontological fragment consists of:
•
•
•
•</p>
        <p>The name of the ontology
The classes or entities in the ontology that are supported
within the application, and their superclasses.</p>
        <p>Constraints on those classes including such things as ranges
of values allowed for specific slots</p>
        <p>The axioms that reference the supported classes/ attributes.
The fragment can be expressed as a set of constraints over the
classes and attributes. As an example from the EDEN system,
each agent that provided a wrapper for a data resource advertised
only a fraction of the domain ontology – precisely that portion for
which it could provide data instances. In some cases, a wrapper
agent advertised only a fraction of the slots of a class, and further
placed semantic constraints on the content of the slots based on
knowledge of the data instances about which it could report. In
other cases, an agent might fill a slot with a static value from the
canonical data representation for that slot. For example, because
there was no record in a particular remediation database for land
use, one of the attributes of a site, certain government database
wrappers returned the generic value “Federal Facility” which
was one of the values used in other databases.</p>
        <p>Another example decision that needs to be made on any system is
whether a request for everything matching a query should return
an empty set if an agent does not advertise all elements of a
class, or should return nulls for the unadvertised elements. This
provoked strong disagreement on our team between those who
felt that the former was algebraically preferable and those who
felt that the latter was more intuitive to users already intimidated
by the challenge of understanding the nature of the results
produced by a dynamic distributed information system.</p>
      </sec>
      <sec id="sec-6-2">
        <title>4.2 Compose</title>
        <p>Another issue that is brought about by ontological commitment is
the issue that an agent often requires access to terms from
multiple ontologies. There exist a growing number of useful and
public ontologies, and it often seems more appropriate to pick
and choose terms from those ontologies than to build a new one
from scratch, with just the terms that you need for your ontology.
The result is a virtual ontology that imports fragments of other
ontologies, into some sort of unified and useful whole. For
example, within the EDEN application the diverse ways of
representing values relating to various concepts such as chemical
or geographic identifiers made it necessary to compose the
domain ontology incorporating terminology from the field of
hazardous waste pollution and remediation with the EDR
ontology for value representation.</p>
        <p>The ability to compose ontologies has an immediate benefit in
modular ontology design. For instance, if you look within the
EDR tags, there are tags relating to location (street address,
postal code) as well as geographic (latitude and longitude). These
tags are not specific to the environmental domain, but are shared
among other domains such as address books and GPS positioning
systems. Furthermore, such structures seem relatively stable.
This indicates that it may be much more useful to have, say, a
single ontology for geographic positions, their representations,
and the relationships between them that can be incorporated into
environmental applications as well as those in other domains.</p>
      </sec>
      <sec id="sec-6-3">
        <title>4.3 Extend</title>
        <p>Unfortunately, applications frequently need to incorporate
concepts that are not represented in any ontology, or they may
need to “glue” concepts from different ontologies together using
new concepts. Thus, it is not sufficient to compose subsets of
ontologies; frequently it is necessary also to add some new
concepts. For example, within the setting of our EDEN
application, the domain ontologies covered the concepts for
which users wanted to retrieve information; for instance, they
enabled the user to answer questions such as, “What toxic waste
sites are within 10 miles of Houston, TX?” However, the
agentbased system that supported the application also was able to ask
higher-level, more abstract questions such as “Are the number of
toxic waste sites within 10 miles of Houston increasing faster
than they can be remediated?” These higher-level questions used
concepts present neither in the domain ontology, nor in any other
ontology available at the time.</p>
        <p>The important issue with extension is to do it in a manner that
will not create further problems later. There are two ways to
extend an ontology; extensions that are safe and can easily be
incorporated into other applications, and extensions that are
unsafe and should eventually be agreed upon and then folded
back by consensus into the ontology. We define safe and unsafe
extensions with respect to the permissible definitional changes
with respect to the underlying ontology(-ies). A safe extension
can be characterized as follows:
Safe extension: an extension incorporating existing concepts
from one or more ontologies, where any new concepts are defined
axiomatically against the existing concepts in such a way that
instances of the new concepts can be determined computationally
from instances of the existing concepts. The transformation
axioms must be invertible; the conversion of any instance
between its view in the existing concept(s) and its view in the
new concept must be lossless in both directions.</p>
        <p>Safe extensions are “safe” primarily because they can be shared
among the agents easily, simply by sharing the axioms that allow
for conversion from the existing classes to the extended classes.
We note that new concepts within such a safe extension cannot
have new attributes that would need to be present in any instance
of that concept, as this would violate the lossless conversion
property. So, for example, “milliliters per liter” would be a safe
extension for a concept if the concepts “water pollutant” and
“parts per million” were already present in the ontology being
extended.</p>
        <p>In contrast, unsafe extensions are “unsafe” because they are
harder to share with other agents, as the other agents may not
maintain the information necessary to populate instances of the
unsafe classes. For example, adding a new concept, “Water
pollutant” with a new property, “concentration” would not be a
safe extension if there were no concept of containing medium in
the original ontology. Other agents, committed to the ontology,
may not have the information necessary to provide a
concentration for each pollutant they have, or may have different
ways of modeling the concept of how badly the pollutant is
contaminating the water.</p>
        <p>
          Because unsafe extensions allow agents to instantiate classes that
have been extended unsafely and thus cannot easily be shared;
these extensions limit the openness of the agent. However,
unsafe extensions are sometimes necessary; for instance, they are
often required to deal with some of the impedance mismatch
issues discussed earlier. Allowing unsafe extensions increases
the possibility that different agents will create divergent
extensions to the ontology, and become incompatible with one
another. Therefore, unsafe extensions should be treated with
caution. One approach is to limit such extensions to those that
are monotonic [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] -- not requiring modifications to existing
ontological elements. Idealistically, the need for specific unsafe
extensions should be propagated to the ontology designers, who
can determine whether or not there is a consensus on the need for
the particular extension, and whether they should be incorporated
into the next version.
        </p>
      </sec>
      <sec id="sec-6-4">
        <title>4.4 Example</title>
      </sec>
      <sec id="sec-6-5">
        <title>GPS Ontology</title>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Lat/Long</title>
      <sec id="sec-7-1">
        <title>EDEN Ontology</title>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Measured</title>
    </sec>
    <sec id="sec-9">
      <title>Contamination</title>
      <sec id="sec-9-1">
        <title>Measurement</title>
      </sec>
      <sec id="sec-9-2">
        <title>Ontology</title>
        <p>canonical unit
unit
atPoint</p>
      </sec>
    </sec>
    <sec id="sec-10">
      <title>Meter</title>
      <p>isa
canonical unit
unit</p>
    </sec>
    <sec id="sec-11">
      <title>Unit of</title>
    </sec>
    <sec id="sec-12">
      <title>Measure</title>
      <p>isa</p>
    </sec>
    <sec id="sec-13">
      <title>Locator</title>
      <p>isa</p>
    </sec>
    <sec id="sec-14">
      <title>GeoLocation</title>
      <p>location</p>
    </sec>
    <sec id="sec-15">
      <title>Sampling</title>
    </sec>
    <sec id="sec-16">
      <title>Point</title>
    </sec>
    <sec id="sec-17">
      <title>Distance</title>
    </sec>
    <sec id="sec-18">
      <title>Quantity</title>
      <p>depth
isa
unit</p>
    </sec>
    <sec id="sec-19">
      <title>Foot</title>
      <sec id="sec-19-1">
        <title>5. IMPLEMENTATION ISSUES</title>
        <p>Given that we allow the operations of subsetting, composition
and (safe) extension over ontologies within an agent system, the
agents themselves need to be able to represent exactly what
combination they are committed to, in order to ensure that they
do not mislead others with respect to their capabilities.
There are three issues with representation and implementation.
The first is that the agent needs to be able to represent internally
the full set of details on how the ontology fragments it is using fit
together. This can be done with a suitable abstract ontology
representation. Secondly, the agent must be able to converse in
some detail with other agents about the parts of the ontologies
that they understand. This must be done using a suitable
exchange representation. Thirdly, messages should be able to
represent the ontology fragments that the message covers. This
requires a more specific ontology field within the agent
communication language.</p>
      </sec>
      <sec id="sec-19-2">
        <title>5.1 Internal Representation</title>
        <p>
          Internally, an agent needs to understand in detail the specific
schema-level knowledge to which it is committed. This
understanding may be complicated by the fact that ontological
knowledge may be represented using many different
representational models; for instance, there are
subject-verbobject models such as DAML, frame-oriented models such and
OKBC, and object-oriented models such as UML [
          <xref ref-type="bibr" rid="ref1 ref14">14, 1</xref>
          ]. These
all differ in key issues such as whether or not attributes are
firstclass objects, whether or not multiple inheritance is allowed, and
how strongly-typed classes must be. Further complicating this, if
the operations of subsetting, composition and extension are used,
a single application may look into a set of ontologies whose
underlying ontological models may diverge.
        </p>
        <p>
          Recent discussions have included the notion of developing
abstract ontology representations for the internal representation
of ontological knowledge. As with all applications that attempt to
span multiple viewpoints, key choices in specifying an abstract
ontology representation revolve around whether it should
incorporate every modeling features present in some ontology
representation, or it should only look at the modeling features
used by every representation, or it should take some path in the
middle. Wilmott et.al. [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] discuss in depth some of the issues
associated with the specification and implementation of an
abstract ontology representation from a theoretical standpoint.
Fortunately, an abstract ontology representation is implemented
within an ontology service, and thus only needs to incorporate the
requirements of the agents that it serves. In the InfoSleuth
system, our ontology service focused on frame-based ontologies,
as they were most compatible with the databases with which we
were working.
        </p>
        <p>Internally, an agent that is using ontologies need only store the
subsets of the ontology that it needs, and the glue that puts them
together. So, for instance, in the ontology of Figure 2, the agent
itself only needs to understand the concepts in the shaded areas
of the GEMET, GPS Positioning, Facility and EDR ontologies, as
well as the additional information associated with their
composition and extension. Managing this information in an
efficient way is the work of the ontology service.</p>
        <p>Our experience has shown us that a good abstract ontology
representation and a good ontology service are essential to cope
with the incorporation of existing ontologies and their
evolutionary steps. The abstract ontology representation
engenders a unified methodology by which the agent can absorb
existing ontology information and its particular use within the
application. Agent implementers then use the ontologies in a
more declarative and less hard-wired manner, which in turn
facilitates the incorporation of new ontological information.</p>
      </sec>
      <sec id="sec-19-3">
        <title>5.2 External Representation</title>
        <p>Agents need to be able to share which parts of the different
ontologies that they use or require, and how they put them
together, with their other collaborating agents. This happens
when the agent advertises its capabilities to another agent, when
an agent is looking to locate another agent that can help it with
some task, or when agents are negotiating over who is
responsible for what subtasks. We have identified three basic
ways that ontologies may be adapted dynamically: subsetting,
composition, and extension:
Subsetting: The notion of subsetting an ontology is not reflected
in the definition of the ontology itself, but rather in artifacts
related to the use of the ontology. For example, fragments may be
advertised from one agent to another, so that agents can inform
other agents within their community of their exact capabilities.
Alternatively, an application may be supported at the user end by
an agent that interfaces with the user, ensuring that the user is
presented only with the fragments of the ontology that are
supported within the underlying agent community. Any user
interfaces can then be tailored to the ontology fragment.
Composition: As with subsetting, composition should be
specified at the time of use, however, there are some
representational difficulties at this point. While subsetting can be
defined as a set of axioms that overlays a single ontology,
composition spans multiple ontologies and thus presents different
representational challenges. For instance, a composition may
wish to state that the values of the “LatitudeMeasure” slot for the
“FacilityIdentification” in the EDR may be understood by the
agent as represented in the “LatLongCoordinate” units defined in
the “GPS Coordinates” ontology. In EDEN, we defined a
geographic location as comprising a (selectable) coordinate
scheme and a coordinate value.</p>
        <p>OKBC and DAML present different challenges with respect to
composition. Within OKBC, composition must be done by
explicitly importing the component ontologies. Existing tools do
not necessarily facilitate importing ontologies from remote
locations, or ones that are represented in different languages
(e.g., DAML). The resulting new ontology, even if it can be
defined is a potentially huge superset of what is actually needed
by the agent. Furthermore, it may not be possible to relate the
terms in the new ontology back to the ontologies that it imported,
so it may be computationally difficult to re-relate terms imported
from the same ontology into two such composed ontologies.
DAML+OIL, on the other hand, uses namespaces to facilitate the
combination of ontologies and the relating of concepts among
multiple ontologies. However, this ability to use namespaces
relies on having the component ontologies’ contents also within a
namespace.</p>
        <p>Extension: Safe extensions by definition should be able to be
specified in terms of logical axioms over existing concepts in the
ontology. While axioms are strongly considered among ontology
designers, many ontology exchange languages hardly consider
them at all; for instance, DAML+OIL at the moment has no good
foundation for representing axioms and is thus unsuitable for
exchanging knowledge concerning safe extensions.</p>
        <p>In EDEN, we prepared a number of ad hoc lexical translation
processes to translate between an old version and an extended
version of the ontology. This did not yield a reusable method.</p>
      </sec>
      <sec id="sec-19-4">
        <title>5.3 Representation in ACL Messages</title>
        <p>
          The last area of application impact concerns the communication
of ontological reference information in ACL messages. Currently,
agent communication languages such as the FIPA ACL [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] carry a
specification of the ontology from which the message content
takes its concepts in an “:ontology” property. Compositional
ontologies do not map directly to a single ontology, so the
message format needs to change in one of the following ways:
1.
2.
3.
        </p>
        <p>Give the compositional ontology a name and make it
explicitly available, and use this in the message’s :ontology
property.</p>
        <p>Specify all ontologies using multiple :ontology properties.
This may not cover concepts in safe extensions to the
ontology.</p>
        <p>Allow the ontology field to contain a description of the
concepts in the in the ontologies, including extensions, that
are used in the message. This could result in unnecessary
performance impact on communicative actions.</p>
      </sec>
      <sec id="sec-19-5">
        <title>6. CONCLUSIONS</title>
        <p>
          Ontological commitment is the decision by a particular group of
applications or users within an application domain to use the
terms defined in a given ontology. High ontological commitment
occurs when many users and/or applications within an
application domain commit to sharing the same vocabulary of
concepts, meanings and relationships defined within a specific
ontology. Ontological commitment to a common set of ontologies
is a key feature to provide for openness in an agent system, as it
provides a common vocabulary over which agents can converse.
This paper focused mainly on two areas relating to ontological
commitment and use. The first area is the clash of goals between
ontology definers, application developers and users. Ontology
definers are concerned with the completeness and purity of their
ontological design. Application developers are often concerned
only with specific subsets of the ontology that relate directly to
the application, and with application-specific representational
and computational issues. Users are concerned mainly with
sticking with known and familiar terminology and vocabularies,
which may not be logically structured and therefore may not be
amenable to being reformulated into computationally-accessible
concepts. This clash is complicated by the widening gap between
these groups – developers, for instance, may be using ontologies
that are already well-established. Thus, while several good
design paradigms involving ontology designers, application
developers, and users, exist for initial ontology development
[
          <xref ref-type="bibr" rid="ref16 ref17 ref18">16,17,18</xref>
          ], none of these seem to encompass these more
longterm concerns. We recommend the development of an extended
ontology lifecycle that fosters evolution. Components required to
support this lifecycle should include long-term ontology design
support, a widely-available feedback channel from developers
and users back to designers, and an open forum for discussing
issues and extensions to the ontology.
        </p>
        <p>
          An application committed to reuse existing ontologies may
encounter several difficulties, as search for the perfect ontology
for the application – one that is acceptable to both the application
and its users – is not necessarily fruitful. We described a
compositional approach to ontological use. Compositional
ontologies base themselves in existing ontologies as much as
possible, using the operations of ontological subsetting,
ontological composition and ontological extension to tailor them
to the specific needs of the applications. Compositional
ontologies fit well with the needs of the application, and the
approach has the potential to raise the level of ontological
commitment to existing ontologies. However, they require more
sophisticated ontology-related exchanges among collaborating
agents. In order to incorporate these compositional ontologies,
further work needs to be done on the development of a more
formal algebra to support the subset, compose and extend
operations over ontologies (in contrast to our current ad-hoc
methods). Visser and Cui [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] attacked a related problem of
heterogeneous ontology structures, and this may provide further
insight into this formal algebra. As a second task, we need to
develop a methodology for ensuring the correctness and
consistency of these compositional ontologies.
        </p>
        <p>Many approaches to the representation and specification of
ontologies exist; some common ones for the exchange of
ontological information include OKBC, DAML+OIL, OIL and its
varieties, and UML. Each of these has features amenable to their
adoption as a means to exchange information on ontological
components and extensions, though none cover all of the issues
addressed in this paper. Therefore, the exchange of such
information using these standards is still problematic.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Cranefield</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haustein</surname>
            ,
            <given-names>S</given-names>
          </string-name>
          and Purvis,
          <string-name>
            <surname>M. “</surname>
          </string-name>
          <article-title>UML-based ontology modeling for software agents”</article-title>
          .
          <source>In Proceedings of the workshop on Ontologies in Agent Systems, Autonomous Agents</source>
          <year>2001</year>
          , Montreal, Canada,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <article-title>[2] DAML.org</article-title>
          . http://www.daml.org,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Deschaine</surname>
            ,
            <given-names>L. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brice</surname>
            ,
            <given-names>R. S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Nodine</surname>
            ,
            <given-names>M. H.</given-names>
          </string-name>
          “
          <article-title>Use of InfoSleuth to Coordinate Information Acquisition, Tracking and Analysis in Complex Applications</article-title>
          .”
          <source>In Proceedings of Advanced Simulation Technologies Conference, April</source>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>“EPA Environmental Data</surname>
          </string-name>
          <article-title>Registry”</article-title>
          . http://www.epa.gov/edr,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5] FIPA. “
          <article-title>Agent Communication Language Specifications”</article-title>
          . http://www.fipa.org/repository/aclspecs.html,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6] “
          <article-title>General Multilingual Environmental Thesaurus”</article-title>
          . http://www.nu.niedersachsen.de/cds/etccds_neu/library/select.html
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Humphreys</surname>
            ,
            <given-names>B.L.</given-names>
          </string-name>
          et.al. “
          <article-title>Assessing and Enhancing the Value of the UMLS Knowledge Sources”</article-title>
          .
          <source>In Proceedings of the 15th Annual Symposium on Computer Applications in Medical Care</source>
          , pp.
          <fpage>78</fpage>
          -
          <lpage>82</lpage>
          , Washington,
          <string-name>
            <given-names>D.C.</given-names>
            ,
            <surname>November</surname>
          </string-name>
          ,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Nodine</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          et.al. “
          <article-title>Active Information Gathering in InfoSleuth</article-title>
          .”
          <source>International Journal of Cooperative Information Systems 91/2</source>
          , 2000, pp.
          <fpage>3</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <article-title>[9] “Welcome to the OIL Page”</article-title>
          . http://www.ontoknowledge.org/oil,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>“OKBC Home</surname>
          </string-name>
          <article-title>Page”</article-title>
          . http://www.ksl.stanford.edu/software/OKBC,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Sowa</surname>
            ,
            <given-names>J. F. Knowledge</given-names>
          </string-name>
          <string-name>
            <surname>Representation</surname>
          </string-name>
          . Brooks/Cole, California,
          <year>2000</year>
          , pp.
          <fpage>xi</fpage>
          -xii.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>“EPA Terminology Reference System</surname>
          </string-name>
          .” http://www.epa.gov/trs/index.htm ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Tuttle</surname>
            ,
            <given-names>M. S.</given-names>
          </string-name>
          et al., “Merging terminologies”.
          <source>Medinfo</source>
          .
          <year>1995</year>
          ;
          <volume>8 Pt 1</volume>
          :
          <fpage>162</fpage>
          -
          <lpage>6</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14] Object Management Group.
          <source>OMG Unified Modeling Language Specification, version 1</source>
          .3. http://www.omg.org/technology/documents/formal/unified_model ing_language.htm.
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Willmott</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Constantinescu</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Callisti</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          “Multilingual Agents: Ontologies, Languages and Abstractions.”
          <source>In Proceedings of the First International Workshop on Ontologies in Agent Systems, Autonomous Agents</source>
          <year>2001</year>
          , Montreal, Canada, May,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Gruber</surname>
            ,
            <given-names>T.R. “</given-names>
          </string-name>
          <article-title>A Translation Approach to Portable Ontology Specifications</article-title>
          .”
          <source>Knowledge Acquisition</source>
          <volume>5</volume>
          (
          <issue>2</issue>
          ),
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Gomez-Perez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , “Ontological Engineering:
          <article-title>A State of the Art</article-title>
          .
          <source>” Expert Update</source>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Gruninger</surname>
            ,
            <given-names>M</given-names>
          </string-name>
          and Fox,
          <string-name>
            <surname>M. S.</surname>
          </string-name>
          “
          <article-title>Methodology for the Design and Evaluation of Ontologies</article-title>
          .”
          <source>In Proceedings of the Workshop on Basic Ontological Issues and Knowledge Sharing</source>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Hammer</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>McLeod</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          “
          <article-title>An Approach to Resolving Semantic Heterogeneity in a Federation of Autonomous, Heterogeneous Database Systems</article-title>
          .” In
          <source>International Journal of Intelligent and Cooperative Information Systems</source>
          <volume>2</volume>
          (
          <issue>1</issue>
          ),
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Kent</surname>
            ,
            <given-names>W. “</given-names>
          </string-name>
          <article-title>The Many Forms of a Single Fact”</article-title>
          .
          <source>In Proceedings of IEEE COMPCON</source>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>