=Paper=
{{Paper
|id=Vol-66/paper-3
|storemode=property
|title=On the Impact of Ontological Commitment
|pdfUrl=https://ceur-ws.org/Vol-66/oas02-11.pdf
|volume=Vol-66
|authors=Marian Nodine and Jerry Fowler
}}
==On the Impact of Ontological Commitment==
On the Impact of Ontological Commitment
Marian Nodine Jerry Fowler
Telcordia Technologies setenv, LLC
106 E. 6th Street, Suite 415 Houston, TX
Austin, TX 78733 (713)526-8641
(512)478-8923
gfowler@acm.org
nodine@research.telcordia.com
ABSTRACT In an agent-based system, common ontologies specify the
Ontological commitment, or the agreement to have your ontological commitments of a set of participating agents [16]. An
applications and users conform to a common domain ontological commitment is an agreement to use a vocabulary in a
understanding as encapsulated in one or more shared ontologies, way that is consistent with an ontology. An agent or human
is a noble goal and essential for open agent systems. Our committed to an ontology understands (some subset of) the
experiences building ontology-based agent systems in multiple ontology and agrees to use it in a manner consistent with the
domains have shown us that the intention for a new application semantics of the ontology. Agents and humans committed to the
to locate and conform to some existing ontology or ontologies same ontology can share knowledge among themselves with
within its domain has many impediments to its success. For some confidence that they share an underlying understanding of
instance, the goals of the designer of a domain ontology include what is being said. Commitment to common, shared ontologies
developing a complete and comprehensive domain description; facilitates openness in an agent-based system.
however, the application developer may only require a small In this paper, we examine the conflicting requirements and goals
fragment of that ontology. Multiple applications that conform to of ontology designers, ontology-committed applications, and
the ontology may, in fact, use completely orthogonal fragments of ontology-aware users, and their respective impact on the problem
the ontology, and not be able to interact at all. Users may insist of ontology commitment and reuse. As ontological sharing and
on importing into the ontology sets of terms that are neither reuse increases, the gap between the ontology designers and the
logically consistent nor easily modelable. ontology users grows larger. Our goal is to analyze what issues
inhibit reuse and to propose strategies for facilitating reuse. In
With these issues in mind, in this paper we propose some
particular, we consider the problem of reuse of ontologies whose
guidelines for ontology development and evolution paradigm that
specification is complete, for applications whose requirements
should facilitate ontology reuse. These guidelines could
were not considered during the design of the ontology. This
underpin a usage model for ontologies; one that enables the
problem is not addressed in the ontology design methodologies
application designer to reuse ontological concepts from multiple
summarized in [17]. We develop guidelines and approaches for
ontologies in a more flexible manner, while retaining the
agents to use existing ontologies in a more flexible manner.
essentially good properties of ontology sharing and reuse. These
guidelines affect both the design and use of ontology-based Throughout the paper, we relate the issues to real issues we have
applications, as well as the way applications advertise encountered within the context of one of our applications, EDEN
themselves to other agents with which they may interoperate. [3]. EDEN1 is an agent-based system developed for the purpose
of inter-organizational sharing of environmental data collected,
stored and monitored by multiple government agencies and non-
1. INTRODUCTION government scientists spread throughout the US and Europe, and
The goal of knowledge representation is to make explicit the
relating information from these disparate data sources and
semantics of a particular domain of interest for the purposes of
schemas at a semantic level as needed by the users. EDEN uses
sharing the knowledge among humans and computer artifacts.
ontologies to represent the semantics of the underlying
Sowa [11] subdivides knowledge representation into categories:
information, and real and varied databases to populate those
“Logic provides the formal structure and rules of inference. ontologies with instances.
Ontology defines the kinds of things that exist in the
application domain. 2. ONTOLOGICAL COMMITMENT
Because ontologies are meant to facilitate sharing and reuse of
Computation supports the applications that distinguish
knowledge, it is important that the ontology and its collection of
knowledge representation from pure philosophy.”
users (both human and agent) align themselves to a shared view
There is a strong relationship between some specific ontology of the domain during the process of designing and evolving the
and the logical rules and computational artifacts that use that ontology for that domain. However, many existing ontologies
ontology, in that when they communicate among themselves, they have been developed either by designers attempting to
have some level of assurance that the same terms have the same characterize a domain (with no real computational applications
meanings to all. However, this use requires that the logical rules that use them) or by application developers to support individual
and the computational artifacts have explicit linkages with the applications (with no real sharing of the ontology with other
ontology; often in the form of hard-coding the ontological terms
into the rules and/or the application code itself.
1
EDEN was funded jointly by the DOD, DOE, EPA and EEA.
applications). The plethora of existing ontologies argues that Environmental Data Registry [4], it was difficult to relate
many concepts are already represented within some ontology, so measurements of chemicals between different representations
reuse of these ontologies, increasing the ontological commitment because the measurement ontology was “well understood,” and
level, is now feasible. For example, some ontologies such as the often explicit in the data representations themselves. Even terms
Unified Medical Language System (UMLS) Metathesaurus [7] such as “milligrams per kilogram” were sometimes represented
have achieved a higher level of commitment. inconsistently in text fields. This posed a problem in extracting
There are several issues that impede this sharing and knowledge. values for computation in this “well understood” measurement
First are issues related to the conflicting goals and objectives of system.
ontology definers, ontology-committed applications, and users of The desire of the application user is to be able to do his job well
such applications. Second are issues of the mutual conflict and easily. Application users have expertise in their own jobs
between ontological commitment and ontological evolution. and potentially in the domain of the application, but may have
Third are issues of conceptual mismatches between ontologies minimal understanding of knowledge representation or
and the applications and users that use them. Unfortunately, application design issues. Typical users want an understandable,
these issues stem from fundamental issues and characteristics of natural, easy-to-use interface to the application that facilitates
the problem of sharing ontologies so broadly. their work. This implies that they want the scope of the ontology
restricted to the exact area within the domain that they are
2.1 Conflict: Definer, Application, User specifically concerned with. Another issue is that a user may
The commitment to ontologies is hampered by the conflicting have a comfortable vocabulary of domain-related terms that do
goals of ontology definers, developers of ontology-committed not map well to the ontology representation model, to the domain
applications and ontology-committed application users, and often model that the ontology developer had in mind or to the needs of
by the confusion of users and definers over the demands of the application itself.
ontology-committed applications. Here, we use the descriptor
“ontology-committed” to mean that something purports to use the 2.2 Conflict: Commitment, Evolution
terms in the ontology(-ies) in a manner consistent with its (their) A second hampering factor when considering ontological
definitions. commitment itself is the fact that ontological commitment
The goal of the ontology designer, at least one working towards impedes ontology evolution, and vice versa. Yet, both of these
maximizing the usefulness of his ontology to a wide variety of are necessary for an ontology to be successful. As Mark Tuttle
applications, is to completely characterize a particular domain at et.al. say, “if you don’t use it, it won’t improve, if you don’t
the semantic level. The ontology designer needs expertise in improve it, it won’t get used.” [13]. In other words, use brings
knowledge representation and in the domain of the ontology. His out the need to change, change is required for continued use.
intent is to develop a comprehensive and up-to-date ontology, As stated before, a measure of ontological commitment is the
with a broad set of acceptable terms. His job is hampered by the number of agents and humans that are committed to using the
fact that the different domain experts have different viewpoints ontology. Each of these committed entities has made the effort to
of the domain, and these viewpoints must be reconciled within learn the ontology and to use the concepts therein in a manner
the ontology itself, placing pressure on the designer to either take consistent with their definition. However, the ontology itself may
a commonly agreed-upon but consistent subset of the domain, or have problems, or may be incomplete in some aspect, or not
to attempt to placate everyone by including everything. The likely reflect changes in the domain itself. In these situations, it
result is either the ontology will express concepts at a higher becomes desirable at some point to evolve the ontology,
level than can be used effectively by an application that seeks to providing an updated version. Initially, there is no real
attach suitable labels to real data, or the terminology within the commitment to the updated version; rather, the entities
ontology will not be logically consistent and thus easily committed to the old ontology must explicitly learn the new
incorporatable into an application. Fortunately, our experience ontology, and adapt to the changes therein. This leads to the
indicates that many terminology disagreements can be addressed observation that, as commitment to an ontology rises, so does the
by synonymy. For instance, what the EPA refers to as a “site” the level of effort required to accommodate an evolutionary step.
DOD calls a “facility.” What the EPA calls an “operable unit” (a
Development of an application represents a strong degree of
specific location contained in an EPA “site”) the DOD calls a ontological commitment. The incorporation of an ontology into an
“site.”
application may require specific coding with respect to the
The goal of the application designer is to use the ontology to ontology, so application developers tend not to appreciate it when
support an application. The application designer has expertise in the ontology evolves. This tends to discourage users from
building applications, especially within given domains. His intent employing the precision they need to express their views of the
is to develop an efficient and useful application. It is likely that data in an application. In EDEN, the first version of our ontology
the application will not cover the whole domain, or may span easily evolved into the second, because there were only three
multiple domains. Also, applications need to focus on some small database fragments and a single user view in the initial
issues that are unimportant to the designer. For instance, a failure demonstration. As the complexity of the user interface increased
to take sufficient care of value representation issues in the to make multiple views possible, and as we added larger, more
ontology may force the application designer to code this complex resources, the task of evolving the ontology from version
information directly into the application, even though it is really two to version three became more demanding due simply to the
domain-related knowledge. For example, within the numbers of agents impacted.
2.3 Impedance Mismatch attracted to the same ontology. This affects not only the original
There are several areas where mismatch and other problems design, but also the process of evolving the ontology.
seem to crop up. This problem has been studied extensively
within the heterogeneous database community [19,20]. Some
3.1 Design and Representation
Ideally, the design and representation of an ontology should be
examples of these issues are as follows:
done in collaboration with users and applications; the
Uneven concept granularity: Some ontologies and other applications then can incorporate the ontology as designed, and
dictionaries were developed with a focus on one particular aspect the users can validate the concepts, relationships and axioms that
of the domain. This aspect was well-developed and well-tested, they access in the ontology via the application. These
but other aspects of the same domain are ignored. For instance, interrelationships are shown in Figure 1.
the ontology could have very strong and detailed concepts for
contaminants and hazardous waste sites on land and in lakes, but Representability vs. Usability vs.
not for ocean contaminants. However, a general application Implementability Representability
concerning toxic waste sites would need all of these concepts.
Concept modeling mismatch: Some applications and
information sources model individual concepts differently from Domain Speci- Speci- Domain
others, thus it was hard to develop a single ontological concept knowledge, fication Ontology fication knowledge,
validation Designer validation
that would relate well to all the ways it could be represented. For
instance, the concept of a set of measurements of contaminant
levels taken daily could be represented in the ontology and/or the
Implementation
data itself either with a class of measurement vectors (one
instance per day), or a class of measurements, with one instance
Requirements,
per measurement type per day, specialized on measurement type. validation
Value representation mismatch: Many ontologies we have Application
Application
found do not even consider issues relating to the representation Implementability vs.
Designer User
of the values of the attributes in the instances, a requirement for Usability
many applications. For instance, it would be easy to say that
“area” was an integer; therefore setting its range as the range of
positive integers. However, the Americans measured area in Figure 1. Ontological Interactions
square feet, etc, while the Europeans measured it in square
meters, etc. The application needed to know in which unit the Partly because of the desire to maximize ontological
commitment, it is normally the goal of the ontology designer to
integer in a particular instance of the area is represented, and
represent the domain as completely as possible. Ontologies such
how the units relate. These concepts should therefore exist in the
domain ontology. In a more complex example, one information as UMLS or environmental thesauri such as the General
Multilingual Environmental Thesaurus (GEMET) [6] or the
source in the EDEN application had “soil” as one of the media
Terminology Reference System (TRS) [12] provide a broad base
that could be polluted; but a second had “topsoil” and “subsoil”
for discourse over their relative domains, medicine and the
as separate concepts. A third site had multiple, more fine-grained
environment. Their definition has largely been driven, not by the
classifications for soil. These did not all map easily onto any
implementers of the computer systems that use the ontologies,
single ontological representation for soil, as mapping between the
but rather by the practitioners within the domain itself. An
different concepts required additional knowledge (e.g. the depth
of the soil), and sometimes was lossy (e.g. if everything was just example of the practical impact of this high-level design on the
EDEN system was the design of value mapping across conceptual
mapped to “soil”).
domains. The Environmental Data Register’s (EDR) [4]
Terminology mismatch: Sometimes different sets of users use implementation of its value domain model was fully normalized.
the same term for different concepts, or the term best suited to This produced response times that were suitable for individual
the user is already present in the ontology with a different term lookups and mappings, but the performance scaled
meaning. This means that necessary concepts are sometimes inadequately when several agents were performing simultaneous
omitted because the lexical terms needed to express them in large-scale term conversions for entire tables. To obtain any
human language are all present in other usages. For instance, the tolerable performance at all, we were compelled to revise the
UMLS set of Semantic Relations was slow to develop the ontology of the conceptual domain to support a flatter, non-
concept of genetic relationship because the terms parent and normal data structure.
child existed for their use in logical tree structures.
All of these issues conspire to make ontology sharing 3.2 Evolution
challenging. As application developers develop applications using ontologies,
and users use those applications, they identify weaknesses,
3. ONTOLOGY DEVELOPMENT ISSUES missing concepts, and different issues that need to be addressed.
For ontological commitment to become real, ontology designers, For instance, when we developed our initial environmental
application designers and users within a domain need all to be ontology, some of the users who were used to the terminology in
GEMET requested that we evolve our own ontology to be
consistent with GEMET, or otherwise incorporate its terms. This
request mandated an incorporation of that ontology into the reuse as much already-defined ontological information as he can
environmental ontology. Once this incorporation was complete, within his application, tagging the imported concepts back to the
all agents had to be adapted to the new ontology (this required a ontologies from which they were taken to facilitate the process of
varying amount of effort depending on how closely the agent was evolving the application within the ontology. Unfortunately, up to
hardwired to the ontology). Some of the users who were not used this point the application developer has had no input whatsoever
to the GEMET terminology also had to learn it. Thus, the into the ontological design, though he has been working closely
evolution of the ontology engendered quite a bit of effort on many with the prospective application users. Thus, the necessary
people. One could envision that the evolution of an ontology that collaboration between the ontology designer, the application
has a higher level of ontological commitment, such as UMLS, developer and the users is missing in this situation. Furthermore,
could get very complex. the users’ input into the application design is much more direct
Clearly, the evolution of an ontology to a new version must be than that of the ontology designer. The result often is that there is
done carefully and collaboratively, involving the same types of as no good existing ontological “match” to the application and/or its
were involved in the original design. This indicates a slower, users.
more coarse-grained type of evolution, which in turn affects the Our experiences with our own ontology-based agent applications
contents of the ontology. An ontology may contain several types has led us to the following observations:
of information useful to the domain that it represents. These 1. Reusing ontologies is necessary for acceptance by domain
include: specialists, but many mature ontologies specify a lot more
• Concepts/classes, their attributes, and the types of than current agent systems can actually use.
representations their attributes can take. 2. Most of the applications we have developed require
• Relationships between concepts, including subclassing, concepts from multiple ontologies.
synonyms, and containment. 3. Most of the applications that we have developed require
• Axioms and functions over those concepts. some concepts defined in no ontologies, because of
mismatch issues.
• Instances of those concepts the relationships between them,
and axioms over the instances. GEMET GPS
The first three of these are referred to as the schema information EDEN Positioning
(also known as the conceptual model or meta-knowledge) and the
last is referred to the instance knowledge.
Experience has shown us that instances, because they represent
changing objects in the real world, tend to evolve much faster Facility Extension EDR
than the concepts themselves. Because of this, it is easier to
factor the ontology into two pieces, the schema and the instances.
Within the EDEN application, the environmental ontology itself
only contained schema-level domain information, and we
populated the schema using information from different
environmental databases. Because the agents themselves Figure 2. A Compositional Ontology
naturally were written only against the schema concepts, and the In order to facilitate reuse, we have designed our applications to
schema-level ontology information evolved more slowly, ontology use a compositional notion of ontologies, where we select
evolution did not create as significant a problem as it could have. concepts from different ontologies as needed, and supplement the
This experience has carried over into numerous other result with any new concepts that we cannot locate elsewhere.
applications developed using the InfoSleuth agent system [8,3]; This leads us to support three operations over ontologies
thus, we recommend this factoring unless the instance themselves, which we term subset, compose and extend,
information is guaranteed to be stable (e.g., constants). respectively, as shown in Figure 2.
Ontological representation systems maintain this natural
factoring between schema and instances. In OIL [9], the schema 4.1 Subset
level is encapsulated within Standard OIL. OKBC clearly Agents are frequently coded as specialists, understanding a
distinguishes between schema and instances in its own focused subset of the domain itself. For example, an agent that is
vocabulary. Therefore, there is no real barrier to this factoring interfacing with information repositories on environmental
approach. remediation techniques would not necessarily understand the
information related to companies and their responsibilities for
4. COMPOSITIONAL ONTOLOGIES cleaning up specific toxic waste sites; yet the scope of the domain
Given that the best approach to ontology design and use is a encompasses both areas. These agents need not incorporate entire
collaborative one among ontology designers, application ontologies; in fact, this may not even be possible for lightweight
developers and users, let us now turn our attention to the issue of agents using large ontologies. Standards such as FIPA [5] allow
reuse. Consider an application developer, developing an agents to advertise the ontologies they understand, However,
application to fulfill some of the needs of a specific class of users such an advertisement may be misleading to other agents if the
in a given application domain. The developer has decided to agent only understands a subset of the ontology, in that two
agents advertising the same ontology may in fact understand or geographic identifiers made it necessary to compose the
orthogonal subsets of the ontology, rendering communication domain ontology incorporating terminology from the field of
impossible. hazardous waste pollution and remediation with the EDR
Furthermore, too much ontology information may also confuse ontology for value representation.
users. As a practical matter, users of several of our agent The ability to compose ontologies has an immediate benefit in
applications including EDEN were confused by the presence of modular ontology design. For instance, if you look within the
ontological terms for which no agent had advertised any EDR tags, there are tags relating to location (street address,
knowledge. The presence of the term implied to them that they postal code) as well as geographic (latitude and longitude). These
should be able to extract information (other than definition and tags are not specific to the environmental domain, but are shared
ontological relationships) relevant to the term. That no agent had among other domains such as address books and GPS positioning
advertised the term was an unsatisfying explanation. Because of systems. Furthermore, such structures seem relatively stable.
this, the agents themselves that implement the ontology must be This indicates that it may be much more useful to have, say, a
careful how they present themselves to their users. single ontology for geographic positions, their representations,
We define this subsetting using ontological fragments. An and the relationships between them that can be incorporated into
ontological fragment consists of: environmental applications as well as those in other domains.
• The name of the ontology 4.3 Extend
• The classes or entities in the ontology that are supported Unfortunately, applications frequently need to incorporate
within the application, and their superclasses. concepts that are not represented in any ontology, or they may
need to “glue” concepts from different ontologies together using
• Constraints on those classes including such things as ranges new concepts. Thus, it is not sufficient to compose subsets of
of values allowed for specific slots ontologies; frequently it is necessary also to add some new
• The axioms that reference the supported classes/ attributes. concepts. For example, within the setting of our EDEN
application, the domain ontologies covered the concepts for
The fragment can be expressed as a set of constraints over the which users wanted to retrieve information; for instance, they
classes and attributes. As an example from the EDEN system, enabled the user to answer questions such as, “What toxic waste
each agent that provided a wrapper for a data resource advertised sites are within 10 miles of Houston, TX?” However, the agent-
only a fraction of the domain ontology – precisely that portion for based system that supported the application also was able to ask
which it could provide data instances. In some cases, a wrapper higher-level, more abstract questions such as “Are the number of
agent advertised only a fraction of the slots of a class, and further toxic waste sites within 10 miles of Houston increasing faster
placed semantic constraints on the content of the slots based on than they can be remediated?” These higher-level questions used
knowledge of the data instances about which it could report. In concepts present neither in the domain ontology, nor in any other
other cases, an agent might fill a slot with a static value from the ontology available at the time.
canonical data representation for that slot. For example, because
there was no record in a particular remediation database for land The important issue with extension is to do it in a manner that
use, one of the attributes of a site, certain government database will not create further problems later. There are two ways to
wrappers returned the generic value “Federal Facility” which extend an ontology; extensions that are safe and can easily be
was one of the values used in other databases. incorporated into other applications, and extensions that are
unsafe and should eventually be agreed upon and then folded
Another example decision that needs to be made on any system is
back by consensus into the ontology. We define safe and unsafe
whether a request for everything matching a query should return
extensions with respect to the permissible definitional changes
an empty set if an agent does not advertise all elements of a
with respect to the underlying ontology(-ies). A safe extension
class, or should return nulls for the unadvertised elements. This
can be characterized as follows:
provoked strong disagreement on our team between those who
felt that the former was algebraically preferable and those who Safe extension: an extension incorporating existing concepts
felt that the latter was more intuitive to users already intimidated from one or more ontologies, where any new concepts are defined
by the challenge of understanding the nature of the results axiomatically against the existing concepts in such a way that
produced by a dynamic distributed information system. instances of the new concepts can be determined computationally
from instances of the existing concepts. The transformation
4.2 Compose axioms must be invertible; the conversion of any instance
Another issue that is brought about by ontological commitment is between its view in the existing concept(s) and its view in the
the issue that an agent often requires access to terms from new concept must be lossless in both directions.
multiple ontologies. There exist a growing number of useful and Safe extensions are “safe” primarily because they can be shared
public ontologies, and it often seems more appropriate to pick among the agents easily, simply by sharing the axioms that allow
and choose terms from those ontologies than to build a new one for conversion from the existing classes to the extended classes.
from scratch, with just the terms that you need for your ontology. We note that new concepts within such a safe extension cannot
The result is a virtual ontology that imports fragments of other have new attributes that would need to be present in any instance
ontologies, into some sort of unified and useful whole. For of that concept, as this would violate the lossless conversion
example, within the EDEN application the diverse ways of property. So, for example, “milliliters per liter” would be a safe
representing values relating to various concepts such as chemical extension for a concept if the concepts “water pollutant” and
“parts per million” were already present in the ontology being Figure 3 shows a fragment of an example compositional
extended. environmental ontology closely related to the one we used in the
In contrast, unsafe extensions are “unsafe” because they are EDEN application. This ontology used subsets of the GPS and
harder to share with other agents, as the other agents may not Measurement ontologies, composed together with a fragment of
maintain the information necessary to populate instances of the the environmental ontology. The necessary concepts for this
unsafe classes. For example, adding a new concept, “Water particular application span all three ontologies. The gray boxes
pollutant” with a new property, “concentration” would not be a indicate the three ontologies, and the white boxes within indicate
safe extension if there were no concept of containing medium in the classes taken from the individual ontology into the
the original ontology. Other agents, committed to the ontology, compositional ontology. Note that the attributes from the
may not have the information necessary to provide a “Sampling Point” class refer to concepts in the other ontologies.
concentration for each pollutant they have, or may have different
5. IMPLEMENTATION ISSUES
ways of modeling the concept of how badly the pollutant is
Given that we allow the operations of subsetting, composition
contaminating the water.
and (safe) extension over ontologies within an agent system, the
Because unsafe extensions allow agents to instantiate classes that agents themselves need to be able to represent exactly what
have been extended unsafely and thus cannot easily be shared; combination they are committed to, in order to ensure that they
these extensions limit the openness of the agent. However, do not mislead others with respect to their capabilities.
unsafe extensions are sometimes necessary; for instance, they are
There are three issues with representation and implementation.
often required to deal with some of the impedance mismatch
The first is that the agent needs to be able to represent internally
issues discussed earlier. Allowing unsafe extensions increases
the full set of details on how the ontology fragments it is using fit
the possibility that different agents will create divergent
together. This can be done with a suitable abstract ontology
extensions to the ontology, and become incompatible with one
representation. Secondly, the agent must be able to converse in
another. Therefore, unsafe extensions should be treated with
some detail with other agents about the parts of the ontologies
caution. One approach is to limit such extensions to those that
that they understand. This must be done using a suitable
are monotonic [16] -- not requiring modifications to existing
exchange representation. Thirdly, messages should be able to
ontological elements. Idealistically, the need for specific unsafe
represent the ontology fragments that the message covers. This
extensions should be propagated to the ontology designers, who
requires a more specific ontology field within the agent
can determine whether or not there is a consensus on the need for
communication language.
the particular extension, and whether they should be incorporated
into the next version. 5.1 Internal Representation
Internally, an agent needs to understand in detail the specific
4.4 Example schema-level knowledge to which it is committed. This
GPS Ontology understanding may be complicated by the fact that ontological
Locator knowledge may be represented using many different
isa representational models; for instance, there are subject-verb-
canonical unit object models such as DAML, frame-oriented models such and
Lat/Long GeoLocation OKBC, and object-oriented models such as UML [14, 1]. These
unit all differ in key issues such as whether or not attributes are first-
class objects, whether or not multiple inheritance is allowed, and
EDEN Ontology location how strongly-typed classes must be. Further complicating this, if
the operations of subsetting, composition and extension are used,
Measured atPoint Sampling a single application may look into a set of ontologies whose
Contamination Point underlying ontological models may diverge.
depth Recent discussions have included the notion of developing
abstract ontology representations for the internal representation
Measurement of ontological knowledge. As with all applications that attempt to
Distance span multiple viewpoints, key choices in specifying an abstract
Ontology
isa ontology representation revolve around whether it should
canonical unit incorporate every modeling features present in some ontology
Meter Quantity representation, or it should only look at the modeling features
unit used by every representation, or it should take some path in the
isa unit middle. Wilmott et.al. [15] discuss in depth some of the issues
Unit of associated with the specification and implementation of an
isa abstract ontology representation from a theoretical standpoint.
Measure Foot
Fortunately, an abstract ontology representation is implemented
within an ontology service, and thus only needs to incorporate the
Figure 3. A part of the EDEN compositional ontology. requirements of the agents that it serves. In the InfoSleuth
system, our ontology service focused on frame-based ontologies,
as they were most compatible with the databases with which we defined is a potentially huge superset of what is actually needed
were working. by the agent. Furthermore, it may not be possible to relate the
Internally, an agent that is using ontologies need only store the terms in the new ontology back to the ontologies that it imported,
subsets of the ontology that it needs, and the glue that puts them so it may be computationally difficult to re-relate terms imported
together. So, for instance, in the ontology of Figure 2, the agent from the same ontology into two such composed ontologies.
itself only needs to understand the concepts in the shaded areas DAML+OIL, on the other hand, uses namespaces to facilitate the
of the GEMET, GPS Positioning, Facility and EDR ontologies, as combination of ontologies and the relating of concepts among
well as the additional information associated with their multiple ontologies. However, this ability to use namespaces
composition and extension. Managing this information in an relies on having the component ontologies’ contents also within a
efficient way is the work of the ontology service. namespace.
Our experience has shown us that a good abstract ontology Extension: Safe extensions by definition should be able to be
representation and a good ontology service are essential to cope specified in terms of logical axioms over existing concepts in the
with the incorporation of existing ontologies and their ontology. While axioms are strongly considered among ontology
evolutionary steps. The abstract ontology representation designers, many ontology exchange languages hardly consider
engenders a unified methodology by which the agent can absorb them at all; for instance, DAML+OIL at the moment has no good
existing ontology information and its particular use within the foundation for representing axioms and is thus unsuitable for
application. Agent implementers then use the ontologies in a exchanging knowledge concerning safe extensions.
more declarative and less hard-wired manner, which in turn In EDEN, we prepared a number of ad hoc lexical translation
facilitates the incorporation of new ontological information. processes to translate between an old version and an extended
version of the ontology. This did not yield a reusable method.
5.2 External Representation
Agents need to be able to share which parts of the different 5.3 Representation in ACL Messages
ontologies that they use or require, and how they put them The last area of application impact concerns the communication
together, with their other collaborating agents. This happens of ontological reference information in ACL messages. Currently,
when the agent advertises its capabilities to another agent, when agent communication languages such as the FIPA ACL [5] carry a
an agent is looking to locate another agent that can help it with specification of the ontology from which the message content
some task, or when agents are negotiating over who is takes its concepts in an “:ontology” property. Compositional
responsible for what subtasks. We have identified three basic ontologies do not map directly to a single ontology, so the
ways that ontologies may be adapted dynamically: subsetting, message format needs to change in one of the following ways:
composition, and extension: 1. Give the compositional ontology a name and make it
Subsetting: The notion of subsetting an ontology is not reflected explicitly available, and use this in the message’s :ontology
in the definition of the ontology itself, but rather in artifacts property.
related to the use of the ontology. For example, fragments may be 2. Specify all ontologies using multiple :ontology properties.
advertised from one agent to another, so that agents can inform This may not cover concepts in safe extensions to the
other agents within their community of their exact capabilities. ontology.
Alternatively, an application may be supported at the user end by
an agent that interfaces with the user, ensuring that the user is 3. Allow the ontology field to contain a description of the
presented only with the fragments of the ontology that are concepts in the in the ontologies, including extensions, that
supported within the underlying agent community. Any user are used in the message. This could result in unnecessary
interfaces can then be tailored to the ontology fragment. performance impact on communicative actions.
Composition: As with subsetting, composition should be 6. CONCLUSIONS
specified at the time of use, however, there are some Ontological commitment is the decision by a particular group of
representational difficulties at this point. While subsetting can be applications or users within an application domain to use the
defined as a set of axioms that overlays a single ontology, terms defined in a given ontology. High ontological commitment
composition spans multiple ontologies and thus presents different occurs when many users and/or applications within an
representational challenges. For instance, a composition may application domain commit to sharing the same vocabulary of
wish to state that the values of the “LatitudeMeasure” slot for the concepts, meanings and relationships defined within a specific
“FacilityIdentification” in the EDR may be understood by the ontology. Ontological commitment to a common set of ontologies
agent as represented in the “LatLongCoordinate” units defined in is a key feature to provide for openness in an agent system, as it
the “GPS Coordinates” ontology. In EDEN, we defined a provides a common vocabulary over which agents can converse.
geographic location as comprising a (selectable) coordinate
This paper focused mainly on two areas relating to ontological
scheme and a coordinate value.
commitment and use. The first area is the clash of goals between
OKBC and DAML present different challenges with respect to ontology definers, application developers and users. Ontology
composition. Within OKBC, composition must be done by definers are concerned with the completeness and purity of their
explicitly importing the component ontologies. Existing tools do ontological design. Application developers are often concerned
not necessarily facilitate importing ontologies from remote only with specific subsets of the ontology that relate directly to
locations, or ones that are represented in different languages the application, and with application-specific representational
(e.g., DAML). The resulting new ontology, even if it can be
and computational issues. Users are concerned mainly with [4] “EPA Environmental Data Registry”. http://www.epa.gov/edr,
sticking with known and familiar terminology and vocabularies, 2002.
which may not be logically structured and therefore may not be [5] FIPA. “Agent Communication Language Specifications”.
amenable to being reformulated into computationally-accessible http://www.fipa.org/repository/aclspecs.html, 2001.
concepts. This clash is complicated by the widening gap between
these groups – developers, for instance, may be using ontologies [6] “General Multilingual Environmental Thesaurus”.
that are already well-established. Thus, while several good http://www.nu.niedersachsen.de/cds/etc-
design paradigms involving ontology designers, application cds_neu/library/select.html
developers, and users, exist for initial ontology development [7] Humphreys, B.L. et.al. “Assessing and Enhancing the Value
[16,17,18], none of these seem to encompass these more long- of the UMLS Knowledge Sources”. In Proceedings of the 15th
term concerns. We recommend the development of an extended Annual Symposium on Computer Applications in Medical Care,
ontology lifecycle that fosters evolution. Components required to pp. 78-82, Washington, D.C., November, 1991.
support this lifecycle should include long-term ontology design
[8] Nodine, M. et.al. “Active Information Gathering in
support, a widely-available feedback channel from developers
InfoSleuth.” International Journal of Cooperative Information
and users back to designers, and an open forum for discussing
issues and extensions to the ontology. Systems 91/2, 2000, pp.3-28.
An application committed to reuse existing ontologies may [9] “Welcome to the OIL Page”.
encounter several difficulties, as search for the perfect ontology http://www.ontoknowledge.org/oil, 2002.
for the application – one that is acceptable to both the application [10] “OKBC Home Page”.
and its users – is not necessarily fruitful. We described a http://www.ksl.stanford.edu/software/OKBC, 2002.
compositional approach to ontological use. Compositional [11] Sowa, J. F. Knowledge Representation. Brooks/Cole,
ontologies base themselves in existing ontologies as much as California, 2000, pp. xi-xii.
possible, using the operations of ontological subsetting,
ontological composition and ontological extension to tailor them [12] “EPA Terminology Reference System.”
to the specific needs of the applications. Compositional http://www.epa.gov/trs/index.htm , 2002.
ontologies fit well with the needs of the application, and the [13] Tuttle, M. S. et al., “Merging terminologies”. Medinfo.
approach has the potential to raise the level of ontological 1995;8 Pt 1:162-6.
commitment to existing ontologies. However, they require more
[14] Object Management Group. OMG Unified Modeling
sophisticated ontology-related exchanges among collaborating
Language Specification, version 1.3.
agents. In order to incorporate these compositional ontologies,
http://www.omg.org/technology/documents/formal/unified_model
further work needs to be done on the development of a more
ing_language.htm. 2000.
formal algebra to support the subset, compose and extend
operations over ontologies (in contrast to our current ad-hoc [15] Willmott, S., Constantinescu, I. and Callisti, M.
methods). Visser and Cui [19] attacked a related problem of “Multilingual Agents: Ontologies, Languages and Abstractions.”
heterogeneous ontology structures, and this may provide further In Proceedings of the First International Workshop on
insight into this formal algebra. As a second task, we need to Ontologies in Agent Systems, Autonomous Agents 2001,
develop a methodology for ensuring the correctness and Montreal, Canada, May, 2001.
consistency of these compositional ontologies. [16] Gruber, T.R. “A Translation Approach to Portable Ontology
Many approaches to the representation and specification of Specifications.” Knowledge Acquisition 5(2), 1993.
ontologies exist; some common ones for the exchange of [17] Gomez-Perez, A., “Ontological Engineering: A State of the
ontological information include OKBC, DAML+OIL, OIL and its Art.” Expert Update, 1999.
varieties, and UML. Each of these has features amenable to their
adoption as a means to exchange information on ontological [18] Gruninger, M and Fox, M. S. “Methodology for the Design
components and extensions, though none cover all of the issues and Evaluation of Ontologies.” In Proceedings of the Workshop
addressed in this paper. Therefore, the exchange of such on Basic Ontological Issues and Knowledge Sharing, 1995.
information using these standards is still problematic. [19] Hammer, J. and McLeod, D. “An Approach to Resolving
Semantic Heterogeneity in a Federation of Autonomous,
7. REFERENCES Heterogeneous Database Systems.” In International Journal of
[1] Cranefield, S., Haustein, S and Purvis, M. “UML-based Intelligent and Cooperative Information Systems 2(1), 1993.
ontology modeling for software agents”. In Proceedings of the [20] Kent, W. “The Many Forms of a Single Fact”. In
workshop on Ontologies in Agent Systems, Autonomous Agents Proceedings of IEEE COMPCON, 1998.
2001, Montreal, Canada, 2001.
[2] DAML.org. http://www.daml.org, 2002.
[3] Deschaine, L. M., Brice, R. S. and Nodine, M. H. “Use of
InfoSleuth to Coordinate Information Acquisition, Tracking and
Analysis in Complex Applications.” In Proceedings of Advanced
Simulation Technologies Conference, April, 2000.