=Paper=
{{Paper
|id=Vol-1/paper-7
|storemode=property
|title=Do we need the closed world assumption in knowledge representation?
|pdfUrl=https://ceur-ws.org/Vol-1/hustadt-long.pdf
|volume=Vol-1
|authors=U. Hustadt
}}
==Do we need the closed world assumption in knowledge representation?==
<pdf width="1500px">https://ceur-ws.org/Vol-1/hustadt-long.pdf</pdf>
<pre>
        Do we need the closed-world assumption in knowledge
                          representation?
                                            Ullrich Hustadt
                                  Max-Planck-Institut fur Informatik
                                 Im Stadtwald, D-66123 Saarbrucken
                                   e-mail hustadt@mpi-sb.mpg.de

1   Introduction                                         typically used in the construction of the knowledge
Database systems and knowledge representation sys-       base of a reasoning agent. A knowledge base can
tems represent and reason about some aspect of the       be thought of as representing the beliefs of such
real world. In both it is common to separate the         an agent. One of the most prominent knowledge
two functions of representation, i.e. describing the     representation formalisms is kl-one [Brachman and
conceptual scheme and the actual data, and compu-        Schmolze,1985] which has been used in the construc-
tation, i.e. answering of queries and manipulation of    tion of natural language processing systems.
data.                                                       The knowledge representation language of kl-one
   The database management system of a database          and all it's derivates can be considered as a subset
system provides a data definition language to de-        of rst-order logic with equality. With respect to
scribe the conceptual scheme. The data de nition         describing structural properties of objects and con-
language is used to describe the database in terms       ceptual schemes they are more expressive than the
of a data model. Operations on the database re-          data de nition languages corresponding to the rela-
quire a specialized language, called a data manipu-      tional or object-oriented model.
lation language or query language. One of the most          In the late eighties inference in kl-one was shown
important data models is the relational model which      to be undecidable [Schmidt-Schauss,1989]. Since
describes the world in terms of atomic values and re-    then the emphasis in research has been on devel-
lations on the set of all atomic values. Data manip-     oping and investigating systems that are computa-
ulation languages of the relational model comprise       tionally well behaved, i.e. are tractable or at least
the relational algebra, and the domain and tuple re-     decidable [Brachman et al.,1991; Donini et al.,1991;
lational calculi. The object-oriented model supports     Buchheit et al.,1993]. As a result many commonly
a more elaborated description of the world by allow-     used knowledge representation languages have re-
ing complex objects, i.e. objects constructed using      stricted expressiveness and are in their current form
record formation and set formation, classes, i.e. ab-    no longer suitable for natural language applications.
stract data types describing methods, which are op-      They are still more expressive than data de nition
erations to be performed on the objects, and class       languages, but the question can be risen whether
hierarchies.                                             there is an application needing this additional ex-
   The data manipulation languages of these data         pressive power.
models are based on the following assumptions.              Nevertheless, data manipulation languages and
                                                         query languages of knowledge representation for-
The closed-world assumption                              malisms di er considerably in their underlying as-
    which says that all information that is not true     sumptions.
    in the database is considered as false.              The open-world assumption
The unique-name assumption                                   which says that there can be true facts that are
    which says that two distinct constants (either           not contained in the knowledge base.
    atomic values or objects) necessarily designate      The unique-name assumption
    two di erent objects in the universe.                    which says that two distinct constants (either
The domain-closure assumption                                atomic values or objects) necessarily designate
     which says that there are no other objects in the       two di erent objects in the universe.
     universe than those designated by constants of      The open-domain assumption
     the database.                                           which says that there can be more objects in the
These assumptions are important to understand the            universe than those designated by constants in
way computations are performed in databases.                 the knowledge base unless a constraint in the
  Knowledge representation formalisms are aimed              knowledge base prevents this.
to represent general conceptual information and are      That means, that even if the data de nition language
    Acknowledgments: This work has been supported       and the data manipulation language of a database
by the German Ministry for Research and Technology       management system and a knowledge base manage-
(BMFT) under grant ITS 9102 (Project Logo). Respon-      ment system would coincide, the results of data ma-
sibility for the contents lies with the author.          nipulations would di er.
   In the next section I will give some examples that     following way.
show the usefulness of closed-world inferences in nat-     (?lambda ?x car(inst: ?x) or
ural language processing. Thus knowledge represen-
tation languages sticking to the open-world assump-                         truck(inst: ?x))                  (4)
tion seem to be insucient for natural language pro-      An expression of the (?lambda ?x P ) denotes the
cessing.                                                  set of all ?x satisfying P . The answer we have to
                                                          infer from the knowledge base is that veh1 and veh2
2   Query answering in Natural                            both belong to this set.
    Language Processing                                      Obviously, this answer cannot be computed by the
                                                          comment and question handler without taking dec-
In cooperation with the PRACMA Project1 (De-              laration (1) into account. For instance, it is not pos-
partment of Computer Science, University of Saar-         sible to nd the correct answer to (4) by computing
brucken) we have been developing a suitably ex-          the answer sets for (?lambda ?x car(inst: ?x))
tended knowledge representation system, called mo-        and (?lambda ?x truck(inst: ?x)) and to return
tel [Hustadt and Nonnengart,1993], which is in-           the union of the resulting sets as an answer.
tended to be a module of the pracma system. The              A question of the buyer concerning which objects
PRACMA Project [Jameson et al.,1994] is concerned         do not belong to the set of trucks is translated into
with the modeling of noncooperative information-          the following NLL expression.
providing dialogues. An example from pracma's
domain is the dialogue between a person S trying to        (?lambda ?x not car(inst: ?x))                     (5)
sell her used car to a potential buyer B. Naturally,      Whereas the closed-world assumption would allow
the goals of S con ict in part with those of B.           us to infer that veh1 belongs to this set, the open-
   In the nal implementation, the natural language        world assumption underlying NLL doesn't support
analysis module of the pracma system will use             this conclusion.
the semantic representation language NLL [Laub-              The question whether all cars are vehicles can also
sch and Nerbonne,1991] to represent the German-           be formulated in NLL. To answer this question we
language input strings. The resulting NLL expres-         can try to infer
sions will be stored in the pragmatic dialogue mem-
ory. Various modules will process the content of the       (forall ?x vehicle(inst: ?x) if
dialogue memory, the most important one for us is                          car(inst: ?x))                    (6)
the comment and question handler. The result of           from the knowledge base. The answer to this ques-
this module is transfered to the natural language         tion has to be independent of the constants currently
generator which is responsible for verbalizing NLL        occurring in our knowledge base. On the basis of
expressions.                                              declaration (1), the answer has to be positive.
   NLL contains a rst-order logic core with anadic
predicates, generalized quanti ers, plural reference         Now let us assume that the left front seat of veh2
expressions, and -abstraction. To t the pur-             is red. Choosing lfseat to designate the left front
poses of pracma the language has been extended            seat, this can be represented in the following way.
by modal operators.                                         hasPart(inst: veh2, theme: lfseat)               (7)
   Suppose the knowledge base of the car seller S           seat(inst: lfseat)                               (8)
contains declarations de ning that vehicles are either
cars or trucks, veh1 is a truck, and veh2 is a vehicle.     hasColour(inst: lfseat, theme: red)              (9)
This can be represented in NLL in the following           To answer the question whether all seats of veh2
way.                                                      are red we have to try to infer the following NLL
 (forall ?x vehicle(inst: ?x) iff                         expression.
               (car(inst: ?x) or                           (forall ?x
                truck(inst: ?x))                  (1)                hasColour(inst: ?x, theme: red)
  truck(inst: veh1)                               (2)                if hasPart(inst: veh2, theme: ?x)
  vehicle(inst: veh2)                             (3)                      and seat(inst: ?x))             (10)
Here veh1 and veh2 are constants, vehicle, car,           Because of the open-domain and open-world as-
and truck are predicate symbols. In NLL argu-             sumption, the answer to the question cannot be pos-
ments of predicates are identi ed via keywords, e.g.      itive. Although the only seat the car seller knows to
inst, rather than positions in argument vectors.          be part of veh2 is actually red, there may be other
Any identi er preceded by a question mark, e.g.           seats of veh2 and these seats may not be red.
?x, is a variable. In addition we have used the              Intuitively, a positive answer is much more plau-
boolean operators iff (equivalence) and or (disjunc-      sible. We would assume that the car seller knows all
tion), and the universal quanti er forall in decla-       the seats of veh2 and knows the colour of every seat
ration (1).                                               of veh2. It is possible to extend the knowledge base
   Now a question of the buyer concerning which ob-       using number restrictions in such a way that we can
jects are either cars or trucks is represented in the     infer a positive answer, e.g.
   1
     PRACMA is short for `PRocessing Arguments be-         ((= 1) ?x hasPart(inst: veh2, theme: ?x)
tween Controversially Minded Agents.'                                  and seat(inst: ?x))                  (11)
declares that veh2 has exactly one seat. decla-          [Brachman et al., 1991] Ron J. Brachman, Debo-
rations (7),(8),(9), and (11) taken together allow         rah L. McGuinness, Peter F. Patel-Schneider, and
us to answer query (10) positively. However, it            A. Borgida. Living with classic: When and how
seems to be more natural to extend the language            to use a kl-one-like language. In J. F. Sowa,
by an epistemic modal operator in the style of Lif-        editor, Principles in Semantic Networks: Explo-
schitz [Lifschitz,1991] to solve the problem. For a        rations in the Representation of Knowledge, pages
description of an extension of the knowledge repre-        401{456. Morgan Kaufmann, San Mateo, Califor-
sentation language ALC by an epistemic operator            nia, 1991.
refer to Donini et al. [Donini et al.,1992].             [Buchheit et al., 1993] M. Buchheit, F. M. Donini,
   Suppose our language contains such an epistemic         and A. Schaerf. Decidable reasoning in terminolo-
operator K. Then we have two possibilities to get a        gical knowledge representation systems. Research
positive answer to the question. The rst possibility       Report RR-93-10, Deutsches Forschungszentrum
is to reformulate the question slightly in the follow-     fur Kunstliche Intelligenz, Saarbrucken, Germany,
ing way.                                                   1993.
 (forall ?x                                              [Donini et al., 1991] F. M. Donini, M. Lenzerini,
      hasColour(inst: ?x, theme: red) if                   D. Nardi, and W. Nutt. The complexity of concept
      K(hasPart(inst: veh2, theme: ?x)                     languages. In J. Allen, R. Fikes, and E. Sandewall,
         and seat(inst: ?x)))                   (12)       editors, Proceedings of the Second International
                                                           Conference on Principles of Knowledge Represen-
Now the question is whether all known seats of veh2        tation and Reasoning, pages 151{162, Cambridge,
are red and the answer has to be positive. This            USA, April 22{25 1991. Morgan Kaufmann.
approach causes the problem how the natural lan-         [Donini et al., 1992] F. M. Donini, M. Lenzerini,
guage analysis module should determine the epis-           D. Nardi, A. Schaerf, and W. Nutt. Adding
temic character of question (12) opposed to the non-       Epistemic Operators to Concept Languages. In
epistemic character of question (6).                       B. Nebel, C. Rich, and W. Swartout, editors, Pro-
  The second possibility is to add a declaration of        ceedings of the Third International Conference on
the following form to the knowledge base                   Principles of Knowledge Representation and Rea-
 not (hasPart(inst: veh2, theme: ?x)                       soning, pages 342{353, Cambridge, USA, 1992.
       and seat(inst: ?x)) if                               Morgan Kaufmann.
    not K(hasPart(inst: veh2, theme: ?x)                 [Hustadt and Nonnengart, 1993] U. Hustadt and A.
          and seat(inst: ?x))                    (13)       Nonnengart. Modalities in knowledge representa-
                                                            tion. In Chris Rowles, Huan Liu, and Norman Foo,
This declaration allows to conclude that an object          editors, Proceedings of the 6th Australian Joint
is either not part of veh2 or not a seat if it is not       Conference on Arti cial Intelligence, pages 249{
known to be part of veh2 and a seat.                        254, Melbourne, Australia, 16{19 November 1993.
   Obviously, we are now able to turn our knowledge         World Scienti c.
base system into a database system either by suit-       [Jameson et al., 1994] Anthony Jameson, B. Kip-
ably adding epistemic operators to all the queries or       per, A. Ndiaye, R. Schafer, J. Simons, T. Weis,
by adding enough epistemic rules to the knowledge           and D. Zimmermann. Cooperating to be noncoop-
base. Therefore, the extension of knowledge repre-          erative: The dialog system pracma. To appear in
sentation languages with an epistemic operator is a         the Proceedings of the 18th Annual German Con-
  rst step to unify the database world and the knowl-       ference on Arti cial Intelligence, 1994. Springer.
edge base world.
                                                         [Laubsch and Nerbonne, 1991] J. Laubsch and J.
3    Future Work                                            Nerbonne. An Overview of NLL. Technical re-
It is well-known that theorem proving in a rst-             port, Hewlett Packard Laboratories, May 1991.
order language containing an epistemic operator is       [Lifschitz, 1991] Vladimir Lifschitz. Nonmonotonic
not even semi-decidable. Although the answers to            databases and epistemic queries. In Proceedings
the example questions presented in the previous sec-       of the Twelfth International Conference on Arti-
tion seem to be derived easily, there is no hope to nd      cial Intelligence, pages 381{386, Sydney, Aus-
a correct and complete inference mechanism which            tralia, August 24{30 1991. Morgan Kaufmann.
is able to deduce them.                                  [Schmidt-Schauss, 1989] M. Schmidt-Schauss. Sub-
   If we need a correct inference mechanism, the only       sumption in kl-one is Undecidable. In R. J.
possibility we have is to restrict the knowledge rep-       Brachman and H. J. Levesque, and R. Reiter, edi-
resentation language, i.e. we have to identify a de-        tors,Proceedings of the First International Confer-
cidable fragment of NLL to which we can add an             ence on Principles of Knowledge Representation
epistemic operator without loosing decidability.           and Reasoning, pages 421{431, Toronto, Canada,
References                                                 May 15{19 1989. Morgan Kaufmann.
[Brachman and Schmolze, 1985] Ron J. Brachman
  and J. G. Schmolze. An Overview of the kl-one
  knowledge representation system. Cognitive Sci-
  ence, 9(2):171{216, 1985.

</pre>