A modal reconstruction of Rational Closure Laura Giordano1, Valentina Gliozzi2 , Nicola Olivetti3 , and Gian Luca Pozzato2 1 Dipartimento di Informatica - Universitá del Piemonte Orientale “Amedeo Avogadro”, Alessandria, Italy laura@mfn.unipmn.it 2 Dipartimento di Informatica - Universitá degli Studi di Torino, Italy {gliozzi,pozzato}@di.unito.it 3 LSIS-UMR CNRS 6168, Université “Paul Cézanne” Aix-Marseille 3, France nicola.olivetti@univ-cezanne.fr Abstract. This paper provides a general semantic framework for nonmonotonic reasoning, based on a minimal models semantics on the top of KLM systems for nonmonotonic reasoning. This general framework can be instantiated in order to provide a semantic reconstruction within modal logic of the notion of rational closure, introduced by Lehmann and Magidor. We give two characterizations of rational closure: the first one in terms of minimal models where propositional in- terpretations associated to worlds are fixed along minimization, the second one where they are allowed to vary. In both cases a knowledge base must be expanded with a suitable set of consistency assumptions, represented by negated condition- als. The correspondence between rational closure and minimal model semantics suggests the possibility of defining variants of rational closure by changing either the underlying modal logic or the comparison relation on models. 1 Introduction In a seminal work Kraus Lehmann and Magidor [7] (henceforth KLM) proposed an ax- iomatic approach to nonmonotonic reasoning. Plausible inferences are represented by nonmonotonic conditionals of the form A |∼ B, to be read as “typically or normally A entails B”: for instance monday |∼ go work, “normally on Monday I go to work”. The conditional is nonmonotonic since from A |∼ B one cannot derive A ∧ C |∼ B, in our example, one cannot derive monday ∧ ill |∼ go work. KLM proposed a hierarchy of four systems, from the weakest to the strongest: cumulative logic C, loop-cumulative logic CL, preferential logic P and rational logic R. Each system is characterized by a set of postulates expressing natural properties of nonmonotonic inference. We present below an axiomatization of the two stronger logics P and R (C and CL being too weak to be taken as an axiomatic base for nonmonotonic reasoning). But before presenting the axiomatization, let us clarify one point: in the original presentation of KLM sys- tems, [7] a conditional A |∼ B is considered as a consequence relation between a pair of formulas A and B, so that their systems provide a set of “postulates” (or closure conditions) that the intended consequence relations must satisfy. Alternatively, these postulates may be seen as rules to derive new conditionals from given ones. We take a slightly different viewpoint, shared among others by Halpern and Friedman [4] (see Section 8) and Boutilier [2] who proposed a modal interpretation of KLM systems P and R: in our understanding these systems are ordinary logical systems in which a con- ditional A |∼ B is a propositional formula belonging to the object language. Whenever we restrict our consideration, as done by Kraus Lehmann and Magidor, to the entailment of a conditional from a set of conditionals, the two viewpoints coincide: a conditional is a logical consequence in logic P/R of a set of conditionals if and only if it belongs to all preferential/rational consequence relations extending that set of conditionals, or (in semantic terms), it is valid in all preferential/rational models (as defined by KLM) of that set. Here is the axiomatization of logics P and R, in our presentation KLM postu- lates/rules are just axioms. We use ⊢P C (resp. |=P C ) to denote provability (resp. va- lidity) in the propositional calculus . All axioms and rules of propositional logic A |∼ A (REF) if ⊢P C A ↔ B then (A |∼ C) → (B |∼ C), (LLE) if ⊢P C A → B then (C |∼ A) → (C |∼ B) (RW) ((A |∼ B) ∧ (A |∼ C)) → (A ∧ B |∼ C) (CM) ((A |∼ B) ∧ (A |∼ C)) → (A |∼ B ∧ C) (AND) ((A |∼ C) ∧ (B |∼ C)) → (A ∨ B |∼ C) (OR) ((A |∼ B) ∧ ¬(A |∼ ¬C)) → (A ∧ C) |∼ B) (RM) The axiom (CM) is called cumulative monotony and it is characteristic of all KLM logics, axiom (RM) is called rational monotony and it characterizes the logic of rational entailment R. The weaker logic of preferential entailment P contains all axioms, but (RM). P and R seem to capture the core properties of nonmonotonic reasoning, as shown in [4] they are quite ubiquitous being characterized by different semantics (all of them being instances of so-called plausibility structures). Logics P and R enjoy a very simple modal semantics, actually it turns out that they are the flat fragment of some well-known conditional logics. For P the modal semantics is defined by considering a set of worlds W equipped by an accessibility (or preference) relation < assumed to be transitive, irreflexive, and satisfying the so- called Smoothness Condition. For the stronger R < is further assumed to be modular. Intuitively the meaning of x < y is that x is more normal/less exceptional than y. We say that A |∼ B is true in a model if B holds in all most normal worlds where A is true, i.e. in all <-minimal worlds satisfying A. KLM systems formalize desired properties of nonmonotonic inference. However, they are too weak to perform useful nonmonotonic inferences. For instance KLM sys- tems cannot handle irrelevant information in conditionals: from monday |∼ go work, there is no way of concluding monday ∧ shines |∼ go work in any one of KLM systems. Partially motivated by the weakness of the axiomatic approach, Lehmann and Magidor have proposed a true nonmonotonic mechanism on the top of logic R called rational closure. Rational clsure on the one hand preserves the properties of R, on the other hand allows one to perform some truthful nonmonotonic inferences, like the one just mentioned (monday ∧ shines |∼ go work).4 The authors has given a syntactic procedure to calculate the set of conditionals entailed by the rational closure as well as a quite complex semantic construction. It is worth noticing that a strongly related construction has been proposed by Pearl [9] with his notion of 1-entailment, motivated by a probabilistic interpretation of conditionals. In this work we tackle the problem of giving a purely semantic characterization of rational closure, stemming directly from the modal semantics of logic R. Notice that we restrict our attention to finite knowledge bases. More precisely, we try to answer to the following question: given the fact that logic R is characterized by a specific class of Kripke models, how can we characterize the Kripke models of the rational closure of a set of positive conditionals? The characterization we propose may be seen as an instance of a general recipe for defining nonmonotonic inference: (i) fix an underlying modal semantics for condi- tionals (such as the one of P or R), (ii) obtain nonmonotonic inference by restricting semantic consequence to a class of “minimal” models according to some preference relation on models. The preference relation in itself is defined independently from the language and from the set of conditionals (knowledge base) whose nonmonotonic con- sequences we want to determine. In this respect our approach is similar in spirit to “minimal models” approaches to nonmonotonic reasoning, such as circumscription. The general recipe for defining nonmonotonic inference we have sketched may have a wider interest than that of capturing Lehmnan and Magidor’s rational closure. First of all, we may think of studying variants of rational closure based on other modal logics and/or on other comparison relations on models. Secondly, being a purely semantic approach (the preference relation is independent from the language), our semantics can cope with a larger language than the one considered in KLM framework. To this regard, already in this paper, we consider a richer language allowing boolean combinations of conditionals5. In the future, we may think of applying our semantics to Nonmonotonic Description Logics, where an extension of rational closure has been recently considered [3]. In any case, the quest of a modal characterization of rational closure turns out to be harder than expected. Our semantic account is based on the minimization of the height of worlds in models, where the height of a world is defined in terms of length of the <-chains starting from the world. Intuitively, the lower the height of a world, the more normal (or less exceptional)is the world and our minimization corresponds intuitively to the idea of minimizing less-normal or less-plausible worlds (or maximizing most plausible ones). 4 Actually the main motivation of Lehmann and Magidor leading to the definition of rational closure was technical: it turns out that the intersection of all rational consequence relations satisfying a set of conditionals coincides with the least preferential consequence relation sat- isfying that set, so that (i) the axiom/rule (RM) does not add anything and (ii) such relation in itself fails to satisfy (RM). Their notion of rational closure provides a solution to both prob- lems and can be seen as the “minimal” (in some sense) rational consequence completing a set of conditionals. 5 An extension of rational closure to knowledge bases comprising both positive and negative conditionals has been proposed in [1]. We begin by considering the nonmonotonic inference relation determined by re- stricting considerations to models which minimize the height of worlds. In this first characterization we keep fixed the propositional interpretation associated to worlds. The consequence relation makes sense in its own, but as we show it is strictly weaker than rational closure. We can obtain nonetheless a first characterization of rational closure if we further restrict attention to minimal canonical models that is to say, to models that contain all propositional interpretations compatible with the knowledge base K (i.e. all propositional interpretations except those that satisfy some formulas inconsis- tent with the knowledge base K). Restricting attention to canonical models amounts to expanding K by all formulas ¬(A |∼ ⊥) (read as “A is possible”, as it encodes S5 3A) for all formulas A such that K 6|=R A |∼ ⊥. We thus obtain a very simple and neat characterization of rational closure, but at the price of an exponential increase of the K. We then propose a second characterization that does not entail this exponential blow up. In analogy with circumscription, we consider a stronger form of minimization where we minimize the height of worlds, but we allow to vary the propositional interpretation associated to worlds. Again the resulting minimal consequence relation makes sense in its own, but as we show it still does not correspond to rational closure. In order to capture rational closure, we must basically add the assumption that there are “enough” worlds to satisfy all conditionals consistent with the knowledge base K. This amounts to adding a small set of consistency assumptions (represented by negative conditionals). In this way we capture exactly rational closure, without an exponential increase of the knowledge base. 2 General Semantics In KLM framework the language of both logics P and R consists only of conditionals A |∼ B. We consider here a richer language allowing boolean combinations of condi- tionals (and propositional formulas). Our language L is defined from a set of proposi- tional variables ATM . We use A, B, C, . . . to denote propositional formulas (not con- taining |∼), and F, G, . . . to denote arbitrary formulas. More precisely, the formulas of L are defined as follows: if A is a propositional formula, A ∈ L; if A and B are proposi- tional formulas, A |∼ B ∈ L; if F is a boolean combination of formulas of L, F ∈ L. A knowledge base K is any set of formulas: as already mentioned in this work we restrict our attention to finite knowledge bases. The semantics of P and R is defined respectively in terms of preferential and ra- tional6 models, that are possible world structures equipped with a preference relation <, intuitively x < y means that the world/individual x is more normal/ more typical than the world/individual y. The intuitive idea is that A |∼ B holds in a model if the most typical/normal worlds/individuals satisfying A satisfy also B. Preferential models presented in [7] characterize the system P, whereas the more restricted class of rational models characterize the system R [8]. 6 We use the expression “rational model” rather than “ranked model” which is also used in the literature in order to avoid any confusion with the notion of rank used in rational closure. Definition 1. A preferential model is a triple M = hW, <, V i where W is a non-empty set of items, < is an irreflexive, transitive relation on W satisfying the Smoothness relation defined below. V is a function V : W 7−→ 2ATM , which assigns to every world w the set of atoms holding in that world. If F is a boolean combination of formulas, its truth conditions (M, w |= F ) are defined as for propositional logic. Let A be a propositional formula; we define M inM < (A) = {w ∈ W | M, w |= A and ∀w , ′ ′ ′ ′ ′ w < w implies M, w 6|= A}. We also define M, w |= A |∼ B if for all w , if w ∈ M inM ′ < (A) then M, w |= B. Last we define the Smoothness Condition: if M, w |= A, then w ∈ M in< (A) or there is w′ ∈ M inM M ′ < (A) such that w < w. Validity and satisfiability of a formula are defined as usual. Given a set of formulas K of L and a model M = hW, <, V i, we say that M is a model of K, written M |= K, if, for every F ∈ K, and every w ∈ W, we have that M, w |= F . K preferentially entails a formula F , written K |=P F if F is valid in all preferential models of K. Since we limit our attention to finite knowledge bases, we can restrict our attention to finite models, as the logic enjoys the finite model property (observe that in this case the smoothness condition is ensured trivially by the irreflexivity of the preference relation). From Definition 1, we have that the truth condition of A |∼ B is “global” in a model M = hW, <, V i: given a world w, we have that M, w |= A |∼ B if, for all w′ , if w′ ∈ M inM ′ < (A) then M, w |= B. It immediately follows that A |∼ B holds in w if only if A |∼ B is valid in a model, i.e. it holds that M, w′ |= A |∼ B for all w′ in W; for this reason we will often write M |= A |∼ B. Moreover, when the reference to the model M is unambiguous, we will simply write M in< (A) instead of M inM < (A). Definition 2. A rational model is a preferential model in which < is further assumed to be modular: for all x, y, z ∈ W, if x < y then either x < z or z < y. K rationally entails a formula F , written K |=R F if F is valid in all rational models of K. When the logic is clear from the context we shall write K |= F instead of K |=P F or K |=R F . From now on, we restrict our attention to rational models. Definition 3. The height kM of a world w in M is the length of any chain w0 < . . . < w from w to a w0 such that for no w′ it holds that w′ < w0 7 . Notice that in a rational model hW, V, 0) of a world x is determined by the height kM (C) of the antecedents C of conditionals falsified by x. Given a model M = hW, <, V i and x ∈ W, we define Sx = {C |∼ D ∈ K | M, x |= C ∧ ¬D}. Proposition 2. Let K be a knowledge base and M a model, then M |= K if and only if M satisfies the following, for every x ∈ W: 1. if kM (x) = 0 then Sx = ∅ 2. if Sx 6= ∅, then kM (x) > kM (C) for every C |∼ D ∈ Sx . Observe that condition 1 is a consequence of condition 2, since by 2 if Sx 6= ∅ then trivially kM (x) > 0; we have explicitly mentioned it for clarity (see the subsequent proposition and theorem, whose proofs can be found in [6]). Proposition 3. Let K be a knowledge base and let M be a minimal model of K with respect to FIMS ; then M satisfies for every x ∈ W: 1. if Sx = ∅ then kM (x) = 0. 2. if Sx 6= ∅, then kM (x) = 1 + max{kM (C) | C |∼ D ∈ Sx }. Theorem 1. Let K be a knowledge base and let M be any model, then M is a FIMS minimal model of K if and only if M satisfies for every x ∈ W: 1. Sx = ∅ iff kM (x) = 0. 2. if Sx 6= ∅, then kM (x) = 1 + max{kM (C) | C |∼ D ∈ Sx }. In our second semantics, we let the interpretations vary. The semantics is called variable interpretations minimal semantics, for short VIMS . Definition 6 (VIMS ). Given M = hW, <, V i and M′ = hW ′ , <′ , V ′ i we say that M is preferred to M′ with respect to the variable interpretations minimal semantics, and write M 0, Ci = E(Ci−1 ). Observe that, being K finite, there is a n ≥ 0 such that for all m > n, Cm = Cn or Cm = ∅. Definition 8. A propositional formula A has rank i for K iff i is the least natural num- ber for which A is not exceptional for Ci . If A is exceptional for all Ci then A has no rank. The notion of rank of a formula allows to define the rational closure of a knowledge base K. Definition 9. Let K be a conditional knowledge base. The rational closure K̄ of K is the set of all A |∼ B such that either (1) the rank of A is strictly less than the rank of A ∧ ¬B (this includes the case A has a rank and A ∧ ¬B has none), or (2) A has no rank. The rational closure of a knowledge base K seemingly contains all conditional asser- tions that, in the analysis of nonmonotonic reasoning provided in [8], one rationally wants to derive from K. For a full discussion, see [8]. Can we capture rational closure within our semantics? A first conjecture might be that the FIMS of Definition 5 could capture rational closure. However, we are soon forced to recognize that this is not the case. For instance, Example 1 above illustrates that {penguin |∼ bird, penguin |∼ ¬f ly, bird |∼ f ly} 6|=FIMS penguin ∧ black |∼ ¬f ly. On the contrary, it can be easily verified that penguin ∧ black |∼ ¬f ly is in the rational closure of {penguin |∼ bird, penguin |∼ ¬f ly, bird |∼ f ly}. Therefore, FIMS as it is does not allow us to define a semantics corresponding to rational clo- sure. Things change if we consider FIMS applied to models that contain all possible valuations compatible with a given knowledge base K. We call these models canonical models. 8 In [8], |=P is used instead of |=R . However when K contains only positive conditionals the two notions coincide (see footnote 1) and we prefer to use |=R here since we consider rational models. Example 2. Consider Example 1 above. If we restrict our attention to models that also contain a w with V (w) = {penguin, bird, black} which is a black penguin that does not fly and can therefore be assumed to be a typical penguin, we are able to conclude that typically black penguins do not fly, as in rational closure. Indeed, in all minimal models of K that also contain w with V (w) = {penguin, bird, black}, it holds that penguin ∧ black |∼ ¬f ly. We are led to the conjecture that FIMS restricted to canonical models could be the right semantics for rational closure. Fix a propositional language LProp comprising a finite set of propositional variables ATM , a propositional interpretation v : ATM −→ {true, f alse} is compatible with K, if there is no formula A ∈ LProp such that v(A) = true and K |=R A |∼ ⊥. Definition 10. A model M = hW, <, V i satisfying a knowledge base K is said to be canonical if it contains (at least) a world associated to each propositional interpretation compatible with K, that is to say: if v is compatible with K, then there exists a world w in W, such that for all propositional formulas B M, w |= B iff v(B) = true. It can be easily shown that: Theorem 2. For a given domain W, there exists a unique canonical model M for K over W such that, for all other canonical models M′ over W, we have M