=Paper=
{{Paper
|id=Vol-1577/paper_14
|storemode=property
|title=Forgetting Concept and Role Symbols in ALCOIH\mu+(\top,\sqcap)-Ontologies
|pdfUrl=https://ceur-ws.org/Vol-1577/paper_14.pdf
|volume=Vol-1577
|authors=Yizheng Zhao,Renate A. Schmidt
|dblpUrl=https://dblp.org/rec/conf/dlog/ZhaoS16
}}
==Forgetting Concept and Role Symbols in ALCOIH\mu+(\top,\sqcap)-Ontologies==
Forgetting Concept and Role Symbols in ALCOIHµ+ (O, u)-Ontologies Yizheng Zhao and Renate A. Schmidt The University of Manchester, UK Abstract. Forgetting is a non-standard reasoning problem that is con- cerned with creating restricted views of ontologies relative to a subset of the initial signature and preserving pertinent logical consequences up to the symbols in the restricted views. In this paper, we present an Ackermann-based approach for forgetting of concept and role symbols in ontologies expressible in the description logic ALCOIHµ+ (O, u). The method is one of only few approaches that can eliminate role symbols, that can handle role inverse and ABox statements (via nominals), and the only approach so far providing support for forgetting in description logics with nominals. Despite the inherent difficulty of forgetting for this level of expressivity, performance results with a prototypical implemen- tation have shown very good success rates on real-world ontologies. 1 Introduction This paper presents a practical forgetting method for creating restricted views of ontologies expressed in the language of the description logic ALCOIHµ+ (O, u). The work is motivated by the high demand for advanced techniques for ontology- based knowledge processing. Research of forgetting, often referred to as uniform interpolation (UI) in this area (or, variable elimination, projection or second- order quantifier elimination in knowledge representation and logic) has gained significant momentum since the work of various groups developing forgetting methods for the description logic ALC and logics weaker than ALC, e.g., [11, 18, 19, 21, 25–27], and the foundational studies of the properties of forgetting for description logics by, e.g., [10, 19, 20, 28]. These works give arguments for the important role of forgetting in realising various tasks that are crucial for effective processing and management of ontologies. For example, forgetting can be used for ontology analysis and summary, for ontology reuse, for information hiding, for computing the logical difference between ontologies, for ontology debugging and repair, and for query answering. Early work in the area primarily focused on forgetting concept symbols, as role forgetting was realised to be significantly harder than forgetting of concept symbols [28]. The dominant reason is probably that the result of forgetting role symbols may not always be expressible in the source language. Although the earliest work did study concept and role forgetting, e.g. [25], most subsequent work considered only concept forgetting with the exception of [12–15]. Their method can perform both concept and role forgetting for description logics ex- tending ALC up to and including description logics with the expressiveness of 2 Yizheng Zhao and Renate A. Schmidt SH, SIF and SHQ. In addition they have extended their method to forgetting for description logics with ABoxes (for logics between ALC and SHI) [16]. The work on forgetting for description logics is predated by work on second- order quantifier elimination [5–7], which can be traced to questions posed by Boole and seminal work of [1]. These works triggered and influenced work in knowledge representation [17], but also led to striking results in the automation of correspondence theory of modal logics [3, 22]. Because of the close relationship between description logics and modal logics, besides the work on modal corre- spondence theory, investigations of UI for modal logics [4, 9, 24] are relevant. These are particularly related to concept forgetting, but not to role forgetting. The contribution of this paper is an approach for forgetting of concept and role symbols in expressive description logics not considered so far. The method accommodates ontologies expressible in the description logic ALCOIH and the extension allowing positive occurrences of the least fixpoint operator µ, the top role O and role intersection u. This means the method is one of only few ap- proaches that can eliminate role symbols, while also handling role inverse and ABox statements via nominals, and the only approach so far providing support for forgetting in description logics with nominals. Being based on the Ackermann approach to second-order quantifier elimination [5, 22, 29] the method terminates always. We have shown the method is correct, i.e., the forgetting solution com- puted is equivalent to the original ontology up to the symbols that have been eliminated. A general problem of forgetting is marking out a language that is expressive enough to allow for solutions to be expressed by finitely many formu- las. Completeness results are rare and become harder to achieve with increased expressivity, unless one is willing to admit second-order quantification, for which the forgetting problem becomes trivially solvable. Despite our method not be- ing complete, results of an evaluation with a prototypical implementation have shown high success rates on real-world ontologies of various sizes. 2 The Description Logic ALCOIHµ+ (O, u) Let NC , NR and NO be mutually disjoint sets of concept symbols (names), role symbols (names) and individuals, respectively. Let Nµ be a set of concept vari- ables disjoint with NC , NR and NO . Concepts in ALCOIH(O, u) have one of the following forms: a | > | A | ¬C | C t D | ∀R.C, where a ∈ NO , A ∈ NC , C and D are any ALCOIH(O, u)-concepts, and R is any role expression. Role expressions in ALCOIH(O, u) can be the top role O, a role symbol r ∈ NR , the inverse r− of a role symbol r, or a conjunction of these. The language of ALCOIHµ+ (O, u) ex- tends that of ALCOIH(O, u) with atomic least fixpoint expressions of the form µX.C[X], where X ∈ Nµ , and C[X] is a concept expression in which X occurs only positively (under an even number of negations). Moreover, µ-expressions may occur only positively. Because of this restriction, µ-expressions can be sim- ulated in ALCOIH(O, u) with auxiliary concept symbols as in [13]. The semantics of the ALCOIH(O, u)-fragment is as expected. Intuitively, µX.C[X] denotes the least general concept Cµ w.r.t. a concept expression for Forgetting Concept and Role Symbols in ALCOIHµ+ (O, u)-Ontologies 3 which Cµ ≡ C[Cµ ] holds, where C[Cµ ] is a concept expression obtained from replacing every occurrence of X in C[X] by Cµ . A formal definition of the se- mantics of fixpoint expressions can be found in [2]. We assume without loss of generality that a TBox T is a set of concept axioms of the form C v D, where C and D are closed concepts, i.e., contain no concept variable not in the scope of a µ. An RBox R is a set of role axioms of the form R v S, where R and S is a role symbol or an inverted role symbol. Nominals in a description logic make ABox assertions superfluous, since these can be captured by TBox inclusions via nominals. An ALCOIHµ+ (O, u)-ontology is therefore assumed to be the union of the TBox and the RBox in this paper. Theorem 1. Reasoning in ALCOIHµ+ (O, u) is decidable. Theorem 1 follows from the decidability results of guarded fixpoint logic [8] and the description logic ALBOid [23], which both subsume the languages of ALCOIH(O, u) and ALCOIHµ+ (O, u). In the sequel we describe the normal form on which our forgetting method works. A TBox clause is a disjunction of ALCOIHµ+ (O, u)-concepts, in which role expressions can be a role symbol, an inverted role symbol, or a conjunction of these. A role literal is either a role symbol or an inverted role symbol, or their negations. An RBox clause is a disjunction of two role literals of complementary polarity. RBox clauses are transformed from the clausification of role axioms, where role negation is introduced. A clause that contains occurrences of a designated concept symbol and role symbol S is called an S-clause. Given an S-clause C, an occurrence of S is posi- tive in C if it is under an even number of negations. Otherwise it is negative. A role symbol that occurs immediately under a (negated) universal role restriction is assumed to be negative (positive). C is positive (negative) w.r.t. S if every oc- currence of S in C is positive (negative). A set N of clauses is positive (negative) w.r.t. S if every occurrence of S in N is positive (negative). By sig(E) we denote the concept and role symbols occurring in E, where E ranges over concepts, clauses, and ontologies. Σ is assumed to be the concept and role symbols to be forgotten in this paper. Let S be any concept or role symbol and let I and I 0 be interpretations. We say I and I 0 are equivalent up to S, or S- equivalent, if I and I 0 coincide but differ possibly in the interpretations of S. More generally, I and I 0 are equivalent up to Σ, or Σ-equivalent, if I and I 0 are the same but differ possibly in the interpretations of the symbols in Σ. Definition 1 (Forgetting in ALCOIHµ+ (O, u)-Ontologies). Let O and O0 be ALCOIHµ+ (O, u)-ontologies and let the signature be Σ ⊆ sig(O). O0 is the solution of forgetting Σ-symbols in O, if the following conditions hold: (i) sig(O0 ) ⊆ sig(O) and sig(O0 ) ∩ Σ = ∅, and (ii) for any interpretation I: I |= O0 iff I 0 |= O, for some interpretation I 0 Σ-equivalent to I. This states that the given ontology O and the forgetting solution O0 are equivalent up to the interpretations of the symbols in Σ. It follows that if both O0 and O00 are solutions of forgetting symbols in Σ from O, then they are equivalent. 4 Yizheng Zhao and Renate A. Schmidt 3 Overview of the Forgetting Method The forgetting process in our method consists mainly of three phases: the reduc- tion to a set of ALCOIHµ+ (O, u)-clauses, the forgetting phase and the definer elimination phase (see Figure 1). Process to Transform to Ontology O set of clauses N r-reduced form R Ackermann Σ to forget r Σ = {S1 , ..., S2 } into Analyser Transform to Σ A-reduced form C Forgetting Elimination of Ackermann 0 Solution O Definer Symbols to forget A Fig. 1. Overview of the forgetting method Initially given, as the input to the method, are an ontology O of TBox and RBox axioms expressible in ALCOIHµ+ (O, u), and a set Σ that contains the concept and role symbols to be forgotten. The Σ-symbols are entirely determined by the users and their application demands; Σ can thus be any subset of the signature of the initial ontology as the user wishes. Once received by the system, O is transformed into a set N of clauses, i.e., the normal form of our method. The forgetting phase is an iteration of several rounds in which individual concept and role symbols are eliminated. An important feature of the method is that concept symbols and role symbols are forgotten in a focused way, that is, the rules for concept forgetting and the rules for role forgetting are mutually independent; concept and role symbols can thus be forgotten in any desired order. In the forgetting phase, in order for the forgetting rules to be applied to eliminate S ∈ Σ, the S-clauses must be transformed into S-reduced form. Thus, whether the Σ-symbols are eliminable depends entirely on whether the current set of clauses can be transformed into reduced form. This reduction is performed using two Ackermann-based calculi working independently, which also lead to goal-oriented elimination of the concept and role symbols. The calculi are described in the next sections. Equivalence-preserving simplification rules are applied throughout the forgetting process, ensuring that the current clauses are always in simpler representations for efficiency. What the method returns, if the forgetting is successful, is an ontology O0 that does not contain the symbols in Σ, i.e., the returned ontology is formulated using only symbols in sig(O)\Σ. Previous research has shown that the success rates of forgetting depend very much on the order in which the Σ-symbols are forgotten [3, 5, 6, 14, 22], even Forgetting Concept and Role Symbols in ALCOIHµ+ (O, u)-Ontologies 5 when forgetting only concept symbols, e.g., [29, 18]. An important challenge therefore is to find good orders of eliminating Σ-symbols. Our method either follows a user-specified order, or it uses a heuristic analysis based on the fre- quency counts of the Σ-symbols, where concept symbols and role symbols are analysed separately. We refer to the maximal symbol of Σ w.r.t. the forgetting order as the pivot in our method. Following (and starting with the pivot), the method forgets the Σ-symbols one by one. If the pivot is successfully eliminated from N , we attempt to forget the next symbol in , which has become the new pivot. Otherwise the pivot remains in Σ, flagged as a currently non-forgettable symbol, and the next symbol in becomes the pivot. When the end of the ordering has been reached, the calculus is applied to the flagged symbols, attempting again to eliminate them. Success will always be pursued until a forgetting solution is found. Though the ordering provides useful guidance during the forgetting process, it does not generally guarantee the success of forgetting. On the symbols not eliminated, a different ordering, not attempted before, will be used. When all possible orderings have been attempted and there are still Σ-symbols remaining, this means that the method fails to find a forgetting solution. Theorem 2. For any ALCOIHµ+ (O, u)-ontology O and any Σ ⊆ sig(O), the method always terminates and returns a set O0 of clauses. If O0 does not contain any Σ-symbols, the method was successful. O0 is then a solution of forgetting the symbols in Σ from O. If neither O nor O0 uses µ, O0 is Σ-equivalent to O in ALCOIH(O, u). Otherwise, it is Σ-equivalent to O in ALCOIHµ+ (O, u). 4 Calculus for Concept Forgetting We first present the calculus for forgetting of concept symbols in ontologies ex- pressible in ALCOIHµ+ (O, u). The calculus extends the calculus of [29] for concept forgetting in ALCOI. Since concept symbols occur only in TBox ax- ioms, only the TBox needs to be processed for concept forgetting. Non-Cyclic AckermannC Cyclic AckermannC N , C1 t A, . . . , Cn t A N , C1 [A] t A, . . . , Cn [A] t A A A N¬C 1 t...t¬Cn NµX.(¬C 1 t...t¬Cn )[X] provided: (i) A does not occur in the provided: (i) the Ci are negative w.r.t. Ci , and (ii) N is negative w.r.t. A. A, and (ii) N is negative w.r.t. A. PurifyC,p PurifyC,n N N N is positive w.r.t. A. N is negative w.r.t. A. N>A A N¬> Fig. 2. Forgetting concept symbol A in a set N of ALCOIHµ+ (O, u)-clauses Definition 2 (A-Reduced Form). Suppose A is a concept symbol. A clause is in A-reduced form if it is negative w.r.t. A, or it has the form A t C, where C is an ALCOIHµ+ (O, u)-concept that (Case I) does not contain any occurrence of A, or (Case II) is negative w.r.t. A. A set N of clauses is in A-reduced form if every A-clause in N is in A-reduced form. 6 Yizheng Zhao and Renate A. Schmidt The Ackermann and Purify rules, given in Figure 2, are the forgetting rules that lead to the elimination of concept symbols in ALCOIHµ+ (O, u)-clauses. Suppose A ∈ NC is the pivot and C is a concept expression, then NCA denotes the set obtained from N by replacing every occurrence of A by C. We refer to the clauses of the form Ci t A (1 ≤ i ≤ n) as positive premises of the rule. The Ackermann rule is applicable (to forget A in N ) only if N is in A-reduced form. Theorem 3 (Ackermann Lemma for Concept Forgetting). Let I be an arbitrary ALCOIHµ+ (O, u)-interpretation. For A the pivot, when the Non- Cyclic AckermannC rule is applicable, the conclusion of the rule is true in I iff for some interpretation I 0 A-equivalent to I, the premises are true in I 0 . The same is true for the Cyclic AckermannC rule and the PurifyC,p(n) rules. This states that eliminating the pivot symbol in NC with the AckermannC and PurifyC rules preserves equivalence up to the pivot. Given the pivot A and a set N of clauses in A-reduced form, the Non-Cyclic Ackermann rule is applied when A does not occur in the Ci (1 ≤ i ≤ n) of the positive premises. The Cyclic Ackermann rule is applied when A occurs in the Ci but only negatively, e.g., A occurs in a cyclic clause ¬∀r.A t A. Fixpoints are introduced in this case in order to facilitate finite representation of the result, where every occurrence of A in ¬C1 t . . . t ¬Cn is replaced by X, a fresh concept variable, and every occurrence of A in N is replaced by µX.(¬C1 t . . . t ¬Cn )[X]. The Purify rule is applied whenever A is pure in N , i.e., N is positive (or negative) w.r.t. A. Concept Clausify Case Splitting N , C t ¬(D1 t . . . t Dn ) N , ¬a t C1 t . . . t Cn N , C t ¬D1 , . . . , C t ¬Dn N , ¬a t C1 | . . . | ¬a t Cn provided: A has positive occurrences provided: A occurs positively in C1 t in ¬(D1 t . . . t Dn ). . . . t Cn . SkolemizationO SkolemizationR N , C t ¬∀O.D N , ¬a t ¬∀R.C N , ¬b t ¬D t ∀O.C N , ¬a t ¬∀R.¬b, ¬b t ¬C provided: (i) b is a fresh nominal, and provided: (i) b is a fresh nominal, and (ii) A occurs positively in ¬∀O.D. (ii) A occurs positively in ¬∀R.C. Concept Surfacing Sign Switching N , C t ∀R.D N N , (∀R− .C) t D (N¬A A ¬¬A )A provided: (i) A does not occur pos- provided: (i) Sign Switching has not itively in C, and (ii) A occurs posi- been performed on A, and (ii) N is tively in ∀R.D. closed to other rewrite rules. Fig. 3. The rewrite rules for finding A-reduced form of N Clauses are usually not given in reduced form initially. Figure 3 lists the set of rewrite rules for finding the A-reduced form of a set of clauses for A the pivot. The Case Splitting rule splits the derivation into several branches. The intermediate forgetting solution returned at the end of a symbol elimination round is the disjunction of the solutions of each branch in the derivation. New compared to [29], besides the Cyclic Ackermann rule, is the SkolemizationO rule Forgetting Concept and Role Symbols in ALCOIHµ+ (O, u)-Ontologies 7 that rewrites the existential quantification in the premise by the introduction of a fresh nominal. Crucial for the practicality of the method are a number of equivalence-preserving simplification rules, not described here though. 5 Calculus for Role Forgetting The main contribution of this paper is a calculus for forgetting of role symbols in ontologies expressible in ALCOIHµ+ (O, u). Since role symbols occur in the TBox and the RBox, both need to be processed when role symbols are forgotten. Definition 3 (r-Reduced Form). Suppose r is a role symbol. A clause is in r-reduced form if it has the form C t ∀r.D or C t ¬∀(r u Q).D, where C and D are ALCOIHµ+ (O, u)-concepts that do not contain any occurrence of r and Q is an ALCOIHµ+ (O, u) role expression that does not contain any occurrence of r; or it has the form ¬S t r or S t ¬r, where S ∈ {s, s− } for s (6= r) a role symbol. A set N of clauses is in r-reduced form if every r-clause in N is in r-reduced form. As in concept forgetting, the pivot-clauses are first transformed into pivot- reduced form, so that the Ackermann rule for role forgetting becomes applicable. Finding pivot-reduced form of a clause is not always possible unless definer sym- bols are introduced. Definer symbols are specialised concept symbols that do not occur in the present ontology [13], and are introduced as follows: given a clause of the form C t ∀r(−) .D or C t ¬∀r(−) .D, with r being the pivot and occur- ring in Q ∈ {C, D}, the definer symbols are used as substitutes, incrementally replacing C and D until C and D do not contain any occurrences of r. A new clause ¬D t Q is added to the clause set for each replaced sub-concept Q, where D is a fresh definer symbol. For example, introducing a definer symbol D1 leads to A t ∀r.¬∀r.B being rewritten with A t ∀r.D1 and ¬D1 t ¬∀r.B, where A and B are concept symbols. The definer symbols are eliminated using the rules for concept forgetting once all role symbols in Σ have been forgotten. It is for this reason that our system defaults to forgetting role symbols first so that the definer symbols can be eliminated as part of subsequent concept forgetting. Role Surfacing to TBox Role Surfacing to RBox N , C t ∀r− .D N , ¬S t r− N , S t ¬r− − N , ¬S t r N , S − t ¬r N , D t ∀r.C provided: r is a role symbol that does provided: S is either a role symbol or not occur in C and D. an inverted role symbol. Fig. 4. The rewrite rules for finding r-reduced form of N Since the underlying language accommodates role inverse, the calculus in- cludes two Role Surfacing rules, shown in Figure 4, in order to reformulate expressions without occurrences of inverses of the pivot in the respective TBox and RBox clauses, so that the other rules in the calculus do not need to cater for role inverse. The Role Surfacing rules are applied after definer symbols have been introduced, and before the application of the forgetting rules. 8 Yizheng Zhao and Renate A. Schmidt AckermannR + + PT PR z }| { z }| { N , C 1 t ¬∀r u Q1 .D1 , . . . , C k t ¬∀r u Qk .Dk , ¬T 1 t r, . . . , ¬T u t r , P−T P− R z }| { z }| { C1 t ∀r.D1 , . . . , Cm t ∀r.Dm , ¬r t S1 , . . . , ¬r t Sn N , T -BlockH (1, m), . . . , T -BlockH (k, m) , R-BlockC (1, m), . . . , R-BlockC (u, m) , R-BlockR (1, n), . . . , R-BlockR (u, n) ? T -BlockH (j, m) is the set (1 ≤ j ≤ k): {C j t C Y t ¬∀Hj .(Dj t DY )|Y ⊆ [m]}, where ¬> if Y = ∅ ¬> if Y = ∅ CY = F Ci otherwise, D Y = F ¬Di otherwise, and i∈Y i∈Y − − Hj = S1 u . . . u Sn u Qj if PR 6= ∅, otherwise (PR = ∅) Hj = O u Qj . ? R-BlockC (l, m) is the set {C1 t ∀T l .D1 , . . . , Cm t ∀T l .Dm } (1 ≤ l ≤ u). ? R-BlockR (l, n) is the set {¬T l t S1 , . . . , ¬T l t Sn } (1 ≤ l ≤ u). provided: r is the pivot that does not occur in N , and N is in r-reduced form. PurifyR N r provided: N is positive (negative) w.r.t. r, where r is the pivot. N(¬)O Fig. 5. Forgetting role symbol r in a set N of ALCOIHµ+ (O, u)-clauses Theorem 4 (Ackermann Lemma for Role Forgetting). Let I be an arbi- trary ALCOIHµ+ (O, u)-interpretation. For r the pivot, when the AckermannR or PurifyR rule in Figure 5 is applicable then the conclusion of the rule is true in I iff for some interpretation I 0 r-equivalent to I, the premises are true in I 0 . Given a set N of clauses with r ∈ NR being the pivot, once N has been transformed into r-reduced form, we apply the AckermannR rule and the PurifyR rule given in Figure 5 to eliminate r. By definition, clauses in r-reduced form have four distinct forms. We refer to the clauses of the form C j t ¬∀r u Qj .Dj (1 ≤ j ≤ k) as positive TBox premises (denoted by PT+ ) and clauses of the form Ci t ∀r.Di (1 ≤ i ≤ m) as negative TBox premises (denoted by PT− ) of the rule. We refer to the clauses of the form ¬T l t r (1 ≤ l ≤ n) as positive RBox premises + (denoted by PR ) and clauses of the form ¬r t Si (1 ≤ i ≤ u) as negative RBox − premises (denoted by PR ) of the rule. Since it is possible for r to occur in any of these premises (and the premises do not necessarily wholly exist), there are several situations where the AckermannR rule is applicable. For different PT− − and PR the AckermannR rule is performed as follows. Case (I): If PT− 6= ∅ and PR − 6= ∅, then PT− and PR − (the negative premises) + + + are combined with every clause in PT (if PT 6= ∅) and every clause in PR + (if PR 6= ∅), which leads to the elimination of r, and a forgetting solution O0 where r 6∈ sig(O0 ). Specifically, combining PT− and PR − with one of the positive Forgetting Concept and Role Symbols in ALCOIHµ+ (O, u)-Ontologies 9 TBox premises (PT+ ) leads to a set of TBox clauses (denoted by T -Block H (j, m)), − where Hj is the conjunction of Qj and the Si (1 ≤ i ≤ n) occurring in PR (Hj := Q u S1 u . . . u Sn ), m denotes the number of negative TBox premises (|PT− |), j and j refers to the positive TBox premise with which PT− and PR − combine. [m] j Y j j Y denotes the set {1, . . . , m}, and {C t C t ¬∀H .(D t D )|Y ⊆ [m]} is the set that contains all clauses corresponding to every assignment of Y . The following example illustrates this case. Given Σ = {r} and a set N of clauses in r-reduced form with PT− = {A1 t ∀r.B1 , A2 t ∀r.B2 }, PR − = {¬r t − + − − s1 , ¬r t s2 }, and A t ¬∀r.B ∈ PT , combining PT and PR with A t ¬∀r.B leads to T -Block H (j, m) = {A t C Y t ¬∀(s1 u s− Y 2 ).(B t D )|Y ⊆ [2]} that consists of the following clauses: ¬> t¬∀(s1 u s− 1. A t |{z} 2 ).(B t |{z} ¬> ) when Y = ∅ CY DY 2. A t A1 t¬∀(s1 u s− 2 ).(B t ¬B ) when Y = {1} |{z} |{z}1 CY DY 3. A t A2 t¬∀(s1 u s− 2 ).(B t ¬B ) when Y = {2} |{z} |{z}2 CY DY 4. A t A1 t A2 t¬∀(s1 u s− 2 ).(B t ¬B t ¬B2 ) when Y = {1, 2} | {z } | 1 {z } CY DY Combining PT− with one of the positive RBox premises leads to a set of TBox clauses (denoted by R-Block C (l, m)), where m are the negative TBox premises and l refers to the positive RBox premise with which PT− combines; − combining PR with one of the positive RBox premises leads to a set of RBox clauses (denoted by R-Block R (l, n)), where n are the negative RBox premises − and l refers to the positive RBox premise with which PR combines. Case (II): If PT 6= ∅ and PR = ∅, then combining PT− with one of the − − positive TBox premises leads to the same result as in Case (I) (T -Block H (j, m)), only that a top role (O) replaces S1 u . . . u Sn , i.e., H = O. Combining PT− with one of the positive RBox premises leads to the same result as in Case (I), i.e., R-Block C (l, m) (1 ≤ l ≤ u). Case (III): If PT− = ∅ and PR − 6= ∅, then combining − PR with one of the positive TBox premises and with one of the positive RBox premises leads to the same results as in Case (I), i.e., T -Block H (j, m) (1 ≤ j ≤ k) and R-Block R (l, n) (1 ≤ l ≤ u) respectively. Case (IV): The case PT− = ∅ and − PR = ∅ can be seen as an instance of the case where r is pure. The pivot is eliminated using the PurifyR rule in this case. The pivot is forgotten in the ontology once every positive premise (in PT+ and PR + ) has been combined with PT− (if PT− 6= ∅) and PR − − (if PR 6= ∅). Given a set of clauses in pivot-reduced form with m negative TBox premises (|PT− | = m), − n negative RBox premises (|PR | = n), k positive TBox premises (|PT+ | = k), and u positive RBox premises (|PR | = u), combining PT− and PR + − with all positive TBox premises yields a set of k · 2m clauses (exponential growth); combining PT− and PR − with all positive RBox premises yields a set of um + un clauses (polynomial growth). The size of the forgetting solution therefore depends largely on the number of the negative TBox premises (m). 10 Yizheng Zhao and Renate A. Schmidt 6 Evaluation and Empirical Results We implemented a prototype of the forgetting method in Java using the OWL- API1 , and conducted a number of experiments on real-world ontologies that are taken from the NCBO BioPortal2 and Oxford Ontology3 repositories to evaluate the practicality of the method. The experiments were run on a desktop with an Intelr Coretm i7-4790 processor, and four cores running at up to 3.60 GHz and 8 GB of DDR3-1600 MHz RAM. The ontologies used for our evaluation were restricted to ALCOIH(O, u)-fragments, and any sub-concepts beyond the scope of ALCOIH(O, u) were replaced by >. Consequently, 180 and 200 ontologies of various sizes were selected from the NCBO BioPortal and Oxford Ontology Library, respectively. We repeated the experiments 100 times on each ontology and averaged the results to verify the accuracy of our findings. To fit in with real-world applications such as computing logical difference between ontologies and predicate hiding, where forgetting a small number of symbols is in demand, we set up a series of experiments where we forgot 10% and 30% of concept and role symbols that were randomly selected in each ontology. There are also situations where it would be of interest to forget a large number of symbols; ontology reuse is such an example [12–15]. We therefore set up a series of experiments with a large number of concept and role symbols to be forgotten (80% of the concept symbols and 50% of the role symbols in the initial signature). The heuristic for determining the order of eliminating concept and roles symbols (C and R ) was also tested. The Σ-symbols were eliminated in the order as returned by the OWL-API function that gets all concept symbols in the ontology, when the heuristic was not applied. We started with the evaluation of concept symbol forgetting, where a timeout of 15 minutes was imposed. Input Setting Results Setting Results Ontology |Σ| (%) C Time (sec.) Timeout Success R. Fixp. |Σ| (%) R Definer in Time (sec.) Success R. Clause ↑ 7 4.890 0.0% 100.0% 0.0% 7 1.1/onto. 2.120 sec. 100.0% 4.1% 20 (10%) 4 (10%) 3 3.260 0.0% 100.0% 0.0% 3 10 onto. 2.101 sec. 100.0% 4.1% 7 18.672 4.4% 94.4% 7.2% 7 1.9/onto. 8.658 sec. 100.0% 14.5% BioPortal 60 (30%) 12 (30%) 3 9.336 1.1% 98.3% 7.8% 3 13 onto. 8.314 sec. 100.0% 14.5% (354 Axioms 7 70.416 13.8% 83.3% 13.3% 7 3.0/onto. 20.913 sec. 100.0% 26.5% on Avg.) 160 (80%) 20 (50%) 3 29.340 5.6% 91.7% 17.2% 3 16 onto. 20.566 sec. 100.0% 26.5% 7 31.326 6.3% 92.6% 6.8% 7 2.0/onto. 10.564 sec. 100.0% 15.0% 80 Avg. Avg. 3 13.979 2.4% 96.7% 8.3% 3 13 onto. 10.327 sec. 100.0% 15.0% 7 44.392 3.0% 97.0% 0.5% 7 2.4/onto. 3.187 sec. 100.0% 2.2% 36 (10%) 5 (10%) 3 27.745 1.5% 98.5% 0.5% 3 25 onto. 3.072 sec. 100.0% 2.2% 7 193.106 17.0% 79.5% 11.5% 7 3.7/onto. 18.537 sec. 100.0% 6.9% Oxford 108 (30%) 15 (30%) 3 80.461 9.5% 88.5% 14.5% 3 28 onto. 17.998 sec. 100.0% 6.9% (875 Axioms 7 412.852 34.5% 61.5% 19.0% 7 5.7/onto. 36.292 sec. 100.0% 14.6% on Avg.) 288 (80%) 25 (50%) 3 166.270 17.5% 78.5% 26.5% 3 39 onto. 34.117 sec. 100.0% 14.6% 7 216.783 18.2% 79.3% 10.3% 7 3.9/onto. 19.339 sec. 100.0% 7.9% 144 Avg. Avg. 3 91.492 9.5% 88.5% 13.8% 3 30 onto. 18.396 sec. 100.0% 7.9% Fig. 6. Results for concept forgetting Fig. 7. Results for role forgetting The results (of forgetting only concept symbols) obtained from forgetting 10%, 30% and 80% of the concept symbols from the respective BioPortal and Oxford ontologies are shown in Figure 6, which is quite revealing in several ways. The most notable observation to emerge from the results is that, with the 1 http://owlapi.sourceforge.net/ 2 http://bioportal.bioontology.org/ 3 http://www.cs.ox.ac.uk/isg/ontologies/ Forgetting Concept and Role Symbols in ALCOIHµ+ (O, u)-Ontologies 11 heuristically determined elimination order (indicated by 3), our implementation was successful (forgot all Σ-symbols) in 96.7% of the BioPortal-cases within a limited period of time, with 8.3% of them using fixpoints in the result. In the case of 10% of the concept symbols specified to be forgotten, the success rate rose to 100%. Even without the heuristic (7), the forgetting solution could be found by our implementation in 92.6% of the cases. Since the Oxford ontologies were more than twice as large as the BioPortal ontologies and a larger set of concept symbols was specified to be forgotten, a reduction in the performance was expected. The implementation was unable to compute the solution in 11.5% of the Oxford-cases, and in 13.8% of the solved cases fixpoints occurred in the result. The use of the heuristic boosted the overall success rate by 4% and 9.2%, and improved the time efficiency by 124.1% and 136.9% in the BioPortal and Oxford ontologies, respectively. We also evaluated the performance of forgetting different number of role symbols with the same ontologies used for the evaluation of concept forgetting (using a timeout of 5 minutes). The results (of forgetting only role symbols) are shown in Figure 7, from which it can be seen that our method successfully eliminated the role symbols in Σ in all cases. The time used for forgetting role symbols, as expected, was significantly longer than forgetting the same number of concept symbols, despite the 100% success rate. Because of the nature of the AckermannR rule, role symbol forgetting leads to growth of clauses in the forget- ting solution, which was however modest (Clause ↑) compared to the theoretical worst case. Definer symbols were introduced only in a small proportion of the ontologies to help conversion to reduced form. This indicates that most clauses in the ontologies were flat. By m/onto, we mean that there were m definer sym- bols introduced in each ontology, and by n onto we mean that n ontologies out of the total introduced the definer symbol. 7 Conclusion and Future Work In this paper we have developed a method of concept and role forgetting for expressive description logics with nominals, non-empty TBoxes and ABoxes, and a rich language for expressing properties of roles. The method can handle role inclusion statements, role conjunctions and role inverses. This is extremely useful from the perspective of ontology engineering as it increases the arsenal of tools available to create restricted views of ontologies. The results of the evaluation on real-world ontologies has shown that often fixpoints and role conjunction are not needed to express forgetting solutions, and overall the performance results are very positive. Currently beyond the scope of our method are forgetting transitive roles. We can extend the method to eliminate concept symbols in the presence of transitive roles, but the interaction between transitive roles and role hierarchy inclusion statements can lead to results where it is not clear how to represent them finitely. The problem of extending the method to handle number restrictions remains completely open; possibly the techniques of [14, 15] can help extend the method. 12 Yizheng Zhao and Renate A. Schmidt References 1. W. Ackermann. Untersuchungen über das Eliminationsproblem der mathematis- chen Logik. Mathematische Annalen, 110(1):390–413, 1935. 2. D. Calvanese, G. De Giacomo, and M. Lenzerini. Reasoning in expressive descrip- tion logics with fixpoints based on automata on infinite trees. In Proc. IJCAI’99, pages 84–89. Morgan Kaufmann, 1999. 3. W. Conradie, V. Goranko, and D. Vakarelov. Algorithmic correspondence and completeness in modal logic. I. The core algorithm SQEMA. Logical Methods in Computer Science, 2(1), 2006. 4. G. D’Agostino and M. Hollenberg. Logical questions concerning the µ-Calculus: Interpolation, Lyndon and Los-Tarski. J. Symb. Log., 65(1):310–332, 2000. 5. P. Doherty, W. Lukaszewicz, and A. Szalas. Computing circumscription revisited: A reduction algorithm. Journal of Automated Reasoning, 18(3):297–336, 1997. 6. D. M. Gabbay and H. J. Ohlbach. Quantifier elimination in second–order predicate logic. In Proc. KR’92, pages 425–435. Morgan Kaufmann, 1992. 7. D. M. Gabbay, R. A. Schmidt, and A. Szalas. Second Order Quantifier Elimination: Foundations, Computational Aspects and Applications. College Publications, 2008. 8. E. Grädel and I. Walukiewicz. Guarded fixed point logic. In Proc. LICS’99, pages 45–54. IEEE Computer Society, 1999. 9. A. Herzig and J. Mengin. Uniform interpolation by resolution in modal logic. In Proc. JELIA’08, volume 5293 of Lecture Notes in Computer Science, pages 219– 231. Springer, 2008. 10. B. Konev, C. Lutz, D. Walther, and F. Wolter. Model-theoretic inseparability and modularity of description logic ontologies. Artificial Intelligence, 203:66–103, 2013. 11. B. Konev, D. Walther, and F. Wolter. Forgetting and uniform interpolation in extensions of the description logic EL. In Proc. DL’09, volume 477 of CEUR Workshop Proceedings. CEUR-WS.org, 2009. 12. P. Koopmann and R. A. Schmidt. Forgetting concept and role symbols in ALCH- ontologies. In Proc. LPAR’13, volume 8312 of Lecture Notes in Computer Science, pages 552–567. Springer, 2013. 13. P. Koopmann and R. A. Schmidt. Uniform interpolation of ALC-ontologies using fixpoints. In Proc. FroCos’13, volume 8152 of Lecture Notes in Computer Science, pages 87–102. Springer, 2013. 14. P. Koopmann and R. A. Schmidt. Count and forget: Uniform interpolation of SHQ-ontologies. In Proc. IJCAR’14, volume 8562 of Lecture Notes in Computer Science, pages 434–448. Springer, 2014. 15. P. Koopmann and R. A. Schmidt. Saturated-based forgetting in the description logic SIF. In Proc. DL’15, volume 1350 of CEUR Workshop Proceedings. CEUR- WS.org, 2015. 16. P. Koopmann and R. A. Schmidt. Uniform interpolation and forgetting for ALC- ontologies with ABoxes. In Proc. AAAI’15, pages 175–181. AAAI Press, 2015. 17. F. Lin and R. Reiter. Forget it! In Proc. AAAI Fall Symposium on Relevance, pages 154–159, 1994. 18. M. Ludwig and B. Konev. Practical uniform interpolation and forgetting for ALC TBoxes with applications to logical difference. In Proc. KR’14. AAAI Press, 2014. 19. C. Lutz, I. Seylan, and F. Wolter. An automata-theoretic approach to uniform interpolation and approximation in the description logic EL. In Proc. KR’12, pages 286–297. AAAI Press, 2012. Forgetting Concept and Role Symbols in ALCOIHµ+ (O, u)-Ontologies 13 20. C. Lutz and F. Wolter. Foundations for uniform interpolation and forgetting in expressive description logics. In Proc. IJCAI’11, pages 989–995. IJCAI/AAAI, 2011. 21. N. Nikitina and S. Rudolph. (Non-)Succinctness of Uniform Interpolants of General Terminologies in the Description Logic EL. Artificial Intelligence, 215:120–140, 2014. 22. R. A. Schmidt. The Ackermann approach for modal logic, correspondence theory and second-order reduction. Journal of Applied Logic, 10(1):52–74, 2012. 23. R. A. Schmidt and D. Tishkovsky. Using tableau to decide description logics with full role negation and identity. ACM Trans. Comput. Log., 15(1):7:1–7:31, 2014. 24. A. Visser. Bisimulations, Model Descriptions and Propositional Quantifiers. Logic Group Preprint Series. Department of Philosophy, Utrecht Univ., 1996. 25. K. Wang, Z. Wang, R. Topor, J. Z. Pan, and G. Antoniou. Concept and role forgetting in ALC ontologies. In Proc. ISWC’09, volume 5823 of Lecture Notes in Computer Science, pages 666–681. Springer, 2009. 26. K. Wang, Z. Wang, R. Topor, J. Z. Pan, and G. Antoniou. Eliminating concepts and roles from ontologies in expressive description logics. Computational Intelligence, 30(2):205–232, 2014. 27. Z. Wang, K. Wang, R. Topor, and J. Z. Pan. Forgetting for knowledge bases in DL-Lite. Annals of Mathematics and Artificial Intelligence, 58(1-2):117–151, 2010. 28. Z. Wang, K. Wang, R. W. Topor, and J. Z. Pan. Forgetting concepts in DL-Lite. In Proc. ESWC’08, volume 5021 of Lecture Notes in Computer Science, pages 245–257. Springer, 2008. 29. Y. Zhao and R. A. Schmidt. Concept Forgetting in ALCOI-Ontologies Using an Ackermann Approach. In Proc. ISWC’15, volume 9366 of Lecture Notes in Computer Science, pages 587–602. Springer, 2015.