=Paper=
{{Paper
|id=Vol-3263/paper-17
|storemode=property
|title=Fine-Grained Forgetting for the Description Logic ALC
|pdfUrl=https://ceur-ws.org/Vol-3263/paper-17.pdf
|volume=Vol-3263
|authors=Mostafa Sakr,Renate A. Schmidt
|dblpUrl=https://dblp.org/rec/conf/dlog/SakrS22
}}
==Fine-Grained Forgetting for the Description Logic ALC==
Fine-Grained Forgetting for the Description Logic ALC Mostafa Sakr, Renate A. Schmidt The University of Manchester, Manchester, UK Abstract Forgetting is an important ontology extraction technique. A variant of forgetting which has received significant attention in the literature is deductive forgetting (or uniform interpolation). While deductive forgetting is attractive as it generates the forgetting view in a language with the same complexity as the language of the original ontology, it is known that with a slightly extended target language using definer symbols more information can be preserved. In this paper, we study deductive forgetting of concept names with the aim of understanding the unpreserved information. We present a system that performs deductive forgetting and produces a set Ξ of axioms representing the unpreserved information in the forgetting view. Our system allows a new fine-grained ontology extraction process that gives the user the option to enhance the informativeness of the deductive forgetting view by appending to it axioms from Ξ. 1. Introduction Forgetting is an important ontology extraction technique. It eliminates from an ontology a given subset of its vocabulary. The result is a focused ontology, called forgetting view, which preserves the content of the ontology relative to the non-forgotten vocabulary. Forgetting offers solutions for many applications such as: computing logical difference [1], information hiding [2], abduction [3], resolving conflicts [4], relevance [5, 6, 7], and forgetting actions in planning [8]. A variant of forgetting that has been studied in the literature is deductive forgetting (or uniform interpolation). Given an πβπ ontology and a forgetting signature, deductive forgetting produces a view of the ontology which preserves only the information expressible in πβπ over the non-forgetting signature [9, 10, 11]. Deductive forgetting is however not precise, because information expressible with more expressivity may not be preserved. Yet, deductive forgetting remains an appealing variant of forgetting because when performed on πβπ ontologies, the generated forgetting views are (infinitely) representable in πβπ, or finitely representable if fixpoint operators are allowed [12, 13]. When a deductive view is computed, the following questions arise: Does the deductive view preserve all information of the non-forgotten vocabulary? If not, what information is not preserved and what does this information represent? Can some of this information be partially preserved, i.e., can a view more informative than the deductive forgetting view be computed? In this paper, we DL 2022: 35th International Workshop on Description Logics, August 7β10, 2022, Haifa, Israel $ Mostafa.Sakr@manchester.ac.uk (M. Sakr); Renate.Schmidt@manchester.ac.uk (R. A. Schmidt) 0000-0003-4296-5831 (M. Sakr); 0000-0002-6673-3333 (R. A. Schmidt) Β© 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings http://ceur-ws.org ISSN 1613-0073 CEUR Workshop Proceedings (CEUR-WS.org) aim to address these questions, gain a better understanding of the information not preserved by deductive forgetting, and provide a practical tool to compute this information. We focus on concept forgetting for ontologies in the description logic πβπ, the basic logic in the family of expressive description logics [14]. The main contribution is a novel forgetting method. (1) The method converts the input ontology into an intermediate ontology in which the forgetting signature has been eliminated. This intermediate ontology is semantically equivalent to the input ontology with respect to the non-forgotten vocabulary. That is, the reducts of the models of both ontologies to the non- forgotten vocabulary coincide. The intermediate ontology may use foreign concept symbols, or definers, to represent subsets of role successors. (2) The method obtains from the intermediate ontology two sets πͺπππ and Ξ of axioms. The set πͺπππ approximates the deductive view by allowing the use of foreign symbols. We present a method to eliminate these foreign symbols from πͺπππ and obtain the final deductive view in πβπ. Complete elimination of the foreign symbols may not however succeed when the deductive view does not exist finitely in πβπ due to cycles occurring over forgetting symbols. The set Ξ represents the information difference between the intermediate ontology and πͺπππ . If all foreign symbols are successfully eliminated from πͺπππ , then Ξ also represents the information difference between the input ontology and the deductive view. Several benefits are obtained from our forgetting method. (1) In two different evaluations an implementation of our forgetting method was compared against the state-of-the-art deductive forgetting tool Lethe [15]. Our implementation was found to be faster than Lethe. Our analysis shows that this improvement can be attributed to a novelty of the language of the intermediate ontology as it allows for avoiding time-consuming operations performed by Lethe. (2) By inspecting the set Ξ, we now understand the difference between the input ontology and the deductive view as information on the conjunctions of different subsets of role successors. (3) An empty Ξ indicates that the deductive view coincides semantically with the input ontology with respect to the non-forgotten vocabulary. (4) By incrementing πͺπππ with axioms from Ξ, our method allows for a fine-grained forgetting framework where views of the original ontology that are more informative than the deductive view can be obtained based on user requirements. All proofs are provided in the long version https://github.com/e73898ms/ FineGrainedForgetting/blob/main/Fine_Grained_Forgetting-Long%20Version-DL22.pdf. 2. History and Related Work Forgetting can be traced back to Boole who referred to it as elimination of the middle terms. In propositional logic, it was studied in relation to relevance, independence, and variable elimina- tion [6, 7]. A variant of forgetting which preserves semantic equivalences is semantic forgetting. In the context of first-order logic (FOL), semantic forgetting was viewed as a second-order quantifier elimination problem [16, 17] finding that the semantic view of a FO theory is not in general expressible in FOL but is always expressible in second-order logic (SOL). A way to view the foreign symbols in the intermediate ontology created by our method is as second-order existentially quantified concept symbols. This is because they are used to represent subsets of role successors whose first-order definition, as we show, cannot be computed in general. Therefore, our intermediate ontology adheres to results in the literature [9, 16], and can be viewed as an approximation to the semantic view of the input ontology. We show that standard reasoning operations can be performed on the intermediate ontology using standard πβπ reasoning methods. Deductive forgetting was considered in [18] under the name weak forgetting. The proposal in [18] builds on previous work in modal logics which views deductive forgetting as uniform interpolation [19, 20], i.e., forgetting a signature β± from an ontology πͺ is equivalent to computing the uniform interpolant over the remaining vocabulary of πͺ after excluding β±. This allows characterizing the relationship between the original ontology and the deductive view in terms of bisimulation over the non-forgotten symbols [10, 9, 21]. Deductive and semantic forgetting are also closely related to the notions of concept inseparability and model inseparability [22, 23, 24, 25]. Several deductive forgetting methods were proposed in [13, 20, 1]. Deciding the existence of a finitely representable deductive view is 2ExpTime, and its size is, at most, triple exponential in the size of the original ontology [10]. For πβπ ontologies, the deductive view can be captured, possibly infinitely, in πβπ. Infinite forgetting views occur when cycles over some forgetting symbols exist [13]. In this case, finite representations may be approximated by fixpoint operators [12, 13, 26, 27] or by using foreign symbols to witness these cycles [13]. The latter representation can be converted to the former [13]. 3. Basic Definitions Let ππ , ππ be two disjoint sets of concept symbols and role symbols. Concepts in πβπ are of the following forms: β₯ | π΄ | Β¬πΆ | πΆ β π· | βπ.πΆ where π΄ β ππ , π β ππ and πΆ and π· are πβπ concepts. We also allow the following abbreviations: β€ β‘ Β¬β₯, βπ.πΆ β‘ Β¬βπ.Β¬πΆ, πΆ β π· β‘ Β¬(Β¬πΆ β Β¬π·). An interpretation in πβπ is a pair β = β¨Ξβ , Β·β β© where the domain Ξβ is a nonempty set and Β·β is an interpretation function that assigns to each concept symbol π΄ β ππ a subset of Ξβ and to each π β ππ a subset of Ξβ Γ Ξβ . The language constructs are interpreted as follows: β₯β := β , (Β¬πΆ)β := Ξβ βπΆ β , (πΆ β π·)β := πΆ β β© π·β , (βπ.πΆ)β := {π₯ β Ξβ |βπ¦ : (π₯, π¦) β πβ β§ π¦ β πΆ β }. A TBox, or an ontology, is a set of axioms of the form πΆ β π·, where πΆ and π· are concepts. β is model of an ontology πͺ if all axioms πΆ β π· β πͺ are true in β, in symbols β |= πΆ β π·. And, β |= πΆ β π· if and only if πΆ β β π·β . We say that πΆ β π· is satisfiable with respect to πͺ if and only if β |= πΆ β π· for some model β of πͺ. We also say that πΆ β π· is a consequence of πͺ, in symbols πͺ |= πΆ β π·, if and only if β |= πΆ β π· for every model β of πͺ. Let πΆ be an πβπ concept, we denote βοΈ by π ππ(πΆ) the set of concept and role symbols appearing in πΆ. For an ontology πͺ, π ππ(πͺ) = πΆβπ·βπͺ π ππ(πΆ) βͺ π ππ(π·). The size of an ontology is the number of axioms in it. Definition 1. Two models β and π₯ Ξ£-coincide iff Ξβ = Ξπ₯ and πβ = ππ₯ for every concept or role symbol π β Ξ£. Definition 2. Let πͺ1 and πͺ2 be two ontologies and Ξ£ a set of symbols where Ξ£ β ππ βͺ ππ . We say πͺ1 and πͺ2 are semantically Ξ£-equivalent, in symbols πͺ1 β‘β³ Ξ£ πͺ2 , iff for every model β1 of πͺ1 there is a model β2 of πͺ2 , and vice versa, such that β1 and β2 Ξ£-coincide. Resolution (Res) πΆ1 β π΄ πΆ2 β Β¬π΄ πΆ1 β πΆ2 where π΄ is a forgetting symbol and πΆ1 , πΆ2 are general concept expressions. Figure 1: Binary resolution rule Definition 3. Let πͺ1 and πͺ2 be two πβπ ontologies, and let Ξ£ a set of symbols where Ξ£ β ππ βͺππ . We say πͺ1 and πͺ2 are deductively Ξ£-equivalent, in symbols πͺ1 β‘πΆ Ξ£ πͺ2 , iff for every πβπ concept inclusion πΌ, where π ππ(πΌ) β Ξ£, we have πͺ1 |= πΌ iff πͺ2 |= πΌ. Deductive forgetting is defined using deductive equivalence [10]. Definition 4. Let πͺ be an πβπ ontology, and let β± β π ππ(πͺ) β© ππ be a forgetting signa- ture. An ontology π± is a deductive forgetting view of πͺ w.r.t. β± iff π ππ(π±) β π ππ(πͺ)ββ±, and πͺ β‘πΆ π ππ(πͺ)ββ± π±. 4. Computing the Intermediate Ontology The first stage of our method is to compute the intermediate ontology πͺπππ‘ of the input on- tology πͺ w.r.t. the given forgetting signature β±. The method applies resolution to the input ontology written in clausal form. πͺππππ’π ππ is computed by: (1) converting πͺ into negation normal form (NNF), with negation applied only to concept names, (2) miniscoping, i.e., re- placing βπ.πΆ β βπ.π· with the semantically equivalent βπ.(πΆ β π·), and βπ.πΆ β βπ.π· with the semantically equivalent βπ.(πΆ β π·), (3) applying structural transformations to extract the formulas under role restriction that contain the forgetting symbols by introducing fresh concept symbols (called definers) [28], and (4) converting the result to conjunctive normal form (CNF). Example 1. Consider the axiom π΄ β βπ.(π΅ β πΆ) where π΅ is a forgetting symbol. It is first converted to NNF by eliminating the connective β, giving π1 = {Β¬π΄ β βπ.(π΅ β πΆ)}. Structural transformation is applied to extract π΅ β πΆ, giving π2 = {Β¬π΄ β βπ.π·1 , Β¬π·1 β (π΅ β πΆ)} where π·1 β ππ is a definer symbol. Finally, π2 is converted to CNF, giving π3 = {Β¬π΄ β βπ.π·1 , Β¬π·1 β π΅, Β¬π·1 β πΆ}. The forgetting symbols in β± are then eliminated from πͺππππ’π ππ by iteratively eliminating them using the Resolution rule in Figure 1. When all possible resolution inferences have been performed on a concept symbol in β±, clauses that contain this concept symbol are removed in a purity deletion step. Additionally, the following operations are applied eagerly: (1) Tautology deletion: clauses of the form πΆ β Β¬πΆ β πΈ are deleted, where πΆ and πΈ are πβπ concepts. (2) Purification: if a forgetting symbol π΄ occurs only positively or only negatively in πͺ, then π΄ is replaced everywhere by β€ and β₯ respectively. Assume πͺπππ‘ is the set of clauses that remain. Example 2. Let πͺ = {π΄ β βπ.π΅ β βπ .Β¬π΅, πΊ β βπ.(Β¬π΅ β πΆ), π΅ β π»}, and β± = {π΅}. The method starts by generating πͺππππ’π ππ = {Β¬π΄ β βπ.π·1 , Β¬π΄ β βπ .π·2 , Β¬πΊ β βπ.π·3 , Β¬π·1 β π΅, Β¬π·2 β Β¬π΅, Β¬π·3 β Β¬π΅ β πΆ, Β¬π΅ β π»}, where π·1 , π·2 , and π·3 are fresh definers. Then, it resolves on the concept symbol π΅ using the Resolution rule in Figure 1 generating additionally the clauses {Β¬π·1 β Β¬π·2 , Β¬π·1 β Β¬π·3 β πΆ, Β¬π·1 β π»}. Finally, the clauses {Β¬π·1 β π΅, Β¬π·2 β Β¬π΅, Β¬π·3 β Β¬π΅ β πΆ, Β¬π΅ β π»} are removed by purity deletion. So the intermediate ontology πͺπππ‘ is {Β¬π΄ β βπ.π·1 , Β¬π΄ β βπ .π·2 , Β¬πΊ β βπ.π·3 , Β¬π·1 β Β¬π·2 , Β¬π·1 β Β¬π·3 β πΆ, Β¬π·1 β π»}. Theorem 1. πͺ β‘β³ πππ‘ π ππ(πͺ)ββ± πͺ . Theorem 2. The size of πͺπππ‘ is in the worst case exponential in the size of the given ontology πͺ and double exponential in the number of forgetting symbols. Definers are used in the intermediate ontology πͺπππ‘ to represent subsets of role successors. Precise definitions of definers cannot always be given without knowledge of the forgetting symbols. For instance in Example 1, π·1 is interpreted as π·1β = {π¦ β π΅ β β© πΆ β | (π₯, π¦) β πβ , π₯ β π΄β } where β is a model of π. Since the user of the forgetting view may not be aware of π΅, a definition of π·1 in terms of π΅ is not conveyed in πͺπππ‘ . Thus, definers may be viewed as second-order existentially quantified concept symbols because they represent some subsets of the domain whose definitions cannot be precisely captured. One can observe that πͺπππ‘ can be viewed as an approximation to the semantic forgetting view of πͺ with respect to β±. 5. Extracting Ξ and πͺπππ The second stage of the method is to obtain the sets Ξ and πͺπππ from πͺπππ‘ , where πͺπππ is an ontology approximating the deductive view and Ξ is the information difference between πͺπππ‘ and πͺπππ . The Reduction rule in Figure 2 removes from πͺπππ‘ the clauses with two or more negative definers. These clauses constitute the set Ξ. The Role Propagation rule in Figure 2 computes the πβπ consequences that are otherwise lost when removing the clauses of Ξ from πͺπππ‘ by the Reduction rule. Therefore, we require it to be applied before removing these clauses. The premises of the Role Propagation rule start with the clause π0 β πΆ0 , where π0 takes the form Β¬π·0 β Β¬π·1 β Β· Β· Β· β Β¬π·π . The second premise is a set of clauses ππ β πΆπ . Here, the concepts ππ takes the same form as the concept π0 , i.e., is a disjunction of negative definers, but also Definers(ππ ) β Definers(π0 ) where Definers(π ) denotes the set of definer symbols in π ππ(π ). The intuition here is that ππ β π0 . Therefore, every domain element that is not in the interpretation of π0 , consequently ππ , must be in the interpretation of πΆ0 and πΆπ . The clauses in the third and the fourth premises take the same form, except that existential role restriction is only allowed in the third premise. β¨οΈπ By the thirddand fourth premises, every domain element must be in the interpretation of d π=0 πΈπ or π¬π.( ππ=0 π·π ). But the latter can be rewritten as π¬π.Β¬π0 , which is subsumed by π¬π.( π π=0 πΆπ ) as concluded by the rule. Example 3. Continuing with Example 2, the Role Propagation rule applies with its four premises being: 1. π0 β πΆ0 = Β¬π·1 β Β¬π·3 β πΆ Role Propagation π βοΈ π βοΈ π0 β πΆ0 , {ππ β πΆπ },πΈ0 β π¬π.π·0 , {πΈπ β βπ.π·π } π=1 π=1 π β¨οΈ π d ( πΈπ ) β π¬π.( πΆπ ) π=0 π=0 π β¨οΈ where π0 = Β¬π·π , ππ is any sub-concept of π0 , π¬ β {β, β}, and πΆ0 and πΆπ do not contain a π=0 definer. Reduction πͺ βͺ {Β¬π·1 β ... β Β¬π·π β πΆ} πͺ where πΆ is a concept expression that does not contain a negative definer, π·1 , ..., π·π are definer symbols, and π β₯ 2. Figure 2: πβπ reduction rules. π βοΈ 2. {ππ β πΆπ } = {Β¬π·1 β π»} π=1 3. πΈ0 β π¬π.π· = Β¬πΊ β βπ.π·3 π βοΈ 4. {πΈπ β βπ.π·π } = {Β¬π΄ β βπ.π·1 } π=1 The conclusion is Β¬π΄ β Β¬πΊ β βπ.(πΆ β π»). Note that the generated conclusion preserves information that would otherwise be lost when the clause Β¬π·1 β Β¬π·3 β πΆ is removed by the Reduction rule. After the Role Propagation rule has been exhaustively applied, the clauses of Ξ are removed. The remaining clauses constitute πͺπππ . Denote by πͺππ the ontology obtained from πͺπππ‘ by applying the Role Propagation rule. Example 4. Continuing with Example 2. We have πͺππ = πͺπππ‘ βͺ {Β¬π΄ β Β¬πΊ β βπ.(πΆ β π»)} where the axiom on the right of the union operator is the conclusion of the Role Propagation rule obtained in Example 3. We also have Ξ = {Β¬π·1 β Β¬π·2 , Β¬π·1 β Β¬π·3 β πΆ}, and πͺπππ = {Β¬π΄ β βπ.π·1 , Β¬π΄ β βπ .π·2 , Β¬πΊ β βπ.π·3 , Β¬π·1 β π», Β¬π΄ β Β¬πΊ β βπ.(πΆ β π»)}. Theorem 3. πͺππ β‘πΆ π ππ(πͺ)ββ± πͺ πππ . Theorem 3 proves a main contribution of the paper. First, observe that πͺππ β‘β³ πππ‘ π ππ(πͺ)ββ± πͺ . It follows from this observation, and Theorems 1 and 3 that πͺ β‘πΆ π ππ(πͺ)ββ± πͺ πππ . Therefore, if πͺπ’π is a deductive view of πͺπππ with respect to π ππ(πͺ)ββ±, then πͺπππ is deductively equivalent to πͺπ’π with respect to π ππ(πͺ)ββ±. We shall strengthen this in the next section and show that if no cycles occur in πͺπππ then it is semantically equivalent to the deductive view with respect to π ππ(πͺ)ββ±. Second, observe that πͺππ = πͺπππ βͺ Ξ. Additionally, the clauses in Ξ have been generated in πͺπππ‘ by resolution inferences over the forgetting symbols, and the premises of these inferences were removed by purity deletion. Therefore, we find in general that πͺπππ ΜΈ|= Ξ. This implies that Ξ can be viewed as representing the information difference between πͺππ and πͺπππ . Altogether we therefore conclude that Ξ can be viewed as the information difference between πͺ and the deductive view with respect to π ππ(πͺ)ββ±. We end this section with a discussion on the extracted set Ξ which consists of clauses of the format Β¬π·1 β Β· Β· Β· β Β¬π·π β πΉ where π β₯ 2, or in axiom form, π·1 β Β· Β· Β· β π·π β πΉ . Since we introduced the definer symbols to represent subsets of role successors, these clauses can be understood as information on the conjunctions of different subsets of role successors. For instance, in Example 2, the clause Β¬π·1 β Β¬π·2 β Ξ specifies the constraint that the subset of π-successors and the subset of π -successors of domain elements in the interpretation of π΄ are disjoint. It was not a coincidence that we introduced definer symbols to represent subsets of role successors. Using them in this way and introducing them via structural transformation forces the clauses in Ξ to be explicit members of πͺπππ‘ which simplifies their extraction, giving us a representation of the difference between πͺ and the deductive view. 6. Eliminating the Definer Symbols We identified Ξ as the axioms that contain two or more negative definers. πͺπππ‘ and πͺπππ may contain definer symbols that appear negatively in clauses where no other negative definer is present. These definers can be eliminated safely while preserving the interpretations of the non-forgotten vocabulary. For this, we use the Definer Elimination rule in Figure 3. The side conditions of the Definer Elimination rule exclude the elimination of definers that may appear both positively and negatively in a clause. We call such definer symbols cyclic definers. The existence of cyclic definers signifies cycles in the original ontology over some forgetting symbols. In this case the deductive view may not exist as it requires an infinite representation. An approach to eliminate cyclic definers and obtain a finite approximation of the deductive view is using fixpoint operators [12]. As an alternative, cyclic definers can be left in the deductive view as witnesses of these cycles [13]. We find this the best option because it defers the decision of a suitable representation to a later stage. For clauses that contain only one negative definer symbol, possibly with other positive definers, the Definer Elimination rule in Figure 3 is applied exhaustively. The rule replaces the definer symbol π· with its super-concept πΆ1 β ... β πΆπ . Note that, in the Definer Elimination rule, πΆ may be β₯. Besides the Definer Elimination rule, we also eagerly apply the Tautology Deletion and the Purification rules (see Section 4). Example 5. Continuing with Example 4, π± is extracted from πͺπππ as follows: 1. The definers π·2 and π·3 are eliminated using Purification. Since π·2 and π·3 appear only pos- itively in πͺπππ , they are purified by replacing them with β€ which gives {Β¬π΄ β βπ.π·1 , Β¬π΄ β βπ .β€, Β¬πΊ β βπ.β€, Β¬π·1 β π», Β¬π΄ β Β¬πΊ β βπ.(πΆ β π»)}. As βπ .β€ evaluates to β€, the result can be simplified further to {Β¬π΄ β βπ.π·1 , Β¬πΊ β βπ.β€, Β¬π·1 β π», Β¬π΄ β Β¬πΊ β βπ.(πΆ β π»)}. Definer Elimination πͺ βͺ {Β¬π· β πΆ1 , ..., Β¬π· β πΆπ } πͺ[π·/πΆ] where πΆ = βππ=1 πΆπ and π· β / π ππ(πΆ), πΆ does not contain any negative definers, and πͺ does not contain π· negatively. Figure 3: Definer elimination rule 2. The definer π·1 is eliminated by the Definer Elimination rule in Figure 3 giving {Β¬π΄ β βπ.π», Β¬πΊββπ.β€, Β¬π» βπ», Β¬π΄βΒ¬πΊββπ.(πΆ βπ»)}. The clause Β¬π» βπ» is then eliminated by Tautology Deletion giving π± = {Β¬π΄ β βπ.π», Β¬πΊ β βπ.β€Β¬π΄ β Β¬πΊ β βπ.(πΆ β π»)} Theorem 4. Let π± be generated from πͺπππ by applying the Definer Elimination rule from Figure 3 exhaustively. Then, (1) πͺπππ β‘β³π ππ(πͺ)ββ± π±; and (2) if π ππ(π±) β© ππ = β then π± is a deductive forgetting view of πͺ w.r.t. β±. 7. Computing More Informative Forgetting Views Our forgetting method allows customizing the informativeness of the final forgetting view according to user requirements, which reveals a spectrum of forgetting views that are more informative than the deductive view and at most as informative as the intermediate ontology. This can be done by overriding the Reduction rule as illustrated in the following example. Example 6. Consider πͺπππ‘ from Example 2. Applying the rules in Figure 2 gives πͺπππ and Ξ from Example 4. We may increase the informativeness of the final forgetting view by overriding the Reduction rule. We describe three different forgetting views that can be generated in this way. 1. If we want to preserve all the information about the π-successors of π΄, then we override the Reduction rule to retain the clauses where π·1 occurs. In this case, Ξ1 = β and the final forgetting view will be πͺ1πππ = π±1 = {Β¬π΄ β βπ.π·1 , Β¬π΄ β βπ .π·2 , Β¬πΊ β βπ.π·3 , Β¬π·1 β Β¬π·2 , Β¬π·1 β Β¬π·3 β πΆ, Β¬π΄ β Β¬πΊ β βπ.πΆ}. 2. If we want to preserve the information about the π-successors of π΄ in relation to the π- successors of πΊ, we override the Reduction rule to retain the clauses where both π·1 and π·3 occur. That is, Ξ2 = {Β¬π·1 β Β¬π·2 } and Β¬π·1 β Β¬π·3 β πΆ ΜΈβ Ξ. Then, πͺ2πππ = {Β¬π΄ β βπ.π·1 , Β¬π΄ β βπ .π·2 , Β¬πΊ β βπ.π·3 , Β¬π·1 β Β¬π·3 β πΆ, Β¬π΄ β Β¬πΊ β βπ.πΆ}, and π±2 = {Β¬π΄ β βπ.π·1 , Β¬πΊ β βπ.π·3 , Β¬π·1 β Β¬π·3 β πΆ, Β¬π΄ β Β¬πΊ β βπ.πΆ} with π·2 purified away. 3. If we are interested in the relation between the π and π successors of π΄, then we override the Reduction rule to remove Β¬π·1 β Β¬π·3 β πΆ but not Β¬π·1 β Β¬π·2 . That is Ξ3 = {Β¬π·1 β Β¬π·3 β πΆ}. Consequently, we get πͺ3πππ = {Β¬π΄ β βπ.π·1 , Β¬π΄ β βπ .π·2 , Β¬πΊ β βπ.π·3 , Β¬π΄ β Β¬πΊββπ.πΆ, Β¬π·1 βΒ¬π·2 }. The final forgetting view then becomes π±3 = {Β¬π΄ββπ.π·1 , Β¬π΄β βπ .π·2 , Β¬πΊ β βπ.β€, Β¬π΄ β Β¬πΊ β βπ.πΆ, Β¬π·1 β Β¬π·2 } with π·3 purified away. Observe that in π±1 and π±2 the clause Β¬π΄ β Β¬πΊ β βπ.πΆ is redundant. We can eliminate this redundancy by applying the Role Propagation rule only when a premise of the rule occurs in Ξ. Since the final forgetting views may use definers, the following question may be asked: Are definers in forgetting views limiting? We argue that since definers are existentially quantified and the forgetting view is expressed using πβπ syntax, standard reasoning tasks such as satisfiability checking, and query answering, can be performed with respect to the non-forgotten vocabulary using the existing πβπ methods. The following example explains the idea. Example 7. Let ontology πͺ = {π΄1 β βπ.π΅, π΄2 β βπ.Β¬π΅} be an ontology, and πͺπππ‘ = {Β¬π΄1 β βπ.π·1 , Β¬π΄2 β βπ.π·2 , Β¬π·1 β Β¬π·2 β β₯} the intermediate ontology of πͺ with respect to β± = {π΅} where π·1 and π·2 are definers. Assume Ξ = β , then πͺπππ‘ is the final forgetting view. Both πͺ and πͺπππ‘ model the information that the π-successors of the elements in the interpretation of π΄1 are disjoint from the π-successors of the elements in the interpretation of π΄2 . Suppose we additionally have a database π = {π΄1 (π1 ), π΄2 (π2 ), π(π1 , π)}, and we want to prove the unsatisfiability of π(π2 , π) with respect to the knowledge base consisting of πͺ and π. This can be done by using a standard πβπ reasoner to show that πͺ, π, π(π2 , π) |= β₯. Replacing πͺ with πͺπππ‘ , would still prove the unsatisfiability of π(π2 , π). Moreover, the reasoner does not require the full interpretation of π·1 and π·2 to prove that πͺπππ‘ , π, π(π2 , π) |= β₯. 8. Evaluation We implemented a prototype of our method based on Java 12 and the OWL API 5.1.11. We refer to our prototype as SeD. We used a random corpus of 50 ontologies from the NCBO Bioportal repository to perform the evaluation. Details of the corpus are given in the long version. We performed two evaluations, each corresponding to a different selection of the forgetting signature β±. Evaluation 1 selected β± as a segment of the ππ sorted by name (recall that ππ is the set of concept names of the input ontology), so β± contained related concept names. E.g., βAbdomenβ and βAbdomen-painβ were likely to be together in β± as they would be adjacent in the sorted ππ . The intuition was to simulate the use case of extracting the knowledge of a single topic, e.g., the digestive system from a large biomedical ontology. Evaluation 2 selected the concept names that occurred most frequently under role restrictions, aiming for more definers being introduced in πͺπππ‘ and bigger Ξ sets. The intuition was to simulate the worst case when the information difference between the intermediate ontology and the deductive views would be large, and the performance of our two-stage forgetting method would be expected to degrade. In every evaluation and for each ontology in the corpus, three forgetting experiments were performed to forget 10%, 30%, and 50% of concept symbols in ππ giving a total of 150 experiments in each evaluation. Each experiment compared SeD and Lethe (version 2.11-0.026)1 [15]. Lethe is an implementation of the deductive forgetting method in [13]. Each tool was allocated 2GB of memory and five hours time out. All experiments were run on a x64-based processor Intel(R) Core(TM) i5 CPU @ 2.7GHz with a 64-bit operating system (macOS Catalina 10.15.7). 1 http://www.cs.man.ac.uk/~koopmanp/lethe/index.html Table 1 Timeouts of SeD and Lethe Evaluation 1 Evaluation 2 10% 30% 50% 10% 30% 50% SeD 2 3 5 2 3 3 Lethe 2 7 8 1 6 7 Figure 4: Normal Distribution of the Gain values. Range of X-axes is Average Β± 3 Standard Deviations Table 1 shows the timeouts of SeD and Lethe. SeD appeared to be more reliable than Lethe in both evaluations. Also, SeD was less affected by increasing the size of β± from 10% to 30% to 50% of ππ , suggesting that SeD is more scalable to harder problems than Lethe. Next we compared the execution times of SeD and Lethe to compute the deductive view. We first computed the time gained by using SeD over Lethe with the formula Gain = (ππΏ β ππ )/ππΏ , where ππΏ and ππ are the times consumed by Lethe and SeD respectively. Second, we computed the averages and standard deviations of the Gain values. In line with standard data analysis methods, outliers were excluded. These were experiments with extreme Gain values compared to the rest of the experiments. In Evaluation 1 we excluded two experiments in the 10% setting and one in the 50% setting, whereas in Evaluation 2 we excluded one experiment in the 10% setting and one from the 50% setting. The averages and standard deviations in Evaluation 1 were: (0.58, 0.78), (0.60, 0.68), and (0.66, 0.68) in the 10%, 30%, and 50% settings respectively. In Evaluation 2 they were: (0.65, 0.46), (0.74, 0.50), and (0.74, 0.67) in the 10%, 30%, and 50% settings respectively. Figure 4 shows the normal distributions of the Gain values in the three settings in the two evaluations. The graphs reflect the attained positive averages stated above, and compare the gain values across the three settings in each evaluation, also allowing the two evaluations to be compared. Surprisingly, a better and more consistent performance was found in Evaluation 2 over Evaluation 1, against our expectation that Evaluation 2 would represent the worst case scenario for SeD. This is indicated by the higher peaks indicating higher probability of achieving the average Gain, and the narrower curves indicating less variation in the results. The performance improvement happens due to the forgetting method itself not the corpus. While Lethe translates the input ontology to a clausal form that is similar to ours, it disallows clauses with two or more negative definers. To compensate for this restriction, Lethe introduces definer symbols as part of the forgetting calculus, and builds a subsumption hierarchy between the definers. This hierarchy forces extra resolution inferences to be performed. The following example illustrates dynamic introduction of definers in Lethe. Example 8. Let πͺ = {π΄1 β βπ.Β¬π΅, π΄2 β βπ.π΅}, and β± = {π΅}. Lethe generates πͺππππ’π ππ as {Β¬π΄1 β βπ.π·1 , Β¬π΄2 β βπ.π·2 , Β¬π·1 β Β¬π΅, Β¬π·2 β π΅}. Instead of resolving π΅ directly to compute the clause Β¬π·1 β Β¬π·2 , Lethe introduces a new definer π·3 and generates an intermediate ontology πͺ1 = πͺππππ’π ππ βͺ {Β¬π΄1 β Β¬π΄2 β βπ.π·3 , Β¬π·3 β π·1 , Β¬π·3 β π·2 }. The last two clauses on πͺ1 are resolved with Β¬π·1 β Β¬π΅ and Β¬π·2 β π΅ to give Β¬π·3 β Β¬π΅ and Β¬π·3 β π΅ which in turn are resolved together to give Β¬π·3 . All clauses where π΅ occurs are then removed to give πͺ2 = {Β¬π΄1 β βπ.π·1 , Β¬π΄2 β βπ.π·2 , Β¬π΄1 β Β¬π΄2 β βπ.π·3 , Β¬π·3 }. The definers π·1 , π·2 , and π·3 are eliminated in a similar way to the method described in Section 6 giving π± = {Β¬π΄1 β βπ.β€, Β¬π΄1 β Β¬π΄2 }. Our normal form is more flexible as it allows several negative definers to appear in a clause, thus avoids the extra resolution inferences performed by Lethe. We measured the size of the extracted Ξ set. In Evaluation 1, Ξ was on average 0.01%, 0.66%, and 18.3% of the size of the original ontology, in the 10%, 30%, and 50% settings respectively. In Evaluation 2, Ξ was on average 0.23%, 0.88%, and 7.13% of the size of the original ontology, in the 10%, 30%, and 50% settings respectively. We also measured the number of definers in Ξ as a ratio to the forgetting signature. In Evaluation 1, they were on average 0.03%, 0.63%, and 1.5% in the 10%, 30%, and 50% settings respectively. In Evaluation 2, they were on average 0.04%, 0%, and 1.4% in the 10%, 30%, and 50% settings respectively. These ratios indicate that our fine-grained method was feasible since appending πͺπππ with axioms from Ξ introduced few definers relative to the size of β±. Finally, we measured the size of the deductive view relative to the original ontology. In Evaluation 1 it was on average 114%, 103%, and 117% of the size of the original ontology in the 10%, 30%, and 50% settings respectively. In Evaluation 2, it was on average 113%, 95%, and 103% of the size of the original ontology in the 10%, 30%, and 50% settings respectively. 9. Conclusions and Future Work We presented a new forgetting method that performs deductive forgetting, and extracts a set Ξ of axioms representing the information difference between the original ontology and the deductive view. Not only does this give a clearer understanding, in terms of the modelled information, on the difference between the input ontology and the deductive view, but also it allows a fine-grained forgetting system that gives control over the information modelled in the forgetting view. Empirical evaluation suggests that our forgetting method is faster than the state-of-the-art forgetting tool Lethe despite computing more information. Nevertheless, our evaluation suggested that appending the deductive forgetting view with information from Ξ introduces few foreign symbols compared to the forgotten symbols. The final forgetting view therefore remains a compact extract of the original ontology for the use in applications. Future work will study in greater depth the newly revealed spectrum of forgetting variants, and their intersections with other forgetting variants in the literature. References [1] M. Ludwig, B. Konev, Practical uniform interpolation and forgetting for πβπ TBoxes with applications to logical difference, in: Proc. KR 2014, AAAI Press, 2014. [2] B. C. Grau, B. Motik, Pushing the limits of reasoning over ontologies with hidden content, in: Proc. KR 2010, AAAI Press, 2010, pp. 214β224. URL: http://www.comlab.ox.ac.uk/ people/Boris.Motik/pubs/gm10pushing-ibq.pdf. [3] W. Del-Pinto, R. A. Schmidt, ABox abduction via forgetting in ALC, in: Proc. AAAI 2019, AAAI Press, 2019, pp. 2768β2775. [4] J. Lang, P. Marquis, Reasoning under inconsistency: A forgetting-based approach, Artificial Intelligence 174 (2010) 799β823. URL: https://www.sciencedirect.com/science/article/pii/ S0004370210000676. doi:https://doi.org/10.1016/j.artint.2010.04.023. [5] D. Subramanian, M. R. Genesereth, The relevance of irrelevance., in: Proc. IJCAI, 1987. [6] G. Lakemeyer, Relevance from an Epistemic Perspective, Artif. Intell. 97 (1997) 137β167. doi:10.1016/S0004-3702(97)00038-6. [7] J. Lang, P. Liberatore, P. Marquis, Propositional independence: Formula-variable indepen- dence and forgetting, Artificial Intelligence Research 18 (2003) 391β443. [8] E. Erdem, P. Ferraris, Forgetting Actions in Domain Descriptions, in: Proc. AAAI 2007, AAAI Press, 2007, pp. 409β414. [9] Y. Zhang, Y. Zhou, Knowledge forgetting: Properties and applications, Artificial Intelli- gence 173 (2009) 1525β1537. doi:10.1016/j.artint.2009.07.005. [10] C. Lutz, F. Wolter, Foundations for uniform interpolation and forgetting in expressive description logics, in: Proc. IJCAI 2011, 2011, p. 989β995. [11] J. Delgrande, A Knowledge Level Account of Forgetting, Artificial Intelligence Research 60 (2017) 1165β1213. doi:10.1613/jair.5530. [12] A. Nonnengart, A. Szalas, A fixpoint approach to second-order quantifier elimination with applications to correspondence theory, in: Logic at Work, Springer, 1998. [13] P. Koopmann, R. A. Schmidt, Uniform interpolation of πβπ-ontologies using fixpoints, in: Proc. FroCoS 2013, volume 8152 of LNAI, Springer, 2013, pp. 87β102. [14] F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, P. F. Patel-Schneider, The Description Logic Handbook, Cambridge Univ. Press, 2003. [15] P. Koopmann, R. A. Schmidt, Implementation and evaluation of forgetting in ALC- ontologies, in: Proc. WoMo 2013, volume 1081, CEUR-WS.org, 2013. [16] F. Lin, R. Reiter, Forget it, in: AAAI Fall Symp. on Relevance, 1994, pp. 154β159. [17] D. M. Gabbay, H. J. Ohlbach, Quantifier elimination in second-order predicate logic, in: Proc. KR 1992, Morgan Kaufmann, 1992, pp. 425β435. [18] Y. Zhang, Y. Zhou, Forgetting revisited, in: Proc. KR, AAAI Press, 2010. [19] S. Ghilardi, An algebraic theory of normal forms, Ann. Pure Appl. Logic 71 (1995) 189 β 245. [20] A. Herzig, J. Mengin, Uniform interpolation by resolution in modal logic, in: Proc. JELIA 2008, volume 5293 LNAI, Springer, 2008, pp. 219β231. [21] H. Ditmarsch, A. Herzig, J. Lang, P. Marquis, Introspective forgetting, in: Proc. AI 2008, Springer, 2008, pp. 18β29. [22] B. Konev, D. Walther, F. Wolter, Forgetting and uniform interpolation in large-scale description logic terminologies, in: Proc. IJCAI 2009, Morgan Kaufmann, 2009, p. 830β835. [23] B. Konev, C. Lutz, D. Walther, F. Wolter, Model-theoretic inseparability and modularity of description logic ontologies, Artificial Intelligence 203 (2013) 66β103. doi:10.1016/j. artint.2013.07.004. [24] E. Botoeva, B. Konev, C. Lutz, V. Ryzhikov, F. Wolter, M. Zakharyaschev, Inseparability and conservative extensions of description logic ontologies: A survey, in: Reasoning Web 2016, Springer, 2017, pp. 27β89. URL: https://doi.org/10.1007/978-3-319-49493-7_2. doi:10.1007/978-3-319-49493-7_2. [25] C. Lutz, F. Wolter, Deciding inseparability and conservative extensions in the description logic β°β, Symbolic Computation 45 (2010) 194β228. doi:10.1016/j.jsc.2008.10.007. [26] G. DβAgostino, M. Hollenberg, Uniform interpolation, automata and the modal π-calculus, Logic Group Preprint Series 165 (1996). [27] D. Calvanese, G. De Giacomo, M. Lenzerini, Reasoning in Expressive Description Log- ics with Fixpoints Based on Automata on Infinite Trees, in: Proc. IJCAI 1999, Morgan Kaufmann, 1999, pp. 84β89. [28] A. Nonnengart, C. Weidenbach, Computing small clause normal forms, in: Handbook of Automated Reasoning, North-Holland, 2001, pp. 335 β 367.