1. Introduction

Journal of the ACM (JACM) 36 (1989) 929-965. doi:10.1145/76359.76371. [22] M. Funk

10.1016/J.FUTURE

SAT-Based Bounded Fitting for the Description Logic ℒ (Extended Abstract)

Maurice Funk

0 1

Jean Christoph Jung

Tom Voellmer

2 0 Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) , Germany 1 Leipzig University , Germany 2 TU Dortmund University , Germany

2025

13870 3 6

Bounded fitting is a general paradigm for learning logical formulas from positive and negative data examples, that has received considerable interest recently. We investigate bounded fitting for concepts formulated in the description logic ℒ and its syntactic fragments. We show that the underlying size-restricted fitting problem is NP-complete for all studied fragments, even in the special case of a single positive and a single negative example. By design, bounded fitting is an Occam algorithm and thus is a sample-eficient PAC learning algorithm, regardless of the studied fragment. We complement this by showing that eficient PAC learning is impossible under standard complexity theoretic assumptions, and that other natural learning algorithms are typically not sample-eficient PAC learning algorithms. Finally, we present an implementation of bounded fitting in ℒ and its fragments based on a SAT solver. We discuss optimizations and compare our implementation to other concept learning tools.

eol>Description Logic Bounded Fitting PAC Learning

1. Introduction

Learning description logic (DL) concepts from given data examples is an important task when working with large knowledge bases [ 1, 2 ]. For the purpose of this paper, an example is a pair (ℐ, ) where ℐ is a finite interpretation (describing, e.g., a database or a knowledge graph) and is some individual in ℐ. Moreover, a DL concept fits a set of positive examples and a set of negative examples if ℐ |= () for all (ℐ, ) ∈ and ℐ ̸|= () for all (ℐ, ) ∈ . We mention three applications. First, the fitting concept may be used as an explanation of the separation between good and bad “scenarios”, described by and , respectively. For example, and could be data describing users who visited (resp., did not visit) a certain page, and a fitting would explain the users’ behavior from their data. Second, under the classical query-by-example paradigm [ 3, 4 ], a human user may reverse-engineer a DL concept to be used as query by manually selecting elements they want to have returned ( ) or not returned ( ), and the system comes up with an expression satisfying the demands. Finally, an ontology engineer may seek a definition of some symbol satisfied in the interpretation, so they may ask for a concept separating the instances of from the non-instances.

In this paper, we study the problem of learning concepts formulated in the description logic ℒ, which is the basic logic underlying the web ontology language OWL 2 DL [ 5 ], and its syntactic fragments. The importance of finding fitting description logic concepts has resulted in both foundational work [ 6, 7, 8 ] and systems. While most systems are based on heuristic search and refinement operators [ 9, 10, 11, 12, 13 ] or, more recently, also on neural techniques [14, 15], we approach the problem via bounded iftting. Bounded fitting is a general paradigm for fitting logical formulas to positive and negative examples that has been investigated recently for the description logic ℰℒ [16] and a range of other logics like linear temporal logic LTL [17, 18] and computation tree logic CTL [19]. Algorithm 1 provides an

Input: Positive examples , negative examples 1 for := 1, 2, . . . do 2 if there is a concept ∈ ℒ of size that fits , then 3 return

Algorithm 1: Bounded Fitting for description logic ℒ. abstract description of bounded fitting for a given description logic ℒ. It should be clear that, if any iftting concept exists, bounded fitting always returns a fitting concept of minimal size, which is often a desirable property. From the practical perspective, human users typically prefer shorter, that is, simpler concepts in the applications sketched above. From a theoretical perspective, this property makes bounded fitting an Occam algorithm which implies that it comes with probabilistic generalization guarantees in Valiant’s probably approximately correct (PAC) learning framework [20, 21]. Intuitively, this means that bounded fitting needs only few examples to be able to generalize to unseen examples.

The basic DL ℒ provides the logical constructors conjunction ⊓, disjunction ⊔, negation ¬, existential restriction ∃, and universal restriction ∀ to build complex concepts from concept names and ⊥ and ⊤. Motivated by the fact that, depending on the application, one may not need all concept constructors, fragments ℒ() of ℒ have been studied which allow only a subset ⊆ {⊓ , ⊔, ¬, ∃, ∀} of the available constructors. For instance, the mentioned DL ℰℒ is defined by {⊓, ∃}, another popular logic is ℱℒ0, which is defined by {⊓, ∀}. Bounded fitting has been studied recently for ℰℒ, and we extend this study here to all other syntactical fragments.

The paper corresponding to this extended abstract has been accepted for publication at ISWC 2025 [22]. The full paper with all proof details is available on arXiv [23].

2. Contributions

Our main contributions are as follows. First, we study the size-restricted fitting problem : given positive examples , negative examples , and a size bound in unary encoding, determine whether there is a concept of size at most that fits and . Clearly, this is precisely the problem to be solved in Line 2 of bounded fitting. Then, motivated by the ability of bounded fitting to generalize well from few examples, we investigate the generalization abilities of fitting algorithms for DLs ℒ() in Valiant’s PAC learning framework. Finally, we provide an implementation of bounded fitting for ℒ and its fragments, that relies on a SAT solver to solve size-restricted fitting in Line 2 of bounded fitting. We give now a more detailed overview.

Complexity of size-restricted fitting. We show that size-restricted fitting is NP-complete for ℒ and all its syntactic fragments ℒ() such that contains at least ∃ or ∀. This was known for the fragment ℰℒ [24]. Containment in NP can be shown by a simple guess and check argument. The lower bound is more technical and rather strong: it applies already in the case of only one positive and one negative example and over a signature consisting of two role names and one concept name. It thus strengthens the mentioned result for ℰℒ which requires a non-constant number of positive examples. The proof is by reduction from the hitting set problem. The examples constructed in the reduction admit a fitting ℒ concept if and only if there is a fitting ℒ(∃) concept, which means that it shows NP-hardness for all ℒ() with containing ∃. NP-hardness for the other fragments follows by applying a duality principle.

Theorem 1. Size-restricted fitting for ℒ() is NP-complete for every ⊆ {⊓ , ⊔, ¬, ∃, ∀} with {∃, ∀} ∩ ̸= ∅. This already holds if only a single positive and a single negative example are allowed, and over a signature consisting of two role names and one concept name.

Generalization. We investigate the learnability of ℒ concepts in Valiant’s PAC learning framework [20]. A PAC learning algorithm is a fitting algorithm that, given suficiently many labeled examples drawn from an unknown distribution, returns a concept that generalizes well (that is, has a small error when evaluated over the entire distribution) with high probability. We call such an algorithm eficient if it runs in polynomial time and sample-eficient if a polynomial number of examples sufices to ensure the described probabilistic generalization guarantees. For a precise definition, see the full paper but also [25]. We start with observing that under reasonable complexity theoretic assumptions, no ℒ() admits an eficient PAC learning algorithm, that is, an algorithm that runs in polynomial time and produces a concept that satisfies the definition of PAC learning. This is stated in the following theorem. Theorem 2. Let ⊆ {⊓ , ⊔, ¬, ∃, ∀}. If there is an eficient PAC learning algorithm for ℒ(), then: 1. NP = RP, if contains at least one of ∃/∀ and {⊓, ⊔} ̸⊆ ; 2. RSA encryption is polynomial time invertible, if {⊓, ⊔} ⊆ .

We then analyze the generalization ability of fitting algorithms that have favorable properties from a logical perspective in that they return fitting concepts that are most specific, most general, or of minimal quantifier depth among all fitting concepts. More precisely, we investigate whether there can be PAC learning algorithms with such properties that are sample-eficient. We show that, with one exception, all such algorithms are not sample-eficient, and hence do not generalize well. This was already known for the fragment ℰℒ of ℒ [16], and some of our proofs rely on similar techniques. Our results are summarized by the following theorem.

Theorem 3. Let ⊆ {⊓ , ⊔, ¬, ∃, ∀} be any set containing at least one of ∃/∀ and at least one of ⊓/⊔, and let be a fitting algorithm for ℒ(). Then is not a sample-eficient PAC learning algorithm, if: 1. ̸= {∃, ⊔} and always returns a most specific fitting if one exists; 2. ̸= {∀, ⊓} and always returns a most general fitting if one exists; 3. always returns a fitting of minimal quantifier depth if some fitting exists.

The exceptions in the theorem are the cases of = {∃, ⊔} and = {∀, ⊓}. For these fragments, bounded fitting is a sample-eficient PAC learning algorithm that returns a most specific or most general, respectively, fitting concept if it exists.

Implementation. We implemented bounded fitting for ℒ and its fragments using a SAT solver to decide the NP-complete size-restricted fitting problems by encoding size-restricted fitting into a propositional formula.1 We present two optimizations of the basic encoding, one where the structure of concepts is precomputed and then supplied to the SAT solver and another, where types of elements are used instead of individual concept names. Additionally, our implementation supports approximate iftting, the optimization variant of the fitting problem, where one searches for a concept that fits as many positive and negative examples as possible.

We compare our implementation ALC-SAT+ to other systems that support learning of ℒ concepts, namely CELOE [12], SParCEL [ 10 ], and EvoLearner [ 9 ], considering both exact fitting and approximate 1Our implementation is available at https://github.com/SAT-based-Concept-Learning/ALCSAT. iftting. For evaluating exact fitting we generated sets of positive and negative examples from a fragment of the YAGO knowledge graph [26], see the full paper for details. For approximate fitting, we compared the systems on the SML benchmarks [27]. We measured the accuracy and length of the returned concepts using 10-fold cross validation. Our results on the SML benchmarks are shown in Table 1 where the first line in each cell is the accuracy and the second line is the length of the returned concept; in both cases, the ± -term denotes the standard deviation. Our tool achieves competitive values for both accuracy and concept length. In some instances, ALC-SAT+ may return a larger concept compared to the other tools, however, this means that the accuracy reported cannot be achieved with a smaller concept.

Acknowledgments Jean Christoph Jung was supported by DFG project JU 3197/1-1. Declaration on Generative AI The authors have not employed any Generative AI tools.

[1]

Lehmann , Learning OWL Class Expressions, volume 6 of Studies on the Semantic Web , IOS Press, 2010 . doi: 10 .3233/978-1- 61499 -340-7-i.

[2]

Lehmann ,

Fanizzi ,

Bühmann , C. d'Amato, Concept learning , in: Perspectives on Ontology Learning , AKA / IOS Press, 2014 , pp. 71 - 91 . URL: https://jens-lehmann.org/files/2014/pol_concept_ learning.pdf.

[3] M. M. Zloof , Query by example , in: American Federation of Information Processing Societies: 1975 National Computer Conference , 19 - 22 May 1975 , Anaheim, CA, USA, volume 44 of AFIPS Conference Proceedings , AFIPS Press, 1975 , pp. 431 - 438 . doi: 10 .1145/1499949.1500034.

[4] D. M. L. Martins , Reverse engineering database queries from examples: State-of-the-art, challenges , and research opportunities, Information Systems 83 ( 2019 ) 89 - 100 . doi: 10 .1016/J.IS. 2019 . 03 .002.

[5]

Horrocks ,

P. F.

Patel-Schneider , F. van Harmelen , From SHIQ and RDF to OWL: the making of a web ontology language , Journal of Web Semantics 1 ( 2003 ) 7 - 26 . doi: 10 .1016/J.WEBSEM. 2003 . 07 .001.

[6]

Funk ,

J. C.

Jung ,

Lutz ,

Pulcini ,

Wolter , Learning description logic concepts: When can positive and negative examples be separated? , in: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19 , 2019 , pp. 1682 - 1688 . doi: 10 .24963/ijcai. 2019 /233.

[7]

Lehmann ,

Hitzler , Concept learning in description logics using refinement operators , Machine Learning 78 ( 2010 ) 203 - 250 . doi: 10 .1007/s10994-009-5146-2.

[8]

Funk ,

J. C.

Jung ,

Lutz , Actively learning concept and conjunctive queries under ℰ ℒ- ontologies , in : Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21 , 2021 , pp. 1887 - 1893 . doi: 10 .24963/ijcai. 2021 /260.

[9]

Heindorf ,

Blübaum ,

Düsterhus ,

Werner ,

V. N.

Golani ,

Demir ,

Ngonga Ngomo , EvoLearner: Learning description logics with evolutionary algorithms , in: WWW '22: The ACM Web Conference 2022 , ACM, 2022 , pp. 818 - 828 . doi: 10 .1145/3485447.3511925.

[10]

A. C.

Tran ,

Dietrich ,

H. W.

Guesgen , S. Marsland, Parallel symmetric class expression learning , Journal of Machine Learning Research 18 ( 2017 ) 64 : 1 - 64 : 34 . URL: https://jmlr.org/papers/v18/ 14 - 317 .html.