Finding Good Proofs for Answers to Conjunctive
Queries Mediated by Lightweight Ontologies
Christian Alrabbaa, Stefan Borgwardt, Patrick Koopmann and Alisa Kovtunova
Institute of Theoretical Computer Science, Technische Universität Dresden, 01062 Dresden, Germany


                                      Abstract
                                      In ontology-mediated query answering, access to incomplete data sources is mediated by a conceptual
                                      layer constituted by an ontology. To correctly compute answers to queries, it is necessary to perform
                                      complex reasoning over the constraints expressed by the ontology. In the literature, there exists a
                                      multitude of techniques incorporating the ontological knowledge into queries. However, few of these
                                      approaches were designed for comprehensibility of the query answers. In this article, we try to bridge
                                      these two qualities by adapting a proof framework originally applied to axiom entailment for conjunctive
                                      query answering. We investigate the data and combined complexity of determining the existence of
                                      a proof below a given quality threshold, which can be measured in different ways. By distinguishing
                                      various parameters such as the shape of a query, we obtain an overview of the complexity of this problem
                                      for the lightweight ontology languages DL-Lite𝑅 ,and also have a brief look at temporal query answering.


1. Introduction
Explaining description logic (DL) reasoning has a long tradition, starting with the first works on
proofs for standard DL entailments [1, 2]. A popular and very effective method is justifications,
which simply point out the axioms from an ontology that are responsible for an entailment [3,
4, 5, 6]. More recently, work has resumed on techniques to find proofs for explaining more
complex logical consequences [7, 8, 9, 10, 11]. On the other hand, if a desired entailment
does not hold, one needs different explanation techniques such as abduction [12, 13, 14] or
counterinterpretations [15]. Explaining answers to conjunctive queries (CQs) has also been
investigated before, in the form of abduction for missing answers over DL-Lite ontologies [14],
provenance for positive answers in DL-Lite and ℰℒ [16, 17], as well as proofs for DL-Lite query
answering [18, 19, 20].
   Here, we also investigate proofs for CQ answers, inspired by [18, 19, 20], but additionally
consider the problem of generating good proofs according to some quality measures and provide
a range of complexity results focussing on DL-Lite𝑅 .In addition to classical OMQA, we also have
a brief look at explaining inferences over temporal data using a query language incorporating
metric temporal operators. Our results are based on a framework developed for proofs of
standard DL reasoning [9]. There, proofs are formalized as directed, acyclic hypergraphs and

   DL 2022: 35th International Workshop on Description Logics, August 7–10, 2022, Haifa, Israel
" christian.alrabbaa@tu-dresden.de (C. Alrabbaa); stefan.borgwardt@tu-dresden.de (S. Borgwardt);
patrick.koopmann@tu-dresden.de (P. Koopmann); alisa.kovtunova@tu-dresden.de (A. Kovtunova)
 0000-0002-2925-1765 (C. Alrabbaa); 0000-0003-0924-8478 (S. Borgwardt); 0000-0001-5999-2583 (P. Koopmann);
0000-0001-9936-0943 (A. Kovtunova)
                                    © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
 CEUR
 Workshop
 Proceedings
               http://ceur-ws.org
               ISSN 1613-0073
                                    CEUR Workshop Proceedings (CEUR-WS.org)
proof quality can be measured in different ways. We mainly consider the size (the number of
formulas) of a proof as well as its tree size, which corresponds to the size when the proof is
presented in a tree-shaped way (which may require repeating subproofs), as it is often done
in practice [8, 21]. The quest for good proofs is formalized as a search problem in a so-called
derivation structures produced by a deriver, which specifies the possible inferences.
   In this paper, we consider two different kinds of derivers for generating proofs for CQ answers.
These loosely correspond to the approaches in [18, 19, 20], but are generalized to apply to a larger
class of DLs. Specifically, our structures rely on a translation of DLs to existential rules [22], and
thus apply to any DL that can be expressed in this formalism. One deriver, denoted by Dcq and
inspired by [19, 20], focuses on the derivation of CQs, which can be derived from other CQs and
ontology axioms. Inferences in Dcq are logically sound, but can be harder to understand. The
reason is the local scope of existential quantification in a CQ, which forces atoms connected
by the same variables to be carried along inferences they are not relevant for. This problem
is circumvented with the deriver Dsk , which relies on a Skolemized version of the TBox. This
allows one to focus on inferences of single atoms that are only later aggregated into the final
CQ, leading to simpler sentences within the proof. Focusing on the particular cases of DL-Lite𝑅
and ℰℒ, we consider the complexity of the decision problems of finding proofs of (tree) size
below a given threshold 𝑛 in these derivation structures. We find that for DL-Lite𝑅 and any
DL in which CQ answering is UCQ-rewritable, all of these problems (regardless of derivation
structure and quality measure) are in AC0 in data complexity. In combined complexity, these
problems are NP-complete in general, but polynomial when considering only acyclic queries
and tree size. We also obtain similar results for the case of Dsk w.r.t. ℰℒ ontologies and tree size,
but for size the situation is not clear yet because we suspect that for ℰℒ proofs may actually get
exponentially large. To explain answers to temporal queries, we extend our derivers with new
inference schemes to deal with metric temporal operators, allowing us to lift some of our results
also to this setting. The full details can be found in a technical report [23], but we describe the
main ideas here.


2. Preliminaries
Proofs In our setting, a logic ℒ = (𝒮ℒ , |=ℒ ) consists of a set 𝒮ℒ of ℒ-sentences and a con-
sequence relation |=ℒ ⊆ 𝑃 (𝒮ℒ ) × 𝒮ℒ between ℒ-theories (subsets of ℒ-sentences) and single
ℒ-sentences; we usually write only |= instead of |=ℒ . We assume that the size |𝜂| of an ℒ-
sentence 𝜂 is defined in some way, e.g. by the number of symbols in 𝜂. We require that ℒ is
monotonic, i.e. that 𝒯 |= 𝜂 implies 𝒯 ′ |= 𝜂 for all 𝒯 ′ ⊇ 𝒯 . For example, ℒ could be first-order
logic or some DL.
   As in [9, 10, 11], we view proofs as directed hypergraphs (see the appendix for details).

Definition 1 (Derivation Structure). A derivation structure 𝒟 = (𝑉, 𝐸, ℓ) over a theory 𝒰 is a
directed, labeled hypergraph that is

    • grounded, i.e. every leaf 𝑣 in 𝒟 is labeled by ℓ(𝑣) ∈ 𝒰; and

    • sound, i.e. for every hyperedge (𝑆, 𝑑) ∈ 𝐸, the entailment {ℓ(𝑠) | 𝑠 ∈ 𝑆} |= ℓ(𝑑) holds.
  We call hyperedges (𝑆, 𝑑) ∈ 𝐸 inferences or inference steps, with 𝑆 being the premises and 𝑑
the conclusion, and may write them like
                          𝑝       𝑝→𝑞                     𝑝                           𝑝→𝑞
                                 𝑞            or
                                                                         𝑞

Proofs are special derivation structures that derive a goal sentence.

Definition 2 (Proof). Given a sentence 𝜂 and a theory 𝒰, a proof of 𝒰 |= 𝜂 is a finite derivation
structure 𝒫 = (𝑉, 𝐸, ℓ) over 𝒰 such that

        • 𝒫 contains exactly one sink 𝑣𝜂 ∈ 𝑉 , which is labeled by 𝜂,

        • 𝒫 is acyclic, and

        • every vertex has at most one incoming hyperedge, i.e. there exist no two hyperedges
          (𝑆1 , 𝑣), (𝑆2 , 𝑣) ∈ 𝐸 with 𝑆1 ̸= 𝑆2 .

A tree proof is a proof that is a tree. A subproof 𝑆 of a hypergraph 𝐻 is a subgraph of 𝐻 that is
a proof with leaf(𝑆) ⊆ leaf(𝐻).

   To compute proofs, we assume that there is some reasoning system or calculus that defines
a derivation structure for a given entailment 𝜂, and the structure may contain several proofs
for that entailment. Formally, a deriver D for a logic ℒ takes as input an ℒ-theory 𝒰 and an
ℒ-sentence 𝜂, and returns a (possibly infinite) derivation structure D(𝒰, 𝜂) over 𝒰 that describes
all inference steps that D could perform in order to derive 𝜂 from 𝒰. This derivation structure
is not necessarily computed explicitly, but can be accessed through an oracle (which checks,
for example, whether an inference conforms to the underlying calculus). The task of finding a
good proof then corresponds to finding a (finite) proof that can be homomorphically mapped
into this derivation structure and which is minimal according to some measure of proof quality.
We consider two such measures here: the size of a proof 𝒫 = (𝑉, 𝐸, ℓ) is ms (𝒫) := |𝑉 |,1 and
the tree size mt (𝒫) is the size of a tree unraveling of 𝒫 [11]. The depth of 𝒫 is the length of the
longest path from a leaf to the sink (see appendix).

DLs and Existential Rules We assume that the reader is familiar with DLs, in particular
DL-Lite𝑅 [24] and ℰℒ [25], where theories 𝒰 = 𝒯 ∪ 𝒜 are called ontologies or knowledge bases
and are composed of a TBox 𝒯 and an ABox 𝒜. Many DL ontologies can be equivalently
expressed using the formalism of existential rules [22]. Existential rules are first-order sentences
of the form ∀𝑦           ⃗ , ⃗𝑧 ) → ∃𝑢
              ⃗ , ⃗𝑧 . 𝜓(𝑦           ⃗ . 𝜒(𝑧
                                           ⃗ , ⃗𝑢), with the body 𝜓(𝑦
                                                                    ⃗ , ⃗𝑧 ) and the head 𝜒(𝑧
                                                                                            ⃗ , ⃗𝑢) being
conjunctions of atoms of the form 𝐴(𝑥) or 𝑃 (𝑥1 , 𝑥2 ), for a concept name 𝐴, role name 𝑃 and
terms 𝑥, 𝑥1 and 𝑥2 , which are individual names or variables from ⃗𝑧 , ⃗𝑢 and ⃗𝑦 . We usually omit the
universal quantification. Notable DLs that can be equivalently expressed as sets of existential
rules are ℰℒ, Horn-𝒮ℛℐ𝒬 and DL-Lite𝑅 .


1
    Since every vertex has at most one incoming hyperedge, the size of 𝐸 is at most quadratic in |𝑉 |.
Conjunctive Queries In this paper, we want to construct proofs for ontology-mediated
conjunctive query entailments. A conjunctive query (CQ) q(𝑥                  ⃗ ) is an expression of the form
∃𝑦
 ⃗ . 𝜑(𝑥
       ⃗ , ⃗𝑦 ), where 𝜑(𝑥   ⃗ , ⃗𝑦 ) is a conjunction of atoms using answer variables ⃗𝑥 and existentially
quantified variables ⃗𝑦 . If ⃗𝑥 = (), then q(𝑥     ⃗ ) is called Boolean. ABox assertions are a special case
of Boolean CQs with only one atom and no variables. A tuple ⃗𝑎 of individual names from 𝒜 is a
certain answer to q(𝑥      ⃗ ) over 𝒯 ∪ 𝒜, in symbols 𝒯 ∪ 𝒜 |= q(𝑎        ⃗ ), if, for any model of 𝒯 ∪ 𝒜, the
sentence q(𝑎     ⃗ ) is true in this model. Any CQ q(𝑥        ⃗ ) = ∃𝑦
                                                                     ⃗ . 𝜑(𝑥
                                                                           ⃗ , ⃗𝑦 ) is associated with the set of
atoms in 𝜑, so we can write e.g. 𝐴(𝑧) ∈ q(𝑥            ⃗ ).

Example 1. For the following DL-Lite𝑅 ontology and query, we have 𝒯 ∪ {𝐵(b)} |= q(b).

        𝒯 = {𝐴 ⊑ ∃𝑅,              ∃𝑅− ⊑ ∃𝑇,             𝐵 ⊑ ∃𝑃,           ∃𝑃 − ⊑ ∃𝑆,           𝑃 ⊑ 𝑅− }
    q(𝑦 ′′ ) = ∃𝑥, 𝑥′ , 𝑥′′ , 𝑦, 𝑦 ′ , 𝑧, 𝑧 ′ . 𝑅(𝑥, 𝑦) ∧ 𝑇 (𝑦, 𝑧) ∧ 𝑇 (𝑦 ′ , 𝑧) ∧ 𝑅(𝑥′ , 𝑦 ′ ) ∧ 𝑆(𝑥′ , 𝑧 ′ )
                                                                                ∧ 𝑆(𝑥′′ , 𝑧 ′ ) ∧ 𝑃 (𝑦 ′′ , 𝑥′′ ).

In the next section, we explore different ways to explain this inference (see Figures 2 and 4).


3. Derivation Structures for Certain Answers
In the following, let 𝒯 ∪ 𝒜 be a knowledge base in some DL ℒ, q a conjunctive query, and ⃗𝑎 a
certain answer, i.e. 𝒯 ∪ 𝒜 |= q(𝑎
                                ⃗ ), which we want to explain. We can use derivation structures
over ℒcq (the extension of ℒ with all Boolean CQs) to explain query answers. For example, the
following derivation step involving the ontology from Example 1 is a sound inference:
                                                  𝐵(b)    𝒯
                                                     q(b)
   However, to define a derivation structure that yields proofs suitable for explanations to users,
inferences that only make small deduction steps are more valuable. For this purpose, we define
derivers that capture which inference steps are admitted. For TBox entailment, in [9, 10, 11], we
considered derivers based on the inference schemas used by a consequence-based reasoner. To
obtain proofs for CQ entailment, we follow the ideas of chase procedures that replace atoms in
CQs by other atoms by “applying” rules to them [26, 22, 24, 18]. We will introduce two derivers
that represent different paradigms of what constitutes a proof.

3.1. The CQ Deriver
Similarly to the approach used in [19, 20], inferences in our first deriver, Dcq , always produce
Boolean CQs. This deriver is defined for DLs that can be expressed using existential rules. An in-
ference step is obtained by matching the left-hand side of a rule to part of a CQ and then replacing
it by the right-hand side. For example, starting from ∃𝑧. 𝑃 (b, 𝑧) and 𝑃 (𝑥, 𝑦) → 𝑅(𝑦, 𝑥), we can
apply the substitution {𝑥 ↦→ b, 𝑦 ↦→ 𝑧} to obtain ∃𝑧. 𝑅(𝑧, b). Additionally, we allow to keep any
of the replaced atoms from the original CQ, e.g. to produce the conclusion ∃𝑧. 𝑃 (b, 𝑧) ∧ 𝑅(𝑧, b).
A second type of inference allows one to combine two Boolean CQs using conjunction. To
duplicate variables, we additionally introduce tautological rules such as 𝑃 (𝑥, 𝑧) → ∃𝑧 ′ . 𝑃 (𝑥, 𝑧 ′ ),
   ∃𝑥
    ⃗ . 𝜑(𝑥
          ⃗)         ⃗ , ⃗𝑧 ) → ∃𝑢
                   𝜓(𝑦           ⃗ . 𝜒(𝑧
                                       ⃗ , ⃗𝑢)                      ∃𝑥
                                                                     ⃗ . 𝜑(𝑥
                                                                           ⃗)        ∃𝑦
                                                                                      ⃗ . 𝜓(𝑦
                                                                                            ⃗)
                                               (MP)                         ′
                                                                                                    (C)
                   ∃𝑤
                    ⃗ .𝜌(𝑤 ⃗)                                         ∃𝑥                 ⃗ ′)
                                                                                 ⃗ ) ∧ 𝜓(𝑦
                                                                       ⃗ , ⃗𝑦 .𝜑(𝑥

                                           (T)                            ∃𝑥
                                                                           ⃗ . 𝜑(𝑥  ⃗ , ⃗𝑎)
              ⃗ , ⃗𝑦 ) → ∃𝑥
            𝜑(𝑥           ⃗ . 𝜑(𝑥
                                ⃗ , ⃗𝑦 )                                                      (E)
                                                                         ∃𝑥
                                                                          ⃗ , ⃗𝑦 . 𝜑(𝑥
                                                                                     ⃗ , ⃗𝑦 )

Figure 1: Inference schemas for Dcq . (MP) and (T) refer to modus ponens and tautology.


which yields ∃𝑧, 𝑧 ′ . 𝑃 (b, 𝑧) ∧ 𝑃 (b, 𝑧 ′ ) when combined with ∃𝑧. 𝑃 (b, 𝑧). Finally, we use an
inference schema that allows us to replace constants by variables, e.g. to capture that ∃𝑧. 𝑃 (b, 𝑧)
implies ∃𝑥, 𝑧. 𝑃 (𝑥, 𝑧).
   The detailed inference schemas can be found in Figure 1. (MP) is admissible only if there
exists a substitution 𝜋 such that 𝜋(𝜓(𝑦   ⃗ , ⃗𝑧 )) ⊆ 𝜑(𝑥
                                                        ⃗ ), and then 𝜌(𝑤  ⃗ ) is the result of replacing any
subset of 𝜋(𝜓(𝑦 ⃗ , ⃗𝑧 )) in 𝜑(𝑥
                               ⃗ ) by any subset of 𝜋(𝜒(𝑧   ⃗ , ⃗𝑢′ )), where the variables ⃗𝑢 are renamed
into new existentially quantified variables ⃗𝑢′ to ensure that they are disjoint with ⃗𝑥. In (C),
we again rename the variables ⃗𝑦 to ⃗𝑦 ′ to avoid overlap with ⃗𝑥. Since every ABox assertion
corresponds to a ground CQ, this inference also allows one to collect ABox assertions into a
single CQ. (T) introduces an existential rule that allows us, together with (MP), to create copies
of variables in CQs (see Fig. 2). Finally, (E) transforms individual names in some positions into
existentially quantified variables.
Definition 3 (CQ Deriver). Dcq (𝒯 ∪ 𝒜, q(𝑎  ⃗ )) is a derivation structure over 𝒯 ∪ 𝒜 with vertices
labeled by the axioms in 𝒯 ∪ 𝒜 and all Boolean CQs over the signature of 𝒯 ∪ 𝒜, and its
hyperedges represent all possible instances of (MP), (C), (T), and (E) over these vertices. An
(admissible) proof in Dcq (𝒯 ∪ 𝒜, q(𝑎
                                    ⃗ )) is a proof of 𝒯 ∪ 𝒜 |= q(𝑎  ⃗ ) that has a homomorphism
into this derivation structure.
  It is easy to check that the inferences used by Dcq are sound. Moreover, we can show that
they are complete, i.e. that any CQ entailed by 𝒜 ∪ 𝒯 has a proof in Dcq (𝒯 ∪ 𝒜, q(𝑎⃗ )) (see
Lemma 5). A proof for Example 1 w.r.t. Dcq is depicted in Figure 2.

3.2. Skolemized Derivation Structure
To explain a Boolean CQ, using a derivation structure that works on CQs seems natural. However,
a downside is that we have to “collect” quantified variables along the proof and label vertices with
complex expressions. Since the inference rules apply on sub-expressions, it may be challenging
to understand on which part of the CQ an inference is performed—indeed, finding a match for the
body o a rule in a CQ is NP-hard. The problem is that we cannot separate inference steps on the
same variable without affecting soundness, as the existential quantification only applies locally
in the current CQ. To follow our example: 𝑥′′ and 𝑧 ′ in Figure 2 are connected to each other and to
the constant b, and thus have to be kept together: although ∃𝑥′′ , 𝑧 ′ .𝑃 (b, 𝑥′′ ) ∧ 𝑆(𝑥′′ , 𝑧 ′ ) implies
∃𝑥′′ .𝑃 (b, 𝑥′′ ) and ∃𝑥′′ , 𝑧 ′ .𝑆(𝑥′′ , 𝑧 ′ ), those two CQs do not imply the original CQ anymore. To
overcome these issues, we consider a second type of deriver that relies on Skolemization, and is
inspired by the approach from [18].
                                                                (MP)
                                        𝐵(b)                                            𝐵 ⊑ ∃𝑃

                                                      ∃𝑥′′ . 𝑃 (b, 𝑥′′ )                   (MP)
                                                                                                              ∃𝑃 − ⊑ ∃𝑆

                    𝑃 ⊑ 𝑅−
                                                     (MP)
                                                                           ∃𝑥′′ , 𝑧 ′ 𝑃 (b, 𝑥′′ ) ∧ 𝑆(𝑥′′ , 𝑧 ′ )

                             ∃𝑥′′ , 𝑧 ′ . 𝑅(𝑥′′ , b) ∧ 𝑆(𝑥′′ , 𝑧 ′ ) ∧ 𝑃 (b, 𝑥′′ )                     (MP)
                                                                                                                    ∃𝑅− ⊑ ∃𝑇


                                 (T)
                                                                 ∃𝑥′′ , 𝑧, 𝑧 ′ . 𝑅(𝑥′′ , b) ∧ 𝑇 (b, 𝑧) ∧ 𝑆(𝑥′′ , 𝑧 ′ ) ∧ 𝑃 (b, 𝑥′′ )
   𝑅(𝑥, 𝑦) ∧ 𝑇 (𝑦, 𝑧) → ∃𝑦 . 𝑅(𝑥, 𝑦 ′ ) ∧ 𝑇 (𝑦 ′ , 𝑧)
                                   ′
                                                                                                         (MP)


                                 (T)               ∃𝑥 , 𝑦 , 𝑧, 𝑧 . 𝑅(𝑥 , b) ∧ 𝑇 (b, 𝑧) ∧ 𝑇 (𝑦 ′ , 𝑧) ∧ 𝑅(𝑥′′ , 𝑦 ′ ) ∧ . . .
                                                       ′′   ′          ′         ′′

                     𝑆(𝑥, 𝑧) → 𝑆(𝑥, 𝑧)                                                                   (MP)

                                       (T)                                                              ...
   𝑅(𝑥′′ , 𝑦 ′ ) ∧ 𝑆(𝑥′′ , 𝑧 ′ ) → ∃𝑥′ . 𝑅(𝑥′ , 𝑦 ′ ) ∧ 𝑆(𝑥′ , 𝑧 ′ )
                                                                                                         (MP)

                                        (T)
                     𝑅(𝑥′′ , 𝑦) → ∃𝑥. 𝑅(𝑥, 𝑦)                      (MP)                                 ...


     ∃𝑥, 𝑥′ , 𝑥′′ , 𝑦 ′ , 𝑧, 𝑧 ′ . 𝑅(𝑥, b) ∧ 𝑇 (b, 𝑧) ∧ 𝑇 (𝑦 ′ , 𝑧) ∧ 𝑅(𝑥′ , 𝑦 ′ ) ∧ 𝑆(𝑥′ , 𝑧 ′ ) ∧ 𝑆(𝑥′′ , 𝑧 ′ ) ∧ 𝑃 (b, 𝑥′′ )
                                                                           (E)
    ∃𝑥, 𝑥′ , 𝑥′′ , 𝑦, 𝑦 ′ , 𝑧, 𝑧 ′ . 𝑅(𝑥, 𝑦) ∧ 𝑇 (𝑦, 𝑧) ∧ 𝑇 (𝑦 ′ , 𝑧) ∧ 𝑅(𝑥′ , 𝑦 ′ ) ∧ 𝑆(𝑥′ , 𝑧 ′ ) ∧ 𝑆(𝑥′′ , 𝑧 ′ ) ∧ 𝑃 (b, 𝑥′′ )

Figure 2: A CQ proof for Example 1 (inferences (E) and (T) are delayed to the last steps)


   This deriver, Dsk , mainly operates on ground CQs, and requires the theory to be Skolemized.
This means that it cannot contain existential quantification, it may however contain function
symbols. To Skolemize existential rules, for each existentially quantified variable a fresh function
symbol is introduced; for the CI ∃𝑃 − ⊑ ∃𝑆 this results in 𝑃 (𝑥, 𝑦) → 𝑆(𝑦, 𝑔(𝑦)), where 𝑔 is
a unary function symbol whose argument denotes the dependency on the variable 𝑦 shared
between the body and head of the rule. Let 𝒯 𝑠 be the set of Skolemized rules resulting from this
transformation and note that the entailments 𝒯 ∪ 𝒜 |= q(𝑎             ⃗ ) and 𝒯 𝑠 ∪ 𝒜 |= q(𝑎⃗ ) are equivalent
for CQs q(𝑥     ⃗ ) that do not use function symbols. Our deriver internally considers two kinds of
formulas: 1) CQs that may use function symbols and 2) rules of the form ∀𝑥                   ⃗ .𝜑(𝑥⃗ ) → 𝜓(𝑥 ⃗ ),
where 𝜓(𝑥     ⃗ ) may now contain function terms, but no further quantified variables. Since we
are only interested in CQs that are entailed by 𝒯 𝑠 ∪ 𝒜, we can assume w.l.o.g. that this
entailment can be shown solely using domain elements denoted by ground terms, e.g. 𝑓 (𝑓 (a)),
which allows us to eliminate variables from most of the inferences. For example, instead of
∃𝑥′′ , 𝑧 ′ . 𝑃 (b, 𝑥′′ ) ∧ 𝑆(𝑥′′ , 𝑧 ′ ) in Figure 2 we now use 𝑃 (b, 𝑓 (b)) ∧ 𝑆(𝑓 (b), 𝑔(𝑓 (b))). Since these
atoms do not share variables, in our derivation structure we mainly need to consider inferences
on single atoms, which allows for more fine-grained proofs (see Figure 4). Only at the end we
need to compose atoms to obtain a CQ.
   The simplified inference schemas are shown in Figure 3. In (MPs ), 𝛼𝑖 (𝑡          ⃗𝑖 ) and 𝛽(𝑠⃗ ) are ground
atoms with terms composed from individual names and Skolem functions, and likewise 𝜒(𝑧                        ⃗)
                              ⃗1 )
                          𝛼1 (𝑡      ...         ⃗𝑛 ) 𝜓(𝑦
                                             𝛼𝑛 (𝑡       ⃗ , ⃗𝑧 ) → 𝜒(𝑧
                                                                      ⃗)
                                                                          (MPs )
                                                𝛽(𝑠⃗)
                              ⃗1 ) . . . 𝛼𝑛 (𝑡
                          𝛼1 (𝑡                  ⃗𝑛 )                  ⃗)
                                                                     𝜑(𝑡
                                                      (Cs )                 (Es )
                              ⃗                  ⃗
                          𝛼1 (𝑡1 ) ∧ · · · ∧ 𝛼𝑛 (𝑡𝑛 )              ∃𝑥
                                                                    ⃗ .𝜑(𝑥
                                                                         ⃗)

Figure 3: Inference schemas for Dsk .


may contain Skolem functions; similar to (MP), we require that there is a substitution 𝜋 such
that 𝜋(𝜓(𝑦                 ⃗1 ), . . . , 𝛼𝑛 (𝑡
          ⃗ , ⃗𝑧 )) = {𝛼1 (𝑡                 ⃗𝑛 )} and 𝛽(𝑠         ⃗ )). In (Es ), ⃗𝑡 is now a vector of
                                                         ⃗ ) ∈ 𝜋(𝜒(𝑧
ground terms which may contain function symbols. Since (MPs ) works only with ground
atoms, (Cs ) and (Es ) can now only be used at the end of a proof to obtain the desired CQ (see
Figure 4). Moreover, we do not need a version of (T) here since it would be trivial for ground
atoms. Its effects in Dcq can be simulated here due to the fact that the same atom can be used
several times as a premise for (MPs ) or (Cs ).

Definition 4 (Skolemized Deriver). The derivation structure Dsk (𝒯 𝑠 ∪ 𝒜, q(𝑎    ⃗ )) is defined
similarly to Definition 3, but using 𝒯 𝑠 and the inference schemas (MPs ), (Cs ) and (Es ).

   Though different presentations with different advantages and disadvantages, it is not hard to
translate proofs based on Dsk into proofs in Dcq and vice versa.

Lemma 5. Any proof 𝒫 in Dcq (𝒯 ∪𝒜, q(𝑎   ⃗ )) can be transformed into a proof in Dsk (𝒯 𝑠 ∪𝒜, q(𝑎
                                                                                                ⃗ ))
in time polynomial in the sizes of 𝒫 and 𝒯 , and conversely any proof 𝒫 in Dsk (𝒯 𝑠 ∪ 𝒜, q(𝑎    ⃗ ))
can be transformed into a proof in Dcq (𝒯 ∪ 𝒜, q(𝑎 ⃗ )) in time polynomial in the sizes of 𝒫 and 𝒯 .
The latter also holds for tree proofs.

  However, it is not the case that minimal proofs are equivalent for these two derivers, i.e. a
minimal proof may become non-minimal after the transformation.
  This lemma also shows that our derivation structures are complete, i.e. if 𝒯 ∪ 𝒜 |= q(𝑎  ⃗ ) holds,
then we can provide a proof for it. To see this, consider the minimal Herbrand model 𝐻 of 𝒯 𝑠 ∪𝒜,
which can be computed using the (Skolem) chase procedure for existential rules—essentially,
applying the rules step-by-step to obtain new ground atoms, in a way very similar to (MPs ).
This model is a universal model for CQ answering over 𝒯 ∪ 𝒜, which means that 𝒯 ∪ 𝒜 |= q(𝑎        ⃗)
implies 𝐻 |= q(𝑎⃗ ), which, in turn, means that there must be a proof in Dsk (𝒯 𝑠 ∪ 𝒜, q(𝑎 ⃗ )), and
hence by Lemma 5 also one in Dcq (𝒯 ∪ 𝒜, q(𝑎      ⃗ )). For convenience, we assume in the following
that TBoxes are silently Skolemized when constructing derivation structures using Dsk , that is,
we identify Dsk (𝒯 ∪ 𝒜, q(𝑎 ⃗ )) with Dsk (𝒯 𝑠 ∪ 𝒜, q(𝑎   ⃗ )).


4. The Complexity of Finding Good Proofs
It is our intution that proofs in Dsk are more comprehensible than in Dcq because of its simpler
labels. Moreover, we assume small proofs (w.r.t. size ms or tree size mt ) to be more comprehen-
sible than large ones (but one can certainly also consider other measures [10, 11]). Therefore,
                                                                          (MPs )
                                                    𝐵(b)                                      𝐵 ⊑ ∃𝑃

                                                   (MPs )                                       (MPs )
                           𝑃 ⊑ 𝑅−                                      𝑃 (b, 𝑓 (b))                                ∃𝑃 − ⊑ ∃𝑆

                            (MPs )
   ∃𝑅− ⊑ ∃𝑇                                     𝑅(𝑓 (b), b)                              𝑆(𝑓 (b), 𝑔(𝑓 (b)))

                         𝑇 (b, ℎ(b))


                                                                  (Cs )

   𝑅(𝑓 (b), b) ∧ 𝑇 (b, ℎ(b)) ∧ 𝑇 (b, ℎ(b)) ∧ 𝑅(𝑓 (b), b) ∧ 𝑆(𝑓 (b), 𝑔(𝑓 (b))) ∧ 𝑆(𝑓 (b), 𝑔(𝑓 (b))) ∧ 𝑃 (b, 𝑓 (b))
                                                                  (Es )

    ∃𝑥, 𝑥′ , 𝑥′′ , 𝑦, 𝑦 ′ , 𝑧, 𝑧 ′ . 𝑅(𝑥, 𝑦) ∧ 𝑇 (𝑦, 𝑧) ∧ 𝑇 (𝑦 ′ , 𝑧) ∧ 𝑅(𝑥′ , 𝑦 ′ ) ∧ 𝑆(𝑥′ , 𝑧 ′ ) ∧ 𝑆(𝑥′′ , 𝑧 ′ ) ∧ 𝑃 (b, 𝑥′′ )

Figure 4: A Skolemized proof for Example 1


we now study the complexity of finding small proofs automatically (which is independent of the
comprehensibility of the resulting proofs). More precisely, we are interested in the following
decision problem OPx (ℒ, m) for a deriver Dx ∈ {Dcq , Dsk }, a DL ℒ ∈ {ℰℒ, DL-Lite𝑅 }, and
a measure m ∈ {ms , mt }: given an ℒ-KB 𝒯 ∪ 𝒜, a query q(𝑥        ⃗ ) with certain answer ⃗𝑎, and a
natural number 𝑛 (in binary encoding), is there a proof 𝒫 for q(𝑎      ⃗ ) in Dx (𝒯 ∪ 𝒜, q(𝑎⃗ )) with
m(𝒫) ≤ 𝑛? To better distinguish the complexity of finding small proofs from that of query
answering, we assume 𝒯 ∪ 𝒜 |= q(𝑎    ⃗ ) as prerequisite, which fits the intuition that users request
an explanation only after they know that ⃗𝑎 is a certain answer. Lemma 7 in [11] shows that,
instead of looking for arbitrary proofs and homomorphisms into the derivation structure, one
can restrict the search to subproofs of Dx (𝒯 ∪ 𝒜, q(𝑎   ⃗ )), which we will often do implicitly.
   It is common in the context of OMQA to distinguish between data complexity, where only
the data varies, and combined complexity, where also the influence of the other inputs is taken
into account. This raises the question whether the bound 𝑛 is seen as part of the input or not.
It turns out that fixing 𝑛 trivializes the data complexity, because then 𝑛 also fixes the set of
relevant ABoxes modulo isomorphism.

Theorem 6. For a constant bound 𝑛, OPx (ℒ, m) is in AC0 in data complexity.

   One may argue that, since the size of the proof depends on 𝒜, the bound 𝑛 on the proof size
should be considered part of the data as well. Under this assumption, our decision problem is not
necessarily in AC0 anymore. For example, consider the ℰℒ TBox {∃𝑟.𝐴 ⊑ 𝐴} and 𝑞(𝑥) ← 𝐴(𝑥).
For every 𝑛, there is an ABox 𝒜 such that 𝐴(𝑎) is entailed by a seqeuence of 𝑛 role assertions,
and thus needs a proof of size at least 𝑛. Deciding whether this query admits a bounded proof is
thus as hard as deciding whether it admits an answer at all in 𝒜, i.e. P-hard [27]. However, we at
least stay in AC0 for DLs over which CQs are rewritable, e.g. DL-Lite𝑅 [24], because the number
of (non-isomorphic) proofs that we need to consider is bounded by the size of the rewriting,
which is constant in data complexity.

Theorem 7. If all CQs are UCQ-rewritable over ℒ-TBoxes, then OPx (ℒ, m) is in AC0 in data
complexity.
   We now consider the combined complexity. In [9, 11], we established general upper bounds
for finding proofs of bounded size. These results depend only on the size of the derivation
structure obtained for the given input. Both Dcq and Dsk may produce derivation structures of
infinite size, as Dcq contains CQs of arbitrary size, and Dsk also has Skolem terms of arbitrary
nesting depth. However, we can sometimes bound the number of relevant Skolem terms in Dsk
by considering only the part of the minimal Herbrand model 𝐻 that is necessary to satisfy
the query q(𝑎  ⃗ ). For example, in logics with the polynomial witness property [28], including
DL-Lite𝑅 , we know that any query that is entailed is already satisfied after polynomially many
chase steps used to construct 𝐻. In particular, this means that the nesting depth of Skolem
terms in a proof is bounded polynomially (in the size of the TBox and the query), and hence the
part of Dsk (𝒯 𝑠 ∪ 𝒜, q(𝑎 ⃗ )) that we need to search for a (small) proof is bounded exponentially.
For such structures, our results from [9, 11] give us a NExpTime-upper bound for size, and a
PSpace-upper bound for tree size, upon which we can improve with the following lemma.

Lemma 8. There is a polynomial 𝑝 such that for any DL-Lite𝑅 KB 𝒯 ∪ 𝒜, CQ q(𝑥            ⃗ ), and certain
answer ⃗𝑎, there is a proof in Dsk (𝒯 ∪ 𝒜, q(𝑎
                                             ⃗ )) of tree size at most 𝑝(|𝒯 |, |q(𝑥
                                                                                  ⃗ )|).

   A direct consequence of Lemmas 5 and 8 is the upper bound in the following theorem.
The lower bound can be shown by a reduction from Boolean query entailment over DL-Lite𝑅
ontologies: for this, we extend the KB in a given query answering problem by axioms that
trivially entail the query, but only yield proofs larger than 𝑛.

Theorem 9. OPx (DL-Lite𝑅 , m) is NP-complete.

   To obtain tractability, we can restrict the shape of the query. Recall that the Gaifman graph
of a query q is the undirected graph using the terms of q as nodes and has an edge between
terms occurring together in an atom. A query is tree-shaped if its Gaifman graph is a tree.

Theorem 10. Given a DL-Lite𝑅 KB 𝒯 ∪ 𝒜 and a tree-shaped CQ q(𝑥       ⃗ ) with certain answer ⃗𝑎,
one can compute in polynomial time a proof of minimal tree size in Dsk (𝒯 ∪ 𝒜, q(𝑎⃗ )).

  The central property used in the proof of Theorem 10 is that for tree size every atom in q(𝑎
                                                                                             ⃗)
has a separate proof, even if two atoms are proven in the same way. To avoid this redundancy,
one could think about modifying (Es ) slightly:

                  𝜑(𝑡⃗)                                                    ′ ⃗ )𝜎 = 𝜑(𝑡
                     ′    (E′s ) , provided there exists 𝜎 : ⃗𝑥 → ⃗𝑡 s.t. 𝜑 (𝑥        ⃗)
                ∃𝑥
                 ⃗ .𝜑 (𝑥
                       ⃗)
Denote the resulting deriver by D′sk . Using (E′s ), we can derive ∃𝑥, 𝑦. 𝐴(𝑥) ∧ 𝐴(𝑦) from 𝐴(𝑎);
with (Es ), the premise would need to be 𝐴(𝑎) ∧ 𝐴(𝑎). However, this modification is already
sufficient to make our problem NP-hard for tree-shaped queries, even without a TBox. The same
problem arises in Dcq (where atoms can be duplicated using (T)), and if we consider ms .

Theorem 11. For tree-shaped CQs, OP′x (ℒ, mt ) is NP-hard. The same holds for OPsk (ℒ, ms ) and
OPcq (ℒ, mt ).
Table 1
Semantics of (Boolean) MTCQs for I = (ΔI , (ℐ𝑖 )𝑖∈Z ) and 𝑖 ∈ Z.
                   𝜑           I, 𝑖 |= 𝜑 iff
                   CQ 𝜓        ℐ𝑖 |= 𝜓
                   ⊤           true
                   𝜑∧𝜓         I, 𝑖 |= 𝜑 and I, 𝑖 |= 𝜓
                   𝜑∨𝜓         I, 𝑖 |= 𝜑 or I, 𝑖 |= 𝜓
                   ⊞𝐼 𝜑        ∀𝑘 ∈ 𝐼 such that I, 𝑖 + 𝑘 |= 𝜑
                   ⊟𝐼 𝜑        ∀𝑘 ∈ 𝐼 such that I, 𝑖 − 𝑘 |= 𝜑
                   𝜑 𝒰𝐼 𝜓      ∃𝑘 ∈ 𝐼 such that I, 𝑖+𝑘 |= 𝜓 and ∀𝑗 : 0 ≤ 𝑗 < 𝑘 : I, 𝑖+𝑗 |= 𝜑
                   𝜑 𝒮𝐼 𝜓      ∃𝑘 ∈ 𝐼 such that I, 𝑖−𝑘 |= 𝜓 and ∀𝑗 : 0 ≤ 𝑗 < 𝑘 : I, 𝑖−𝑗 |= 𝜑


5. Metric Temporal CQs
We now consider proofs for temporal query answering. In this setting, TBox axioms hold
globally, i.e. at all time points, the ABox contains information about the state of the world in
different time intervals, and the query contains (metric) temporal operators.
   An interval 𝜄 is a nonempty subset of Z of the form [𝑡1 , 𝑡2 ], where 𝑡1 , 𝑡2 ∈ Z ∪ {∞} and
𝑡1 ≤ 𝑡2 (for simplicity, we write [∞, 𝑡2 ] for (−∞, 𝑡2 ] and [𝑡1 , ∞] instead of [𝑡1 , ∞));2 𝑡1 and
𝑡2 are encoded in binary. A temporal ABox 𝒜 is a finite set of facts of the form 𝐴(𝑎)@𝜄 or
𝑃 (𝑎, 𝑏)@𝜄, where 𝐴(𝑎) and 𝑃 (𝑎, 𝑏) are assertions and 𝜄 is an interval. The fact 𝐴(𝑎)@𝜄 states
that 𝐴(𝑎) holds throughout the interval 𝜄. We denote by tem(𝒜) the multiset of intervals that
occur in 𝒜 and |tem(𝒜)| is the sum of their lengths. A temporal interpretation I = (ΔI , (ℐ𝑖 )𝑖∈Z ),
is a collection of DL interpretations ℐ𝑖 = (ΔI , ·ℐ𝑖 ), 𝑖 ∈ Z, over ΔI . I satisfies a TBox axiom 𝛼 if
each ℐ𝑖 , 𝑖 ∈ Z, satisfies 𝛼, and it satisfies a temporal assertion 𝛼@𝜄 if each ℐ𝑖 , 𝑖 ∈ 𝜄, satisfies 𝛼.
   We use the finite-range positive version of metric temporal conjunctive queries (MTCQs)
introduced in [29, 30], combining CQs with MTL operators [31, 32, 33].
Definition 12. An MTCQ is of the form q(𝑥
                                        ⃗ , 𝑤) = 𝜑(𝑥
                                                   ⃗ )@𝑤, where 𝜑 is built according to

                          𝜑 ::= 𝜓 | ⊤ | 𝜑 ∧ 𝜑 | 𝜑 ∨ 𝜑 | ⊟𝐼 𝜑 | ⊞𝐼 𝜑 | 𝜑 𝒰𝐼 𝜑 | 𝜑 𝒮𝐼 𝜑,

with 𝑤 an interval variable, 𝜓 a CQ, 𝐼 a finite interval with non-negative endpoints, and ⃗𝑥 the
                                                       ⃗ , 𝑤) over 𝒯 ∪ 𝒜 is a pair (𝑎
 free variables of all CQs in 𝜑. A certain answer to q(𝑥                            ⃗ , 𝜄) such that
⃗𝑎 ⊆ ind(𝒜), 𝜄 is an interval and, for any 𝑡 ∈ 𝜄 and any model I of 𝒯 ∪ 𝒜, we have I, 𝑡 |= 𝜑(𝑎    ⃗)
 according to Table 1. We denote this as 𝒯 ∪ 𝒜 |= q(𝑎  ⃗ , 𝜄).
  For temporal extensions of Definitions 3 and 4, we will interpret 𝐴 ⊑ 𝐴′ now as the global
temporal rule 𝐴(𝑥) → 𝐴′ (𝑥) holding in any possible interval.
                               (∃𝑥
                                 ⃗ . 𝜑(𝑥
                                       ⃗ ))@𝜄          ⃗ , ⃗𝑧 ) → ∃𝑢
                                                     𝜓(𝑦           ⃗ . 𝜒(𝑧
                                                                         ⃗ , ⃗𝑢)
                                                                                 (TMP)
                                                (∃𝑤
                                                  ⃗ .𝜌(𝑤
                                                       ⃗ ))@𝜄
  Similarly, we need temporal versions of (C) and (E), where all CQs are annotated with the
same interval variable. In addition, we need an inference for disjunctive MTCQS:
2
    This allows us to avoid considering special cases in the interval arithmetic below.
                                            𝜑(𝑥
                                              ⃗ )@𝜄
                                                         (DISJ)
                                          ⃗ ) ∨ 𝜓(𝑦
                                       (𝜑(𝑥       ⃗ ))@𝜄
  To provide a proof for a temporal query, we need to be able to coalesce, i.e. merge intervals:
                       ∃𝑥
                        ⃗ 1 . 𝜑(𝑥
                                ⃗ 1 )@𝜄1      ...     ∃𝑥
                                                       ⃗ 𝑛 . 𝜑(𝑥
                                                               ⃗ 𝑛 )@𝜄𝑛
                                                  ⋃︀𝑛                   (COAL)
                                     (∃𝑥
                                       ⃗ . 𝜑(𝑥
                                             ⃗ ))@ 𝑖=1 𝜄𝑖
where 𝑠𝑖=1 𝜄𝑖 is a single interval and 𝜑(𝑥
      ⋃︀
                                         ⃗ 1 ), . . . , 𝜑(𝑥
                                                          ⃗ 𝑛 ) are identical up to variable renaming.
On the other hand, we also need an inverse operation to shrink intervals:
                                          ∃𝑥
                                           ⃗ . 𝜑(𝑥
                                                 ⃗ )@𝜄
                                                        (SEP)
                                          ∃𝑥     ⃗ )@𝜄′
                                           ⃗ . 𝜑(𝑥
where 𝜄′ ⊆ 𝜄. Both inferences are needed to infer all intervals 𝜄 with 𝒯 ∪ 𝒜 |= ∃𝑥⃗ . 𝜑(𝑥
                                                                                        ⃗ )@𝜄.
  Finally, we need inferences for the temporal operators, where for 𝒰[𝑟1 ,𝑟2 ] we only consider
the case where 𝑟1 > 0 since 𝜑 𝒰[0,𝑟2 ] 𝜓 is equivalent to 𝜓 ∨ (𝜑 𝒰[1,𝑟2 ] 𝜓):

                  𝜑(𝑥⃗ )@[𝑡1 , 𝑡2 ]                         𝜑(𝑥 ⃗ )@𝜄         ⃗ )@𝜄′
                                                                            𝜓(𝑦
                                            (⊞)                                                 ( 𝒰)
                   ⃗ )@[𝑡1 − 𝑟1 , 𝑡2 − 𝑟2 ]
      ⊞[𝑟1 ,𝑟2 ] 𝜑(𝑥                                  ⃗ ) 𝒰[𝑟1 ,𝑟2 ] 𝜓(𝑦
                                                    𝜑(𝑥                ⃗ )@(𝜈 − [𝑟1 , 𝑟2 ]) ∩ 𝜄

where 𝜈 := (𝜄 + 1) ∩ 𝜄′ (all time points where 𝜓-s are immediately preceded by 𝜑-s) and
[𝑤1 , 𝑤2 ] − [𝑟1 , 𝑟2 ] := [𝑤1 − 𝑟2 , 𝑤2 − 𝑟1 ], and none of the involved intervals should be empty.
Inferences for ⊟ and 𝒮 are similar. We denote the resulting deriver by Dtcq . A Skolemized
variant Dtsk can be defined similarly with temporalized versions of (MPs ), (Cs ), and (Es ). We
can now lift Theorems 7 and 9 to this setting.
Theorem 13. If CQ answering in ℒ is UCQ-rewritable, then MTCQ answering is also UCQ-
rewritable and OPtx (ℒ, m) is in AC0 in data complexity. Moreover, OPtx (DL-Lite𝑅 , m) is NP-
complete. Let D ∈ {Dtcq , Dtsk }. Then, it is NP-complete to decide whether, given a DL-Lite𝑅 TBox
𝒯 , a temporal ABox 𝒜, q(𝑎⃗ , 𝜄) s.t. 𝒯 ∪ 𝒜 |= q(𝑎   ⃗ , 𝜄), and 𝑛 in unary or binary encoding, there
exists a proof in D(𝒯 ∪ 𝒜, q(𝑎⃗ , 𝜄)) of (tree) size at most 𝑛.


6. Conclusion
We started to explore a framework for proofs of answers to conjunctive queries. In the future,
we want to extend our complexity results to other DLs, and our framework to DLs that cannot
be translated to existential rules. Other interesting research questions include derivers that
combine TBox and query entailment rules, e.g. Dcq plus the rules of the ELK reasoner [34].
Instead of proofs, one could also try to show a canonical model to a user in order to explain
query answers. For explaining missing answers, we also want to continue investigating how to
find (optimal) counter-interpretations or abduction results [12].


Acknowledgments
This work was supported by DFG in grant 389792660, TRR 248 (https://perspicuous-computing.
science), and QuantLA, GRK 1763 (https://lat.inf.tu-dresden.de/quantla).
References
 [1] D. L. McGuinness, Explaining Reasoning in Description Logics, Ph.D. thesis, Rutgers
     University, NJ, USA, 1996. doi:10.7282/t3-q0c6-5305.
 [2] A. Borgida, E. Franconi, I. Horrocks, Explaining 𝒜ℒ𝒞 subsumption, in: ECAI 2000,
     Proceedings of the 14th European Conference on Artificial Intelligence, 2000, pp. 209–213.
     URL: http://www.frontiersinai.com/ecai/ecai2000/pdf/p0209.pdf.
 [3] S. Schlobach, R. Cornet, Non-standard reasoning services for the debugging of description
     logic terminologies., in: G. Gottlob, T. Walsh (Eds.), Proc. of the 18th Int. Joint Conf.
     on Artificial Intelligence (IJCAI 2003), Morgan Kaufmann, 2003, pp. 355–362. URL: http:
     //ijcai.org/Proceedings/03/Papers/053.pdf.
 [4] F. Baader, R. Peñaloza, B. Suntisrivaraporn, Pinpointing in the description logic ℰℒ+ , in:
     KI 2007: Advances in Artificial Intelligence, 30th Annual German Conference on AI, KI
     2007, 2007, pp. 52–67. doi:10.1007/978-3-540-74565-5_7.
 [5] R. Peñaloza, Axiom-Pinpointing in Description Logics and Beyond, Ph.D. thesis, Technis-
     che Universität Dresden, Germany, 2009. URL: https://nbn-resolving.org/urn:nbn:de:bsz:
     14-qucosa-24743.
 [6] M. Horridge, Justification Based Explanation in Ontologies, Ph.D. thesis, University of
     Manchester, UK, 2011. URL: https://www.research.manchester.ac.uk/portal/files/54511395/
     FULL_TEXT.PDF.
 [7] M. Horridge, B. Parsia, U. Sattler, Justification oriented proofs in OWL, in: The Semantic
     Web - ISWC 2010 - 9th International Semantic Web Conference, ISWC 2010, Part I, 2010,
     pp. 354–369. doi:10.1007/978-3-642-17746-0_23.
 [8] Y. Kazakov, P. Klinov, A. Stupnikov, Towards reusable explanation services in protege, in:
     A. Artale, B. Glimm, R. Kontchakov (Eds.), Proc. of the 30th Int. Workshop on Description
     Logics (DL’17), volume 1879 of CEUR Workshop Proceedings, 2017. URL: http://www.
     ceur-ws.org/Vol-1879/paper31.pdf.
 [9] C. Alrabbaa, F. Baader, S. Borgwardt, P. Koopmann, A. Kovtunova, Finding small proofs
     for description logic entailments: Theory and practice, in: E. Albert, L. Kovacs (Eds.),
     LPAR-23: 23rd International Conference on Logic for Programming, Artificial Intelligence
     and Reasoning, volume 73 of EPiC Series in Computing, EasyChair, 2020, pp. 32–67. doi:10.
     29007/nhpp.
[10] C. Alrabbaa, F. Baader, S. Borgwardt, P. Koopmann, A. Kovtunova, On the complexity of
     finding good proofs for description logic entailments, in: S. Borgwardt, T. Meyer (Eds.),
     Proceedings of the 33rd International Workshop on Description Logics (DL’20), volume
     2663 of CEUR Workshop Proceedings, 2020. URL: http://ceur-ws.org/Vol-2663/paper-1.pdf.
[11] C. Alrabbaa, F. Baader, S. Borgwardt, P. Koopmann, A. Kovtunova, Finding good proofs
     for description logic entailments using recursive quality measures, in: A. Platzer,
     G. Sutcliffe (Eds.), Proceedings of the 28th International Conference on Automated
     Deduction (CADE’21), volume 12699 of LNCS, 2021, pp. 291–308. doi:10.1007/
     978-3-030-79876-5_17.
[12] P. Koopmann, Signature-based abduction with fresh individuals and complex concepts
     for description logics, in: Z. Zhou (Ed.), Proceedings of the Thirtieth International Joint
     Conference on Artificial Intelligence, IJCAI 2021, ijcai.org, 2021, pp. 1929–1935. doi:10.
     24963/ijcai.2021/266.
[13] İ. İ. Ceylan, T. Lukasiewicz, E. Malizia, C. Molinaro, A. Vaicenavicius, Explanations for
     negative query answers under existential rules, in: D. Calvanese, E. Erdem, M. Thielscher
     (Eds.), Proceedings of KR 2020, AAAI Press, 2020, pp. 223–232. URL: https://doi.org/10.
     24963/kr.2020/23. doi:10.24963/kr.2020/23.
[14] D. Calvanese, M. Ortiz, M. Simkus, G. Stefanoni, The complexity of explaining negative
     query answers in dl-lite, in: G. Brewka, T. Eiter, S. A. McIlraith (Eds.), Principles of
     Knowledge Representation and Reasoning: Proceedings of the Thirteenth International
     Conference, KR 2012, AAAI Press, 2012. URL: http://www.aaai.org/ocs/index.php/KR/
     KR12/paper/view/4537.
[15] C. Alrabbaa, W. Hieke, A. Turhan, Counter model transformation for explaining non-
     subsumption in EL, in: C. Beierle, M. Ragni, F. Stolzenburg, M. Thimm (Eds.), Proceedings
     of the 7th Workshop on Formal and Cognitive Reasoning, volume 2961 of CEUR Workshop
     Proceedings, CEUR-WS.org, 2021, pp. 9–22. URL: http://ceur-ws.org/Vol-2961/paper_2.pdf.
[16] D. Calvanese, D. Lanti, A. Ozaki, R. Peñaloza, G. Xiao, Enriching ontology-based data
     access with provenance, in: S. Kraus (Ed.), Proceedings of the Twenty-Eighth International
     Joint Conference on Artificial Intelligence, IJCAI 2019, ijcai.org, 2019, pp. 1616–1623.
     doi:10.24963/ijcai.2019/224.
[17] C. Bourgaux, A. Ozaki, R. Peñaloza, L. Predoiu, Provenance for the description logic elhr,
     in: C. Bessiere (Ed.), Proceedings of the Twenty-Ninth International Joint Conference on
     Artificial Intelligence, IJCAI 2020, ijcai.org, 2020, pp. 1862–1869. doi:10.24963/ijcai.
     2020/258.
[18] A. Borgida, D. Calvanese, M. Rodriguez-Muro, Explanation in the DL-Lite family of
     description logics, in: R. Meersman, Z. Tari (Eds.), On the Move to Meaningful Internet
     Systems: OTM 2008, volume 5332 of Lecture Notes in Computer Science, Springer, 2008, pp.
     1440–1457. doi:10.1007/978-3-540-88873-4_35.
[19] G. Stefanoni, Explaining query answers in lightweight ontologies, Diploma thesis, Tech-
     nische Universität Wien, 2011. URL: http://www.cs.ox.ac.uk/files/7942/thesis.pdf.
[20] F. Croce, M. Lenzerini, A framework for explaining query answers in DL-Lite, in: C. Faron-
     Zucker, C. Ghidini, A. Napoli, Y. Toussaint (Eds.), Knowledge Engineering and Knowledge
     Management - 21st International Conference, EKAW 2018, volume 11313 of Lecture Notes
     in Computer Science, Springer, 2018, pp. 83–97. doi:10.1007/978-3-030-03667-6_6.
[21] C. Alrabbaa, F. Baader, R. Dachselt, T. Flemisch, P. Koopmann, Visualising proofs and the
     modular structure of ontologies to support ontology repair, in: S. Borgwardt, T. Meyer
     (Eds.), Proceedings of the 33rd International Workshop on Description Logics (DL 2020),
     volume 2663 of CEUR Workshop Proceedings, CEUR-WS.org, 2020. URL: http://ceur-ws.org/
     Vol-2663/paper-2.pdf.
[22] A. Calì, G. Gottlob, T. Lukasiewicz, A general datalog-based framework for tractable query
     answering over ontologies, J. Web Semant. 14 (2012) 57–83. doi:10.1016/j.websem.
     2012.03.001.
[23] C. Alrabbaa, S. Borgwardt, P. Koopmann, A. Kovtunova, Finding good proofs for answers
     to conjunctive queries mediated by lightweight ontologies (technical report), 2022. URL:
     https://arxiv.org/abs/2206.09758. doi:10.48550/ARXIV.2206.09758.
[24] D. Calvanese, G. De Giacomo, D. Lembo, M. Lenzerini, R. Rosati, Tractable reasoning
     and efficient query answering in description logics: The DL-Lite family, J. of Automated
     Reasoning 39 (2007) 385–429. doi:10.1007/s10817-007-9078-x.
[25] F. Baader, Terminological cycles in a description logic with existential restrictions, in:
     G. Gottlob, T. Walsh (Eds.), IJCAI-03, Proceedings of the Eighteenth International Joint
     Conference on Artificial Intelligence, Morgan Kaufmann, 2003, pp. 325–330. URL: http:
     //ijcai.org/Proceedings/03/Papers/048.pdf.
[26] R. Fagin, P. G. Kolaitis, R. J. Miller, L. Popa, Data exchange: semantics and query answering,
     Theor. Comput. Sci. 336 (2005) 89–124. doi:10.1016/j.tcs.2004.10.033.
[27] R. Rosati, On conjunctive query answering in EL, in: D. Calvanese, E. Franconi, V. Haarslev,
     D. Lembo, B. Motik, A. Turhan, S. Tessaris (Eds.), Proceedings of the 2007 International
     Workshop on Description Logics (DL2007), volume 250 of CEUR Workshop Proceedings,
     CEUR-WS.org, 2007. URL: http://ceur-ws.org/Vol-250/paper_83.pdf.
[28] G. Gottlob, S. Kikot, R. Kontchakov, V. V. Podolskii, T. Schwentick, M. Zakharyaschev,
     The price of query rewriting in ontology-based data access, Artif. Intell. 213 (2014) 42–59.
     doi:10.1016/j.artint.2014.04.004.
[29] S. Borgwardt, W. Forkel, A. Kovtunova, Finding new diamonds: Temporal minimal-
     world query answering over sparse ABoxes, in: Proc. of the 3rd Int. Joint Conf. on
     Rules and Reasoning, RuleML+RR 2019, volume 11784 of LNCS, Springer, 2019, pp. 3–18.
     doi:10.1007/978-3-030-31095-0_1.
[30] S. Borgwardt, W. Forkel, A. Kovtunova, Temporal minimal-world query answering over
     sparse ABoxes, Theory Pract. Log. Program. 22 (2022) 193–228. URL: https://doi.org/10.
     1017/S1471068421000119. doi:10.1017/S1471068421000119.
[31] R. Alur, T. A. Henzinger, A really temporal logic, J. ACM 41 (1994) 181–204. doi:10.1145/
     174644.174651.
[32] V. Gutiérrez-Basulto, J. C. Jung, A. Ozaki, On metric temporal description logics, in: Proc.
     ECAI, IOS Press, 2016, pp. 837–845. doi:10.3233/978-1-61499-672-9-837.
[33] F. Baader, S. Borgwardt, P. Koopmann, A. Ozaki, V. Thost, Metric temporal description
     logics with interval-rigid names, in: Proc. of the 11th Int. Symp. on Frontiers of Combining
     Systems (FroCoS’17), Springer, 2017, pp. 60–76. doi:10.1007/978-3-319-66167-4_4.
[34] Y. Kazakov, M. Krötzsch, F. Simancik, The incredible ELK - From polynomial procedures
     to efficient reasoning with ℰℒ ontologies, J. Autom. Reason. 53 (2014) 1–61. doi:10.1007/
     s10817-013-9296-3.