<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Axiom Pinpointing is Hard</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Rafael Pen~aloza</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bar s Sertkaya?</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Theoretical Computer Science TU Dresden</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We investigate the complexity of several decision, enumeration and counting problems in axiom pinpointing in Description Logics. We prove hardness results that already hold for the propositional Horn fragment. We show that for this fragment, unless p = np, all minimal subsets of a given TBox that have a given consequence, i.e. MinAs, cannot be enumerated in a speci ed lexicographic order with polynomial delay. Moreover, we show that recognizing the set of all MinAs is at least as hard as recognizing the set of all minimal transversals of a given hypergraph, however whether this problem is intractable remains open. We also show that checking the existence of a MinA that does not contain any of the given sets of axioms, as well as checking the existence of a MinA that contains a speci ed axiom are both np-hard. In addition we show that counting all MinAs and counting the MinAs that contain a certain axiom are both #p-hard.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Introduction
As the number and the size of DL-based ontologies grow, tools that support
knowledge engineers in improving and maintaining the quality of these ontologies
become more important. In real world applications often the knowledge engineer
not only wants to know whether her knowledge base has a certain (unwanted)
consequence or not, but also wants to know the reasons of this consequence.
Even for knowledge bases of moderate size, nding explanations for a given
consequence is not an easy task without getting support from an automated tool.
The task of nding explanations for a given consequence, i.e., nding minimal
subsets of the original knowledge base that have the given consequence is called
axiom pinpointing in the literature.</p>
      <p>
        Axiom pinpointing has been considered by several authors. In [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] Schlobach
and Cornet have described an extension of the tableau-based satis ability
algorithm for ALC that given an ALC knowledge base and a consequence of this
knowledge base computes all minimal subsets of the original knowledge base that
have the given consequence. The same problem had been addressed earlier in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
in the context of extending DLs by default rules. Later in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] Parsia et al. have
extended the approach in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] to more expressive DLs, and in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] Lee et al. have
extended the approach in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] to ALC knowledge bases with GCIs. In contrast to
? Supported by the German Research Foundation (DFG) under grant BA 1122/12-1
these works on axiom pinpointing in expressive DLs, in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] Baader et al. have
addressed axiom pinpointing in the less expressive DL E L+. They have investigated
the complexity of nding minimal subsets of a given E L+ TBox (there called
MinAs) that have a given consequence. They have shown that checking the
existence of a MinA within a speci ed cardinality bound is np-complete. Moreover,
they have shown that a given consequence can have exponentially many MinAs
in a given E L+ TBox. Given this fact, it is de nitely not possible to compute
all MinAs in polynomial time. Baader et al. have considered this problem in a
setting where the TBox has a so-called static part which is always present, and
a refutable part from which MinAs are computed. They have shown that in this
setting computing all MinAs cannot be performed in output polynomial time
unless p = np. In fact, all these hardness results are shown for the propositional
Horn fragment (there called HL), that allows for only conjunction and GCIs.
      </p>
      <p>
        In the present paper we investigate the complexity of several decision,
enumeration and counting problems related to MinAs. Following [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], we show our
hardness results for the logic HL. First we examine whether MinAs can be
enumerated in a speci ed lexicographic order with polynomial delay. It turns
out that even for HL already generating the lexicographically rst MinA is
intractable, thus unless p = np, MinAs cannot be enumerated in a speci ed
lexicographic order with polynomial delay. Next we examine whether all MinAs can
be computed in output polynomial time without the static part considered in
the setting in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. We show that for this setting recognizing the set of all MinAs
is at least as hard as recognizing the set of all minimal transversals of a given
hypergraph, which is a prominent open problem [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] in hypergraph theory [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
However, whether this problem is intractable remains open. Next we investigate
the problem of existence of a MinA that does not contain any of the given sets
of axioms. This problem, which can be useful in practical applications where one
wants to avoid certain combinations of axioms in the MinAs, turns out to be
np-hard. We also consider the problem of checking the existence of a MinA that
contains a speci ed axiom. We show that this problem is also np-hard. Finally,
we show that both counting all MinAs and counting the MinAs that contain a
certain axiom are #p-hard problems.
2
      </p>
      <p>Complexity of computing all MinAs
In the present section we examine the complexity of computing all MinAs. The
hardness results we show in this work actually already hold for the propositional
Horn fragment HL. For this reason, from now on we are going to formulate our
problems for HL TBoxes. Note that HL is a sublogic of E L, which implies that
the lower bounds for the complexity found for HL also hold for E L and E L+.
We start with formally de ning some basic notions.</p>
      <p>De nition 1. Let T be an HL TBox and C and D be two HL concept
descriptions such that C vT D. We call a set T 0 T a minimal axiom set or MinA
for T w.r.t. C v D if C vT 0 D and it is minimal w.r.t. set inclusion. In this
case we also say that T ' explains C v D.</p>
      <p>Based on this, our problem is formulated as follows:</p>
    </sec>
    <sec id="sec-2">
      <title>Problem: mina-enum</title>
      <p>Input: An HL TBox T and a GCI C v D such that T j= C v D.
Output: The set of all MinAs for T w.r.t. C v D.
2.1</p>
      <sec id="sec-2-1">
        <title>Enumerability with polynomial delay</title>
        <p>
          In [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] it has been shown that there can be exponentially many MinAs for a given
HL TBox w.r.t. a given consequence of this TBox. Given this fact, it is de nitely
not possible to solve mina-enum in polynomial time. In complexity theory, for
analyzing the performance of enumeration algorithms where the number of
solutions can be exponential in the size of the input, one considers other measures.
One such measure is the time the algorithm spends between two consequent
solutions. An algorithm is said to run with polynomial delay [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] if the time
until the rst solution is generated, and thereafter the time between any two
consecutive solutions is bounded by a polynomial in the size of the input.
        </p>
        <p>In the following we investigate whether it is possible to enumerate all MinAs
in a speci ed lexicographic order with polynomial delay. The lexicographic order
we will use is de ned as follows:
De nition 2. Let the elements of a set S be linearly ordered. This order induces
another linear strict order on P(S), which is called the lexicographic order. We
say that a set R S is lexicographically smaller than a set T S where R 6= T
if the rst element at which they disagree is in R.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Problem: first-mina</title>
      <p>Input: An HL TBox T , a GCI C v D such that T j= C v D, a set S T such
that S j= C v D, and a linear order on T .</p>
      <p>Question: Is S the rst MinA w.r.t. the lexicographic order induced by the given
linear order?</p>
      <sec id="sec-3-1">
        <title>Theorem 1. first-mina is conp-complete.</title>
        <p>Proof. The problem is in conp since if S is not the lexicographically rst MinA,
a proof of this can be given by nondeterministically guessing a subset of T and
verifying in polynomial time that it is a MinA that is lexicographically smaller
than S.</p>
        <p>
          In order to show conp-hardness, we are going to give a reduction from the
problem of checking whether a given maximal independent set is the
lexicographically last maximal independent set of a given graph. Recall that a maximal
independent set of a graph G = (V; E ) is a subset V 0 V of the vertices such
that no two vertices in V 0 are joined by an edge in E , and each vertex in V n V 0
is joined by an edge to some vertex in V 0. This problem has been shown to be
conp-complete in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
        </p>
        <p>T := fPv v</p>
        <p>l
v2E;E2E</p>
        <p>PE j v 2 V g [ f l PE v Ag;
S := fPv v</p>
        <p>l
v2E;E2E</p>
        <p>PE j v 2 (V n S)g [ f l PE v Ag:</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Problem: last maximal independent set (last-mis)</title>
      <p>Input: A graph G = (V; E), a maximal independent set S V , and a linear order
on V .</p>
      <p>Question: Is S the last maximal independent set w.r.t. the lexicographic order
induced by the given linear order?</p>
      <p>Let an instance of last-mis be given with the graph G = (V; E) and the
maximal independent set S. From G and S we construct an instance of
firstmina as follows: For every v 2 V we introduce a concept name Pv, for every edge
E 2 E we introduce a concept name PE, and nally one more concept name A.
Using these we construct the TBox
E2E</p>
      <p>E2E
and the GCI dv2V Pv v A that obviously follows from T . Additionally, by
using S we construct a set S T as:
Note that T contains exactly jV j + 1 axioms. We order these axioms such that
an axiom with premise Pv comes before the axiom with premise Pv0 if and only
if the vertex v comes before the vertex v0 in the originally given linear order on
V . Finally we place the axiom dE2E PE v A as the last one. It is easy to see
that this construction indeed creates an instance of first-mina and the TBox
T , as well as the MinA S and the GCI dv2V Pv v A can be created in time
polynomial in the sizes of G and S. We claim that S is lexicographically the last
maximal independent set if and only if S is the lexicographically rst MinA.</p>
      <p>()) Assume S is the lexicographically last maximal independent set. Then
V n S contains at least one vertex from every edge (i.e., it is a vertex cover), since
otherwise S would not be an independent set. Thus every PE, for E 2 E, appears
on the righthand side of at least one axiom in S. That is dv2V Pv vS dE2E PE,
thus dv2V Pv vS A, i.e., S explains this axiom. Since S is maximal, V n S is
minimal, and thus S is minimal, i.e., S is a MinA. Moreover, it is the
lexicographically rst one since S is the lexicographically last maximal independent
set.</p>
      <p>(() Assume S is the lexicographically rst MinA. Then every PE, for E 2 E,
appears on the righthand side of at least one axiom in S, since otherwise it would
not be an explanation. That is, V nS contains at least one vertex from every edge.
Then S contains at most one vertex from every edge, i.e., it is an independent
set. Since S is minimal V n S is minimal, and thus S is maximal, i.e., S is a
maximal independent set. Moreover it is the lexicographically last one since S
is the lexicographically rst MinA. 2</p>
      <p>Theorem 1 has the following consequence: Since generating the
lexicographically rst MinA is already intractable, unless p = np, all MinAs cannot be
enumerated in lexicographic order with polynomial delay.</p>
      <p>Corollary 1. Unless p = np, there is no algorithm that enumerates MinAs in
a given lexicographic order with polynomial delay.</p>
      <p>Interestingly, it is easy to generate the lexicographically last MinA for a given
TBox w.r.t. a given GCI. We start with the axioms in the speci ed order, and
remove an axiom if the resulting set of axioms still explains the given GCI. Once
we have scanned all axioms in this way, the resulting set of axioms is a MinA, and
it is the lexicographically last one. For a TBox of size n, i. e. having n axioms,
this operation requires at most n subsumption tests each of which can be done
in polynomial time for E L and E L+. Nonetheless, it is still unclear whether this
fact can be used to enumerate all MinAs in the reverse lexicographical order.
2.2</p>
      <sec id="sec-4-1">
        <title>Computability in output polynomial time</title>
        <p>
          One other measure for analyzing the performance of enumeration algorithms
is to take into account not only the size of the input, but also the size of the
output. An algorithm is said to run in output polynomial time (or polynomial total
time) [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] if it outputs all solutions in time polynomial in the size of the input
and the output. One advantage of an output polynomial algorithm is that it runs
in polynomial time (in the size of the input) when there are only polynomially
many solutions.
        </p>
        <p>In the following we investigate whether mina-enum can be solved in output
polynomial time. For solving this enumeration problem, the following decision
problem has crucial importance:</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Problem: all-minas</title>
      <p>Input: An HL TBox T , a GCI C v D such that T j= C v D, and T
Question: Is T precisely the set of all MinAs of T w.r.t. C v D?
As Proposition 1 below shows, if this problem cannot be decided in polynomial
time, then unless p = np, mina-enum cannot be solved in output polynomial
time.</p>
      <p>Proposition 1. If all-minas cannot be decided in polynomial time, then unless
p = np, mina-enum cannot be solved in output-polynomial time.
Proof. Assume that we have an algorithm A that solves mina-enum in
outputpolynomial time. Let its runtime be bounded by a polynomial p(IS; OS) where
IS denotes the size of the input TBox and OS denotes the size of the output,
i.e., the set of all MinAs.</p>
      <p>In order to decide all-minas for an instance given by the TBox T , the GCI
C v D, and a set T P(T ), we construct another algorithm A0 that works
as follows: it runs A on T and C v D for at most p(jT j; jT j)-many steps. If A
terminates within this many steps, then A0 compares the output of A with T
and returns yes if and only if they are equal. If they are not equal, A0 returns
no. If A has not yet terminated after p(jT j; jT j)-many steps, this implies that
there is at least one MinA that is not contained in T , so A0 returns no. It is easy
to see that the runtime of A0 is bounded by a polynomial in jT j and jT j, that is
A0 decides all-minas in time polynomial in the size of the input. 2
P(T ).</p>
      <p>The proposition shows that determining the complexity of all-minas is
indeed crucial for determining the enumeration complexity of mina-enum. It is
not di cult to see that all-minas is in conp. Given an instance of all-minas,
a nondeterministic algorithm can guess a subset of T that is not in T , and in
polynomial time verify that this is a MinA, thus T is not the set of all MinAs.
In the following we show that all-minas is at least as hard as recognizing the
set of all minimal transversals of a given hypergraph. However, whether it is
conp-hard remains unfortunately open.</p>
      <p>
        First we brie y recall some notions of hypergraphs. A hypergraph H = (V; E)
consists of a set of vertices V = fvi j 1 i ng, and a set of (hyper)edges
E = fEj j 1 j mg where Ej V . Following the convention in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], we assume
that the set of edges as well as the set of vertices is nonempty, and the union
of all edges yields the vertex set. A set W V is called a transversal of H if it
intersects all edges of H, i.e., 8E 2 E: E \W 6= ;. A transversal is called minimal
if no proper subset of it is a transversal. The set of all minimal transversals of
H constitute another hypergraph on V called the transversal hypergraph of H,
which is denoted by T r(H). Generating T r(H) is an important problem which
has applications in many elds of computer science (see e.g. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]). The well-known
decision problem associated to this computation problem is de ned as follows:
      </p>
    </sec>
    <sec id="sec-6">
      <title>Problem: transversal hypergraph (trans-hyp)</title>
      <p>Input: Two hypergraphs H = (V; EH) and G = (V; EG).</p>
      <p>
        Question: Is G the transversal hypergraph of H, i.e., does T r(H) = G hold?
trans-hyp is known to be in conp, but so far neither a polynomial time
algorithm has been found, nor has it been proved to be conp-hard. In a landmark
paper [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] Fredman and Khachiyan proved that trans-hyp can be solved in
no(log n) time, which implies that this problem is most likely not conp-hard. It is
conjectured that this problem, together with several computationally equivalent
problems, forms a class properly contained between p and conp [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
    </sec>
    <sec id="sec-7">
      <title>Theorem 2. all-minas is trans-hyp-hard.</title>
      <p>Proof. Let an instance of trans-hyp be given by the hypergraphs H = (V; EH)
and G = (V; EG). From H and G we construct an instance of all-minas as
follows: for every vertex v 2 V we introduce a concept name Pv, for every edge
E 2 EH of H a concept name PE, and nally one additional concept name A.
For a set of vertices F V , we de ne the TBox</p>
      <p>TF := fPv v</p>
      <p>l
v2E;E2EH</p>
      <p>PE j v 2 F g [ f</p>
      <p>l PE v Ag:
E2EH
Using these we construct the TBox T := TV , a set of TBoxes T := fTF j F 2
EGg P(T ), and the GCI dv2V Pv v A that obviously follows from T .</p>
      <p>It is easy to see that this construction indeed creates an instance of
allminas, and the TBox T , as well as the set T and the GCI dv2V Pv v A can be
constructed in time polynomial in the sizes of H and G. We claim that G is the
transversal hypergraph of H if and only if T is precisely the set of all MinAs.
Note that d PE v A is the only axiom in T that has A in its right-hand
side; this impEl2ieEsHthat every MinA must contain this axiom. Hence, every MinA
is of the form TF for some F V . To prove our claim, it su ces to show that a
set of vertices F V is a minimal transversal of H if and only if the TBox TF
is a MinA.</p>
      <p>()) Assume that F is a minimal transversal of H. By de nition F satis es
F \ E 6= ; for every E 2 EH . Then TF is an axiom set explaining the above
GCI. Moreover, it is minimal since F is minimal. Thus, it is indeed a MinA for
T w.r.t. dv2V Pv v A.</p>
      <p>(() Now assume that TF is a MinA. Then for every PE (where E 2 EH)
there is at least one axiom in TF with lefthand side Pv (where v 2 F and F 2 EG)
such that Pv is subsumed by PE . That is F intersects every E 2 EH, i.e., F is
a transversal. Moreover, it is minimal since the original set of axioms TF is a
minimal axiom set.</p>
      <p>It is not di cult to see that from the above it follows that G is the transversal
hypergraph of H if and only if T := fTF j F 2 EG g is the set of all MinAs. 2
3</p>
      <p>Preferred and unwanted axioms
Next we consider the problem of computing MinAs that contain a preferred
axiom and MinAs that do not contain some unwanted combination of axioms.
First we investigate the problem of existence of a MinA that does not contain
any of the given sets of axioms. This problem can be useful in applications where
one wants to avoid certain combinations of axioms in the MinAs. It is de ned
as follows:</p>
    </sec>
    <sec id="sec-8">
      <title>Problem: add-mina</title>
      <p>Input: An HL TBox T , a GCI C v D such that T j= C v D, and T
Question: Is there a MinA T 0 that satis es S 6 T 0 for every S 2 T ?</p>
      <sec id="sec-8-1">
        <title>Theorem 3. add-mina is np-complete.</title>
        <p>Proof. The problem is clearly in np. A nondeterministic algorithm for solving it
rst guesses a T 0 T , then tests in polynomial time whether T 0 is a MinA that
does not contain any of the S in T .</p>
        <p>
          Hardness is shown by a reduction from the following np-hard problem [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]:
        </p>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>Problem: hypergraph 2-coloring</title>
      <p>Input: A hypergraph H = (V; E ).</p>
      <p>Question: Is H 2-colorable, i.e., is there a W
W \ E 6= ; and (V n W ) \ E 6= ;?
V such that for all E 2 E ,
Let an instance of hypergraph 2-coloring be given with the hypergraph
H = (V; E ). From H we construct an instance of add-mina as follows: just like
in the construction in the proof of Theorem 2, we introduce a concept name
Pv for every v 2 V , a concept name PE for every edge E 2 E , and nally one
more concept name A. We also construct exactly the same TBox T and the GCI
dv2V Pv v A constructed there. In addition to these, for an edge E 2 E we
de ne the set</p>
      <p>VE := fPv v</p>
      <p>l
v2F;F 2E</p>
      <p>PF j v 2 Eg [ f l PF v Ag;</p>
      <p>F 2E
and using this construct the set</p>
      <p>T := fVE j E 2 Eg:
It is easy to see that the TBox T , the GCI above and the set T can be constructed
in time polynomial in the size of H. We claim that H is 2-colorable if and only
if there is a MinA T 0 for T w.r.t. dv2V Pv v A such that T 0 satis es S 6 T 0 for
every S 2 T .</p>
      <p>()) Assume H is 2-colorable. Then there is a W V such that W \ E 6= ;
and (V n W ) \ E 6= ; for every E 2 E, i.e., both W and its complement are
transversals of H. Assume w.l.o.g. that W is minimal. Using W we de ne the
set TW := fPv v dv2E;E2E PE j v 2 W g [ fdE2E PE v Ag and claim that
this is the MinA we are looking for. Since W is a transversal every PE, for
E 2 E, appears on the righthand side of at least one axiom in TW . That is
dv2V Pv vTW dE2E PE, thus dv2V Pv vTW A, i.e., TW explains this axiom.
TW is minimal since W is minimal. Moreover, since V n W is also a transversal,
every edge E 2 E contains at least one vertex v that is not in W . Thus, every
VE 2 T contains at least one axiom Pv v dv2F;F 2E PF that is not in TW . That
is TW is a MinA that is not a superset of any VE 2 T .</p>
      <p>(() Assume there is a MinA T 0 that is not a superset of any VE 2 T . De ne
the set WT 0 := fv j Pv v dv2E;E2E PE 2 T 0g. Since T 0 explains the constructed
GCI, for every E 2 E it contains at least one axiom in whose righthandside
PE occurs. That is, WT 0 intersects every E 2 E. Since T 0 is not a superset of
any VE 2 T , every such VE contains at least one axiom that is not in T 0. This
means that every E 2 E contains at least one vertex that is not in WT 0 . That is,
V n WT 0 intersects every E 2 E. Thus we have shown that WT 0 is a 2-coloring
of H. 2</p>
      <p>Next we consider the dual problem, which is checking the existence of a MinA
that contains a certain axiom. This problem can be useful in applications where
one is interested in MinAs that contain a speci c axiom, and is known as the
relevance problem:</p>
    </sec>
    <sec id="sec-10">
      <title>Problem: mina-relevance</title>
      <p>Input: An HL TBox T , a GCI C v D such that T j= C v D, and an axiom
t 2 T .</p>
      <p>
        Question: Is there a MinA S of T w.r.t. C v D such that t 2 S?
If one could e ciently solve the relevance problem, then black-box techniques
for computing the set of all MinAs (see, e.g. [
        <xref ref-type="bibr" rid="ref20 ref3">3, 20</xref>
        ]) based on Reiter's Hitting
Set Tree algorithm [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] could bene t from Rymon's further optimizations [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
As we will show next, this problem is also np-hard.
      </p>
    </sec>
    <sec id="sec-11">
      <title>Theorem 4. mina-relevance is np-complete.</title>
      <p>
        Proof. The problem is clearly in np. A nondeterministic algorithm for solving it
rst guesses a subset of T , then tests in polynomial time whether it is a MinA
containing t. For showing hardness we are going to give a reduction from the
following np-complete problem in [
        <xref ref-type="bibr" rid="ref11 ref7">11, 7</xref>
        ]:
      </p>
    </sec>
    <sec id="sec-12">
      <title>Problem: horn-relevance</title>
      <p>Input: Two sets of propositional variables H and M , a set T of de nite Horn
clauses over H [ M , and a propositional variable p 2 H.</p>
      <p>Question: Is there a minimal H0 H such that H0 [ T j= M and p 2 H0?
Recall that a de nite Horn clause is a Horn clause with exactly one positive
literal, i.e., the elements of T are formulae of the form V ! f , where
G H [ M and f 2 H [ M . Given an instance of horgn2Grgelevance we
construct an instance of mina-relevance as follows: for every propositional
variable h 2 H we introduce a concept name Ph, for every propositional variable
m 2 M we introduce a concept name Pm, and nally two more fresh concept
names A and B. Using these we construct the TBox</p>
      <p>T := fA v Ph j h 2 Hg [ f l Pg v Pf j ^ g ! f 2 T g [ f l Pm v Bg
g2G g2G m2M
where G H [ M and f 2 H [ M . Using A and B we construct the GCI A v B.
This construction indeed creates an instance of mina-relevance and it can be
done in polynomial time. We claim that there is a minimal H0 H such that
H0 [ T j= M and p 2 H0 if and only if there is a MinA S of T w.r.t. A v B such
that A v Pp 2 S.</p>
      <p>()) Assume there is a minimal H0 H such that H0 [ T j= M and p 2 H0.
Using H0 de ne the set
TH0 := fA v Ph j h 2 H0g [ f l Pg v Pf j ^ g ! f 2 T g [ f l Pm v Bg:
g2G g2G m2M
TH0 explains A v B since H0 [ T j= M . It is minimal since H0 is minimal, and
A v Pp 2 TH0 since p 2 H0.</p>
      <p>(() Assume there is a MinA S such that A v Pp 2 S. Since it explains
A v B it contains the axiom d
dg2G Pg v Pf such that for everymm2M2PMm tvheBco,nccoenpttainnasmaexiPommsococfurtsheonfotrhme
righthand side of at least one such axiom. Additionally S contains axioms of the
form A v Ph such that A vS dm2M Pm. Then the set S = fh j A v Ph 2 Sg
satis es S [ T j= M . Moreover p 2 S since A v Pp 2 S, and S is minimal since
S is minimal. 2
4</p>
      <p>
        Counting MinAs
In applications where one is interested in computing all MinAs, it might also
be useful to know in advance how many of them exist. Next we consider this
counting problem. It turns out that this is hard for the counting complexity
class #p [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], which is de ned as the class of functions counting the number of
accepting paths of nondeterministic Turing machines.
      </p>
      <sec id="sec-12-1">
        <title>Problem: #mina</title>
        <p>Input: An HL TBox T and a GCI C v D such that T j= C v D.
Output: The number of all MinAs of T w.r.t. C v D.</p>
        <sec id="sec-12-1-1">
          <title>Theorem 5. #mina is #p-complete.</title>
          <p>Proof. The problem is clearly in #p. Given an HL TBox T , a GCI C v D that
follows from T and a candidate solution T 0 T , we can in polynomial time
verify that T 0 is a MinA.</p>
          <p>
            For showing #p-hardness, we are going to give a parsimonious reduction from
#hypergraph transversal, which is the problem of counting the minimal
transversals of a given hypergraph H. It has been shown to be #p-complete in
[
            <xref ref-type="bibr" rid="ref6">6</xref>
            ]. In the reduction, from a given hypergraph H we construct the same TBox
and the GCI constructed in the proof of Theorem 2. Note that this construction
establishes a bijection between the minimal transversals of H and MinAs of T
w.r.t. dv2V Pv v A. That is, a W V is a minimal transversal of H if and only
if the set T := fPv v dv2E;E2EH PE j v 2 W g [ fd PE v Ag de ned using
W is a MinA. That is, this construction preserves tEh2eEnHumber of solutions, i.e.,
it is parsimonious. 2
          </p>
          <p>
            Instead of counting all MinAs, one can also try to count the MinAs that
contain a speci c axiom. If we are trying to explain an unwanted consequence,
the solution of this counting problem will allow us to detect axioms that are
most likely to be faulty, i. e. those that appear in the most MinAs. This idea has
been proposed in [
            <xref ref-type="bibr" rid="ref19">19</xref>
            ] as a heuristic for correcting an error while minimizing the
changes in the set of axioms.
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-13">
      <title>Problem: #mina-relevance</title>
      <p>Input: An HL TBox T , a GCI C v D such that T j= C v D and an axiom
t 2 T .</p>
      <p>Output: The number of all MinAs of T w.r.t. C v D that contain t.</p>
    </sec>
    <sec id="sec-14">
      <title>Theorem 6. #mina-relevance is #p-complete.</title>
      <p>Proof. The problem is in #p since given an HL TBox T , a GCI C v D that
follows from T an axiom t 2 T and a candidate solution T 0 T , we can in
polynomial time verify that T 0 is a MinA and it contains t.</p>
      <p>We show #p-hardness by a parsimonious reduction from #mina, which has
been shown to be #p-complete above. Given an instance of #mina with the
TBox T and the GCI A v B we construct the TBox T 0 := T [ S0, where
S0 = fA v C; B u C v Dg, and C and D are two concept names not occurring
in T . It is not di cult to see that a set S T is a MinA for T w.r.t. A v B
if and only if S [ S0 is a MinA for T 0 w.r.t. A v D. Moreover, every MinA for
T 0 w.r.t. A v D contains the axioms in S0. Thus, there are exactly as many
MinAs for T w.r.t. A v B as there are for T 0 w.r.t. A v D containing the axiom
A v C. 2</p>
      <p>
        Concluding remarks and future work
We have analyzed the complexity of nding all the MinAs for a given HL TBox,
and some other related problems in axiom pinpointing. The bottom line of this
analysis is that axiom pinpointing is hard, even in this restricted setting, where
we can only express conjunctions and implications. Obviously, the hardness
results transfer to any logic more expressive than HL with GCIs. It is however
unclear whether better bounds can be obtained if we restrict our attention to
acyclic TBoxes, or to logics like DL-Lite [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] in which conjunction is not allowed
on the lefthand side of the axioms.
      </p>
      <p>
        One problem that remains open is the exact complexity of computing all
MinAs. In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] it was shown that if we allow for an irrefutable portion of the
TBox, then there cannot be an algorithm that outputs all MinAs in output
polynomial time. Here we show that if all axioms are refutable then this problem
is at least as hard as generating the minimal transversals of a given hypergraph.
However, whether it is intractable remains open. As future work, we are going to
work on determining whether this problem is intractable or it is computationally
equivalent to generating minimal transversals.
      </p>
      <p>Another branch for future work refers to e ciently nding approximate
solutions for the hard problems presented here. For instance nding an
approximation to the total number of MinAs allows us at least to get an idea on how
problematic is a given (unwanted) consequence. Alternatively, a set of axioms
that explains a given consequence but is not necessarily minimal, can be helpful
for a better understanding of the given consequence.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>F.</given-names>
            <surname>Baader</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Hollunder</surname>
          </string-name>
          .
          <article-title>Embedding defaults into terminological representation systems</article-title>
          .
          <source>Journal of Automated Reasoning</source>
          ,
          <volume>14</volume>
          :
          <fpage>149</fpage>
          {
          <fpage>180</fpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>F.</given-names>
            <surname>Baader</surname>
          </string-name>
          , R. Pen~aloza, and
          <string-name>
            <given-names>B.</given-names>
            <surname>Suntisrivaraporn</surname>
          </string-name>
          .
          <article-title>Pinpointing in the description logic EL+</article-title>
          . In J. Hertzberg,
          <string-name>
            <given-names>M.</given-names>
            <surname>Beetz</surname>
          </string-name>
          , and R. Englert, editors,
          <source>Proceedings of the 30th German Conference on Arti cial Intelligence (KI2007)</source>
          , volume
          <volume>4667</volume>
          <source>of Lecture Notes in Arti cial Intelligence</source>
          , pages
          <fpage>52</fpage>
          {
          <fpage>67</fpage>
          . Springer-Verlag,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>F.</given-names>
            <surname>Baader</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Suntisrivaraporn</surname>
          </string-name>
          .
          <article-title>Debugging SNOMED CT using axiom pinpointing in the description logic EL+</article-title>
          .
          <source>In Proceedings of the International Conference on Representing and Sharing Knowledge Using SNOMED (KR-MED'08)</source>
          , Phoenix, Arizona,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>C.</given-names>
            <surname>Berge</surname>
          </string-name>
          . Hypergraphs. Elsevier Science Publishers
          <string-name>
            <surname>B.V.</surname>
          </string-name>
          (North Holland),
          <year>1989</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. D.</given-names>
            <surname>Giacomo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Vetere.</surname>
          </string-name>
          DL-Lite:
          <article-title>Practical reasoning for rich DLs</article-title>
          .
          <source>In Proceedings of the 2004 Description Logic Workshop (DL</source>
          <year>2004</year>
          ), volume
          <volume>104</volume>
          <source>of CEUR Electronic Workshop Proceedings</source>
          , http://ceur-ws.
          <source>org/</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>A.</given-names>
            <surname>Durand</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Hermann</surname>
          </string-name>
          .
          <article-title>On the counting complexity of propositional circumscription</article-title>
          .
          <source>Information Processing Letters</source>
          ,
          <volume>106</volume>
          (
          <issue>4</issue>
          ):
          <volume>164</volume>
          {
          <fpage>170</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          and
          <string-name>
            <surname>G. Gottlob.</surname>
          </string-name>
          <article-title>The complexity of logic-based abduction</article-title>
          .
          <source>Journal of the ACM</source>
          ,
          <volume>42</volume>
          (
          <issue>1</issue>
          ):3{
          <fpage>42</fpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          and
          <string-name>
            <given-names>G.</given-names>
            <surname>Gottlob.</surname>
          </string-name>
          <article-title>Identifying the minimal transversals of a hypergraph and related problems</article-title>
          .
          <source>SIAM Journal on Computing</source>
          ,
          <volume>24</volume>
          (
          <issue>6</issue>
          ):
          <volume>1278</volume>
          {
          <fpage>1304</fpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          and
          <string-name>
            <given-names>G.</given-names>
            <surname>Gottlob</surname>
          </string-name>
          .
          <article-title>Hypergraph transversal computation and related problems in logic and AI</article-title>
          . In S. Flesca,
          <string-name>
            <given-names>S.</given-names>
            <surname>Greco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Leone</surname>
          </string-name>
          , and G. Ianni, editors,
          <source>Proceedings of the European Conference on Logics in Arti cial Intelligence (JELIA</source>
          <year>2002</year>
          ), volume
          <volume>2424</volume>
          of Lecture Notes in Computer Science, pages
          <volume>549</volume>
          {
          <fpage>564</fpage>
          .
          <string-name>
            <surname>SpringerVerlag</surname>
          </string-name>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>M. L. Fredman</surname>
            and
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Khachiyan</surname>
          </string-name>
          .
          <article-title>On the complexity of dualization of monotone disjunctive normal forms</article-title>
          .
          <source>Journal of Algorithms</source>
          ,
          <volume>21</volume>
          (
          <issue>3</issue>
          ):
          <volume>618</volume>
          {
          <fpage>628</fpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. G. Friedrich, G. Gottlob, and
          <string-name>
            <given-names>W.</given-names>
            <surname>Nejdl</surname>
          </string-name>
          .
          <article-title>Hypothesis classi cation, abductive diagnosis and therapy</article-title>
          . In G. Gottlob and W. Nejdl, editors,
          <source>Proceedings of the International Workshop on Expert Systems in Engineering, Principles and Applications</source>
          , volume
          <volume>462</volume>
          of Lecture Notes in Computer Science, pages
          <volume>69</volume>
          {
          <fpage>78</fpage>
          ,
          <string-name>
            <surname>Viena</surname>
          </string-name>
          , Austria,
          <year>1990</year>
          . Springer-Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>M. R. Garey</surname>
            and
            <given-names>D. S.</given-names>
          </string-name>
          <string-name>
            <surname>Johnson</surname>
          </string-name>
          . Computers and
          <article-title>Intractability; A Guide to the Theory of NP-Completeness</article-title>
          .
          <string-name>
            <given-names>W. H.</given-names>
            <surname>Freeman</surname>
          </string-name>
          &amp; Company, New York, NY, USA,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13. D. S. Johnson, M. Yannakakis, and
          <string-name>
            <given-names>C. H.</given-names>
            <surname>Papadimitriou</surname>
          </string-name>
          .
          <article-title>On generating all maximal independent sets</article-title>
          .
          <source>Information Processing Letters</source>
          ,
          <volume>27</volume>
          (
          <issue>3</issue>
          ):
          <volume>119</volume>
          {
          <fpage>123</fpage>
          ,
          <year>1988</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          , T. Meyer, and
          <string-name>
            <given-names>J. Z.</given-names>
            <surname>Pan</surname>
          </string-name>
          .
          <article-title>Computing maximally satis able terminologies for the description logic ALC with GCIs</article-title>
          . In B. Parsia,
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          , and D. Toman, editors,
          <source>Proceedings of the International Workshop on Description Logics (DL'06)</source>
          , Lake District, UK,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>B.</given-names>
            <surname>Parsia</surname>
          </string-name>
          , E. Sirin,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Kalyanpur</surname>
          </string-name>
          .
          <article-title>Debugging OWL ontologies</article-title>
          . In A. Ellis and T. Hagino, editors,
          <source>Proceedings of the 14th international conference on World Wide Web (WWW</source>
          <year>2005</year>
          ), pages
          <fpage>633</fpage>
          {
          <fpage>640</fpage>
          . ACM,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>R.</given-names>
            <surname>Reiter</surname>
          </string-name>
          .
          <article-title>A theory of diagnosis from rst principles</article-title>
          .
          <source>Arti cial Intelligence</source>
          ,
          <volume>32</volume>
          (
          <issue>1</issue>
          ):
          <volume>57</volume>
          {
          <fpage>95</fpage>
          ,
          <year>1987</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>R.</given-names>
            <surname>Rymon</surname>
          </string-name>
          .
          <article-title>Search through systematic set enumeration</article-title>
          . In B. Nebel, C. Rich, and W. Swartout, editors,
          <source>Proceedings of the International Conference on the Principles of Knowledge Representation and Reasoning (KR'92)</source>
          , pages
          <fpage>539</fpage>
          {
          <fpage>550</fpage>
          , Cambridge, MA, USA,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>S.</given-names>
            <surname>Schlobach</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Cornet</surname>
          </string-name>
          .
          <article-title>Non-standard reasoning services for the debugging of description logic terminologies</article-title>
          . In G. Gottlob and T. Walsh, editors,
          <source>Proceedings of the Eighteenth International Joint Conference on Arti cial Intelligence (IJCAI'03)</source>
          , pages
          <fpage>355</fpage>
          {
          <fpage>362</fpage>
          . Morgan Kaufmann,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <given-names>S.</given-names>
            <surname>Schlobach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cornet</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Harmelen</surname>
          </string-name>
          .
          <article-title>Debugging incoherent terminologies</article-title>
          .
          <source>Journal of Automated Reasoning</source>
          ,
          <volume>39</volume>
          (
          <issue>3</issue>
          ):
          <volume>317</volume>
          {
          <fpage>349</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <given-names>B.</given-names>
            <surname>Suntisrivaraporn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Qi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Ji</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          .
          <article-title>A modularization-based approach to nding all justi cations for OWL DL entailments</article-title>
          . In J. Domingue and C. Anutariya, editors,
          <source>Proceedings of the 3rd Asian Semantic Web Conference (ASWC'08)</source>
          , volume
          <volume>5367</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>1</fpage>
          <lpage>{</lpage>
          15. Springer-Verlag,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>L. G. Valiant.</surname>
          </string-name>
          <article-title>The complexity of computing the permanent</article-title>
          .
          <source>Theoretical Computer Science</source>
          ,
          <volume>8</volume>
          (
          <issue>2</issue>
          ):
          <volume>189</volume>
          {
          <fpage>201</fpage>
          ,
          <year>1979</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>