<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An Automata-based Algorithm for ⋆ Description Logics around S RI Q</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Information Systems, Vienna University of Technology</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we use automata-theoretic techniques to tightly characterize the worst-case complexity of the knowledge base satisfiability problem for the very expressive Description Logics (DLs) ALCQIbr+eg and S RIQ. The logic ALCQIbr+eg extends ALC with qualified number restrictions, inverse roles, safe Boolean role expressions, regular expressions over roles, and concepts of the form ∃P.Self in the style of S RIQ, a well known DL closely related to the new Semantic Web Standard OWL 2. By reducing its knowledge base satisfiability problem to the emptiness test of an automaton on infinite trees, we show that all these additions do not increase the worst case complexity of ALC and provide a decision procedure for one of the most expressive DLs that have been shown to be decidable in exponential time. We also close the open question of the precise complexity of reasoning in S RIQ, exploiting a reduction from the S RIQ knowledge base satisfiability problem into the ALCQIbr+eg one.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Introduction
non-deterministic double exponential time even in the restricted case of SHIQ KBs. For
SROIQ, an optimal 2NEXPTIME upper bound could be established by reducing it (with an
exponential blow-up) to the two variable fragment of First Order Logic with counting, but the
tight complexity of SRIQ remained open [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <p>
        In this paper, we introduce an extension of the logic ALCQIbreg previously considered
in [
        <xref ref-type="bibr" rid="ref2 ref4">2, 4</xref>
        ]. Like SHIQ, SRIQ and other ‘main stream’ DLs, ALCQIbreg extends ALC with
inverses and counting. The main difference is that, instead of role inclusions and other axioms
asserting properties of roles, ALCQIbreg supports regular expressions over roles. They allow
ALCQIbreg to simulate the role hierarchies of SHIQ and SRIQ and, together with the
presence of (safe) Boolean role constructors, capture most of the features of SRIQ. Since all
remaining features are related to the ∃R.Self constructor, we add it to ALCQIbreg and obtain
its extension ALCQIbr+eg , which is (at least) as expressive as SRIQ (although less succinct).
      </p>
      <p>
        As we show in this paper, the ALCQIbr+eg knowledge base satisfiability problem allows
for an elegant reduction to the emptiness test of an automaton on infinite trees. The reduction
builds on [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and yields a worst-case optimal, single exponential time decision procedure. We
also present a reduction of the SRIQ knowledge base satisfiability problem to ALCQIbr+eg
that exponentially increases the size of the input knowledge base; this yields the first
2EXPTIME decision procedure for SRIQ and a tight characterization of its worst case complexity.
      </p>
      <p>
        The paper is organized as follows. In Section 2, after introducing the syntax and semantics
of ALCQIbr+eg knowledge bases, we define their syntactic closure, introduce a normal form
for them and a canonical form for their models. These allow us to extend the automata
algorithm of [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] to ALCQIbr+eg in Section 3. In section 4 we show how SRIQ can be reduced to
ALCQIbr+eg . Final discussion and conclusions are given in Section 5.
2
      </p>
      <p>The DL ALCQIb+</p>
      <p>
        reg
We introduce the syntax and semantics of the DL ALCQIbr+eg , a natural extension of
ALCQIbreg [
        <xref ref-type="bibr" rid="ref2 ref4">2, 4</xref>
        ] with concepts of the form ∃S.Self and Boolean role inclusion axioms.
Definition 1 (ALCQIbr+eg concepts and roles). We consider fixed, countably infinite
alphabets of concept names C (also called atomic concepts), role names R and individual names
I. We assume that the set C contains the special concepts ⊤ (top) and ⊥ (bottom), while R
contains the top (universal) role T and the bottom (empty) role B.
      </p>
      <p>According to the following syntax, we define (ALCQIbr+eg ) concepts C, C′, atomic roles
Q, simple roles S, S′, and roles R, R′, where A ∈ C, P ∈ R and P 6= T.</p>
      <p>C, C′ −→ A | ¬C | C ⊓ C′ | C ⊔ C′ | ∀R.C | ∃R.C |&gt; n S.C |6 n S.C | ∃S.Self</p>
      <p>Q −→ P | P −
S, S′ −→ Q | S ∩ S′ | S ∪ S′ | S ∩ ¬S′</p>
      <p>R, R′ −→ T | S | R ∪ R′ | R ◦ R′ | R∗ | id (C)
We use S \ S′ as a shortcut for S ∩ ¬S′. An ALCQIbr+eg expression is a concept or a role.
Subconcepts, subroles and subexpressions are defined in the natural way.</p>
      <p>Definition 2 (Knowledge Bases). An (ALCQIbr+eg ) assertion is of the form C(a), S(a, b),
or a 6= b, where C is a concept, S a simple role and a, b ∈ I. An (ALCQIbr+eg ) ABox is a
set of assertions. An (ALCQIbr+eg ) concept inclusion axiom (CIA) is an expression C ⊑ C′
for arbitrary concepts C and C′. An Boolean role inclusion axiom (BRIA) is an expression
S ⊑ S′ for simple roles S and S′. An ALCQIbr+eg TBox is a set of CIAs and BRIAs.</p>
      <p>An (ALCQIbr+eg ) knowledge base (KB) is a pair K = hA, T i where T is an ALCQIbr+eg
TBox and A is a non-empty ALCQIbr+eg ABox.We denote by CK the set of atomic concepts
occurring in K, by RK the set of roles names occurring in K, by RK the set RK ∪ {P − | P ∈
RK}, and by IK the individuals in K.</p>
      <p>Definition 3 (Semantics). An interpretation I = (ΔI , ·I ) consists of a non-empty domain
ΔI and a valuation function ·I that maps each individual a ∈ I to an element aI ∈ ΔI ,
each concept name A ∈ C to a set AI ⊆ ΔI , and each role name P ∈ R to a set of pairs
RI ⊆ ΔI × ΔI , in such a way that ⊤I = ΔI , ⊥I = ∅, TI = ΔI × ΔI and BI = ∅. The
function ·I is inductively extended to all concepts and roles as follows:</p>
      <p>(¬C)I=ΔI \CI (P −)I = {(y, x) | (x, y) ∈ P I}
(C ⊓ C′)I=CI ∩ C′I (R ∩ R′)I = RI ∩ R′I
(C ⊔ C′)I=CI ∪ C′I (R ∪ R′)I = RI ∪ R′I
(∀R.C)I={x | ∀y.(x, y) ∈ RI → y ∈ CI } (¬R)I = (ΔI × ΔI) \ RI
(∃R.C)I={x | ∃y.(x, y) ∈ RI ∧ y ∈ CI} (R ◦ R′)I = RI ◦ R′I
(&gt; n S.C)I={x | |{y | (x, y) ∈ SI ∧ y ∈ CI }| ≥ n} (R∗)I = (RI)∗
(6 n S.C)I={x | |{y | (x, y) ∈ SI ∧ y ∈ CI }| ≤ n} (id(C))I = {(x, x) | x ∈ CI }
(∃S.Self)I={x | (x, x) ∈ SI}
where ∩, ∪ and \ are overridden to denote the standard set-theoretic operations, ◦ to denote
composition and ·∗ to denote the reflexive transitive closure of a binary relation.</p>
      <p>I satisfies an assertion α, denoted I |= α, if aI ∈ AI when α = A(a); haI , bI i ∈ P I
when α = P (a, b); and aI 6= bI when α = a 6≈ b. I is a model of an ABox A, denoted
I |= A if it satisfies every assertion in A.I satisfies a CIA or BRIA E ⊑ E′ if EI ⊆ E′I . I
is a model of a TBox T , denoted I |= T if it satisfies every CIA and BRIA in T . Finally, I is
a model of K = hT , Ai, denoted I |= K, if I |= T and I |= A. Given a KB K, Knowledge
base satisfiability is the problem of deciding whether there exists an I such that I |= K.
2.1 Syntactic Closure
We define the (syntactic) closure of a concept, which contains all the concepts and simple
roles that are relevant for deciding its satisfiability. It is analogous to the well-known
FischerLadner closure of PDL, and it is exploited by the automata construction in Section 3.</p>
      <p>In this section, we consider concepts and roles in a DL that we call ALCQIBr+eg ; it is
very similar to ALCQIbr+eg but supports arbitrary role negation ¬S instead of role difference
S ∩¬S′. If an ALCQIBr+eg expression is equivalent to an ALCQIbr+eg one, we call it safe.
Intuitively, since ALCQIBr+eg does not impose safety, it allows for a more flexible manipulation
of Boolean role expressions and a simpler notion of syntactic closure.</p>
      <p>In what follows, we use ≷ to denote either &gt; or 6. For a role name P ∈ C, we call P − the
inverse of P and P the inverse of P −; the inverse of an atomic role Q is denoted Inv(Q). For a
simple role S, Inv(S) denotes the role obtained by replacing each atomic role Q occurring in
S by its inverse Inv(Q). For brevity, we assume in this subsection that ⊔ and ∀ are expressed
by means ⊓, ∃ and using ¬, and that \ and ∪ are expressed using ∩ and ¬. As usual, C and
C′, S and S′ and R and R′ respectively stand for concepts, simple roles and arbitrary roles.
Definition 4. The closure Cl (D) of an ALCQIBr+eg concept D is defined as the smallest set
of ALCQIBr+eg expressions such that D ∈ Cl (D) and:
if C ∈ Cl (D) then ¬C ∈ Cl (D) (if C is not of the form ¬C′)
if ¬C ∈ Cl (D) then C ∈ Cl (D)
if C ⊓ C′ ∈ Cl (D) then C, C′ ∈ Cl (D)
if ∃R.C ∈ Cl (D) then C ∈ Cl (D)
if ∃(R ∪ R′).C ∈ Cl (D) then ∃R.C, ∃R′.C ∈ Cl (D)
if ∃(R ◦ R′).C ∈ Cl (D) then ∃R.∃R′.C ∈ Cl (D)
if ∃R∗.C ∈ Cl (D) then ∃R.∃R∗.C ∈ Cl (D)
if ∃id (C).C′ ∈ Cl (D) then C, C′ ∈ Cl (D)
if ∃S.C ∈ Cl (D) then S ∈ Cl (D)
if ≷ n S.C ∈ Cl (D) then S, C ∈ Cl (D)
if ∃S.Self ∈ Cl (D) then S ∈ Cl (D)
if S ∩ S′ ∈ Cl (D) then S, S′ ∈ Cl (D)
if ¬S ∈ Cl (D) then S ∈ Cl (D)
if S ∈ Cl (D) then ¬S ∈ Cl (D) (if S is not of the form ¬S′)
if S ∈ Cl (D) then Inv(S) ∈ Cl (D)</p>
      <p>Note that |Cl (D)| is linear in the length of D and that Cl (D) may contain non safe
ALCQIBr+eg expressions even when D is an ALCQIbr+eg concept.
2.2 Normalizing Knowledge Bases
We present now some simple reductions that allow us to transform a KB into an
equivalent one with a more restricted syntactic structure. We consider ALCQIbr+eg KBs as well as
ALCQIBr+eg ones, which are defined in the natural way.</p>
      <p>First of all, we consider the special expressions ⊤, ⊥, B and T, and show that they can be
expressed without the need of special symbols. Then we show how every KB can be
transformed into an extensionally reduced one, where all terminological information is expressed
by CIAs. Finally, we consider the well known negation normal form of KBs.
Universal role and special expressions. The special concepts ⊤ and ⊥ can be simulated via
any concept names C⊤ and C⊥ not occurring in K by adding, e.g., CIAs C ⊔ ¬C ⊑ C⊤,
¬C⊤ ⊑ C⊥ and ¬C⊥ ⊑ C⊤ for any concept C. Further, using ⊤ and ⊥, the empty role B
can also be easily simulated by a fresh role name PB by adding an axiom ⊤ ⊑ ∀PB.⊥. Note
that the above holds for every DL containing ALC.</p>
      <p>The universal role can be expressed in ALCQIBr+eg as ¬B, but this is not a safe role and
is hence disallowed in ALCQIbr+eg KBs. In fact, in the absence of the T symbol, there is no
ALCQIbr+eg role expression that is always equivalent to T. However, for each input KB K,
there exists a role expression U such that the resulting KB has a model I with U I = ΔI × ΔI
whenever K is satisfiable. This is similar to what occurs in other expressive DLs (e.g., SHIQ)
and sufficient for the problems we consider.</p>
      <p>Indeed, since every satisfiable ALCQIbr+eg concept C has a connected model, we
can transform C into an equisatisfiable concept replacing each occurrence of T by U =
(SQ∈RC Q)∗, where RC denotes the set of all atomic roles occurring in C. In the presence of
ABoxes, we additionally need to ensure that all pairs of ABox individuals are in the extension
of the role expression simulating T. Hence, given a KB K = hA, T i, we can eliminate the
universal role T as follows:
- For a fresh role name AU , an assertion AU (a, b) is added to A for every pair a, b ∈ IK.
- Each occurrence of T in T is replaced by the role U = (AU ∪ SQ∈RK Q)∗.
Extensionally reduced KBs. Let KB K = hA, T i be an ALCQIBr+eg KB. Of the
transformations below, the one w.r.t. to the ABox is well known. Here we use a similar technique to
rewrite by means of CIAs all terminological information in the ∃R.Self concepts and BRIAs.
Please note that all the transformations below involve only simple roles, hence the
resulting KB complies to the allowed syntax. Also, since they do not introduce any non-safe role
expressions, if K is an ALCQIbr+eg KB, it will remain in ALCQIbr+eg after the reduction.
– ABox reduction. K is extensionally reduced w.r.t. the ABox if only assertions of the forms
A(a), P (a, b) and a 6= b with A ∈ C and P ∈ R occur in A.</p>
      <p>The ABox reduction of K is the KB ΩABox(K) obtained as follows:
1. Each assertion of the form C(a) in A with C 6∈ C is replaced with an assertion</p>
      <p>AC (a) for a fresh AC ∈ C and an axiom AC ⊑ C is added to T .
2. Each assertion S(a, b) in A with S 6∈ R is replaced with an assertion PS (a, b) for a
fresh PS ∈ R and an axiom PS ⊑ S is added to T .
– Self concepts reduction. K is extensionally reduced w.r.t. Self concepts if P ∈ R for
every concept of the form ∃P.Self occurring in it.</p>
      <p>The Self concept reduction of K is the KB ΩSelf(K) obtained by replacing each concept of
the form ∃S.Self with S 6∈ R occurring in T with a concept ∃PS .Self for a fresh PS ∈ R
and an adding an axiom PS ⊑ S to T .
– BRIA reduction. K is extensionally reduced w.r.t. BRIAs if all axioms in T are CIAs.</p>
      <p>The BRIA reduction of K is the KB ΩBRIA(K) obtained by replacing in T each BRIA of
the form S ⊑ S′ by a CIA ∃(S \ S′).⊤ ⊑ ⊥.
– KB reduction. K is extensionally reduced if it is extensionally reduced w.r.t. the ABox,
Self concepts and BRIAs.</p>
      <p>The reduction of K is the KB Ω(K) = ΩBRIA(ΩSelf(ΩABox(K))).</p>
      <p>The above reductions preserve the semantics of the knowledge base K, and Ω(K)
additionally constrains the interpretation of some concept names not occurring in K. If we consider
the standard first order translation of K and Ω(K), then the latter is a conservative extension
of the former. With this observation, the proof of the following proposition is straightforward:
Proposition 1. For a given ALCQIBr+eg KB K, Ω(K) can be obtained in linear time. For
every interpretation I, I |= Ω(K)′ implies I |= K and I |= K implies I′ |= Ω(K), where I′
denotes the interpretation s.t. E′ I = EI for every E ∈ CK ∪ RK, (AC )I′ = CI for every
AC ∈ CΩ(K) \ CK, and (PS )I = SI for every PS ∈ RΩ(K) \ RK.</p>
      <p>
        Thus deciding the satisfiability of K can be reduced (in linear time) to used deciding the
satisfiability of Ω(K), and we can restrict or attention to extensionally reduced knowledge
bases. Note that we can also consider Ω(K) to decide the entailment of any sentence over the
language of K; this is useful, e.g., for query answering [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>Negation Normal Form. Finally, we transform KBs to negation normal form (NNF).</p>
      <p>An ALCQIBr+eg role R is in NNF form if Q is atomic for every subrole ¬Q of R.
Similarly, a concept C is in negation normal form (NNF) if A is atomic for every subconcept
¬A of C, and all roles occurring in C are in NNF. A knowledge base is in NNF if only
expressions in NNF occur in it. For an expression E, nnf (E) denotes the equivalent
expression in NNF obtained from E using the standard transformations. For a concept D, we let
Cl nnf (D) = {nnf (E) | E ∈ Cl (D)}. For a KB K, nnf (K) denotes the KB obtained from K
by replacing each expression E in K by nnf (E).</p>
      <p>Proposition 2. For every ALCQIBr+eg expression E, nnf (E) can be obtained in linear time,
and if E is an ALCQIbr+eg expression, then so is nnf (E). Further, for every KB K and every
interpretation I, I |= K iff I |= nnf (K) .</p>
      <p>An ALCQIbr+eg KB K = hA, T i is normal if it is extensionally reduced and in NNF, it
does not contain ⊤, ⊥, T and B, and each concept occurring in A also occurs in T .1
1 This is w.l.o.g. as we can add to T , e.g., A ⊑ ⊤ for each A ∈ C that occurs in A but not in T .
2.3</p>
      <p>Canonical Models and Trees
We have seen that, to decide KB satisfiability, we only need to consider KBs with a restricted
syntax. Now we consider some semantic properties by which the shape of the considered
interpretations can also be conveniently restricted.</p>
      <p>In what follows, we consider only ALCQIbr+eg KBs; the results of this section do not hold
for ALCQIBr+eg in general.2 Like many DLs, ALCQIbr+eg has some form of the tree model
property: every satisfiable TBox T (or similarly, every satisfiable concept C) has a model
that can be seen as a tree with possible additional loops at some nodes, say a ‘quasi-tree’. A
satisfiable ALCQIbr+eg KB K = hA, T i has a ‘quasi-forest’ shaped canonical model, in which
each ABox individual is the root of a quasi-tree shaped model of T .</p>
      <p>Definition 5. Let K = hT , Ai be an ALCQIbr+eg KB, and let 1 ≤ n ≤ |IK|, k ≥ 0. An
interpretation I = (ΔI , ·I ) for K is called a canonical interpretation (with n roots and branching
degree k) if:
(1) each i·x ∈ ΔI has i ∈ {1, . . . , n} and x ∈ {1, . . . , k}∗; if x = ε then i is called a root.
(2) {ε} ∪ ΔI is prefix closed, i.e., if x·c ∈ ΔI , then x ∈ {ε} ∪ ΔI .
(3) For each a ∈ IK there is exactly one root j such that aI = j.
(4) For each root j there is some a ∈ IK with aI = j.
(5) If (i·w, j·w′) ∈ P I for some atomic role P and two roots i, j, then either w = w′ = ε,
or i = j and one of the following holds: (i) w = w′, (ii) w′ = w·l or (iii) w = w′·l, for
some 1 ≤ l ≤ k.</p>
      <p>
        In ALCQIbr+eg , any TBox T can be ‘internalized’ into an equivalent concept CT , so that
the satisfiability of T can be established by obtaining a model of CT [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>Proposition 3. Consider an ALCQIbreg knowledge base K</p>
      <p>CT = ∀(SR∈RK R)∗. FC1⊑C2∈T (¬C1 ⊔ C2)
For every interpretation I = (ΔI , ·I ), I |= T iff (CT )I = ΔI . Furthermore, if I is a
canonical interpretation, then I |= T iff i ∈ (CT )I for each root i of I.
=
hA, T i, and let</p>
      <p>
        Now we can establish the canonical model property of ALCQIbr+eg ; it can be shown by
adapting corresponding proofs for related logics [
        <xref ref-type="bibr" rid="ref15 ref17">17, 15</xref>
        ]. Roughly, any model of a satisfiable
ALCQIbr+eg concept D can be unraveled into a model of D that is a quasi-tree and that has
branching degree kD = |Cl (D)| × n, where n is the maximal n occurring in a concept of the
form &gt; n S.C in Cl nnf (D), or 1 if there are no such concepts.
      </p>
      <p>Theorem 1. Every satisfiable ALCQIbr+eg KB K = hT , Ai has a canonical model I such
that I is a canonical interpretation for K with branching kCT , I |= A and i ∈ (CT )I for
each root i of I.</p>
      <p>
        By Theorem 1, which does not hold for ALCQIBr+eg in general, deciding the satisfiability
of an ALCQIbr+eg KB boils down to deciding whether it has a canonical model. To decide
the latter, we rely on a representation of canonical interpretations as infinite labeled trees and
define an automaton that decides the existence of such trees.
2 Theorem 1 fails already in the extension of ALC with non-safe Boolean role expressions, see [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
Definition 6. An (infinite) tree is a prefix-closed set T of words over the natural numbers IN.
The elements of T are called nodes, the empty word ε is its root. For every x ∈ T , the nodes
x·c with c ∈ IN are the successors of x, and x is the predecessor of each x·c. By convention,
x·0 = x, and (x·i)·−1 = x. The branching degree d(x) of a node x is the number of its
successors. If d(x) ≤ k for each node x of T , then T has branching degree k. An infinite path
π of T is a prefix-closed set π ⊆ T where for every i ≥ 0 there exists a unique node x ∈ P
with |x| = i. A labeled tree over an alphabet Σ (or simply a Σ-labeled tree) is a pair (T , V ),
where T is a tree and V : T → Σ maps each node of T to an element of Σ.
      </p>
      <p>In the following, we assume a (fixed) given normal ALCQIbr+eg KB K = hA, T i. We
denote IK by J and assume an arbitrary, fixed, enumeration a1, . . . , am of the elements of
J . We define PI = {P ij | ai, aj ∈ J and P ∈ RK} and PS = {PSelf | P ∈ RK}. We will
define the tree representation of a canonical interpretation I for K as a labeled tree TI over
the alphabet ΣK = 2CK∪RK∪J ∪{r}∪{d}∪PI∪PS.</p>
      <p>To define TI , we first note that the domain of a canonical interpretation is almost a tree;
we only need to add a root ε to it. The extensions of the interpretations of individuals, concepts
and roles can be represented as labels over the alphabet ΣK in a straightforward way. Roughly,
each element of x ∈ ΔI , which is a node of the tree, is labeled with a set V (x) that contains:
(i) the atomic concepts A such that x ∈ AI ; (ii) the atomic roles that connect the predecessor
of x to x, and (iii) a special symbol PSelf for each P such that x ∈ (∃P.Self)I . The root ε and
the roots of I (which are at the first level of the tree TI ) are treated differently. The label of
each root i of I, apart from the atomic concepts to which i belongs, also contains the name of
the individuals in J which it interprets, and it contains no basic roles. The relations between
level one nodes are stored in the label of the root ε. The root ε does not represent any object in
ΔI and is marked with a special identifier r and symbols of the form P ij indicating that the
pair of individuals (ai, aj ) belongs to the extension of the basic role P . For simplicity, if I has
n roots and |J | &gt; n, |J | − n dummy children labeled {d} are added to the root ε, ensuring
that it has exactly |J | children.</p>
      <p>Definition 7. Let I be a canonical interpretation for K with n roots. The tree representation
of I is the labeled tree TI = (T , V ) over the alphabet ΣK = 2CK∪RK∪J ∪{r}∪{d}∪PI∪PS
defined as follows:
• T = {ε} ∪ ΔI ∪ {n + 1, . . . , m}.
• V (ε) = {r} ∪ {P ij | ai, aj ∈ J , P ∈ CK and haiI , ajI i ∈ P I },
• for each 1 ≤ i ≤ n, V (i) = {aj ∈ J | ajI = i} ∪ {A ∈ CK | ajI ∈ AI and ajI = i},
• for each n + 1 ≤ i ≤ m, V (i) = {d},
• for all other nodes i·x of T , V (i·x) = {A ∈ CK | i·x ∈ AI } ∪ {Q ∈ RK | (i·w, i·x) ∈</p>
      <p>
        P I where x = w·j for some j, 1 ≤ j ≤ kCT } ∪ {PSelf | P ∈ RK and i·x ∈ (∃P.Self)I }
Note that the branching degree of TI is bounded by |J | at the root and by kCT at all other
levels, so the tree has branching degree max(kCT , |J |).
3
Automata on infinite trees allow for an elegant reduction of decision problems for temporal
and program logics [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], and have been widely exploited for many variants of PDL, the
μcalculus and similar logics [
        <xref ref-type="bibr" rid="ref16 ref17">16, 17</xref>
        ]. In DLs they have been used for concept satisfiability [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]
and, to a more limited extent, for KB satisfiability [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
We consider two-way alternating tree automata over infinite trees, introduced in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Let
B(I) be the set of positive Boolean formulas built inductively from true, false, and atoms from
a given set I applying ∧ and ∨. A set J ⊆ I satisfies a formula ϕ ∈ B(I), if assigning true to
the atoms in J and false to those in I \ J makes ϕ true. A two-way alternating tree automaton
(2ATA) running over infinite trees with branching degree k, is a tuple A = hΣ, Q, δ, q0, F i,
where Σ is the input alphabet; Q is a finite set of states; δ : Q × Σ → B([k] × Q), where
[k] = {−1, 0, 1, . . . , k}, is the transition function; q0 ∈ Q is the initial state; and F ⊆ Q is
the (Bu¨ chi) acceptance condition.
      </p>
      <p>The transition function δ maps a state q ∈ Q and an input letter σ ∈ Σ to a positive
Boolean formula ϕ over atoms [k] × Q. Intuitively, if δ(q, σ) = ϕ, then each atom (c, q′) in ϕ
corresponds to a new copy of the automaton going in the direction given by c and starting in
state q′. E.g., a transition of the form δ(q1, σ) = (1, q2) ∧ (1, q3) ∨ (−1, q1) ∧ (0, q3) indicates
that if A is in the state q1 and reads the node x labeled with σ, it proceeds by sending off
either two copies, in the states q2 and q3 respectively, to the first successor of x, or one copy
in the state q1 to the predecessor of x and one copy in the state q3 to x itself.</p>
      <p>Informally, a run of a 2ATA A over a labeled tree (T , V ) is a labeled tree (Tr, r) in which
each node y ∈ Tr is labeled by an element r(y) = (x, q) ∈ T × Q and describes a copy of A
that is in the state q and reads the node x of T ; the labels of adjacent nodes must satisfy the
transition function of A. The run is accepting if in every infinite path at least one state from
F occurs infinitely often in the node labels.</p>
      <p>Formally, given a 2ATA A = hΣ, Q, δ, q0, F i and a Σ-labeled tree (T , V ), a run of A
over (T , V ) is a T ×Q-labeled tree (Tr, r) satisfying:
• ε ∈ Tr and r(ε) = (ε, q0).
• For each y ∈ Tr with r(y) = (x, q) there is a (possibly empty) set S = {(c1, q1) , . . . ,
(cn, qn)} ⊆ [k] × Q such that S satisfies δ(q, V (x)) and for all 1 ≤ i ≤ n, we have that
y·i ∈ Tr, x·ci is defined and r(y·i) = (x·ci, qi).</p>
      <p>A run (Tr, r) over (T , V ) is accepting, if for each infinite path π of Tr there is some state
qf ∈ F such that the set {y ∈ π | r(y) = (x, qf ) for some x ∈ T } is infinite. A labeled tree
(T , V ) is accepted by A if there is an accepting run of A over (T , V ), and L(A) denotes the
set of Σ-labeled trees accepted by A.</p>
      <p>
        The non-emptiness problem for 2ATAs is to decide whether, given a 2ATA A, the set L(A)
is nonempty. It was shown in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] that non-emptiness of A = hΣ, Q, δ, q0, F i is decidable in
time single exponential in |Q| and polynomial in |Σ|.
In the following, we assume a given, fixed, normal ALCQIbr+eg KB K = hA, T i and provide
a construction of a 2ATA AK from K that accepts a given tree T iff it is the tree
representation of a canonical model of K. Similar automata constructions have been given in the
literature. In particular, this one is a simple extension of the one in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] to handle the ∃S.Self
concepts of ALCQIbr+eg . In turn, [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] extends to KB satisfiability a previous construction of
an automaton that decides ALCQIbreg concept satisfiability [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Due to space constraints, we
briefly describe AK = hΣK, QK, δK, q0, FKi and focus on the extension from ALCQIbreg to
ALCQIbr+eg ; we refer the reader to [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] for details.
      </p>
      <p>
        The alphabet is ΣK = 2CK∪RK∪J ∪{r}∪{d}∪PI∪PS as given in Definition 7; note that the
only difference w.r.t. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] is the presence of the set PS.
      </p>
      <p>
        Like in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], the ‘basic’ states of AK are the role and concept expressions in the closure of
CT , which are used for checking whether a node of the tree is in the extension of an expression
or marked with some symbol. There are also additional special states for verifying some more
involved conditions like the satisfaction of the number restrictions, the correct representation
of the individuals and the ABox assertions, etc.
      </p>
      <p>
        More precisely, let S = J ∪ PS ∪ {c, d}. There is a state for each element of the the
set Cl ext = Cl nnf (CT ) ∪ {s, ¬s | s ∈ S} which we call the S-extended closure of CT . The
extended closure allows us to treat the individual names and special symbols in the node labels
as well as the atomic expressions in a uniform way. Note that, in contrast to [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], Cl ext may
contain concepts of the form ∃P.Self. The set of states is SK = {s0, s1} ∪ Cl ext ∪ Snum ∪
SA role ∪ SA quant ∪ SA num ∪ SSelf, where the sets SA quant Snum , SA num and SA role are
as in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. We use the states in SA quant of the forms hj, ∃S.Ci and hj, ∀S.Ci to respectively
check whether concepts of the form ∃Q.C and ∀Q.C are satisfied by a node representing
some ABox individual aj . For checking the satisfaction of number restrictions we need the
states Snum of the form h≷ n S.C, i, ji that store two counters i and j, plus the states SA num
of the form hm, ≷ nQC, i, ji, which also store the value of two counters but for a particular
individual am. For checking whether two ABox individuals ai, aj are related by a simple role
S, we use states of the form Sij in SA role . The only additional states w.r.t. to [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] are the ones
in the set SSelf = {SSelf | S a simple role in Cl nnf (CT )}, which we use for decomposing a
simple role and checking whether it links an individual to itself. Finally, there is the initial state
s0 and another special state s1. As in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], both are used to verify the satisfaction of general
conditions necessary for the input tree to represent a canonical model of K, e.g., that the root
is the only node labeled r, that its label correctly stores the role assertions in the ABox and
that the ABox individuals are correctly represented by the level one nodes.
      </p>
      <p>
        As in [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">2–4</xref>
        ], FK = {∀R∗.C | ∀R∗.C ∈ Cl nnf (CT )} is the acceptance condition.
      </p>
      <p>
        The transition function δ : SK × ΣK → B([k] × SK), where k = max(kCT , |J |), is
defined as follows. First, for each σ in σ ∈ ΣK with r ∈ σ we define a transition δ(s0, σ)
=F1 ∧ · · · ∧ F8 from the initial state s0, which verifies (i) that the root is the only node
containing r and that the ABox individuals are properly represented by the level one nodes
(F1–F4); (ii) that all ABox assertions are satisfied (F5–F7); and (iii) that every non-dummy
node at level one is the root of a tree representing a model of CT (F8). F4 requires a transition
δ(s1, σ) from the other special state s1 for each σ ∈ ΣK, which is necessary for F4 to correctly
ensure that the symbol r only occurs at the root and the symbols in J only occur at the first
level of the tree. These transitions are as in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], to which the reader may refer for details.
      </p>
      <p>
        The transitions that inductively decompose complex concepts while navigating the tree
are as in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]; so are the ones that decompose simple roles. Additionally, for each concept
of the form ∃P.Self such that ∃P.Self ∈ SK and for each σ ∈ ΣK there is a transition
δ(∃P.Self, σ) = (0, PSelf). Special transitions are necessary to decompose the simple roles
for each possible self loop and for each pair of ABox individuals. Thus we have for each
σ ∈ ΣK and for each s ∈ SSelf ∪ SA role a transition δ(s, σ) as follows:
δ(S ∩ S′Self, σ) = (0, SSelf) ∧ (0, S′Self) δ(S ∩ S′ij, σ) = (0, Sij) ∧ (0, S′ij)
δ(S ∪ S′Self, σ) = (0, SSelf) ∨ (0, S′Self) δ(S ∪ S′ij, σ) = (0, Sij) ∨ (0, S′ij)
δ(Q \ Q′Self, σ) = (0, QSelf) ∧ (0, ¬Q′Self) δ(Q \ Q′ij, σ) = (0, Qij) ∧ (0, ¬Q′ij)
Informally, given a tree representing an interpretation, let us call potential neighbors of x
all the nodes that could be related to a node x via a simple role. If x does not represent an
ABox individual, then its potential neighbors are x itself, its predecessor and all its successors.
If x does represent an ABox individual, then instead of its predecessor (which is the dummy
root), all the other nodes representing ABox individuals are potential neighbors of x.
      </p>
      <p>
        Further transitions in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] verify whether a node satisfies an universal restriction ∀S.C, an
existential restriction ∃S.C or a number restriction ≷ nS.C, for a simple role S; they navigate
all the potential neighbors of a node and test which are reachable via S and labelled C. We use
similar transitions, but adapt them to take into account that, in an ALCQIbr+eg interpretation,
a node is a potential neighbor of itself. The transitions for the concepts of the form ∀S.C and
∃S.C are given below. The ones for the number restrictions (which use the states in Snum and
SA num to encode counters) are adapted similarly but omitted here due to space constraints.
      </p>
      <p>Since the potential neighbors of a node x are different depending on whether (i) x is a
level one node, or (ii) x is a node at any other level, the transitions must differ accordingly.
Hence, for each concept of the form ∃S.C or ∀S.C in Cl nnf (CT ) with S simple and each
σ ∈ ΣK, if σ ∩ (J ∪ {d}) = ∅ we define:
δ(∃S.C, σ) = ((0, SSelf) ∧ (0, C)) ∨ ((0, Inv(S)) ∧ (−1, C)) ∨</p>
      <p>W1≤i≤kCT ((i, S) ∧ (i, C))
δ(∀S.C, σ) = ((0, nnf (¬SSelf)) ∨ (0, C)) ∧ ((0, nnf (¬Inv(S))) ∨ (−1, C)) ∧</p>
      <p>V1≤i≤kCT ((i, nnf (¬S)) ∨ (i, C))
Otherwise, if σ ∩ (J ∪ {d}) 6= ∅, we have:
δ(∃S.C, σ) = ((0, SSelf) ∧ (0, C)) ∨ Waj∈σ(−1, hj, ∃S.Ci)∨</p>
      <p>W1≤i≤kCT((i, S) ∧ (i, C))
δ(∀S.C, σ) = ((0, nnf (¬SSelf)) ∨ (0, C)) ∧ Vaj∈σ(−1, hj, ∀S.Ci) ∧</p>
      <p>V1≤i≤kCT ((i, nnf (¬S)) ∨ (i, C))</p>
      <p>
        The first disjunct/conjunct in these transitions, which verifies whether the current node
reaches itself via S and is labelled C, is not present in the corresponding transitions from [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        In the last set of transitions (case (i) above, σ ∩ (J ∪ {d}) 6= ∅), the second
disjunct/conjunct is responsible for sending a copy of the automaton to the root of the tree and
moving to the special states in SA quant , in order to traverse all the ABox individuals which are
potential neighbors of the current node. The latter is achieved as in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], i.e., for each σ ∈ ΣK
and each hj, ∃S.Ci or hj, ∀S.Ci in SA quant , there is a transition
δ(hj, ∃S.Ci, σ) = _ ( _ ((0, Sjk) ∧ (i, ak) ∧ (i, C)))
1≤i≤|J | 1≤k≤|J |
      </p>
      <p>^ (
δ(hj, ∀S.Ci, σ) =</p>
      <p>^ ((0, nnf (¬Sjk)) ∨ (i, ¬ak) ∨ (i, C)))
1≤i≤|J | 1≤k≤|J |</p>
      <p>The above transitions decompose all concepts and roles until they reach states
corresponding to atomic expressions or (possibly negated) special symbols in PI ∪ PS; it is then checked
whether the expression is contained in the node label σ. The transitions form the states s0 and
s1 may also move to states corresponding to possibly negated symbols in J ∪{r, d}, which are
similarly checked at the node labels. Thus, for each σ ∈ ΣK, each s ∈ CK ∪ RK ∪ J ∪ {r, d}
and each t ∈ PI ∪ PS, there are transitions:</p>
      <p>(true, if s ∈ σ (true, if s 6∈ σ
δ(s, σ) =
δ(t, σ) =</p>
      <p>
        false, if s 6∈ σ
(true, if t ∈ σ or Inv(t) ∈ σ
false, otherwise
δ(¬s, σ) =
δ(¬t, σ) =
false, if s ∈ σ
false, otherwise
(true, if t 6∈ σ and Inv(t) 6∈ σ
where Inv(t) = Inv(P )ji if t = P ij and Inv(t) = Inv(P )Self if t = PSelf for each P ∈ RK.
These transitions extend similar ones in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] to include the symbols in PS.
      </p>
      <p>
        By a simple adaptation of the proofs in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], it can be easily verified that, given a canonical
model I of K, its tree representation TI is accepted by AK. Furthermore, a model IT of K
can be constructed from any labeled tree T = (T , V ) accepted by AK. The domain ΔIT is
given by the nodes x in T with ai ∈ V (x) for some individual ai, and the nodes in T that
are reachable from any such x through the roles. The extensions of concepts and roles are
determined by the labels of the nodes in T. The only difference w.r.t. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] is that we also add a
pair (x, x) to P IT for every x whose label contains PSelf.
      </p>
      <p>This shows that the automaton AK can be used to decide the satisfiability of K.
Theorem 2. K is satisfiable iff the set of trees accepted by AK is nonempty.</p>
      <p>
        Under unary encoding of numbers in restrictions, the number of states of AK is polynomial
in the size of K. Since ΣK is single exponential in the size of K, we can exploit the results of
[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] to obtain an EXPTIME upper bound for satisfiability of ALCQIbr+eg KBs. This is optimal,
as a matching lower bound is known for much weaker DLs [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>Corollary 1. Satisfiability of ALCQIbr+eg knowledge bases is EXPTIME-complete.
4</p>
      <p>
        Reasoning in SRI Q
The description logic SRIQ was introduced in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] as an extension of the DL RIQ [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], which
in turn extends the well known SHIQ [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] underlying OWL-Lite. It has gained considerable
attention recently due to its close relationship to the logic SROIQ underlying the new OWL
2 standard. The most prominent feature of SRIQ are complex role inclusion axioms of the
form R1 ◦ . . . ◦ Rn ⊑ R. Additionally, in an SRIQ KB it is possible to explicitly state certain
properties of roles like transitivity, (ir)reflexivity and disjointness. Some of these additions
increase the expressivity of the logic, while other are just ‘syntactic sugar’ and are intended to
be useful for ontology engineering. We now recall the definition of SRIQ.
Definition 8 (SRIQ knowledge bases). Let R = R ∪ {R− | R ∈ R}. Each R ∈ R is a
(SRIQ) role. As usual, P − is the inverse of P , P the inverse of P −, and Inv(R) denotes the
inverse of the role R. A (SRIQ) role inclusion axiom (SRIA) is an expression of the form
R1 ◦ . . . ◦ Rn ⊑ R, where n ≥ 1 and {R1, . . . , Rn, R} ⊆ R.
      </p>
      <p>A set Rh of SRIAs is regular if there exists a partial order ≺ on R such that Inv(R) ≺ R′
iff R ≺ R′ for every R, R′ ∈ R, and such that every SRIA in Rh is of one of the following
forms:
(i) R ◦ R ⊑ R, or
(ii) Inv(R) ⊑ R, or
(iii) R1 ◦ . . . ◦ Rn ⊑ R and Ri ≺ R for each 1 ≤ i ≤ n, or
(iv) R ◦ R1 ◦ . . . ◦ Rn ⊑ R and Ri ≺ R for each 1 ≤ i ≤ n, or
(v) R1 ◦ . . . ◦ Rn ◦ R ⊑ R and Ri ≺ R for each 1 ≤ i ≤ n.</p>
      <p>Given a set Rh of SRIAs, let RRh denote the set of roles occurring in Rh. The set of
simple roles in Rh is the minimal set SR(Rh) ⊆ RRh containing each role R such that (i)
there are no SRIAs of the form R1 ◦ . . . ◦ Rn ⊑ R in Rh, and (ii) if n = 1 and R1 ∈ SR(Rh)
for every SRIA of the form R1 ◦ . . . ◦ Rn ⊑ R in Rh, then R ∈ SR(Rh).</p>
      <p>A role assertion is an expression of the form Sym(R), Ref(R), Irr(R), or Dis(R, R′), where
R, R′ ∈ R.3 An RBox R is a finite set of SRIAs and role assertions such that R ∈ SR(Rh)
for each R occurring in a role assertion of the form Ref(R), Irr(R) or Dis(R, R′) in R,
where Rh denotes the set of SRIAs in R. We say that R is regular if Rh is regular, and define
SR(R) = SR(Rh).</p>
      <p>Similarly as in ALCQIbr+eg , (SRIQ) concepts C, C′ obey the following syntax
C, C′ −→ A | ¬C | C ⊓ C′ | C ⊔ C′ | ∀R.C | ∃R.C |&gt; n S.C |6 n S.C | ∃S.Self,
where A ∈ C and R, S ∈ R. A (SRIQ) concept inclusion axiom (SCIA) is an expression
C ⊑ C′ for arbitrary SRIQ concepts C and C′. A (SRIQ) TBox is a set of SCIAs. For an
RBox R, a concept C is R-simple if S ∈ SR(Rh) for each S occurring in a subconcept of C
of the form &gt; n S.C′, 6 n S.C′ or ∃S.Self; a TBox T is R-simple if all concepts occurring
in the SCIAs of T are R-simple.</p>
      <p>An (SRIQ) assertion is an expression C(a), R(a, b), ¬S(a, b) or a 6= b, where C is a
SRIQ concept, S, R are SRIQ roles and a, b ∈ I. An (SRIQ) ABox is a set of SRIQ
assertions. An ABox A is R-simple for an RBox R if S ∈ SR(Rh) for each S occurring in an
assertion of the form ¬S(a, b).</p>
      <p>An (SRIQ) knowledge base is a tuple K = hA, T , Ri where R is a regular SRIQ
RBox, A a non-empty R-simple SRIQ ABox and T an R-simple SRIQ TBox.</p>
      <p>The semantics of SRIQ TBoxes and ABoxes is defined as for ALCQIbr+eg . An
interpretation I satisfies a role assertion Sym(R), Ref(R) or Irr(R) if RI is a symmetric, reflexive
or irreflexive relation, respectively; I satisfies Dis(R, R′) if the relations are disjoint, i.e.,
RI ∩ R′I = ∅; I satisfies a SRIA R1 ◦ . . . ◦ Rn ⊑ R if R1I ◦ . . . ◦ RnI ⊆ RI , where again we
override the symbol ◦ and use it to denote binary role composition. An interpretation I is a
model of an RBox, in symbols I |= R, if it satisfies all SRIAs and role assertions in R.
Modelhood of a KB is restricted in the natural way to the models of the RBox, i.e., I |= hA, T , Ri
iff I |= A, I |= T and I |= R.
4.1 Reducing SRI Q to ALCQI br+eg
The restriction to regular RBoxes, which is crucial for the decidability of SRIQ, ensures that
all implications between roles can be captured by a set of regular expressions. This will allow
us to reduce reasoning in SRIQ to reasoning in ALCQI br+eg , and to show that the algorithm
we have presented is also an optimal decision procedure for SRIQ KB satisfiability.</p>
      <p>
        If R is a regular RBox, then for every role R ∈ R there is a regular expression ρR such
that a word w = R1 ◦ . . . ◦ Rn over R is in the language of ρR iff wI ⊆ RI in every model
I of R. This was shown in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] for RIQ and its generalization to SRIQ is straightforward.4
Proposition 4. Given a regular RBox R, for each R ∈ R there is a regular expression ρR
such that RI = (ρR)I for every model I of R.
      </p>
      <p>
        Reducing TBoxes. Each regular expression ρR is an ALCQIbr+eg role. Further, if a role S
is simple in R, then ρS is the regular expression S(S′⊑S)∈R S′, which is in fact a simple
ALCQIbr+eg role. As a consequence, if we replace R by ρR is any SRIQ concept, the
resulting expression is an ALCQIbr+eg concept. This allows for a natural translation of SRIQ
concepts and TBoxes into ALCQI br+eg .
3 In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] also role assertions of the form Tra(R), asserting that R is transitive, are allowed. We omit
them since this can be equivalently expressed with an SRIA R ◦ R ⊑ R.
4 It follows also from [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and the equivalence of finite state automata and regular expressions.
Definition 9. Let R be a regular RBox. For each R ∈ R, ρR denotes a (fixed, arbitrary)
regular expression such that RI = (ρR)I for every model I of R. For any SRIQ concept C,
we denote by CR the ALCQIbr+eg concept that results from replacing each role R in C with
ρR. Similarly, for any SRIQ TBox T , we define T R = {CR ⊑ DR | C ⊑ D ∈ T }.
Proposition 5. Let R be a regular SRIQ RBox and I a model of R. Then RI = (ρR)I for
every SRIQ role R and CI = (CR)I for every SRIQ concept C, and I |= T iff I |= T R.
Reducing ABoxes. Now we consider ABoxes. First, we remove the negative role assertions,
which are not allowed in ALCQIbr+eg , as follows.
      </p>
      <p>Definition 10. Given an SRIQ ABox A, let Aneg contain all negative role assertions
¬S(a, b), and let Apos be obtained by replacing each assertion ¬S(a, b) in Aneg by P¬S (a, b),
where P¬S is a fresh role name. Further, let RA be the set of ALCQIbr+eg BRIAs containing
P¬S ∩ S ⊑ B for each ¬S(a, b) in A.</p>
      <p>If an ABox contains an assertion in which a non-simple role R occurs, ρR is not a
simple ALCQIbr+eg role and hence we can not replace R with it. Instead, we explicitly add as
assertions all relations between individuals that are implied by the RBox.</p>
      <p>Definition 11. Let A and R be a SRIQ ABox and a SRIQ RBox respectively. The
Rextension of A, denoted AR, is the minimal set of assertions containing A such that, if there
are individuals a1, . . . , an+1 ∈ IK and a SRIA R1◦. . .◦Rn ⊑ R in R such that Ri(ai, ai+1) ∈
AR for each 1 ≤ i ≤ n, then R(a1, an+1) ∈ AR.</p>
      <p>Reducing RBoxes. As for the RBox, the satisfaction of the SRIAs is ensured by the regular
expressions added to the TBox and the assertions added to the ABox. We only have to ensure
the satisfaction of the role assertions, which is easy in ALCQIbr+eg .</p>
      <p>Definition 12. Let R be a regular SRIQ RBox and let R′ denote the set of role assertions
in R. We denote by RB the set of BRIAs and CIAs that contains:
• R ⊑ R′ for every SRIA R ⊑ R′ in R such that R′ ∈ SR(R),
• R ⊑ Inv(R) and Inv(R) ⊑ R for every assertion Sym(R) in R′,
• ⊤ ⊑ ∃R.Self for every assertion Ref(R) in R′,
• ∃R.Self ⊑ ⊥ for every assertion Irr(R) in R′, and
• R ∩ R′ ⊑ B for every assertion Dis(R, R′) in R′.</p>
      <p>Reducing KBs. Now we can reduce any given SRIQ KB K = hA, T , Ri into an equivalent
ALCQIbr+eg KB K = hA, T i as follows.</p>
      <p>Definition 13. Let K = hA, T , Ri be a SRIQ knowledge base. We denote by K′ = hA′, T ′i
the following ALCQIbr+eg KB:
• The ABox A′ is AR, where A = (A \ Aneg) ∪ Apos.
• The TBox T ′ is T R ∪ RA ∪ RB .</p>
      <p>For any interpretation I, I |= K implies I |= K′. The converse holds in a slightly weaker
form. While a model of K′ need not be closed under the RBox and it may not be a model of K,
the models of K′ and K may only differ in the interpretation of some ‘implied’ roles. Adding
them to a model of K′, we obtain a model of K.</p>
      <p>For a model I of K′, the extension of I to R is the interpretation IR such that ΔI = ΔI ;
(A)IR = (A)I for every atomic concept A; and for every atomic role P occurring in R, if
R
(x, y) ∈ (ρP )I or (y, x) ∈ (ρP − )I , then (x, y) ∈ (P )I . It can be easily verified that if
I |= K′, then IR |= K. Hence, we can reduce the satisfiability problem for any SRIQ
knowledge bases to satisfiability of an ALCQIbr+eg one.
Proposition 6. Let K = hA, T , Ri be a SRIQ knowledge base. Then K is satisfiable iff K′
is satisfiable.</p>
      <p>As a consequence, the automata algorithm in Section 3 can be used to decide the
satisfiability of SRIQ knowledge bases.</p>
      <p>
        Furthermore, it is worst case optimal. All steps are clearly polynomial in the size of A, T
and exp(R). However, the size of exp(R) can be exponential in the size of R [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
Theorem 3. The satisfiability of an SRIQ KB K = hA, T , Ri is decidable in time
exponential in the combined sizes of T , A and exp(R), and double exponential in the size of K.
      </p>
      <p>
        It was recently shown that SRIQ is 2EXPTIME hard [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], hence our bound is optimal.
Corollary 2. The satisfiability of SRIQ knowledge bases is 2EXPTIME-complete.
      </p>
      <p>
        Note that the blow-up in complexity w.r.t. ALCQIbr+eg is due to the size of exp(R), and
that the algorithm is single exponential whenever exp(R) is polynomial in R; this includes,
for example, the so-called simple role hierarchies defined in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Our algorithm compares well
to the SRIQ algorithm in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], which may require non-deterministic double exponential time
even for such restricted cases.
5
      </p>
      <p>
        Conclusions
We have shown that the DL ALCQIbr+eg fully captures the DL SRIQ. Although the latter
is exponentially more succinct, reasoning is also (provably) exponentially harder in the worst
case. The syntax of ALCQIbr+eg allows for additional constructions not supported by SRIQ,
like Boolean role expressions and inclusions between them. Since these additional features
can be useful for modeling KBs and for ontology engineering [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], it seems reasonable to
consider ALCQIbr+eg as a suitable alternative for SRIQ. It would even be possible to
modify the syntax of ALCQIbr+eg and make it more similar to SRIQ from the perspective of
ontology engineering. For example, instead of using the regular expressions in concepts as
in ALCQIbr+eg , a special set of defined role names rR could be used, and assertions of the
form ρR ⊑ rR added to the KB. Provided that the defined roles rR are not used as atomic
roles in complex role expressions, reasoning could be done with the techniques described
here. The resulting logic would be a very simple and natural extension of SRIQ and would
avoid the effort of recognizing whether a given role hierarchy is regular and of transforming
it into a set of finite automata or regular expressions. Note that the language of the regular
expressions supported in (such a variation of) ALCQIbr+eg could also be extended to be
exponentially more succinct (e.g. by allowing squaring of expressions), resulting in a logic that
may be comparable to SRIQ in terms of succinctness.
      </p>
      <p>In order for ALCQIbr+eg to become a suitable alternative to SRIQ in practice, the main
challenge that remains ahead is to explore alternatives for practicable algorithms for this kind
of logics, and to implement them in actual reasoners. Unfortunately, DLs that support
transitive closure, or more generally, regular expressions over roles in the style of PDL, have largely
been displaced by the better known alternative of role hierarchies on which implemented
reasoners have focused. While it is often claimed that the development systems that support such
logics is problematic, to our knowledge there have been no major attempts to do it. Regular
expressions are nevertheless worth exploring, as they are well understood and widely used in
computer science, and are likely to be useful for ontology engineering.</p>
      <p>
        Another natural alternative is to extend SRIQ with more constructors from ALCQIbr+eg .
In particular, one can consider SRIQb, the extension of SRIQ with safe Boolean role
expressions; similar extensions of closely related DLs like SHIQ and SROIQ were
already considered in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. It is easy to verify that any SRIQb KB can be transformed into
an ALCQI br+eg one with the reduction above, and the results given for SRIQ extend to
SRIQb as well. Our results also show that SRIQ and SRIQb can be easily extended with
a universal role (which is already supported in SROIQ) as long as it is considered to be
non-simple. Note that relaxing this restriction and allowing the universal role in the Boolean
role expressions of SRIQb makes reasoning NEXPTIME hard, see [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>
        Finally, we point out that with the techniques given here, the query answering problem
for SRIQ can be reduced to ALCQIbr+eg , showing that answering two-way positive regular
path queries as defined in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] over SRIQ and SRIQb knowledge bases is decidable.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>F.</given-names>
            <surname>Baader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Nardi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P. F.</given-names>
            <surname>Patel-</surname>
          </string-name>
          Schneider, editors.
          <source>The Description Logic Handbook</source>
          . Cambridge University Press,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          .
          <article-title>2ATAs make DLs easy</article-title>
          .
          <source>In Proc. of DL</source>
          <year>2002</year>
          , pages
          <fpage>107</fpage>
          -
          <lpage>118</lpage>
          .
          <source>CEUR Electronic Workshop Proceedings</source>
          , vol.
          <volume>53</volume>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Ortiz</surname>
          </string-name>
          .
          <article-title>Answering regular path queries in expressive description logics: An automata-theoretic approach</article-title>
          .
          <source>Technical Report INFSYS RR-1843-08-05.</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Eiter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Ortiz</surname>
          </string-name>
          .
          <article-title>Answering regular path queries in expressive description logics: An automata-theoretic approach</article-title>
          .
          <source>In Proc. of AAAI'2007</source>
          , pages
          <fpage>391</fpage>
          -
          <lpage>396</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>E. A.</given-names>
            <surname>Emerson</surname>
          </string-name>
          and
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Jutla</surname>
          </string-name>
          .
          <article-title>Tree automata, mu-calculus and determinacy</article-title>
          .
          <source>In Proc. of FOCS'91</source>
          , pages
          <fpage>368</fpage>
          -
          <lpage>377</lpage>
          ,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>J.</given-names>
            <surname>Heflin</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Hendler</surname>
          </string-name>
          .
          <article-title>A portrait of the Semantic Web in action</article-title>
          .
          <source>IEEE Intelligent Systems</source>
          ,
          <volume>16</volume>
          (
          <issue>2</issue>
          ):
          <fpage>54</fpage>
          -
          <lpage>59</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>I.</given-names>
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Kutz</surname>
          </string-name>
          , and
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          .
          <article-title>The irresistible S RIQ</article-title>
          .
          <source>In Proc. OWLED</source>
          <year>2005</year>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>I.</given-names>
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Kutz</surname>
          </string-name>
          , and
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          .
          <article-title>The even more irresistible S ROIQ</article-title>
          .
          <source>In Proc. KR</source>
          <year>2006</year>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>I.</given-names>
            <surname>Horrocks</surname>
          </string-name>
          and
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          .
          <article-title>Decidability of S HIQ with complex role inclusion axioms</article-title>
          .
          <source>In Proc. of IJCAI</source>
          <year>2003</year>
          . Morgan-Kaufmann Publishers,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. I. Horrocks,
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Tobies</surname>
          </string-name>
          .
          <article-title>Reasoning with individuals for the description logic S HIQ</article-title>
          .
          <source>In Proc. of CADE</source>
          <year>2000</year>
          , pages
          <fpage>482</fpage>
          -
          <lpage>496</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kazakov</surname>
          </string-name>
          .
          <article-title>RIQ and S ROIQ are harder than S HOIQ</article-title>
          .
          <source>In Proc. KR'08</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>C.</given-names>
            <surname>Lutz</surname>
          </string-name>
          and
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          .
          <article-title>The complexity of reasoning with Boolean modal logic</article-title>
          .
          <source>In Proc. of Advances in Modal Logic</source>
          <year>2000</year>
          (AiML
          <year>2000</year>
          ),
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>S.</given-names>
            <surname>Rudolph</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Kro¨tzsch, and</article-title>
          <string-name>
            <given-names>P.</given-names>
            <surname>Hitzler</surname>
          </string-name>
          .
          <article-title>Cheap boolean role constructors for description logics</article-title>
          .
          <source>In Proc. of JELIA</source>
          <year>2008</year>
          , volume
          <volume>5293</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>362</fpage>
          -
          <lpage>374</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>K.</given-names>
            <surname>Schild</surname>
          </string-name>
          .
          <article-title>A correspondence theory for terminological logics: Preliminary report</article-title>
          .
          <source>In Proc. of IJCAI</source>
          <year>1991</year>
          , pages
          <fpage>466</fpage>
          -
          <lpage>471</lpage>
          ,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>S.</given-names>
            <surname>Tobies</surname>
          </string-name>
          .
          <article-title>Complexity Results and Practical Algorithms for Logics in Knowledge Representation</article-title>
          .
          <source>PhD thesis</source>
          , LuFG Theoretical Computer Science, RWTH-Aachen, Germany,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>M. Y.</given-names>
            <surname>Vardi</surname>
          </string-name>
          .
          <article-title>Reasoning about the past with two-way automata</article-title>
          .
          <source>In Proc. of ICALP</source>
          <year>2002</year>
          , volume
          <volume>1443</volume>
          <source>of LNCS</source>
          , pages
          <fpage>628</fpage>
          -
          <lpage>641</lpage>
          . Springer,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>M. Y. Vardi</surname>
            and
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Wolper</surname>
          </string-name>
          .
          <article-title>Automata-theoretic techniques for modal logics of programs</article-title>
          .
          <source>J. Comput. Syst. Sci.</source>
          ,
          <volume>32</volume>
          :
          <fpage>183</fpage>
          -
          <lpage>221</lpage>
          ,
          <year>1986</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>