<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Goal-Oriented Algorithm for Unification in E L w.r.t. Cycle-Restricted TBoxes?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Franz Baader</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Borgwardt</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Barbara Morawska</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>baader</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>stefborg</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>morawska}@tcs.inf.tu-dresden.de</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Theoretical Computer Science</institution>
          ,
          <addr-line>TU Dresden</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Unification in DLs has been proposed in [7] (for the DL F L0, which offers the constructors conjunction (u), value restriction (8r:C), and the top concept (&gt;)) as a novel inference service that can, for instance, be used to detect redundancies in ontologies. For example, assume that one developer of a medical ontology defines the concept of a patient with severe head injury as Patient u 9 nding:(Head_injury u 9severity:Severe); whereas another one represents it as Patient u 9 nding:(Severe_ nding u Injury u 9 nding_site:Head): These two concept descriptions are not equivalent, but they are nevertheless meant to represent the same concept. They can obviously be made equivalent by treating the concept names Head_injury and Severe_ nding as variables, and substituting the first one by Injury u 9 nding_site:Head and the second one by 9severity:Severe. In this case, we say that the descriptions are unifiable, and call the substitution that makes them equivalent a unifier. Intuitively, such a unifier proposes definitions for the concept names that are used as variables: in our example, we know that, if we define Head_injury as Injury u 9 nding_site:Head and Severe_ nding as 9severity:Severe, then the two concept descriptions (1) and (2) are equivalent w.r.t. these definitions. Here equivalence holds without additional GCIs. To motivate our interest in unification w.r.t. GCIs, assume that the second developer uses the description</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>(1)
(2)</p>
      <sec id="sec-1-1">
        <title>Patient u 9status:Emergency u</title>
        <p>(3)</p>
      </sec>
      <sec id="sec-1-2">
        <title>9 nding:(Severe_ nding u Injury u 9 nding_site:Head)</title>
        <p>instead of (2). The descriptions (1) and (3) are not unifiable without additional</p>
        <sec id="sec-1-2-1">
          <title>GCIs, but they are unifiable, with the same unifier as above, if the GCI</title>
        </sec>
      </sec>
      <sec id="sec-1-3">
        <title>9 nding:9severity:Severe v 9status:Emergency</title>
        <p>? Supported by DFG under grant BA 1122/14-1
is present in a background ontology.</p>
        <p>
          In [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], we were able to show that unification in the DL E L (which differs from
F L0 by offering existential restrictions (9r:C) in place of value restrictions) is of
considerably lower complexity than in F L0: the decision problem in E L is
NPcomplete rather than ExpTime-complete in F L0. In addition to a brute-force
“guess and then test” NP-algorithm [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], we have developed a goal-oriented
unification algorithm for E L, in which nondeterministic decisions are only made if
they are triggered by “unsolved parts” of the unification problem [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], and an
algorithm that is based on a reduction to satisfiability in propositional logic (SAT)
[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], which enables the use of highly-optimized SAT solvers. In [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] it was also
shown that the approaches for unification of E L-concept descriptions (without
any background ontology) can easily be extended to the case of an acyclic TBox
as background ontology without really changing the algorithms or increasing
their complexity. Basically, by viewing defined concepts as variables, an acyclic
        </p>
        <sec id="sec-1-3-1">
          <title>TBox can be turned into a unification problem that has as its unique unifier the substitution that replaces the defined concepts by unfolded versions of their definitions. For GCIs, this simple trick is not possible.</title>
        </sec>
        <sec id="sec-1-3-2">
          <title>In [2], we extended the brute-force “guess and then test” NP-algorithm from</title>
          <p>
            [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ] to the case of GCIs, which required the development of a new characterization
of subsumption w.r.t. GCIs in E L. Unfortunately, the algorithm is complete only
for general TBoxes (i.e., finite sets of GCIs) that satisfy a certain restriction
on cycles, which, however, does not prevent all cycles. For example, the cyclic
GCI 9child:Human v Human satisfies this restriction, whereas the cyclic GCI
          </p>
        </sec>
      </sec>
      <sec id="sec-1-4">
        <title>Human v 9parent:Human does not.</title>
        <sec id="sec-1-4-1">
          <title>In the present paper, we describe a goal-oriented algorithm for unification in</title>
          <p>
            E L w.r.t. cycle-restricted general TBoxes, which extends the one from [
            <xref ref-type="bibr" rid="ref6">6</xref>
            ] and
reduces the amount of nondeterministic guesses considerably. Full proofs of the
presented results can be found in [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ].
2
          </p>
          <p>
            The Description Logic E L
Syntax and semantics of E L are defined in the usual way (see, e.g., [
            <xref ref-type="bibr" rid="ref9">9</xref>
            ]). Here,
we just recall that E L-concept descriptions are built from a finite set NC of
concept names and a finite set NR of role names using the concept constructors
top-concept (&gt;), conjunction (C u D), and existential restriction (9r:C for every
r 2 NR). Nested existential restrictions 9r1:9r2: 9rn:C will sometimes also be
written as 9r1r2 : : : rn:C, where r1r2 : : : rn is viewed as a word over the alphabet
of role names, i.e., an element of NR. As usual, concepts C are interpreted as sets
          </p>
        </sec>
        <sec id="sec-1-4-2">
          <title>CI over some domain such that the semantics of the constructors is respected.</title>
          <p>
            A general concept inclusion (GCI) is of the form C v D for concept
descriptions C; D, and a general TBox is a finite set of GCIs. An interpretation I
satisfies such a GCI if CI DI , and it is a model of the general TBox T if it
satisfies all GCIs in T . Subsumption asks whether a given GCI C v D follows
from a general TBox T , i.e. whether every model of T satisfies C v D. In this
case we say C is subsumed by D w.r.t. T and write C vT D. Subsumption in
E L w.r.t. a general TBox is known to be decidable in polynomial time [
            <xref ref-type="bibr" rid="ref9">9</xref>
            ].
          </p>
          <p>An E L-concept description is an atom if it is an existential restriction or a
concept name. The atoms of an E L-concept description C are the subdescriptions
of C that are atoms, and the top-level atoms of C are the atoms occurring in
the top-level conjunction of C. Obviously, any E L-concept description is the
conjunction of its top-level atoms, where the empty conjunction corresponds to
&gt;. The atoms of a general TBox T are the atoms of all the concept descriptions
occurring in T .</p>
        </sec>
        <sec id="sec-1-4-3">
          <title>We say that a subsumption between two atoms is structural if their top-level</title>
          <p>
            structure is compatible. To be more precise, we define structural subsumption
between atoms as follows: the atom C is structurally subsumed by the atom D
w.r.t. T (C vsT D) iff either (i) C = D is a concept name, or (ii) C = 9r:C0,
D = 9r:D0, and C0 vT D0. It is easy to see that subsumption w.r.t. ; between
two atoms implies structural subsumption w.r.t. T , which in turn implies
subsumption w.r.t. T . The unification algorithm in [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ] and the one presented below
crucially depend on the following characterization of subsumption:
Lemma 1. Let T be a general TBox and C1; : : : ; Cn, D1; : : : ; Dm atoms. Then
C1 u u Cn vT D1 u u Dm iff for every j 2 f1; : : : ; mg
s Dj , or
1. there is an index i 2 f1; : : : ; ng such that Ci vT
2. there are atoms A1; : : : ; Ak; B of T (k 0) such that
a) A1 u u Ak vT B,
b) for every 2 f1; : : : ; kg there is i 2 f1; : : : ; ng with Ci vsT A , and
c) B vsT Dj .
          </p>
        </sec>
        <sec id="sec-1-4-4">
          <title>Our proof of this lemma in [1] is based on a new Gentzen-style proof calculus</title>
          <p>
            for subsumption w.r.t. a general TBox, which is similar to the one developed in
[
            <xref ref-type="bibr" rid="ref10">10</xref>
            ] for subsumption w.r.t. cyclic and general TBoxes.
          </p>
        </sec>
        <sec id="sec-1-4-5">
          <title>As mentioned in the introduction, our unification algorithm is complete only for general TBoxes that satisfy a certain restriction on cycles.</title>
          <p>Definition 2. The general TBox T is called cycle-restricted iff there is no
nonempty word w 2 NR+ and E L-concept description C such that C vT 9w:C.</p>
        </sec>
        <sec id="sec-1-4-6">
          <title>In [1] we show that a given general TBox can easily be tested for cycle</title>
          <p>restrictedness. The main idea is that it is sufficient to consider the cases where
C is a concept name or &gt;.</p>
          <p>Lemma 3. Let T be a general TBox. It can be decided in time polynomial in
the size of T whether T is cycle-restricted or not.
3</p>
          <p>Unification in E L w.r.t. General TBoxes</p>
        </sec>
        <sec id="sec-1-4-7">
          <title>We partition the set NC into a set Nv of concept variables (which may be</title>
          <p>replaced by substitutions) and a set Nc of concept constants (which must not be
replaced by substitutions). A substitution maps every concept variable to an
E L-concept description. It is extended to concept descriptions in the usual way:
–
–
(A) := A for all A 2 Nc [ f&gt;g,
(C u D) := (C) u (D) and</p>
          <p>(9r:C) := 9r: (C).</p>
          <p>An E L-concept description C is ground if it does not contain variables.
Obviously, a ground concept description is not modified by applying a substitution.</p>
        </sec>
        <sec id="sec-1-4-8">
          <title>A general TBox is ground if it does not contain variables.</title>
          <p>Definition 4. Let T be a general TBox that is ground. An E L-unification
problem w.r.t. T is a finite set = fC1 v? D1; : : : ; Cn v? Dng of subsumptions
between E L-concept descriptions. A substitution is a unifier of w.r.t. T if
solves all the subsumptions in , i.e. if (C1) vT (D1); : : : ; (Cn) vT (Dn).
We say that is unifiable w.r.t. T if it has a unifier.</p>
          <p>
            Note that we have restricted the background general TBox T to be ground.
This is not without loss of generality. If T contained variables, then we would
need to apply the substitution also to its GCIs, and instead of requiring (Ci) vT
(Di) we would thus need to require (Ci) v (T ) (Di), which would change
the nature of the problem considerably (see [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ] for a more detailed discussion).
          </p>
        </sec>
        <sec id="sec-1-4-9">
          <title>Preprocessing To simplify the description of the algorithm, it is convenient to</title>
          <p>first normalize the TBox and the unification problem appropriately. An atom is
called flat if it is a concept name or an existential restriction of the form 9r:A
for a concept name A. The general TBox T is called flat if it contains only GCIs
of the form A u B v C, where A; B are flat atoms or &gt; and C is a flat atom.</p>
        </sec>
        <sec id="sec-1-4-10">
          <title>The unification problem is called flat if it contains only flat subsumptions of</title>
          <p>the form C1 u u Cn v? D, where n 0 and C1; : : : ; Cn; D are flat atoms.1</p>
          <p>
            Let be a unification problem and T a general TBox. By introducing
auxiliary variables and concept names, respectively, and T can be transformed in
polynomial time into a flat unification problem 0 and a flat general TBox T 0
such that the unifiability status remains unchanged, i.e., has a unifier w.r.t.
T iff 0 has a unifier w.r.t. T 0. In addition, if T was cycle-restricted, then so is
T 0 (see [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ] for details). Thus, we can assume without loss of generality that the
input unification problem and general TBox are flat.
          </p>
        </sec>
        <sec id="sec-1-4-11">
          <title>Local Unifiers The main idea underlying the “in NP” results in [4,2] is to show</title>
          <p>that any E L-unification problem that is unifiable has a so-called local unifier. Let
T be a flat cycle-restricted TBox and a flat unification problem. The atoms
of are the atoms of all the concept descriptions occurring in . We define
Atnv := At n Nv</p>
          <p>At := fC j C is an atom of T or of
(non-variable atoms).</p>
          <p>g and</p>
        </sec>
        <sec id="sec-1-4-12">
          <title>Every assignment S of subsets SX of Atnv to the variables X in Nv induces the following relation &gt;S on Nv: &gt;S is the transitive closure of</title>
          <p>f(X; Y ) 2 Nv</p>
          <p>Nv j Y occurs in an element of SX g:
1 If n = 0, then we have an empty conjunction on the left-hand side, which as usual
stands for &gt;.</p>
        </sec>
        <sec id="sec-1-4-13">
          <title>We call the assignment S acyclic if &gt;S is irreflexive (and thus a strict partial</title>
          <p>order). Any acyclic assignment S induces a unique substitution S , which can
be defined by induction along &gt;S :
– If X 2 Nv is minimal w.r.t. &gt;S , then we define S (X) := d
– Assume that (Y ) is already defined for all Y such that X &gt;D2SSYX .DT. hen we
define S (X) := dD2SX S (D):</p>
        </sec>
        <sec id="sec-1-4-14">
          <title>We call a substitution local if it is of this form, i.e., if there is an acyclic assign</title>
          <p>ment S such that = S . If the unifier of w.r.t. T is a local substitution,
then we call it a local unifier of w.r.t. T .</p>
          <p>
            The main technical result shown in [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ] is that any unifiable E L-unification
problem w.r.t. a cycle-restricted TBox has a local unifier. This yields the
following brute-force unification algorithm for E L w.r.t. cycle-restricted TBoxes: first
guess an acyclic assignment S, and then check whether the induced local
substitution S solves . As shown in [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ], this algorithm runs in nondeterministic
polynomial time. NP-hardness follows from the fact that already unification in
E L w.r.t. the empty TBox is NP-hard [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ].
4
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>A Goal-Oriented Unification Algorithm</title>
      <p>
        The brute-force algorithm is not practical since it blindly guesses an acyclic
assignment and only afterwards checks whether the guessed assignment induces
a unifier. We now introduce a more goal-oriented unification algorithm, in which
nondeterministic decisions are only made if they are triggered by “unsolved parts”
of the unification problem. In addition, failure due to wrong guesses can be
detected early. Any non-failing run of the algorithm produces a unifier, i.e.,
there is no need for checking whether the assignment computed by this run
really produces a unifier. This goal-oriented algorithm generalizes the algorithm
for unification in E L w.r.t. the empty TBox introduced in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], though the rules
look quite different because in the present paper we consider unification problems
that consist of subsumptions whereas in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] we considered equivalences.
      </p>
      <p>We assume without loss of generality that the cycle-restricted TBox T and
the unification problem 0 are flat. Given T and 0, the sets At and Atnv are
defined as above. Starting with 0, the algorithm maintains a current unification
problem and a current acyclic assignment S, which initially assigns the empty
set to all variables. In addition, for each subsumption in it maintains the
information on whether it is solved or not. Initially, all subsumptions are unsolved,
except those with a variable on the right-hand side. Rules are applied only to
unsolved subsumptions. A (non-failing) rule application does the following:
– it solves exactly one unsolved subsumption,
– it may extend the current assignment S, and
– it may introduce new flat subsumptions built from elements of At.</p>
      <sec id="sec-2-1">
        <title>Each rule application that extends SX additionally expands w.r.t. X as follows:</title>
        <p>every subsumption s 2 of the form C1 u u Cn v? X is expanded by adding
the subsumption C1 u u Cn v? A to for every A 2 SX .</p>
        <p>Eager Ground Solving:
Eager Solving:</p>
        <p>Condition: This rule applies to s = C1 u u Cn v? D if it is ground.
Action: If C1 u u Cn vT D does not hold, the rule application fails.
Otherwise, s is marked as solved.</p>
        <p>Condition: This rule applies to s = C1 u u Cn v? D if either
– there is i 2 f1; : : : ; ng such that Ci = D or Ci = X 2 Nv and D 2 SX , or
– D is ground and d G vT D holds, where G is the set of all ground atoms
in fC1; : : : ; Cng [ SX2fC1;:::;Cng\Nv SX .</p>
        <p>Action: Its application marks s as solved.</p>
        <p>Eager Extension:</p>
        <p>Condition: This rule applies to s = C1 u uCn v? D if there is i 2 f1; : : : ; ng
with Ci = X 2 Nv and fC1; : : : ; Cng n fXg SX .</p>
        <p>Action: Its application adds D to SX . If this makes S cyclic, the rule
application fails. Otherwise, is expanded w.r.t. X and s is marked as solved.</p>
      </sec>
      <sec id="sec-2-2">
        <title>Subsumptions are only added if they are not already present in . If a new</title>
        <p>subsumption is added to , either by a rule application or by expansion of ,
then it is initially designated unsolved, except if it has a variable on the
righthand side. Once a subsumption is in , it will not be removed. Likewise, if a
subsumption in is marked as solved, then it will not become unsolved later.</p>
      </sec>
      <sec id="sec-2-3">
        <title>If a subsumption is marked as solved, this does not mean that it is already</title>
        <p>solved by the substitution induced by the current assignment. It may be the
case that the task of satisfying the subsumption was deferred to solving other
subsumptions which are “smaller” than the given subsumption in a well-defined
sense. The task of solving a subsumption whose right-hand side is a variable is
deferred to solving the subsumptions introduced by expansion.</p>
      </sec>
      <sec id="sec-2-4">
        <title>The rules of the algorithm consist of the three eager rules Eager Ground</title>
      </sec>
      <sec id="sec-2-5">
        <title>Solving, Eager Solving, and Eager Extension (see Figure 1), and several nondeterministic rules (see Figures 2 and 3). Eager rules are applied with higher priority than nondeterministic rules. Among the eager rules, Eager Ground Solving has the highest priority, then comes Eager Solving, and then Eager Extension.</title>
        <p>Algorithm 5. Let 0 be a flat E L-unification problem. We set := 0 and
SX := ; for all X 2 Nv. While contains an unsolved subsumption, apply the
steps (1) and (2). Once all subsumptions are solved, return the substitution
induced by the current assignment.
(1) Eager rule application: If some eager rules apply to an unsolved
subsumption s in , apply one of highest priority. If the rule application fails,
then return “not unifiable”.
(2) Nondeterministic rule application: If no eager rule is applicable, let s be
an unsolved subsumption in . If one of the nondeterministic rules applies
to s, nondeterministically choose one of these rules and apply it. If none of
these rules apply to s or the rule application fails, then return “not unifiable”.</p>
        <p>Condition: This rule applies to s = C1 u u Cn v? 9s:D0 if there is at least
one index i 2 f1; : : : ; ng with Ci = 9s:C0.</p>
        <p>Action: Its application chooses such an index i, adds the subsumption C0 v?
D0 to , expands it w.r.t. D0 if D0 is a variable, and marks s as solved.
Extension:</p>
        <p>Condition: This rule applies to s = C1 u u Cn v? D if there is at least one
i 2 f1; : : : ; ng with Ci 2 Nv.</p>
        <p>Action: Its application chooses such an i and adds D to SCi . If this makes S
cyclic, the rule application fails. Otherwise, is expanded w.r.t. Ci and s is
marked as solved.</p>
      </sec>
      <sec id="sec-2-6">
        <title>In step (2), the choice which unsolved subsumption to consider next is don’t</title>
        <p>care nondeterministic. However, choosing which rule to apply to the chosen
subsumption is don’t know nondeterministic. Additionally, the application of
nondeterministic rules requires don’t know nondeterministic guessing.</p>
        <p>
          The eager rules are mainly there for optimization purposes, i.e., to avoid
nondeterministic choices if a deterministic decision can be made. For example,
a ground subsumption, as considered in the Eager Ground Solving rule, either
follows from the TBox, in which case any substitution solves it, or it does not,
in which case it does not have a solution. This condition can be checked in
polynomial time using the polynomial time subsumption algorithm for E L [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. In
the case considered in the Eager Solving rule, the substitution induced by the
current assignment already solves the subsumption. In fact, if the first (second)
condition of the rule is satisfied, then the first (second) condition of Lemma 1
applies. The Eager Extension rule solves a subsumption that contains only a
variable X and some elements of SX on the left-hand side. The rule is
motivated by the following observation: for any assignment S0 extending the current
assignment, the induced substitution 0 satisfies 0(X) 0(C1) u : : : u 0(Cn).
Thus, if SX0 contains D, then 0(X) vT 0(D), and 0 solves the subsumption.
Conversely, if 0 solves the subsumption, then 0(X) vT 0(D), and thus adding
D to SX0 yields an equivalent induced substitution.
        </p>
        <p>The nondeterministic rules only come into play if no eager rules can be
applied. In order to solve an unsolved subsumption s = C1 u u Cn v? D, we
consider the two conditions of Lemma 1. Regarding the first condition, which is
addressed by the rules Decomposition and Extension, assume that is induced
by an acyclic assignment S. To satisfy the first condition of the lemma with ,
the atom (D) must subsume a top-level atom in (C1) u u (Cn). This atom
can either be of the form (Ci) for an atom Ci, or it can be of the form (C) for
an atom C 2 SCi and a variable Ci. In the second case, the atom C can either
already be in SCi or it can be put into SCi by an application of the Extension rule.</p>
      </sec>
      <sec id="sec-2-7">
        <title>The Mutation rules cover the second condition in Lemma 1. For example, let us analyze in detail how Mutation 1 ensures that all the requirements of the second condition of Lemma 1 are satisfied. Whenever this condition requires a structural</title>
        <p>Mutation 1:</p>
        <p>Condition: This rule applies to s = C1 u u Cn v? D if n &gt; 1 and there are
atoms A1; : : : ; Ak; B of T such that A1 u u Ak vT B holds.</p>
        <p>Action: Its application chooses such atoms, marks s as solved, and generates
the following subsumptions:
– it chooses for each 2 f1; : : : ; kg an i 2 f1; : : : ; ng and adds the new
subsumption Ci v? A to ,
– it adds B v? D to .</p>
        <p>Mutation 2:</p>
        <p>Condition: This rule applies to s = 9r:X v? D if X is a variable, D is ground,
and there are atoms 9r:A1; : : : ; 9r:Ak of T such that 9r:A1 u u 9r:Ak vT D
holds.</p>
        <p>Action: Its application chooses such atoms, adds A1; : : : ; Ak to SX , expands
w.r.t. X, and marks s as solved.</p>
        <p>Mutation 3:</p>
        <p>Condition: This rule applies to s = 9r:X v? 9s:Y if X and Y are variables,
and there are atoms 9r:A1; : : : ; 9r:Ak; 9s:B of T with 9r:A1 u u 9r:Ak vT
9s:B.</p>
        <p>Action: Its application chooses such atoms, marks s as solved, and generates
the following subsumptions:
– it adds A1; : : : ; Ak to SX and expands w.r.t. X,
– it adds the subsumption B v? Y to and expands it w.r.t. Y .</p>
        <p>Mutation 4:</p>
        <p>Condition: This rule applies to s = C v? 9s:Y if C is a ground atom or &gt;,
Y is a variable, and there is an atom 9s:B of T such that C vT 9s:B holds.
Action: Its application chooses such an atom, adds the new subsumption B v?
Y to , expands this subsumption w.r.t. Y , and marks s as solved.
subsumption (E) vsT (F ) to hold for a (hypothetical) unifier of , the rule
creates the new subsumption E v? F , which has to be solved later on. This way,
the rule ensures that the substitution built by the algorithm actually satisfies
the conditions of the lemma. To check the subsumption A1 u u Ak vT B, the
rule again employs a polynomial-time subsumption algorithm.</p>
      </sec>
      <sec id="sec-2-8">
        <title>The other mutation rules follow the same idea, but they implicitly apply one or more Decomposition or Eager Extension rules after mutation. This ensures that the generated subsumptions are “smaller” than the subsumption that triggers their introduction.</title>
        <p>
          Soundness We will show that, if Algorithm 5 returns a substitution on input
0, then is a unifier of 0 w.r.t. T . In the following, let S be the final acyclic
assignment computed by a non-failing run of Algorithm 5 on input 0, and
the substitution induced by S. By b we denote the final set of subsumptions
computed by this run, i.e., the original subsumptions of 0 together with the
new ones generated by rule applications. To show that solves all subsumptions
in b, we use well-founded induction [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] on the well-founded order on b:
Definition 6. Let s = C1 u u Cn v? Cn+1 2 b.
        </p>
        <p>– s is small if n = 1 and C1 is ground or Cn+1 is ground.
– We define m(s) := (m1(s); m2(s); m3(s)), where
m1(s) := 0 if s is small, and m1(s) := 1 otherwise;
m2(s) := X if Cn+1 = X or Cn+1 = 9r:X for a variable X and some
r 2 NR, and m2(s) := ? otherwise;
m3(s) := maxfrd( (Ci)) j i 2 f1; : : : ; n + 1gg where rd yields the role
depth of a concept description, i.e., the maximal nesting of existential
restrictions.
– The strict partial order on such triples is the lexicographic order, where
the first and the third component are compared w.r.t. the normal order &gt; on
natural numbers. The variables in the second component are compared w.r.t.
the relation &gt;S induced by S, and ? is smaller than any variable.
– We extend to b by setting s1 s2 iff m(s1) m(s2).</p>
      </sec>
      <sec id="sec-2-9">
        <title>As the lexicographic product of well-founded strict partial orders is again</title>
        <p>
          well-founded [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], is a well-founded strict partial order on b.
        </p>
        <p>Lemma 7.</p>
        <p>is an E L-unifier of b w.r.t. T , and thus also of its subset 0.</p>
        <p>
          Proof. Let s 2 b and assume that solves all subsumptions s0 2 b with s0 s.
– If s has a non-variable atom as its right-hand side, then it was initially
marked as unsolved and must have been marked solved by a successful rule
application. As an example, we consider the application of the Decomposition
rule (the other rules can be treated similarly [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]). Then s is of the form
C1 u u Cn v? 9s:D0 with Ci = 9s:C0 for some i 2 f1; : : : ; ng and we
have s0 = C0 v? D0 2 b. We will show that s s0 holds. By induction, this
implies that solves s0, and by Lemma 1 thus also s.
        </p>
        <p>Observe first that m2(s) = m2(s0) since either 9s:D0 and D0 contain the
same variable or are both ground. We now make a case distinction based on
m1(s0). If s0 is small, then s is either non-small, i.e. m1(s) &gt; m1(s0), or small
and of the form 9s:C0 v? 9s:D0. In the second case, we have m1(s) = m1(s0)
and m3(s) &gt; m3(s0). If s0 is non-small, then both C0 and D0 are variables,
and thus s is also non-small, which yields m1(s) = m1(s0). Furthermore, the
maximal role depth obviously decreases when going from s to s0, and thus
m3(s) &gt; m3(s0). In all cases we have shown m(s) m(s0), i.e., s s0.
– If s has a variable as its right-hand side, it is of the form C1 u u Cn v? X
and for every A 2 SX there is a subsumption sA = C1 u u Cn v? A in b.
If s is small, then n = 1 and C1 is ground, and thus the subsumptions sA are
also small. Thus, we have m1(s) m1(sA) for every A 2 SX . Furthermore,
we have m2(s) &gt; m2(sA) since A is ground or contains a variable on which
X depends. This yields s sA, and thus by induction (C1)u u (Cn) vT
(A) for every A 2 SX , which implies that (C1) u u (Cn) vT (X) by
the definition of .
Completeness Assume that 0 is unifiable w.r.t. T and let be a ground unifier
of 0 w.r.t. T . We use this unifier to guide the application of the nondeterministic
rules such that Algorithm 5 does not fail. The following invariants for and S
will be maintained:
(I) is a unifier of .
(II) For all B 2 SX we have (X) vT
(B).</p>
        <p>Since SX is initialized to ; for all variables X 2 Nv and is initialized to 0,
these invariants are satisfied after the initialization of the algorithm.</p>
      </sec>
      <sec id="sec-2-10">
        <title>The invariants immediately rule out one cause of failure for the algorithm,</title>
        <p>namely that the current assignment becomes cyclic. This is the only place in the
whole proof where our assumption on cycle-restrictedness of T is needed.
Lemma 8. If invariant (II) is satisfied, then the current assignment S is acyclic.</p>
      </sec>
      <sec id="sec-2-11">
        <title>The proofs of this and of the next lemma can be found in [1].</title>
        <p>Lemma 9. Assume that the current set of subsumptions
signment S satisfy the invariants (I) and (II), and let s 2
and the current
asbe unsolved.
1. If an eager rule applies to s, then its application does not fail and the resulting
set 0 and assignment S0 also satisfy the invariants (I) and (II).
2. If no eager rule applies to s, then there is a nondeterministic rule that can
successfully be applied to s such that the resulting set 0 and assignment S0
also satisfy the invariants (I) and (II).</p>
      </sec>
      <sec id="sec-2-12">
        <title>An immediate consequence of this lemma is that, if 0 is unifiable, then there</title>
        <p>is a non-failing run of Algorithm 5 on 0 during which the invariants (I) and (II)
are satisfied. Together with the fact that every run of the algorithm terminates
(see below), this shows completeness, i.e., whenever 0 has a unifier w.r.t. T ,
the algorithm computes one.</p>
      </sec>
      <sec id="sec-2-13">
        <title>Termination Consider a run of Algorithm 5. It is easy to show that any sub</title>
        <p>sumption encountered during this run falls into one of the following categories:</p>
      </sec>
      <sec id="sec-2-14">
        <title>1. subsumptions from 0;</title>
        <p>2. subsumptions created by expansion from 0: these are of the form C1 u</p>
        <p>Cn v? A for a subsumption C1 u u Cn v? X 2 0 and A 2 Atnv;
3. subsumptions of the form C v? D for C; D 2 At.
u
Since the cardinality of At is polynomially bounded by the size of 0 and T ,
there are only polynomially many subsumptions of this form. Rules are only
applicable to subsumptions that are marked unsolved, and the application of a
rule marks at least one subsumption as solved. Thus, only polynomially many
rules can be applied during the run. In addition, each rule application takes
only polynomial time. This shows that every run of the algorithm terminates in
polynomial time.</p>
        <p>Theorem 10. Algorithm 5 is an NP-decision procedure for unifiability in E L
w.r.t. cycle-restricted TBoxes.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Conclusions</title>
      <p>
        We have presented a goal-oriented NP-algorithm for unification in E L w.r.t.
cycle-restricted TBoxes. In [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], we developed a reduction of this problem to a
propositional satisfiability problem (SAT), which is based on a characterization
of subsumption different from the one in Lemma 1. Though clearly better than
the brute-force algorithm introduced in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], both algorithms suffer from a high
degree of nondeterminism introduced by having to guess atoms and GCIs from
the underlying cycle-restricted TBox. We have to find optimizations to tackle
this problem before an implementation becomes feasible.
      </p>
      <sec id="sec-3-1">
        <title>On the theoretical side, the main topic for future research is to consider</title>
        <p>unification w.r.t. unrestricted general TBoxes. In order to generalize the
bruteforce algorithm in this direction, we need to find a more general notion of locality.</p>
      </sec>
      <sec id="sec-3-2">
        <title>Starting with the goal-oriented algorithm, the idea would be not to fail when a cyclic assignment is generated, but rather to add rules that can break such cycles, similar to what is done in procedures for general E-unification [11].</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borgwardt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morawska</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Unification in the description logic EL w</article-title>
          .r.t. cycle
          <article-title>-restricted TBoxes</article-title>
          .
          <source>LTCS-Report 11-05</source>
          , Chair of Automata Theory, TU Dresden, Germany (
          <year>2011</year>
          ), see http://lat.inf.tu-dresden.de/research/reports.html.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borgwardt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morawska</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Extending unification in EL towards general TBoxes</article-title>
          .
          <source>In: Proc. KR'12</source>
          . AAAI Press (
          <year>2012</year>
          ),
          <article-title>short paper</article-title>
          . To appear.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borgwardt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morawska</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>SAT encoding of unification in ELHR+ w</article-title>
          .r.t. cycle
          <article-title>-restricted ontologies</article-title>
          .
          <source>LTCS-Report 12-02</source>
          ,
          <article-title>Chair for Automata Theory</article-title>
          , TU Dresden (
          <year>2012</year>
          ), see http://lat.inf.tu-dresden.de/research/reports.html.
          <article-title>A short version of this report has been submitted to a conference.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morawska</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Unification in the description logic EL</article-title>
          .
          <source>In: Proc. RTA'09. LNCS</source>
          , vol.
          <volume>5595</volume>
          , pp.
          <fpage>350</fpage>
          -
          <lpage>364</lpage>
          . Springer (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morawska</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>SAT encoding of unification in EL</article-title>
          .
          <source>In: Proc. LPAR'10. LNCS</source>
          , vol.
          <volume>6397</volume>
          , pp.
          <fpage>97</fpage>
          -
          <lpage>111</lpage>
          . Springer (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morawska</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Unification in the description logic EL</article-title>
          .
          <source>Log. Meth. Comput. Sci. 6</source>
          (
          <issue>3</issue>
          ) (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Narendran</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Unification of concept terms in description logics</article-title>
          .
          <source>J. Symb. Comput</source>
          .
          <volume>31</volume>
          (
          <issue>3</issue>
          ),
          <fpage>277</fpage>
          -
          <lpage>305</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nipkow</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Term Rewriting</article-title>
          and
          <string-name>
            <given-names>All</given-names>
            <surname>That</surname>
          </string-name>
          . Cambridge University Press, United Kingdom (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Brandt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Polynomial time reasoning in a description logic with existential restrictions, GCI axioms, and-what else?</article-title>
          <source>In: Proc. ECAI'04</source>
          . pp.
          <fpage>298</fpage>
          -
          <lpage>302</lpage>
          . IOS Press (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Hofmann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Proof-theoretic approach to description-logic</article-title>
          .
          <source>In: Proc. LICS'05</source>
          . pp.
          <fpage>229</fpage>
          -
          <lpage>237</lpage>
          . IEEE Press (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Morawska</surname>
          </string-name>
          , B.:
          <article-title>General E-unification with eager variable elimination and a nice cycle rule</article-title>
          .
          <source>J. Autom. Reasoning</source>
          <volume>39</volume>
          (
          <issue>1</issue>
          ),
          <fpage>77</fpage>
          -
          <lpage>106</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>