<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Tailoring OWL for Data Intensive Ontologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Diego Calvanese</string-name>
          <email>calvanese@inf.unibz.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giuseppe De Giacomo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Domenico Lembo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maurizio Lenzerini</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Riccardo Rosati</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dip. di Informatica e Sistemistica Universita` di Roma “La Sapienza” Via Salaria</institution>
          <addr-line>113 I-00198 Roma</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Faculty of Computer Science Free University of Bozen-Bolzano Piazza Domenicani 3 I-39100 Bolzano</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The idea of using ontologies as a conceptual view over data repositories is becoming more and more popular. In these contexts, data are typically very large (much larger than the intentional level of the ontologies), and query answering becomes the basic reasoning services. In these contexts query answering should be very efficient on the data, and currently the only technology that is available to deal with large amounts of data is the one provided by relational data management systems (RDBMS). In this paper we advocate that for such contexts a suitable fragment of OWL-DL should be devised. Such a fragment must allow forms of query answering that exploit RDBMS when reasoning on the data, while it must include the main modeling features of conceptual models like UML class diagrams and ER diagrams. In particular it must include cyclic assertions, ISA on concepts, inverses of roles, role typing, mandatory participation to roles, and functional restrictions on roles. Also the query language should go beyond the expressive capabilities of concept expressions in description logics, and include at least conjunctive queries (corresponding to the select-project-join fragment of SQL). We discuss this issues by exhibiting a fragment of OWL-DL that includes all such features, namely DL-Lite, and showing that such a fragment is essentially maximal.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The idea of using ontologies as a conceptual view over data repositories is becoming
more and more popular. For example, in Enterprise Application Integration Systems,
Data Integration Systems [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], and the Semantic Web [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], data become instances of
concepts in ontologies. In these contexts, data are typically very large and dominate
the intentional level of the ontologies. Hence, when measuring the computational
complexity of reasoning, the most important parameter is the size of the data, i.e., one is
interested in data complexity [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. While in all the above mentioned contexts one could
still accept reasoning that is exponential on the intentional part, it is mandatory that
reasoning is polynomial (actually less – see later) in the data. A second fundamental
requirement is the possibility to answer queries over an ontology that are more
complex than the simple queries (i.e., concepts and roles) usually considered in Description
Logics (DLs) research.
      </p>
      <p>
        Traditionally, research carried out in DLs has not paid much attention to data
complexity (see Section 4 for a detailed discussion), and only recently efficient management
of large amounts of data has become a primary concern in ontology reasoning
systems [
        <xref ref-type="bibr" rid="ref10 ref14">14, 10</xref>
        ]. Unfortunately, research on the trade-off between expressive power and
computational complexity of reasoning has shown that many DLs with efficient, i.e.,
worst-case polynomial time, reasoning algorithms lack the modeling power required
for capturing conceptual models (such as UML class diagrams and Entity-Relationship
diagrams) and basic ontology languages. On the other hand, whenever the complexity
of reasoning is exponential in the size of the instances (as for example for OWL-DL,
in Racer1, and in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]), there is little hope for effective instance management. Indeed,
the only technology that is currently available to deal with complex queries over large
amounts of data is the one provided by relational data management systems (RDBMS),
and it cannot be directly exploited in these cases.
      </p>
      <p>In this paper we advocate that for those contexts where ontologies are used to access
large amounts of data, a suitable fragment of OWL-DL should be devised, specifically
tailored to capture conceptual modeling constructs, while keeping query answering
efficient. Specifically, efficiency of query answering should be achieved by delegating data
storage and query answering to an RDBMS. The fragment should include the main
modeling features of conceptual models, which are also at the base of most ontology
languages. These features include cyclic assertions, ISA on concepts, inverses on roles,
role typing, mandatory participation to roles, and functional restrictions of roles. Also,
the query language should go beyond the expressive capabilities of concept expressions
in DLs, and include at least conjunctive queries (corresponding to the select-project-join
fragment of SQL).</p>
      <p>
        We present a DL, called DL-Lite, that exhibits all the above characteristics [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. The
distinguishing features of DL-Lite are that the extensional component of a knowledge
base, the ABox, is maintained by an RDBMS in secondary storage, and that query
answering can be performed as a two step process: in the first step, a query posed
over the knowledge base is reformulated, taking into account the intensional component
(the TBox) only, obtaining a union of conjunctive queries; in the second step such a
union is directly evaluated over the ABox, and the evaluation can be carried out by an
SQL engine, taking advantage of well established query optimization strategies. Since
the first step does not depend on the data, and the second step is the evaluation of a
relational query over a databases, the whole query answering process is in LOGSPACE
in the data [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        We show also that DL-Lite is essentially the maximal fragment exhibiting such a
desirable property, and allowing one to delegate query evaluation to a relational
engine [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Indeed, even slight extensions of DL-Lite make query answering (actually
already instance checking, i.e., answering atomic queries) at least NLOGSPACE in data
complexity, ruling out the possibility that query evaluation could be performed by a
relational engine.
2
      </p>
      <p>DL-Lite
As usual in DLs, DL-Lite allows for representing the domain of interest in terms of
concepts, denoting sets of objects, and roles, denoting binary relations between objects.</p>
    </sec>
    <sec id="sec-2">
      <title>1 http://www.sts.tu-harburg.de/˜r.f.moeller/racer/</title>
      <p>In DL-Lite2, concepts and roles are defined as follows:</p>
      <p>C ::= A j ? j 9R j C1 u C2</p>
      <p>R ::= P j P ¡
where A denotes an atomic concept and P denotes an atomic role; R denotes a (generic)
role, which can either be an atomic role or its inverse; C (possibly with subscript)
denotes a (generic) concept that can be either an atomic concept, the empty concept
?, a concept of the form 9R, i.e., the standard DL construct of unqualified existential
quantification on roles, or the conjunction of two concepts. Note that we allow for a
limited form of negation through ? (sufficient to capture disjointness of concepts), but
we do not allow for disjunction.</p>
      <p>A DL-Lite knowledge base (KB) K = hT ; Ai is constituted by two components: a
TBox T , used to represent intensional knowledge, and an ABox A, used to represent
extensional information. DL-Lite TBox assertions are of the form</p>
      <p>C1 v C2
(funct R)
inclusion assertion
functionality assertion
An inclusion assertion expresses that a concept C1 is subsumed by a concept C2, while a
functionality assertion expresses the (global) functionality of a role (atomic or inverse).</p>
      <p>As for the ABox, DL-Lite allows for assertions of the form:</p>
      <p>C(a);</p>
      <p>R(a; b)</p>
      <p>membership assertions
where a and b are constants. These assertions state respectively that the object denoted
by a is an instance of the concept C, and that the pair of objects denoted by (a; b) is an
instance of the role R.</p>
      <p>Although DL-Lite is quite simple from the language point of view, it allows for
querying the extensional knowledge of a KB in a much more powerful way than usual
DLs, in which only membership to a concept or to a role can be asked. Specifically,
DLLite allows for using conjunctive queries of arbitrary complexity. A conjunctive query
(CQ) q over a knowledge base K is an expression of the form q(x) Ã 9y.conj (x; y),
where x are the so-called distinguished variables, y are existentially quantified
variables called the non-distinguished variables, and conj (x; y) is a conjunction of atoms
of the form C(z), or R(z1; z2), where C and R are respectively a concept and a role in
K, and z, z1, z2 are constants of a fixed infinite domain ¢ (see later) or variables in x
or y.</p>
      <p>
        The semantics of DL-Lite is given in terms of interpretations over a fixed infinite
domain ¢3. We assume to have one constant for each object, denoting exactly that
object. In other words, we have standard names [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], and we will not distinguish between
the alphabet of constants and ¢.
2 The variant of DL-Lite presented here is a slight extension of the language presented in [
        <xref ref-type="bibr" rid="ref7 ref8">8, 7</xref>
        ],
since it allows for conjunction of concepts in the left-hand side of inclusion assertions.
3 The assumption of having a fixed interpretation domain is done for convenience in the
definition of query answering (see later).
      </p>
      <p>An interpretation I = (¢; ¢I ) consists of a first order structure over ¢ with an
interpretation function ¢I such that:</p>
      <p>AI µ ¢
(9R)I = fc j 9c0. (c; c0) 2 RI g
P I µ ¢ £ ¢
?I = ;
(C1 u C2)I = C1I \ CI</p>
      <p>2
(P ¡)I = f(c; c0) j (c0; c) 2 P I g</p>
      <p>An interpretation I is a model of an inclusion assertion C1 v C2 if and only if
C1I µ C2I ; I is a model of a functionality assertion (funct R) if (c; c0) 2 RI and
(c; c00) 2 RI implies c0 = c00; I is a model of a membership assertion C(a) (resp.,
R(a; b)) if a 2 CI (resp., (a; b) 2 RI ). A model of a KB K is an interpretation I that
is a model of all assertions in K. A KB is satisfiable if it has at least one model. A KB
K logically implies an assertion ® if all the models of K are also models of ®. A query
q(x) Ã 9y.conj (x; y) is interpreted in I as the set qI of tuples c 2 ¢ £ ¢ ¢ ¢ £ ¢ such
that, when we substitute the variables x with the constants c, the formula 9y.conj (x; y)
evaluates to true in I.</p>
      <p>Since DL-Lite deals with conjunctive queries, the basic reasoning services that are
of interest are:
– Query answering: given a query q with distinguished variables x and a KB K,
return the set ans(q; K) of tuples c of constants of K such that c 2 qI , for every
model I of K. Note that this task generalizes instance checking in DLs, i.e.,
checking whether a given object is an instance of a specified concept in every model of
the knowledge base.
– Query containment: given two queries q1 and q2 and a KB K, verify whether q1I µ
q2I , for every model I of K. Note that this task generalizes logical implication of
inclusion assertions in DLs.
– KB satisfiability: verify whether a KB is satisfiable.</p>
      <p>Although equipped with advanced reasoning services, at first sight DL-Lite might
seem rather weak in modeling intensional knowledge, and hence of limited use in
practice. In fact, this is not the case. Despite the simplicity of its language and the
specific form of inclusion assertions allowed, DL-Lite is able to capture the main notions
(though not all, obviously) of both ontologies, and of conceptual modeling formalisms
used in databases and software engineering, such as Entity-Relationship diagrams and
UML class diagrams. In particular, DL-Lite assertions allow us to specify: ISA, e.g.,
stating that a concept A1 is subsumed by a concept A2, using A1 v A2; disjointness,
e.g., between concepts A1 and A2, using A1 u A2 v ?; role-typing, e.g., stating that
the first (resp., second) component of the role P is an instance of A, using 9P v A
(resp., 9P ¡ v A); participation constraints, e.g., stating that all instances of a concept
A participate to a role P as the first (resp., second) component, using A v 9P (resp.,
A v 9P ¡); non-participation constraints, using Au9R v ?; functionality restrictions
on roles, using (funct R).</p>
      <p>Observe that DL-Lite does allow for cyclic assertions without falling into
intractability. Indeed, we can enforce the cyclic propagation of the existence of a P -successor
using the two DL-Lite inclusion assertions A v 9P and 9P ¡ v A. The constraint
imposed on a model is similar to the one imposed by the ALN cyclic assertion A v
T
A</p>
      <p>Perfect
reformulation
rq;T</p>
      <p>Query
evaluation</p>
      <p>
        ans(q; hT ; Ai)
9P u 8P .A, though stronger, since it additionally enforces the second component of P
to be typed by A. In order to keep tractability even in the presence of cycles, DL-Lite
imposes restrictions on the use of the 8R.C construct, which, if used together with
inclusion assertions, immediately would lead to intractability of TBox reasoning [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and
of query answering (cf. Table 1).
      </p>
      <p>Finally, notice that DL-Lite is a strict subset of OWL-Lite, the least expressive
variant of OWL4, which presents some constructs (e.g., some kinds of role restrictions) that
cannot be expressed in DL-Lite, and that make reasoning in OWL-Lite non-tractable in
general.
3</p>
      <sec id="sec-2-1">
        <title>Reasoning in DL-Lite</title>
        <p>
          We discuss now reasoning in DL-Lite, and concentrate on the basic reasoning task in the
context of using ontologies to access large data repositories, namely that of answering
(conjunctive) queries over a DL-Lite knowledge base. The other forms of reasoning can
be reduced to query answering [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. For example, to check whether K is unsatisfiable,
we can simply add the inclusion A1 u A2 v ? to the TBox and the assertion A1(a) to
the ABox (where A1, A2 are new atomic concepts and a is new constant), and check
whether a is in the answer to the query q(x) Ã A2(x). Similarly, to check whether K
implies A v C, we can simply add the assertion A(a) to the Abox (where a is new
constant), and check whether a is in the answer to the query q(x) Ã C0(x), where C0
is the conjunction of atoms corresponding to the concept C.
        </p>
        <p>
          Given the limited expressive power of DL-Lite TBoxes, it might seem that in order
to answer a query q over a KB K, we could simply build a finite first-order structure
on the basis of K, and then evaluate the query itself as an expression over this
firstorder structure. Actually, it is possible to show that this is not the case. In particular, it
can be shown that, in general, given a DL-Lite KB K, there exists no finite structure S
such that, for every conjunctive query q, the set of answers to q over K is the result of
evaluating q itself over S (see [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]). This property demonstrates that answering queries
in DL-Lite goes beyond both propositional logic and relational databases.
        </p>
        <p>Instead, in order to exploit query evaluation for query answering, and also properly
take into account that the size of the ABox (i.e., the data) largely dominates the size
of the TBox, we consider the query answering process as divided in two steps (cf.
Figure 1):</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>4 http://www.w3.org/TR/owl-features/</title>
      <p>1. First, considering the TBox T only, the user query q is reformulated into a new
query rq;T (expressed in a suitable query language LQ).
2. Then, the reformulated query rq;T is evaluated over the ABox A only
(considered as a first-order structure, i.e., a database), producing the requested answer
ans(q; hT ; Ai).</p>
      <p>Notice that, in principle, such a two steps query answering process is always
possible for arbitrary TBoxes and user queries, provided we do not impose any restriction
on the query language LQ in which the reformulation rq;T is expressed. Indeed, in
order to ensure that the evaluation of rq;T over the ABox A produces the correct answer
ans(q; hT ; Ai) (i.e., that rq;T is a perfect reformulation of q given T ), we may have
to allow for LQ to be completely general. In other words, we may need to be able to
specify in rq;T an arbitrary Turing Machine computation, possibly one that performs
full inferences over the TBox and the query. However, if we pose no restriction on LQ,
and hence on the form of T and q, we have no guarantee that the evaluation of rq;T
over the ABox A can be performed efficiently, and hence that the separation between
query reformulation (using the TBox only) and query evaluation (over the ABox only)
makes sense from a computational complexity point of view.</p>
      <p>
        As shown in [
        <xref ref-type="bibr" rid="ref7 ref8">8, 7</xref>
        ], one of the distinguishing features of DL-Lite is that the above
described two steps query answering process makes sense, and allows us to be efficient
in the size of the data. Indeed, the perfect reformulation rq;T of a conjunctive query q
over a DL-Lite KB K = hT ; Ai can be expressed as a union of conjunctive queries,
i.e., a set of select-project-join SQL queries, and hence the query evaluation step can be
performed in LOGSPACE in the size of the ABox A [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Since the size of rq;T does not
depend on A, the data complexity (i.e., the complexity measured as a function of the
size of the ABox only) of the whole query answering algorithm is LOGSPACE.
      </p>
      <p>Moreover, by storing the ABox under the control of an RDBMS, which can
manage effectively large numbers (i.e., millions) of objects in the knowledge base, we can
delegate the query evaluation step to an SQL database engine. More precisely, we can
construct a relational database that faithfully represents an ABox A as follows. First of
all, we assume that A does not contain conjunction and ?. If this is not the case, we can
easily pre-process it and bring it in such a form in linear time (actually, in LOGSPACE
in the number of objects in the ABox). Let further A0 be the ABox obtained from A by
adding for each assertion R(a; b) 2 A the implied assertions 9R(a) and 9R¡(b). Then,
for each concept C in K that is either atomic or of the form 9R, we define a relational
table tabC of arity 1, such that hai 2 tabC if and only if C(a) 2 A0. Similarly, for each
atomic role P in K, we define a relational table tabP of arity 2, such that ha; bi 2 tabP
if and only if P (a; b) 2 A or P ¡(b; a) 2 A. Let us call DB (A) the resulting database.
The fact that the ABox A is managed in secondary storage by an RDBMS, together with
the fact that in DL-Lite the perfect reformulation of a select-project-join SQL query can
be expressed as a set of select-project-join SQL queries that can be directly evaluated
over DB (A), allows us to completely delegate the query evaluation process to the SQL
engine of the RDBMS, and to take advantage of well established query optimization
strategies.</p>
      <p>
        We have developed a prototype tool, called QUONTO [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] (see Figure 2), that
implements the DL-Lite query answering algorithm and delegates to an RDBMS the ABox
storage and the query evaluation step. QUONTO is able to answer queries over ABoxes
containing millions of assertions, and the limitations actually depend on the underlying
DBMS engine (currently we use MySQL).
      </p>
      <p>
        Notice that, as soon as we extend the expressive power of DL-Lite even slightly,
we lose the possibility of delegating query evaluation to an RDBMS. Indeed, Table 1,
drawn from [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], shows bounds on the data complexity of query answering for
various DLs obtained from DL-Lite by adding various constructs (and possibly removing
some). To give a more precise account of the complexity, we distinguish between the
constructs allowed in concepts on the left-hand side of inclusion assertions (denoted by
B), and those allowed in concepts on the right-hand side of inclusion assertions
(denoted by C). The language L1 is DL-Lite, and L2 is obtained from DL-Lite by allowing
for qualified existential quantification in C while forbidding the use of functionality
assertions. Similarly to DL-Lite, query answering in L2 can be performed in LOGSPACE
via query reformulation. Instead, in languages L3, L4, L5, one can encode reachability
in directed graphs (essentially through the use of qualified existentials in B). Hence
query answering (actually, already instance checking) becomes NLOGSPACE-hard in
data-complexity. By further allowing for the use of conjunction in B, one can encode
Path System Accessibility (a non-linear form of reachability), and instance checking
becomes PTIME-hard (L6, L7, L8). Finally, by allowing to denote two concepts that
together cover the whole domain, conjunctive query answering becomes even
coNPhard in data-complexity (L9, L10, L11).
      </p>
      <p>The fact that the data-complexity goes beyond LOGSPACE, means actually that
query answering (resp., instance checking) requires more powerful engines than those
available in standard relational database technology. Essentially NLOGSPACE-hardness
Legend: A (possibly with subscript)= atomic concept, P = atomic role,
B (possibly with subscript) = left-hand side of TBox inclusion assertions,
C = right-hand side of TBox inclusion assertions, R = arbitrary role.
means that at least the power to compute transitive closure (i.e., linear recursion) is
required, while PTIME-hardness essentially requires the power of Datalog. For the
coNPhard cases a technology based on Disjunctive Datalog could be adopted. An immediate
consequence of this fact is that, as soon as we go beyond DL-Lite, we lose the possibility
of delegating query answering to data management tools and we cannot take advantage
of query optimization techniques of current industrial strength RDBMSs.
4</p>
      <sec id="sec-3-1">
        <title>Discussion and related work</title>
        <p>
          DL-Lite is a fragment of expressive DLs with assertions and inverses studied in the 90’s
(see [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] for an overview), which are at the base of current ontology languages such as
OWL, and for which optimized automated reasoning systems such as Fact5, Racer, and
Pellet6 have been developed. Indeed, one could use, off-the-shelf, a system like Racer
or Pellet to perform KB satisfiability, instance checking (of concepts), and logical
implication of inclusion assertions in DL-Lite. Also, reasoning with conjunctive queries in
these DLs has been studied (see e.g., [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]), although not yet implemented in systems.
Unfortunately, the reasoning procedures for these DLs are all EXPTIME-hard, and more
importantly they are not tailored towards obtaining tight bounds with respect to data
complexity. Alternative reasoning procedures that allow for clearly isolating data
complexity have recently been proposed, how they will work in practice still needs to be
understood: a coNP upper bound for data complexity of instance checking in an
expressive DL has been shown, and a polynomial fragment has been isolated [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], though
it is open whether the technique can be extended to deal efficiently with conjunctive
queries; building on the technique in [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], coNP-completeness of answering
conjunc
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>5 http://www.cs.man.ac.uk/˜horrocks/FaCT/</title>
    </sec>
    <sec id="sec-5">
      <title>6 http://www.mindswap.org/2003/pellet/</title>
      <p>
        tive queries for an expressive DL with assertions, inverse roles, and number restrictions
(that generalize functionality) has been shown [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
      </p>
      <p>
        DL-Lite can also capture (the DL-subset of) RDFS7, except for role hierarchies. In
fact, the query answering technique for DL-Lite works also for full RDFS extended
with participation constraints (i.e., inclusion assertions with 9R on the right-hand side),
and one can show that in this case query answering is indeed LOGSPACE. However,
if we further extend RDFS with functionality assertions, it can be shown that query
answering becomes NLOGSPACE-hard. Finally, if we move from RDFS to DLP [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ],
query answering becomes PTIME-hard, since DLP is a superset of L6 in Table 1.
      </p>
      <p>
        There has been a lot of work in DLs on the boundary between polynomial and
exponential reasoning. This work first concentrated on DLs without the TBox component of
the KB, and led to the development of simple DLs, such as ALN , that admit polynomial
instance checking. However, for minor variants of ALN , such as ALE (where we
introduce qualified existential and drop number restrictions), F LE ¡ (where we additionally
drop negated atomic concept), and ALU (where we introduce union and drop number
restrictions), conjunctive query answering is coNP-hard in data complexity [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. If we
allow for cyclic inclusion assertions in the KB, then even subsumption in CLASSIC
and ALN becomes intractable [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]8.
      </p>
      <p>
        More recently languages equipped with qualified existential restrictions but no
universal restrictions (even expressed in a disguised way, as in DL-Lite, through inverse
roles) have been studied. In particular, in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] it has been shown that for the basic
language E L instance checking is polynomial even in the presence of general (i.e.,
cyclic) inclusion assertions, while extensions of the language lead easily to
intractability (cf. Table 1). Conjunctive query answering in E L has not been studied yet, however
the results in Table 1 show us that such a service is PTIME-hard in data complexity and
hence cannot be delegated to a relational DBMS (actually such a lower bound holds
already for instance checking).
5
      </p>
      <sec id="sec-5-1">
        <title>Conclusions</title>
        <p>For the management of data intensive ontologies, we have advocated the use of DL-Lite,
a subset of OWL-Lite that (i) is specifically tailored to capture conceptual data models
and basic ontology languages, and (ii) keeps the worst-case data complexity of sound
and complete reasoning in LOGSPACE. This allows one to effectively exploit current
industrial strength relational technology for query answering.</p>
        <p>We have also argued that by extending DL-Lite, we lose this possibility, and in
order to allow for the separation of TBox and ABox reasoning, we have to rely on more
powerful query evaluation engines, such as one for Datalog. Hence, the possibility of
efficient implementations and optimization strategies for query evaluation engines that
go beyond relational ones is worth investigating.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>7 http://www.w3.org/TR/rdf-schema/</title>
      <p>8 Note that a TBox with only acyclic inclusion assertions can always be transformed into an
empty TBox.</p>
      <p>Acknowledgments This research has been partially supported by the IST Specific
Targeted Research Project “Thinking ONtologies (TONES) – IST-007603 funded by
the European Union under the 6th Framework Programme.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>S.</given-names>
            <surname>Abiteboul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hull</surname>
          </string-name>
          , and
          <string-name>
            <given-names>V.</given-names>
            <surname>Vianu</surname>
          </string-name>
          .
          <source>Foundations of Databases. Addison Wesley Publ. Co.</source>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>A.</given-names>
            <surname>Acciarri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Palmieri</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          . QUONTO:
          <article-title>QUerying ONTOlogies</article-title>
          .
          <source>In Proc. of AAAI</source>
          <year>2005</year>
          , pages
          <fpage>1670</fpage>
          -
          <lpage>1671</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>F.</given-names>
            <surname>Baader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Brandt</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Lutz</surname>
          </string-name>
          .
          <article-title>Pushing the E L envelope</article-title>
          .
          <source>In Proc. of IJCAI</source>
          <year>2005</year>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>F.</given-names>
            <surname>Baader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Nardi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P. F.</given-names>
            <surname>Patel-</surname>
          </string-name>
          Schneider, editors.
          <source>The Description Logic Handbook: Theory, Implementation and Applications</source>
          . Cambridge University Press,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. A. Cal`ı,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          .
          <article-title>Query rewriting and answering under constraints in data integration systems</article-title>
          .
          <source>In Proc. of IJCAI</source>
          <year>2003</year>
          , pages
          <fpage>16</fpage>
          -
          <lpage>21</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          .
          <article-title>Reasoning with inclusion axioms in description logics: Algorithms and complexity</article-title>
          .
          <source>In Proc. of ECAI'96</source>
          , pages
          <fpage>303</fpage>
          -
          <lpage>307</lpage>
          . John Wiley &amp; Sons,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          .
          <article-title>Data complexity of query answering in description logics</article-title>
          .
          <source>In Proc. of DL 2005. CEUR Electronic Workshop Proceedings</source>
          , http://ceur-ws.
          <source>org/</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          .
          <article-title>DL-Lite: Tractable description logics for ontologies</article-title>
          .
          <source>In Proc. of AAAI</source>
          <year>2005</year>
          , pages
          <fpage>602</fpage>
          -
          <lpage>607</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          .
          <article-title>Answering queries using views over description logics knowledge bases</article-title>
          .
          <source>In Proc. of AAAI</source>
          <year>2000</year>
          , pages
          <fpage>386</fpage>
          -
          <lpage>391</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>C. Chen</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Haarslev</surname>
            , and
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Wang</surname>
          </string-name>
          . LAS:
          <article-title>Extending Racer by a Large ABox Store</article-title>
          .
          <source>In Proc. of DL 2005. CEUR Electronic Workshop Proceedings</source>
          , http://ceur-ws.
          <source>org/</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>F. M. Donini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Nardi</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Schaerf</surname>
          </string-name>
          .
          <article-title>Deduction in concept languages: From subsumption to instance checking</article-title>
          .
          <source>J. of Log. and Comp</source>
          .,
          <volume>4</volume>
          (
          <issue>4</issue>
          ):
          <fpage>423</fpage>
          -
          <lpage>452</lpage>
          ,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>B. N.</given-names>
            <surname>Grosof</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Volz</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Decker</surname>
          </string-name>
          .
          <article-title>Description logic programs: Combining logic programs with description logic</article-title>
          .
          <source>In Proc. of WWW</source>
          <year>2003</year>
          , pages
          <fpage>48</fpage>
          -
          <lpage>57</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>J.</given-names>
            <surname>Heflin</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Hendler</surname>
          </string-name>
          .
          <article-title>A portrait of the semantic web in action</article-title>
          .
          <source>IEEE Intelligent Systems</source>
          ,
          <volume>16</volume>
          (
          <issue>2</issue>
          ):
          <fpage>54</fpage>
          -
          <lpage>59</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. I.
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Turi</surname>
            , and
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Bechhofer</surname>
          </string-name>
          .
          <article-title>The Instance Store: DL reasoning with large numbers of individuals</article-title>
          .
          <source>In Proc. of DL 2004. CEUR Electronic Workshop Proceedings</source>
          , http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>104</volume>
          /,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15. U. Hustadt,
          <string-name>
            <given-names>B.</given-names>
            <surname>Motik</surname>
          </string-name>
          , and
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          .
          <article-title>Data complexity of reasoning in very expressive description logics</article-title>
          .
          <source>In Proc. of IJCAI</source>
          <year>2005</year>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          .
          <article-title>Data integration: A theoretical perspective</article-title>
          .
          <source>In Proc. of PODS</source>
          <year>2002</year>
          , pages
          <fpage>233</fpage>
          -
          <lpage>246</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>H. J.</given-names>
            <surname>Levesque</surname>
          </string-name>
          and
          <string-name>
            <surname>G. Lakemeyer.</surname>
          </string-name>
          <article-title>The Logic of Knowledge Bases</article-title>
          . The MIT Press,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Levy and M.-C.</surname>
          </string-name>
          <article-title>Rousset. Combining Horn rules and description logics in CARIN</article-title>
          .
          <source>Artificial Intelligence</source>
          ,
          <volume>104</volume>
          (
          <issue>1-2</issue>
          ):
          <fpage>165</fpage>
          -
          <lpage>209</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>M. M. Ortiz de la Fuente</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Eiter</surname>
            , and
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Franconi</surname>
          </string-name>
          .
          <article-title>Data complexity of answering conjunctive queries over SHIQ knowledge bases</article-title>
          .
          <source>Technical report</source>
          , Fac. of Computer Science, Free Univ. of Bozen-Bolzano,
          <year>July 2005</year>
          .
          <article-title>Also available as CORR technical report</article-title>
          at http://arxiv.org/abs/cs.LO/0507059/.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <given-names>M. Y.</given-names>
            <surname>Vardi</surname>
          </string-name>
          .
          <article-title>The complexity of relational query languages</article-title>
          .
          <source>In Proc. of STOC'82</source>
          , pages
          <fpage>137</fpage>
          -
          <lpage>146</lpage>
          ,
          <year>1982</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>