<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>On Equality Constraints in Datalog+/- Knowledge Bases (Position Paper)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Andrea Calí</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Console</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Riccardo Frosini</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Birkbeck University of London</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Sapienza University of Rome</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Naples “Federico II”</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2026</year>
      </pub-date>
      <abstract>
        <p>Commonly adopted database constraints in knowledge bases are tuple-generating dependencies (TGDs) and equality-generating dependencies (EGDs), which form the core of the Datalog+/- family of languages, which are able to capture a variety of ontology formalisms. The presence of EGDs in Datalog+/- programs, and even in simpler relational schema languages based e.g. on inclusion dependencies, is known to easily lead to undecidability or intractability of query answering. The notion of separability was hence introduced to characterise sets of EGDs that have limited interaction with TGDs. We review two notions of separability found in the literature, as well as syntactic conditions that are suficient to them. In particular, we define the notion of deep separability, which captures several analogous notions defined ad-hoc in the literature, and provide a suficient condition for it. We then establish a relationship between decidability of query answering and decidability of determining separability. This works sheds light on the interaction between EGDs and TGDs in Datalog+/- knowledge bases and beyond, providing a basis for further investigations.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Ontology Based Data Access</kwd>
        <kwd>Semantic Technologies</kwd>
        <kwd>Integrity Constraints</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction and Preliminaries</title>
      <sec id="sec-1-1">
        <title>When a database  is equipped with an ontology Σ , that is, a knowledge base consisting of</title>
        <p>constraints that express relevant properties of the underlying domain, queries are not answered
merely against the database instance , but against the logical theory  ∪ Σ . Several languages
have been proposed for ontologies, with diferent computational properties.</p>
        <sec id="sec-1-1-1">
          <title>The DL-Lite family [1] has the advantage of a low (ac0, which is contained in logspace) data</title>
          <p>complexity of conjunctive query answering and of knowledge base satisfiability.</p>
        </sec>
      </sec>
      <sec id="sec-1-2">
        <title>The ER± family of ER-like languages [2], in particular, comprises several tractable (w.r.t. con</title>
        <p>junctive query answering) ontology languages, which properly generalize the main languages
of the DL-Lite family.</p>
      </sec>
      <sec id="sec-1-3">
        <title>Another relevant, more general class of ontology languages, capable of capturing most DL</title>
        <p>lite language, is the Datalog± family, that is, a family of rule-based languages derived from</p>
      </sec>
      <sec id="sec-1-4">
        <title>Datalog (see, e.g., [3] and references therein) whose rules are (function-free) Horn rules, possibly</title>
        <p>with existentially quantified variables in the head, called tuple-generating dependencies (TGDs),
enriched with functionality constraints in the form of equality-generating dependencies (EGDs),
and negative constraints, a form of denial constraints.</p>
      </sec>
      <sec id="sec-1-5">
        <title>In this paper we focus on the interaction between TGDs and EGDs. A TGD is an first-order</title>
        <p>
          implication that forces the existence of tuples under certain conditions, and it is of the form
∀X∀Y (X, Y) → ∃Z (X, Z) , where (X, Y) and (X, Z) are conjunctions of atoms over
a relational schema. An EGD forces equality of values under certain conditions, and it is of
the form ∀X (X) →   =  , where (X) is a conjunction of atoms over a relational
schema, and {,  } ⊆ X . To answer a query  over an instance  and a set Σ of TGDs and
EGDs, we could in principle “expand”  according to Σ , inferring all the entailed additional
knowledge, and then evaluate  against the obtained instance. Said expansion is called chase
in the literature, for which we refer the reader to [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. In the chase, atoms are added according
to TGDs; in doing so, unknown values (those corresponding to the existentially-quantified
variables of TGDs) are represented by labelled nulls, which are a sort of placeholder. The set
of labelled nulls is denoted by Γ  , while the set of constants is denoted by Γ . EGDs force the
equality between a labelled null and a constant, or between two labelled nulls1. The application
of an EGD can trigger the application of TGDs. The chase as a procedure applies one TGD,
then all EGDs until no further EGD application is possible, and then starts again with an EGD
application, and so on, possibly to the infinite. The chase is a universal model in this case, which
entails that the correct answer to a query  on a database  under a set Σ of TGDs and EGDs
can be obtained by evaluating  on the chase of (only in principle, as the result of the chase,
also called chase for convenience of notation, can be infinite).
        </p>
      </sec>
      <sec id="sec-1-6">
        <title>In the rest of this section we show two examples to define the notion of separability, which</title>
        <p>shall be introduced formally in the next section.</p>
        <p>Example 1. Consider the following set Σ of TGDs and EGDs (we omit universal quantifiers to
avoid clutter):
 1 : 1() → ∃ 2(,  )
 2 : 2(,  ) → 3(,  )
 3 : 3(,  ) → 4(,  )
 4 : 4(,  ) → 5(,  )
 5 : 5(,  ) → 2(,  )</p>
        <p>:  3(,  ), 3(, ) →  = 
Notice that  is a key dependency, imposing that atoms in 3 have unique values on their first
attribute. Now, let us take  = {1(), 3(, )} and the (ground) Boolean conjunctive query 
defined as  ←  2(, ) ( is a propositional predicate), which simply asks whether 2(, ) holds.
Let us answer  by computing the chase of  according to Σ , denoted chase(, Σ) . We first
obtain 2(, ) from  1, where  is a labelled null. From  3 we get 4(, ). We then add 3(, )
from  2; at this point we need to apply  and enforce  =  , so that 3(, ) becomes 3(, ). Due
to  , therefore, the query has positive answer, written  ∪ Σ |=  . However, even in the absence of
 ,  would be answered positively. In fact, if we proceed with the expansion we get 5(, ) from
 4 and finally 2(, ) from  5. Therefore we would have had the same result without  . This can
be shown to hold for every query ; therefore, provided that the theory  ∪ Σ is satisfiable (that
is, the expansion does not fail), we can answer every query, for every , by considering the TGDs
only. This property is called separability, and it defines a form of lack of interaction (hence the
name) between EGDs and TGDs in a certain set.</p>
      </sec>
      <sec id="sec-1-7">
        <title>1Equating two distinct constants results in a logical inconsistency because the unique name assumption is adopted.</title>
        <p>Example 2. Suppose we have  = {(, )}, a TGD (,  ) → ∃ (, ), and an EGD
(1, 2), (3, 2) → 1 = 2. The chase (according to the dependencies above) then adds
(, ) by virtue of the TGD  turns into  and the added atom becomes (, ). Hence a query 
defined as  → (, ) has positive answer, and would have negative answer without the EGD
above. In this case separability does not hold.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. What is separability?</title>
      <sec id="sec-2-1">
        <title>It is well know that the interaction of TGDs and EGDs leads to undecidability of query answering;</title>
        <p>this happens even in simple sub-classes of constraints, such as key and inclusion dependencies [5,</p>
      </sec>
      <sec id="sec-2-2">
        <title>6]. It is therefore useful to identify classes of constraints which do not sufer from his “harmful”</title>
        <p>
          interaction. To this aim, a key condition is that of separability [
          <xref ref-type="bibr" rid="ref2 ref7">2, 7</xref>
          ], which we give below.
Definition 1 (Separability). Consider a Σ  of TGDs over a schema ℛ, and a set Σ  of EGDs
over ℛ. We say that the set Σ = Σ  ∪ Σ  is separable if, for every database  for ℛ, either
chase(, Σ) fails, or, chase(, Σ) |=  if chase(, Σ  ) |= , for every BCQ  over ℛ.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Notably, there is another definition of separability in the literature, which is adopted in [ 5, 6, 8].</title>
      </sec>
      <sec id="sec-2-4">
        <title>Such a definition is similar to the above Definition 1, but with a diference which makes it stronger. For the sake of completeness, we give such a definition below.</title>
        <p>Definition 2 (Old separability). Consider a Σ  of TGDs over a schema ℛ, and a set Σ  of
EGDs over ℛ. We say that the set Σ = Σ  ∪ Σ  is separable if, for every database  for ℛ, (i) if
the chase fails, then  ̸|= Σ  ( does not satisfy Σ ), and (ii) if the chase does not fail, we have
chase(, Σ) |=  if chase(, Σ  ) |= , for every BCQ  over ℛ.</p>
      </sec>
      <sec id="sec-2-5">
        <title>The old separability is a special case of the new separability as it enforces condition (i) (Definition 2), which we reformulate below, calling it EGD-stability.</title>
        <p>Definition 3 (EGD-stability). Consider a set Σ  of TGDs over a schema ℛ, and a set Σ  of
EGDs over ℛ. We say that the set Σ = Σ  ∪ Σ  is EGD-stable if, for every instance  for ℛ,
 |= Σ  implies that chase(, Σ) does not fail.</p>
        <p>EGD stability guarantees that the satisfiability of  ∪Σ (i.e., the existence of chase(, Σ) ) can
be determined by merely checking whether  |= Σ  . The satisfiability check is a fundamental
step in query answering (see Section 2.2), and EGD stability guarantees it can be easily done.</p>
      </sec>
      <sec id="sec-2-6">
        <title>However, with a more sophisticated version of the satisfiability check (see Section 3), the old separability can be relaxed to the new separability, giving raise to the discovery of new (syntactic) classes that enjoy (new) separability. Section 2.1 below provides an overview of the most relevant syntactic classes in the literature.</title>
        <p>
          2.1. Related Work
The notion of (old) separability was first introduced in [
          <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
          ], in the context of inclusion
dependencies (IDs) and key dependencies (KDs) (see e.g. [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]). The general idea is to define a
suficient syntactic condition for separability, which can be eficiently checked. This was done
while extending an early class of IDs and KDs (see e.g. [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]), called key-based. Key-based IDs
and KDs were proposed in the milestone work by Johnson and Klug [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], and they are in fact
separable, though not defined explicitly as such. Key-based IDs and KDs are defined as follows 2:
(i) for each relational predicate , there is only one KD defined on it; (ii) for each ID [X] ⊆ [Y] ,
where X and Y are set of attributes of  and  respectively (see [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] for the notation), (ii.a) X is
disjoint from any key set of attributes for , and (ii.b) Y is properly contained in the set of key
attributes for .
        </p>
      </sec>
      <sec id="sec-2-7">
        <title>The more general class of non-key-conflicting (NKC) IDs, with respect to a set of KDs, is</title>
        <p>defined as follows: (i) for each relational predicate , there is only one KD defined on it; (ii) for
every ID [X] ⊆ [Y] , Y is not a proper superset of the set of key attributes for  (if such set
exists). The class of KDs and NKC IDs is separable, and it properly captures the well known
class of foreign-key dependencies.</p>
      </sec>
      <sec id="sec-2-8">
        <title>The class of KDs and non-key conflicting IDs was generalized in [ 8] to general TGDs, which</title>
        <p>are assumed to have a single atom in the head, without loss of generality. The idea is analogous,
and the condition is as follows: (i) for each relational predicate , there is only one KD defined
on it; (ii) each existentially quantified variable in the head of a TGD must occur only once; (iii)
for each TGD  = (X, Y) → ∃Z (X, Z) , the set of X-attributes of (X, Z) is not a proper
superset of the set of key attributes of .</p>
      </sec>
      <sec id="sec-2-9">
        <title>In [11], the class of non-key-conflicting TGDs was straightforwardly extended to treat functional dependencies (FDs) rather than keys.</title>
      </sec>
      <sec id="sec-2-10">
        <title>The literature so far discussed deals with suficient syntactic conditions that guarantee, in</title>
        <p>fact, EGD-stability. However, there are classes of TGDs and EGDs such that EGDs are triggered
in the chase, but which enjoy separability.</p>
        <p>Example 3. Consider the set of TGDs and EGDs in Example 1. It is separable, but it is not
EGDstable. In fact, if we consider the instance ′ = {3(, ), 2(, )}, we have that ′ |= Σ  but
 ∪ Σ  is unsatisfiable because the chase fails due to a hard violation.</p>
      </sec>
      <sec id="sec-2-11">
        <title>This leads in [2] to the definition of separability as in Definition 1 in this paper. This work deals</title>
        <p>
          with IDs and KDs which express an expressive variant of the Entity-Relationship model [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. By
means of graph-related properties, necessary and suficient syntactic conditions for separability
are provided, thus defining useful tractable classes of constraints.
        </p>
      </sec>
      <sec id="sec-2-12">
        <title>The case of linear TGDs (TGDs with a single body-atom) and KDs was later considered</title>
        <p>
          in [
          <xref ref-type="bibr" rid="ref13 ref7">13, 7</xref>
          ]. In these works, suficient syntactic conditions are proposed that guarantee separability,
without imposing non-egd-triggerability. The conditions are quite involved and they make use
of backward-resolution. Interestingly, the complexity of checking the syntactic condition, called
non-conflict condition, is the same as that of query answering, that is pspace-complete.
        </p>
        <p>At this point, we considered separability but not the problem of satisfiability. While
separability allows us to ignore EGDs in the case of satisfiable theories, the problem of deciding the
satisfiability remains, as well as that of determining its complexity. This will be the subject of
next section.
2The definition in the original paper is diferent but equivalent; we choose a definition which is more clear in the
context of this paper.
2.2. Query Answering under Separable Constraints
In the case of a set Σ = Σ  ∪ Σ  of separable TGDs and EGDs, such that BCQ answering
under Σ  is decidable, given an instance  and a BCQ , to decide whether  ∪ Σ |=  , the
following steps are needed:
1. Check whether the chase fails, that is, whether  ∪Σ is satisfiable; if  ∪Σ is unsatisfiable,
then trivially  ∪ Σ |=  (“Ex falso quodlibet” ).
2. If  ∪ Σ is satisfiable, then by Definition 1 we know chase(, Σ) |=  if
chase(, Σ  ) |= , therefore we check whether chase(, Σ  ) |= .</p>
      </sec>
      <sec id="sec-2-13">
        <title>Apart from the complexity of the satisfiability check, we have that the complexity of query</title>
        <p>answering is the same as that of answering under TGDs only, which is a highly desirable
property. In the case of EGD-stable constraints, the satisfiability check amounts simply to
checking whether  |= Σ , which can be done in np, and in ptime (or better, in the even
lower complexity class ac0) if we consider Σ  fixed. However, in the cases of separable but not</p>
      </sec>
      <sec id="sec-2-14">
        <title>EGD-stable constraints, the problem is to be addressed diferently; this will be the subject of</title>
      </sec>
      <sec id="sec-2-15">
        <title>Section 3.</title>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Separability and Satisfiability</title>
      <sec id="sec-3-1">
        <title>In this section we address the problem of deciding whether, given an instance  and a set Σ of</title>
        <p>separable TGDs and EGDs,  ∪ Σ is satisfiable. As seen in Section 2, this preliminary check
is needed, in the case of separability, before one proceeds to answer a BCQ  by taking into
account the TGDs only.</p>
        <p>
          The satisfiability check is done as in [
          <xref ref-type="bibr" rid="ref2 ref8">8, 2</xref>
          ], by encoding hard violations of EGDs as a set  of
Boolean conjunctive queries. Given a separable set Σ = Σ  ∪ Σ , and an instance , we have
that satisfiability holds if and only if, for each  ∈  we have  ∪Σ  ̸|=  or, equivalently,
if and only if all queries in  have negative answer when evaluated against  and Σ  (TGDs
only). The encoding is done as follows. First, we need to add auxiliary facts to . For each
pair of distinct constants 1, 2 appearing in  as arguments, we add the facts neq (1, 2) and
neq (2, 1) to , where neq /2 is an auxiliary predicate expressing that two constants are distinct.
Then, for each EGD  of the form (X) →  =  , with  and  in X, we construct the
BCQ  as follows (quantifiers omitted for brevity):  :  ← (X), neq ( ,  ), where  is a
propositional predicate. It is not dificult to prove that if  ∪ Σ |=   then chase(, Σ) fails,
and therefore  ∪Σ is unsatisfiable. However, it still remains to determine whether  ∪Σ |=   .
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Notice that in the case of non-egd-triggerable constraints we do not have this problem, as we</title>
        <p>merely need to check whether  |= Σ . In principle, separability (see Definition 1 does not
tell us anything about the cases when the chase fails. However, we are still in luck because
we can evaluate the (Boolean) conjunctive queries in  against  ∪ Σ  rather than  ∪ Σ .</p>
      </sec>
      <sec id="sec-3-3">
        <title>This is proved, for instance, in [2, 13], but ad hoc, by using the properties of the particular</title>
        <p>class of constraints involved. Here we show a general condition that allows us to perform the
satisfiability check as above. We first need to introduce a preliminary condition, called deep
separability, which is apparently more restrictive than separability (we shall then prove that it
is implied by separability). We start by denoting by chase(, Σ) the result of the first  chase
steps under Σ applied to an instance , where a step is either an EGD or a TGD application.
Definition 4. Let Σ be a set of TGDs and EGDs, with Σ = Σ  ∪Σ , where the dependencies in Σ 
are TGDs and those in Σ  are EGDs. We say that Σ is deeply separable if, for each integer  ⩾ 0,
for each instance  with with values in Γ ∪ Γ  , and for each Boolean conjunctive query , the
following holds: if chase(, Σ) exists, then chase (, Σ) |=  implies chase(, Σ  ) |= .</p>
      </sec>
      <sec id="sec-3-4">
        <title>The notion of deep separability, intuitively, guarantees that, at any step before a possible</title>
        <p>failure, the chase does not entail any atoms that are not entailed by the chase computed according
to Σ  only. We now come to the main result about deeply separable TGDs and EGDs. The
result states that in the case of deep separability the satisfiability check done by answering
suitable queries as above is correct and complete.</p>
        <p>Theorem 1. Let Σ be a set of deeply separable TGDs and EGDs, with Σ = Σ  ∪ Σ , where the
dependencies in Σ  are TGDs and those in Σ  are EGDs. Let  = {1, . . . , } be the set of
BCQs encoding violations of the EGDs in Σ  as from the above construction. Then we have that
chase(, Σ  ) |=  for some  ∈ {1, . . . , } if and only if chase(, Σ) fails (or equivalently
 ∪ Σ is unsatisfiable).</p>
        <p>Proof (sketch). We prove the two directions of the implication separately.</p>
        <p>“Only if”. By contradiction, assume chase(, Σ  ) |=  for some  ∈ {1, . . . , }
but chase(, Σ) exists. It is not dificult to show that if chase(, Σ  ) |=  then also
chase(, Σ) |=  ; this holds because chase(, Σ  ) is a universal model for  ∪ Σ  and
chase(, Σ) is a model (not necessarily universal) for  ∪ Σ  ; therefore there exists a
homomorphism from the former to the latter. If chase(, Σ) |=   then the chase must necessarily
fail. Contradiction.</p>
        <p>“If”. Assume chase(, Σ) fails at step  by violation of the EGD  ∈ Σ . Then,
chase−1 (, Σ) exists; moreover, it is easily seen that chase−1 (, Σ) |=   , where  ∈ 
encodes the violation of  . By the hypothesis of deep separability we have chase(, Σ  ) |=  ,
hence the claim. □</p>
      </sec>
      <sec id="sec-3-5">
        <title>Notice that for the “If” direction we do not need deep separability nor separability; this direction of the implication holds for general TGDs and EGDs. Finally we show that, despite the appearances, deep separability is not a stricted condition than separability. In fact, separability implies deep separability.</title>
        <p>Theorem 2. Let Σ be a set of TGDs and EGDs. If Σ is separable, then it is deeply separable.</p>
      </sec>
      <sec id="sec-3-6">
        <title>Proof (sketch). We distinguish two cases.</title>
      </sec>
      <sec id="sec-3-7">
        <title>Case 1: chase(, Σ) exists . In this case the claim obviously holds.</title>
        <sec id="sec-3-7-1">
          <title>Case 2: chase(, Σ) fails . Assume chase(, Σ) fails at step ; therefore chase−1 (, Σ)</title>
          <p>exists. Take  obtained from  by replacing each constant in Γ with a fresh null from
Γ  . Obviously chase( , Σ) exists and so does chase−1 ( , Σ) . Take any BCQ  such
that chase−1 ( , Σ) |=  . It is straightforwardly seen that chase( , Σ) |=  and,
due to separability, chase( , Σ  ) |= . Since chase( , Σ  ) |=  is obtained from
chase( , Σ  ) by the above renaming of constants into nulls, we have chase(, Σ  ) |= .
Since this holds for every step ℓ ⩽  − 1, the claim is proved. □</p>
        </sec>
      </sec>
      <sec id="sec-3-8">
        <title>The following result immediately follow from the above.</title>
        <p>Corollary 1. For every separable set Σ of TGDs and EGDs, for every instance  and for every
BCQ :
• checking unsatisfiability of  ∪ Σ has the same complexity as query answering under Σ 
alone;
• checking whether  ∪ Σ |=  has the same complexity as query answering under Σ  alone.</p>
      </sec>
      <sec id="sec-3-9">
        <title>Notice that we mention unsatisfiability rather than satisfiability because Theorem 1 shows a</title>
        <p>reduction from failure (unsatisfiability) to BCQ answering under TGDs alone.
3.1. A Syntactic Condition for Separability</p>
      </sec>
      <sec id="sec-3-10">
        <title>We consider positions in the schema as arguments of predicates; if we have a predicate /3</title>
        <p>
          of arity 3, we denote its arguments as positions [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] and [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. Given a set Σ  of TGDs,
we say that a position is a bud position if, whenever it appears in a head-atom of any TGD
of Σ  , it contains an existentially-quantified variable. Given an EGD body (X) , we denote by
(X)| 1/2 , with {1, 2} ⊆ X , the conjunction of atoms obtained from (X) by replacing
1 with 2. Given an EGD (X) →  1 = 2, we call 1, 2 equality variables (EVs).
Definition 5. Given a set Σ = Σ  ∪ Σ  as before, we say that Σ is transparent if: (i) for every
EGD (X) →  1 = 2 in Σ , the equality variables appear in the body only in bud positions,
and (ii) every atom  in (X)| 1/2 or (X)| 2/1 is such that (X) ∪ Σ  |= .
        </p>
      </sec>
      <sec id="sec-3-11">
        <title>Intuitively, transparency ensures that (i) whenever an EGD is applied in the chase, the</title>
        <p>equation of two symbols (two labelled nulls, or one labelled null and a constant) does not
propagate backwards in the portion of the chase constructed at that point; in fact, any labelled
null corresponding to an EV in the application appears in the chase for the first time; (ii) all atoms
appearing as a consequence of the EGD application would appear anyway in chase(, Σ  ).</p>
      </sec>
      <sec id="sec-3-12">
        <title>The following theorem can then be proved.</title>
        <p>Theorem 3. If a set Σ = Σ</p>
        <p>∪ Σ  of TGDs and EGDs is transparent, then Σ is separable.
3.2. Separability and Decidability</p>
      </sec>
      <sec id="sec-3-13">
        <title>To establish a relationship between the problem of determining the separability of a set of TGDs plus EGDs and decidability of conjunctive query answering, we show the following result.</title>
        <p>Theorem 4. Consider a set Σ = Σ
is separable is undecidable.
 ∪ Σ  of TGDs and EGDs as before. Determining whether Σ</p>
      </sec>
      <sec id="sec-3-14">
        <title>The above theorem can be proved by reducing the problem of conjunctive query answering to the one of checking separability.</title>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Discussion</title>
      <sec id="sec-4-1">
        <title>In this paper we have given an overview of the problem of separability between TGDs and EGDs</title>
        <p>
          in the context of ontological query answering, where queries are (Boolean) conjunctive queries.
We have reviewed the two main notions of separability found in the literature, the “old” one
and the “new” one, and we have clarified the diference between them. We have then addressed
the issue of checking satisfiability of a set of TGDs and EGDs together with an instance, and we
have shown that this can be done by merely answering suitable queries in the case of deeply
separable classes of constraints. We have shown the desirable property that all separable sets
of constraints are also deeply separable. We have therefore clarified and proved formally a
satisfiability check which was already employed in the literature, but proved on a case-by-case
basis [
          <xref ref-type="bibr" rid="ref2 ref7 ref8">7, 8, 2</xref>
          ], depending on the class of constraints. We believe that our generalisation provides
a useful tool for future studies, and a better insight into the satisfiability problem.
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Further results. More results on this paper’s topics (for which we also refer the reader to</title>
        <p>
          an older paper [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]), which we did not include here due to space limitations, will be published
elsewhere. In [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] a suficient condition for separability (in the “new” meaning) is provided
for the case of EGDs and linear TGDs (which, we remind the reader, are TGDs with exactly
one head-atom and one body-atom), with ontologies that encode so-called Extended
Entity
        </p>
      </sec>
      <sec id="sec-4-3">
        <title>Relationship schemata. Such a condition can be extended (with some changes) to the class of</title>
        <p>
          sticky sets of TGDs [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], a relevant class that allows for a form of joins in the body.
        </p>
      </sec>
      <sec id="sec-4-4">
        <title>Open problems. We currently have two main open problems at hand, which seem non</title>
        <p>trivial. First, we would like to study the decidability of determining whether a given set of</p>
      </sec>
      <sec id="sec-4-5">
        <title>TGDs and EGDs is separable. This has been proved to be undecidable for the case of arbitrary</title>
      </sec>
      <sec id="sec-4-6">
        <title>TGDs and EGDs [15]; however, the proof relies on the fact that query answering under arbitrary</title>
      </sec>
      <sec id="sec-4-7">
        <title>TGDs (without EGDs) is undecidable [16]. It is not clear whether undecidability holds also in</title>
        <p>
          the cases where query answering is undecidable under TGDs and EGDs together, but decidable
under TGDs alone; relevant cases are, for instance, IDs and KDs, sticky sets of TGDs and EGDs,
or guarded TGDs and KDs [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. We conjecture that determining separability is undecidable
for these classes. The second problem is more technical, and is determining the complexity of
checking the aforementioned, syntactic condition for the separability of sticky sets of TGDs and
        </p>
      </sec>
      <sec id="sec-4-8">
        <title>EGDs. We have an obvious exptime lower bound, but the upper bound is currently unknown.</title>
      </sec>
      <sec id="sec-4-9">
        <title>Ackowledgments. Andrea Calí acknowledges financial support from: project SERICS</title>
        <p>(PE00000014); “cascade” project SMIMI (CUP F53C24000190005), under the NRRP MUR program
funded by the EU-NGEU; PNRR ECS00000037 “cascade” project MUSA; project INFANT (CUP</p>
      </sec>
      <sec id="sec-4-10">
        <title>E23C24000390006); project MELODY (PRIN-PNRR, CUP E53D23017550001). The authors would like to thank Andreas Pieris for his insightful comments on this research.</title>
      </sec>
      <sec id="sec-4-11">
        <title>The authors have not employed any Generative AI tools.</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Artale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kontchakov</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Zakharyaschev, The DL-Lite family and relations</article-title>
          ,
          <source>J. Artif. Intell. Res</source>
          .
          <volume>36</volume>
          (
          <year>2009</year>
          )
          <fpage>1</fpage>
          -
          <lpage>69</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Gottlob</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pieris</surname>
          </string-name>
          ,
          <article-title>Ontological query answering under expressive EntityRelationship schemata</article-title>
          ,
          <source>Inf. Syst</source>
          .
          <volume>37</volume>
          (
          <year>2012</year>
          )
          <fpage>320</fpage>
          -
          <lpage>335</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          , G. Gottlob,
          <string-name>
            <given-names>T.</given-names>
            <surname>Lukasiewicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Marnette</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pieris</surname>
          </string-name>
          , Datalog+/
          <article-title>-: A family of logical knowledge representation and query languages for new applications</article-title>
          ,
          <source>in: Proc. of LICS</source>
          ,
          <year>2010</year>
          , pp.
          <fpage>228</fpage>
          -
          <lpage>242</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Console</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Frosini</surname>
          </string-name>
          , Deep separability of ontological constraints,
          <year>2013</year>
          . URL: https://arxiv.org/abs/1312.5914.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          ,
          <source>Query Answering and Optimisation in Data Integration Systems, Ph.D. thesis</source>
          , Università di Roma “La Sapienza”,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <article-title>On the decidability and complexity of query answering over inconsistent and incomplete databases</article-title>
          ,
          <source>in: Proc. of PODS</source>
          ,
          <year>2003</year>
          , pp.
          <fpage>260</fpage>
          -
          <lpage>271</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Gottlob</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pieris</surname>
          </string-name>
          ,
          <article-title>Querying conceptual schemata with expressive equality constraints</article-title>
          ,
          <source>in: Proc. of ER</source>
          ,
          <year>2011</year>
          , pp.
          <fpage>161</fpage>
          -
          <lpage>174</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          , G. Gottlob,
          <string-name>
            <given-names>T.</given-names>
            <surname>Lukasiewicz</surname>
          </string-name>
          ,
          <article-title>A general datalog-based framework for tractable query answering over ontologies</article-title>
          ,
          <source>J. Web Sem</source>
          . (
          <year>2012</year>
          ). To appear.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Abiteboul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hull</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vianu</surname>
          </string-name>
          , Foundations of Databases, Addison-Wesley,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Johnson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. C.</given-names>
            <surname>Klug</surname>
          </string-name>
          ,
          <article-title>Testing containment of conjunctive queries under functional and inclusion dependencies</article-title>
          ,
          <source>J. Comput. Syst. Sci</source>
          .
          <volume>28</volume>
          (
          <year>1984</year>
          )
          <fpage>167</fpage>
          -
          <lpage>189</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Gottlob</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pieris</surname>
          </string-name>
          ,
          <article-title>Advanced processing for ontological queries</article-title>
          ,
          <source>PVLDB</source>
          <volume>3</volume>
          (
          <year>2010</year>
          )
          <fpage>554</fpage>
          -
          <lpage>565</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>P. P.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>The entity-relationship model: towards a unified view of data</article-title>
          ,
          <source>ACM TODS 1</source>
          (
          <year>1995</year>
          )
          <fpage>124</fpage>
          -
          <lpage>131</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pieris</surname>
          </string-name>
          ,
          <article-title>On equality-generating dependencies in ontology querying - Preliminary report</article-title>
          ,
          <source>in: Proc. of AMW</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Console</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Frosini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pieris</surname>
          </string-name>
          ,
          <article-title>On Equality Constraints in Ontological Query Answering</article-title>
          ,
          <source>Technical Report</source>
          , University of London, Birkbeck College,
          <year>2012</year>
          .
          <article-title>Available from the authors</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A.</given-names>
            <surname>Pieris</surname>
          </string-name>
          ,
          <year>2012</year>
          . Personal communication.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C.</given-names>
            <surname>Beeri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Y.</given-names>
            <surname>Vardi</surname>
          </string-name>
          ,
          <article-title>The implication problem for data dependencies</article-title>
          ,
          <source>in: Proc. of ICALP</source>
          ,
          <year>1981</year>
          , pp.
          <fpage>73</fpage>
          -
          <lpage>85</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>A.</given-names>
            <surname>Calì</surname>
          </string-name>
          , G. Gottlob,
          <string-name>
            <given-names>T.</given-names>
            <surname>Lukasiewicz</surname>
          </string-name>
          ,
          <article-title>A general datalog-based framework for tractable query answering over ontologies</article-title>
          ,
          <source>in: Proc. of PODS</source>
          ,
          <year>2009</year>
          , pp.
          <fpage>77</fpage>
          -
          <lpage>86</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>