<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Characterising Fixed Parameter Tractability of Query Evaluation Over Guarded TGDs (Extended Abstract)?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Cristina Feier</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Bremen</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>We consider the usual scenario of ontology mediated querying where queries are posed to a database whose knowledge is enriched by means of an ontology. A tuple (L; Q), where L is an ontology language and Q is a query language, is referred to as an OMQ language. One thoroughly explored fragment of FOL as a basis for ontology speci cation languages is that of tuple generating dependencies (TGDs). A tgd is a rule (logical implication) having as body and head conjunctions of atoms, where some variables occurring in head atoms might be existentially quanti ed (all other variables are universally quanti ed). As such, it potentially allows the derivation of atoms over fresh individuals (individuals not mentioned in the database). Answering (even atomic) queries with respect to sets of tgds is undecidable [7]. However, there has been lots of work on identifying decidable fragments [7, 1, 9]. A prominent such fragment is that of Guarded TGDs (GTGD) [7]: a tgd is guarded if all universally quanti ed variables occur as terms of some body atom, called guard. Many Description Logic languages like E LH or E LHI can be expressed using GTGDs. While query answering with respect to GTGDs is decidable, the combined complexity of the problem is high: exptime-complete for bounded arity schemas, and 2ExpTime-complete in general. At the same time, while the fragment is tractable w.r.t. data complexity [6], the coe cient of the polynomial depends on the size of the ontology, which might be relatively high. By xing the set of tgds, the combined complexity drops to NP for evaluating conjunctive queries (CQs), and to PTime for evaluating atomic queries and CQs of bounded treewdith [8] which is similar to the complexity of query evaluation over databases [15]. A natural question is when can OMQs from (GTGD, UCQ) be evaluated e ciently? The above question has been partially answered in [2]. There, the complexity of evaluating OMQs over bounded arity schemas is studied in a parameterized setting, with the parameter being the size of the OMQ. Under the assumption that FPT 6= W[1], it is shown that a recursively enumerable (r.e.) class of OMQs over bounded arity schemas can is xed parameter tracatable (fpt) i it has bounded semantic treewidth, i.e. there exists a k such that every OMQ from the class is equivalent to another OMQ from (GTGD, UCQ) whose UCQ has syntactic treewidth k [4]. The work generalizes previous results concerning</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        e ciency of evaluation of OMQs over bounded arity schemas based on the DL
languages E LH and E LHI from [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        The characterization from [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] is similar to the well-known result of Grohe
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], concerning the parameterized complexity of CQ evaluation over databases:
the evaluation of a r.e. class of CQs is fpt i the class has bounded semantic
treewidth, i.e. there exists a class of equivalent CQs of bounded treewidth. The
similarity is not coincidental, as the results build from [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] on the results of Grohe
in a non-trivial way. In fact, the lower bound proof uses a central construction
from [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], but has to employ sophisticated techniques to adapt this to OMQs.
      </p>
      <p>The main research question we address in this paper is the following: when is
it possible to e ciently evaluate OMQs from (GTGD, UCQ) in the case where
there is no restriction concerning schema arity? We consider again a
parameterized setting, where the parameter is the size of the OMQ. Given a r.e. class
of OMQ Q from (GTGD, UCQ), we denote with p-OMQ(Q) the parameterized
problem of evaluating OMQs from Q. As our rst main result shows, in this case
another structural measure for OMQs/UCQs plays a central role: submodular
width. Similarly to the de nition of semantic treewidth for OMQs, it is possible
to de ne a notion of semantic submodular width of an OMQ. We obtain the
following:</p>
      <p>Main Result 1. Let Q be a r. e. class of OMQs from (GTGD, UCQ).
Assuming the Exponential Time Hypothesis, p-OMQ(Q) is xed-parameter tractable
i Q has bounded semantic submodular width.</p>
      <p>
        To prove the result, we use the fact that every OMQ from (GTGD, UCQ)
can be rewritten into an OMQ from (GDLog, UCQ) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], where GDLog stands
for Guarded Datalog, the restriction of GTGD to rules with only universally
quanti ed variables. For OMQs from (GDLog, UCQ), we construct equivalent
OMQs called covers which witness bounded semantic submodular width. Covers
are based on so-called characteristic databases, which are databases that entail
an OMQ and which are su ciently minimal with respect to the homomorphism
order, in a very speci c sense.
      </p>
      <p>Typically, in a database setting, the database induced by a CQ (or its core)
can be seen as a canonical database which entails the query and on which to
base further constructions. However, in the case of OMQs which pose
restrictions on the database schema this is no longer possible: a CQ might contain
symbols which are not allowed to occur in a database. In fact, this is a typical
usage of ontologies: to enrich the database schema with new terminology. As
such, we identify other representative databases for OMQs. Based on these new
concepts, for OMQs from (GDLog, UCQ) it is possible to obtain an alternative
characterization for fpt evaluation based on syntactic measures:</p>
      <p>Main Result 2. For Q a r. e. class of OMQs from (GDLog, UCQ), let Qc
and DQ be the classes of covers and characteristic databases for OMQs from Q,
respectively. Under the Exponential Time Hypothesis, the following statements
are equivalent:</p>
    </sec>
    <sec id="sec-2">
      <title>1. p-OMQ(Q) is xed-parameter tractable;</title>
    </sec>
    <sec id="sec-3">
      <title>2. Qc has bounded submodular width;</title>
      <p>3. DQ has bounded submodular width.</p>
      <p>
        The hardness result for the above characterization exploits the seminal
result of Marx [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] concerning fpt evaluation of uniform CSPs closed under
underlying hypergraphs. For a r.e. class of structures A, the parameterized uniform
CSP problem p-CSP(A; ) asks whether there exists a homomorphism from some
structure A in A (with jAj being the parameter) to an arbitrary relational
structure. The result of Marx has been lifted to arbitrary uniform CSPs in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]:
Theorem 1 (Theorem 1, [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]). Let C be a r.e. class of structures. Assuming
the Exponential Time Hypothesis, p-CSP(C; ) is xed-parameter tractable i C
has bounded semantic submodular width.
      </p>
      <p>
        The problem of evaluating uniform CSPs can be seen as an alternative
presentation of the problem of evaluating a r.e. class of Boolean CQs over databases.
In fact, Grohe's characterization for fpt evaluation of r. e. classes of CQs in the
bounded arity case has been achieved via a uniform CSP detour [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. To exploit
the result from [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] concerning evaluation of uniform CSPs, we introduce an
fptreduction from parameterized uniform CSP evaluation to parameterized OMQ
evaluation which preserves semantic submodular width.
      </p>
      <p>Main Result 3. For Q an OMQ from (GDLog; UCQ), there exists an
fptreduction from p-CSP(DQ; ) to p-OMQ(fQg).</p>
      <p>
        The reduction is important also as a stand-alone result as it can be used as
a black-box tool to port results from the uniform CSP/database realm to the
OMQ one. To further showcase this, we revisit the problem of evaluating r.e.
classes of OMQs over bounded arity schemas, previously addresed in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and
establish analogues of Main Result 1 and Main Result 2. In this case, submodular
width is replaced with treewidth and the Exponential Time Hypothesis with
the assumption that FPT 6= W[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This is due to the fact that the set of
characteristic databases of an OMQ has the same treewidth as its cover. As such,
we retrieve the semantic characterization from [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] without going into details of
the construction employed by Grohe in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], and also establish an alternative
syntactic characterization for fpt evaluation of OMQs from (GDLog, UCQ) over
bounded arity schemas using the newly introduced notions of cover and
characteristic databases.
      </p>
      <p>
        So far we have only considered Boolean OMQs, as the corresponding results
for CQ evaluation in the unbounded arity case have also only been established
in the Boolean case. As future work, we plan to extend our work to the
nonBoolean case. Many results from the database world concerning e ciency of
performing tasks like counting [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], enumeration [
        <xref ref-type="bibr" rid="ref10 ref5">5, 10</xref>
        ], and so on, use structural
measures on the queries similar to the ones involved in the characterizations
for evaluation. Thus, by generalizing our constructs to the non-Boolean case,
depending on which structural measures are preserved when transitioning from
sets of characteristic databases to covers of OMQs, we might be able to lift such
results to the OMQ world.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Jean-Francois</surname>
            <given-names>Baget</given-names>
          </string-name>
          , Michel Leclere,
          <string-name>
            <surname>Marie-Laure Mugnier</surname>
            , and
            <given-names>Eric</given-names>
          </string-name>
          <string-name>
            <surname>Salvat</surname>
          </string-name>
          .
          <article-title>On rules with existential variables: Walking the decidability line</article-title>
          .
          <source>Artif</source>
          . Intell.,
          <volume>175</volume>
          (
          <fpage>9</fpage>
          - 10):
          <volume>1620</volume>
          {
          <fpage>1654</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Pablo</given-names>
            <surname>Barcelo</surname>
          </string-name>
          , Victor Dalmau, Cristina Feier, Carsten Lutz, and
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Pieris</surname>
          </string-name>
          .
          <article-title>The limits of e ciency for open- and closed-world query evaluation under guarded tgds</article-title>
          .
          <source>In PODS</source>
          , pages
          <volume>259</volume>
          {
          <fpage>270</fpage>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Pablo</given-names>
            <surname>Barcelo</surname>
          </string-name>
          , Cristina Feier, Carsten Lutz, and
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Pieris</surname>
          </string-name>
          .
          <article-title>When is ontologymediated querying e cient? In LICS</article-title>
          , pages
          <volume>1</volume>
          {
          <fpage>13</fpage>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Pablo</given-names>
            <surname>Barcelo</surname>
          </string-name>
          , Diego Figueira, Georg Gottlob, and
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Pieris</surname>
          </string-name>
          .
          <article-title>Semantic optimization of conjunctive queries</article-title>
          .
          <year>2019</year>
          . Under submission, available at http://homepages.inf.ed.ac.uk/apieris/BFGP.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Christoph</given-names>
            <surname>Berkholz</surname>
          </string-name>
          and
          <string-name>
            <given-names>Nicole</given-names>
            <surname>Schweikardt</surname>
          </string-name>
          .
          <article-title>Constant delay enumeration with fpt-preprocessing for conjunctive queries of bounded submodular width</article-title>
          .
          <source>In MFCS</source>
          <year>2019</year>
          , volume
          <volume>138</volume>
          , pages
          <issue>58:1</issue>
          {
          <fpage>58</fpage>
          :
          <fpage>15</fpage>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Andrea</given-names>
            <surname>Cal</surname>
          </string-name>
          , Georg Gottlob, and
          <string-name>
            <given-names>Thomas</given-names>
            <surname>Lukasiewicz</surname>
          </string-name>
          .
          <article-title>A general datalog-based framework for tractable query answering over ontologies</article-title>
          .
          <source>Journal of Web Semantics</source>
          ,
          <volume>14</volume>
          :
          <fpage>57</fpage>
          {
          <fpage>83</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Andrea</given-names>
            <surname>Cal</surname>
          </string-name>
          , Georg Gottlob, and
          <string-name>
            <given-names>Michael</given-names>
            <surname>Kifer</surname>
          </string-name>
          .
          <article-title>Taming the in nite chase: Query answering under expressive relational constraints</article-title>
          .
          <source>J. Artif. Intell. Res.</source>
          ,
          <volume>48</volume>
          :
          <fpage>115</fpage>
          {
          <fpage>174</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Andrea</given-names>
            <surname>Cal</surname>
          </string-name>
          , Georg Gottlob, and Thomas Lukasiewicz.
          <article-title>Datalog : a uni ed approach to ontologies and integrity constraints</article-title>
          .
          <source>In ICDT</source>
          , pages
          <volume>14</volume>
          {
          <fpage>30</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Andrea</given-names>
            <surname>Cal</surname>
          </string-name>
          , Georg Gottlob, and
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Pieris</surname>
          </string-name>
          .
          <article-title>Towards more expressive ontology languages: The query answering problem</article-title>
          .
          <source>Artif</source>
          . Intell.,
          <volume>193</volume>
          :
          <fpage>87</fpage>
          {
          <fpage>128</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <article-title>Nofar Carmeli and Markus Kroll. Enumeration Complexity of Conjunctive Queries with Functional Dependencies</article-title>
          .
          <source>In (ICDT</source>
          <year>2018</year>
          ), volume
          <volume>98</volume>
          , pages
          <issue>11:1</issue>
          {
          <fpage>11</fpage>
          :
          <fpage>17</fpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Hubie</surname>
            <given-names>Chen</given-names>
          </string-name>
          , Georg Gottlob, Matthias Lanzinger, and
          <string-name>
            <given-names>Reinhard</given-names>
            <surname>Pichler</surname>
          </string-name>
          .
          <article-title>Semantic width and the xed-parameter tractability of constraint satisfaction problems</article-title>
          . In Christian Bessiere, editor,
          <source>IJCAI</source>
          , pages
          <volume>1726</volume>
          {
          <fpage>1733</fpage>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>Hubie</given-names>
            <surname>Chen</surname>
          </string-name>
          and
          <string-name>
            <given-names>Stefan</given-names>
            <surname>Mengel</surname>
          </string-name>
          .
          <article-title>A Trichotomy in the Complexity of Counting Answers to Conjunctive Queries</article-title>
          .
          <source>In (ICDT</source>
          <year>2015</year>
          ), volume
          <volume>31</volume>
          , pages
          <fpage>110</fpage>
          {
          <fpage>126</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>Martin</given-names>
            <surname>Grohe</surname>
          </string-name>
          .
          <article-title>The complexity of homomorphism and constraint satisfaction problems seen from the other side</article-title>
          .
          <source>J. ACM</source>
          ,
          <volume>54</volume>
          (
          <issue>1</issue>
          ):1:
          <issue>1</issue>
          {1:
          <fpage>24</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>Daniel</given-names>
            <surname>Marx</surname>
          </string-name>
          .
          <article-title>Tractable hypergraph properties for constraint satisfaction and conjunctive queries</article-title>
          .
          <source>In STOC</source>
          , pages
          <volume>735</volume>
          {
          <fpage>744</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>Mihalis</given-names>
            <surname>Yannakakis</surname>
          </string-name>
          .
          <article-title>Algorithms for acyclic database schemes</article-title>
          .
          <source>In VLDB</source>
          , pages
          <volume>82</volume>
          {
          <fpage>94</fpage>
          ,
          <year>1981</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>