<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Width Revisited (Extended Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Georg Gottlob</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Matthias Lanzinger</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Reinhard Pichler</string-name>
          <email>pichlerg@dbai.tuwien.ac.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>TU Wien</institution>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Oxford</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Answering conjunctive queries (CQ) and, equivalently, solving constraint satisfaction problems (CSP), are among the most fundamental tasks in computer science. While these problems are NP-complete, there has been much success in identifying \islands of tractability". In this work, we want to further sharpen the image of this complexity landscape. In particular, we are interested in extending the pioneering work by Barcelo, Pieris, Romero, and Vardi [1, 2], Dalmau, Kolaitis, and Vardi [4], and Grohe [6] on the deep connections between query minimization and structural decompositions. To this end, the following notion of semantic widths will be the central theme of our work. Note that from now on, we only concentrate on CQs. Clearly, all results hold for CSPs as well. De nition 1. Let Q be the class of all conjunctive queries and w : Q ! R+ be some property of the query. We de ne semantic w as sem-w(q) := inffw(q0) j q0 ' qg, where ' denotes homomorphic equivalence.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Such semantic widths can be interpreted as measures of the inherent structural
complexity of the underlying question posed by the query, whereas classical
notions of width express the complexity of a speci c way to pose the question.</p>
      <p>
        So far, semantic notions of acyclicity [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], treewidth (tw ) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and (generalized)
hypertree width (hw and ghw , resp.) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] have been investigated, while more
powerful notions of width such as fractional hypertree width (fhw ) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], submodular
width (subw ) [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], and adaptive width (adw ) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] have been left unexplored. The
goal of this work is to study semantic notions also of fhw , adw , and subw .
Recall that the semantic notions of acyclicity, tw , and ghw can be characterized in
terms of the core of the CQ. This naturally raises the question if such a
characterization is also possible for fhw , subw , and adw . We will give an a rmative
answer which, structurally, looks very similar to the previous results. However,
some additional machinery will be needed to prove our new results.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], subw was introduced to provide an in some sense \complete"
characterization of the xed-parameter tractability (FPT) of CQ Evaluation. Our
investigation of the semantic version of subw will provide new input for such an
FPT-characterization. More speci cally, our new notion of sem-subw will allow
us to identify an FPT-fragment of CQ Evaluation which is strictly bigger than
the fragment de ned via subw in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        Strongly related to the search for ( xed-parameter) tractable fragments of
CQ Evaluation is the complexity of the Check problem which, for given integer
k 1, is about deciding if a given CQ has width k. Among the width notions
mentioned above, this problem is tractable for tw and hw and NP-complete for
ghw and fhw . For subw and adw , the complexity is expected to be even higher.
When moving to semantic notions, the computation of the core introduces a
further source of intractability. In case of ghw and fhw , several structural
properties of the hypergraph underlying a CQ have been identi ed recently [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] to
make the Check problem tractable. We will show that these properties may
also be helpful for the computation of the semantic variants of ghw and fhw .
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Preliminaries</title>
      <p>
        We assume the reader to be familiar with basic concepts such as conjunctive
queries (CQs) and their associated hypergraphs. We implicitly extend
properties of hypergraphs to CQs through their associated hypergraphs. Due to space
limitations, we refer to [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] for de nitions of the fractional cover number ( ),
generalized hypertree width (ghw), fractional hypertree width (f hw), adaptive
width (adw), and submodular width (subw).
      </p>
      <p>We are mainly interested in two computational problems here. The rst one
is the usual query evaluation problem for a class of CQs Q, which we denote as
Eval(Q). The decision problem of checking, whether a query has width k for
width notion w, will be referred to as Check(w; k).</p>
      <p>
        The problem Check(w; k) with w 2 fghw ; fhw g has been shown NP-complete
even for k = 2 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. On the positive side, also tractable fragments of this problem
have been identi ed in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] via the following hypergraph properties: for a
hypergraph H, the c-multi-intersection width of H refers to the maximum cardinality
of an intersection of any c distinct edges. For the special case c = 2, we simply
use the term intersection width. The degree of H is the maximum number of
edges a vertex of H occurs in. The rank of H is the maximum edge size in H.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>The following theorem is the basis of all our further considerations:
Theorem 1. For every conjunctive query q:
1. sem- (q) = (Core(q))
2. sem-f hw(q) = f hw(Core(q))
3. sem-adw(q) = adw(Core(q))
4. sem-subw(q) = subw(Core(q))</p>
      <p>
        An interesting consequence of Theorem 1 is that bounded semantic width
of a class Q of CQs, for the notions of width enumerated in the theorem,
implies FPT of the Eval(Q) problem when parameterized by the query. This
is due to the fact that core computation only depends on the query (not the
data). This is of particular interest in the case of bounded sem-subw, because
it properly generalizes bounded submodular width, and as such \escapes" the
FPT-characterization theorem of Marx [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. To see that the generalization is in
fact proper, consider the class of \grid queries" QGn , such that QGn asks if
a given undirected graph contains a grid of size n. The core of query QGn
simply asks for the existence of a single (undirected) edge. However, the
associated hypergraphs of the family (QGn )n 1 include grids of every dimension.
Hence, (QGn )n 1 has unbounded treewidth. By considering the fractional
independent set where every vertex has weight 1=rank (H), we get the inequality
subw(H) adw(H) (tw (H) + 1)=rank (H). It is then clear that QGn has
unbounded subw , which is in sharp contrast to sem-subw(QGn ) = 1.
Corollary 1. Let Q be a class of CQs. Then bounded sem-f hw, sem-subw, and
sem-adw are su cient conditions for the xed-parameter tractability of Eval(Q)
(parameterized by the query). Furthermore, they properly subsume bounded fhw ,
subw , and adw , respectively.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], it was shown that the Check problem of ghw and fhw becomes
tractable if the underlying hypergraphs satisfy certain properties such as bounded
(multi-)intersection width, bounded degree, and/or bounded rank. Note that all
these properties are hereditary in the sense that deletion of vertices and/or edges
from a hypergraph does not destroy these properties. Thus, using Theorem 1, we
can directly identify tractable fragments of Check for sem-ghw and sem-f hw.
We present Corollary 2 as an illustrative example for various similar results.
Corollary 2. For a constant k &gt; 0, let Q be a class of conjunctive queries with
bounded fractional hypertree width and bounded degree, then Check(sem-f hw; k)
is tractable in Q.
      </p>
      <p>
        There exist classes with bounded fhw but unbounded ghw [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. However, little
is known about the conditions under which this can occur. We show that for
classes with either bounded degree or bounded intersection width, the properties
of bounded fhw and bounded ghw (and, therefore, also bounded hw ) in fact
coincide. Furthermore, since sem-f hw and sem-ghw(cf. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]) are characterized by
the width of the core, it is easy to generalize the result to the semantic case. As a
direct consequence, the promise algorithm presented by Chen and Dalmau in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
can be adapted to classes with bounded sem-f hw and either bounded degree or
bounded intersection width, making evaluation of these classes tractable.
Theorem 2. Let q be a conjunctive query and let its associated hypergraph have
degree d and intersection width i. The following statements hold:
{ f hw(q) ghw(q) d f hw(q) (Implicit in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ])
{ sem-f hw(q)
sem-ghw(q)
      </p>
      <p>d sem-f hw(q)
{ f hw(q)
ghw(q)</p>
      <p>2i f hw(q)2 + 2 f hw(q)
{ sem-f hw(q)
sem-ghw(q)</p>
      <p>2i sem-f hw(q)2 + 2 sem-f hw(q)</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>So far, we have given a characterization of the semantic variants of , f hw, adw,
and subw. From this we are able to derive new insights into the complexity of
CQs. Figure 1 illustrates our current view of the complexity landscape of CQs.</p>
      <p>
        Many consequences of Theorem 1 are yet to be explored. Of particular
interest is the role of sem-subw. Marx has shown that bounded subw characterizes
those hypergraphs for which evaluation of the associated CQs is xed-parameter
tractable [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The natural next step is to characterize precisely those CQs for
which the evaluation is xed-parameter tractable. Semantic submodular width
is a natural candidate step in this direction but the question whether it provides
a necessary condition for xed-parameter tractable CQ evaluation remains an
important open problem for future work.
      </p>
      <p>sem-f hw
sem-ghw
sem-f hwBIP
sem-f hwBDP
sem-subw
sem-adw
ghw
tw; hw
ghwLogBMIP
f hwBDP
f hw
subw
adw</p>
      <p>Check &amp; Eval tractable</p>
      <sec id="sec-4-1">
        <title>Check hard</title>
        <p>Eval tractable</p>
      </sec>
      <sec id="sec-4-2">
        <title>Eval FPT</title>
        <p>Acknowledgements This work was supported by the Austrian Science Fund
(FWF):P30930-N35.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Barcelo</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pieris</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Romero</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Semantic optimization in tractable classes of conjunctive queries</article-title>
          .
          <source>SIGMOD Rec</source>
          .
          <volume>46</volume>
          (
          <issue>2</issue>
          ),
          <volume>5</volume>
          {
          <fpage>17</fpage>
          (Sep
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Barcelo</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Romero</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vardi</surname>
          </string-name>
          , M.Y.:
          <article-title>Semantic acyclicity on graph databases</article-title>
          .
          <source>SIAM Journal on Computing</source>
          <volume>45</volume>
          (
          <issue>4</issue>
          ),
          <volume>1339</volume>
          {
          <fpage>1376</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dalmau</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Beyond hypertree width: Decomposition methods without decompositions</article-title>
          .
          <source>In: Proc. CP</source>
          <year>2005</year>
          . pp.
          <volume>167</volume>
          {
          <fpage>181</fpage>
          . Springer (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Dalmau</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kolaitis</surname>
            ,
            <given-names>P.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vardi</surname>
          </string-name>
          , M.Y.:
          <article-title>Constraint satisfaction, bounded treewidth, and nite-variable logics</article-title>
          .
          <source>In: Proc. CP</source>
          <year>2002</year>
          . pp.
          <volume>310</volume>
          {
          <fpage>326</fpage>
          . Springer (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Fischl</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gottlob</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pichler</surname>
          </string-name>
          , R.:
          <article-title>General and fractional hypertree decompositions: Hard and easy cases</article-title>
          .
          <source>In: Proc. PODS</source>
          <year>2018</year>
          . pp.
          <volume>17</volume>
          {
          <fpage>32</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Grohe</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The complexity of homomorphism and constraint satisfaction problems seen from the other side</article-title>
          .
          <source>Journal of the ACM (JACM) 54(1)</source>
          ,
          <volume>1</volume>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Grohe</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marx</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Constraint solving via fractional edge covers</article-title>
          .
          <source>ACM Trans. Algorithms</source>
          <volume>11</volume>
          (
          <issue>1</issue>
          ), 4:
          <issue>1</issue>
          {4:
          <issue>20</issue>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Marx</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Tractable structures for constraint satisfaction with truth tables</article-title>
          .
          <source>Theory of Computing Systems</source>
          <volume>48</volume>
          (
          <issue>3</issue>
          ),
          <volume>444</volume>
          {
          <fpage>464</fpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Marx</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Tractable hypergraph properties for constraint satisfaction and conjunctive queries</article-title>
          .
          <source>Journal of the ACM</source>
          <volume>60</volume>
          (
          <issue>6</issue>
          ),
          <volume>42</volume>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>