<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Relaxed Regular Path Queries in Lightweight DLs (Extended Abstract)?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oliver Fernández Gil</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anni-Yasmin Turhan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Theoretical Computer Science</institution>
          ,
          <addr-line>TU Dresden</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Regular path queries (RPQs) is a well-investigated query language that dates back to the early 90's, where its capabilities to navigate graph-structured data attracted much attention in research on semistructured data and graph databases [10,6]. This interest was revived in recent years, since in many application areas data is graph-structured and represented in graph database models. Notable examples of applications of RPQs are querying biological networks, the semantic web and social networks. Moreover, RPQs and its extensions are part of SPARQL, which is the standard language recommended by the W3C to query RDF data. Formally, a graph database consists of a labeled directed graph, where edge labels correspond to binary predicates stating relations between data items. A RPQ consists of a regular language over these labels, and retrieves pairs of data items (a; b) that are connected by paths complying to the specified regular language. The extension of two-way RPQs (2RPQ) allows to traverse edges backwards, and the more expressive language of conjunctive 2RPQs (C2RPQ) allows conjunctions of 2RPQs that can share variables. In scenarios where a RPQ yields no answers over a particular database, it can be useful to relax the query to retrieve more than the classical answers, i.e., pairs that are connected by paths that are “similar enough” to the paths required by the query. This can be practical to provide feasible alternatives in applications where data is gathered automatically from heterogeneous data sources and exact semantics need not yield the expected results. Similarly, in applications where the data is irregular and evolves in structure and content, it can be hard for users to have full knowledge of its vocabulary and structure and queries that approximate/relax the set of answers may be helpful. Several approaches have been considered to address this problem, as for instance, [8,9,7,12]. In particular, [7] proposes an elegant and tractable solution that uses a weighted finite-state transducers to define the approximation semantics. Roughly speaking, such a transducer is a mechanism that transforms input words into corresponding output words, and computes a weight quantifying the cost of the transformation. The idea is to use a transducer as a means to specify which paths are allowed to be considered approximations/distortions of the “ideal” paths specified by the query, and to specify their distortion costs.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Approximate answers are then tuples (a; b; ), where is the minimal cost of
distorting a path that complies with the query, into a path leading from a to b.</p>
      <p>
        Path queries have also been investigated for ontology-mediated query
answering (OMQA), in which semantic knowledge provided in a background ontology
is used to enrich the data. Ontologies are often formulated in a description logic
(DL) which can be used to represent the conceptual knowledge of an
application domain in a structured and formally well-understood way. In contrast to
query answering over a (graph) database, OMQA usually adopts the open world
assumption where all possible models of the ontology and the data are
considered when computing the answers. Answering conjunctive 2-way regular path
queries (C2RPQs) has been studied for very expressive DLs [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and for the
E L and DL-Lite families of lightweight DLs [
        <xref ref-type="bibr" rid="ref13 ref2 ref3">3,13,2</xref>
        ]. However, approaches for
query answering under approximate semantics in the OMQA setting are scarce.
There is prior work on the simple case of instance queries [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and on C2RPQs
in the restricted setting of acyclic ontologies using RDFs schema [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] or
nongradual variants of conjunctive queries [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The goal of this paper is to define
approximate semantics for answering C2RPQs in DLs and to devise computation
algorithms for answering them in the DLs E LH and DL-LiteR.
      </p>
      <p>Our contributions are threefold. First, we extend the known
transducerbased approximate semantics from RPQs to the more general query language
of C2RPQs in the graph database setting. In contrast to RPQs, C2RPQs may
contain quantified variables and more than one query atom. This requires to
regard several matchings of the quantified variables and to combine the costs of
the distortions of each query atom. Second, we consider the setting of OMQA
and define approximate semantics for answering C2RPQs over DL ontologies.
We define the notion of certain approximate answers as a generalization of the
classical certain answers. Third, we investigate two reasoning problems for
certain approximate answers. We start with the decision problem that asks, given
a threshold value and a tuple a, is a a certain approximate answer with
approximation cost of at most ? Then we consider the problem of computing the
exact approximation cost of a. For 2RPQs, we devise a polynomial time
algorithm that can be used to solve both problems. Regarding C2RPQs, we prove
that a) both problems can be solved in polynomial time in data complexity, b)
the decision problem is in NExpTime in combined complexity, and c) we provide
a double exponential time algorithm (in combined complexity) to compute the
approximation cost.</p>
      <p>
        Several of our upper bounds are not tight, in the sense that they do not match
the hardness results inherited from the classical case. The exact data complexity
for DL-LiteR and the combined complexity of C2RPQs in both DLs remain
open: NL PTime and PSpace NexpTime (2ExpTime), respectively. Trying to
close (or reduce) these gaps constitutes ongoing work, where the case concerning
the combined complexity appears to represent the biggest challenge. On the one
hand, we are analysing whether the PSpace rewriting procedure proposed in
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] can really be adapted to answer C2RPQs under approximate semantics. On
the other hand, we are investigating if the approximation mechanism allows to
simulate more difficult query answering problems, like for instance, answering
nested 2RPQs which is ExpTime-hard in both DLs [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>Regarding future research, we plan to consider other semi-rings and study
which approximation patterns they allow to express, as well as the impact on
the computational complexity of the considered problems.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bienvenu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ortiz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simkus</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Nested regular path queries in description logics</article-title>
          . In: Baral,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Giacomo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.D.</given-names>
            ,
            <surname>Eiter</surname>
          </string-name>
          , T. (eds.)
          <source>Principles of Knowledge Representation and Reasoning: Proceedings of the Fourteenth International Conference, KR 2014</source>
          , Vienna, Austria,
          <source>July 20-24</source>
          ,
          <year>2014</year>
          . AAAI Press (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bienvenu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ortiz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simkus</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Regular path queries in lightweight description logics: Complexity and algorithms</article-title>
          .
          <source>J. Artif. Intell. Res</source>
          .
          <volume>53</volume>
          ,
          <fpage>315</fpage>
          -
          <lpage>374</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Giacomo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lembo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosati</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Tractable reasoning and efficient query answering in description logics: The DL-Lite family</article-title>
          .
          <source>J. Autom. Reasoning</source>
          <volume>39</volume>
          (
          <issue>3</issue>
          ),
          <fpage>385</fpage>
          -
          <lpage>429</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eiter</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ortiz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Answering regular path queries in expressive description logics via alternating tree-automata</article-title>
          .
          <source>Inf. Comput</source>
          .
          <volume>237</volume>
          ,
          <fpage>12</fpage>
          -
          <lpage>55</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ecke</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peñaloza</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turhan</surname>
            ,
            <given-names>A.Y.</given-names>
          </string-name>
          :
          <article-title>Similarity-based relaxed instance queries</article-title>
          .
          <source>Journal of Applied Logic</source>
          <volume>13</volume>
          (
          <issue>4</issue>
          , Part 1),
          <fpage>480</fpage>
          -
          <lpage>508</lpage>
          (
          <year>2015</year>
          ), special Issue for the Workshop on Weighted Logics for AI 2013
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Florescu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Levy</surname>
            ,
            <given-names>A.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suciu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Query containment for conjunctive queries with regular expressions</article-title>
          . In: Mendelzon,
          <string-name>
            <given-names>A.O.</given-names>
            ,
            <surname>Paredaens</surname>
          </string-name>
          ,
          <string-name>
            <surname>J</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, June 1-3</source>
          ,
          <year>1998</year>
          , Seattle, Washington, USA. pp.
          <fpage>139</fpage>
          -
          <lpage>148</lpage>
          . ACM Press (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Grahne</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thomo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Regular path queries under approximate semantics</article-title>
          . Ann. Math. Artif. Intell.
          <volume>46</volume>
          (
          <issue>1-2</issue>
          ),
          <fpage>165</fpage>
          -
          <lpage>190</lpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Jagadish</surname>
            ,
            <given-names>H.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mendelzon</surname>
            ,
            <given-names>A.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Milo</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Similarity-based queries</article-title>
          . In: Yannakakis, M. (ed.)
          <source>Proceedings of the Fourteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, May 22-25</source>
          ,
          <year>1995</year>
          , San Jose, California, USA. pp.
          <fpage>36</fpage>
          -
          <lpage>45</lpage>
          . ACM Press (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kanza</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sagiv</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Flexible queries over semistructured data</article-title>
          .
          <source>In: PODS. ACM</source>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Mendelzon</surname>
            ,
            <given-names>A.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wood</surname>
          </string-name>
          , P.T.:
          <article-title>Finding regular simple paths in graph databases</article-title>
          .
          <source>SIAM J. Comput</source>
          .
          <volume>24</volume>
          (
          <issue>6</issue>
          ),
          <fpage>1235</fpage>
          -
          <lpage>1258</lpage>
          (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Peñaloza</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thost</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turhan</surname>
            ,
            <given-names>A.Y.</given-names>
          </string-name>
          :
          <article-title>Query answering for rough EL ontologies</article-title>
          . In: Thielscher,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Toni</surname>
          </string-name>
          ,
          <string-name>
            <surname>F</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of 16. International Conference on Principles of Knowledge Representation and Reasoning (KR</source>
          <year>2018</year>
          ). pp.
          <fpage>399</fpage>
          -
          <lpage>408</lpage>
          . AAAI (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Poulovassilis</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Selmer</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wood</surname>
          </string-name>
          , P.T.:
          <article-title>Approximation and relaxation of semantic web path queries</article-title>
          .
          <source>J. Web Semant</source>
          .
          <volume>40</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>21</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Stefanoni</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krötzsch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rudolph</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>The complexity of answering conjunctive and navigational queries over OWL 2 EL knowledge bases</article-title>
          .
          <source>J. Artif. Intell. Res</source>
          .
          <volume>51</volume>
          ,
          <fpage>645</fpage>
          -
          <lpage>705</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>