<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>gOWL: A Fast Ontology-Mediated Query Answering</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Chenhong Meng</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xiaowang Zhang</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guohui Xiao</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Zhiyong Feng</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guilin Qi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Faculty of Computer Science, Free University of Bozen-Bolzano</institution>
          ,
          <addr-line>Bolzano I-39100</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Computer Science and Engineering, Southeast University</institution>
          ,
          <addr-line>Nanjing 211189</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>School of Computer Science and Technology, Tianjin University</institution>
          ,
          <addr-line>Tianjin 300350</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>School of Computer Software,Tianjin University</institution>
          ,
          <addr-line>Tianjin 300350</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Tianjin Key Laboratory of Cognitive Computing and Application</institution>
          ,
          <addr-line>Tianjin 300350</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This poster shows an ontology-mediated query answering system (gOWL) based on pure materialization to avoid query rewriting online. gOWL shows that answering queries over a partial materialization of the chase is complete for a large fragment of practical queries with bounded depth. From a system engineering perspective, the materialization approach allows us to design a modular architecture to integrate off-the-shelf efficient SPARQL query engines. The poster will be based on DBpedia and UOBM datasets. We will compare the performance of gOWL with PAGOdA, Ontop, and Pellet (with speedup up to three orders of magnitude).</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        OMQ (Ontology-mediated Query) is a core reasoning task in many applications [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
OMQs have been studies intensively on lightweight ontology languages, that have
canonical model properties. There are basically two approaches for query answering, namely,
materialization-based and query rewriting-based. Materialization-based approaches
normally compute and materialize the chase first and then execute the queries over the
materialization, such as RDFox [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and PAGOdA [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. However, materialization is often
infeasible when the chase contains infinitely existentially entailed elements. Instead,
query writing-based approaches avoid materializing the chase but rewrite input queries
by compiling the consequence of the reasoning into the query, such as QuOnto [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ],
Clipper [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and Ontop [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Query rewriting comes at the cost of query rewriting and
query evaluation at runtime, and the possibility of missing optimization opportunity at
the data level.
      </p>
      <p>In this poster, we adopt a pure materialization-based approach which allows us to
design a modular architecture to integrate off-the-shelf efficient SPARQL query
engines. We implement the proposed approach in a prototype gOWL, and build an OMQ
systems gOWL-3X by employing RDF-3X. The preliminary encouraging experiments
on DBpedia and UOBM show that gOWL outperforms PAGOdA, Ontop, and Pellet.</p>
    </sec>
    <sec id="sec-2">
      <title>Framework of gOWL</title>
      <p>The approach proposed in this Section has been implemented in the gOWL system 6.
The framework of gOWL contains three modules, namely, Query Processor,
Normalized Model Constructor, and Query Execution shown in the following figure.</p>
      <sec id="sec-2-1">
        <title>SPARQL Query KB</title>
        <p>gOWL</p>
        <p>Query
Processor</p>
      </sec>
      <sec id="sec-2-2">
        <title>Normalized Model Constructor</title>
        <p>SPARQL
Graph
Query
RDF
Dataset</p>
      </sec>
      <sec id="sec-2-3">
        <title>Query Execution</title>
      </sec>
      <sec id="sec-2-4">
        <title>Model</title>
      </sec>
      <sec id="sec-2-5">
        <title>Selector</title>
      </sec>
      <sec id="sec-2-6">
        <title>SPARQL API</title>
      </sec>
      <sec id="sec-2-7">
        <title>Engine</title>
      </sec>
      <sec id="sec-2-8">
        <title>Selector</title>
      </sec>
      <sec id="sec-2-9">
        <title>SPARQL</title>
        <p>Query</p>
        <p>Engine
(Centralized/
Distributed)</p>
      </sec>
      <sec id="sec-2-10">
        <title>Result</title>
        <p>Query Processor Query Processor is a module to compute the depth of a query by
applying a depth-first-search method. We use dep(q) to denote the depth of q, which
represents the length of maximal certain path in q. The nodes in the path must be starting
of a quantified-free variable of q and all other nodes are quantified variables.
Normalized Model Constructor In Normalized Model Constructor, we compute
nstep universal models UKn (i.e., U K0 : : : UKn) for given a KB K = (T ; A) a natural number
0 means to
n. n represents the count of expanded steps from ABox. For example, UK
expand 0 step from ABox, which equivalent to itself; U K1 means to expand 1 step from
0 (i.e., ABox), the expansion condition is (T , U K0) j= 9 R1(a) but R1(a; b) 62 U K0,
UK
(n 1) (i.e., n-step from ABox),
for all b 2 Ind(U K0); UKn means to expand 1 step from UK
the expansion condition is (T , U K(n 1)) j= 9 R1(a) but R1(a; b) 62 U K(n 1), for all
b 2 Ind(U K(n 1)).</p>
        <p>
          Intuitively speaking, n-step universal models of a KB are hierarchical extensions of
the ABox via the TBox. In fact, based on the statistical analysis of practical SPARQL
queries, over 96% of queries contain at most 7 triple patterns (i.e.,7 triples in a SPARQL
query) [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Therefore, we only consider n no more than 7 in this poster.
Query Execution The module of Query Execution contains four parts, namely Model
Selector, SPARQL API, Engine Selector, and SPARQL Query Engine. Model Selector
i for a given query that i = dep can be satisfied. Through
is used to select a suitable UK
SPARQL API, the query and dataset are passed on Engine Selector which utilizes the
information of them and the characteristics of each engine to recommend the suitable
one. Finally we employ SPARQL Query Engine to return solutions.
6 https://github.com/liulovemeng/gOWL.git
        </p>
        <p>Q1</p>
        <p>Q2
107 gOWL-3X PAGOdA Ontop
106
105
i()snem104
m
i
try103
eu
Q
102
101</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Experiments and evaluations</title>
      <p>The experiments are carried out on a machine running Linux, which has 4 CPUs with
6 cores and 64GB memory. RDF-3X is used as the underlying SPARQL query engines.
We utilized UOBM and DBpedia data as a standard of evaluation.</p>
      <p>
        We evaluate on a dataset (around 12 million triples) of DBpedia ontology in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and
U7K is computed in 48 hours. The experimental results of 10 queries (Q1 Q10) over
three engines (i.e., gOWL-3X, PAGOdA, Pellet) are shown in the Figure 2. In a same
way, the evaluate on UOBM [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] over three engines (i.e., gOWL-3X, PAGOdA, Ontop)
are shown in the Figure 3 and Figure 4 (Note that we only list experimental results in
24 hours and the ordinate represents the total online time of query answering).
      </p>
      <p>Q3</p>
      <p>Q4</p>
      <p>Q5</p>
      <p>Q6</p>
      <p>Q7</p>
      <p>Q8</p>
      <p>Q9</p>
      <p>Q10</p>
      <p>We find that gOWL-3X significantly improve the performance of all 21 queries
comparing to PAGOdA, Ontop and Pellet which represent materialization-based
approach, query rewriting-based approach and traditional reasoning machine, respectively.
Since Ontop doesn’t have mapping information about DBpedia, it cannot handle DBpdiea
dataset. From the figures we can see that gOWL-3X is several orders of magnitude
higher than other methods, because it has the advantage of changing data processing
to offline before the query coming. In addition, its performance remains good with
data size increases (UOBM100 UOBM1000 ). Under the same conditions, PAGOdA
couldn’t handle large-scale dataset effectively because its query execution level is
dependent on RDFox, which finishes processing the data online, so the memory
requirements are much stronger.
101
100</p>
      <p>Q1</p>
      <p>Q2</p>
      <p>Q3</p>
      <p>Q4</p>
      <p>Q5</p>
      <p>Q6</p>
      <p>Q7</p>
      <p>Q8</p>
      <p>Q9</p>
      <p>Q10</p>
      <p>Q11
4</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>In this poster, we have presented the gOWL system for ontology-mediated query
answering. The approach take advantage of high-performance of off-on-shelf SPARQL
query engines for supporting large-scale ontology query answering in an efficient and
simple way. Based on DBpedia and UOBM datasets, gOWL outperforms existing
engines significantly.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work is supported by the National Natural Science Foundation of China (61502336),
the National Key R&amp;D Program of China (2016YFB1000603), the Key Technology
R&amp;D Program of Tianjin (16YFZCGX00210), and the Seed Foundation of Tianjin
University (2018XZC-0016).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bienvenu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Ontology-mediated query answering: Harnessing knowledge to get more from data</article-title>
          .
          <source>In Proc. of IJCAI'16</source>
          , pp.
          <fpage>4058</fpage>
          -
          <lpage>4061</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Botoeva</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santarelli</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Fabio</given-names>
            <surname>Savo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Solimando</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            and
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Beyond OWL 2 QL in OBDA: Rewritings and approximations</article-title>
          .
          <source>In Proc. of AAAI'16</source>
          , pp.
          <fpage>921</fpage>
          -
          <lpage>928</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cogrel</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Komla-Ebri</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kontchakov</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lanti</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rezk</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>RodriguezMuro</surname>
          </string-name>
          , M., and
          <string-name>
            <surname>Xiao</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Ontop: Answering SPARQL queries over relational databases</article-title>
          .
          <source>Semantic Web J.</source>
          ,
          <volume>8</volume>
          (
          <issue>3</issue>
          ):
          <fpage>471</fpage>
          -
          <lpage>487</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Eiter</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ortiz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simkus</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tran</surname>
            ,
            <given-names>T.K.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Xiao</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Query rewriting for HornSHIQ plus rules</article-title>
          .
          <source>In Proc. of AAAI'12</source>
          , pp.
          <fpage>726</fpage>
          -
          <lpage>733</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Han</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feng</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rao</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Jiang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>On the statistical analysis of practical SPARQL queries</article-title>
          .
          <source>In Proc. of WebDB'11, article 2</source>
          , ACM.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Kontchakov</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Toman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolter</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Zakharyaschev</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>The combined approach to query answering in DL-Lite</article-title>
          .
          <source>In Proc. of KR'10</source>
          . AAAI Press.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nenov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piro</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Olteanu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Parallel materialisation of Datalog programs in centralised, main-memory RDF systems</article-title>
          .
          <source>In Proc. of AAAI'14</source>
          , pp.
          <fpage>129</fpage>
          -
          <lpage>137</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nenov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaminski</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>PAGOdA: Pay-as-yougo ontology query answering using a datalog reasoner</article-title>
          .
          <source>J. Artif. Intell. Res.</source>
          ,
          <volume>54</volume>
          (
          <issue>1</issue>
          ):
          <fpage>309</fpage>
          -
          <lpage>367</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>