<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>OWL Ontology Optimization using Constrained Multi-ob jective Genetic Algorithm for Reducing Load of Inference Enabled SPARQL Query Execution</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Naoki Yamada</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Naoki Fukuta</string-name>
          <email>fukuta@inf.shizuoka.ac.jp</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Informatics, Graduate School of Integrated Science and Technology, Shizuoka University</institution>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Informatics, Shizuoka University</institution>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>On an inference-enabled Linked Open Data(LOD) endpoint, usually a query execution takes longer than on an LOD endpoint without inference engine due to its processing of reasoning. In optimizing an OWL ontology for executing SPARQL queries on an LOD endpoint with inference engine, there are a number of SPARQL queries that we would probably take into account and many of these factors possibly work against each other. In this paper, for reducing query execution time on an inference-enabled LOD endpoint, we propose an idea and its implementation to employ a constrained multi-objective evolutionary approach to make modi cation of ontologies based on the past-processed queries and their results. We employ a multi-objective evolutionary approach to realize better promoting and preserving diversity within the population. We also employ constraint handling to dealing with invalid solution such as inconsistent ontology. We show how the approach works well on implementing an inference-enabled LOD endpoint by a cluster of SPARQL endpoints.</p>
      </abstract>
      <kwd-group>
        <kwd>Semantic Web SPARQL Inference</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Linked Open Data (LOD) are retrieved by a query written in the standard query
language called SPARQL3 for the retrieval of RDF data stored in an endpoint.
Reasoning on LODs allows such queries to obtain implicitly stated knowledge or
data from the given explicit relations. These techniques are important to make
intelligent agents accessible to data on its external world. Techniques to utilize
reasoning capability based on ontology have been developed to overcome several
issues, such as higher complexity in worst case[1, 4{7]. We have presented an
approach to cut off the queries which put unusual heavy loads to the reasoning
3 http://www.w3.org/TR/rdf-sparql-query/
engine [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] as well as modifying the queries and ontologies to approximate such a
heavy load queries [
        <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
        ]. However, due to the complexity of queries and strong
dependency on the features of the queries, it is not easy to implement `one-
tsall' solutions for all possible queries on the endpoint. In this paper, to overcome
this issue, we propose an idea and its implementation to employ a constrained
multi-objective evolutionary approach to make modi cation of ontologies based
on the past-processed queries to realize better promoting and preserving diversity
within the population.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>GA-Based Ontology Modi cation</title>
      <p>
        We employ GA-based modi cation approach to approximate an ontology for an
inference-enabled LOD endpoint [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In this paper, we use the query execution
result to calculate the tness value. It is time-consuming to relaunch an endpoint
every query execution for shelter a result of query execution from the effect of
other query execution cache. On the implementation of our GA, the number of
times of relaunching an endpoint and the number of times of query execution
will both be increased in proportion to a population size and the number of
generations. Therefore, nding the optimal solution of an ontology modi cation
method in a small population size and a small number of generations is important
for reducing the time cost of GA.
      </p>
      <p>Expressing an OWL ontology itself as a gene is a simple way to applying
genetic algorithm to an OWL ontology modi cation. For example, assuming that
we express ontology modi cation with 0 and one, if the value corresponding to an
axiom is 0, delete its axiom from the OWL ontology, if the value corresponding
to an axiom is 1, do nothing to the OWL ontology. It is difficult to apply the
solution obtained by this method to other ontologies.</p>
      <p>In our genetic algorithm, we express a modi cation rule for an OWL ontology
as a gene. To nd better solutions with an earlier generation than the result of
applying a normal genetic algorithm, we use constraint handling and
multiobjective optimization with genetic algorithm.</p>
      <p>
        In the multi-objective algorithm that work under the principle of domination,
tness values of the n-objective functions are expressed as n-dimensional vector
a. The domination between two individuals A ⪰ B can be de ned as equation (1)
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In multi-objective evolutionary algorithms that work under the domination
principle, it aims to calculate the domination and obtain a set of non-dominated
solutions [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>A ⪰ B ,ai</p>
      <p>bi(8i 2 f1;
ai &gt; bi(9i 2 f1;
; ng) and
; ng)
(1)
3</p>
    </sec>
    <sec id="sec-3">
      <title>Discussion</title>
      <p>
        We set up a SPARQL endpoint using Joseki (v3.4.4) in conjunction with
serverside Pellet [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] reasoner to enable OWL-level inference capability on the endpoint.
      </p>
      <p>OWL Ontology Optimization using Constrained MOGA
We used the dataset used in the conference track on Ontology Alignment
Evaluation Initiative 2013 (OAEI 2013)4. Here we used Linklings ontology from the
OAEI dataset in the preliminary experiment.</p>
      <p>Fitness values for individuals are set to equation (2). Here, Vo is a tness
value of modi ed ontology o. Fo is a F-measure of the precision and recall on
their output results over the modi ed ontology o. TO is a query execution time
of the original ontology O. To is a query execution time of the modi ed ontology
o. In this experiment, we use = 0:7 and = 0:3, respectively</p>
      <p>TO</p>
      <p>T o</p>
      <p>TO
Vo =</p>
      <p>
        Fo +
such that
+
= 1
(2)
4 http://oaei.ontologymatching.org/2013/conference/index.html
ment of approximation performance, applying techniques often used in
manyobjective evolutionary algorithms such as the NSGA-III [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] is one of our future
work.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgment</title>
      <p>The work was partly supported by the JST CREST JPMJCR15E1.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Baumgartner</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Furbach</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <article-title>Niemela, I.: Hyper Tableaux</article-title>
          . In: Alferes,
          <string-name>
            <given-names>J.J.</given-names>
            ,
            <surname>Pereira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.M.</given-names>
            ,
            <surname>Orlowska</surname>
          </string-name>
          , E. (eds.)
          <article-title>Logics in Arti cial Intelligence</article-title>
          .
          <source>LNCS</source>
          , vol.
          <volume>1126</volume>
          , pp.
          <volume>1</volume>
          {
          <fpage>17</fpage>
          . Springer-Verlag (
          <year>1996</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Deb</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jain</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>An evolutionary many-objective optimization algorithm using reference-point-based nondominated sorting approach, part i: Solving problems with box constraints</article-title>
          .
          <source>IEEE Transactions on Evolutionary Computation</source>
          <volume>18</volume>
          (
          <issue>4</issue>
          ),
          <volume>577</volume>
          {601 (Aug
          <year>2014</year>
          ). https://doi.org/10.1109/TEVC.
          <year>2013</year>
          .2281535
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Eiben</surname>
            ,
            <given-names>A.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          : Introduction to Evolutionary Computing, pp.
          <volume>195</volume>
          {
          <fpage>213</fpage>
          . Springer Publishing Company, Incorporated, 2nd edn. (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Goncalves</surname>
            ,
            <given-names>R.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Performance Heterogeneity and Approximate Reasoning in Description Logic Ontologies</article-title>
          . In: Cudre-Mauroux,
          <string-name>
            <given-names>P.</given-names>
            , et al. (eds.) The Semantic Web{ISWC 2012
            <surname>Part</surname>
          </string-name>
          <string-name>
            <surname>I. LNCS</surname>
          </string-name>
          , vol.
          <volume>7649</volume>
          , pp.
          <volume>82</volume>
          {
          <fpage>98</fpage>
          .
          <string-name>
            <surname>SpringerVerlag</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Kang</surname>
            ,
            <given-names>Y.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Y.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krishnaswamy</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Predicting Reasoning Performance Using Ontology Metrics</article-title>
          . In: Cudre-Mauroux,
          <string-name>
            <given-names>P.</given-names>
            , et al. (eds.) The Semantic Web{ISWC 2012
            <surname>Part</surname>
          </string-name>
          <string-name>
            <surname>I. LNCS</surname>
          </string-name>
          , vol.
          <volume>7649</volume>
          , pp.
          <volume>198</volume>
          {
          <fpage>214</fpage>
          . Springer-Verlag (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shearer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Hypertableau Reasoning for Description Logics</article-title>
          .
          <source>Journal of Arti cial Intelligence Research</source>
          <volume>36</volume>
          ,
          <volume>165</volume>
          {
          <fpage>228</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Romero</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
          </string-name>
          , I.:
          <article-title>MORe: Modular Combination of OWL Reasoners for Ontology Classi cation</article-title>
          . In: Cudre-Mauroux,
          <string-name>
            <given-names>P.</given-names>
            , et al. (eds.) The Semantic Web{ISWC 2012
            <surname>Part</surname>
          </string-name>
          <string-name>
            <surname>I. LNCS</surname>
          </string-name>
          , vol.
          <volume>7649</volume>
          , pp.
          <volume>1</volume>
          {
          <fpage>16</fpage>
          . Springer (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Sirin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Pellet: A practical owl-dl reasoner</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>5</volume>
          (
          <issue>2</issue>
          ),
          <volume>51</volume>
          {
          <fpage>53</fpage>
          (
          <year>2007</year>
          ). https://doi.org/https://doi.org/10.1016/j.websem.
          <year>2007</year>
          .
          <volume>03</volume>
          .004, http://www.sciencedirect.com/science/article/pii/S1570826807000169, software Engineering and the Semantic Web
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Yamada</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamagata</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fukuta</surname>
          </string-name>
          , N.:
          <article-title>Query rewriting or ontology modi cation? considering reasoning capability on lod endpoints</article-title>
          .
          <source>In: IEEE International Conference on Agents (IEEE ICA</source>
          <year>2016</year>
          ). pp.
          <volume>19</volume>
          {
          <issue>24</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Yamada</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamagata</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fukuta</surname>
          </string-name>
          , N.:
          <article-title>Query rewriting or ontology modi cation? toward a faster approximate reasoning on lod endpoints</article-title>
          .
          <source>IEICE Transactions on Information and Systems E100.D(12)</source>
          ,
          <volume>2923</volume>
          {
          <fpage>2930</fpage>
          (
          <year>2017</year>
          ). https://doi.org/10.1587/transinf.2016AGP0010
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Yamagata</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fukuta</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          :
          <article-title>An Approach to Dynamic Query Classi cation and Approximation on an Inference-enabled SPARQL Endpoint</article-title>
          .
          <source>Journal of Information Processing</source>
          <volume>23</volume>
          (
          <issue>6</issue>
          ),
          <volume>759</volume>
          {
          <fpage>766</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>