<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Preliminary Comparison of Forgetting Solutions Computed using SCAN, LETHE and FAME</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ruba Alassaf</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Renate A. Schmidt</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Computer Science, The University of Manchester</institution>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <fpage>21</fpage>
      <lpage>26</lpage>
      <abstract>
        <p>Forgetting, in description logic, is a non-standard reasoning technique. The aim of this technique is to eliminate concept and role symbols from a knowledge base, whilst preserving all logical consequences up to the remaining symbols. In this research, three forgetting tools were used to understand the relationship between the approaches on which they are based: SCAN, LETHE and FAME. The approach applied LETHE to description logic-based ontologies to investigate the solutions being computed. The solutions were then compared with those being computed using the other forgetting tools, SCAN and FAME.</p>
      </abstract>
      <kwd-group>
        <kwd>Forgetting</kwd>
        <kwd>Ontology</kwd>
        <kwd>Resolution</kwd>
        <kwd>Ackermann's Lemma</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        In previous research, a number of practical approaches have been developed
to perform forgetting. Generally, these tools fall within two categories: those
which use resolution and others which use methods that exploit Ackermann’s
Lemma. The aim of this research was to compare the solutions being computed
between the different methods using real world ontologies. The three tools used
to represent the different approaches analyzed in this research are SCAN [
        <xref ref-type="bibr" rid="ref2 ref5">2, 5</xref>
        ],
LETHE [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ], and FAME [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        The first tool, SCAN, computes forgetting solutions for knowledge-bases
expressed in first-order logic using a resolution-based approach, namely the SCAN
algorithm [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Since forgetting in first-order logic is not generally solvable, SCAN
is not guaranteed to find a solution. SCAN uses Skolemization to eliminate
existential quantifiers, and therefore Skolem terms may appear in SCAN’s solutions.
      </p>
      <p>
        Similarly, LETHE, the second tool, computes its forgetting solution for
description logic-based ontologies using a resolution-based method. However, it
has been proven that the approach used to develop LETHE will find forgetting
solutions for the description logics it can handle [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Furthermore, a point that
will be useful later is that the forgetting solutions computed using LETHE may
involve definer symbols, which are not part of the original vocabulary. This is
due to applying flattening techniques to the ontology and sometimes failing to
eliminate the extra symbols introduced during the computation [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
Copyright c 2017 by the paper's authors
      </p>
      <p>
        Finally, the last tool, FAME, computes its forgetting solutions for description
logic-based ontologies using a method based on Ackermann’s Lemma. Similar to
LETHE, the method is guaranteed to terminate. However, the method is not
forgetting complete and may fail to eliminate some of the symbols [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The
symbols it fails to eliminate are present in the result computed by the system.
      </p>
      <p>The difference between the tools lie in the expressivity of the logics being
supported, as well as the forgetting methodology being used. The aim of this
research was to apply one of the forgetting tools, LETHE, to description
logicbased ontologies to investigate the solutions computed, and compare them with
those computed using SCAN and FAME. The focus was on concept forgetting
in the description logic ALC.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Our Tool</title>
      <p>For the purposes of the comparison in this research, a tool has been developed
to investigate the solutions computed using LETHE, which were then compared
with those computed using SCAN and FAME. FAME and LETHE have the
advantage of being able to process large ontologies, hence two separate versions
of the system were created: one that compares LETHE and SCAN, and another
that compares FAME and LETHE using larger ontologies. An effort has been
made to maximize the size of the ontologies that SCAN can accept. However, the
considered ontologies were still not as large as desired. This is due to boundaries
in the static data structures of SCAN.</p>
      <p>
        Figure 1 and Figure 2 give an overview of the components of the system and
show how they are used. As can be seen in Figure 1, the tool parses an ontology
O expressed in description logic in the OWL syntax, using the OWL API, and
a set of concept symbols Σ, that is referred to as the forgetting vocabulary. A
description logic to first-order logic translator that uses the established
relationship between them [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] has been built. The tool uses the translator to translate
an ontology expressed in OWL syntax to an equivalent set of first-order
formulae expressed in Otter syntax. It is used to translate the input ontology into an
equivalent input for SCAN. After each of the two systems have computed its
solution, a first-order logic theorem prover called Prover91 was used to check
for entailments between the results. The translator was used again to translate
LETHE’s results to first-order logic as Prover9 uses the Otter syntax, too.
      </p>
      <p>The system shown in Figure 2 is another version of the one used in Figure 1,
except that the appropriate changes were made to use FAME instead of SCAN.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Results of Comparing SCAN and LETHE</title>
      <p>In this section, the results of the comparison made between the solutions
produced using SCAN and LETHE are presented. Following this, the results of the
comparison made between the solutions produced using LETHE and FAME are
presented in the next section.</p>
      <p>In both comparisons, a set of randomly selected concepts was forgotten. The
cardinality of the forgetting vocabulary varied depending on the number of
concepts present in the ontology. However, for each ontology, an effort has been
made to test and evaluate the forgetting solutions computed, as a result of
forgetting sets of concepts of different sizes, ranging from low to high. In ontologies
expressed in description logics that are more expressive than description logic
ALC, an ALC fragment of the ontologies were used. The tests where LETHE
1 https://www.cs.unm.edu/∼mccune/prover9/
produces results that contain definer symbols were excluded as they are not part
of the original vocabulary.</p>
      <p>After testing and evaluating fragments of different ontologies from the OBO
Foundry ontology repository2, the following results were observed.
1. The solutions computed using SCAN always entail the solutions computed
using LETHE.
2. The solutions computed using LETHE always entail the solutions computed
using SCAN if no Skolem terms occur in SCAN’s solutions. However, if
Skolem terms do occur in the solution, the solution computed using LETHE
entail the subset of clauses and formulas that do not contain Skolem terms.
3. In some cases, the subset of clauses and formulas in SCAN’s solution, that
do not contain Skolem terms, is sufficient to entail the forgetting solutions
computed using LETHE. The exact reason behind this occurrence raises
interesting new questions of research.
4. If SCAN and LETHE both terminate, the approach used to develop SCAN
outperforms the approach used to develop LETHE. However, it must be
noted that this is the average situation.
5. In some cases the two approaches agree on the difficulty of the forgetting
problem. However, in other cases, it was observed that LETHE found some
forgetting problems to be difficult, while SCAN found a solution relatively
quickly. It was speculated that the reverse occurs as well, but currently there
is not sufficient evidence to support it.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Results of Comparing LETHE and FAME</title>
      <p>Based on the testing and evaluation performed using fragments of ontologies
from the Oxford ontology repository3, the following results were observed:
1. If FAME successfully forgets all the symbols in the forgetting vocabulary
and produces a solution that is expressed in description logic ALC, rather
than a more expressive description logic such as ALCI, then the solutions
produced by FAME and LETHE entail one another.
2. If FAME successfully forgets all the symbols in the forgetting vocabulary,
but produces a solution in a more expressive description logic, namely ALCI,
then the solutions produced by FAME entail those produced by LETHE.
3. If FAME fails to forget a subset of the forgetting vocabulary and LETHE
produces a solution that does not contain any unwanted symbols such as
definer symbols, then the solutions produced by FAME entail those produced
by LETHE. This case captures the strength of resolution as it shows that
there exist cases where a forgetting problem can be solved using resolution
but not using an Ackermann-based approach. Consequently, this opens the
door to consider a hybrid technique that benefits from the speed of FAME
and the power of LETHE.
2 http://obofoundry.org
3 http://www.cs.ox.ac.uk/isg/ontologies/
4. One of the differences between FAME and LETHE is that LETHE
introduces definer symbols using standard flattening techniques to structurally
transform the ontology into what is called normal form, in order to simplify
its computations. We found that if LETHE introduces definer symbols and
fails to eliminate all of them, FAME fails to forget at least one symbol in
the ontology. Consequently, if FAME fails to forget at least one symbol and
LETHE fails to eliminate at least one definer symbol, then no entailments
can be inferred. This is obvious because each tool produces solutions that
contain symbols that are not present in the solutions of the other tool.
However, interestingly, one of the flattening techniques used to produce definer
symbols in LETHE is inspired by Ackermann’s Lemma; instead of using
the Lemma to eliminate a symbol, it is used to introduce one. Hence, this
raises the question: what is the relationship between the definer symbols
that LETHE failed to eliminate and the symbols FAME failed to forget?
We expect that these are cases of symbols with cyclic dependencies, which
remains to be checked.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Concluding Remarks</title>
      <p>Parts of the theory were confirmed and verified using a tool that was developed
for the purposes of this research. The tool aims to compare the solutions
computed using the three forgetting tools: SCAN, LETHE and FAME. It provided
insight into the relationship between the three forgetting tools in the absence of
a theoretical comparison.</p>
      <p>The possibilities of a hybrid system are currently being investigated in an
attempt to improve the performance while exploiting the power of resolution.
A naive implementation has shown interesting results, particularly, for one of
the forgetting problems. This problem took the resolution rules, implemented
in LETHE, more than 14 hours to compute a solution. On the other hand,
FAME, which exploits Ackermann’s Lemma, successfully forgets all the symbols
in the forgetting problem, except for one, in two seconds. The hybrid method
which first applies FAME and then LETHE to the result computed by FAME
successfully forgets all the symbols in the forgetting problem in three seconds.
It would be interesting to develop heuristics that select the approach to use for
each symbol in the forgetting vocabulary.</p>
      <p>Moreover, we are interested in investigating the relationships between the
tools and the underlying approaches more deeply. For example, we have not yet
found any counterexamples to the following statement: when FAME
successfully forgets all the symbols in the forgetting vocabulary, but produces results in
ALCI, we have that the solutions produced by LETHE entail a subset of those
produced by FAME, namely the subset of solutions that are expressed in
description logic ALC. Additionally, no cases where FAME successfully forgets all
the symbols and LETHE fails to eliminate a definer symbol were found during
this research. We would like to investigate if this can theoretically happen. For
concept forgetting in ALC, we expect that this cannot happen. Finally, we would
like to continue investigating the relationship between the approaches used in
the various systems but for more expressive description logics.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>F.</given-names>
            <surname>Baader</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. L.</given-names>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Nardi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P. F.</given-names>
            <surname>Patel-</surname>
          </string-name>
          Schneider, editors.
          <source>The Description Logic Handbook: Theory, Implementation and Applications</source>
          . University of Cambridge Press, Cambridge, UK,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>D. M. Gabbay</surname>
            ,
            <given-names>R. A.</given-names>
          </string-name>
          <string-name>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Szalas.</surname>
          </string-name>
          Second-Order Quantifier Elimination: Foundations,
          <source>Computational Aspects and Applications</source>
          , volume
          <volume>12</volume>
          <source>of Studies in Logic: Mathematical Logic and Foundations. College Publications</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>P.</given-names>
            <surname>Koopmann</surname>
          </string-name>
          .
          <article-title>Practical Uniform Interpolation for Expressive Description Logics</article-title>
          .
          <source>PhD thesis</source>
          , University of Manchester,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>P.</given-names>
            <surname>Koopmann</surname>
          </string-name>
          and
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          .
          <article-title>Implementation and evaluation of forgetting in ALC-ontologies</article-title>
          .
          <source>In Proceedings of the 7th International Workshop on Modular Ontologies co-located with the 12th International Conference on Logic Programming and Non-monotonic Reasoning (LPNMR</source>
          <year>2013</year>
          ), Corunna, Spain,
          <year>September 15</year>
          ,
          <year>2013</year>
          .,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>H. J.</given-names>
            <surname>Ohlbach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Engel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Gabbay</surname>
          </string-name>
          . Scan:
          <article-title>Home page</article-title>
          . http://www.mettel-prover.org/scan/index.html. [Online; accessed 29-
          <fpage>October2017</fpage>
          ].
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhao</surname>
          </string-name>
          and
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          .
          <article-title>Concept forgetting in ALCOI-ontologies using an Ackermann approach</article-title>
          . In M. Arenas,
          <string-name>
            <given-names>O.</given-names>
            <surname>Corcho</surname>
          </string-name>
          , E. Simperl,
          <string-name>
            <given-names>M.</given-names>
            <surname>Strohmaier</surname>
          </string-name>
          , M. d'Aquin,
          <string-name>
            <given-names>K.</given-names>
            <surname>Srinivas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. T.</given-names>
            <surname>Groth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Heflin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Thirunarayan</surname>
          </string-name>
          , and S. Staab, editors,
          <source>The Semantic Web, 14th International Semantic Web Conference, ISWC</source>
          <year>2015</year>
          , volume
          <volume>9366</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>587</fpage>
          -
          <lpage>602</lpage>
          . Springer,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>