<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>DL</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>On the Way to Diverse Datasets for Evaluating ABox Abduction Algorithms (Extended Abstract)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Janka Boborová</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jakub Kloc</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin Homola</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Júlia Pukancová</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Comenius University in Bratislava</institution>
          ,
          <addr-line>Mlynská dolina, 842 41 Bratislava</addr-line>
          ,
          <country country="SK">Slovakia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>38</volume>
      <fpage>3</fpage>
      <lpage>6</lpage>
      <abstract>
        <p>We propose a method for generating evaluation datasets for ABox abduction algorithms, using diverse real-world knowledge bases, logical consequences as observations to ensure meaningfulness, justifications to guarantee explanations exist, and ontology modules to constrain the search space. Abduction [1, 2] is a form of inference that explains an observation by identifying its possible causes (explanations). We focus specifically on ABox abduction, where both the observation and its explanations consist of ABox assertions. To enable meaningful comparison and evaluation of ABox abduction algorithms, a suitable dataset of ABox abduction problems is needed. Such a dataset should include multiple real-world knowledge bases, each with meaningful observations. For each observation, it should provide abducibles (a set representing the search space) of varying sizes. Additionally, the ABox abduction problems should vary in the number and length of explanations. Datasets in existing evaluations sufer from several limitations, including use of only one knowledge base [ 3]; artificial automatically generated observations [3, 4, 5]; observations that may have no explanations [4]; limited diversity in explanation length and number [3]; and weak constraints on the search space [6, 7, 3, 4, 5]. Building on prior approaches, we have begun constructing an evaluation dataset that addresses these shortcomings.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Description logics</kwd>
        <kwd>ABox abduction</kwd>
        <kwd>evaluation dataset</kwd>
        <kwd>test case</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. Construction of a Robust Evaluation Dataset</title>
      <p>To generate meaningful, non-artificial observations and reduce the search space without losing
explanations, we propose two methods: one for generating observations (applicable to any knowledge base)
and another for generating abducibles (applicable to any ABox abduction problem). For simplicity, the
methods are defined for atomic concept assertions but extend easily to all atomic assertions and their
complements. Applying these methods to diverse real-world knowledge bases enables the construction
of a robust evaluation dataset.</p>
      <p>
        Real-World Knowledge Bases: Koopmann et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] proposed to use the 2015 OWL Reasoner
Competition Corpus1[8], providing 1,920 diverse real-world ontologies, as a suitable set of knowledge bases
for the evaluation of abduction algorithms. To focus only on relevant ontologies with the potential to
produce interesting problems, we defined the following requirements: consistency; consistency check
time ≤ 30 seconds; individual count ≥ 1. After applying these requirements, we obtained 865 ontologies
as candidate knowledge bases.
      </p>
      <p>Consequences as Observations: Although many knowledge bases are available, we are not aware of
real-world use cases with predefined observations; therefore, observations must be generated separately.
We aim to generate meaningful observations  by selecting logical consequences of a knowledge base
. To ensure explanatoriness ( ̸|= ), each  must be modified to no longer entail observation .
This is done by removing at least one assertion from each justification of , i.e., from a minimal set of
axioms responsible for the entailment of  [9]. Our approach is described in Algorithm 1.</p>
      <p>The core idea is to “corrupt”  by removing assertions that can later be recovered as explanations
through ABox abduction. As ABox abduction yields only ABox assertions, other axiom types cannot be
removed during the modification of .</p>
      <p>Algorithm 1 Generating ABox Abduction Problems
Input: knowledge base 
Output: a set of ABox abduction problems 
1:  ← {}
2:  ← {() |  ∈   ,  ∈ ,  |= (), () ∈/ }
3: for () in  do
4: (()) ← get the ABox parts of justifications for () using OWLExplanation
5: end for
6: for () in  do
7:  ← get the size of the largest justification in (())
8: for  in ⟨1, ⟩ do
190:: for ← in (()) do
11:  ← randomly select (, ||) assertions from 
12:  ←   ∖ 
13: end for
14:  ←   ∪ { = (, ())}
15: end for
16: end for
17: return 
◁ generate observations
◁ compute observation justifications
◁ generate ABox abduction problems
◁ modify</p>
      <p>To explore a wider range of possibilities, we generated multiple modified knowledge bases for each
observation by progressively removing more assertions, aiming to produce diferent explanations for
the same observation.</p>
      <p>Module-Based Abducibles: During evaluation, it is useful to examine how algorithms perform
with search spaces of varying sizes. However, reducing the search space requires a careful strategy to
preserve explanations.</p>
      <p>We propose using module extraction [10, 11], a technique that extracts a meaningful fragment of an
ontology while preserving all axioms relevant to the complete meaning of a given signature. This
technique can be used to generate module abducibles, i.e., assertions relevant to an observation , excluding
symbols unlikely to appear in explanations: Abdmodule = {() |  ∈  ,  ∈  from Σ()} .
Specifically, the ⊤-module, which includes all subclasses of the atomic classes in the signature of , as
explanations typically involve concepts subsumed by the concept in .</p>
      <p>To diferentiate within Abdmodule when generating abducibles of a given size, we prioritise assertions
involving individuals from , as they are more likely to appear in explanations.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Analysis of Generated Inputs</title>
      <p>Generating ABox Abduction Problems: The observation generation process applied to 865
knowledge bases resulted in 37,042 ABox abduction problems. The largest ABox part of a justification
contained 12 assertions, and the maximum number of justifications for an observation was 38 (all with
a single-assertion ABox part). Over 90% of observations had one justification with a single-element
ABox part.</p>
      <p>|ℰ | (length  : |{ℰ | ℰ ∈ ℰ , |ℰ| = }|)</p>
      <p>In theory, more justifications should lead to more explanations, and justifications with more assertions
should lead to longer explanations. In practice, justifications may contain complex assertions that
cannot be reconstructed by algorithms limited to atomic assertions and their complements. Additionally,
observations may have explanations beyond those found in the justifications. Therefore, given the large
number of generated problems, identifying those with interesting properties is challenging without
additional information.</p>
      <p>Generating Abducibles: To analyse abducible generation, we selected a sample of ABox abduction
problems (Table 1) by applying MergeXplain (MXP)2 [13] with Abddefault = {() |  ∈  ,  ∈
 from Σ( ∪ )} to a random subset of the generated problems. MergeXplain returns a set of
explanations, ℰ , containing all explanations of length 1 and, if present, at least one additional explanation of
a greater length. We selected the final sample based on the number and length of explanations. Notably,
only one problem (ont934_obs01) in the subset produced explanations of varying lengths, including
some longer than one.</p>
      <p>For each problem in the sample, we generated module abducibles, Abd, which on average
reduced the search space to 46%. Abd consistently included all explanations found by MergeXplain,
ensuring none were lost.</p>
      <p>To generate abducibles of varying sizes, three methods were applied: (a) module abducibles prioritising
assertions with individuals from the observation (our proposed approach), (b) module abducibles without
prioritisation, and (c) completely random selection. Each method was used to generate abducibles of
sizes 10, 25, 50, 100, 250, and 500, and was run three times per size to obtain averaged results. For each
method and size, we report the percentage of explanation assertions covered by the generated Abd
sets, relative to their size, computed as m|ienx(p|Al.bda|s,|seexrptli.onasssienrtAiobnds||) (e.g., a set of size 10 can cover at
most 10 explanation assertions, and the maximum possible coverage is bounded by the total number of
explanation assertions). The results (Table 2) were averaged over all problems and runs. Method (a)
was the most successful, consistently generating sets that covered the highest number of explanation
assertions across all sizes. At sizes 250 and 500, it achieved full coverage for all problems except
ont934_obs01, where it generated no more than two explanation assertions per Abd set. Still, even on
this problem, it outperformed the other methods, which on average generated none. Out of 108 runs,
the generated set contained no explanation assertions in 5 cases for method (a) (all for ont934_obs01),
57 for method (b), and 78 for method (c).
2MXP was run using CATS [12]: https://github.com/Comenius-Abduction-Team/CATS-Abduction-Solver</p>
    </sec>
    <sec id="sec-4">
      <title>4. Discussion and Outlook</title>
      <p>The dataset generation process needs refinement, especially in producing ABox abduction problems.
To narrow down the generated problems and focus on the most relevant ones, we plan to analyse the
ABox parts of justifications. Since many observations produce only single-assertion explanations, we
aim to use observations composed of multiple assertions.</p>
      <p>In contrast, for generating abducibles, the module-based approach prioritising individuals from the
observation seems promising.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This research was sponsored by the Slovak Republic under the SRDA grant APVV-23-0292 (DyMAX)
and VEGA grant no. 1/0630/25 (eXSec), and it is also based on the work from COST Action CA23147
GOBLIN. J. Boborová was supported by Grant no. UK/1283/2025 awarded by Comenius University in
Bratislava.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used gpt-4o in order to: Text translation, Grammar
and spelling check, Improve writing style. After using these tools, the authors reviewed and edited the
content as needed and take full responsibility for the publication’s content.
[8] B. Parsia, N. Matentzoglu, R. S. Gonçalves, B. Glimm, A. Steigmiller, The OWL reasoner evaluation
(ORE) 2015 competition report, J. Autom. Reason. 59 (2017) 455–482. URL: https://doi.org/10.1007/
s10817-017-9406-8. doi:10.1007/S10817-017-9406-8.
[9] A. Kalyanpur, B. Parsia, M. Horridge, E. Sirin, Finding all justifications of OWL DL entailments,
in: The Semantic Web, 6th International Semantic Web Conference, 2nd Asian Semantic Web
Conference, ISWC 2007 + ASWC 2007, Busan, Korea, November 11-15, 2007, volume 4825 of
Lecture Notes in Computer Science, Springer, 2007, pp. 267–280. URL: https://doi.org/10.1007/
978-3-540-76298-0_20. doi:10.1007/978-3-540-76298-0\_20.
[10] C. D. Vescovo, D. Gessler, P. Klinov, B. Parsia, U. Sattler, T. Schneider, A. Winget, Decomposition and
modular structure of bioportal ontologies, in: The Semantic Web - ISWC 2011 - 10th International
Semantic Web Conference, Bonn, Germany, October 23-27, 2011, Proceedings, Part I, volume 7031
of Lecture Notes in Computer Science, Springer, 2011, pp. 130–145. URL: https://doi.org/10.1007/
978-3-642-25073-6_9.
[11] B. C. Grau, I. Horrocks, Y. Kazakov, U. Sattler, Just the right amount: Extracting modules from
ontologies, in: Proceedings of the 16th International Conference on World Wide Web, WWW
2007, Banf, Alberta, Canada, May 8-12, 2007, ACM, 2007, pp. 717–726. URL: https://doi.org/10.
1145/1242572.1242669. doi:10.1145/1242572.1242669.
[12] J. Kloc, J. Boborová, M. Homola, J. Pukancová, CATS solver: The rise of hybrid abduction
algorithms, in: Proceedings of the 38th International Workshop on Description Logics (DL 2025),
Opole, Poland, September 3–6, 2025, 2025. To appear.
[13] K. M. Shchekotykhin, D. Jannach, T. Schmitz, MergeXplain: Fast computation of multiple conflicts
for diagnosis, in: Proceedings of the Twenty-Fourth International Joint Conference on Artificial
Intelligence, IJCAI 2015, Buenos Aires, Argentina, July 25-31, 2015, AAAI Press, 2015, pp. 3221–3228.
URL: http://ijcai.org/Abstract/15/454.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C.</given-names>
            <surname>Elsenbroich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Kutz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Sattler</surname>
          </string-name>
          ,
          <article-title>A case for abductive reasoning over ontologies</article-title>
          ,
          <source>in: Proceedings of the OWLED*06 Workshop on OWL: Experiences and Directions</source>
          , Athens, GA, US, volume
          <volume>216</volume>
          <source>of CEUR-WS, CEUR-WS.org</source>
          ,
          <year>2006</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>216</volume>
          /submission_25.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Peirce</surname>
          </string-name>
          ,
          <article-title>Illustrations of the logic of science VI: Deduction, induction, and hypothesis</article-title>
          ,
          <source>Popular Science Monthly</source>
          <volume>13</volume>
          (
          <year>1878</year>
          )
          <fpage>470</fpage>
          -
          <lpage>482</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Homola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pukancová</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Balintová</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Boborová</surname>
          </string-name>
          ,
          <article-title>Hybrid MHS-MXP ABox abduction solver: First empirical results</article-title>
          ,
          <source>in: Proceedings of the 35th International Workshop on Description Logics (DL</source>
          <year>2022</year>
          )
          <article-title>co-located with Federated Logic Conference</article-title>
          (FLoC
          <year>2022</year>
          ), Haifa, Israel,
          <source>August 7th to 10th</source>
          ,
          <year>2022</year>
          , volume
          <volume>3263</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2022</year>
          . URL: https: //ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3263</volume>
          /paper-13.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>P.</given-names>
            <surname>Koopmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Del-Pinto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tourret</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <article-title>Signature-based abduction for expressive description logics</article-title>
          ,
          <source>in: Proceedings of the 17th International Conference on Principles of Knowledge Representation and Reasoning</source>
          , KR 2020, Rhodes, Greece,
          <year>2020</year>
          , pp.
          <fpage>592</fpage>
          -
          <lpage>602</lpage>
          . URL: https://doi.org/ 10.24963/kr.2020/59. doi:
          <volume>10</volume>
          .24963/KR.
          <year>2020</year>
          /59.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <article-title>A tractable approach to ABox abduction over description logic ontologies</article-title>
          ,
          <source>in: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, July 27 -31</source>
          ,
          <year>2014</year>
          ,
          <string-name>
            <given-names>Québec</given-names>
            <surname>City</surname>
          </string-name>
          , Québec, Canada, AAAI Press,
          <year>2014</year>
          , pp.
          <fpage>1034</fpage>
          -
          <lpage>1040</lpage>
          . URL: https://doi.org/10.1609/aaai. v28i1.8852. doi:
          <volume>10</volume>
          .1609/AAAI.V28I1.8852.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pukancová</surname>
          </string-name>
          , M. Homola,
          <article-title>ABox abduction for description logics: The case of multiple observations</article-title>
          ,
          <source>in: Proceedings of the 31st International Workshop on Description Logics</source>
          , Tempe, Arizona,
          <string-name>
            <surname>US</surname>
          </string-name>
          , volume
          <volume>2211</volume>
          <source>of CEUR-WS, CEUR-WS.org</source>
          ,
          <year>2018</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2211</volume>
          /paper-31.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Mrózek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pukancová</surname>
          </string-name>
          , M. Homola,
          <article-title>ABox abduction solver exploiting multiple DL reasoners</article-title>
          ,
          <source>in: Proceedings of the 31st International Workshop on Description Logics co-located with 16th International Conference on Principles of Knowledge Representation and Reasoning (KR</source>
          <year>2018</year>
          ), Tempe, Arizona,
          <string-name>
            <surname>US</surname>
          </string-name>
          , October 27th - to - 29th,
          <year>2018</year>
          , volume
          <volume>2211</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2018</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2211</volume>
          /paper-24.pdf.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>