<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>In the Hand of the Beholder: Comparing Interactive Proof Visualizations</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Christian Alrabbaa</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Borgwardt</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nina Knieriemen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alisa Kovtunova</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anna Milena Rothermel</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Frederik Wiehr</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>German Research Center for Artificial Intelligence (DFKI)</institution>
          ,
          <addr-line>Saarland Informatics Campus, Saarbrücken</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute of Theoretical Computer Science</institution>
          ,
          <addr-line>TU Dresden</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Although logical inferences are interpretable, actually explaining them to a user is still a challenging task. While sometimes it may be enough to point out the axioms from the ontology that lead to the consequence of interest, more complex inferences require proofs with intermediate steps that the user can follow. Our main hypothesis is that different users need different representations of proofs for optimal understanding. To this end, we undertook some user experiments related to logical proofs. We explored how a user's cognitive abilities influence the performance in and preference for certain proof representations. In particular, we compared tree-shaped representations with linear, textbased ones, and for each we offered an interactive and a static version. After each proof, participants had to solve some tasks measuring their level of understanding and rated each proof according to the perceived comprehensibility. At the end of the questionnaire, subjects ranked the proofs by comprehensibility. We found no differences between the general performance or the subjective ratings of the proof representations. However, in the final ranking participants preferred the conditions with tree-shaped proofs over the textual ones, with significant differences in the rankings of the higher cognitive ability group and across both groups but not in the low cognitive ability group.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Research on explaining description logic inferences has mostly focused on
computing justifications [
        <xref ref-type="bibr" rid="ref21 ref31 ref9">9, 21, 31</xref>
        ], i.e., minimal sets of axioms from which a consequence
follows. While justifications are already very helpful for designing or debugging
an ontology, depending on the complexity of the inference and the expertise
of the user, more detailed proofs of the consequence are needed. Following a
line of research on the understandability of description logic inferences and
proofs [
        <xref ref-type="bibr" rid="ref10 ref17 ref2 ref23 ref26 ref3 ref30 ref33 ref4">2–4, 10, 17, 23, 26, 30, 33</xref>
        ], in this paper we investigate the usefulness of
Copyright © 2021 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0).
different proof representations. We compare proofs in a traditional tree shape
with a linearized textual representation of proofs. For both variants, we provide
an interactive as well as a static version.
      </p>
      <p>
        Logical (abstract) reasoning is the ability to solve novel problems without
taskspecific knowledge and a core mechanism of human learning [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. Logical reasoning
is closely related to fundamental cognitive functions. Generally, according to [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ],
there are two types of intelligence. Fluid intelligence involves processing new
information and solving novel types of problems, as opposed to those that require
crystallized intelligence, which entails the application of consolidated knowledge
typically acquired in academic settings. In this terminology, experience with
logic can be classified as an example of crystallized intelligence. And since it is
typically neither trained nor taught, logical ability is often assessed as part of
fluid intelligence tests [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>
        Our main goal in this paper is to find out whether a user’s logical ability
influences which of these four proof representations are preferred and lead to better
performance. A study from last year [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] attempted a similar comparison, but it
did not find significant differences based on the user’s self-reported experience
with logic. In this paper, we attempt to measure the user’s logical ability level
more directly and investigate the impact on performance and preferences of
different proof representations. Moreover, we were able to increase the number of
participants by creating a fully online survey and making it available at a study
participant recruitment platform.
      </p>
      <p>
        In a first experiment, we verified that the score on the standardized
International Cognitive Ability Resource (ICAR) test1 strongly correlates with the
ability to draw logical inferences and understand logical proofs. Based on this
insight, we used the ICAR in our second experiment to measure the user’s ability
levels and compare the different proof representations. To open our experiments
to a larger population, we decided not to use the formal DL syntax, but rather
hand-crafted textual representations of axioms, similar to the ones used in [
        <xref ref-type="bibr" rid="ref1 ref34">1, 34</xref>
        ].
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        There are various fluid intelligence tests assessing logical ability, e.g. The Stanford
Binet Intelligence Scale, Fifth Edition (SB:V) [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], Wechsler Adult Intelligence
Scale–Fourth Edition (WAIS-IV) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], and nonverbal Raven’s Progressive
Matrices [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ]. We refer the reader to [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ] for a comprehensive survey. These tests provide
reliable, and well-validated abstract reasoning items [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and are extensively used
in clinical, educational, occupational and research settings [
        <xref ref-type="bibr" rid="ref16 ref37">16, 37</xref>
        ]. However,
these tasks are not free-to-use, and copyright often prevents these pen-and-paper
tasks from being adapted into computerized tasks. Notable exceptions here are
the freely available web-based Hagen Matrices Test [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] and the public-domain
International Cognitive Ability Resource (ICAR) [
        <xref ref-type="bibr" rid="ref39">39</xref>
        ].
      </p>
      <p>In parallel, several approaches for converting description logic axioms and
proofs into textual representations have been developed and evaluated [1, 6, 29, 33,</p>
      <sec id="sec-2-1">
        <title>1 https://icar-project.com/projects/icar-project/wiki</title>
        <p>A v ⊥</p>
        <p>O = { A v ∃r.&gt;,</p>
        <p>
          C u B v ⊥,
A v ∀r.(B u C) }
34]. For example, generation of verbalized explanations for non-trivial derivations
in a real world domain was tested on computer scientists in [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ]. The authors
distinguish short and long textual explanations, but the participants’ opinions on
conciseness turned out to be mixed and not too strong. In the recent work [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ],
it has been shown that proofs should not contain too many trivial steps, i.e. be
relatively small, not only for a textual but also for a tree-shape representation.
        </p>
        <p>
          Concerning the representation of logical statements, we take the following
studies into the account. First, in [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ] it has been confirmed that statements
in a controlled natural language are understood significantly better than the
Manchester OWL Syntax, where DL axioms are expressed by sentences with
the words like “SubTypeOf”, “DisjointWith”, “HasDomain”, etc. Second, the
experiment [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] has shown that the Manchester Syntax is not more effective
than the formal DL syntax. Therefore, similarly to the approaches [
          <xref ref-type="bibr" rid="ref29 ref34">29, 34</xref>
          ] and
in contrast to [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ], in our experiment we do not use the formal DL notation,
since that would require readers to learn the syntax of ontology languages or
description logics: instead we use patterns to convert DL sentences into
naturallanguage explanations (see Section 3). Further, we arrange these sentences in
a tree-shaped representation, similarly to proofs based on consequence-based
reasoning procedures [
          <xref ref-type="bibr" rid="ref27 ref36">27,36</xref>
          ], and we order them in a linear sequence using English
connectives, e.g. as produced by various verbalization techniques [
          <xref ref-type="bibr" rid="ref29 ref30 ref34 ref6">6, 29, 30, 34</xref>
          ].
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Proofs</title>
      <p>
        We assume a basic familiarity with DLs, in particular ALCQ [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Let O be an
ontology and α a consequence of O (O |= α). The next step is to compute
justifications, i.e., minimal subsets J ⊆ O such that J |= α, which already
point out the axioms from O that are responsible for α. However, actually
understanding why α follows may require a more detailed proof. Informally, a
proof is a tree consisting of inference steps α1...αn , where each step is sound, i.e.,
α
{α1, . . . αn} |= α holds (see Figure 1). Our proofs conform to the framework [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ].
Often, such a proof is built from the inference rules of an appropriate calculus
[
        <xref ref-type="bibr" rid="ref36 ref7">7, 36</xref>
        ]. However, there also exist approaches to generate DL proofs that start
with a justification, and extend it with intermediate axioms (lemmas) using
heuristics [
        <xref ref-type="bibr" rid="ref21 ref22">21, 22</xref>
        ], concept interpolation [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ], or forgetting [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        It is important that proofs are neither too detailed nor too short. In fact, a
justification can itself be seen as a one-step proof of an unintended entailment α,
but if each element of the justifications seems reasonable to the user, then it
can be hard to track down the precise interaction between these axioms that
causes the problem. Recall that axioms may still not behave as the user expects,
e.g. “every A has only rs” does not imply “every A has an r”. On the other
hand, by assumptions also made in [
        <xref ref-type="bibr" rid="ref10 ref34">10, 34</xref>
        ], too many small proof steps can also
be detrimental for understanding, because they are distracting. For example, a
reasoner may add the step CBuuBCvv⊥⊥ to Figure 1 to make the two conjunctions
match syntactically. However, for a user this only adds clutter to the proof,
because the correspondence between “B and C” and “C and B” is intuitively
clear.
      </p>
      <p>A textual representation of a proof is necessarily a linearization, where the
inference steps are explained in a sequence, for example in a top-down left-right
order. A text corresponding to the formal proof in Figure 1 could be the following:
Since every A has an r and every A has only rs that are Bs and Cs, every
A has an r which is a B and a C. Since every A has an r which is a B and
a C and there is no object which is a C and a B at the same time, there
is no object in A.</p>
      <p>Other aspects in which a text differs from a proof tree are that conjunctions (e.g.
“since”, “and”) are used to illustrate proof steps and that statements may be
repeated if they are reused later. For our second experiment, we used a flexible
visualization of trees in which arrows are used instead of horizontal lines (see
Figure 2 on page 9).</p>
      <p>
        As described in the introduction, we do not use the formal DL syntax for
the experiments, i.e., also in the proof tree representation, A v ∃r.&gt; would be
shown as “Every A has an r.” Moreover, we use nonsense names that vaguely
look and sound English to enable more natural-sounding sentences, e.g. “Every
woal is munted only with luxis that are kakes” instead of “Every A has only rs
that are Bs and Cs” (A v ∀r.(B u C)). We already faced a problem concerning
prior knowledge about the example domains in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Thus, in this study we opted
for non-existing domains.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Connecting Logical Abilities and Proof Understanding</title>
      <p>We first conducted an experiment that shows a connection between participants’
understanding of logical proofs and their general cognitive abilities. A printable
version of the survey is available online.2
4.1</p>
      <p>Description of the experiment
Introduction and Goals. In our main experiment (Section 5), we want to
distinguish user preferences based on their baseline level of ability to solve logical
reasoning tasks. To this end, we want to employ a standardized measure that
allows us to predict the performance on such tasks. Here, we first tested whether
the ICAR16 questionnaire can be used for this purpose.</p>
      <sec id="sec-4-1">
        <title>2 https://cloud.perspicuous-computing.science/s/oHp9pRaoCx5SDsF</title>
        <p>Design. We used LimeSurvey3 for hosting our fully online survey. Since we did
not pre-screen (i.e., impose eligibility criteria on) our participants for experience
with logic or proofs, we inserted a short introduction explaining the structure of
proof trees. In order to exclude the effect of tiredness, the order of the ICAR16
questions and the proof tasks was randomized.</p>
        <p>Participants. The sample consisted of 101 participants (45 female, 56 male)
with a mean age of M = 24.52 (SD = 6.81). The age range spread between 18 and
48 years. Participants were recruited using Prolific4 and were paid 6€ for their
participation. Apart from being at least 18 years old, there were no exclusion
criteria. 28 participants who did not complete the survey were excluded (and not
paid).</p>
        <sec id="sec-4-1-1">
          <title>Materials.</title>
          <p>
            ICAR16. To assess the participants’ cognitive abilities, the abbreviated form
of the International Cognitive Ability Resource (ICAR16) [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ] was applied.1
It consists of 16 questions equally distributed over four different types: matrix
reasoning, letter and number series, verbal reasoning, and 3-dimensional rotation.
The eight answer options were displayed in a single-choice format. In the end,
a mean score was calculated by coding correct answers with 1 and incorrect
answers with 0. Thus, the maximum score was 1, while the minimal score was 0.
The internal consistency of the ICAR16 questionnaire is α = .81 [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ].
Logical reasoning. To test the performance with logical reasoning, participants
had to solve two tasks. The first described a set of axioms (in natural language)
and they should decide which of the given statements follow from the axioms.
Each of the statements could be marked as “follows”, “does not follow” or “I
do not know”. In the second task, they were given a proof in tree shape (like
in Figure 2) that contained a blank node, and they were asked which of some
given statements would be valid labels for the node in the context of the proof
(“yes”, “no”, “I do not know”). The mean score of the performance in both tasks
was calculated from the number of correct answers. The highest possible score
was 24.
          </p>
          <p>Further Information. As further information, demographic data were collected
(age, gender) as well as experience with propositional logic, from 1 (“no knowledge
at all/no experience”) to 5 (“expert/a lot of experience”). After each proof task,
the participants were asked to rate its difficulty on a scale from 1 (“very easy”)
to 5 (“very difficult”).</p>
          <p>Hypothesis. The ICAR16 score is an independent variable that, according to
our hypothesis, predicts the performance in the logical proofs.</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>3 https://www.limesurvey.org/</title>
      </sec>
      <sec id="sec-4-3">
        <title>4 https://www.prolific.co/</title>
        <p>Results</p>
        <p>Descriptive Results. The mean of the ICAR16 scores was M = .55 (SD =
.24) with the participants’ performance being spread in a normal distribution.
The maximal achieved score was 1, the minimum was 0. The mean of the score
for both logical reasoning tasks was M = 15.99 (SD = 3.3), with the maximum
score being 23 and the minimum 6. The performance in these tasks was also
normally distributed across the participants. Moreover, the difficulty of the first
task was rated with a mean of M = 3.64 (SD = .89). The difficulty of the second
task was rated as M = 3.55 (SD = 1.14).</p>
        <p>
          Regression analysis. A multiple regression analysis was carried out using the
performance in the logical reasoning tasks as the dependent and the ICAR16
performance as the independent variable. The ICAR16 score significantly predicted
the performance in the logical tasks (F (1, 99) = 43.15, p &lt; .001). The ICAR16
explained 30% of the variation in the score of the logical tasks (R2 = .3, p &lt;
.001), which can be interpreted as large effect size/high explained variance [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
5
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Logical Abilities and Proof Representation Preferences</title>
      <p>Given that ICAR16 scores are highly correlated with performance on logical
reasoning tasks, we used it in our main experiment to distinguish participants by
their logical ability level. A printable version of the survey is available online.5
5.1</p>
      <p>Description of the experiment
Introduction and Goals. With this experiment, we attempt to find out which
proof representation is most understandable for different users. The goal is to find
a difference in the (subjective) preferences and (objective) performance on each
proof representation, depending on the user’s level of logical reasoning ability.
Conditions. We used two different conditions (factors) with two levels each. One
condition was the representational form of the proof, which was either tree-shaped
or (linear) textual. The other condition was the interactivity of the proof
representation, which was either static or interactive. Thus, there were the four following
condition combinations: (ir) interactive tree, (sr) static tree, (ix) interactive text,
and (sx) static text (see Table 1). We used a 2 × 2 within-subjects design, which
means that each participant saw all four combinations. The independent variable
in the main study is the ICAR16 score. Objective (the number of correct answers)
and subjective (comprehensibility rating) performances on proofs as well as proof
rankings are dependent variables. We conducted the experiment in English via
LimeSurvey, with embedded links to the interactive proof representations that
were hosted on our own server.</p>
      <sec id="sec-5-1">
        <title>5 https://cloud.perspicuous-computing.science/s/dCSmbraoJ4RzDqG</title>
        <p>Design. The survey was again implemented using LimeSurvey. As in the first
experiment, the order of the ICAR16 and the proof question groups was
randomized. Moreover, each participant was randomly assigned to one of the four groups
in Table 1. Before the proof tasks, we added a short explanation of the proof
representation and a small training example on which the participants could
explore both interactive formats.</p>
        <p>Participants. The final sample consisted of 173 participants (41% female, 59%
male) with a mean age of M = 24.8 (SD = 8.21) and an age range from 18 to 65
years. The mean of the participants’ experience with propositional logic was M =
1.76 (SD = 1). Furthermore, 60.7% of the participants indicated that they never
worked with propositional logic. The participants were recruited using Prolific4
and were paid 10.20€ for their participation. Apart from being at least 18 years
old, there were no exclusion criteria. 83 participants who did not complete the
survey were excluded also by the platform. Due to technical errors, the proofs
were not displayed for 3 participants, which were also excluded. Additionally,
four attention checks were implemented in the study. 13 participants with more
than 2 incorrectly answered attention checks were excluded as well.</p>
        <sec id="sec-5-1-1">
          <title>Material.</title>
          <p>ICAR16. Again, we used the ICAR16 questionnaire to assess the participants’
cognitive abilities (see page 5).</p>
          <p>Proofs. We developed four artificial proofs of roughly the same difficulty level.
The statements of each proof were given in textual form (also for the tree-shaped
representations), with concept and role names replaced by nonsense words (see
Section 3). For each of the four proofs, the proofs in both representation formats
were created manually, not automatically. For the linear textual format, the
individual statements were connected by additional words and statements were
repeated if they were last mentioned more than 2 lines above. In Figures 2 and 2,
we depict the textual and tree-shaped representations of Proof 1. Green color
indicates the assumptions, intermediate deductions are marked in blue, and the
final conclusion is colored orange.</p>
          <p>
            The interactive version of the tree representation6 started with only the
final conclusion visible, and participants could interact with each node by using
three buttons. A button labeled “&lt;” revealed the immediate predecessors of
the current node in the proof. The second button labeled “↑” could reveal the
whole subtree above the current node, and the last button labeled “↓” allowed
to collapse the subtree back to a single node. The interactive text7 worked
in a different way. At the beginning, participants saw only the first sentence,
i.e., the first assumption. Using two buttons labeled “&gt;” and “&lt;” they could
reveal the next sentence or hide the most recently revealed sentence. Additionally,
clicking on a sentence highlighted the premises that were used to infer the selected
statement (corresponding to the predecessors of a node in the tree representation).
Moreover, both interactive representations could be freely zoomed and panned.
The interactive proofs were provided by a prototypical web application for
explaining DL entailments called Evonne [
            <xref ref-type="bibr" rid="ref18 ref5">5, 18</xref>
            ]. For this study, Evonne was
adapted to allow for (linear) interactive textual representation of proofs. In
addition, the modes of interaction in the tree representation were kept relatively
basic on purpose. This was done especially to avoid overwhelming participants
who had little experience with logic and proofs.
          </p>
          <p>For each proof, there were three pages of questions. For the interactive
conditions, the proofs on the first page were collapsed (to the root node or the
first sentence), but on subsequent pages they started fully expanded since they
had already been explored. Each question page contained a single question with 6
answer options (plus “none of these” and “I don’t know”). Questions were of the</p>
        </sec>
      </sec>
      <sec id="sec-5-2">
        <title>6 https://lat.inf.tu-dresden.de/evonne/proof1</title>
      </sec>
      <sec id="sec-5-3">
        <title>7 https://lat.inf.tu-dresden.de/evonne/textProof1</title>
        <p>form “Which of the following would be a correct replacement for the deduction
‘XYZ’ in the proof?” or “Which parts of the following summary/reformulation of
the proof are incorrect?” The former questions require to choose a statement,
which is not necessary equivalent to a given axiom but which aligns smoothly to
the rest of the proof. In the end, a score was calculated based on the number of
correct answers. Thus, the highest possible score for a user performance was 12.
Further Information. As further information, demographic data were collected
(age, gender, nationality), as well as experience with propositional logic, from
1 (“no knowledge at all/no experience”) to 5 (“expert/a lot of experience”).
Participants also indicated how frequently they use propositional logic with 1
being “Never” and 5 being “All the time”. After each proof, the participants were
asked to rate its comprehensibility on a scale from 1 (“not at all”) to 5 (“very
much”). At the end of the survey, the participants were asked to additionally
rank all the proofs they had seen according to their relative comprehensibility.
Hypotheses. We stated two hypotheses concerning the preferences and
performance differences between the proof representations.</p>
        <p>Hypothesis 1 : It is easier to understand interactive proofs than static proofs.
This will be shown by an increase in performance and by a higher
comprehensibility rating for the interactive conditions.</p>
        <p>Hypothesis 2 : The relative level of comprehensibility of a tree-shaped vs.
textual proof depends on the cognitive abilities. This will be shown by a difference
in performance and difficulty rating between the condition combinations and in
the final comprehensibility ranking, in dependence of the ICAR16 scores.
5.2</p>
        <p>
          Results
For the analyses, IBM SPSS Statistics (Version 26) predictive analytics software
for Windows [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] was used. A p-value threshold of 0.05 was used for the entire
analyses. After the assumptions were considered as tenable, a regression analysis
was carried out, to confirm the results of the pre-study. Again, the predictive
effect of the ICAR16 on the performance in the proofs was significant, F (1, 171)
= 24.8, p &lt; .001. With an R2 = .13 (corrected R2 = .12), the model shows a
moderate explained variance (Cohen, 1988).
        </p>
        <p>A median split (mdn = .44) was carried out to divide the participants into
those who achieved high scores in the ICAR16 and thus presumably also have
higher cognitive abilities and those who scored lower.</p>
        <p>For ICAR16 the mean was M = .46, while it was M = 2.36 for the proof
performance. The group containing those participants who scored low in the
ICAR16 achieved M = 1.9 across all proofs. In contrast, the group of participants
with high ICAR16 scores showed an overall proof performance of M = 2.87.
Performance and Comprehensibility Ratings. To compare the
performance in the proof tasks and the subjective comprehensibility ratings after
n Most
interactive text
interactive tree
15
15
static text
30
30
45
45
60
60
0
0
15
15
static tree
30
30
45
45
60
60
each proof, we ran a multivariate analysis of variance (MANOVA). All the
assumptions were considered as tenable. We found no significant overall
difference between the conditions across the two ICAR groups, Pillai’s Trace = .01,
F (6, 1376) = 1.41, p = .206. Also when looking at the groups separately, we
could not find any significant differences between the representations, neither in
the low-ICAR group (Pillai’s Trace =1.03, F (6, 712) = 1.90, p = .078) nor in the
group with high scores (Pillai’s Trace = .01, F (6, 656) = .53, p = .788). Thus,
we could not detect differences in the comprehensibility ratings as well as the
performance between the various representations in each cognitive ability group
and across the two groups.</p>
        <p>Ranking. To evaluate the ranking of the four representations (1 = most
comprehensible, 4 = least comprehensible), we ran a Friedman’s test revealing a
significant difference across both ICAR groups, χ2(3) = 17.16, p = .001, n =
173 (see Figure 3, light bars). Post-hoc pairwise comparisons were
Bonferronicorrected and showed three significant comparisons. The interactive tree was
significantly more often ranked higher than the interactive text (z = .40, p =
.024, Cohen’s effect size r = .03) and also higher than static text (z = -.50, p =
.002, Cohen’s effect size r = .04). The static tree representation was also ranked
significantly higher than static text, z = .39, p = .032, Cohen’s effect size r =
.03 (see Figure 3).</p>
        <p>A Friedman’s test in the group with high ICAR performance showed a
significant difference in the ranking of representations, χ2(3) = 12.73, p =
.005, n = 83 (see Figure 3, dark bars). Bonferroni-corrected post-hoc pairwise
comparisons revealed two significant comparisons. There is a significant difference
between static tree and static text (z = .59, p = .019, Cohen’s effect size r =
.06) with static tree being ranked higher than static text. Interactive tree was
also preferred before static text, (z = -.54, p = .041, Cohen’s effect size r = .06).</p>
        <p>The low-ICAR-performers showed no significant difference in the ranking of
representations, χ2(3) = 6.70, p = .082, n = 90.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Discussion</title>
      <p>Neither of our two hypotheses could be conclusively confirmed. We did not find a
significant advantage of using some proof representations over others, even when
distinguishing groups by their levels of cognitive abilities. The analysis of the
final ranking of the proof representations indicates a subjective preference of the
conditions with tree-shaped proofs over their textual counterparts, but this did
not seem to impact the objective performance measure nor the subjective ratings
the participants gave after each proof. These preferences are largely driven by
the group with higher ICAR performance (cf. Figure 3).</p>
      <p>
        Limitations. According to the aims of our study, we did not pre-select
participants according to their experience with logic or field of studies. 55.5% of the
participants had no experience with propositional logic and 60.7% had never
worked with it. For many participants, even the ones with higher ICAR scores,
the proof tasks were very challenging, resulting in a mean score of M =2.36 out
of a total of 12. 15 people commented about the high difficulty level in the end,
and only 3 said the proofs were easy to understand. This resulted in many data
points being clustered on the lower end of the scale and differences being more
difficult to detect. For similar future studies, it is important to calibrate the
difficulty of the tasks well; they should be neither too hard nor too easy.
Conclusion. In addition to previous observations that shorter proofs are
better [
        <xref ref-type="bibr" rid="ref10 ref34">10, 34</xref>
        ], we observed a subjective preference for tree-shaped proofs, although
this was not reflected by increased performance in our study. As a side result, we
demonstrated that cognitive abilities tested by the ICAR16 predict the reasoning
performance in formal logics. In future work, we want to further investigate
the trade-off between giving no details (i.e., justifications) and giving too many
details (i.e., full proofs) in various representation formats. For laypersons, it may
be better to quickly communicate the gist of a proof in natural language, whereas
experts may require access to the formal details.
      </p>
      <p>Acknowledgements This work was supported by the DFG grant 389792660 as
part of TRR 248 (https://perspicuous-computing.science), and QuantLA,
GRK 1763 (https://lat.inf.tu-dresden.de/quantla).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Alharbi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Howse</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stapleton</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hamie</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Touloumis</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The efficacy of OWL and DL on user understanding of axioms and their entailments</article-title>
          . In: d'Amato,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Fernández</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Tamma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.A.M.</given-names>
            ,
            <surname>Lécué</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Cudré-Mauroux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Sequeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.F.</given-names>
            ,
            <surname>Lange</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Heflin</surname>
          </string-name>
          ,
          <string-name>
            <surname>J</surname>
          </string-name>
          . (eds.)
          <source>The Semantic Web - ISWC 2017 - 16th International Semantic Web Conference</source>
          , Vienna, Austria,
          <source>October 21-25</source>
          ,
          <year>2017</year>
          , Proceedings,
          <source>Part I. Lecture Notes in Computer Science</source>
          , vol.
          <volume>10587</volume>
          , pp.
          <fpage>20</fpage>
          -
          <lpage>36</lpage>
          . Springer (
          <year>2017</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -68288-
          <issue>4</issue>
          _
          <fpage>2</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Alrabbaa</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borgwardt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koopmann</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kovtunova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Finding small proofs for description logic entailments: Theory and practice</article-title>
          . In: Albert,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Kovacs</surname>
          </string-name>
          ,
          <string-name>
            <surname>L</surname>
          </string-name>
          . (eds.) LPAR-
          <volume>23</volume>
          : 23rd International Conference on Logic for Programming,
          <source>Artificial Intelligence and Reasoning</source>
          . EPiC Series in Computing, vol.
          <volume>73</volume>
          , pp.
          <fpage>32</fpage>
          -
          <lpage>67</lpage>
          . EasyChair (
          <year>2020</year>
          ). https://doi.org/10.29007/nhpp
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Alrabbaa</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borgwardt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koopmann</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kovtunova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>On the complexity of finding good proofs for description logic entailments</article-title>
          . In: Borgwardt, S., Meyer, T. (eds.)
          <source>Proceedings of the 33rd International Workshop on Description Logics (DL</source>
          <year>2020</year>
          )
          <article-title>co-located with the 17th International Conference on Principles of Knowledge Representation and Reasoning (KR</article-title>
          <year>2020</year>
          ), Online Event [Rhodes, Greece],
          <source>September 12th to 14th</source>
          ,
          <year>2020</year>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>2663</volume>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2020</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2663</volume>
          /paper-1.pdf
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Alrabbaa</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borgwardt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koopmann</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kovtunova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Finding good proofs for description logic entailments using recursive quality measures</article-title>
          . In: Platzer,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Sutcliffe</surname>
          </string-name>
          ,
          <string-name>
            <surname>G</surname>
          </string-name>
          . (eds.)
          <source>Proceedings of the 28th International Conference on Automated Deduction (CADE'21). Lecture Notes in Computer Science</source>
          , vol.
          <volume>12699</volume>
          , pp.
          <fpage>291</fpage>
          -
          <lpage>308</lpage>
          . Springer-Verlag (
          <year>2021</year>
          ). https://doi.org/10.1007/ 978-3-
          <fpage>030</fpage>
          -79876-5_
          <fpage>17</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Alrabbaa</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dachselt</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Flemisch</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koopmann</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Visualising proofs and the modular structure of ontologies to support ontology repair</article-title>
          . In: Borgwardt, S., Meyer, T. (eds.)
          <source>Proceedings of the 33rd International Workshop on Description Logics (DL</source>
          <year>2020</year>
          )
          <article-title>co-located with the 17th International Conference on Principles of Knowledge Representation and Reasoning (KR</article-title>
          <year>2020</year>
          ), Online Event [Rhodes, Greece],
          <source>September 12th to 14th</source>
          ,
          <year>2020</year>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>2663</volume>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2020</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2663</volume>
          /paper-2.pdf
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Androutsopoulos</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lampouras</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galanis</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Generating natural language descriptions from OWL ontologies: The NaturalOWL system</article-title>
          .
          <source>Journal of Artificial Intelligence Research</source>
          <volume>48</volume>
          ,
          <fpage>671</fpage>
          -
          <lpage>715</lpage>
          (
          <year>2013</year>
          ). https://doi.org/10.1613/jair.4017
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brandt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Pushing the EL envelope</article-title>
          . In: Kaelbling,
          <string-name>
            <given-names>L.P.</given-names>
            ,
            <surname>Saffiotti</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . (eds.)
          <source>Proc. of the 19th Int. Joint Conf. on Artificial Intelligence (IJCAI'05)</source>
          . pp.
          <fpage>364</fpage>
          -
          <lpage>369</lpage>
          . Professional Book Center (
          <year>2005</year>
          ), http://ijcai.org/ Proceedings/09/Papers/053.pdf
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lutz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>An Introduction to Description Logic</article-title>
          . Cambridge University Press (
          <year>2017</year>
          ). https://doi.org/10.1017/9781139025355
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Baader</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peñaloza</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suntisrivaraporn</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Pinpointing in the description logic EL+</article-title>
          .
          <source>In: Proc. of the 30th German Annual Conf. on Artificial Intelligence (KI'07). Lecture Notes in Computer Science</source>
          , vol.
          <volume>4667</volume>
          , pp.
          <fpage>52</fpage>
          -
          <lpage>67</lpage>
          . Springer, Osnabrück, Germany (
          <year>2007</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>540</fpage>
          -74565-
          <issue>5</issue>
          _
          <fpage>7</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Borgwardt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirsch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kovtunova</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wiehr</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <article-title>: In the Eye of the Beholder: Which Proofs are Best</article-title>
          ? In: Borgwardt, S., Meyer, T. (eds.)
          <source>Proc. of the 33rd Int. Workshop on Description Logics (DL</source>
          <year>2020</year>
          ).
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>2663</volume>
          (
          <year>2020</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2663</volume>
          /paper-6.pdf
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Burke</surname>
            ,
            <given-names>H.R.</given-names>
          </string-name>
          :
          <article-title>Raven's progressive matrices: Validity, reliability, and norms</article-title>
          .
          <source>The Journal of Psychology</source>
          <volume>82</volume>
          (
          <issue>2</issue>
          ),
          <fpage>253</fpage>
          -
          <lpage>257</lpage>
          (
          <year>1972</year>
          ). https://doi.org/10.1080/00223980.
          <year>1972</year>
          .9923815
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Chierchia</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fuhrmann</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Knoll</surname>
            ,
            <given-names>L.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pi-Sunyer</surname>
            ,
            <given-names>B.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sakhardande</surname>
            ,
            <given-names>A.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blakemore</surname>
            ,
            <given-names>S.J.:</given-names>
          </string-name>
          <article-title>The matrix reasoning item bank (mars-ib): novel, open-access abstract reasoning items for adolescents and adults</article-title>
          .
          <source>Royal Society Open Science</source>
          <volume>6</volume>
          (
          <issue>10</issue>
          ),
          <volume>190232</volume>
          (
          <year>2019</year>
          ). https://doi.org/10.1098/rsos.190232
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Climie</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rostad</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Test review: Wechsler adult intelligence scale</article-title>
          .
          <source>Journal of Psychoeducational Assessment</source>
          <volume>29</volume>
          ,
          <fpage>581</fpage>
          -
          <lpage>586</lpage>
          (12
          <year>2011</year>
          ). https://doi.org/10.1177/ 0734282911408707
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Cohen</surname>
          </string-name>
          , J.:
          <article-title>Statistical Power Analysis for the Behavioral Sciences</article-title>
          . Lawrence Erlbaum Associates, 2nd edn. (
          <year>1988</year>
          ). https://doi.org/10.4324/9780203771587
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Condon</surname>
            ,
            <given-names>D.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Revelle</surname>
            ,
            <given-names>W.:</given-names>
          </string-name>
          <article-title>The international cognitive ability resource: Development and initial validation of a public-domain measure</article-title>
          .
          <source>Intelligence</source>
          <volume>43</volume>
          ,
          <fpage>52</fpage>
          -
          <lpage>64</lpage>
          (
          <year>2014</year>
          ). https://doi.org/10.1016/j.intell.
          <year>2014</year>
          .
          <volume>01</volume>
          .004
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Daniel</surname>
          </string-name>
          , M.H.:
          <article-title>Intelligence testing: Status and trends</article-title>
          .
          <source>American</source>
          psychologist
          <volume>52</volume>
          (
          <issue>10</issue>
          ),
          <volume>1038</volume>
          (
          <year>1997</year>
          ). https://doi.org/10.1037/
          <fpage>0003</fpage>
          -
          <lpage>066X</lpage>
          .
          <fpage>52</fpage>
          .10.1038
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Engström</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nizamani</surname>
            ,
            <given-names>A.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Strannegård</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Generating comprehensible explanations in description logic</article-title>
          .
          <source>In: Informal Proceedings of the 27th International Workshop on Description Logics</source>
          , Vienna, Austria,
          <source>July 17-20</source>
          ,
          <year>2014</year>
          . pp.
          <fpage>530</fpage>
          -
          <lpage>542</lpage>
          (
          <year>2014</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1193</volume>
          /paper_17.pdf
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Flemisch</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Langner</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alrabbaa</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dachselt</surname>
          </string-name>
          , R.:
          <article-title>Towards designing a tool for understanding proofs in ontologies through combined node-link diagrams</article-title>
          . In: Ivanova,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Lambrix</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Pesquita</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Wiens</surname>
          </string-name>
          , V. (eds.) Proceedings of the Fifth International Workshop on Visualization and
          <article-title>Interaction for Ontologies and Linked Data co-located with the 19th International Semantic Web Conference (ISWC</article-title>
          <year>2020</year>
          ), Virtual Conference (originally planned in Athens, Greece),
          <source>November</source>
          <volume>02</volume>
          ,
          <year>2020</year>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>2778</volume>
          , pp.
          <fpage>28</fpage>
          -
          <lpage>40</lpage>
          . CEUR-WS.org (
          <year>2020</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2778</volume>
          /paper3.pdf
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Heydasch</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>The Hagen matrices test (HMT)</article-title>
          .
          <source>Ph.D. thesis</source>
          , University of Hagen, Germany (
          <year>2014</year>
          ). https://doi.org/10.13140
          <source>/RG.2.2.31433.75361</source>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Horn</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cattell</surname>
          </string-name>
          , R.:
          <article-title>Refinement and test of the theory of fluid and crystallized general intelligences</article-title>
          .
          <source>Journal of educational psychology 57(5)</source>
          ,
          <fpage>253</fpage>
          -
          <lpage>270</lpage>
          (
          <year>October 1966</year>
          ). https://doi.org/10.1037/h0023816
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Justification Based Explanation in Ontologies</article-title>
          .
          <source>Ph.D. thesis</source>
          , University of Manchester, UK (
          <year>2011</year>
          ), https://www.research.manchester.ac.uk/portal/ files/54511395/FULL_TEXT.PDF
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bail</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Toward cognitive support for OWL justifications</article-title>
          .
          <source>Knowledge-Based Systems 53</source>
          ,
          <fpage>66</fpage>
          -
          <lpage>79</lpage>
          (
          <year>2013</year>
          ). https://doi.org/10. 1016/j.knosys.
          <year>2013</year>
          .
          <volume>08</volume>
          .021
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Justification oriented proofs in OWL</article-title>
          .
          <source>In: The Semantic Web - ISWC 2010 - 9th International Semantic Web Conference, ISWC 2010</source>
          , Shanghai, China, November 7-
          <issue>11</issue>
          ,
          <year>2010</year>
          ,
          <string-name>
            <given-names>Revised</given-names>
            <surname>Selected</surname>
          </string-name>
          <string-name>
            <surname>Papers</surname>
          </string-name>
          , Part I. pp.
          <fpage>354</fpage>
          -
          <lpage>369</lpage>
          (
          <year>2010</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>642</fpage>
          -17746-0_
          <fpage>23</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>24. IBM Corp.: IBM SPSS Statistics for Windows [computer software], https://www. ibm.com/products/spss-statistics</mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Janzen</surname>
            ,
            <given-names>H.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Obrzut</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marusiak</surname>
            ,
            <given-names>C.W.</given-names>
          </string-name>
          :
          <article-title>Test review: Roid, GH (</article-title>
          <year>2003</year>
          ).
          <article-title>StanfordBinet intelligence scales, (SB: V). Itasca, IL: Riverside Publishing</article-title>
          .
          <source>Canadian Journal of School Psychology</source>
          <volume>19</volume>
          (
          <issue>1-2</issue>
          ),
          <fpage>235</fpage>
          -
          <lpage>244</lpage>
          (
          <year>2004</year>
          ). https://doi.org/10.1177/ 082957350401900113
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Kazakov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klinov</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stupnikov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Towards reusable explanation services in Protege</article-title>
          . In: Artale,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Glimm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Kontchakov</surname>
          </string-name>
          ,
          <string-name>
            <surname>R</surname>
          </string-name>
          . (eds.)
          <source>Proc. of the 30th Int. Workshop on Description Logics (DL'17)</source>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <source>1879</source>
          (
          <year>2017</year>
          ), http://www.ceur-ws.
          <source>org/</source>
          Vol-1879/paper31.pdf
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Kazakov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krötzsch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simancik</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>The incredible ELK - from polynomial procedures to efficient reasoning with EL ontologies</article-title>
          .
          <source>J. Autom. Reasoning</source>
          <volume>53</volume>
          (
          <issue>1</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>61</lpage>
          (
          <year>2014</year>
          ). https://doi.org/10.1007/s10817-013-9296-3
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Krawczyk</surname>
            ,
            <given-names>D.C.</given-names>
          </string-name>
          :
          <article-title>The cognition and neuroscience of relational reasoning</article-title>
          .
          <source>Brain Research</source>
          <volume>1428</volume>
          ,
          <fpage>13</fpage>
          -
          <lpage>23</lpage>
          (
          <year>2012</year>
          ). https://doi.org/10.1016/j.brainres.
          <year>2010</year>
          .
          <volume>11</volume>
          . 080
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Kuhn</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>The understandability of OWL statements in controlled english</article-title>
          .
          <source>Semantic Web</source>
          <volume>4</volume>
          (
          <issue>1</issue>
          ),
          <fpage>101</fpage>
          -
          <lpage>115</lpage>
          (
          <year>2013</year>
          ). https://doi.org/10.3233/SW-2012-0063
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Nguyen</surname>
            ,
            <given-names>T.A.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Power</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piwek</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Measuring the understandability of deduction rules for OWL</article-title>
          .
          <source>In: Proceedings of the First International Workshop on Debugging Ontologies and Ontology Mappings</source>
          ,
          <source>WoDOOM</source>
          <year>2012</year>
          , Galway, Ireland, October 8,
          <year>2012</year>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          (
          <year>2012</year>
          ), http://www.ida.liu.se/~patla/conferences/ WoDOOM12/papers/paper4.pdf
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Peñaloza</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sertkaya</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Understanding the complexity of axiom pinpointing in lightweight description logics</article-title>
          .
          <source>Artificial Intelligence</source>
          <volume>250</volume>
          ,
          <fpage>80</fpage>
          -
          <lpage>104</lpage>
          (
          <year>2017</year>
          ). https: //doi.org/10.1016/j.artint.
          <year>2017</year>
          .
          <volume>06</volume>
          .002
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Raven</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raven</surname>
          </string-name>
          , J.:
          <article-title>Raven progressive matrices</article-title>
          .
          <source>In: Handbook of Nonverbal Assessment</source>
          , pp.
          <fpage>223</fpage>
          -
          <lpage>237</lpage>
          . Springer US, Boston, MA (
          <year>2003</year>
          ). https://doi.org/10. 1007/978-1-
          <fpage>4615</fpage>
          -0153-4_
          <fpage>11</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Schiller</surname>
            ,
            <given-names>M.R.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Glimm</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Towards explicative inference for OWL</article-title>
          .
          <source>In: Informal Proceedings of the 26th International Workshop on Description Logics</source>
          , Ulm, Germany,
          <source>July 23 - 26</source>
          ,
          <year>2013</year>
          . pp.
          <fpage>930</fpage>
          -
          <lpage>941</lpage>
          (
          <year>2013</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1014</volume>
          / paper_36.pdf
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Schiller</surname>
            ,
            <given-names>M.R.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schiller</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Glimm</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Testing the adequacy of automated explanations of EL subsumptions</article-title>
          .
          <source>In: Proceedings of the 30th International Workshop on Description Logics</source>
          , Montpellier, France,
          <source>July 18-21</source>
          ,
          <year>2017</year>
          . (
          <year>2017</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-1879/paper43.pdf
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35.
          <string-name>
            <surname>Schlobach</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Explaining subsumption by optimal interpolation</article-title>
          . In: Alferes,
          <string-name>
            <given-names>J.J.</given-names>
            ,
            <surname>Leite</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.A</surname>
          </string-name>
          . (eds.)
          <source>Proc. of the 9th Eur. Conf. on Logics in Artificial Intelligence (JELIA'04). Lecture Notes in Computer Science</source>
          , vol.
          <volume>3229</volume>
          , pp.
          <fpage>413</fpage>
          -
          <lpage>425</lpage>
          . SpringerVerlag (
          <year>2004</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>540</fpage>
          -30227-8_
          <fpage>35</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          36.
          <string-name>
            <surname>Simancik</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kazakov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Consequence-based reasoning beyond horn ontologies</article-title>
          .
          <source>In: IJCAI 2011, Proceedings of the 22nd International Joint Conference on Artificial Intelligence</source>
          , Barcelona, Catalonia, Spain,
          <source>July 16-22</source>
          ,
          <year>2011</year>
          . pp.
          <fpage>1093</fpage>
          -
          <lpage>1098</lpage>
          (
          <year>2011</year>
          ). https://doi.org/10.5591/978-1-
          <fpage>57735</fpage>
          -516-8/
          <fpage>IJCAI11</fpage>
          -187
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          37.
          <string-name>
            <surname>Sternberg</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grigorenko</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bundy</surname>
            ,
            <given-names>D.A.</given-names>
          </string-name>
          :
          <article-title>The predictive value of IQ</article-title>
          .
          <source>MerrillPalmer Quarterly</source>
          <volume>47</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>41</lpage>
          (
          <year>2001</year>
          ). https://doi.org/10.1353/mpq.
          <year>2001</year>
          .0005
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          38.
          <string-name>
            <surname>Urbina</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Tests of intelligence</article-title>
          . In: Sternberg,
          <string-name>
            <surname>R.J.</surname>
          </string-name>
          , Kaufman, S.B. (eds.) The Cambridge Handbook of Intelligence, p.
          <fpage>20</fpage>
          -
          <lpage>38</lpage>
          . Cambridge Handbooks in Psychology, Cambridge University Press (
          <year>2011</year>
          ). https://doi.org/10.1017/ CBO9780511977244.003
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          39.
          <string-name>
            <surname>Young</surname>
            ,
            <given-names>S.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keith</surname>
            ,
            <given-names>T.Z.:</given-names>
          </string-name>
          <article-title>An examination of the convergent validity of the ICAR16 and WAIS-IV</article-title>
          .
          <source>Journal of Psychoeducational Assessment</source>
          <volume>38</volume>
          (
          <issue>8</issue>
          ),
          <fpage>1052</fpage>
          -
          <lpage>1059</lpage>
          (
          <year>2020</year>
          ). https://doi.org/10.1177/0734282920943455
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>