<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Human Compliance with Normative Principles in Argumentation: Efects of Naturalness Bias and Graphical Representation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marija Petrović</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Predrag Teovanović</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Danka Purić</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bruno Yun</string-name>
          <email>bruno.yun@univ-lyon1.fr</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Caren Al Anaissy</string-name>
          <email>alanaissy@cril.fr</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sébastien Konieczny</string-name>
          <email>konieczny@cril.fr</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Srdjan Vesic</string-name>
          <email>vesic@cril.fr</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>CRIL - CNRS - Univ. Artois</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>CRIL Université d'Artois</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>France</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>FASPER - Faculty of Special Education and Rehabilitation</institution>
          ,
          <addr-line>Visokog Stevana 2, 11000 Belgrade</addr-line>
          ,
          <country country="RS">Serbia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universite Claude Bernard Lyon 1, CNRS, Ecole Centrale de Lyon, INSA Lyon</institution>
          ,
          <addr-line>Université Lumière Lyon 2, LIRIS, UMR5205, 69622 Villeurbanne</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Belgrade, Faculty of Philosophy, Department of Psychology</institution>
          ,
          <addr-line>18-20 Čika Ljubina Street, 11000 Belgrade</addr-line>
          ,
          <country country="RS">Serbia</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Belgrade, Faculty of Philosophy, LIRA Lab</institution>
          ,
          <addr-line>18-20 Čika Ljubina Street, 11000 Belgrade</addr-line>
          ,
          <country country="RS">Serbia</country>
        </aff>
      </contrib-group>
      <fpage>16</fpage>
      <lpage>28</lpage>
      <abstract>
        <p>Argumentation theory examines how conclusions are derived or refuted through logical reasoning, playing a crucial role in human interaction and decision-making. In artificial intelligence, computational argumentation leverages formal models to aid in decision-making processes. This paper investigates the influence of argument content (specifically the naturalness bias) and graphical representation on participants' adherence to the simple principles of reinstatement and void precedence principles. We conducted experiments testing three hypotheses related to participants' rationality in evaluating arguments with and without graphical aids and bias-provoking content. Contrary to our expectations, neither the presence of graphical representations nor the type of content significantly impacted participants' compliance with the reinstatement principle. Additionally, the graph did not enhance understanding, suggesting the need for instructional aids. Our findings challenge previous studies and highlight the complexity of factors influencing argument evaluation.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Argumentation</kwd>
        <kwd>Bias</kwd>
        <kwd>Ranking-Based Semantics</kwd>
        <kwd>Human Reasoning</kwd>
        <kwd>Principles</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Argumentation theory is the interdisciplinary study of how conclusions can be supported or
undermined by premises through logical reasoning. It is an essential part of our daily human
reasoning, interaction and communication with one another. Argumentation is often used in
decision-making problems, resolving conflicts of opinion through negotiation and deliberation
processes, and influencing the thoughts of others through persuasion. The complex and dynamic
nature of argumentation is studied in various disciplines.</p>
      <p>
        It stands as an important domain of Artificial Intelligence (AI), especially in knowledge
representation and reasoning. Indeed, the use of AI allows for the development of computational
models for the exchange of arguments among various agents, facilitating the derivation of valid
conclusions from incomplete, inconsistent or conflicting information. Within AI, computational
argumentation focuses on creating formal models that support decision-making through the
construction and evaluation of arguments. Many works in computational arguments are based
on Dung’s seminal work [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] where he proposed the abstract argumentation framework, which is
a general tool for studying relations between the arguments such as attacks (and, more recently,
supports [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ] or sets of attacking arguments [
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ]).
      </p>
      <p>
        On a given abstract argumentation framework, one can apply many diferent semantics to
compute sets of acceptable arguments (conflict-free, admissible, . . . ) [
        <xref ref-type="bibr" rid="ref1 ref6">1, 6</xref>
        ] or rank arguments
from the strongest to the weakest (using ranking-based semantics) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Those semantics are
based on many intuitions or normative principles, defined in the literature, which characterise
their behaviour. While some work has studied the link between basic intuitions from the
argumentation theory (e.g., reinstatement) and human reasoning [
        <xref ref-type="bibr" rid="ref10 ref8 ref9">8, 9, 10</xref>
        ], it is still unknown if
most of the principles are intuitive or used by humans.
      </p>
      <p>
        A recent work by Vesic, Yun, and Teovanovic [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] studied some principles for ranking-based
semantics (anonymity, void precedence, maximality, and independence) and showed that the
graphical representation influences the normativity of participant responses. Namely, there
was a higher level of compliance with the principles when the participants were shown the
graphical representation of the arguments. Moreover, they showed that anonymity between
tasks principle was not followed by the participants. In other words, for two diferent sets of
textual arguments with same structure (the graphs are isomorphic), the participants difered in
their evaluation of the arguments, although this efect was dampened for participants who were
shown the graphical representation. This demonstrates that both the content of the arguments
and the graphical representation have a significant efect on the evaluation task.
      </p>
      <p>The latter result opens up several important questions that were not studied before. In
what way does the content of the arguments (e.g., language, bias-provoking content) afect the
principle compliance? Can graphical representations consistently serve as a prescriptive tool
for enhancing rationality in individuals?</p>
      <p>In this exploratory paper, we will focus on a restricted setting. We will study whether
participants comply with the simple reinstatement principle and how this is afected by the
naturalness bias and the graphical representation.</p>
      <p>On the one hand, the simple reinstatement principle states that in a configuration with three
arguments , , and  such that  attacks  and  attacks  (see Figure 1), roughly speaking
“both  and  must be stronger than ”. On the other hand, the naturalness bias (also known
as appeal-to-nature) is a cognitive bias where people have a preference for things perceived as
“natural" over those perceived as “unnatural" or artificial. This bias can influence attitudes and
decisions in various domains, including food, medicine, or lifestyle choices, among others.</p>
      <p>To experimentally check the impact of the content (e.g. bias-provoking text) and the graph,
we set up an experiment featuring three hypotheses. Firstly, we expected participants to behave
more rationally in the absence of the naturalness bias, regardless of graph presence. Next,
we predicted that the graph would make participants more rational (as shown in previous
studies) and we also expected that the efect of the graph would be stronger in the presence of
naturalness bias than in its absence.</p>
      <p>This paper is structured as follows. In Section 2, we recall the computational argumentation
background on extension and ranking-based semantics needed to motivate the principles. In
Section 3, we formulate our aims and hypothesis with respect to the principles, the graph, and
the naturalness bias. In Section 4, we describe the design of our study, the sampling plan, and
the instruments. Lastly, we present our results in Section 5 and conclude in Section 6.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Argumentation Background</title>
      <p>
        We start this section by recalling the definition of an argumentation framework as defined by
Dung in his seminal paper [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] (as an example, see Figure 1).
      </p>
      <p>Definition 1 (Argumentation framework). An argumentation framework is a pair ℱ =
(, ), where  is a finite set of arguments and  ⊆  ×  is a set of binary attacks between
arguments. The set of attackers of  ∈  is () = { ∈  | (, ) ∈ }.</p>
      <p>In the following sub-sections, we explain what is considered as rational with respect to
extension-based semantics and ranking-based semantics.</p>
      <sec id="sec-2-1">
        <title>2.1. Extension-based semantics</title>
        <p>Given an argumentation framework ℱ = (, ), we say that a set  ⊆  is conflict-free if
there is no ,  ∈  such that (, ) ∈ . A set  ⊆  defends  ∈  if for every  ∈ 
such that (, ) ∈ , there exists  ∈  such that (, ) ∈ . A set  ⊆  is admissible if it is
conflict-free and defends every argument in . A set  is preferred if it is a maximal (for set
inclusion) admissible set.</p>
        <p>The preferred semantics is the function that computes all the preferred sets from an
argumentation framework. There are many other acceptability semantics, which we do not introduce in
this paper as the details are not of practical interest for the rest of our cognitive study1.</p>
        <sec id="sec-2-1-1">
          <title>1Please refer to [6] for an introduction to extenion-based semantics.</title>
          <p>Example 1. In Figure 1, there are three admissible sets, which are ∅, {}, and {, }, but there
is only one preferred set which is {, }.</p>
          <p>
            In Example 1, the preferred extension shows that both  and  are better than  (as  and
 are in the preferred set but  is not). In this case, we can consider that  and  should
be equally acceptable. Thus, we define the extension-based simple reinstatement, inspired by
[
            <xref ref-type="bibr" rid="ref12 ref13">12, 13</xref>
            ], as follows.
          </p>
          <p>Definition 2 (Extension-based simple reinstatement). In a configuration with three
arguments , , and  such that  attacks  and  attacks , the extension-based simple
reinstatement is satisfied if  is equally strong as ,  is stronger than  and  is stronger than
.</p>
          <p>While extension can extract sets of acceptable arguments, an orthogonal approach consists
in ranking-arguments from the stronger to the weakest one.</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Ranking-based semantics</title>
        <p>Another family of argumentation semantics is called ranking-based. They do not calculate
extensions; instead, they rank the arguments from the strongest to the weakest one. In this
paper, we focus on a particular family of ranking-based semantics, called gradual semantics,
which associates to each argument, a number from 0 (weakest) to 1 (strongest). Those scores
naturally induce an order on the set of arguments.</p>
        <p>
          Definition 3 (Gradual semantics). A gradual semantics is a function  that takes as input
any ℱ = (, ) and returns a function ℱ :  → [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ]. The notation ℱ () ≤ ℱ ()
means that  is at least as acceptable as  w.r.t.  .
        </p>
        <p>
          For the rest of our study, we do not need particular semantics. However, just for the sake of
illustration, we introduce one semantics, called h-categorizer [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. We refer the reader to the
work of [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] for a comparative study on ranking-based semantics.
        </p>
        <p>Definition 4 ( ℎ-categorizer). The ℎ-categorizer semantics is a ranking-based semantics such
that for every ℱ = (, ), for every argument  ∈  we have
ℎ () =
ℱ</p>
        <p>1
1 + ∑︀∈() ℎ ()
ℱ</p>
        <p>
          It was shown [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] that ℎ-categorizer is well-defined for every argumentation framework (i.e.,
the semantics converge to unique values for each argumentation framework). To illustrate,
the values assigned by ℎ-categorizer to the arguments from Figure 1 are: ℎ () = 1,
ℱ
ℱℎ () = 21 , and ℱℎ () = 23 .
        </p>
        <p>Many principles were defined for characterising the behaviour of ranking-based semantics.
For the purpose of this paper, we do not need to recall them all. We just formalize the
rankingbased simple reinstatement and the void precedence principles.
Definition 5 (Ranking-based simple reinstatement). In a configuration with three
arguments , , and  such that  attacks  and  attacks , the ranking-based simple reinstatement
is satisfied if  is stronger than  and  is stronger than .</p>
        <p>We consider that both extension-based and ranking-based are valid and rational points of
view. In the rest of the paper, we allow the rational agent to act based on either of them. Hence,
we will say that the reinstatement is satisfied if either ranking-based simple reinstatement or
extension-based simple reinstatement is satisfied.</p>
        <p>Definition 6 (Simple reinstatement). In a configuration with three arguments , , and 
such that  attacks  and  attacks , the simple reinstatement is satisfied if  is equal or
stronger than  and  is stronger than .</p>
        <p>
          Void precedence states that a non-attacked argument is stronger than an attacked one.
Definition 7 (Void precedence [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]). We say that a ranking-based semantics  satisfies void
precedence if for every argumentation graph ℱ = (, ) and ,  ∈  such that () = ∅ and
() ̸= ∅ then ℱ () &gt; ℱ ().
        </p>
        <p>Now that we have defined what is considered rational in abstract argumentation, we formally
describe our aims and hypotheses.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Aim and Hypotheses</title>
      <p>In the current study, we examined if people comply with normative principles more readily when
presented with arguments with neutral content in comparison to arguments that contain
biasprovoking content. To do so, we explored people’s response patterns in a simple reinstatement
scenario. As an example of bias-provoking content, we relied on the naturalness bias i.e., the
tendency to prefer natural things over artificial ones, all other things being equal. Furthermore,
we aimed to replicate previous findings that graphical representation of arguments leads to
higher compliance with normative principles i.e., the presence of an argumentation graph
facilitates rational inference.</p>
      <p>We expected participants to comply more with normative principles in the neutral condition
than in the naturalness bias condition, regardless of graph presence (H1). Moreover, we expected
participants to comply more with normative principles in the graph condition relative to the
no-graph condition, regardless of content (H2). Finally, we expected an interaction between the
two factors, i.e., that the efect of the graph will be stronger in the naturalness bias condition
relative to the neutral condition (H3).</p>
      <p>On an exploratory level, we also checked whether people tend to follow extension-based or
ranking-based argumentation semantics more, depending on the rating of diferent arguments
(the non-attacked one and the reinstated one). The extension-based semantics predict that both
arguments will have the same acceptability level, whereas most of the ranking-based semantics
predict that the non-attacked one will be stronger.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Method</title>
      <sec id="sec-4-1">
        <title>4.1. Open science</title>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Study design</title>
        <p>The materials and data to reproduce the findings of this study can be found on the project OSF
page: https://osf.io/kce7q/.</p>
        <p>We employed a 2x2 design with two independent factors: content (neutral vs. bias-provoking)
and graph (present vs. absent), resulting in four experimental groups of participants.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Sampling plan and data exclusions</title>
        <p>The sample size was based on an a priori power analysis. Since we considered several possible
efects, we opted to base the power analysis on the smallest efect size of interest - i.e., a small
interaction efect size (Cohen’s  = .10). For power .80, the apriori analysis indicated we would
need  = 197 per group (788 in total) to detect this efect size. For power .90, for the same
efect size, we would need  = 263 per group (1052 in total). For further details on the power
analysis, see preregistration - https://aspredicted.org/9T9_NG4.</p>
        <p>As per preregistration, we included an attention check (“Please choose Completely agree to
indicate you are paying attention”), and all participants who failed the attention check were
automatically excluded from the sample and did not count towards the final sample size.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Final sample and procedure</title>
        <p>The final sample size consisted of a total of  = 1048 UK and US Prolific participants. All
respondents were compensated for their participation using the standard Prolific 9 £/hour rate.
Most participants were from the UK (85.6%), and 61.8% were female, with an average age of
  = 41.45 ( = 13.71)2. Due to a technical error in the questionnaire setup, there
was an unequal distribution between the four experimental groups (116 for neutral content
without a graph, 398 for neutral content with a graph, 400 for bias-provoking content without
a graph, and 133 for bias-provoking content with a graph).</p>
        <p>
          The questionnaire was administered online via Prolific and hosted on the SoSci Survey
platform [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. The participants were randomly allocated first to either the neutral or
biasedcontent condition, and then to either the graph or no-graph condition. The participants then
saw the arguments (and the graph, depending on the condition) and rated them, after which
they answered an additional question on their strategy when rating the arguments. The planned
duration of the study was four minutes, but on average, participants completed it in 100 seconds
( = 43.3).
        </p>
        <p>2Based on the data from  = 1046 participants, since two participants’ demographic Prolific data could not be
matched.</p>
        <p>Consider the arguments below.
• Argument A: It doesn’t matter if you drink natural or lab-produced water.
• Argument B: Natural things are healthier than lab-produced ones, so you should drink natural
water.</p>
        <p>• Argument C: They are chemically identical, so they have the same impact on your health.</p>
        <p>Consider the arguments below.
• Argument A: It’s not going to rain today.
• Argument B: But look at all these clouds, it will rain.</p>
        <p>• Argument C: Those clouds are Cumulus, hence they do not produce rain.</p>
      </sec>
      <sec id="sec-4-5">
        <title>4.5. Study instruments</title>
        <p>As mentioned in Section 4.2, we had four experimental groups of participants (neutral vs.
bias-provoking / graph vs. no-graph).</p>
        <p>Each participant was first shown three arguments in their textual forms. Depending on their
group (neutral vs. bias-provoking), the textual content was diferent (see Figures 2 and 3). We
can observe that in both cases, argument B attacks argument A which consists only of a claim,
argument C attacks argument B since C attacks B’ s premise.</p>
        <p>
          The textual content of the arguments was inspired by structured argumentation which allows
for constructing arguments from a knowledge base using a formal language. In this context,
arguments are characterized as structured because they reveal their underlying premises and
conclusions clearly, the connection between them is formally established, and attacks among
them are formally defined. There exist several approaches in structured argumentation for
formalizing arguments such as the ASPIC+ framework [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], deductive argumentation [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] and
the assumption-based framework [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. We represented arguments in their simplest forms which
consists of a premise that supports a conclusion (claim) using an inference rule. In some cases,
the premise can be hidden (e.g., in argument A), hence the argument consists only of a claim.
Based on the internal structure of the arguments, we can have diferent types of attack relations
between the arguments. For example, an argument can attack another argument’ s premise; this
type of attack is called undermining attack (e.g., attack from  to ). Another type of attack,
called rebutting attack, is an attack on the conclusion of an argument (e.g., attack from  to ).
        </p>
        <p>Only for the participants in the graph groups, we displayed the graphical visualisation of the
arguments and attacks (see Figure 1).</p>
        <p>Lastly, the participants were asked to assess the strength of each argument using a 5-point
Likert scale from 1 (Weak) to 5 (Strong), with 3 being “Neutral”.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>Overall, participants complied with simple reinstatement to a low degree (see Figure 4), with
the percentage of participants who followed these principles in either of the groups ranging
from 10.3% to 16.5%.</p>
      <p>We conducted a logistic regression analysis to investigate how well the two factors (content
and graph) and their interaction predict compliance with simple reinstatement. The
goodnessof-fit of the logistic regression model was assessed using a likelihood ratio test, which yielded
a non-significant chi-square statistic,  2(3) = 2.96,  = .398. Additionally, the 2 value
indicated that the model accounts for only 0.3% of the variability in compliance with simple
reinstatement. This suggests that the model with argument content and graphical representation
did not provide a significantly better fit to the data than a model without them, as shown in
Table 1.</p>
      <p>Regarding void precedence, the participants in our sample followed this principle to a larger
degree (see Figure 5) - the lowest percentage in either of the groups was 44.4% while the highest
was 69.8%.</p>
      <p>To explore if the two factors and their interaction predict compliance with void precedence,
we conducted another logistic regression analysis. The goodness-of-fit of the model was assessed
using a likelihood ratio test, which yielded a significant chi-square statistic,  2(3) = 53.66,  &lt;
Predictor 
Intercept -3.78
Content 1.14</p>
      <p>Graph 1.09
Content * Graph -0.60
.001, and 2 value indicated that the model accounts for 5% of the variability in compliance with
void precedence. As shown in Table 2, content influenced rational behavior, while the graph and
interaction terms were non-significant. Presenting arguments with naturalness bias-provoking
content decreased compliance with void precedence by 71% (() = 0.29, 95% =
[0.11, 0.77]).</p>
      <p>Further analyses revealed that the efect of content is due to higher ratings of argument 
((1057) = 2.66,  = .008,  = 0.16) and especially lower ratings of argument  ((1057) =
7.69,  &lt; .001,  = 0.47) in the group presented with bias-provoking content, regardless of
the presence of graph (see Table 3). Ratings for argument  did not difer significantly between
bias-provoking and neutral content groups ( = .28).</p>
      <p>-2.26
-1.24
-0.36
0.18</p>
    </sec>
    <sec id="sec-6">
      <title>6. Discussion</title>
      <sec id="sec-6-1">
        <title>We now summarize and discuss all of our hypotheses and results.</title>
        <p>First, we expected participants to comply more with normative principles in the neutral
condition than in the naturalness bias condition, regardless of graph presence (H1). For this
hypothesis, our results are mixed. The experiment showed that the change in content (i.e.
neutral vs biased) did not impact the degree of compliance with the simple reinstatement
principle. However, the content influenced the degree to which participants complied with the
void precedence principle. Namely, in the presence of arguments incorporating naturalness bias,
the ratings of the non-attacked argument  were very low while the ratings of the non-defended
argument  were high (probably due to the naturalness bias increasing participants’ ratings of
).</p>
        <p>
          Second, we expected participants to comply more with normative principles in the graph
condition relative to the no-graph condition, regardless of content (H2). This hypothesis was
disconfirmed - our experiment showed that the graph did not provide a significantly better fit to
the data. This contrasts the recent results [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] where the presence of the graphical representation
was shown to enhance compliance with principles for graded argumentation semantics. Note,
however, that the participants in that study went through a tutorial and several tasks before
evaluating the strength of the arguments. This hints that a tutorial may be necessary to teach
participants the meaning of the graphical representation and how it is linked to argument
ratings.
        </p>
        <p>Third, we expected an interaction between the two factors, i.e., that the efect of the graph
will be stronger in the naturalness bias condition relative to the neutral condition (H3). Contrary
to our hypothesis, the experiment showed that the interaction efect was not significant in our
model.</p>
        <p>
          In our experiments, compliance with the reinstatement principle was very low (less than 20%).
Our results difer from the existing empirical work on reinstatement [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]. However, the task that
participants were presented with is also diferent. While we study the individual satisfaction of
the principle considering all arguments simultaneously, they compared the average ratings of
argument  (before  is introduced, after  is introduced, and then when  is introduced).
        </p>
        <p>Note that according to the average scores of arguments in our experiment,  is stronger than
 and  is stronger than . One of the possible explanations for this is that the argument 
is just a statement and contains no justification. Argument  has a general justification (e.g.,
there are clouds so it will rain) and might be seen as more acceptable for this reason. Finally
the argument  has the strongest hypothesis that fully justifies its conclusion and might seem
dificult to attack (e.g., the cumulus clouds do not produce rain).</p>
        <p>Another reason why compliance with simple reinstatement might be low is that participants’
intuition about the notion of defense diverges significantly from that of the researchers. More
precisely, it might be that the participants misunderstood the meaning of attacks and associated
attacks solely with a reduction in argument  strength, ignoring the increase in strength
brought by reinstatement for argument .</p>
        <p>Importantly, several factors limit the generalizability of our findings. Firstly, due to a
programming error, the four groups in our study were not equal in size, somewhat reducing statistical
power. Next, since participants only responded to one presented scenario, it is dificult to say
how they would respond to other scenarios, be it simple reinstatement scenarios with diferent
content or more complex scenarios. Finally, we presented all arguments at the same time, which
may have an impact on participants’ perception of attacks and, consequently, their ratings of
arguments’ strength.</p>
        <p>
          In future work, we plan to introduce the arguments step-by-step (as it was done by Rahwan
et al. [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]) while monitoring the ratings of all arguments (instead of only ). We believe that
this will give us a better insight on the factors at play behind the participants’ ratings of the
arguments.
        </p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>This work benefited from the support of the project SATTORI in the framework of the Pavle
Savic - Hubert Curien funding scheme, and from the support of the project AGGREEY
ANR-22CE23-0005 of the French National Research Agency (ANR).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P. M.</given-names>
            <surname>Dung</surname>
          </string-name>
          ,
          <article-title>On the acceptability of arguments and its fundamental role in nonmonotonic reasoning, logic programming and n-person games</article-title>
          ,
          <source>Artif. Intell</source>
          .
          <volume>77</volume>
          (
          <year>1995</year>
          )
          <fpage>321</fpage>
          -
          <lpage>358</lpage>
          . URL: https://doi.org/10.1016/
          <fpage>0004</fpage>
          -
          <lpage>3702</lpage>
          (
          <issue>94</issue>
          )
          <fpage>00041</fpage>
          -
          <lpage>X</lpage>
          . doi:
          <volume>10</volume>
          .1016/
          <fpage>0004</fpage>
          -
          <lpage>3702</lpage>
          (
          <issue>94</issue>
          )
          <fpage>00041</fpage>
          -
          <lpage>X</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C.</given-names>
            <surname>Cayrol</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lagasquie-Schiex</surname>
          </string-name>
          ,
          <article-title>Bipolarity in argumentation graphs: Towards a better understanding</article-title>
          ,
          <source>Int. J. Approx. Reason</source>
          .
          <volume>54</volume>
          (
          <year>2013</year>
          )
          <fpage>876</fpage>
          -
          <lpage>899</lpage>
          . URL: https://doi.org/10.1016/j. ijar.
          <year>2013</year>
          .
          <volume>03</volume>
          .001. doi:
          <volume>10</volume>
          .1016/J.IJAR.
          <year>2013</year>
          .
          <volume>03</volume>
          .001.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Cohen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gottifredi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. J.</given-names>
            <surname>García</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. R.</given-names>
            <surname>Simari</surname>
          </string-name>
          ,
          <article-title>A survey of diferent approaches to support in argumentation systems</article-title>
          ,
          <source>Knowl. Eng. Rev</source>
          .
          <volume>29</volume>
          (
          <year>2014</year>
          )
          <fpage>513</fpage>
          -
          <lpage>550</lpage>
          . URL: https: //doi.org/10.1017/S0269888913000325. doi:
          <volume>10</volume>
          .1017/S0269888913000325.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B.</given-names>
            <surname>Yun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vesic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Croitoru</surname>
          </string-name>
          ,
          <article-title>Ranking-based semantics for sets of attacking arguments</article-title>
          ,
          <source>in: The Thirty-Fourth AAAI Conference on Artificial Intelligence</source>
          ,
          <source>AAAI</source>
          <year>2020</year>
          ,
          <source>The ThirtySecond Innovative Applications of Artificial Intelligence Conference</source>
          ,
          <source>IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI</source>
          <year>2020</year>
          , New York, NY, USA, February 7-
          <issue>12</issue>
          ,
          <year>2020</year>
          , AAAI Press,
          <year>2020</year>
          , pp.
          <fpage>3033</fpage>
          -
          <lpage>3040</lpage>
          . URL: https: //doi.org/10.1609/aaai.v34i03.5697. doi:
          <volume>10</volume>
          .1609/AAAI.V34I03.5697.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S. H.</given-names>
            <surname>Nielsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Parsons</surname>
          </string-name>
          ,
          <article-title>A generalization of dung's abstract framework for argumentation: Arguing with sets of attacking arguments</article-title>
          , in: N.
          <string-name>
            <surname>Maudet</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Parsons</surname>
          </string-name>
          , I. Rahwan (Eds.),
          <article-title>Argumentation in Multi-Agent Systems</article-title>
          , Third International Workshop, ArgMAS 2006, Hakodate, Japan, May 8,
          <year>2006</year>
          ,
          <string-name>
            <given-names>Revised</given-names>
            <surname>Selected</surname>
          </string-name>
          and Invited Papers, volume
          <volume>4766</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2006</year>
          , pp.
          <fpage>54</fpage>
          -
          <lpage>73</lpage>
          . URL: https://doi.org/10. 1007/978-3-
          <fpage>540</fpage>
          -75526-
          <issue>5</issue>
          _4. doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>540</fpage>
          -75526-5\_4.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>P.</given-names>
            <surname>Baroni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Caminada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Giacomin</surname>
          </string-name>
          ,
          <article-title>An introduction to argumentation semantics</article-title>
          ,
          <source>Knowl. Eng. Rev</source>
          .
          <volume>26</volume>
          (
          <year>2011</year>
          )
          <fpage>365</fpage>
          -
          <lpage>410</lpage>
          . URL: https://doi.org/10.1017/S0269888911000166. doi:
          <volume>10</volume>
          .1017/S0269888911000166.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L.</given-names>
            <surname>Amgoud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ben-Naim</surname>
          </string-name>
          ,
          <article-title>Ranking-based semantics for argumentation frameworks</article-title>
          , in: W. Liu,
          <string-name>
            <given-names>V. S.</given-names>
            <surname>Subrahmanian</surname>
          </string-name>
          , J. Wijsen (Eds.),
          <source>Scalable Uncertainty Management - 7th International Conference, SUM 2013</source>
          , Washington, DC, USA, September
          <volume>16</volume>
          -
          <issue>18</issue>
          ,
          <year>2013</year>
          . Proceedings, volume
          <volume>8078</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2013</year>
          , pp.
          <fpage>134</fpage>
          -
          <lpage>147</lpage>
          . URL: https: //doi.org/10.1007/978-3-
          <fpage>642</fpage>
          -40381-1_
          <fpage>11</fpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>642</fpage>
          -40381-1\_
          <fpage>11</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Cramer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Guillaume</surname>
          </string-name>
          ,
          <article-title>Empirical study on human evaluation of complex argumentation frameworks</article-title>
          , in: F. Calimeri,
          <string-name>
            <given-names>N.</given-names>
            <surname>Leone</surname>
          </string-name>
          , M. Manna (Eds.),
          <source>Logics in Artificial Intelligence - 16th European Conference, JELIA</source>
          <year>2019</year>
          , Rende, Italy, May 7-
          <issue>11</issue>
          ,
          <year>2019</year>
          , Proceedings, volume
          <volume>11468</volume>
          of Lecture Notes in Computer Science, Springer,
          <year>2019</year>
          , pp.
          <fpage>102</fpage>
          -
          <lpage>115</lpage>
          . URL: https://doi. org/10.1007/978-3-
          <fpage>030</fpage>
          -19570-
          <issue>0</issue>
          _7. doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>030</fpage>
          -19570-0\_7.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Cramer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Guillaume</surname>
          </string-name>
          ,
          <article-title>Empirical cognitive study on abstract argumentation semantics</article-title>
          , in: S. Modgil,
          <string-name>
            <given-names>K.</given-names>
            <surname>Budzynska</surname>
          </string-name>
          , J. Lawrence (Eds.),
          <source>Computational Models of Argument - Proceedings of COMMA</source>
          <year>2018</year>
          , Warsaw, Poland,
          <fpage>12</fpage>
          -14
          <source>September</source>
          <year>2018</year>
          , volume
          <volume>305</volume>
          <source>of Frontiers in Artificial Intelligence and Applications</source>
          , IOS Press,
          <year>2018</year>
          , pp.
          <fpage>413</fpage>
          -
          <lpage>424</lpage>
          . URL: https: //doi.org/10.3233/978-1-
          <fpage>61499</fpage>
          -906-5-413. doi:
          <volume>10</volume>
          .3233/978-1-
          <fpage>61499</fpage>
          -906-5-413.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>E.</given-names>
            <surname>Bezou Vrakatseli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Prakken</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Janssen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Amgoud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Booth</surname>
          </string-name>
          , et al.,
          <article-title>New experiments on reinstatement and gradual acceptability of arguments</article-title>
          ,
          <source>in: Proceedings of the 19th International Workshop on Nonmonotonic Reasoning</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>109</fpage>
          -
          <lpage>118</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.</given-names>
            <surname>Vesic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Yun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Teovanovic</surname>
          </string-name>
          ,
          <article-title>Graphical representation enhances human compliance with principles for graded argumentation semantics</article-title>
          , in: P.
          <string-name>
            <surname>Faliszewski</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Mascardi</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Pelachaud</surname>
          </string-name>
          , M. E. Taylor (Eds.),
          <source>21st International Conference on Autonomous Agents and Multiagent Systems, AAMAS</source>
          <year>2022</year>
          , Auckland, New Zealand, May 9-
          <issue>13</issue>
          ,
          <year>2022</year>
          , International Foundation for Autonomous
          <source>Agents and Multiagent Systems (IFAAMAS)</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>1319</fpage>
          -
          <lpage>1327</lpage>
          . URL: https://www.ifaamas.org/Proceedings/aamas2022/pdfs/p1319.pdf.
          <source>doi:10.5555/3535850</source>
          .3535997.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L. van der</given-names>
            <surname>Torre</surname>
          </string-name>
          , S. Vesic,
          <article-title>The principle-based approach to abstract argumentation semantics</article-title>
          ,
          <source>FLAP</source>
          <volume>4</volume>
          (
          <year>2017</year>
          ). URL: http://www.collegepublications.co.uk/downloads/ifcolog00017. pdf.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>P.</given-names>
            <surname>Baroni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Giacomin</surname>
          </string-name>
          ,
          <article-title>On principle-based evaluation of extension-based argumentation semantics</article-title>
          ,
          <source>Artif. Intell</source>
          .
          <volume>171</volume>
          (
          <year>2007</year>
          )
          <fpage>675</fpage>
          -
          <lpage>700</lpage>
          . URL: https://doi.org/10.1016/j.artint.
          <year>2007</year>
          .
          <volume>04</volume>
          .004. doi:
          <volume>10</volume>
          .1016/J.ARTINT.
          <year>2007</year>
          .
          <volume>04</volume>
          .004.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P.</given-names>
            <surname>Besnard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hunter</surname>
          </string-name>
          ,
          <article-title>A logic-based theory of deductive arguments</article-title>
          ,
          <source>Artificial Intelligence</source>
          <volume>128</volume>
          (
          <year>2001</year>
          )
          <fpage>203</fpage>
          -
          <lpage>235</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>E.</given-names>
            <surname>Bonzon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Delobelle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Konieczny</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Maudet</surname>
          </string-name>
          ,
          <article-title>A comparative study of ranking-based semantics for abstract argumentation</article-title>
          , in: D.
          <string-name>
            <surname>Schuurmans</surname>
            ,
            <given-names>M. P.</given-names>
          </string-name>
          Wellman (Eds.),
          <source>Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 12-17</source>
          ,
          <year>2016</year>
          , Phoenix, Arizona, USA, AAAI Press,
          <year>2016</year>
          , pp.
          <fpage>914</fpage>
          -
          <lpage>920</lpage>
          . URL: https://doi.org/10.1609/aaai.v30i1.10116. doi:
          <volume>10</volume>
          .1609/AAAI.V30I1.10116.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>F.</given-names>
            <surname>Pu</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. L.</surname>
          </string-name>
          <year>andY</year>
          . Zhang, G. Luo,
          <article-title>Argument ranking with categoriser function</article-title>
          , in: International Knowledge Science, Engineering and Management Conference KSEM'
          <volume>14</volume>
          ,
          <year>2014</year>
          , pp.
          <fpage>290</fpage>
          -
          <lpage>301</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Leiner</surname>
          </string-name>
          ,
          <source>Sosci survey (version 3.2</source>
          . 31)[computer software],
          <string-name>
            <surname>München: SoSci Survey GmbH</surname>
          </string-name>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>S.</given-names>
            <surname>Modgil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Prakken</surname>
          </string-name>
          ,
          <article-title>A general account of argumentation with preferences</article-title>
          ,
          <source>Artificial Intelligence</source>
          <volume>195</volume>
          (
          <year>2013</year>
          )
          <fpage>361</fpage>
          -
          <lpage>397</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>P.</given-names>
            <surname>Besnard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hunter</surname>
          </string-name>
          ,
          <article-title>A logic-based theory of deductive arguments</article-title>
          ,
          <source>Artificial Intelligence</source>
          <volume>128</volume>
          (
          <year>2001</year>
          )
          <fpage>203</fpage>
          -
          <lpage>235</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>P. M. Dung</surname>
            ,
            <given-names>R. A.</given-names>
          </string-name>
          <string-name>
            <surname>Kowalski</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Toni</surname>
          </string-name>
          ,
          <article-title>Assumption-based argumentation</article-title>
          ,
          <source>Argumentation in artificial intelligence</source>
          (
          <year>2009</year>
          )
          <fpage>199</fpage>
          -
          <lpage>218</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>I.</given-names>
            <surname>Rahwan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. I.</given-names>
            <surname>Madakkatel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bonnefon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. N.</given-names>
            <surname>Awan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Abdallah</surname>
          </string-name>
          ,
          <article-title>Behavioral experiments for assessing the abstract argumentation semantics of reinstatement, Cogn</article-title>
          . Sci.
          <volume>34</volume>
          (
          <year>2010</year>
          )
          <fpage>1483</fpage>
          -
          <lpage>1502</lpage>
          . URL: https://doi.org/10.1111/j.1551-
          <fpage>6709</fpage>
          .
          <year>2010</year>
          .
          <volume>01123</volume>
          .x. doi:
          <volume>10</volume>
          .1111/J.
          <fpage>1551</fpage>
          -
          <lpage>6709</lpage>
          .
          <year>2010</year>
          .01123.X.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>