Towards an Argument Scheme Classification for
Ethical Reasoning
Elfia Bezou-Vrakatseli1 , Oana Cocarascu1 and Sanjay Modgil1
1
    Department of Informatics, King’s College London, UK


                                         Abstract
                                         This paper describes progress towards realising the long-term research goal of supporting dialogue
                                         between humans and between humans and AI systems so as to enable transparent and rational joint
                                         reasoning, in particular about matters of ethical significance. Key to realising this goal is this paper’s
                                         proposed exploration and analysis of natural language moral debates, via argument schemes and critical
                                         questions. We believe this to be an important first step in identifying schemes and scheme taxonomies,
                                         specialised for ethical reasoning, and that can support both the natural language processing needed to
                                         support human-AI dialogue, as well as for scaffolding human-human dialogue.

                                         Keywords
                                         Dialogue, argument schemes, argument mining, ethics


1. Introduction & Motivation
Recent highly publicised successes in Artificial Intelligence (AI) applications have in large part
been due to advances in machine learning (notably deep learning) [1]. However, symbolic ap-
proaches to AI will necessarily play a role in facilitating inter-agent (in particular human-AI and
human-human) communication which is inherently a symbolic activity. In particular, reasoning
in the presence of uncertainty, conflict, and disagreement typically benefits from multiple agents
engaged in information sharing and joint reasoning; that is, in dialogical exchanges normatively
governed by prescriptions encoded in logics for non-monotonic reasoning. One of the most
promising paradigms for facilitating such dialogues is through argumentation-based charac-
terisations of non-monotonic (nm) inference relations in terms of the exchange of arguments
constructed from a given static belief base. These ‘monological’ characterisations can then be
generalised to dialogical models in which agents exchange locutions (not limited to arguments),
such that evaluation of a dialogical exchange in favour of a claim equates with that claim being
a non-monotonic inference from the information shared during the course of the dialogue [2].
As dialogues consist of informational exchanges, they enable the understanding of the parties
involved, not only as a direct result of the information each utterance conveys, but also as a
means of exploring the reasoning of the interlocutors. As a result, dialogues also facilitate joint
reasoning. Overall, the communication and joint reasoning of AI systems and humans leverages
the strengths of AI along with those of human reasoning.

CMNA ’22: Workshop on Computational Models of Natural Argument
Envelope-Open elfia.bezou_vrakatseli@kcl.ac.uk (E. Bezou-Vrakatseli); oana.cocarascu@kcl.ac.uk (O. Cocarascu);
sanjay.modgil@kcl.ac.uk (S. Modgil)
                                       © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073
                                       CEUR Workshop Proceedings (CEUR-WS.org)


                                                                                                           1
Elfia Bezou-Vrakatseli et al. CEUR Workshop Proceedings                                            1–5


   Our long-term research goal is guiding humans and AI systems alike in building and chal-
lenging arguments related to ethical issues as ethical reasoning is of crucial importance for
decision-making systems that can impact human lives [3]. Moreover, enabling joint human
and AI reasoning can help solve the value alignment problem [2] and help ensure the ethical
behaviour, in accordance with human values, of agents as shown in [4]. In order to support
dialogue between humans and between humans and AI systems that can enable transparent and
rational joint reasoning, we need to draw from recent advances in natural language processing
(NLP), and in particular argument mining [5], an area of research which focuses on extracting
arguments and their relations from text (e.g. relations between premises and conclusions).
   To this end, we begin by analysing natural language (NL) texts using argument schemes and
critical questions. In particular, we focus on annotating moral debates using two argument
scheme classifications: Walton [6] and Wagemans [7]. As the existing vast variety of taxonomies
and argument schemes is one of the biggest problems in the literature, we purport to reconcile
the two most used classifications and develop a theoretically well-founded, as well as practically
useful, hybrid classification that can be applied to ethical debates. Said reconciliation leverages
the strengths of both, while identifying the schemes particular to ethical reasoning. The novelty
of our research can be found in the attempt to go beyond standard argument mining techniques
(to determine the relation between premise and conclusion and identify support/attack relations
between arguments) by making use of informal logic. In particular, we make use of argument
schemes and critical questions that offer a semantically richer approach to argument classifica-
tion, as premises support a conclusion by virtue of instantiating a scheme and support/attack
relations are instigated in response to critical questions.


2. Background & Related Work
Argument schemes represent stereotypical patterns of reasoning and consist of a set of premises
and a conclusion. Walton proposed over 60 argument schemes with corresponding sets of critical
questions which are used to evaluate the strength of an argument [6] . For example, Walton’s
representation of the argument from positive consequences scheme is defined by a Premise: “If 𝐴
is brought about, good consequences will (plausibly) occur” and Conclusion: “Therefore, 𝐴 should
be brought about”, with the following critical questions (CQs): CQ1: “How strong is the likelihood
that the cited consequences will (may, must) occur?”; CQ2: “What evidence supports the claim that
the cited consequences will occur and is it sufficient to support the strength of the claim adequetly?”;
“CQ3: Are there opposite consequences (bad as opposed to good) that should be taken into account?”.
   Wagemans proposed the Periodic Table of Arguments (PTA) [7] (see Figure 1 for the first
iteration of the table), which is based on an a priori constrained set of possible combinations
between different characterisations of argument: subject vs predicate arguments, first vs second
order arguments, and argument substance. Subject arguments are arguments whose premise
and conclusion have the same predicate but different subject (e.g. “Abortion should be illegal,
because murder is illegal.”), whereas Predicate arguments are arguments whose premise and
conclusion share the same subject but have different predicates (e.g. “The death penalty should
be abolished, because the death penalty carries the risk of ruining innocent people lives.”). First-
order arguments contain simple statements which cannot be split further while Second-order


                                                   2
Elfia Bezou-Vrakatseli et al. CEUR Workshop Proceedings                                                   1–5


Figure 1: The first Periodic Table of Arguments with different characterisations of arguments: subject
vs predicate, 1st vs 2nd order, and combinations of three types of propositions: F(act), V(alue), P(olicy) [7].


arguments are epistemological in nature and, containing at least one complex statement whose
subject can be broken down, they can embed first order arguments (e.g., “We should wear masks
in public because WHO suggests it.”). Finally, the argument’s statements are categorised as
propositions of policy (if they express that an act should be performed), of value (if they contain
an evaluative judgment based on a definition or assessment criteria, such as ethical judgements),
or of fact (if their veracity can be verified empirically). We note that the two taxonomies are not
incompatible and the PTA can incorporate existing schemes, for example Walton’s argument
from sign corresponds to 1-pre-FF in the PTA, as can be seen in Figure 1.
   Several works have focused on classifying arguments based on the argument scheme they
instantiate using features extracted from text [8, 9]. [10] proposed guidelines for annotating
argument schemes using a taxonomic hierarchy of schemes and the semantic properties of
premises/claims, while [11] advocated for the creation of argument scheme templates represent-
ing clusters of schemes. Visser et al. [12] annotated election debates using Walton’s schemes
and Wagemans’s PTA and provided an overview of the correspondence between the results
obtained with the two taxonomies. In contrast, our aim is to reconcile the two approaches of
classifying schemes, leveraging the strengths of both in order to develop a hybrid classification
that can be applied to ethical debates so as to provide transparent and rational reasoning.


3. From Argument Classification to Reconciling Taxonomies
We make first steps towards reconciling the two taxonomies by classifying NL arguments
using Walton and Wagemans, following the method of Visser et al. [12] to manually annotate
arguments, but we focus on ethical debates (e.g. “Pro-life vs Pro-choice: Should abortion be
legal?”, “Should an Artificial General Intelligence be created?”, “Should all humans be vegans?”).
The most common schemes identified are: argument from consequences, argument from values,
argument from analogy, argument from example, practical reasoning, while the majority of
arguments we identify using Wagemans’ classification are first-order, predicate arguments.
Classifying an argument from the debate Pro-life vs Pro-choice. Consider argument 𝐴:


                                                      3
Elfia Bezou-Vrakatseli et al. CEUR Workshop Proceedings                                          1–5


Access to legal abortion improves the health and safety of pregnant people. In our analysis, we
first determine that argument 𝐴 is an enthymeme and assume the complete argument to be
𝐴′ : Access to legal abortion improves the health and safety of pregnant people, so pregnant people
should have the right to choose abortion.
   To annotate with Walton’s taxonomy, we use the Argument Scheme Key (ASK), a series of
disjunctive choices based on the distinctive features of argument schemes which also groups
scheme types that share particular characteristics [12]. The ASK path used to classify argument
𝐴′ is as follows, with the answer in brackets: Argument relies on a source’s opinion or character
(No); Conclusion is about a course of action (Yes); Argument focuses on the outcome of the action
(Yes); Conclusion promotes a positive outcome (Yes); Course of action assists someone else (No);
Course of action promotes a goal (Yes), resulting in an argument from positive consequences.
   To annotate with Wagemans’ taxonomy, we consider the three characterisations of argument.
In particular, we determine that 𝐴′ is a first-order, predicate argument as the premise and the
conclusion share the same subject (i.e. “access to legal abortion”). Note that 𝐴′ needs to be
rephrased slightly in order to match the definition of a predicate argument. Lastly, we observe
that the conclusion of 𝐴′ is a proposition of policy (“pregnant people should have the right …”),
while the premise is a fact (“legal access improves the health and safety of pregnant people”),
thus the argument is of type 1-pre-PF, equivalent to a pragmatic argument.
   This example offers insights into our approach towards a hybrid taxonomy: firstly, observing
the co-occurrences of argument schemes in both classification systems allows us to detect
correspondences between the two in order to reconcile them. For example, Walton’s argument
from positive consequences is often classified as 1-pre-PF using Wagemans’ PTA, which was
also observed in [12]. Secondly, comparing and contrasting the annotation guidelines of each
taxonomy helps us reflect on them and their usefulness. For instance, deciding if an argument
is first or second order is a criterion in both taxonomies, which indicates that this distinction is
necessary and should be included in the criteria of classification in our hybrid taxonomy.
   We also take inspiration from [13] that developed the hybrid scheme argument from action,
which (partly) incorporates the schemes argument from positive/negative consequences, argument
from positive/negative values, and practical reasoning. Grouping schemes in this manner can be
a helpful tool in decreasing the number of argument schemes considered and a step towards
developing our hybrid classification. Thus, we also consider argument 𝐴′ as an argument from
action, although the scheme is not part of the two taxonomies analysed.
Linking arguments using critical questions. Consider argument 𝐴: A vegan society would
cause the least harm to wildlife, so all humans should go vegan and argument 𝐵: The risk of death of
wildlife increases during the transport of food, especially when the vegan food travels for thousands
of miles. Argument 𝐴 is an argument from action which has 16 critical questions [13], one of
them being CQ: “Assuming the circumstances, does the action have the stated consequences?”. We
observe that argument 𝐵 can serve as an answer to the CQ of argument 𝐴 and can be seen as a
counter-argument to it. In addition, argument 𝐵 is as an argument from danger.
   An important step in reconciling Walton’s argument schemes and Wagemans’ PTA is to
identify the strengths of the two taxomies. The former is more comprehensive, while the latter
is more practically useful and can be seen as an intermediate between the semantic detail of
Walton and the relation between premise-conclusion used in argument mining approaches.


                                                 4
Elfia Bezou-Vrakatseli et al. CEUR Workshop Proceedings                                             1–5


Critical questions represent a key aspect of argument schemes, as they can be pointers to
counter-arguments or supporting arguments, since the argument answering a critical question
attacks/supports the original argument by virtue of the critical question; we aim to include
them in our hybrid classification and apply them to ethical debates. Part of our reconciliation
process is also to decrease the number of argument schemes. Indeed, the next step of our study
will focus on exploring whether other schemes can be grouped together, similarly to argument
from action [13]. The clustering nature of the ASK algorithm along with the criteria of the PTA
can be used to create a reduced, but still broad, number of argument types.


4. Conclusion
This paper describes an initial step towards realising the long-term research goal of supporting
dialogue between humans and between humans and AI systems. We focus on argument
annotation using Walton’s argument schemes and critical questions as well as Wagemans’
Periodic Table of Argument, in order to develop a new, evolved framework, specialised in
ethical reasoning. We believe this is an important first step in identifying schemes and scheme
taxonomies specialised for ethical reasoning and that can support dialogical scaffolding of
both human and artificial agent reasoning. Our approach goes beyond standard annotation
approaches for argument mining and proposes a semantically richer approach to argument
classification through tools of argumentation.

References
 [1] A. Rogers, O. Kovaleva, A. Rumshisky, A primer in BERTology: What we know about how BERT
     works, Transactions of the Association for Computational Linguistics 8 (2020) 842–866.
 [2] S. Modgil, Dialogical scaffolding for human and artificial agent reasoning, in: Proceedings of the
     5th International Workshop on Artificial Intelligence and Cognition, AIC, 2017, pp. 58–71.
 [3] A. Matthias, The responsibility gap: Ascribing responsibility for the actions of learning automata,
     Ethics and information technology 6 (2004) 175–183.
 [4] D. Hadfield-Menell, A. D. Dragan, P. Abbeel, S. Russell, Cooperative inverse reinforcement learning,
     CoRR abs/1606.03137 (2016).
 [5] J. Lawrence, C. Reed, Argument mining: A survey, Computational Linguistics 45 (2019) 765–818.
 [6] D. Walton, C. Reed, F. Macagno, Argumentation schemes, Cambridge University Press, 2008.
 [7] J. H. M. Wagemans, Constructing a periodic table of arguments, 2016.
 [8] V. W. Feng, G. Hirst, Classifying arguments by scheme, in: The 49th Annual Meeting of the
     Association for Computational Linguistics: Human Language Technologies, 2011, pp. 987–996.
 [9] J. Lawrence, C. Reed, Argument mining using argumentation scheme structures, in: Proceedings
     of Computational Models of Argument (COMMA), 2016, pp. 379–390.
[10] E. Musi, D. Ghosh, S. Muresan, Towards feasible guidelines for the annotation of argument schemes,
     in: Proceedings of the Third Workshop on Argument Mining, ArgMining@ACL, 2016, pp. 82–93.
[11] D. Liga, M. Palmirani, Argumentation schemes as templates? Combining bottom-up and top-down
     knowledge representation, in: Proceedings of the 20th CMNA workshop, 2020, pp. 51–56.
[12] J. Visser, J. Lawrence, C. Reed, J. Wagemans, D. Walton, Annotating argument schemes, Argumen-
     tation 35 (2021) 101–139.
[13] K. Atkinson, T. J. M. Bench-Capon, Action-based alternating transition systems for arguments
     about action, in: Proceedings of the 32nd AAAI, 2007, pp. 24–29.


                                                   5