=Paper= {{Paper |id=Vol-2346/paper1 |storemode=property |title=On the End-to-End Argument Validation System based on Communicative Discourse Trees |pdfUrl=https://ceur-ws.org/Vol-2346/paper1.pdf |volume=Vol-2346 |authors=Boris Galitsky,Dmitry Ilvovsky |dblpUrl=https://dblp.org/rec/conf/persuasive/GalitskyI19 }} ==On the End-to-End Argument Validation System based on Communicative Discourse Trees== https://ceur-ws.org/Vol-2346/paper1.pdf

On the End-to-End Argument Validation System based
on Communicative Discourse Trees

Boris Galitsky1 and Dmitry Ilvovsky2
1
Oracle Corp. Redwood Shores, CA, USA
boris.galitsky@oracle.com
2
Higher School of Economics Moscow Russia
dilvovsky@hse.ru

Abstract. We formulate a problem of an assessment of argumentation validity
based on rhetorical analysis of text. Argumentation structure can be detected in
text in the form of discourse trees extended with edge labels for communicative
actions. Extracted argumentation structure is represented as a defeasible logic
program and is subject to dialectical analysis to establish the validity of the
arguments for the main claim being communicated. We evaluate the accuracy
of argument mining and then argument validation as well as an overall
performance of an end-to-end argumentation system.

1 Introduction

In this study we focus on validating claims of human agent expressed in text. In
non-trivial cases, claim validation relies on an analysis of arguments. When domain
knowledge is available and formalized, truthfulness of a claim can be validated
directly. However, in most text analysis environments such knowledge is unavailable
and other implicit means need to come into play, such as writing style and writing
logic, in particular, used argumentation patterns. In this study we employ the
discourse analysis in our end-to-end argument validation system for texts and explore
which discourse features can be leveraged for argumentation validity analysis.
When an author attempts to provide an argument for something, a number of
argumentation patterns can be employed. The basic points of argumentation are
reflected in the rhetorical structure of text where an argument is present (Moens et al.,
2007). We select the Rhetoric Structure Theory (RST, in Mann and Thompson 1988)
as a means to represent discourse features associated with logical argumentation.
Nowadays, the performance of both rhetoric parsers and argumentation reasoners has
dramatically improved (Feng and Hirst 2014). Taking into account the discourse
structure of conflicting dialogs, one can judge on the authenticity and validity of these
dialogs in terms of its argumentation. In this work we will evaluate the combined
argument validity assessment system that includes both the discourse structure
extraction and reasoning about it with the purpose of the validation of an agent’s
claim. Either approach to argument detection from text or to reasoning about
2

formalized arguments has been undertaken (Galitsky and Pampapathi 2003,
Symeonidis et al., 2007), but not the whole argument assessment system.
Most of the modern techniques treat computational argumentation as specific
discourse structures and perform detection of arguments of various sorts in text, such as
classifying a text paragraph as argumentative or non-argumentative (Moens et al.,
2007). A number of systems recognize components and structures of logical arguments
(Sardianos et al., 2015). However, these systems do not rely on discourse trees (DTs);
they only extract arguments and do not apply logical means to evaluate it. At the same
time, a broad corpus of research deals with logical arguments irrespectively of how they
may occur in natural language (Bondarenko et al., 1997). A number of studies addressed
argument quality in logic and argumentation theory (van Eemeren et al., 1996; Damer,
2009), however the number of systems that assess the validity of arguments in text is
very limited (Cabrio and Villata, 2012). Most argument mining systems are either
classifiers which recognize certain forms of logical arguments in text, or reasoners over
the logical representation of arguments (Amgoud et al., 2015).
To address this shortcoming, in this project, we build an end-to-end argumentation
system, augmenting an argument extraction from text with its logical analysis. To
represent the linguistic features of text, we use the following sources:
1) Rhetoric relations between the parts of the sentences, obtained as a discourse tree
(DT). Discourse trees encode rhetorical relations such as Cause, Contrast, Condition,
Attribution which are correlated with argumentation attack relation.
2) Speech acts and communicative actions, obtained as verbs from the VerbNet
resource.
To assess the logical validity of an extracted argument, we apply the Defeasible Logic
Program (DeLP; in Garcia and Simari 2004), part of which is built on the fly from facts
and clauses extracted from these sources. We integrate argumentation detection and
validation components into a decision support system that can be deployed, for
example, in the customer relationship management (CRM) domain. To evaluate our
approach to extraction and reasoning about argumentation, we chose the dispute
resolution / customer complaint validation task because an argumenation analysis plays
an essential role in it.

2 Rhetorical Representation of Argumentation

We start with a political domain and give an example of conflicting agents
providing their interpretation of certain events. These agents provide argumentation
for their claims; we will observe how formed rhetoric structures correlate with their
argumentation patterns. We focus on the Malaysia Airlines Flight 17 example with
the agents exchanging arguments: Dutch investigators, The Investigative Committee
of the Russian Federation, and the self-proclaimed Donetsk People's Republic. It is a
controversial conflict where each agent attempts to blame its opponent. To sound
more convincing, each agent postulates its claim in a way to attack the claims of its
opponents, matching their argumentation styles and trying to defeat their claims.
3

“Dutch accident investigators say that strong evidence points to pro-Russian rebels
as being fully responsible for shooting down plane. The report indicates where the
missile was fired from and identifies who was in control of the territory and pins the
downing of MH17 on the pro-Russian rebels.” (Fig. 1a).
“The Investigative Committee of the Russian Federation believes that the plane
was hit by a missile, which could not be produced in Russia. The committee cites an
investigation that established the type of the missile and disagrees with Dutch
accident investigators.”(Fig. 1b)
“Rebels, the self-proclaimed Donetsk People's Republic, deny that they controlled
the territory from which the missile was allegedly fired. They confirm that it became
possible only after three months after the tragedy to say if rebels controlled one or
another town and the claim of Dutch accident investigators is flawed”(Fig. 1c).
To show the structure of arguments one needs to merge discourse relations with
information from speech acts. We need to know the discourse structure of interactions
between agents, and what kinds of interactions they are. For argument identification,
we do not need to know the domain of interaction (here, aviation), the subjects of
these interaction, what are the entities, but we need to take into account mental,
domain-independent relations between them. We accomplish this by introducing the
concept of Communicative Discourse Tree (CDT).
CDT is a DT with labels for edges that are the VerbNet expressions for verbs
(which are communicative actions, (CA, Galitsky and Kuznetsov 2008)). Arguments
of verbs are substituted from text according to VerbNet frames (Kipper et al., 2008).
The first and possibly second argument is instantiated by agents. The consecutive
arguments are instantiated by noun or verb phrases which are the subjects of CA. For
example, the nucleus node for elaboration relation (on the left of Fig. 1a) is labeled
with say(Dutch, evidence), and the satellite is labeled with responsible(rebels,
shooting_down). These labels are not intended to express that the subjects of
Elementary Discourse Units (EDUs) are evidence and shooting_down but instead are
intended for matching this CDT with others for the purpose of finding similarity
between them.
4

Fig. 1a The claim of the first agent, Dutch accident investigators

Notice that in the CDTs for three paragraphs expressing the views of conflicting
parties (Figs 1a, 2b and 2c), communicative actions with their subjects contain the
main claims of the respective party, and the DTs without these labels contain
information on how these claims are logically packaged. To summarize, a typical
CDT for a text with argumentation includes rhetoric relations other than Elaboration
and Join, and a substantial number of communicative actions. However, these rules
are complex enough so that the structure of CDT matters and tree-specific learning is
required (Galitsky et al., 2015).

Fig. 1b The claim of the second agent, the Committee

Fig. 1c The claim of the third agent, the rebels

3 Detecting Argumentation in Communicative Discourse Trees

Argumentation analysis needs a systematic approach to learn associated discourse
structures. The features of CDTs could be represented in a numerical space so that
argumentation detection can be conducted; however, structural information on DTs
would not be leveraged. Also, features of argumentation can potentially be measured
in terms of maximal common sub-DTs, but such nearest neighbor learning is
5

computationally intensive and too sensitive to errors in DT construction. Therefore, a
CDT-kernel learning approach is selected which applies a support vector machine
(SVM) learning to the feature space of all sub-CDTs of the CDT for a given text
where an argument is being detected.
Tree Kernel (TK) learning for strings, parse trees and parse thickets is a well-
established research area nowadays. The CD-TK counts the number of common sub-
trees as the discourse similarity measure between two DTs. In this study, we extend
the TK definition for the CDT, augmenting DT kernel by the information on CAs.
TK-based approaches are not very sensitive to errors in parsing (syntactic and
rhetoric) because erroneous sub-trees are mostly random and will unlikely be
common among different elements of a training set.
A CDT can be represented by a vector V of integer counts of each sub-tree type
(without taking into account its ancestors):
V(𝑇) = (# 𝑜𝑓 𝑠𝑢𝑏𝑡𝑟𝑒𝑒𝑠 𝑜𝑓 𝑡𝑦𝑝𝑒 1, … , # 𝑜𝑓 𝑠𝑢𝑏𝑡𝑟𝑒𝑒𝑠 𝑜𝑓 𝑡𝑦𝑝𝑒 𝐼, … , # 𝑜𝑓 𝑠𝑢𝑏𝑡𝑟𝑒𝑒𝑠 𝑜𝑓
𝑡𝑦𝑝𝑒 𝑛). Given two tree segments CDT1 and CDT2 , the tree kernel function is defined:
𝐾 (CDT1, CDT2) = = Σi V(CDT1)[i], V(CDT1)[i] =
Σn1Σn2 Σi Ii(n1)* Ii(n2), where 𝑛1∈𝑁1 , n2∈𝑁2 and 𝑁1 and N2 are the sets of all
nodes in CDT1 and CDT2 , respectively; 𝐼i(𝑛) is the indicator function:
𝐼i(𝑛) = {1 iff a subtree of type 𝑖 occurs with a root at a node; 0 otherwise}. Further
details for using TK for paragraph-level and discourse analysis are available in
(Galitsky 2017).
Only the arcs of the same type of rhetoric relations (presentation relation, such
as antithesis, subject matter relation, such as condition, and multinuclear relation,
such as List) can be matched when computing common sub-trees. We use N for a
nucleus or situations presented by this nucleus, and S for a satellite or situations
presented by this satellite. Situations are propositions, completed actions or actions in
progress, and communicative actions and states (including beliefs, desires, approve,
explain, reconcile and others). Hence we have the following expression for RST-
based generalization ‘^’ for two texts text1 and text2 :
text1 ^ text2 = ∪i,j (rstRelation1i, (…,…) ^ rstRelation2j (…,…)), where I ∈ (RST
relations in text1), j ∈ (RST relations in text2). Further, for a pair of RST relations
their generalization looks as follows: rstRelation1(N1, S1) ^ rstRelation2 (N2, S2) =
(rstRelation1^ rstRelation2 )( N1^N2, S1^S2).
We define CA as a function of the form verb (agent, subject, cause), where verb
characterizes some type of interaction between involved agents (e.g., explain,
confirm, remind, disagree, deny, etc.), subject refers to the information transmitted or
object described, and cause refers to the motivation or explanation for the subject. To
handle meaning of words expressing the subjects of CAs, we apply word2vec models
(Mikolov et al., 2015).
We combined Stanford NLP parsing, coreferences, entity extraction, DT
construction (discourse parser, Surdeanu et al., 2016 and Joty et al., 2013), VerbNet
and Tree Kernel builder into one system available at
https://github.com/bgalitsky/relevance-based-on-parse-trees.
6

4 Claim Validation via Dialectical Analysis

To convince an addressee, a message needs to include an argument and its structure
needs to be valid. Once an argumentation structure extracted from text is represented
via CDT, we need to verify that the main point (target claim) communicated by the
author is not logically attacked by her other claims. To assess the validity of the
argumentation, a Defeasible Logic Programming (DeLP) approach is selected. It is an
argumentative framework based on logic programming (García and Simari, 2004;
Alsinet et al., 2008).
A DeLP is a set of facts, strict rules Π of the form (A:-B), and a set of defeasible
rules Δ of the form A-