=Paper=
{{Paper
|id=Vol-2346/paper1
|storemode=property
|title=On the End-to-End Argument Validation System based on Communicative Discourse Trees
|pdfUrl=https://ceur-ws.org/Vol-2346/paper1.pdf
|volume=Vol-2346
|authors=Boris Galitsky,Dmitry Ilvovsky
|dblpUrl=https://dblp.org/rec/conf/persuasive/GalitskyI19
}}
==On the End-to-End Argument Validation System based on Communicative Discourse Trees==
On the End-to-End Argument Validation System based on Communicative Discourse Trees Boris Galitsky1 and Dmitry Ilvovsky2 1 Oracle Corp. Redwood Shores, CA, USA boris.galitsky@oracle.com 2 Higher School of Economics Moscow Russia dilvovsky@hse.ru Abstract. We formulate a problem of an assessment of argumentation validity based on rhetorical analysis of text. Argumentation structure can be detected in text in the form of discourse trees extended with edge labels for communicative actions. Extracted argumentation structure is represented as a defeasible logic program and is subject to dialectical analysis to establish the validity of the arguments for the main claim being communicated. We evaluate the accuracy of argument mining and then argument validation as well as an overall performance of an end-to-end argumentation system. 1 Introduction In this study we focus on validating claims of human agent expressed in text. In non-trivial cases, claim validation relies on an analysis of arguments. When domain knowledge is available and formalized, truthfulness of a claim can be validated directly. However, in most text analysis environments such knowledge is unavailable and other implicit means need to come into play, such as writing style and writing logic, in particular, used argumentation patterns. In this study we employ the discourse analysis in our end-to-end argument validation system for texts and explore which discourse features can be leveraged for argumentation validity analysis. When an author attempts to provide an argument for something, a number of argumentation patterns can be employed. The basic points of argumentation are reflected in the rhetorical structure of text where an argument is present (Moens et al., 2007). We select the Rhetoric Structure Theory (RST, in Mann and Thompson 1988) as a means to represent discourse features associated with logical argumentation. Nowadays, the performance of both rhetoric parsers and argumentation reasoners has dramatically improved (Feng and Hirst 2014). Taking into account the discourse structure of conflicting dialogs, one can judge on the authenticity and validity of these dialogs in terms of its argumentation. In this work we will evaluate the combined argument validity assessment system that includes both the discourse structure extraction and reasoning about it with the purpose of the validation of an agent’s claim. Either approach to argument detection from text or to reasoning about 2 formalized arguments has been undertaken (Galitsky and Pampapathi 2003, Symeonidis et al., 2007), but not the whole argument assessment system. Most of the modern techniques treat computational argumentation as specific discourse structures and perform detection of arguments of various sorts in text, such as classifying a text paragraph as argumentative or non-argumentative (Moens et al., 2007). A number of systems recognize components and structures of logical arguments (Sardianos et al., 2015). However, these systems do not rely on discourse trees (DTs); they only extract arguments and do not apply logical means to evaluate it. At the same time, a broad corpus of research deals with logical arguments irrespectively of how they may occur in natural language (Bondarenko et al., 1997). A number of studies addressed argument quality in logic and argumentation theory (van Eemeren et al., 1996; Damer, 2009), however the number of systems that assess the validity of arguments in text is very limited (Cabrio and Villata, 2012). Most argument mining systems are either classifiers which recognize certain forms of logical arguments in text, or reasoners over the logical representation of arguments (Amgoud et al., 2015). To address this shortcoming, in this project, we build an end-to-end argumentation system, augmenting an argument extraction from text with its logical analysis. To represent the linguistic features of text, we use the following sources: 1) Rhetoric relations between the parts of the sentences, obtained as a discourse tree (DT). Discourse trees encode rhetorical relations such as Cause, Contrast, Condition, Attribution which are correlated with argumentation attack relation. 2) Speech acts and communicative actions, obtained as verbs from the VerbNet resource. To assess the logical validity of an extracted argument, we apply the Defeasible Logic Program (DeLP; in Garcia and Simari 2004), part of which is built on the fly from facts and clauses extracted from these sources. We integrate argumentation detection and validation components into a decision support system that can be deployed, for example, in the customer relationship management (CRM) domain. To evaluate our approach to extraction and reasoning about argumentation, we chose the dispute resolution / customer complaint validation task because an argumenation analysis plays an essential role in it. 2 Rhetorical Representation of Argumentation We start with a political domain and give an example of conflicting agents providing their interpretation of certain events. These agents provide argumentation for their claims; we will observe how formed rhetoric structures correlate with their argumentation patterns. We focus on the Malaysia Airlines Flight 17 example with the agents exchanging arguments: Dutch investigators, The Investigative Committee of the Russian Federation, and the self-proclaimed Donetsk People's Republic. It is a controversial conflict where each agent attempts to blame its opponent. To sound more convincing, each agent postulates its claim in a way to attack the claims of its opponents, matching their argumentation styles and trying to defeat their claims. 3 “Dutch accident investigators say that strong evidence points to pro-Russian rebels as being fully responsible for shooting down plane. The report indicates where the missile was fired from and identifies who was in control of the territory and pins the downing of MH17 on the pro-Russian rebels.” (Fig. 1a). “The Investigative Committee of the Russian Federation believes that the plane was hit by a missile, which could not be produced in Russia. The committee cites an investigation that established the type of the missile and disagrees with Dutch accident investigators.”(Fig. 1b) “Rebels, the self-proclaimed Donetsk People's Republic, deny that they controlled the territory from which the missile was allegedly fired. They confirm that it became possible only after three months after the tragedy to say if rebels controlled one or another town and the claim of Dutch accident investigators is flawed”(Fig. 1c). To show the structure of arguments one needs to merge discourse relations with information from speech acts. We need to know the discourse structure of interactions between agents, and what kinds of interactions they are. For argument identification, we do not need to know the domain of interaction (here, aviation), the subjects of these interaction, what are the entities, but we need to take into account mental, domain-independent relations between them. We accomplish this by introducing the concept of Communicative Discourse Tree (CDT). CDT is a DT with labels for edges that are the VerbNet expressions for verbs (which are communicative actions, (CA, Galitsky and Kuznetsov 2008)). Arguments of verbs are substituted from text according to VerbNet frames (Kipper et al., 2008). The first and possibly second argument is instantiated by agents. The consecutive arguments are instantiated by noun or verb phrases which are the subjects of CA. For example, the nucleus node for elaboration relation (on the left of Fig. 1a) is labeled with say(Dutch, evidence), and the satellite is labeled with responsible(rebels, shooting_down). These labels are not intended to express that the subjects of Elementary Discourse Units (EDUs) are evidence and shooting_down but instead are intended for matching this CDT with others for the purpose of finding similarity between them. 4 Fig. 1a The claim of the first agent, Dutch accident investigators Notice that in the CDTs for three paragraphs expressing the views of conflicting parties (Figs 1a, 2b and 2c), communicative actions with their subjects contain the main claims of the respective party, and the DTs without these labels contain information on how these claims are logically packaged. To summarize, a typical CDT for a text with argumentation includes rhetoric relations other than Elaboration and Join, and a substantial number of communicative actions. However, these rules are complex enough so that the structure of CDT matters and tree-specific learning is required (Galitsky et al., 2015). Fig. 1b The claim of the second agent, the Committee Fig. 1c The claim of the third agent, the rebels 3 Detecting Argumentation in Communicative Discourse Trees Argumentation analysis needs a systematic approach to learn associated discourse structures. The features of CDTs could be represented in a numerical space so that argumentation detection can be conducted; however, structural information on DTs would not be leveraged. Also, features of argumentation can potentially be measured in terms of maximal common sub-DTs, but such nearest neighbor learning is 5 computationally intensive and too sensitive to errors in DT construction. Therefore, a CDT-kernel learning approach is selected which applies a support vector machine (SVM) learning to the feature space of all sub-CDTs of the CDT for a given text where an argument is being detected. Tree Kernel (TK) learning for strings, parse trees and parse thickets is a well- established research area nowadays. The CD-TK counts the number of common sub- trees as the discourse similarity measure between two DTs. In this study, we extend the TK definition for the CDT, augmenting DT kernel by the information on CAs. TK-based approaches are not very sensitive to errors in parsing (syntactic and rhetoric) because erroneous sub-trees are mostly random and will unlikely be common among different elements of a training set. A CDT can be represented by a vector V of integer counts of each sub-tree type (without taking into account its ancestors): V(𝑇) = (# 𝑜𝑓 𝑠𝑢𝑏𝑡𝑟𝑒𝑒𝑠 𝑜𝑓 𝑡𝑦𝑝𝑒 1, … , # 𝑜𝑓 𝑠𝑢𝑏𝑡𝑟𝑒𝑒𝑠 𝑜𝑓 𝑡𝑦𝑝𝑒 𝐼, … , # 𝑜𝑓 𝑠𝑢𝑏𝑡𝑟𝑒𝑒𝑠 𝑜𝑓 𝑡𝑦𝑝𝑒 𝑛). Given two tree segments CDT1 and CDT2 , the tree kernel function is defined: 𝐾 (CDT1, CDT2) == Σi V(CDT1)[i], V(CDT1)[i] = Σn1Σn2 Σi Ii(n1)* Ii(n2), where 𝑛1∈𝑁1 , n2∈𝑁2 and 𝑁1 and N2 are the sets of all nodes in CDT1 and CDT2 , respectively; 𝐼i(𝑛) is the indicator function: 𝐼i(𝑛) = {1 iff a subtree of type 𝑖 occurs with a root at a node; 0 otherwise}. Further details for using TK for paragraph-level and discourse analysis are available in (Galitsky 2017). Only the arcs of the same type of rhetoric relations (presentation relation, such as antithesis, subject matter relation, such as condition, and multinuclear relation, such as List) can be matched when computing common sub-trees. We use N for a nucleus or situations presented by this nucleus, and S for a satellite or situations presented by this satellite. Situations are propositions, completed actions or actions in progress, and communicative actions and states (including beliefs, desires, approve, explain, reconcile and others). Hence we have the following expression for RST- based generalization ‘^’ for two texts text1 and text2 : text1 ^ text2 = ∪i,j (rstRelation1i, (…,…) ^ rstRelation2j (…,…)), where I ∈ (RST relations in text1), j ∈ (RST relations in text2). Further, for a pair of RST relations their generalization looks as follows: rstRelation1(N1, S1) ^ rstRelation2 (N2, S2) = (rstRelation1^ rstRelation2 )( N1^N2, S1^S2). We define CA as a function of the form verb (agent, subject, cause), where verb characterizes some type of interaction between involved agents (e.g., explain, confirm, remind, disagree, deny, etc.), subject refers to the information transmitted or object described, and cause refers to the motivation or explanation for the subject. To handle meaning of words expressing the subjects of CAs, we apply word2vec models (Mikolov et al., 2015). We combined Stanford NLP parsing, coreferences, entity extraction, DT construction (discourse parser, Surdeanu et al., 2016 and Joty et al., 2013), VerbNet and Tree Kernel builder into one system available at https://github.com/bgalitsky/relevance-based-on-parse-trees. 6 4 Claim Validation via Dialectical Analysis To convince an addressee, a message needs to include an argument and its structure needs to be valid. Once an argumentation structure extracted from text is represented via CDT, we need to verify that the main point (target claim) communicated by the author is not logically attacked by her other claims. To assess the validity of the argumentation, a Defeasible Logic Programming (DeLP) approach is selected. It is an argumentative framework based on logic programming (García and Simari, 2004; Alsinet et al., 2008). A DeLP is a set of facts, strict rules Π of the form (A:-B), and a set of defeasible rules Δ of the form A-