=Paper=
{{Paper
|id=Vol-2672/paper_1
|storemode=property
|title=How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs
|pdfUrl=https://ceur-ws.org/Vol-2672/paper_1.pdf
|volume=Vol-2672
|authors=Markus Brenneis,Maike Behrendt,Stefan Harmeling,Martin Mauve
|dblpUrl=https://dblp.org/rec/conf/comma/BrenneisBHM20
}}
==How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs==
2 Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs Markus BRENNEIS a,1 , Maike BEHRENDT a Stefan HARMELING a and Martin MAUVE a a Department of Computer Science, University of Düsseldorf, Germany Abstract. When exchanging arguments with other people, it is interesting to know who of the others has the most similar opinion to oneself. In this paper, we sug- gest using weighted argumentation graphs that can model the relative importance of arguments and certainty of statements. We present a pseudometric to calculate the distance between two weighted argumentation graphs, which is useful for ap- plications like recommender systems, consensus building, and finding representa- tives. We propose a list of desiderata which should be fulfilled by a metric for those applications and prove that our pseudometric fulfills these desiderata. Keywords. argumentation graphs, online argumentation, metric 1. Introduction In real-world discussions, people exchange arguments on a dedicated issue, such as im- proving the course of study [11], the distribution of funds [7], or which party to vote for at the next general election. In all those cases, participants discuss positions like “there should be a universal basic income” or “special math courses should be introduced”, state their pro and contra arguments and attack other people’s arguments. Each individual participant of an argumentation has a personal view on the argu- ments and their relative importance: Users can decide for themselves which arguments they consider more convincing, thus which arguments they agree to and how much they agree with a statement. They may consider some positions more important than others. Based on those individual views, there are useful applications for measuring the similarity or distance between two users: Clustering can be used to find representatives for a group of people with similar argumentation behavior or for finding a consensus. Another application is opinion polling, where one wants to find out why two persons or organizations come to different conclusions. What is more, collaborative filtering, which needs some definition of distance between users, can be used for pre-filtering arguments in applications like Kialo2 . Copyright c 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). 1 Corresponding Author: Markus Brenneis, Heinrich-Heine-Universität, Universitätsstraße 1, 40225 Düsseldorf, Germany; E-mail: markus.brenneis@uni-duesseldorf.de. 2 https://kialo.com/ Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs 3 In this paper, we propose solutions to the two main challenges to achieving the goal of comparing the argumentations of two users: We define weighted argumentation graphs which are a suitable representation of argumentation covering the mentioned aspects, including importance of arguments and agreement with statements. Secondly, we suggest a pseudometric for calculating the distance between two weighted argumentation graphs, which considers the specific structure of argumentation graphs (e.g., opinions deeper in a graph are less important). We contribute a list of useful desiderata for a metric which compares argumentations, and prove that our pseudometric fulfills those properties. In the following chapter, we present our definition of weighted argumentation graphs. The third chapter introduces our pseudometric and desiderta for a useful metric. Finally, we discuss some limitations of our pseudometric and take a look at related work. 2. Definition of Weighted Argumentation Graphs To be able to determine the similarity of real-world argumentations, there has to be a suitable representation of them. This representation should be able to capture all aspects mentioned in the introduction, and it should be as simple as possible. Therefore, the following definition is based on the IBIS model [12], which has been successfully tested with users without background in argumentation theory using our D-BAS system [11]. For the application purposes described in the introduction, the model should be able to represent the known opinions and arguments of a person as close as possible. Thus, we use statements, not arguments, as atomic elements in our definition, which then can be composed to arguments which can support or attack another statement. Note, though, that this definition can be translated to classical abstract argumentation frameworks based on Dung’s definition [5], e.g. using DABASCO [14]. Let S be the (finite) set of all statements, with the special statement I ∈ S. The set A ⊆ S \ {I} × S is a set of arguments. For an argument a = (s1 , s2 ) ∈ A, s1 is called premise, s2 conclusion of a. Let s ∈ S, then a→s := {(t, u) ∈ A | u = s} is the set of arguments with conclusion s. Note that I is excluded to be a premise since it is the issue in the IBIS model. We refer to I as “personal well-being”, which allows us to interpret an edge like (b, I) in Figure 1 as “My personal well-being improves, because more wind power plants will be built.” The premises of arguments with conclusion I are called positions. Positions are actionable items like “A wall between Austria and Germany should be built,” and play an important role in real-world argumentation, e.g. decision-making problems [7]. Definition 1 (Argumentation Graph). An argumentation graph is a directed, weakly con- nected graph G = (S, A), A ⊆ S \ {I} × S, where the statements are nodes and the argu- ments edges, and there is exactly one I ∈ S which has no outgoing edges. Note that this model does not include different relations for attack and support, as known from bipolar argumentation frameworks. Whether an argument is supportive are not, is up to a person’s interpretation of the natural language representation of the argu- ments. The purpose of the model is solely to capture the hierarchy of statement, which we later need for our metric; bipolarity would add unnecessary complexity in this paper. Every person can have a personal view with personal attitudes on a common argu- mentation graph G, as depicted in G0 in Figure 1. To get an intuition for our next defini- 4 Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs G I G0 on G 0 I 1 2 3 3 a A wall between Austria and Germany should be built. a b −0.2 a 0.4 b a1 The wall stops illegal immigration of cows. 0.2 0.8 a11 Cows can come via Switzerland. a2 The wall is expensive. a1 a2 b1 0.3 a1 0.5 a2 b More wind power plants should be built. b1 Wind power is a renewable energy source. a11 Figure 1. Example for an argumentation graph G and a weighted argumentation graph G0 on G with positions a and b and concrete examples for each statement. Edges with weight 0 and nodes with rating 0 are not drawn. Statement ratings are next to nodes, values for relative argument importance next to edges. tion, let us have a look at what we can deduce about Alice’s attitudes from her graph G0 : Alice strongly accepts position b (rating .4) and is slightly against a (rating −.2). She accepts the statement a2 more than statement a1 (rating .5 > rating .3). The counterargu- ment (a2 , a) “No wall should be built, because a wall is expensive.” is far more important for her than the argument (a1 , a) (relative importance .8 > .2). Furthermore, it makes sense to sort the positions: She considers building more wind power plants (b) more important than building a wall (a, relative importance 23 > 31 ). So when comparing her attitudes with someone else’s attitudes, she would consider a contrary opinion on b more severe than a different opinion on a. For ordinary statements, which are not positions, having an importance does not make sense: One cannot say that “The wall stops illegal immigration of cows” is twice as important as “Cows can come via Switzerland” (important regarding what?); one can only say that the arguments regarding building a wall which are built by those statements are of differing importance. We will use real numbers to represent those weights and ratings. Definition 2 (Weighted Argumentation Graph). Let G = (S, A) be an argumentation graph. A weighted argumentation graph G0 on G is a quadruple (S, A, r, w) with func- tions r and w. r : S → [−0.5, 0.5] assigns an agreement score (rating) to every statement, where negative values mean disagreement, 0 no opinion/don’t care, and positive values agreement. w : A → [0, 1] assigns an importance weight to each argument. The value in- dicates the importance of that argument relative to other arguments with the same con- clusion. The value 0 means that the argument is not used (i.e. has no relevance), and 1 means that the argument is the only relevant argument for the conclusion. The following conditions must hold: ∀s ∈ S ∑ w(a) ∈ {0, 1} (1) a∈A→s r(I) = 0 (2) Formula 1 means that the sum of weights of arguments with the same conclusion is 1 if there is an argument with positive weight (cf. (a1 , a) and (a2 , a) in Figure 1); the sum is 0 iff no argument for a common conclusion has a weight (cf. w(b1 , b) = 0 in Figure 1). This assures that w represents relative, not absolute importance. To simplify notation, we write w(·, ·) instead of w((·, ·)). If the underlying argumentation graph G happens to be a directed tree, we call G0 a weighted argumentation tree. w and r can be represented as matrix or vector, respectively, where undefined values are set to the default value 0. For the example in Figure 1, one gets: Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs 5 0 0 00000 0 0.33 0 0 0 0 0 0 −0.2 0.67 0 0 0 0 0 0 0.4 0 0.2 0 0 0 0 0 , w= r= 0.3 (3) 0 0.8 0 0 0 0 0 0.5 0 0 0 0 0 0 0 0 0 0 00000 0 The entry in row i column j of w is the weight of the argument with premise i and conclusion j. The first column and first row must refer to I as premise or conclusion, respectively. Because of Formula 1, the column sum is always 0 or 1. If we draw or talk about a weighted argumentation graph, “non-existing” edges a are edges with w(a) = 0 (argument with no importance), and “non-existing” nodes s are nodes with r(s) = 0 (neutral statement). An example is G0 shown in Figure 1. The importance of a position p is represented as weight of the “argument” (p, I) leading to the “personal well-being” I. An application could obtain weights and ratings from a user, for example, by asking them to mark statements which are considered more important, or sorting arguments by relevance, which we do in our deliberate system [4]. 3. Proposal of a Pseudometric for Weighted Argumentation Graphs We now propose a pseudometric for calculating a distance between two weighted argu- mentation graphs, and prove several properties we expect of a function which compares two argumentations. The goal of the metric is to indicated how close the opinions and used arguments of two persons are, considering graph structure and individual assess- ments of importance; we do not want to compare argumentations on abstract levels like consistency, number of arguments used, or if other person’s arguments are countered. 3.1. The Pseudometric We define a distance measure of two weighted argumentation graphs G1 = (S, A, r1 , w1 ), G2 = (S, A, r2 , w2 ) on G = (S, A) as: ∞ dG (G1 , G2 ) = (1 − α) ∑ α i kwi1 [:,1] r1 − wi2 [:,1] r2 k1 (4) i=1 where α ∈ (0, 1) determines the influence of opinions deeper in the graph: A lower α emphasizes opinions on statements r(s), a higher value the similarity of the argumen- tation underneath a statement s. w[:,1] denotes the first column of the weight matrix w, α i is the i-th power of the scalar α, wi the i-th power of the square matrix w, and the Hadamard (entrywise) product. The i-th summand calculates the contribution of the paths with length i ending at I. We drop the index G if the underlying argumentation graph is clear from the context. The intuition behind this distance measure becomes clearer when rephrasing it for the special case of argumentation trees (which have no cycles or re-used statements, thus unique paths from each statement to I). In case of argumentation trees T1 and T2 on T , the distance dG is equivalent to: 6 Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs dT (T1 , T2 ) = (1 − α) ∑ α depth(s) r1 (s) ∏ w1 (a) − r2 (s) ∏ w2 (a) (5) s∈S a∈ρI→s a∈ρI→s ρs1 →s2 is the sequence of all arguments (edges) on the path from s1 to s2 ∈ S (where s2 is deeper in the tree). depth(s) = |ρI→s | is the length of the path from I to s, i.e. the number of arguments; depth(I) = 0. The terms in the absolute value measure the similarity of the opinions of a statement s as difference of their ratings, scaled with the product of the “importances” of the ar- guments leading to s. Hence, statements which are deeper in the argumentation tree get a smaller weight, and the overall relevance of an argumentation branch is limited to its importance. To see how the calculation works and that the results match intuition, let us calculate the distance between the graphs G0 (Figure 1), T2 , and T3 (Figure 2) for α = 0.5. We can expect that T2 is closer to T3 than to G0 , because the opinions on the statements a and b match and only the weights are different. The results confirm the expectation: 1 2 d(T2 , G0 ) =(1 − α) α 1 0.5 · 0.6 − (−0.2) · + α 1 0.5 · 0.4 − 0.4 · 3 3 (6) 1 1 + α 0 · 0 · 0.6 − 0.3 · 0.2 · + α 2 0 · 0 · 0.6 − 0.5 · 0.8 · 2 = 0.194 3 3 d(T2 , T3 ) =(1 − α)(α 1 |0.5 · 0.6 − 0.5 · 0.3| + α 1 |0.5 · 0.4 − 0.5 · 0.7| = 0.075 (7) Note that the value of dG and dT is in [0, 1). If d is the depth of T , the maximum value of dT (T1 , T2 ) is α(1 − α d ).3 The maximum value of dG is lim (1 − α) ∑ni=1 α i = α. n→∞ Theorem 1. Let G be an argumentation graph. dG is a pseudometric, i.e. has the follow- ing properties for all weighted argumentation graphs G1 , G2 , G3 on G: (i) dG (G1 , G1 ) = 0 (ii) dG (G1 , G2 ) = dG (G2 , G1 ) (symmetry) (iii) dG (G1 , G3 ) ≤ dG (G1 , G2 ) + dG (G2 , G3 ) (triangle inequality) Proof. dG converges: ∑i α i is geometric series, which converges for α ∈ (0, 1). The value of the L1 norm cannot be greater than 1 because for each column sum σ of the wi , we always have 0 ≤ σ ≤ 1. (i) holds because the same values are subtracted in the L1 norm. (ii) is given since the L1 norm is symmetric. As each summand fulfills the triangle inequality, dG also fulfills (iii). However, dG is not a metric, because dG (G1 , G2 ) can be 0 even if G1 is not equal to G2 : Consider G1 where all statements are agreed to and every argument weight is 0, and G2 where all statements have a rating of 0. Because weights and ratings are multiplied, the distance is 0 though the weighted argumentation graphs are different. 3 Remember that r (root(T )) = r (root(T )) because root(T ) = I, and r(I) := 0. 1 2 Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs 7 3.2. Desiderata for a Metric for Weighted Argumentation Trees Our pseudometric is only one of many possible metrics for the applications described. We present a list of intuitive desiderata which, as we think, should be fulfilled by any metric comparing two person’s argumentations, and thus should be considered when constructing alternative metric proposals. It is, however, hard to capture intuitive prop- erties in graphs which can contain circular references and re-used statements. Therefore, we focus on argumentation trees in this section. Field experiments like [11] have also shown that users seldom create cycles or re-use statements in different branches in real discussions. After each desideratum, we prove that our pseudometric (Formula 5) fulfills it for weighted argumentation trees. We consider those properties important in many real- world application domains of a metric, albeit not everywhere, as pointed out in Section 5. For each desideratum, we indicate why we think it is intuitive. Most desiderata are followed by a visual example making the choice of variable names clearer. Note that each tree in the examples is considered to be part of a bigger weighted argumentation tree, i.e. not all existing nodes and edges are drawn, and irrelevant statement ratings and argument weights are left out. Desideratum 1 (Proportionally bigger overlap is better). Consider trees T1 , T2 , T3 , where T2 is like T1 , but uses one additional argument for a statement s, and T3 is like T2 , but uses one additional argument for s. Although T2 and T1 differ in only one argument, and T3 and T2 differ in only one argument, we expect d(T1 , T2 ) > d(T2 , T3 ) because T2 and T3 have a greater overlap regarding the used arguments. More formally: For every statement s in a tree T which only has leaves s1 , . . . , sn (n > 2) as premises for s with r(s1 ) = · · · = r(sn ) 6= 0 and w(s1 , s) = · · · = w(sn , s) and ∀a ∈ ρI→s : w(a) 6= 0, consider the trees Tk , n ≥ k > 0, which only contain s1 , . . . , sk . Then, given k1 < k2 < k3 , we want to have kk21 < kk23 =⇒ d(Tk1 , Tk2 ) > d(Tk2 , Tk3 ), i.e. if the relative overlap of the number of arguments used is greater, the distance is smaller. Like- wise, we demand kk21 > kk23 =⇒ d(Tk1 , Tk2 ) < d(Tk2 , Tk3 ), and kk12 = kk32 =⇒ d(Tk1 , Tk2 ) = d(Tk2 , Tk3 ). We require ∀a ∈ ρI→s : w(a) 6= 0 (i.e. no argument (edge) with weight 0 along the path from I to s), because if a user gives an argument a weight of 0, they say the premise is not related to the conclusion, thus not related to the topic of the discussion. This means that the user actually does not care about the opinions underneath that argument, which may be treated as if no opinion has been given. In the following example, we have k1 = 1, k2 = 2, and k3 = 3, thus 12 < 23 =⇒ dT (T1 , T2 ) > dT (T2 , T3 ): T1 s T2 s T3 s 1 1 1 1 1 1 k1 k2 k2 k3 k3 k3 r s1 r s1 r s2 r s1 r s2 r s3 Proof. Only the argument weights for (si , s) (namely k11 , k12 , k13 , respectively, as used in (8) = (9)) are different and contribute to the sum, all other summands are zero. The common weight products and common values for r are summarized as wr for readability and are factored out. For (10) > (11), remember that kk12 < kk32 . 8 Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs 0 dT (Tk1 , Tk2 ) = wr (1 − α) ∑ α depth(s ) |wk1 (s0 , s) − wk2 (s0 , s)| (8) s0 ∈{s1 ,...,sn } 1 depth(s0 ) 1 1 = wr (1 − α) ∑ α k1 · − + (k2 − k1 ) · (9) s0 ∈{s1 ,...,sn } k1 k2 k2 depth(s0 ) k1 = wr (1 − α) ∑ α 2 − 2 (10) s0 ∈{s1 ,...,sn } k2 depth(s0 ) k2 > wr (1 − α) ∑ α 2 − 2 = dT (Tk2 , Tk3 ) (11) s0 ∈{s ,...,s } k3 1 n The other cases are proven by replacing “>” with “<” or “=”, respectively. Desideratum 2 (Contrary opinion is worse than no opinion). Consider trees T1 , T2 , T3 , where all trees are identical, but T1 has no opinion on a statement s, T2 a positive opinion on s and T3 a negative opinion on s. As we definitely know that T2 and T3 disagree on s, we want to have d(T2 , T3 ) > d(T1 , T2 ). Formally: For any statement s in a tree T with ∀a ∈ ρI→s : w(a) 6= 0, let T + be like T but with r+ (s) = q > 0, T − like T with r− (s) = p < 0, and T 0 like T with r0 (s) = 0. Then d(T + , T − ) > d(T + , T 0 ). T0 0 s T+ q s T− p s Proof. The only positive summand is the summand for s (which has rating 0, q or p for T 0 , T + , and T − , respectively). The argument weights and the α term are common to all summands, can be factored out, and are summarized as wα . dT (T + , T − ) = wα · |q − p| = wα · (q + p) > wα · |q − 0| = dT (T + , T 0 ) (12) Desideratum 3 (Deviation in deeper parts has less influence than deviation in higher parts). Consider the trees T1 , T2 , T3 , where T1 has an argument (sA , s) with no chil- dren and ∀a ∈ ρI→s : w(a) 6= 0, T2 is constructed from T1 by adding a new statement sB and argument (sB , s) with w2 (sB , s) = w2 (sA , s) = w1 (s2A ,s) , and T3 is constructed from T1 by adding a new statement sA1 and argument (sA1 , sA ) with w3 (sA1 , sA ) = 1. If r(sA ) = r(sB ) = r(sA1 ) 6= 0, then we want to have d(T1 , T2 ) > d(T1 , T3 ), because argu- ments deeper in the tree should have a smaller influence since we consider them less important for the overall opinion. We require r(sA ) = r(sB ) = r(sA1 ) 6= 0 because adding a statement with “don’t care” opinion should not actually change the distance. We want equality because this desider- atum should cover only differences in the depth of the statements, not their rating. T1 s T2 s T3 s w w w 2 2 w r sA r sA r sB r sA 1 r sA1 Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs 9 Proof. The only differences are the summands including s, sA and sA1 . Let r := r(sA ) = r(sB ) = r(sA1 ) 6= 0 and w := w1 (sA , s). As before, we summarize common values for weights and α as wα . w w dT (T1 , T2 ) = wα · rw − r + 0−r = wα · |rw| (13) 2 2 > wα · α|rw| = wα · (α · |r · 0 · w − r · 1 · w|) = dT (T1 , T3 ) (14) Desideratum 4 (Influence of deeper parts depends on weights in higher parts). Consider trees T1 , T2 , T3 , where all trees are identical, have statements sA and sB with conclusion s, and w(sA , s) > w(sB , s), but T1 has an additional argument for sA and T3 has an additional argument for sB . Although the difference in both cases is only one argument, we expect d(T1 , T2 ) > d(T3 , T2 ) because (sA , s) has a larger weight. Formally: Let T2 be a weighted argumentation tree with arguments (sA , s), (sB , s) and w(sA , s) > w(sB , s), no premises for sA and sB and ∀a ∈ ρI→s : w(a) 6= 0. T1 is constructed from T2 by adding (sA1 , sA ) and T3 from T2 by adding (sB1 , sB ), each with a weight of 1 and r1 (sA1 ) = r3 (sB1 ) 6= 0. Then dT (T1 , T2 ) > dT (T3 , T2 ). T1 s T2 s T3 s w(sA , s) w(sB , s) w(sA , s) w(sB , s) w(sA , s) w(sB , s) sA sB sA sB sA sB 1 1 r sA1 r sB1 Proof. Let r := r1 (sA1 ) = r3 (sB1 ) 6= 0. Only the summand which includes sA1 or sB1 , respectively, contributes a value greater than 0. dT (T1 , T2 ) = wα · |w(sA , s) · r − 0| > wα · |0 − w(sB , s) · r| = dT (T2 , T3 ) (15) Desideratum 5 (Weights of arguments have influence even if they are the only differ- ence). Consider trees T1 , T2 , where all trees are identical and have the arguments (sA , s) and (sB , s), but the weights are different: w1 (sA , s) 6= w2 (sA , s) and w1 (sB , s) 6= w2 (sB , s). We want to have d(T1 , T2 ) > 0 if there exists a statement s0 below sA (or sA itself) with r1 (s0 ) = r2 (s0 ) 6= 0 and ∀a ∈ ρI→s0 : w(a) 6= 0. We demand r1 (s0 ) = r2 (s0 ) 6= 0 because it makes sense if weights leading only to statements which are rated as “don’t care” are ignored. In this example, we have sA = s0 : T1 s T2 s w1 (sA , s) w1 (sB , s) w2 (sA , s) w2 (sB , s) r sA sB r sA sB 10 Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs Proof. It is enough to show that there is at least one summand greater than 0. |w1 (sA , s) − w2 (sA , s)| > 0 (16) 0 =⇒ α depth(s ) (1 − α)|w1 (sA , s) − w2 (sA , s)| > 0 (17) depth(s0 ) =⇒ α (1 − α)r1 (s0 ) ∏ w1 (sA0 )|w1 (sA , s) − w2 (sA , s)| > 0 (18) sA0 ∈ρs0 →I \(sA ,s) 0 =⇒ α depth(s ) (1 − α) r1 (s0 ) ∏ w1 (sA0 ) − r2 (s0 ) ∏ w2 (sA0 ) > 0 (19) sA0 ∈ρs0 →I sA0 ∈ρs0 →I For (18) =⇒ (19), remember that all weights in ρs0 →I \ (sA , s) and all ratings are the same for T1 and T2 , e.g. r1 (s0 ) = r2 (s0 ). Desideratum 6 (Symmetry regarding negation of opinion). Let T1 , T2 be any weighted argumentation trees, and T3 , T4 , respectively, the same trees, but the opinion for each statement is negated, i.e. r3 (s) = −r1 (s) and r4 (s) = −r2 (s) for all s ∈ S. We expect that a metric is symmetric regarding negation, i.e. d(T1 , T2 ) = d(T3 , T4 ). Proof. This holds because |r1 (s) − r2 (s)| = |(−r3 (s)) − (−r4 (s))| = |r3 (s) − r4 (s)|. Desideratum 7 (Trade-off between argument weights and agreement). Consider trees T1 , T2 , T3 which are nearly identical and have leaf statements sA and sB with com- mon conclusion s and ∀a ∈ ρI→s : w(a) 6= 0. We have r1 (sA ) = r2 (sA ) = 0.5, r3 (sA ) = −0.5, r1 (sB ) = r2 (sB ) = r3 (sB ) = 0, and w1 (sA , s) > w2 (sA , s). Furthermore, w2 (sB , s) = w1 (sB , s) + w1 (sA , s) − w2 (sA , s), w3 (sB , s) = w1 (sB , s) + w1 (sA , s) − w3 (sA , s), i.e. sB is neutral and “collects” remaining weight such that the sum is 1. If w1 (sA , s) − w2 (sA , s) > w2 (sA , s) + w3 (sA , s), although T1 and T2 have the same opinion on sA , we want to have d(T1 , T2 ) > d(T2 , T3 ), because both T2 and T3 do not care much about their (different) opinions on sA . Likewise, if w1 (sA , s) − w2 (sA , s) < w2 (sA , s) + w3 (sA , s), we expect d(T1 , T2 ) < d(T2 , T3 ) because the weights w1 and w2 are closer to each other and give a greater weight for opposing opinions on sA . The following example trees depict the first case with concrete weight values. Be- cause the different opinions on statement sA are underneath an argument edge with small weight, we want to have d(T1 , T2 ) > d(T2 , T3 ). T1 0 s T2 0 s T3 0 s 0.9 0.1 0.1 0.9 0.2 0.8 0.5 sA 0 sB 0.5 sA 0 sB −0.5 sA 0 sB Proof. We proof the first case. For (20) = (21), remember that w1 (sA , s) > w2 (sA , s). d(T1 , T2 ) = wα · |0.5 · w1 (sA , s) − 0.5 · w2 (sA , s)| (20) = wα · 0.5 · (w1 (sA , s) − w2 (sA , s)) (21) > wα · 0.5 · (w2 (sA , s) + w3 (sA , s)) (22) = wα · |0.5 · w2 (sA , s) − (−0.5) · w3 (sA , s)| = d(T2 , T3 ) (23) Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs 11 The other case follows by replacing “>” with “<”. Desideratum 8 (Trade-off between statement ratings and agreement). Consider trees T1 , T2 , T3 which are nearly identical and have a statement s and ∀a ∈ ρI→s : w(a) 6= 0. We have r1 (s) > r2 (s) > 0 > r3 (s) such that |r1 (s) − r2 (s)| > |r2 (s) − r3 (s)|. Although T1 and T2 have the same positive opinion on s, we want to have d(T1 , T2 ) > d(T2 , T3 ), because both T2 and T3 have a weak opinion on s. Likewise, if |r1 (s) − r2 (s)| < |r2 (s) − r3 (s)|, we expect d(T1 , T2 ) < d(T2 , T3 ) because the ratings r1 (s) and r2 (s) are closer to each other than r2 (s) and r3 (s). The first case, d(T1 , T2 ) > d(T2 , T3 ), is shown in the following example: T1 0.4 s T2 0.1 s T3 −0.1 s Proof. Only the summand for s contributes to the distance, all other summands are 0. Remember that all weights on the path to s are positive. d(T1 , T2 ) = wα |r1 (s) − r2 (s)| > wα |r2 (s) − r3 (s)| = d(T2 , T3 ) (24) The other case follows by replacing “>” with “<”. Desideratum 9 (Weights limit the influence of a path). Consider graphs T1 , T2 which are nearly identical, have an argument (sA , s) with w = w1 (sA , s) = w2 (sA , s) and only the ratings and weights below (and including) sA may differ in any way. No matter how those values are chosen, we want to have d(T1 , T2 ) ≤ w, i.e. the maximum influence of the differences below sA is limited by the weight of sA to its conclusion. Proof. It is enough to consider paths which include a, since the summands for all other parts are 0. Let Sa be the set of all statements which have an argument leading to a, 0 including a itself. We abbreviate α depth(s ) (1 − α) as α(s0 ). dT (T1 , T2 ) = ∑ α(s0 ) r1 (s0 ) ∏ w1 (sA0 ) − r2 (s0 ) ∏ w2 (sA0 ) (25) s0 ∈Sa sA0 ∈ρS→I sA0 ∈ρS→I = w ∑ α(s0 ) r1 (s0 ) ∏ w1 (sA0 ) − r2 (s0 ) ∏ w2 (sA0 ) s0 ∈Sa sA0 ∈ρS→I \(sA ,s) sA0 ∈ρS→I \(sA ,s) (26) As the factor after w is in [0, 1], we get dT (T1 , T2 ) ≤ w. 4. Limitations Although the proposed pseudometric fulfills several intuitive desiderata, there are some limitations which we will discuss in the following. If a weight of an argument is 0, the proposed pseudometric ignores all weights which are underneath this argument. When comparing how similar people argue for or against 12 Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs T1 0 I T2 0 I T3 0 I T30 0 I 0.1 0.9 0.6 0.4 0.3 0.7 0.15 0.35 0.5 0.5 a 0.5 b 0.5 a 0.5 b 0.5 a 0.5 b 0.5 a 0.5 b 0.5 c Figure 2. Possible unexpected change in order if an unrelated opinion is added: 0.05 = d(T1 , T3 ) < d(T2 , T3 ) = 0.075, but 0.1375 = d(T1 , T30 ) > d(T2 , T30 ) = 0.125 with α = 0.5 the top-level positions, this is okay, but if also the way how arguments which are not supported are attacked should influence the distance, the metric has to be extended. Moreover, ordering can be changed by adding an unrelated opinion. Consider trees T1 , T2 , T3 with d(T1 , T3 ) < d(T2 , T3 ). At first sight, it might seem unexpected that this order can be changed to d(T1 , T30 ) > d(T2 , T30 ) by adding a new position c to T3 and keeping the relative weights of the other positions. Depending on the application context, e.g. a voting advice application (VAA), this might be unwanted. An example is depicted in Figure 2. This is due to the normalization of the argument weights. Although even end-user friendly systems like D-BAS support for undercuts, i.e. ar- guments that have an argument as conclusion, undercuts are currently not explicitly mod- eled in our model. This is no big problem, because in many applications, for instance, in a VAA, arguments can be preselected such that no undercut attack is necessary (since the arguments make sense), or rephrased such that the premise is attacked. For example, con- sider the argument “We should build more nuclear power plants because cats are cute” and the attack “Cats and nuclear power plants are unrelated”. Though this is technically an undercut, a user interface may present this as an attack on “Cats are cute”. 5. Related Work Calculating the distance between argumentation graphs to compare how similar the atti- tudes of two agents are has already been used in other systems. The Carneades opinion formation and polling tool presented in [9] is able to compare one’s argumentation with the argumentation of other entities like organizations. This comparison is simply done by counting the number of statements where the agreement/disagreement is the same. This approach is much simpler than our proposal, but uses neither weights nor ratings and violates i.a., Desideratum 3. The mobile application described in [1] also bases on IBIS and extends it with an agreement value for each argument in the argumentation tree. This information is used, for opinion prediction using collaborative filtering. In contrast to our work, the idea of relative argument importance in combination with statement rating is not present. Another application of calculating the similarity of weighted trees is match-making of agents, which are represented by weighted trees. In [3], a recursive similarity measure for this application is proposed. Its parameter N serves a similar purpose to our parameter α. They also give examples which are similar to our desiderata, e.g. Example 4 is like our Desideratum 2. Nodes, however, do not have a weight, and some desiderata are not fulfilled; for instance, Desideratum 5 is explicitly not demanded in their Example 2. There are already other definitions of weighted argumentation graphs based upon Dung’s definition of argumentation frameworks, but many lack the differentiation be- tween argument relevance and statement ratings (e.g. [2,8,10,6]), which we think is im- Brenneis et al. / How Much Do I Argue Like You? Towards a Metric on Weighted Argumentation Graphs 13 portant since argument weights limit how much a branch of an argumentation is rele- vant, whereas statement ratings are only relevant for the single statement. Furthermore, in most cases, there is a global assignment of values in the graph and no user-specific views, which is why a strength of 0 for attacks would be meaningless in the model pre- sented in [13]. Note that most related work in this field is concerned about evaluating consistency or calculating extensions, whereas our main goal is comparing the attitudes of agents, not caring about whether an agents’ attitude is logically consistent or not. 6. Conclusion and Future Work In this paper we proposed a pseudometric to calculate the distance between two argu- mentation graphs representing the attitudes of different persons, and several desiderata which should be considered when proposing other metrics for the same purpose, and are fulfilled by our pseudometric. Possible next steps include developing other sensible metrics and comparing them regarding theoretical properties and practicality. In future work, we want to check if the desiderata are not only intuitive for experts in argumentation theory, but are following the intuition of untrained humans. We also want to test the metric in a VAA to compare the argumentation of voters with those of parties. Thereby we see whether the results of the metric are accepted in an application context. References [1] Althuniyan, N., Sirrianni, J.W., Rahman, M.M., Liu, X.F.: Design of mobile service of intelligent large- scale cyber argumentation for analysis and prediction of collective opinions. In: International Confer- ence on AI and Mobile Services. pp. 135–149. Springer (2019) [2] Amgoud, L., Ben-Naim, J., Doder, D., Vesic, S.: Acceptability semantics for weighted argumentation frameworks. In: IJCAI. vol. 2017, pp. 56–62 (2017) [3] Bhavsar, V.C., Boley, H., Yang, L.: A weighted-tree similarity algorithm for multi-agent systems in e-business environments. Computational Intelligence 20(4), 584–602 (2004) [4] Brenneis, M., Mauve, M.: deliberate – online argumentation with collaborative filtering. In: COMMA (2020), to appear [5] Dung, P.M.: On the acceptability of arguments and its fundamental role in nonmonotonic reasoning, logic programming and n-person games. Artificial Intelligence 77(2), 321–357 (1995) [6] Dunne, P.E., Hunter, A., McBurney, P., Parsons, S., Wooldridge, M.: Weighted argument systems: Basic definitions, algorithms, and complexity results. Artificial Intelligence 175(2), 457–486 (2011) [7] Ebbinghaus, B.: Decision making with argumentation graphs. arXiv preprint arXiv:1908.03357 (2019) [8] Gordon, T.F., Walton, D.: Formalizing balancing arguments. In: COMMA. pp. 327–338 (2016) [9] Gordon, T.F.: Structured consultation with argument graphs. From Knowledge Representation to Argu- mentation in AI. A Festschrift in Honour of Trevor Bench-Capon on the Occasion of his 60th Birthday pp. 115–133 (2013) [10] Hunter, A.: A probabilistic approach to modelling uncertain logical arguments. International Journal of Approximate Reasoning 54(1), 47–81 (2013) [11] Krauthoff, T., Meter, C., Mauve, M.: Dialog-Based Online Argumentation: Findings from a Field Ex- periment. In: Proceedings of the 1st Workshop on Advances in Argumentation in Artificial Intelligence. pp. 85–99 (November 2017) [12] Kunz, W., Rittel, H.W.J.: Issues as elements of information systems, vol. 131. Citeseer (1970) [13] Martınez, D.C., Garcıa, A.J., Simari, G.R.: An abstract argumentation framework with varied-strength attacks. In: Proceedings of the Eleventh International Conference on Principles of Knowledge Repre- sentation and Reasoning (KR’08). pp. 135–144 (2008) [14] Neugebauer, D.: Dabasco: Generating af, adf, and aspic+ instances from real-world discussions. In: COMMA. pp. 469–470 (2018)