Fostering Explainable Online Review Assessment Through Computational Argumentation Atefeh Keshavarzi Zafarghandi, Davide Ceolin Human-Centered Data Analytics, Centrum Wiskunde & Informatica, Amsterdam, The Netherlands Abstract Explainable methods have received increased attention within artificial intelligence. Wherever an automated system makes a decision an explanation is required to convince a user about the decision. Fur- thermore, online information quality assessment is crucial to help users navigate information. However, explaining the assessment of online information had not been clarified well. The current work provides explanations to a user about the assessment of online information and specific, provides explanations for the quality assessments of online reviews. We construct an abstract argumentation framework (AF) based on a set of given reviews. We consider the grounded semantics of AFs to assess each topic. Then, we discuss the question of why a score can be assigned to a topic of a product. Furthermore, we indicate a proper score of a review based on the scores of topics within the review in question. We also collect arguments that can support the chosen score of a review. Keywords Online reviews, Abstract Argumentation frameworks, Explainable Artificial Intelligence 1. Introduction Argumentation is one research area that is frequently mentioned in explainable AI [1]. To our knowledge, the role of argumentation formalisms in the sense of explanation for the assessment of the quality of reviews has not been investigated in depth. In this work we aim to cover this gap, i.e., to use an argumentation formalism not only for assessing the quality of online information but also as a means to explain why a given score is assigned by a system to a topic or a review. This is particularly compelling because online reviews are available in large amounts, but for them to be beneficial to users, their quality needs to be determined. Given the volume of the information at stake here, an automated approach is necessary to address the problem properly. Given that the result of this automated assessment is meant to be used by humans, the ability to explain the assessment process is likewise crucial. Computational argumentation is, in our opinion, a promising methodology in this sense. Abstract argumentation frameworks (AFs for short), as introduced by Dung [2] is a directed graph in which nodes represent arguments and edges denote attack relation among arguments. In this work, we use AFs as a means of modeling and evaluating reviews and as an explanation for a chosen decision. 1st International Workshop on Argumentation for eXplainable AI (ArgXAI, co-located with COMMA ’22), September 12, 2022, Cardiff, UK $ akz@cwi.nl (A. Keshavarzi Zafarghandi); davide.ceolin@cwi.nl (D. Ceolin)  0000-0002-5806-1012 (A. Keshavarzi Zafarghandi); 0000-0002-3357-9130 (D. Ceolin) © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings http://ceur-ws.org ISSN 1613-0073 CEUR Workshop Proceedings (CEUR-WS.org) 1 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 In the work presented in this paper, we aim to model a set of reviews with AFs and use the grounded semantics of AFs as a means of evaluation and explanation. That is, this model allows reasoning on the ‘acceptability of topics within reviews’ when reviews are conflicting with each other. Such conflicts are solved by defining a weight for topics within reviews, which extends and generalizes a review weight we defined in previous work [3]. To this end, we first consider each set of reviews containing the same topic with the same score as a single argument. Then, we introduce an attack relation if the weight of arguments is not the same. We first use the grounded semantics of AFs to assess the score of each topic, and secondly to explain why a system chooses a specific score for a topic in question. That is, we aim at providing the final user with an explanation of why a specific score for a topic in question is acceptable (or trustable). In other words, we aim to indicate what the score of a topic about a target/product is based on the set of reviews. Some specific features of our work are as follows: 1. We consider all topics within a review, instead of only picking the most important one. Thus, we do not miss any information presented in a review. 2. By our method, after the assessment, a user can ask about the score (strength) of topics of a product, instead of just asking about the scores of the reviews. 3. Our method is solid enough to explain to a user the reason of assigning a score to a topic. Also, explaining why an argument does not have a roll in the assessment of the score of a topic. 4. We also accumulate the scores of topics within a review to assess a review score. Further- more, we present an explanation for a review score from a machine point of view, which is a subset of the grounded semantics of AFs. Thus, this explanation is beyond any doubt. 5. For indicating the scores of topics about a product within a set of reviews, based on a system’s points of view, we do not need to consider any generalization of AFs. Since the weight of each argument is used to indicate the direction of the attack relation. The rest of the paper is structured as follows. Section 2 presents related work, and Section 3, introduces abstract argumentation frameworks. Section 4 presents a model to formally represent and reason on reviews. Section 5 concludes and outlines future work directions. 2. Related Work Explainable artificial intelligence (XAI) has received increased attention to explain decisions of automated systems [4]. Several machine learning methods are used to support decision-making. However, these methods are required to convince a user about the machine decision. In other words, an AI system needs to explain its decision to a user. Argumentation theory can help the process of explanation, (see [1, 5] for a survey). Some argumentation frameworks with respect to their use in support of explainable artificial intelligence (XAI) are presented in [6]. In [7] a new type of argumentation semantics is presented for AFs for capturing explanation. Furthermore, argumentation is used to explain why and/or whether a certain argument can be accepted under certain semantics [8, 9, 10, 11]. 2 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 On the other hand, a significant amount of research investigates the quality of reviews. In [3] a generalization of abstract argumentation framework is used to assess the quality of the reviews on a product, extending and combining preferred argumentation frameworks [12, 13] and valued- based argumentation frameworks [14, 15] to model and evaluate a set of reviews. Furthermore, to analyze product reviews, different methods for generating probability distributions over constellations of arguments graph have been presented in [16, 17]. In [17] authors assume that an agent(s) specifies a belief in the acceptability status of arguments. However, in [16] the authors propose a scoring method for identifying the probability distribution for a review. After the extraction of support and attack relations between reviews, a set of reviews is modeled by an abstract dialectical framework [18] which generalizes AFs [2]. 3. Background: Argumentation Formalisms We start the preliminaries of our work by recalling the basic notion of Dung’s abstract argu- mentation frameworks (AFs) [2]. Definition 1. [2] An abstract argumentation framework (AF) is a pair (𝐴, 𝑅) in which 𝐴 is a set of arguments and 𝑅 ⊆ 𝐴 × 𝐴 is a binary relation representing attacks among arguments. Let 𝐹 = (𝐴, 𝑅) be a given AF. For each 𝑎, 𝑏 ∈ 𝐴, the relation (𝑎, 𝑏) ∈ 𝑅 is used to represent that 𝑎 is an argument attacking the argument 𝑏. An argument 𝑎 ∈ 𝐴 is, on the other hand, defended by a set 𝑆 ⊆ 𝐴 of arguments (alternatively, the argument is acceptable with respect to 𝑆) (in 𝐹 ) if for each argument 𝑐 ∈ 𝐴, it holds that if (𝑐, 𝑎) ∈ 𝑅, then there is a 𝑠 ∈ 𝑆 such that (𝑠, 𝑐) ∈ 𝑅 (𝑠 is called a defender of 𝑎). Different extension-based semantics of AFs present which sets of arguments in a given AF can be accepted jointly [2]. We only recall grounded semantics here,1 because it is proven in [2] that 1. every AF has a unique grounded extension, 2. there is no doubt on the acceptance of the arguments in the grounded extension, 3. in any acyclic finite AF all sets of semantics coincide, 4. and in this work we only have acyclic frameworks. Set 𝑆 ⊆ 𝐴 is called a conflict-free set (extension) (in 𝐹 ) if there is no 𝑎, 𝑏 ∈ 𝑆 such that (𝑎, 𝑏) ∈ 𝑅. The characteristic function 𝐹 : 2𝐴 ↦→ 2𝐴 is defined as 𝐹 (𝑆) = {𝑎 | 𝑎 is defended by 𝑆}. A set 𝑆 ∈ cf(𝐹 ) is the grounded extension in 𝐹 if 𝑆 is a unique fixed point of 𝐹 𝑛 (∅). Example 1. Let 𝐹 = ({𝑎, 𝑏, 𝑐}, {(𝑎, 𝑏), (𝑏, 𝑐)}) be an AF. In 𝐹 , (𝑎, 𝑏) means that argument 𝑎 attacks 𝑏, and (𝑏, 𝑐) means that 𝑏 attacks 𝑐. Here, argument 𝑐 is defended by set {𝑎} (alternatively, 𝑐 is acceptable with respect to {𝑎}), since 𝑎 attacks the attacker of 𝑐, namely 𝑏. The set of conflict-free sets of 𝐹 is cf(𝐹 ) = {∅, {𝑎}, {𝑏}, {𝑐}, {𝑎, 𝑐}}. A unique grounded extension of 𝐹 is {𝑎, 𝑐}. The intuition is that 𝑎 is not attacked by any argument, thus no one has any doubt about accepting argument 𝑎. Argument 𝑐 is attacked by 𝑏, however, it is defended by 𝑎 which was accepted by everyone. 1 The reader interested in semantics of AFs can see [2]. 3 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 4. Modeling Reviews with Formal Argumentation Here, we first model reviews in a formal manner by means of abstract argumentation frameworks (AFs), and then show how to use AFs to evaluate the score of a topic of a product based on given reviews (see Section 4.2). To this end, we collect the reviews with the same score containing the same topic and we consider them as a single argument. After indicating the attack relations between arguments, we construct an AF. Then, we use the grounded semantics of AFs for indicating the most trusted arguments and for a reason of explaining why a score is assigned to a topic and a review in our automated method. 4.1. Formal Modeling of Reviews Let 𝑡 be a product (target) and let {r𝑖 (𝑡)} be a set of reviews of 𝑡 (alternatively, {r𝑡𝑖 }). Each review r𝑡𝑖 consists of a numerical score 𝑠𝑐(r𝑡𝑖 ) (e.g., in 5-level Likert scale) and a textual description. The description characterizes the product focusing to specific topics. E.g., in the case of a pair of shoes, the topics can be ‘sole’, ‘upper’ or ‘comfort’. We represent the list of all topics relevant to the product 𝑡 as 𝒯𝑡 , where 𝒯𝑡 = {𝜑1 , . . . , 𝜑𝑛 }. Each review r𝑡𝑖 contains a finite set of topics 𝒯r𝑡𝑖 ⊆ 𝒯𝑡 . E.g., review ‘r𝑡1 : the color of the shoes are not my favorite but I use it for a long time and they are still looking good’ contains two topics, i.e., 𝒯r𝑡1 = {𝜑1 = color, 𝜑3 = quality}. The review score represents the actual reviewer’s opinion on the product, while the text aims at motivating such judgment. An agreeing score of two reviews containing the same topic about the same product indicates a support between reviews while disagreeing score indicates a conflict. Our goal is to identify the score of each topic of a product, based on the set of given reviews, i.e., which score is proper for a topic in question. We address this by constructing an AF based on a set of given reviews. Then, we use the grounded semantics of AFs to indicate a score to a topic in question and explain why it is the case. In the end, we further update the scores of reviews based on their topics. We also, prepare an explanation for the updated score of the reviews by using the grounded semantics of AFs. In review r𝑖𝑗 in Table 1, index 𝑖 indicates the product and 𝑗 indicates the reviewer. Reviews can support one another if they have the same score and they have a common topic. If a set of reviews has the same score and has a common topic, then it means that these reviews support one another. Thus, we consider them as a single argument. In Table 1, topic ’𝜑 : quality’, indicated by a topic detection, is a common topic between r21 and r22 , however, these reviews do not have the same score. Reviewer r21 gave a score of 4 out of 5 to this product. The last sentence in r21 presents the main reason why the reviewer gave a score of 4 out of 5 to this product because she/he was satisfied with the quality of this product. However, the reviewer of r22 is not as satisfied as the reviewer of r21 with the quality of this product. Thus, there is an attack relation between reviews r21 and r22 , because 𝑠𝑐(r21 ) ̸= 𝑠𝑐(r22 ). Reviews can be classified based on their topics. Assume that 𝜑 is a topic of a set of reviews 𝐸 = {r𝑡1 , . . . , r𝑡𝑗 }, if there is 𝐸 ′ ⊆ 𝐸, such that for any r𝑡𝑖 , r𝑡𝑗 ∈ 𝐸 ′ , it holds that 𝑠𝑐(r𝑡𝑖 ) = 𝑠𝑐(r𝑡𝑗 ), then we say that reviews in 𝐸 ′ support one another and we consider all reviews as a single argument. This leads to the classification of 𝐸 based on the scores of reviews. Between arguments with different scores, there is a symmetric attack relation. If there is no intersection 4 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 Id Score Review Text r11 ***** Comfortable r12 * Waaaay too BIG r21 **** Fit perfectly. I bought dark grey, and they didn’t fade r22 *** They fit great. But they fade bad Table 1 Example reviews. r11 and r12 are about the same product, and r21 and r22 are. The reviews, pairwise, disagree on the scores given. The topics tackled, though, are slightly different. between topics within r𝑡𝑖 and r𝑡𝑗 , then there is no relation between these two reviews. 4.2. Modeling Reviews with AFs Let {r𝑡1 , . . . , r𝑡𝑛 } be a finite set of reviews on product 𝑡. Let 𝒯𝑡 = {𝜑𝑖 } be the set of topics relevant to the product 𝑡. Each review r𝑡𝑖 consists of a numerical score 𝑠𝑐(r𝑡𝑖 ) (e.g., in 5-level Likert scale) and a textual description. Let 𝒯𝑡,r𝑡𝑖 be the set of all topics presented in review r𝑡𝑖 . Different topics have different importance in each review, to indicate this importance we introduce a weight function in Definition 2 which shows the initial weight of topic 𝜑 in review r𝑡𝑖 with score 𝑠𝑐(r𝑡𝑖 ). Definition 2. Let r𝑡𝑖 be a review, let 𝜑 be a topic in r𝑡𝑖 , and let 𝑠𝑐(r𝑡𝑖 ) be a score of r𝑡𝑖 . 𝑤(𝑠𝑐(r𝑡𝑖 ), 𝜑, r𝑡𝑖 ) is called the initial weight of 𝜑 in review r𝑡𝑖 and score 𝑠𝑐(r𝑡𝑖 ), where 𝑤 is a value in [0, ∞] such that 𝑤 is computed by aggregating one or multiple factors meeting the following criteria: 1. at least one of such factors is the result of an abstraction function computed on the review itself. Such abstraction functions should be computable over any review and allow establishing a total order of reviews. Example of such abstraction functions include, for instance, readability scores (e.g., Dale-Chall readability [19]) and complexity measures (e.g., Kolmogorov complexity [20]); 2. optional factors can be computed as abstractions over the combination of the review r𝑡𝑖 and/or its topic 𝜑. We aim to use a formalism of argumentation, i.e., abstract argumentation frameworks (AFs), to assess the score of the topics within reviews of a product 𝑡. To this end, we construct an AF based on a set of reviews. Our method associates each argument to the set of reviews with the same score about a common topic. That is, we consider all reviews with the same score that presents the same topic as a single argument. Formally, let [𝜑]𝑘 = {r𝑡𝑖 | r𝑡𝑖 contains topic 𝜑 and 𝑠𝑐(r𝑡𝑖 ) = 𝑘}, i.e., [𝜑]𝑖 collect all the reviews that contains topic 𝜑 and have score 𝑘. We consider [𝜑]𝑘 as a single argument, that contains all the reviews that have topic 𝜑 and score 𝑘. We are interested in evaluating the weight of [𝜑]𝑘 as a single argument. To this end, for each 𝑘 with 𝑘 ∈ {1, 2, 3, 4, 5}, we sum up the initial weights of topic 𝜑 in review r𝑡𝑖 and score 𝑘. Note that in Definition 3 if 𝜑 is not a topic of r𝑡𝑖 then 𝑤(𝑘, 𝜑, r𝑡𝑖 ) = 0. 5 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 Definition 3. Let 𝜑 be a topic of a product 𝑡, let 𝑘 be a natural number between 1 and 5. The weight of topic 𝜑 with respect to score 𝑘, alternatively, the weight of argument [𝜑]𝑘 is as follows: 𝑤([𝜑]𝑘 ) = Σ𝑛𝑖=1 𝑤(𝑘, 𝜑, r𝑡𝑖 ) where 𝑛 is the number of reviews about product 𝑡, and 𝑤(𝑘, 𝜑, r𝑡𝑖 ) is the initial weight 𝜑 in review r𝑡𝑖 and score 𝑘 as introduced in Definition 2. To construct an AF based on a set of reviews, we consider an attack relation from an argument [𝜑]𝑖 to [𝜑]𝑗 iff 𝑤([𝜑]𝑖 ) > 𝑤([𝜑]𝑗 ). Definition 4. Let 𝑡 be a product, let 𝒯𝑡 = {𝜑𝑖 } be a set of topics, and let 𝑖 be a natural number between 1 and 5. For each 𝑖, with 𝑖 ∈ {1.2, 3, 4, 5}, and for each 𝜑 ∈ 𝒯𝑡 , we introduce an argument 𝑎𝑖,𝜑 = [𝜑]𝑖 . Furthermore, 𝑤(𝑎𝑖,𝜑 ) is the weight of argument 𝑎𝑖,𝜑 , equal to 𝑤([𝜑]𝑖 ), as introduced in Definition 3. An AF constructed based on topics is 𝐹 = (𝐴, 𝑅) where, • 𝐴 = {𝑎𝑖,𝜑 } • 𝑅 = {(𝑎𝑖,𝜑 , 𝑎𝑗,𝜑 ) | 𝑎𝑖,𝜑 , 𝑎𝑗,𝜑 ∈ 𝐴 and 𝑤(𝑎𝑖,𝜑 ) > 𝑤(𝑎𝑗,𝜑 )}. An AF, constructed based on topics of a product, is a directed graph. Each node, indicated by 𝑎𝑖,𝜑 contains all sets of reviews that contain topic 𝜑 with score 𝑖. The reason for collecting all such reviews is that if two reviews contain the same topic 𝜑 and give the same score to the product, then their content support one another. Thus, we accumulate them in a single node and we consider them as one argument. After indicating the set of arguments from a given set of reviews, we designate attack relations. If two arguments give different scores to product 𝑡, but contain the same topic, it means that there may exist a conflict between these two arguments with respect to the topic in question. We consider an attack relation between 𝑎𝑖,𝜑 and 𝑎𝑗,𝜑 when their weights are not the same. In order to indicate the direction of the attack relation between two arguments we consider the weight of the topic with respect to the score, presented in Definition 3. Note that in Definition 4 we only consider relations among arguments with the same topics. Thus, for 𝜑, 𝜑′ ∈ 𝒯𝑡 , if 𝜑 ̸= 𝜑′ , then there is no relation between any arguments of 𝑎𝑖,𝜑 and 𝑎𝑗,𝜑′ , for 𝑖, 𝑗 ∈ {1, 2, 3, 4, 5}. That is, the associated graph to AF 𝐹 , constructed based on the topics of a product, is a forest, presented formally in Lemma 1. Note that in graph theory a graph is called connected if for every pair of vertices 𝑎 and 𝑏, there is a path between 𝑎 and 𝑏. Furthermore, a connected component is a maximal connected subgraph of an undirected graph. Definition 5. Let 𝑡 be a product, and let 𝒯𝑡 = {𝜑𝑖 } be a set of topics of 𝑡. Let 𝐹 = (𝐴, 𝑅) be an AF constructed based on topics. Let 𝜑 ∈ 𝒯𝑡 , set 𝑎𝜑 = {𝑎𝑖,𝜑 } is called a component contains 𝜑 in 𝐹 . Furthermore, component 𝑎𝜑 is called connected component iff for every 𝑎𝑖,𝜑 , 𝑎𝑗,𝜑 ∈ 𝑎𝜑 it holds that (𝑎𝑖,𝜑 , 𝑎𝑗,𝜑 ) ∈ 𝑅. Lemma 1. Let 𝐹 be an AF, constructed based on topics of product 𝑡. If the reviews of the product 𝑡 contain at least two topics, then the graph associated with 𝐹 is disconnected. Furthermore, If 𝑚 is the number of topics presented in the reviews, i.e., |𝒯𝑡 | = 𝑚, then the associated graph of 𝐹 contains at least 𝑚 connected component. 6 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 Proof. If the reviews contain more than one topic, for instance 𝜑 and 𝜑′ , then it is clear that there is no relation between arguments containing 𝜑 and 𝜑′ . Thus, the associated graph is disconnected. That is, there is no link between the arguments of components 𝑎𝜑 and 𝑎𝜑′ . Thus, if we have at least two topics in the set of reviews, then the associated graph is disconnected. We now show that the associated graph contains at least 𝑚 connected components if the set of reviews contains 𝑚 number of different topics, by induction on 𝑚. Base case: assume that 𝑚 = 1, that is all reviews contain the single topic 𝜑. If all reviews give the same score to the product, then we only have one argument in the AF constructed based on these reviews. Thus, we have exactly one component. Note that if we have different arguments with the same weight, i.e., |𝑎𝜑 | > 1 and 𝑤(𝑎𝑖,𝜑 ) = 𝑤(𝑎𝑗,𝜑 ) for every 𝑎𝑖,𝜑 , 𝑎𝑗,𝜑 ∈ 𝑎𝜑 , then there are more than one connected component. Inductive hypothesis: Assume that 𝑚 = 𝑗, then the associated graph contains at least 𝑗 connected components. Inductive step: Assume that 𝑚 = 𝑗 + 1, then we have to show that the associated graph contains at least 𝑗 + 1 components. Let 𝐾 be 𝑗 different topics of 𝑡, i.e., 𝐾 ⊆ 𝒯𝑡 and |𝐾| = 𝑗. By the inductive hypothesis, the associated graph contains at least 𝑗 connected components. Let 𝜑 be a topic such that 𝜑 ̸∈ 𝐾, and let 𝜑′ ∈ 𝐾. Thus, there is no relation between arguments of 𝑎𝜑 and 𝑎𝜑′ . Thus, if 𝑚 = 𝑗 + 1 the associated graph has at least 𝑗 + 1 connected component. An AF 𝐹 = (𝐴, 𝑅) is called acyclic (or well-founded) if there is no infinite sequence of arguments 𝑎1 , . . . , 𝑎𝑖 , . . . such that (𝑎𝑖+1 , 𝑎𝑖 ) ∈ 𝑅. By Definition 4, in an AF 𝐹 = (𝐴, 𝑅), it holds that (𝑎𝑖,𝜑 , 𝑎𝑗,𝜑 ) ∈ 𝑅 iff 𝑤(𝑎𝑖,𝜑 ) > 𝑤(𝑎𝑗,𝜑 ). Furthermore, because of the transitive property of relation < in real numbers, any AF constructed based on a set of reviews is acyclic. It is proven in [2] that in any acyclic AF all sets of semantics coincide. Thus, in the following of this work, to assess the topics, presented in reviews, we focus on the grounded extension of the constructed AF based on a given set of reviews. Corollary 1. Let 𝐹 be an AF, constructed based on topics of the product 𝑡. Every connected component in the associated graph of 𝐹 is acyclic. Proof. This corollary is the direct result of the fact that 𝐹 is acyclic. Thus, any connected component of 𝐹 is also acyclic. Since, by Corollary 1 every connected component in 𝐹 is acyclic, every component has an initial argument, i.e., an argument that does not have any parents. Proposition 1 is the direct result of Definition 4. Proposition 1. Let 𝐹 be an AF constructed based on a set of topics of the product 𝑡. Let 𝑎𝑖,𝜑 be an initial argument of 𝐹 . It holds that 𝑤(𝑎𝑖,𝜑 ) is maximum among the weights of other arguments. Lemma 2. Let 𝐹 = (𝐴, 𝑅) be an AF constructed based on a set of topics of the product 𝑡. Let 𝐼 be the set of initial arguments of 𝐹 , i.e., 𝐼 = {𝑏 | there is no 𝑎 ∈ 𝐴 such that (𝑎, 𝑏) ∈ 𝑅}. The grounded extension of 𝐹 is none empty and it is equal with 𝐼. Proof. Since every AF constructed based on topics is acyclic and since each acyclic AF has a none empty grounded extension, 𝐹 has a none empty grounded extension. By the definition 7 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 of grounded semantics, every initial argument is in the grounded extension, i.e., 𝐼 ⊆ grd(𝐹 ). It remains to show that grd(𝐹 ) ⊆ 𝐼. Toward a contradiction, assume that grd(𝐹 ) ̸⊆ 𝐼. That is, there exists an argument in grd(𝐹 ) which is not an initial argument. Assume that 𝑎𝑖,𝜑 is an argument such that 𝑎𝑖,𝜑 ∈ grd(𝐹 ) but 𝑎𝑖,𝜑 ̸∈ 𝐼. Since grd(𝐹 ) ̸= ∅ and 𝐼 ⊆ grd(𝐹 ), there exists 𝑎𝑗,𝜑 ∈ grd(𝐹 ) ∩ 𝐼. By Proposition 1, 𝑤(𝑎𝑗,𝜑 ) is the maximum among the weights of other arguments, in specific, 𝑤(𝑎𝑗,𝜑 ) > 𝑤(𝑎𝑖,𝜑 ). Thus, by Definition 4, (𝑎𝑗,𝜑 , 𝑎𝑖,𝜑 ) ∈ 𝑅. That is, 𝑎𝑖,𝜑 is attacked by 𝑎𝑗,𝜑 in 𝐹 . This is a contradiction of the assumption that 𝑎𝑖,𝜑 ∈ grd(𝐹 ). Thus, the assumption that there exists an argument in the grounded extension which is not an initial argument is wrong. Hence, grd(𝐹 ) = 𝐼. 4.3. What is an AI system explanation of the score of a topic? In this section, we explain how AFs as an AI system can be used to indicate the score of each topic of the product 𝑡, based on the set of reviews. Proposition 2. Let 𝐹 = (𝐴, 𝑅) be the constructed AF based on topics of product 𝑡. Let 𝜑 be a topic of product 𝑡. There exists 𝑖 ∈ {1, 2, 3, 4, 5} such that 𝑎𝑖,𝜑 is in the grounded extension of 𝐹 . Proof. Let 𝐹 be an AF, constructed based on topics of the product 𝑡. Let 𝜑 be a topic and let 𝑎𝜑 be the connected component of 𝜑, as introduced in Definition 5. By Corollary 1, every connected component in the associated graph of 𝐹 is acyclic. Thus, for each 𝜑, 𝑎𝜑 contains an initial argument. By Lemma 2, the grounded extension of 𝐹 and the set of initial arguments of 𝐹 coincide. Thus, for each 𝜑 an argument of 𝑎𝜑 is in the grounded extension. Hence, for any 𝜑, there exists 𝑖 such that 𝑎𝑖,𝜑 is in the grounded extension of 𝐹 . We now define basic explanation in terms of functions. The function 𝑠𝑐𝐴𝐼 (−) is a unary func- tion that takes a topic as an input and returns an appropriate score to that topic by considering all the reviews containing that topic, presented in Definition 6. Definition 6. (Score of a topic) Let 𝐹 = (𝐴, 𝑅) be an AF constructed based on topics of product 𝑡. Let 𝜑 be a topic of product 𝑡. The score 𝜑 based on an AI system is denoted by 𝑠𝑐𝐴𝐼 (𝜑), defined as follows: Σ𝑎𝑖,𝜑 ∈grd(𝐹 ) 𝑖 𝑠𝑐𝐴𝐼 (𝜑) = round( ) |{𝑖 | 𝑎𝑖,𝜑 ∈ grd(𝐹 )}| Σ𝑎 ∈grd(𝐹 ) 𝑖 In 𝑠𝑐𝐴𝐼 (𝜑) the output of the function round is the nearest integer to |{𝑖 | 𝑎𝑖,𝜑 𝑖,𝜑 ∈grd(𝐹 )}| . Note that by Proposition 2 for each topic 𝜑 there exists at least an 𝑖 such that 𝑎𝑖,𝜑 ∈ grd(𝐹 ). Hence, Definition 6 is well-defined. Note that for each 𝜑, if component 𝑎𝜑 is connected, then there exists exactly one 𝑖 such that 𝑎𝑖,𝜑 ∈ grd(𝐹 ). Intuitively, for a 𝜑, if component 𝑎𝜑 is connected and 𝑎𝑗,𝜑 is an initial argument, then it holds that 𝑠𝑐𝐴𝐼 (𝜑) = 𝑗. In this case, the choice of machine, i.e., 𝑠𝑐𝐴𝐼 (𝜑) = 𝑗 can be explained that the score of topic 𝜑 is 𝑗 because argument 𝑎𝑗,𝜑 is an initial argument, i.e., 𝑤(𝑎𝑗,𝜑 ) is maximum among all other arguments in component 𝑎𝜑 . The highest weight of 𝑎𝑗,𝜑 among the arguments of 𝑎𝜑 means that sum of the weights of reviews containing 𝜑 with score 𝑗 is the highest weight. 8 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 Furthermore, in the following, we try to explain to a user why from a machine point of view topic 𝜑 has the score sc𝐴𝐼 (𝜑). To this end, we define an explanation function, denoted by Exp(𝜑, sc𝐴𝐼 (𝜑)) in Definition 7. Function 𝐸𝑥𝑝(𝜑, sc𝐴𝐼 (𝜑)) is a binary function that takes a topic 𝜑 and its machine score sc𝐴𝐼 (𝜑) as inputs and returns the set of arguments that their scores have a role in the machine decision in Definition 6. Definition 7. (Explanation of a score of a topic) Let 𝐹 be an AF, constructed based on topics {𝜑𝑖 } of product 𝑡. Let 𝜑 be a topic and let sc𝐴𝐼 (𝜑) be the score of 𝜑 from a machine point of view, as introduced in Definition 6. An explanation of why the score of topic 𝜑 is sc𝐴𝐼 (𝜑) is as follows: Exp(𝜑, sc𝐴𝐼 (𝜑)) = {𝑎𝑖,𝜑 | 𝑎𝑖,𝜑 ∈ grd(𝐹 )} Let 𝜑 be a topic, Exp(𝜑, sc𝐴𝐼 (𝜑)) collects all 𝑎𝑖,𝜑 that is in the grounded extension of 𝐹 , i.e., the set of arguments, with respect to topic 𝜑, that the acceptance of them are beyond of any doubts. Assume that sc𝐴𝐼 (𝜑) = 𝑖, the notation ¬sc𝐴𝐼 (𝜑) is a number 𝑗 such that 𝑗 ̸= 𝑖. Furthermore, the function NotDef(𝜑) collects all the arguments that contain topic 𝜑 but do not have any role in the computation of the score of topic 𝜑, from a machine point of view. In other words, NotDef(𝜑) contains all arguments which are attacked by Exp(𝜑, sc𝐴𝐼 (𝜑)). Definition 8. Let 𝐹 be an AF, constructed based on topics {𝜑𝑖 } of product 𝑡. Let 𝜑 be a topic and let sc𝐴𝐼 (𝜑) be the score of 𝜑 from a machine point of view, as introduced in Definition 6. Function NotDef(𝜑) is a unary function that takes a topic as an input and returns the set of arguments that do not have any role in the evaluation of sc𝐴𝐼 (𝜑). NotDef(𝜑) = {𝑎𝑖,𝜑 | 𝑎𝑖,𝜑 ̸∈ grd(𝐹 )} For each topic 𝜑, Exp(𝜑, sc𝐴𝐼 (𝜑)) and NotDef(𝜑) are disjoint functions, i.e., Exp(𝜑, sc𝐴𝐼 (𝜑))∪ NotDef(𝜑) = 𝑎𝜑 . For each 𝜑, if 𝑎𝑖,𝜑 ̸∈ Exp(𝜑, sc𝐴𝐼 (𝜑)), then 𝑎𝑗,𝜑 ̸∈ grd(𝐹 ), that is, there exists 𝑎𝑖,𝜑 ∈ grd(𝐹 ) such that 𝑎𝑖,𝜑 attack 𝑎𝑗,𝜑 . That is, 𝑤(𝑎𝑖,𝜑 ) > 𝑤(𝑎𝑗,𝜑 ), thus, it is reasonable that the machine neither consider the score of 𝑎𝑗,𝜑 in the evaluation of sc𝐴𝐼 (𝜑) nor argument 𝑎𝑗,𝜑 in Exp(𝜑, sc𝐴𝐼 (𝜑)). Furthermore, the function SC𝐴𝐼 (−) gives a score to a review r𝑡𝑖 based on the score of each topic within r𝑡𝑖 . That is, SC𝐴𝐼 revise the score of each review given by a reviewer. Definition 9. (Score of a review) Let r𝑡𝑖 be a review of product 𝑡. Let 𝒯𝑡,r𝑡𝑖 be a set of topics presented in review r𝑡𝑖 . The score of the review r𝑡𝑖 based on a machine assessment is the output of function SC𝐴𝐼 (r𝑡𝑖 ), evaluated as follows: Σ𝜑∈𝒯𝑡,r𝑡𝑖 sc𝐴𝐼 (𝜑) SC𝐴𝐼 (r𝑡𝑖 ) = round( ) |𝒯𝑡,r𝑡𝑖 | The score of review r𝑡𝑖 , from a machine point of view, is the round of the average of scores of the topics presented in r𝑡𝑖 , where the score of each topic from a machine point of view is presented in Definition 6. In Definition 10 grounded semantics of AFs are implicitly used to explain the reasons for the score given to a review from a machine’s point of view. 9 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 Definition 10. (Explanation of a score of a review) Let r𝑡𝑖 be a review of product 𝑡. An explanation of why the score SC(r𝑡𝑖 ) is given to a review r𝑡𝑖 is as follows: ⋃︁ Exp(r𝑡𝑖 , SC(r𝑡𝑖 )) = Exp(𝜑, sc𝐴𝐼 (𝜑)) 𝜑∈𝒯𝑡,r𝑡𝑖 In Definition 10, function Exp(r𝑡𝑖 , SC(r𝑡𝑖 )) collects all the explanations for all the topics within r𝑡𝑖 . In other words, this function collects all the initial arguments of components 𝑎𝜑 , where 𝜑 ∈ 𝒯𝑡,r𝑡𝑖 . That is, Exp(r𝑡𝑖 , SC(r𝑡𝑖 )) ⊆ grd(𝐹 ), thus, there is no doubt on the acceptance of these arguments. 5. Conclusion and Future Work We present a method to consider reviews containing a comment topic with the same score as a single argument. We aggregate the initial weights of the reviews in an argument to indicate the main weight of the arguments. We introduce an attack relation between arguments by considering the weight of arguments. Then, we use the notion of grounded semantics of AFs to evaluate the most trusted arguments. Since each argument 𝑎𝑖,𝜑 in the constructed AF is attached by 𝑎𝑗,𝜑 if 𝑗 ̸= 𝑖 and 𝑤(𝑎𝑗,𝜑 ) > 𝑤(𝑎𝑖,𝜑 ), we evaluate the score of topics in the grounded extension. Next, we introduce a function to explain the reason for choosing the associated score of a topic by a system. Then, we present a function to accumulate the scores of topics within a review to assign a score to a review from a machine point of view. In the next step, as an explanation method, we also introduce a function to give a review and its score from a system point of view as inputs and returns all the arguments that have an effect on the assessment of a review in question from a system point of view. In our approach, we identify an attack relation between arguments if they are about the same topic but their weights are different. As future work, we are eager to present relations among arguments that do not have a common topic (we are eager to consider some other features of arguments for presenting relations among them). Furthermore, we are interested in using the non-monotonic nature of argumentation theory for working on a temporal way of reasoning instead of considering a fixed set of reviews. That is, we aim to show how we can evaluate the quality of reviews if we consider the order of time of presenting of reviews. To have further human-machine interaction we aim to consider user preferences over the topics of a product to evaluate the score of a review. It is a possible topic for future work to use the generalizations of AFs, namely, valued-based AFs [14, 15] or ADFs [21], for modeling and assessing the quality of the set of the reviews. Note that the tasks of argument mining from a given KR and automatically constructing of the associated AF are not within the scope of this work because we focus on studying the use of the grounded semantics of AFs as a means to assess product reviews. Given that this shows to be a promising direction, future work will focus on optimizing the task of AF construction by combining human and automated computation. Acknowledgments This research has been supported by the Centrum Wiskunde & Informat- ica (CWI) and Supported by the Netherlands eScience Center project “The Eye of the Beholder” (project nr. 027.020.𝐺15). 10 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 References [1] K. Cyras, A. Rago, E. Albini, P. Baroni, F. Toni, Argumentative XAI: A survey, in: IJCAI, ijcai.org, 2021, pp. 4392–4399. [2] P. M. Dung, On the acceptability of arguments and its fundamental role in nonmonotonic reasoning, logic programming and n-person games, Artificial Intelligence 77 (1995) 321– 357. [3] D. Ceolin, G. Primiero, J. Wielemaker, M. Soprano, Assessing the quality of online reviews using formal argumentation theory, in: ICWE, Springer, 2021, pp. 71–87. [4] C. Lacave, F. J. Díez, A review of explanation methods for heuristic expert systems, Knowl. Eng. Rev. 19 (2004) 133–146. [5] A. Vassiliades, N. Bassiliades, T. Patkos, Argumentation and explainable artificial intelli- gence: a survey, The Knowledge Engineering Review 36 (2021). [6] M. Demollin, Q. Shaheen, K. Budzynska, C. Sierra, Argumentation theoretical frameworks for explainable artificial intelligence, in: 2nd Workshop on Interactive Natural Language Technology for Explainable Artificial Intelligence, NL4XAI@INGL 2020, Dublin, Ireland, December 18, 2020, Association for Computational Linguistics, 2020, pp. 44–49. [7] X. Fan, F. Toni, On computing explanations in abstract argumentation, in: ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic - Including Prestigious Applications of Intelligent Systems (PAIS 2014), volume 263 of Frontiers in Artificial Intelligence and Applications, IOS Press, 2014, pp. 1005–1006. [8] A. Borg, F. Bex, A basic framework for explanations in argumentation, IEEE Intell. Syst. 36 (2021) 25–35. [9] Z. G. Saribatur, J. P. Wallner, S. Woltran, Explaining non-acceptability in abstract argu- mentation, in: ECAI, volume 325 of Frontiers in Artificial Intelligence and Applications, IOS Press, 2020, pp. 881–888. [10] X. Fan, F. Toni, On explanations for non-acceptable arguments, in: TAFA, volume 9524 of Lecture Notes in Computer Science, Springer, 2015, pp. 112–127. [11] X. Fan, F. Toni, On computing explanations in argumentation, in: AAAI, AAAI Press, 2015, pp. 1496–1502. [12] L. Amgoud, S. Vesic, Two roles of preferences in argumentation frameworks, in: ECSQARU, volume 6717, Springer, 2011, pp. 86–97. [13] S. Modgil, Reasoning about preferences in argumentation frameworks, Artif. Intell. 173 (2009) 901–934. [14] T. J. M. Bench-Capon, Value-based argumentation frameworks, in: NMR, 2002, pp. 443–454. [15] T. J. M. Bench-Capon, Persuasion in practical argument using value-based argumentation frameworks, J. Log. Comput. 13 (2003) 429–448. [16] K. Noor, A. Hunter, Analysing product reviews using probabilistic argumentation, in: COMMA, volume 326, IOS Press, 2020, pp. 295–306. [17] A. Hunter, K. Noor, Aggregation of perspectives using the constellations approach to probabilistic argumentation, in: AAAI, AAAI Press, 2020, pp. 2846–2853. [18] G. Brewka, S. Ellmauthaler, H. Strass, J. P. Wallner, S. Woltran, Abstract dialectical frameworks: An overview, in: Handbook of Formal Argumentation, College Publications, 2018, pp. 237–285. 11 Atefeh Keshavarzi Zafarghandi et al. CEUR Workshop Proceedings 1–12 [19] J. S. Chall, E. Dale, Readability revisited: The new Dale-Chall readability formula, Brookline Books, 1995. [20] M. Li, P. Vitányi, et al., An introduction to Kolmogorov complexity and its applications, volume 3, Springer, 2008. [21] G. Brewka, S. Woltran, Abstract dialectical frameworks, in: Proceedings of the Twelfth International Conference on the Principles of Knowledge Representation and Reasoning (KR 2010), 2010, pp. 102–111. 12