<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Sentimental Agents: Exploring Deliberation, Cognitive Biases, and Decision-making in LLM-based Multiagent Systems</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Elizabeth</forename><forename type="middle">A</forename><surname>Ondula</surname></persName>
							<email>ondula@usc.edu</email>
							<affiliation key="aff1">
								<orgName type="institution">University of Southern California</orgName>
								<address>
									<settlement>Los Angeles</settlement>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Daniele</forename><surname>Orner</surname></persName>
							<email>daniele@braveventurelabs.com</email>
							<affiliation key="aff0">
								<orgName type="institution">Brave Venture Labs</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Nick</forename><forename type="middle">Mumero</forename><surname>Mwangi</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Brave Venture Labs</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Casandra</forename><surname>Rusti</surname></persName>
							<email>rusti@usc.edu</email>
							<affiliation key="aff1">
								<orgName type="institution">University of Southern California</orgName>
								<address>
									<settlement>Los Angeles</settlement>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Sentimental Agents: Exploring Deliberation, Cognitive Biases, and Decision-making in LLM-based Multiagent Systems</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">D8EA20FA1074AFF09D14B104F22026BE</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T18:56+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Multi-Agent Systems</term>
					<term>Large Language Models</term>
					<term>Sentiment Analysis</term>
					<term>Cognitive Biases</term>
					<term>Decision-Making</term>
					<term>Opinion Dynamics</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>How does sentiment affect deliberative opinion dynamics in multi-agent systems using Large Language Models (LLMs)? In this paper, we introduce Sentimental Agents, a framework designed to study collaborative decision-making in a society of agents, each equipped with a distinct Mental Model of Self. We propose a method to integrate sentiment analysis and a non-Bayesian update mechanism, to analyze and interpret agents' beliefs and interactions systematically. This method allows us to observe the volatility of the sentiment associated with different agent statements, as well as the change in opinion throughout the agents' conversation. We further use it to model and compare collaborative decision-making approaches. We situate these agents in a simulated Human Resource recruiting environment as a case study to evaluate a candidate's fit for a role. We present a set of metrics to assess the quality of the agents' output. Finally, we explore cognitive biases in the agents' individual and collective opinion formation, a fundamental step to enhance decision-making capabilities and mitigate distortions in the system and the agents' collective reasoning.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Multi-agent systems (MAS), composed of interactive agents have been pivotal in modeling social phenomena, decision-making processes and collaborative tasks. Large Language Models (LLMs) such as GPT-4 <ref type="bibr" target="#b0">[1]</ref> have opened new possibilities for exploring complex social dynamics through the simulation of linguistic interactions among agents. These models can provide the necessary capabilities for simulating communication scenarios. Integrating LLMs into MAS facilitates the study of conversations and interaction patterns in a more detailed manner.</p><p>LLMs have demonstrated exceptional performance in generating text that embodies sentiment and in executing sentiment analysis tasks <ref type="bibr" target="#b1">[2]</ref>. However, the effect of sentiment on deliberative opinion dynamics within an artificial society of agents is a domain that has not yet been fully explored. Traditional agent models may not adequately account for the influence of behavioral states like sentiment and cognitive biases on the decisionmaking process. Our work adopts a nuanced approach to understanding how the output of LLM agents influences one another within these frameworks.</p><p>We introduce Sentimental Agents, a framework designed to study and analyze collaborative decision processes. These agents are not only equipped with language capabilities but also possess a unique Mental Model of Self. This allows them to process and exhibit behaviors that can offer a comprehensive view of how opinions are formed and evolve in a multi-agent setting.</p><p>Our system is designed primarily to observe and describe agents' behavior, rather than to design or direct it. We do not currently include objectives, reward functions, utility metrics or payoffs in our model. The focus is on the natural evolution of interactions among agents without imposing external incentives or goals. Our study concentrates on non-strategic interactions. Unlike strategic agents, which model the behavior of others and act based on these predictions, our non-strategic agents do not possess such models. This distinction is crucial as it means our agents are not engaging in behaviors such as scheming or deceiving to achieve a specific objective. If LLM-based multi-agent systems are ultimately to be used to support decision-making, it is critical to understand and explain how their decisions are made. This is especially true in the hypothetical case of such systems being designed to evaluate, rank or recommend humans. At present, there are no unified solutions that can system- atically analyze the opinions and interactions of these agents, and the potential correlation between the two. To remediate this, we make the following key contributions:</p><p>• We develop a framework, Sentimental Agents [],</p><p>to explore and study collective decision-making processes in a society of agents. • We propose using sentiment analysis as a method to quantify content generated by LLM-based agents for evaluation and recommendation tasks. • We propose a method to apply a non-Bayesian model for opinion dynamics within a multi-agent system. This offers a perspective on how opinions are formed and altered in a sentiment-driven environment.</p><p>Additionally, this work introduces metrics for assessing the quality of conversation and decision-making in language model-based multi-agent systems. These metrics, namely nuance, platitudinal score, drift, and defensibility, offer a toolkit for evaluating the effectiveness of such systems in diverse scenarios. Furthermore, we evaluate cognitive biases including negativity, positivity, and saliency biases. This assessment offers valuable insights into the cognitive influences and tendencies within multiagent decision-making processes. Finally, the framework is applied in a simulated Human Resource recruiting environment, serving as a practical case study. This application not only validates the theoretical model but also highlights the practical potential of the approach in real-world settings.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related Works</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Multi-Agent Collaboration</head><p>In the study of multi-agent systems, understanding how agents collaborate to achieve collective objectives is essential. One interesting approach, explored <ref type="bibr" target="#b2">[3]</ref> examines the use of LLMs in multi-agent settings with a focus on "Theory of Mind" (TOM), which is the ability of an agent to understand and predict the mental states and intentions of others. Although crucial for collaboration, our focus looks more at how agents make decisions rather than understanding others' mental states. Other studies, like those of <ref type="bibr" target="#b3">[4]</ref> look at how agents can debate and make collective decisions using a method known as gradual semantics, where agents exchange arguments and progressively update their opinions to reach a shared decision. Our approach is different in that it explains the agent interactions and decision processes leveraging a mental model of self and sentiment tracking. Further, our agents don't have access to other agents' memories.</p><p>[5] explores how agents coordinate in complex tasks that necessitate both working together on the same task (cooperation) and dividing the task into smaller parts to be done individually (divide-and-conquer). This study highlights the need for flexible strategies to manage tasks that require both joint and individual efforts, differing from our work which doesn't focus on specific task coordination but rather on general deliberations on various topics.</p><p>Similarly, <ref type="bibr" target="#b5">[6]</ref> demonstrate the potential of collaborative mechanisms with LLMs in enhancing social interactions among agents, providing valuable insights into how these technologies can foster collaborative intelligence within multi-agent settings.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">LLM-based Multi-Agent Frameworks</head><p>An LLM-based agent is defined as an AI system comprising three core components: the brain, perception, and action modules <ref type="bibr" target="#b6">[7]</ref>. The brain module stores knowledge and memories, facilitating information processing and decision-making, essential for reasoning and handling new tasks. The perception module extends the agent's sensory capabilities to include textual, auditory, and vi-sual modalities. This enhances its understanding of the environment. The perception module extends the agent's sensory capabilities to include textual, auditory, and visual modalities. This enhances its understanding of the environment. The action module enables the agent to perform physical tasks and interact with its environment.</p><p>In terms of operating mechanism, the agent use natural language for communication, with the brain processing information from the perception module to form strategies and make decisions. In our work, we introduce the concept of a Mental Model of Self (MMS). This concept has been discussed in social psychology <ref type="bibr" target="#b7">[8]</ref>. It refers to an integrated theory and understanding that an agent forms to organize and make sense of one's self-knowledge, experiences and memories into broader principles that can guide anticipation of future behaviors and consequences.</p><p>In our implementation, it serves as an important organizational function in making sense of self-knowledge.We summarize and show differences between the Sentimental Agents framework and prior works in Table <ref type="table" target="#tab_0">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Non-strategic Multi-Agent Systems</head><p>Opinion dynamics has been extensively explored for over six decades, predominantly in the fields of sociology and psychology. It delves into the mechanisms and principles that dictate the formation and alteration of individual opinions under the influence of others. This involves examining a range of models and frameworks to comprehend collective behaviors and the process of consensus formation <ref type="bibr" target="#b8">[9]</ref>. Our work focuses on a nonstrategic model within opinion dynamics, meaning the model does not incorporate game theory principles, nor does it involve agents optimizing specific utilities. Non-Bayesian updating, in this context, signifies a process wherein opinions are modified not based on a factual or probabilistic framework that converts prior probabilities into posterior probabilities. Instead, this approach entails agents updating their opinions influenced by the views of others, without basing these on an unknown state of nature. The updating mechanism in such models can be either synchronous, where all agents update their opinions simultaneously, or asynchronous, where updates occur at different times. A recent survey categorizes and discusses various models prevalent in existing literature <ref type="bibr" target="#b9">[10]</ref>.</p><p>We further use Sentiment Analysis to investigate opinions which manifest as either positive or negative <ref type="bibr" target="#b10">[11]</ref>. Studies have shown that generative models, such as Large Language Models (LLMs), are capable of producing text, which can include opinions with specific sentiments, depending on their application <ref type="bibr" target="#b11">[12]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Evaluating LLM-based Systems</head><p>Evaluation for LLMs is emerging as a discipline to assess the performance of different of AI systems. Currently, for LLMs, there is no single benchmark or protocol that emerges as universally superior. This reflects the diversity of tasks and model capabilities. <ref type="bibr" target="#b1">[2]</ref> provides an exhaustive summary and discussion based on existing works. This work covers evaluation tasks, methods and benchmarks that are crucial for assessing the performance of LLMs. In our work, we adopt a nuanced approach to evaluation. We define specific metrics to assess the conversation quality. These metrics include nuance, platitudinal score, drift and defensibility scores, which are detailed in Section 5.6.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Preliminaries</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Conversation protocols</head><p>Consider a conversational simulation system with a set of agents denoted as ℳ = {𝑚 1 , 𝑚 2 , … , 𝑚 𝑛 }. Each agent 𝑚 𝑖 ∈ ℳ is initialized with a Mental Model of Self (MMS) and a memory component for storing an opinion log. In this system, the engagement among agents in each round 𝑡 is ordered with equal participation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Definition 1. Argument (𝐴) is a component of an opinion that contributes to its overall sentiment.</head><p>For each argument 𝐴 a sentiment value 𝑆 𝐴 is assigned, mapping the argument to a spectrum of sentiment values (positive, negative, neutral, and their intensities):</p><formula xml:id="formula_0">𝑆 𝐴 = 𝑓 (𝐴)<label>(1)</label></formula><p>where 𝑓 ∶ 𝐴 ↦ 𝑆 𝐴 is the sentiment mapping function for arguments.</p><p>Definition 2. 𝑂(𝑚 𝑖 , 𝑡) is the opinion 𝑚 𝑖 in a given round 𝑡 is a set of arguments 𝐴.</p><p>The sentiment of an opinion 𝑆 𝑂 is the average of the sentiment values 𝑆 𝐴 of all its arguments:</p><formula xml:id="formula_1">𝑆 𝑂 (𝑚 𝑖 , 𝑡) = 1 |𝑂(𝑚 𝑖 , 𝑡)| ∑ 𝐴∈𝑂(𝑚 𝑖 ,𝑡) 𝑆 𝐴<label>(2)</label></formula><p>The Ordered Engagement in the system is represented by a function 𝐸 ∶ ℳ × 𝑡 → 𝑚 𝑖 , which establishes the speaking order of agents in each round 𝑡. Under this model, each agent 𝑚 𝑖 contributes exactly one opinion per round. The collective state of opinions at any given round 𝑡 is represented as a vector:</p><formula xml:id="formula_2">𝑋 𝐸 (𝑡) = [𝑂(𝑚 1 , 𝑡), 𝑂(𝑚 2 , 𝑡), … , 𝑂(𝑚 𝑛 , 𝑡)]<label>(3)</label></formula><p>In each conversation round 𝑡, the sentiment value 𝑆 𝑂 𝑚 𝑖 , 𝑡 for each agent 𝑚 𝑖 is updated to reflect the sentiment of the</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Related Work</head><p>Sentiment Analysis Engagement type Memory Decision module Bias Evaluation <ref type="bibr" target="#b12">[13]</ref> No Ordered Belief No Confirmation bias <ref type="bibr" target="#b13">[14]</ref> No Ordered Store/Retrieve Yes No <ref type="bibr" target="#b14">[15]</ref> No Ordered Internal critic Yes Fact-checking <ref type="bibr" target="#b15">[16]</ref> No Varies Chat history Yes No <ref type="bibr" target="#b16">[17]</ref> No Ordered Specialized roles Yes No <ref type="bibr" target="#b17">[18]</ref> No Ordered User-driven Yes User-preference <ref type="bibr" target="#b18">[19]</ref> No Ordered Rationale analysis Yes Credibility check <ref type="bibr" target="#b19">[20]</ref> No newly formed opinion. This process considers the sentiment values 𝑆 𝐴 of the arguments within the opinion 𝑂.</p><p>The sentiment update is executed using a Non-Bayesian method, mathematically represented by:</p><formula xml:id="formula_3">𝑆 𝑖 𝑂 (𝑡) = 𝛼 ⋅ ( 1 |𝐴| ∑ 𝐴∈𝑂(𝑚 𝑖 ,𝑡) 𝑆 𝐴 ) + (1 − 𝛼) ⋅ 𝑆 𝑖 𝑂 (𝑡 − 1)<label>(4)</label></formula><p>Here, 𝑆 𝑖 𝑂 (𝑡) represents the average sentiment of all the arguments expressed by agent 𝑚 𝑖 at round 𝑡, with each argument 𝐴 having its sentiment value 𝑆 𝐴 = 𝑓 (𝐴). The parameter 𝛼 is a weighting factor that determines the influence of the new opinion's average sentiment on the agent's updated sentiment.</p><p>The change in sentiment Δ𝑆 𝑖 𝑂 (𝑡) for agent 𝑚 𝑖 is then calculated as the absolute difference between the updated sentiment value 𝑆 𝑖 𝑂 (𝑡) at round 𝑡 and the agent's previous sentiment value 𝑆 𝑖 𝑂 (𝑡 − 1) at round 𝑡 − 1:</p><formula xml:id="formula_4">Δ𝑆 𝑖 𝑂 (𝑡) = |𝑆 𝑖 𝑂 (𝑡) − 𝑆 𝑖 𝑂 (𝑡 − 1)| (5)</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Collective decision protocols</head><p>When the conversation ends we take the total sentiment. We have the final sentiment score and we have the average of the 𝑆 0 for the gut feeling protocol Definition 3. Borda Count Protocol: A method to collectively rank a list of items, given each individual's order of preference.</p><p>Given 𝑛, the number of items, each agent 𝑚 𝑖 ranks these items. The point assignment for an item 𝑗 by agent 𝑚 𝑖 is 𝑃 𝑚 𝑖 ,𝑗 , with the top-ranked item receiving 𝑛 points and the last receiving 1 point. The total points for each item 𝑗 is calculated as 𝑇 𝑗 = ∑ |ℳ| 𝑖=1 𝑃 𝑚 𝑖 ,𝑗 , and items are ranked in descending order of their total points 𝑇 𝑗 .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Definition 4. Tiered List Protocol: A method to collectively classify a list of items in 3 tiers, given the items that each individual can't accept, and the items they like the most.</head><p>The Valence 𝑉 𝑗 for each item 𝑗 is determined based on the sentiment of opinion 𝑆 𝑂 . For 𝑆 𝑂 &lt; −0.5, 𝑉 𝑗 = −1; for −0.5 ≤ 𝑆 𝑂 ≤ 0.5, 𝑉 𝑗 = 0; and for 𝑆 𝑂 &gt; 0.5, 𝑉 𝑗 = 1. Items are classified into three tiers according to 𝑉 𝑗 : Tier 1 for 𝑉 𝑗 = 1, Tier 2 for 𝑉 𝑗 = 0, and Tier 3 for 𝑉 𝑗 = −1.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Definition 5. Gut-feeling List Protocol: A method to collectively rank a list of items based on the confidence of individuals' feeling toward each item.</head><p>The volatility 𝜈 𝑚 𝑖 ,𝑗 of agent 𝑚 𝑖 's sentiment towards item 𝑗 over several rounds is calculated. Conviction 𝐼 𝑚 𝑖 ,𝑗 is derived as a function of both volatility 𝜈 𝑚 𝑖 ,𝑗 and the final sentiment score 𝑆 𝑚 𝑖 ,𝑗 for item 𝑗. The Gut-feeling list is then generated using a Borda count based on 𝐼 𝑚 𝑖 ,𝑗 for each item across all agents, and items are ranked based on the total Conviction points 𝑇 𝐼 𝑗 = ∑ |ℳ| 𝑖=1 𝐼 𝑚 𝑖 ,𝑗 in descending order.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Applying the Framework</head><p>Our framework is applied to a simulated environment inspired by Human Resource recruiting to evaluate the effectiveness of Sentimental Agents. These agents are designed to generate opinions reflecting their unique expertise, contributing to collective decision-making. The simulation explores opinion formation and decision-making processes within an LLM-based multi-agent setting, mirroring real-world HR recruitment where employers assess candidates through discussions with various experts. In this context, LLM-based agents are expected to engage in conversation and form diverse opinions that influence their decision-making in a simulated recruiting scenario.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Configuration</head><p>In the HR recruiting simulation, advisor agents analyze candidates' CVs and engage in discussions to provide opinions about each candidate. These agents, with expertise in roles like Chief Financial Officer (CFO), Vice President of Engineering, and Recycling Plant Manager, evaluate profiles and generate text reports. They also score candidates and, through collective decision-making protocols like the Borda Count, rank candidates or select the top performers.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.1.">Dataset</head><p>We sourced our dataset from the study conducted by <ref type="bibr" target="#b20">[21]</ref>. This dataset is a collection of resumes represented in a multi-label format. To facilitate easy access and integration of this dataset into our framework, we have developed a script that automates the process of downloading and parsing the data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Sentimental Agents Framework</head><p>The system design as shown in Fig <ref type="figure" target="#fig_0">1</ref>, consists of 7 modules. We describe each of them here.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Brief Module</head><p>The module provides a configuration interface for system initialization with four components: input type, output type, task type, and context. It handles single and multiple item formats for input and output and requires user-defined context specifying task object and subject, with optional Knowledge base integration. Predefined rules in the module automatically associate Input, Output, and Task Types. The logic enforces specific task types Evaluate, Score, Classify for single-item inputs and broader tasks for multi-item inputs. For rank tasks, the output is structured as a list to match task requirements. This design ensures alignment between input/output formats and system functionality.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Agent Initialization</head><p>The Agent Initialization module includes two main elements: Mental Model of Self (MMS) and Memory with qualitative and quantitative opinion logs. It configures agent interaction types for opinion formation as dynamic or independent. The module requires user input to set the number of agents and their expertise, which informs the creation of detailed agent profiles, including priorities, objectives, and evaluation criteria. Figure <ref type="figure" target="#fig_1">3</ref> shows an instance of an MMS. Key parameters include the tolerance level, affecting opinion change propensity, and the drift metric, which tracks MMS variability. Strategies for maintaining agent consistency involve controlled character prompts and setting MMS prompt temperature. The opinion formation, generated via boolean input, influences the nature of agents' decision-making processes. Each agent's opinion log is stored in a central memory system, ensuring decision-making is based on comprehensive and transparent data. Figure <ref type="figure">2</ref> shows agent initialization prompt.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 2: Series of prompts used to create a group of agents</head><p>Mental Model of Self, for one instance of the system</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3.">Opinion Dynamics Module</head><p>This module coordinates agent conversations and decision-making, consisting of conversation and decisionmaking protocols. It focuses on: defining the number of agents, engagement type and stopping mechanism. The current implementation employs ordered engagement with equal participation. For the stop mechanism, the module uses a non-strategic approach, differentiating from strategic interactions. In this non-strategic context, conversations conclude based on non-Bayesian updating as shown in Algorithm 1 , where they end once agents' opinions reach stability. This contrasts with strategic interactions, which involve different mechanisms like rewards or objectives.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.4.">Conversation Module</head><p>This module analyzes agents' statements in conversations, comprising four components: Argumentation, which breaks down statements into arguments for qualitative logging; Sentiment Analysis, which evaluates and quantitatively logs the sentiment of each argument; Opinion Change, using non-Bayesian updating to monitor sentiment shifts; and Conversation Trends, gauging significant changes across rounds to infer opinion stabilization and conversation conclusion.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.5.">Decision Module</head><p>The Decision Making Protocol module is designed to accommodate various decision-making protocols, including Borda Count, Tiered List, and Gut Feeling List, as detailed in the preliminaries (Section 3). It operates by capturing the final sentiment of each agent and the average sentiment throughout the conversation. The functionality and outcomes of these different decision-making processes are further explored and discussed in the results (Section 6).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.6.">Evaluation and Cognitive Bias Modules</head><p>This module evaluates the quality of conversations through various metrics.</p><p>• Nuance: Examines the diversity of themes and perspectives, quantified by the number of topics identified within individual statements or the entire conversation. • Platitudinal Score: Calculated using cosine similarity, it measures the uniqueness of outcomes in the conversation rounds, with higher scores indicating less similarity between different runs. • Drift: Assesses the stability of each agent's Mental Model of Self, monitoring the relevance of results to the advisors' profiles and checking for consistency throughout the conversation. • Defensibility: Evaluates the strength and evidence backing of the agents' arguments, ensuring they are well-supported and referenceable.</p><p>In this research, we examine three cognitive biases: negativity, positivity, and saliency. Negativity bias might lead agents to give undue weight to adverse opinions <ref type="bibr" target="#b21">[22]</ref> [23], while positivity bias could result in an overemphasis on favorable views <ref type="bibr" target="#b23">[24]</ref>  <ref type="bibr" target="#b24">[25]</ref>. Saliency bias, on the other hand, might cause agents to focus on the most prominent or emotionally striking aspects of an opinion, potentially overshadowing other relevant information <ref type="bibr" target="#b25">[26]</ref>. Increment 𝑡 12: end for</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Algorithm 1 Non-Bayesian Updating</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Experiments</head><p>In this study, we aim to investigate the dynamics of sentiment and opinion formation in an LLM-based multiagent system. We focus on understanding how agents' opinions evolve through deliberation, and how sentiment influences their decision-making processes. Our research questions are as follows:</p><p>1. How do agents' opinions change as a result of deliberating with each other, and can we quantify these changes? 2. Do agents adopt each other's arguments during the deliberation process, and can we observe this in qualitative results? 3. Does the sentiment of an argument (valence, arousal) affect its adoption by other agents? 4. Do agents exhibit cognitive biases in their opinion formation, and how can we identify and mitigate these biases?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.1.">Experimental Setting</head><p>In our experiments, we conducted the simulation with 3 agents and 10 candidates. We used the data set within the simulation environment described in Section 4. For the LLM, we used the gpt-3.5-turbo-0613 version of ChatGPT <ref type="bibr">[27]</ref>. For the result shown, the language model parameters were set as alpha 𝛼 = 0.5, tolerance = 0.00001, and temperature = 1.5.</p><p>(a) </p><formula xml:id="formula_5">(b) (c) (d) (e) (f) (g) (h) (i) (j)</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2.">Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2.1.">Evaluation Metrics</head><p>The non-Bayesian updating data from the simulation, shown in Figure <ref type="figure">4</ref>, reveals sentiment fluctuations among agents. For instance, Figure <ref type="figure">5a</ref> shows the VP of Engineering exhibiting the most dramatic change, especially in the final round. This volatility, captured by sentiment and change metrics, highlights the dynamic nature of opinion formation in multi-agent conversations and suggests that agents' opinions evolve and respond to the unfolding discourse, emphasizing the effectiveness of non-Bayesian updating in capturing real-time perspective shifts. 1  Platitudinal score. The inter-agent similarities heatmap shown in Figure <ref type="figure">5</ref> reveals a contrast in sentiment alignment among the agents. This divergence contributes to an overall lower platitudinal score for this specific run for the given candidate. Such diversity in sentiment, as captured by the platitudinal metric, underscores the variation in decision-making approaches within the agent group, emphasizing the balance between consensus and individual thought in the simulation outcomes. 1 For brevity, we only show results for 5 candidates, but the experiment was conducted with 10 candidates for the platitudinal scores, Drift scores. In Table <ref type="table">3</ref> it is observed that the CFO agent generally exhibits moderate drift, while the VP of Engineering (VPE) and the Recycling Plant Manager (RPM) show higher drift values, suggesting a more dynamic adaptation of their MMS in response to the conversation. This variability in drift signifies the agents' differing levels of adaptability and potential reevaluation of their initial stances (LDA) to extract topics from text statements. The data is preprocessed by tokenization and removal of stop words and unwanted words. A dictionary and corpus are constructed using the Gensim library. The LDA model identifies 5 topics, with the top 10 words per topic being most significant. Figure <ref type="figure" target="#fig_4">6</ref> and ?? show the number of unique words per topic and word clouds for each candidate, respectively. Defensibility Scores Candidate resumes are processed through the Langchain embedding 2 and transformed into a format suitable for detailed analysis. The llama index libraries, VectorStoreIndex and ServiceContext, are used to create an indexed repository of the vectorized documents. This index serves as a searchable database, allowing efficient retrieval of text segments that are contextually similar to a given input. When evaluating agents' arguments, the indexed space is searched to find text segments from the resumes that closely match the argument. The similarity between an agent's argument and the retrieved text is quantified as a score, with higher scores indicating stronger support for the argument. If no relevant text is found, a score of zero is assigned, suggesting an unsupported argument.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Candidate</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2.2.">Cognitive Bias Testing</head><p>We hypothesize that agents' updates in sentiment during conversational rounds might be influenced by their peers' positive, negative, or prominent opinions. To investigate this, we chart each agent's sentiment change from the second round onwards, against the recent sentiments of other agents. This analysis reveals the correlation between an agent's changing sentiment and the influence of peer opinions.</p><p>We apply Ordinary Least Squares (OLS) regression to analyze nega tive and positive sentiments separately, setting the y-intercept at zero to indicate that neutral peer statements might not impact an agent's sentiment. Analyzing the regression's strength (𝑅 2 ) and the slope, as 2 https://github.com/langchain-ai/langchain well as the data point distribution, provides insights into the cognitive tendencies of the agents. Additionally, by adjusting our three parameters, alpha, tolerance, and temperature, we aim to better understand how these factors affect agents' cognitive biases. This study offers important insights into the decision-making processes in multiagent systems, particularly in sentiment-influenced contexts.</p><p>In our sensitivity analysis, we varied key parameters: setting alpha to 0.3, 0.5, and 0.7; tolerance to 0.001, 0.005, and 0.0001; and temperature to 0.7, 1, and 1.5, to evaluate their impact on sentiment changes. The outcome, depicted in Figure <ref type="figure" target="#fig_5">7</ref> for ten random candidates, provides insight into negativity and positivity biases through the slopes of the OLS regressions. Our findings on this variation of model parameters show a modest positivity bias, evidenced by the positive slope being approximately 29% steeper than its negative counterpart. A slight positivity or negativity bias trend persisted across varied parameter settings, with some scenarios, notably alpha = 0.3, tolerance = 0.005, and temperature = 1.5, showing a more pronounced positivity bias with a slope more than twice as steep on the positive side than on the negative side.</p><p>The absence of saliency bias was noted in all experiments, as indicated by slopes remaining below 1. Linear regression was determined as the most suitable model based on our evaluation of the 𝑅 2 values. Notably in the shown experiment, agents displayed a tendency towards expressing stronger negative sentiments, with the most negative reaching -0.76, compared to a maximum positive sentiment of 0.62. This inclination towards stronger negative expressions was marked in most scenarios. Additionally, the alpha parameter was observed to significantly influence sentiment ranges, with lower alpha values yielding more constrained ranges.</p><p>For future studies, we aim to extend our examination of the cognitive bias to larger candidate sample sizes. This expansion will enable us to deepen our understanding of how parameter tuning influences cognitive biases and decision-making processes within our framework.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2.3.">Collective decision-making</head><p>The decision-making data reveals diverse agent preferences, as evidenced by the variation in candidate ranks across Borda Count, Tier, and Conviction. We use the average sentiment score, 𝑆 𝑖 𝑂 (𝑡), from equation 4, where 𝑡 is the last round, as the basis for collective decision-making. While some candidates consistently rank higher or lower, suggesting a consensus on their suitability, discrepancies in ranks among agents could reflect unique valuations of candidate qualities.</p><p>Table <ref type="table" target="#tab_2">4</ref> shows the overall sentiment scores. The CFO shows the highest sentiment score, of 0.46 towards Melissa Baldwin, indicating a strong positive inclination.  The sentiment volatility of the agents, as shown in Table 7, was mostly moderate, indicating strong conviction in their opinions. However, there were instances of high volatility, such as the CFO's sentiment towards Kimberly Carr and Mikayla Garrison, and the VPE's sentiment towards Kimberly, Joshua, and James. The RPM's sentiment was volatile towards Mikayla and Melissa Baldwin. The agents' conviction in their opinions is calculated by dividing the final sentiment by the volatility, with higher values indicating stronger intuition about a candidate's suitability for the role. Gut Feeling Rank for each candidate is a revised ranking that takes into account an agent's conviction in its own sentiment. In Table <ref type="table">9</ref>, the Gut Feeling of the RPM toward Joshua is still to rank him in the first place. But the CFO revises its ranking of Kimberley, from the 9th place to the 8th place. The more generous ranking can be interpreted as a result of the "acknowledgement" of the RPM agent that it is not sure of its opinion toward Kimberly.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Sentiment</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Conclusion</head><p>In this paper, we introduce Sentimental Agents, LLM-based agents that generate opinions for collective decision-making within conversational settings. Our pro- </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 9</head><p>Gut feeling for each candidate (Ranking of candidates that combines both the Sentiment Score, and the Conviction an agent has in this sentiment posed framework integrates a non-Bayesian updating mechanism to track sentiment volatility and opinion evolution. In a simulated HR recruiting scenario, we assess these agents' decision-making abilities, noting their diverse opinions and preference shifts over multiple rounds.</p><p>The findings suggest model parameters, such as alpha and tolerance, significantly influence sentiment expression and thus cognitive bias within the system. This research offers a foundation for advanced tool development applicable to domains such as HR recruiting, medical diagnostics, or educational domains.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: The Sentimental Agents framework consists of 7 modules: The Brief, Agent Initialization, Opinion Dynamics, Conversation, Decision, Cognitive Bias and Evaluation Modules.</figDesc><graphic coords="2,89.29,84.19,416.68,139.08" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: An instance of an agent's Mental Model of Self in a simulated HR environment. In this case, the agent took a Job Title as input, and generated a Description, Priorities, and Evaluation Criteria for a given Job Description.</figDesc><graphic coords="6,109.63,84.18,162.69,139.42" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>1: for each round 𝑡 do 2 : 5 5 :</head><label>255</label><figDesc>for each agent 𝑚 𝑖 ∈ ℳ do Δ𝑆 𝑖 𝑂 (𝑡) = |𝑆 𝑖 𝑂 (𝑡) − 𝑆 𝑖 𝑂 (𝑡 − 1)| Equation 4 𝑆 𝑖 𝑂 (𝑡) = 𝛼 ⋅𝑆 𝑂 (𝑚 𝑖 , 𝑡)+(1−𝛼)⋅𝑆 𝑖 𝑂 (𝑡 − 1) if all Δ𝑆 𝑖 𝑂 (𝑡) &lt; threshold for each 𝑚 𝑖 ∈ ℳ or 𝑡 = max_rounds then</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :Figure 5 :</head><label>45</label><figDesc>Figure 4: Sentiment change, and corresponding Opinion Change in conversations for two different candidates. Each conversation stops after five rounds.</figDesc><graphic coords="7,89.29,261.51,79.17,79.17" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 6 :</head><label>6</label><figDesc>Figure 6: The nuance score for each candidate by showing the number of unique words used across all topics.</figDesc><graphic coords="8,119.80,181.86,142.35,113.88" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 7 :</head><label>7</label><figDesc>Figure 7: The results of the cognitive bias testing using a sample of ten candidates and model parameters of alpha = 0.5, tolerance = 0.00001, and temperature = 1.5.</figDesc><graphic coords="9,89.29,84.19,203.36,139.25" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 A</head><label>1</label><figDesc></figDesc><table><row><cell></cell><cell>Ordered</cell><cell cols="2">Dynamic Memory No</cell><cell>Opinion classifier</cell></row><row><cell>Sentimental Agents Yes</cell><cell>Ordered</cell><cell>Opinion logs</cell><cell>Yes</cell><cell>Cognitive bias</cell></row></table><note>comparison of different language model-based multi-agent frameworks.</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 4</head><label>4</label><figDesc>Final Sentiment Scores for each candidate</figDesc><table><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">Final Sentiment Scores</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>Candidate</cell><cell>CFO VPE</cell><cell>RPM</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">Kimberly Carr</cell><cell>-0.51 -0.13 0.26</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">Melissa Morgan</cell><cell>-0.74 -0.21 -0.46</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">Mikayla Garrison 0.69</cell><cell>0.38</cell><cell>0.59</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">Emily Marshall</cell><cell>0.29</cell><cell>-0.29 0.05</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>Justin Davis</cell><cell>-0.01 -0.12 0.37</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">Tamara Brown</cell><cell>0.11</cell><cell>0.36</cell><cell>0.20</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">Taylor Mahoney</cell><cell>-0.49 -0.55 -0.41</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">Joshua Alvarado</cell><cell>0.28</cell><cell>0.58</cell><cell>0.25</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">Melissa Baldwin</cell><cell>0.46</cell><cell>0.34</cell><cell>0.17</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">James Wallace</cell><cell>-0.30 0.11</cell><cell>-0.30</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>Candidates Rank</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>Candidate</cell><cell>CFO VPE RPM Borda Count Rank</cell></row><row><cell cols="4">Candidate Borda Count Tiered Gut-Feeling Kimberly 9 3 10 Melissa 10 3 8 Mikayla 2 2 1 Emily 3 2 3 Justin 5 2 5 Tamara 6 2 6 Taylor 8 3 7 Joshua 4 2 4 Melissa 1 2 2</cell><cell cols="2">Kimberly Carr Melissa Morgan Mikayla Garrison 2 9 10 Emily Marshall 3 Justin Davis 5 Tamara Brown 6 Taylor Mahoney 8 Joshua Alvarado 4 Melissa Baldwin 1 James Wallace 7</cell><cell>5 6 9 8 4 7 10 3 1 2</cell><cell>3 10 4 6 2 7 9 1 5 8</cell><cell>4 2 7 4 8 3 1 9 10 4</cell></row><row><cell>James</cell><cell>7</cell><cell>2</cell><cell>9</cell><cell>Table 5</cell></row><row><cell cols="2">Table 3 Candidate Ranking Metrics</cell><cell></cell><cell></cell><cell cols="2">Rank of each candidate, including the final Rank taking into account each agent's individual rankings (calculated through Borda count)</cell></row><row><cell cols="4">In contrast, the CFO's lowest sentiment score is -0.74</cell><cell cols="2">to one candidate. Consequently, no candidate was clas-</cell></row><row><cell cols="4">towards Melissa Morgan, signaling a significant negative</cell><cell cols="2">sified as Tier 1, with most classified as Tier 2, except for</cell></row><row><cell cols="4">view. Similarly, the VPE aligns with the CFO in favor-</cell><cell cols="2">the three candidates with negative valence, who were</cell></row><row><cell cols="4">ing Melissa Baldwin with the highest score of 0.34, but</cell><cell>classified as Tier 3.</cell></row><row><cell cols="4">diverges in its lowest sentiment, which is directed to-</cell><cell></cell></row><row><cell cols="4">wards Taylor Mahoney with a score of -0.55. The RPM,</cell><cell></cell><cell>Valence</cell></row><row><cell cols="4">on the other hand, exhibits the most positive sentiment</cell><cell>Candidate</cell><cell>CFO VPE RPM</cell></row><row><cell cols="4">towards Mikayla Garrison with a score of 0.59, while</cell><cell cols="2">Kimberly Carr</cell><cell>-1</cell><cell>0</cell><cell>0</cell></row><row><cell cols="4">sharing the CFO's negative sentiment towards Melissa</cell><cell cols="2">Melissa Morgan</cell><cell>-1</cell><cell>0</cell><cell>0</cell></row><row><cell cols="4">Morgan, albeit at a less intense level of -0.46. The sen-timent scores from the conversations directly influence the ranking of candidates as shown in Table 5. Applying the Borda Count method to the combined rankings yields a collective decision. Although individual agents might rank candidates differently based on their interactions,</cell><cell cols="2">Mikayla Garrison 0 Emily Marshall 0 Justin Davis 0 Tamara Brown 0 Taylor Mahoney 0 Joshua Alvarado 0 Melissa Baldwin 0</cell><cell>0 0 0 0 -1 0 0</cell><cell>0 0 0 0 0 0 0</cell></row><row><cell cols="4">the aggregated results provide a more comprehensive</cell><cell cols="2">James Wallace</cell><cell>0</cell><cell>0</cell><cell>0</cell></row><row><cell cols="4">assessment. This approach demonstrates how sentiment</cell><cell></cell></row><row><cell cols="4">analysis combined with a voting system could inform</cell><cell></cell></row><row><cell cols="3">hiring decisions in a multi-agent setting.</cell><cell></cell><cell></cell></row><row><cell cols="4">The intensity of an agent's final sentiment score deter-</cell><cell></cell></row><row><cell cols="4">mines the valence score. In Table 6, this occurred only</cell><cell></cell></row><row><cell cols="4">three times: the CFO attributed a negative valence to two</cell><cell></cell></row><row><cell cols="4">candidates, while the VPE attributed a negative valence</cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 6</head><label>6</label><figDesc>Valence for each candidate</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 7</head><label>7</label><figDesc>Sentiment Volatility for each candidate</figDesc><table><row><cell></cell><cell cols="2">Volatility</cell><cell></cell></row><row><cell>Candidate</cell><cell cols="2">CFO VPE</cell><cell>RPM</cell></row><row><cell>Kimberly Carr</cell><cell>0.58</cell><cell>0.64</cell><cell>0.40</cell></row><row><cell>Melissa Morgan</cell><cell>0.16</cell><cell>0.32</cell><cell>0.47</cell></row><row><cell cols="2">Mikayla Garrison 0.31</cell><cell cols="2">-0.47 0.17</cell></row><row><cell>Emily Marshall</cell><cell>0.17</cell><cell>0.42</cell><cell>0.21</cell></row><row><cell>Justin Davis</cell><cell>0.49</cell><cell>0.16</cell><cell>0.34</cell></row><row><cell>Tamara Brown</cell><cell cols="3">-0.06 -0.27 -0.16</cell></row><row><cell>Taylor Mahoney</cell><cell>0.13</cell><cell>0.26</cell><cell>0.17</cell></row><row><cell>Joshua Alvarado</cell><cell>0.09</cell><cell>0.09</cell><cell>0.42</cell></row><row><cell>Melissa Baldwin</cell><cell>0.14</cell><cell>0.46</cell><cell>0.61</cell></row><row><cell>James Wallace</cell><cell>0.39</cell><cell>0.54</cell><cell>0.39</cell></row><row><cell cols="2">Conviction</cell><cell></cell><cell></cell></row><row><cell>Candidate</cell><cell cols="2">CFO VPE</cell><cell>RPM</cell></row><row><cell>Kimberly Carr</cell><cell cols="3">-0.30 -.0.08 0.11</cell></row><row><cell>Melissa Morgan</cell><cell cols="2">-0.12 -0.07</cell><cell>-0.22</cell></row><row><cell cols="2">Mikayla Garrison 0.21</cell><cell>-0.18</cell><cell>0.10</cell></row><row><cell>Emily Marshall</cell><cell>0.05</cell><cell>-0.12</cell><cell>0.01</cell></row><row><cell>Justin Davis</cell><cell>0.00</cell><cell>-0.02</cell><cell>0.13</cell></row><row><cell>Tamara Brown</cell><cell cols="2">-0.01 -0.10</cell><cell>-0.03</cell></row><row><cell>Taylor Mahoney</cell><cell cols="2">-0.06 -0.14</cell><cell>-0.07</cell></row><row><cell>Joshua Alvarado</cell><cell>0.02</cell><cell>0.05</cell><cell>0.10</cell></row><row><cell>Melissa Baldwin</cell><cell>0.06</cell><cell>0.16</cell><cell>0.10</cell></row><row><cell>James Wallace</cell><cell cols="2">-0.12 0.06</cell><cell>-0.12</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 8</head><label>8</label><figDesc>Conviction for each candidate (the Sentiment Score of given by agents to a candidate, taking into account the Sentiment Volatility during a conversation about this candidate)</figDesc><table /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Gpt-4 technical report</title>
		<author>
			<persName><forename type="first">R</forename><surname>Openai</surname></persName>
		</author>
		<idno>arxiv 2303.08774</idno>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page">13</biblScope>
		</imprint>
	</monogr>
	<note>View in Article 2</note>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">Y</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Zhu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Yi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Wang</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2307.03109</idno>
		<ptr target="https://arxiv.org/abs/2307.03109" />
		<title level="m">A survey on evaluation of large language models</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">Q</forename><surname>Chong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Stepputtis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Campbell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Hughes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lewis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Sycara</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2310.10701</idno>
		<title level="m">Theory of mind for multi-agent collaboration via large language models</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Multiagent dynamics of gradual argumentation semantics</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">D</forename><surname>De Tarlé</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Bonzon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Maudet</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS</title>
				<imprint>
			<date type="published" when="2022">2022. 2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Too many cooks: Bayesian inference for coordinating multi-agent collaboration</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">A</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">E</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Evans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">B</forename><surname>Tenenbaum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">C</forename><surname>Parkes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kleiman-Weiner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Topics in Cognitive Science</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="page" from="414" to="432" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View</title>
		<author>
			<persName><forename type="first">J</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Xu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Deng</surname></persName>
		</author>
		<idno>arXiv:</idno>
		<ptr target="2310.02124[cs" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">Z</forename><surname>Xi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Guo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>He</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Ding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Hong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Jin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Zhou</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2309.07864</idno>
		<title level="m">The rise and potential of large language model based agents: A survey</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><surname>Hart</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Fegley</surname></persName>
		</author>
		<title level="m">Social imitation and the emergence of a mental model of self</title>
				<imprint>
			<date type="published" when="1994">1994</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Opinion Dynamics: Models, Extensions and External Effects</title>
		<author>
			<persName><forename type="first">A</forename><surname>Sîrbu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Loreto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">D P</forename><surname>Servedio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Tria</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-319-25658-0_17</idno>
		<ptr target="Understand-ingComplexSystems" />
	</analytic>
	<monogr>
		<title level="m">Participatory Sensing, Opinions and Collective Awareness</title>
				<editor>
			<persName><forename type="first">V</forename><surname>Loreto</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Haklay</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Hotho</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><forename type="middle">D</forename><surname>Servedio</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Stumme</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Theunis</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Tria</surname></persName>
		</editor>
		<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="363" to="401" />
		</imprint>
	</monogr>
	<note>series Title:</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">A survey on nonstrategic models of opinion dynamics</title>
		<author>
			<persName><forename type="first">M</forename><surname>Grabisch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rusinowska</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Games</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page">65</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<author>
			<persName><forename type="first">E</forename><surname>Cambria</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Das</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Bandyopadhyay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Feraco</surname></persName>
		</author>
		<title level="m">A practical guide to sentiment analysis</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="volume">5</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Synthetic text generation for sentiment analysis</title>
		<author>
			<persName><forename type="first">U</forename><surname>Maqsud</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</title>
				<meeting>the 6th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="156" to="161" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Betz</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2104.06737</idno>
		<title level="m">Natural-language multi-agent simulations of argumentative opinion dynamics</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<author>
			<persName><forename type="first">Y</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Sun</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2310.06500</idno>
		<title level="m">Metaagents: Simulating interactions of human behaviors for llm-based taskoriented coordination via collaborative generative agents</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<author>
			<persName><forename type="first">Y</forename><surname>Du</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Torralba</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">B</forename><surname>Tenenbaum</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Mordatch</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2305.14325</idno>
		<title level="m">Improving factuality and reasoning in language models through multiagent debate</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<author>
			<persName><forename type="first">C.-M</forename><surname>Chan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Su</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Xue</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Fu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Liu</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2308.07201</idno>
		<title level="m">Chateval: Towards better llm-based evaluators through multi-agent debate</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Shu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sesay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">F</forename><surname>Karlsson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Fu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Shi</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2309.17288</idno>
		<title level="m">Autoagents: A framework for automatic agent generation</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<title level="m" type="main">ChoiceMates: Supporting Unfamiliar Online Decision-Making with Multi-Agent Conversational Interactions</title>
		<author>
			<persName><forename type="first">J</forename><surname>Park</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Min</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Ma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Kim</surname></persName>
		</author>
		<idno>arXiv:</idno>
		<ptr target="2310.01331" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<author>
			<persName><forename type="first">X</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Wang</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2311.01876</idno>
		<ptr target="http://arxiv.org/abs/2311.01876" />
		<title level="m">Sentiment Analysis through LLM Negotiations</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Simulating Opinion Dynamics with Networks of LLM-based Agents</title>
		<author>
			<persName><forename type="first">Y.-S</forename><surname>Chuang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Goyal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Harlalka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Suresh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Hawkins</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Shah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">T</forename><surname>Rogers</surname></persName>
		</author>
		<idno>arXiv:</idno>
		<ptr target="2311.09618" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note>physics</note>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Skills prediction based on multi-label resume classification using cnn with model predictions explanation</title>
		<author>
			<persName><forename type="first">K</forename><surname>Jiechieu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tsopze</surname></persName>
		</author>
		<idno type="DOI">10.1007/s00521-020-05302-x</idno>
		<ptr target="https://doi.org/10.1007/s00521-020-05302-x.doi:10.1007/s00521-020-05302-x" />
	</analytic>
	<monogr>
		<title level="j">Neural Computing and Applications</title>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Negative information weighs more heavily on the brain: the negativity bias in evaluative categorizations</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">A</forename><surname>Ito</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">T</forename><surname>Larsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">K</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">T</forename><surname>Cacioppo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of personality and social psychology</title>
		<imprint>
			<biblScope unit="volume">75</biblScope>
			<biblScope unit="page">887</biblScope>
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Negativity bias, negativity dominance, and contagion</title>
		<author>
			<persName><forename type="first">P</forename><surname>Rozin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">B</forename><surname>Royzman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Personality and social psychology review</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="296" to="320" />
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<title level="m" type="main">The Pollyanna principle: Selectivity in language, memory, and thought</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">W</forename><surname>Matlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">J</forename><surname>Stang</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1978">1978</date>
			<publisher>Schenkman Publishing Company</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Human language reveals a universal positivity bias</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">S</forename><surname>Dodds</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">M</forename><surname>Clark</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Desu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">R</forename><surname>Frank</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Reagan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Williams</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Mitchell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">D</forename><surname>Harris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">M</forename><surname>Kloumann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P</forename><surname>Bagrow</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the national academy of sciences</title>
		<imprint>
			<biblScope unit="volume">112</biblScope>
			<biblScope unit="page" from="2389" to="2394" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">The social perceptual salience effect</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">P</forename><surname>Inderbitzin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Betella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lanatá</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">P</forename><surname>Scilingo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">U</forename><surname>Bernardet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">F</forename><surname>Verschure</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of experimental psychology: human perception and performance</title>
		<imprint>
			<biblScope unit="volume">39</biblScope>
			<biblScope unit="page">62</biblScope>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
