<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Community-based Stance Detection</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Emanuele</forename><surname>Brugnoli</surname></persName>
							<email>emanuele.brugnoli@sony.com</email>
							<affiliation key="aff0">
								<orgName type="department">Sony Computer Science Laboratories Rome</orgName>
								<orgName type="institution">Joint Initiative CREF-SONY</orgName>
								<address>
									<addrLine>Piazza del Viminale 1</addrLine>
									<postCode>00184</postCode>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">Centro Studi e Ricerche Enrico Fermi (CREF)</orgName>
								<address>
									<addrLine>Piazza del Viminale 1</addrLine>
									<postCode>00184</postCode>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Dipartimento di Fisica</orgName>
								<orgName type="institution">Sapienza Università di Roma</orgName>
								<address>
									<addrLine>P.le A. Moro 2</addrLine>
									<postCode>00185</postCode>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Donald</forename><surname>Ruggiero</surname></persName>
							<email>donaldruggiero.losardo@sony.com</email>
						</author>
						<author>
							<persName><forename type="first">Lo</forename><surname>Sardo</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Sony Computer Science Laboratories Rome</orgName>
								<orgName type="institution">Joint Initiative CREF-SONY</orgName>
								<address>
									<addrLine>Piazza del Viminale 1</addrLine>
									<postCode>00184</postCode>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">Centro Studi e Ricerche Enrico Fermi (CREF)</orgName>
								<address>
									<addrLine>Piazza del Viminale 1</addrLine>
									<postCode>00184</postCode>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Dipartimento di Fisica</orgName>
								<orgName type="institution">Sapienza Università di Roma</orgName>
								<address>
									<addrLine>P.le A. Moro 2</addrLine>
									<postCode>00185</postCode>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">D</forename><forename type="middle">R</forename><surname>Lo</surname></persName>
						</author>
						<author>
							<affiliation key="aff3">
								<orgName type="department">Tenth Italian Conference on Computational Linguistics</orgName>
								<address>
									<addrLine>Dec 04 -06</addrLine>
									<postCode>2024</postCode>
									<settlement>Pisa</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Community-based Stance Detection</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">E76F4A48C17F3A36677D80BCAC808C7B</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T17:38+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Stance Detection</term>
					<term>Polarisation</term>
					<term>Social Networks</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Stance detection is a critical task in understanding the alignment or opposition of statements within social discourse. In this study, we present a novel stance detection model that labels claim-perspective pairs as either aligned or opposed. The primary innovation of our work lies in our training technique, which leverages social network data from X (formerly Twitter). Our dataset comprises tweets from opinion leaders, political entities and news outlets, along with their followers' interactions through retweets and quotes. By reconstructing politically aligned communities based on retweet interactions, treated as endorsements, we check these communities against common knowledge representations of the political landscape. Our training dataset consists of tweet/quote pairs where the tweet comes from a political entity and the quote either originates from a follower who exclusively retweets that political entity (treated as aligned) or from a user who exclusively retweets a political entity from an opposing ideological community (treated as opposed). This curated subset is used to train an Italian language model based on the RoBERTa architecture, achieving an accuracy of approximately 85%. We then apply our model to label all tweet/quote pairs in the dataset, analyzing its out-of-sample predictions. This work not only demonstrates the efficacy of our stance detection model but also highlights the utility of social network structures in training robust NLP models. Our approach offers a scalable and accurate method for understanding political discourse and the alignment of social media statements.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Stance detection is a critical task within the domain of natural language processing (NLP). It involves identifying the position or attitude expressed in a piece of text towards a specific topic, claim, or entity <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2]</ref>. Traditionally, stances are classified into three primary categories: favor, against, and neutral. This classification enables a detailed description of textual data, facilitating a deeper insight into public opinion and discourse dynamics.</p><p>In recent years, the proliferation of digital communication platforms such as social media, forums, and online news outlets has resulted in an unprecedented volume of user-generated content. This surge underscores the necessity for automated systems capable of efficiently analyzing and interpreting these vast text corpora. Stance detection addresses this need by providing tools that can systematically assess opinions and reactions embedded within texts, thus offering valuable applications across various fields including social media analysis <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4]</ref>, search engines <ref type="bibr" target="#b4">[5]</ref>, and linguistics <ref type="bibr" target="#b5">[6]</ref>.</p><p>According to the last report of World Economic Fo-rum <ref type="bibr" target="#b6">[7]</ref>, the increase in societal polarization features among the top three risks for democratic societies. While a macroscopic increase of polarization has been observed, an understanding of the microscopic pathways though which it develops is still an open field of research. Through stance detection it would be possible to reconstruct these pathways down to the individual text-comment pairs. Stance detection, has been explored across various fields with differing definitions and applications. Du Bois introduces the concept of the stance triangle, where stance-taking involves evaluating objects, positioning subjects, and aligning with others in dialogic interactions, emphasizing the sociocognitive aspects and intersubjectivity in discourse <ref type="bibr" target="#b5">[6]</ref>. Sayah and Hashemi focus on academic writing, analyzing stance and engagement features like hedges, self-mention, and appeals to shared knowledge to understand communicative styles and interpersonal strategies <ref type="bibr" target="#b7">[8]</ref>. <ref type="bibr">Küçük</ref> and Can define stance detection as the classification of an author's position towards a target (favor, against, or neutral), highlighting its importance in sentiment analysis, misinformation detection, and argument mining <ref type="bibr" target="#b8">[9]</ref>. These diverse approaches underscore the multifaceted nature of stance detection and its applications in enhancing the understanding of social discourse, academic rhetoric, and online content analysis. For a review of the recent developments of the field we refer to Alturayeif et al. <ref type="bibr" target="#b1">[2]</ref> and AlDayel et al. <ref type="bibr" target="#b2">[3]</ref>.</p><p>In this work, we propose a novel approach to training stance detection models by leveraging the interactions within highly polarized communities. Our method utilizes tweet/quote pairs from the Italian political debate to construct a robust training set. We operate under the assumption that users who predominantly retweet a particular political profile are likely in agreement with the statements made by that profile. We restricted our analysis to retweet since this form of communication primarily aligns with the endorsement hypothesis <ref type="bibr" target="#b9">[10]</ref>. Namely, being a simple re-posting of a tweet, retweeting is commonly thought to express agreement with the claim of the tweet <ref type="bibr" target="#b10">[11]</ref>. Further, though retweets might be used with other purposes such as those described by Marsili <ref type="bibr" target="#b11">[12]</ref>, the repeated nature of the interaction we observe in our networks reduces the probability that the activity falls outside of the endorsement behavior.</p><p>Conversely, while quoting a tweet works similarly to retweeting, the function allows users to add their own comments above the tweet. This makes this form of communication controversial regarding the endorsement hypothesis, as agreement or disagreement with the tweet depends on the stance of the added comment. On the other hand, the information social media users see, consume, and share through their news feed heavily depends on the political leaning of their early connections <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b13">14]</ref>. In other words, while algorithms are highly influential in determining what people see and shaping their onplatform experiences <ref type="bibr" target="#b14">[15]</ref>, there is significant ideological segregation in political news exposure <ref type="bibr" target="#b15">[16]</ref>. It is therefore reasonable to expect that users who almost exclusively retweet a political entity (party, leader, or both) use quote tweets to express agreement with statements posted by that entity and disagreement with statements posted by political entities ideologically distant from their preferred one. Additionally, the quote interaction perfectly encapsulates the stance triangle described by Du Bois <ref type="bibr" target="#b5">[6]</ref>.</p><p>In order to correctly assess political opposition we construct a retweet network and use the Louvain community detection algorithm <ref type="bibr" target="#b16">[17]</ref> to characterize leaders and, through label propagation, the followers that align with their views.</p><p>Through these community labels we construct a dataset of claim-perspective couples by annotating tweetquote pairs from profiles that clearly express political alignment as favor and annotating tweet-quote pairs in which the profiles come from different communities as against. Finally, we use a pretrained BERT model for Italian language and fine-tune it to the classification task.</p><p>This methodology aims to enhance the accuracy of stance detection models by incorporating real-world patterns of agreement and disagreement observed in polarized online environments. Further, it enables an unsupervised training paradigm that can be scaled to very large datasets.</p><p>In the following sections, we will outline the data gathering approach used for the dataset. Subsequently, we will describe the community detection methods employed to identify leaders and users within the Italian political discourse. We will then discuss the model architecture and its training process. In the results section, we will evaluate the model's performance and present our findings. Finally, the conclusion will address potential future developments, the implications of our work, and its limitations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Results</head><p>In this study, we focus on a comprehensive set of Italian opinion leaders active on Twitter/X, including the official profiles of major news media outlets as well as prominent politicians and political parties. The profiles of news media outlets are further classified according to assessments provided by NewsGuard, which categorize them as either questionable or reliable sources. This classification is crucial for evaluating the quality of the information these outlets disseminate, particularly regarding their reputation for spreading misinformation. For the selected leaders, we collected all tweets produced from January 2018 to December 2022. The general public (followers) is identified based on their RTs to the content produced by these leaders. See Materials and Methods for details on the data collection process. Using this node configuration, we construct a bipartite network with two layers: leaders and followers, where the links represent the number of RTs by the latter of tweets made by the former. If a group of followers retweets tweets from two different leaders, it indicates that these leaders are likely communicating similar messages or viewpoints. To analyze these relationships more deeply, we perform a monopartite projection onto the leader layer. This projection, detailed in Materials and Methods, simplifies the network by concentrating solely on the leaders and the connections between them that are inferred from their shared followers. Panel (A) of Figure <ref type="figure" target="#fig_0">1</ref> shows the RT network of leaders aggregated in terms of communities identified through an optimized version of the Louvain algorithm <ref type="bibr" target="#b16">[17]</ref>. The a posteriori analysis of the political leaders in each group reveals that the clustering algorithm effectively identified communities that align with the political affiliations of the leaders in each cluster <ref type="bibr" target="#b17">[18,</ref><ref type="bibr" target="#b18">19]</ref>. Specifically, the Left-leaning community includes political entities such as +Europa, Azione, Enrico Letta, and Nicola Fratoianni; the Right-leaning community features leaders from FdI, FI, and Lega; and the Five Star Movement (M5S) community includes key figures like Giuseppe Conte and Luigi Di Maio. An interesting observation from the network configuration is the clustering of questionable news sources. These profiles consistently group within the same com- munity, suggesting a potential alignment or affinity with specific political leanings or ideologies.</p><p>Leveraging the political bias of followers in our Twitter network, we build a very large dataset of tweet-quote pairs, each annotated with the corresponding stance (favor or against), as better described in Materials and Methods. Since this method assigns the stance to each pair in an unsupervised manner, to ensure that our approach is performing correctly, we randomly selected 500 pairs (250 favor and 250 against) and manually annotated their stance. We then compared the results of the automatic annotation with the manual annotation. The results, shown in Appendix -Table <ref type="table">3</ref>, indicate a high level of accuracy in favor and against classifications, with a small number of neutral cases. The dataset serves as training set for fine-tuning UmBERTo <ref type="bibr" target="#b19">[20]</ref>, an Italian language model based on the RoBERTa architecture <ref type="bibr" target="#b20">[21]</ref>, to assign stance labels to claim-perspective pairs. The fine-tuning process is performed using 5-fold cross-validation. The optimal performance for each fold is assessed by measuring the accuracy, i.e., the ratio of correctly predicted instances (both true favor and true against) to the total number of instances. The best-trained models from each fold demonstrate nearly identical performance, as shown by the average accuracy and F1-scores reported in the following table. The best model from fold 3 is identified Table <ref type="table">1</ref> Average performance of the best models from each fold on the training set and the test set. The table reports the mean and standard deviation (SD) for each metric considered: Accuracy for the overall model, and F1-score for each individual class.</p><p>as the highest performing and is therefore used in the following analyses. The corresponding confusion matrices for both the training and test sets are provided in Appendix -Table <ref type="table" target="#tab_3">5</ref>.</p><p>Given the imbalance in the label distribution of the claim-perspective dataset, we use 41, 347 pairs -each annotated as favor and previously removed to create a balanced training set -as an additional test set to evaluate the model's performance. The model achieves an accuracy of 83.6% when predicting the stance of these pairs.</p><p>The model is then applied to classify all the collected tweet-quote pairs based on their stance. Thus, following the same procedure used to construct the RT network of leaders, we develop the stance network and analyze its community structure. In this case, the weight of a link in the bipartite follower-leader network represents the positive difference between the number of favoring and against quotes from a follower on the leader's tweets. Panel (B) of Figure <ref type="figure" target="#fig_0">1</ref> shows the stance network of leaders aggregated in terms of communities identified through the Louvain algorithm. The node positions in this representation are the same as those in the RT network, providing a consistent framework for comparison. More formally, to evaluate the differences in clustering assignments between nodes present in both the retweet network and the stance network, we perform a clustering comparison. Namely, we use the contingency table <ref type="bibr" target="#b21">[22]</ref> associated with both the representations to compute community overlap. Figure <ref type="figure" target="#fig_2">2</ref> shows the comparison results broken down by source type: political entities and news outlets. While clusters C and D of the stance network primarily align with clusters 2 and 3 of the RT network, respectively, clusters A and B of the stance network mainly represent a refinement of cluster 1 from the RT network. This suggests that even in the stance network, the emerging communities align with the political affiliations of the leaders within each cluster. Although the tweet-quote pairs used to train the model include only tweets from political entities, the result is significant. The training set does not include pairs where the quote comes from a follower who exclusively retweets political entities from the same ideological community as the tweet's author. This demonstrates the model's ability to reconstruct communities through precise classification of textual pairs.</p><p>The contingency table for news outlets, while displaying less pronounced patterns overall, still demonstrate clear coherence in classification between the retweet network and the stance network. This is particularly remarkable considering that these profiles were not included in the model's training set. The recovery of the retweet network's community structure within the stance network suggests that the model successfully generalizes across profiles with differing linguistic constraints, with only a minimal loss in accuracy, while still allowing for the reconstruction of group affiliations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Discussion</head><p>Stance detection remains a vital yet challenging area in natural language processing (NLP), traditionally limited by the constraints of supervised learning. The availability of large language corpora, where interaction networks can be reconstructed, offers a novel approach that incorporates the social and dynamic aspects of stance, as outlined by Du Bois in his work on the stance triangle <ref type="bibr" target="#b5">[6]</ref>.</p><p>Our model addresses a more complex task compared to other state-of-the-art models. While existing models typically classify a user's stance on specific topics, our model classifies claim-perspective pairs into favor and against categories. This requires a deeper analysis of the relational stance between multiple interacting users and their statements.</p><p>Despite this increased complexity, our model achieved results comparable to those of existing state-of-the-art models <ref type="bibr" target="#b22">[23,</ref><ref type="bibr" target="#b23">24]</ref>. This success supports the hypothesis that in-group/out-group determinants, well-documented in opinion dynamics, significantly explain the variation in behaviors <ref type="bibr" target="#b24">[25]</ref>.</p><p>Moreover, our model's ability to reconstruct communities based on the accurate classification of textual pairs (as shown in Figure <ref type="figure" target="#fig_2">2</ref>) underscores its potential for community reconstruction in scenarios where the interaction network is not provided.</p><p>Importantly, this approach also opens avenues for studying network dynamics based on the probability of agreement between account pairs. This has significant implications for understanding and potentially mitigating coordinated attacks, such as disinformation campaigns and political propaganda. By identifying patterns of agreement and disagreement, we can better detect and analyze the strategies behind these coordinated efforts, enhancing our ability to safeguard democratic processes and public discourse.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Materials and Methods</head><p>Data Collection. Our dataset comprises approximatively 15 million tweets collected by monitoring the activity of 583 profiles that reflect Italian online social dialogue (e.g., La Repubblica, Il Corriere della Sera, Il Giornale). Profiles were selected based on the list of news sites monitored by NewsGuard, a news rating agency dedicated to assigning reliability scores. According to NewsGuard, this list covers approximately 95% of online engagement with news, providing near-comprehensive coverage of news-related dialogue <ref type="bibr" target="#b25">[26]</ref>.</p><p>Additionally, we included Italian political entities in the list of profiles. This inclusion encompasses all major political parties and their leaders (e.g., Giorgia Meloni and Fratelli d'Italia, Elly Schlein and PD, Giuseppe Conte and M5S). For a complete list of the monitored political profiles see Appendix -Table <ref type="table" target="#tab_2">4</ref>.</p><p>For each monitored profile, we collected all tweets from January 2018 to December 2022 using the Twitter/X API before the limitations introduced by the new management <ref type="foot" target="#foot_0">1</ref> . We also gathered all retweets (RTs) and quotes (QTs) of this content within the same time frame, limited to those tweets that gained at least 20 RTs or 10 QTs. The following table provides a detailed breakdown of the data matching these criteria. Community Detection. In order to reconstruct the discourse communities from the twitter activity we built a retweet network. In the context of the data collection strategy previously described, most RTs are from a nonmonitored user (a follower) to one of the users monitored (a leader), excluding a few RTs from one leader to another (45, 299). We can therefore consider this network as a bipartite network, i.e. a network where all links are from one node type to another, with 367 leaders and 934, 394 followers, connected through links with a weight 𝑤 𝑥𝑖 equal to the number of RTs from the follower 𝑥 to the leader 𝑖.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Category</head><p>To identify communities among leaders we assume that leaders with the same readership are more likely to be in the same political community. We therefore constructed a monopartite network by projecting on the leader layer, i.e. we construct a network from the set of all length two paths assigning weights that are the product of the path's links.</p><p>We used the Bipartite Weighted Configuration Model (BiWCM) to statistically validate our bipartite projection <ref type="bibr" target="#b26">[27]</ref>. BiWCM accounts for weighted interactions and preserves the strength of nodes in both layers, ensuring that our observed co-occurrences are not due to random chance but represent genuine structural patterns in the data. In order to find political communities in the network, we applied the Louvain algorithm 1000 times and selected the solution that minimized modularity, i.e., the strength of division of the network into clusters, with higher values indicating a structure where more edges lie within communities than would be expected by chance <ref type="bibr" target="#b27">[28]</ref>.</p><p>The same procedure was followed to construct the stance network and study its community structure. In this case, the weight of a link in the bipartite followerleader network indicates the fraction of favoring quotes from the follower to the leader's tweets. Claim-Perspective Pairs Selection. To construct a dataset of claim-perspective text pairs annotated with the corresponding stance (favor if the perspective supports the claim, against otherwise), we first identified users who clearly expressed an (almost) absolute preference for a single political entity through their retweet activity. Specifically, for each follower, we calculated the distribution of their RTs across the political entities defined in Table <ref type="table" target="#tab_2">4</ref>. Then, we filtered those who allocated at least 80% of their RTs to a single political entity. Some users, although meeting the previous requirement, may not have had a sufficient level of retweet activity during the analyzed period to be considered inclined towards a particular political entity. For example, a user who has only given one retweet to the set of political profiles would appear totally inclined towards a particular entity. To reduce the uncertainty arising from the indiscriminate inclusion of all profiles satisfying the high retweet activ-ity requirement for a single political entity, we calculated for each follower 𝑥 the total number of retweets of content produced by the set of political entities 𝒫 defined in Table <ref type="table" target="#tab_2">4</ref> and excluded the bottom 80% of the resulting distribution (i.e., we imposed |RT 𝑥 (𝒫 )| &gt; 7). For the remaining users, we then assigned the label favor to those quotes of tweets from their preferred political entity and the label against to those quotes of tweets from entities belonging to other political communities, as determined by the community detection analysis. This procedure resulted in the creation of a dataset containing 243, 277 unique claim-perspective (tweet-quote) pairs, each annotated with the corresponding stance. Since the label distribution of the dataset was unbalanced towards favor (specifically, 142, 312 favor and 100, 965 against), we randomly removed 41, 347 favor pairs to obtain a balanced training set for the stance model. The removed pairs were later used as additional test set to evaluate the model's accuracy. Stance model. We initialized our model starting from UmBERTo <ref type="bibr" target="#b19">[20]</ref>, an Italian language model based on the RoBERTa architecture <ref type="bibr" target="#b20">[21]</ref>. Specifically, we relied on the cased version trained using SentencePiece tokenizer and Whole Word Masking on a large corpus, encompassing around 70 GB of text. This makes it highly effective for various natural language processing tasks in Italian, as it leverages a vast and diverse dataset to understand the nuances of the language <ref type="bibr" target="#b28">[29,</ref><ref type="bibr" target="#b29">30]</ref>. The pretrained model was then fine-tuned on the constructed dataset of tweetquote pairs to create a tool capable of inferring the stance of claim-perspective text pairs: favor if the perspective agrees with the claim, and against otherwise. To input the text pairs into the pretrained model, we utilized Um-BERTo's special tokens. Specifically, we concatenated the tweet and quote as &lt;s&gt; + tweet + &lt;/s&gt;&lt;/s&gt; + quote + &lt;/s&gt;, where &lt;s&gt;, &lt;/s&gt;&lt;/s&gt;, and &lt;/s&gt; represent the start, separation, and end tokens, respectively. Since we set max_seq_length = 256, which limits the total number of tokens that can be processed by the model, in cases where the concatenated strings exceeded this limit, the longer text between the tweet and the quote was truncated. This ensures that the input remains within the model's processing capacity while preserving as much information as possible from both texts. Conversely, shorter concatenated strings were padded using the special token &lt;pad&gt; until they reached the 256-token limit. Tweets and quotes were preprocessed before being concatenated by removing URLs, mentions, non-UTF-8 characters, line breaks, and tabs.</p><p>The pretrained UmBERTo model was imported into Python from the HugginFace Transformers library <ref type="bibr" target="#b30">[31]</ref> as a model for sequence classification. The fine-tuning procedure enabled the model to output the probability dis-tribution over the stance labels by minimizing the crossentropy loss between the predicted labels and the true labels, effectively learning to classify the stance of claimperspective pairs. We chose to perform 5-fold crossvalidation to ensure the reliability of the results <ref type="bibr" target="#b31">[32]</ref>. Namely, the data was first partitioned into 5 equally (or nearly equally) sized segments or folds. Subsequently 5 iterations of training and testing are performed such that within each iteration a different fold of the data is heldout for testing while the remaining 4 folds are used for learning. Thus, for each training-test split, we fine-tuned the UmBERTo model for 4 epochs using a batch size of 64 (for both training and testing) and an improved version of the Adam optimizer <ref type="bibr" target="#b32">[33]</ref> with a learning rate of 5𝑒 − 5 and a weight decay of 0.01 for regularization. The chosen hyperparameters are among those recommended in the literature <ref type="bibr" target="#b33">[34,</ref><ref type="bibr" target="#b20">21]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusion</head><p>This study introduces a novel stance detection model that significantly advances the understanding of alignment and opposition in social discourse. By leveraging social network data from X (formerly Twitter), we developed a robust training technique that utilizes interactions within politically aligned communities. Our approach involved curating a dataset of tweet/quote pairs, where the quotes are derived from users' interactions with leaders and politicians. This dataset facilitated the training of a BERT model, which achieved a state of the art accuracy of approximately 85%.</p><p>Our findings underscore the efficacy of using social network structures to train NLP models, demonstrating that retweet interactions can serve as reliable indicators of political alignment. This methodology not only enhances the scalability of stance detection but also offers a nuanced understanding of political discourse on social media platforms. By reconstructing and validating politically aligned communities through expert knowledge, our model provides a robust framework for analyzing the alignment of social media statements.</p><p>The implications of this work extend beyond stance detection, offering potential applications in monitoring political sentiment, identifying misinformation, and understanding public opinion dynamics. Future research could explore the integration of additional social network features and exploring the capacity of the model to generalize to other domains, interaction types and understanding how stance propagates within networks.</p><p>Additionally, investigating the role of specific linguistic markers like adverbs across different languages and cultures can reveal universal and language-specific determinants of stance.</p><p>While our model shows promising results, it also relies heavily on the assumption that retweets are mainly a form of endorsement, and that quotes within one's own political community are all in agreement and that outside of one's political community they are all in disagreement. While the high level of polarization observed in these networks support the validity of these assumptions, it also restricts the applicability of the model to domains where polarization is evident and these assumptions are valid. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Automatic</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 3</head><p>Comparison between manual and automatic annotation for 500 randomly selected tweet-quote pairs. The F1 score for the Favor category is 0.86, and for the Against category, it is 0.86 as well. These results indicate a strong agreement between manual and automatic annotation methods, especially considering that the unsupervised stance classification method does not account for labels other than Favor and Against, while some contents were manually classified as Neutral. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Political entity</head></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Projection of the follower-leader bipartite network onto the layer of leaders. In both (A) and (B), the edges represent connections between leaders based on follower activity. (A) The edge weights are derived from the number of shared followers who retweeted content from both leaders. (B) The edge weights are based on the positive difference between favoring and against quote tweets made by shared followers on the content produced by the two leaders. In these visualizations, the node positions remain constant, providing a consistent framework for comparison. Node colors refer to communities as a result of running an optimized version of the Louvain algorithm. Nodes frame colors refer to the different types of leaders: political entities (azure), questionable news sources (dark red), and reliable news sources (dark blue).</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>(10 −5 ) 0.863 (10 −5 ) 0.864 (10 −5 ) Test 0.846 (10 −6 ) 0.846 (10 −6 ) 0.846 (10 −5 )</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Contingency table associated with retweet network and stance network. Data is broken down by source type: political entities and news outlets.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 2</head><label>2</label><figDesc>Breakdown of the dataset.</figDesc><table><row><cell></cell><cell>Profiles</cell><cell>Tweets</cell><cell>RTs</cell><cell>QTs</cell></row><row><cell>News</cell><cell cols="4">329 279, 793 16, 365, 178 3, 587, 830</cell></row><row><cell>Politics</cell><cell cols="4">38 101, 017 15, 385, 363 2, 388, 621</cell></row><row><cell>TOTAL</cell><cell cols="4">367 380, 810 31, 750, 541 5, 976, 451</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 4</head><label>4</label><figDesc>List of Twitter profiles related to the main political entities active in Italy during the five-year period 2018-2022.</figDesc><table><row><cell></cell><cell></cell><cell cols="3">Twitter profiles</cell></row><row><cell>+Europa</cell><cell></cell><cell cols="3">piu_europa, emmabonino</cell></row><row><cell cols="2">Articolo Uno</cell><cell cols="3">articolounodp, robersperanza</cell></row><row><cell>Azione</cell><cell></cell><cell cols="3">azione_it, carlocalenda</cell></row><row><cell cols="2">Cambiamo!</cell><cell cols="2">giovannitoti</cell></row><row><cell cols="2">Coraggio Italia</cell><cell cols="3">coraggio_italia, luigibrugnaro</cell></row><row><cell cols="5">Democrazia e Autonomia movimentodema</cell></row><row><cell cols="2">Europa Verde</cell><cell cols="3">europaverde_it, angelobonelli1</cell></row><row><cell>FdI</cell><cell></cell><cell cols="3">giorgiameloni, fratelliditalia</cell></row><row><cell>FI</cell><cell></cell><cell cols="3">forza_italia, berlusconi</cell></row><row><cell>ItalExit</cell><cell></cell><cell cols="2">gparagone</cell></row><row><cell>IV</cell><cell></cell><cell cols="3">italiaviva, matteorenzi</cell></row><row><cell>Lega</cell><cell></cell><cell cols="3">legasalvini, matteosalvinimi</cell></row><row><cell>M5S</cell><cell></cell><cell cols="3">giuseppeconteit, mov5stelle, luigidimaio</cell></row><row><cell cols="2">ManifestA</cell><cell cols="2">manifesta_it</cell></row><row><cell>NcI</cell><cell></cell><cell cols="2">maurizio_lupi</cell></row><row><cell>PD</cell><cell></cell><cell cols="3">pdnetwork, enricoletta, sbonaccini, ellyesse</cell></row><row><cell cols="2">Potere al Popolo</cell><cell cols="3">potere_alpopolo</cell></row><row><cell cols="2">Rifondazione comunista</cell><cell cols="2">direzioneprc</cell></row><row><cell>SI</cell><cell></cell><cell cols="3">si_sinistra, nfratoianni</cell></row><row><cell cols="2">Unione di Centro</cell><cell cols="2">antoniodepoli</cell></row><row><cell cols="2">Unione Popolare</cell><cell cols="3">unione_popolare, demagistris</cell></row><row><cell></cell><cell cols="3">Predicted</cell></row><row><cell></cell><cell cols="3">Favor Against</cell><cell>Σ</cell></row><row><cell>Actual</cell><cell>Favor 70, 690 Against 10, 517</cell><cell></cell><cell>10, 082 70, 255</cell><cell>80, 772 80, 722</cell></row><row><cell></cell><cell>Σ 81, 207</cell><cell></cell><cell>80, 337</cell><cell>161, 544</cell></row><row><cell></cell><cell cols="3">(a) training set</cell></row><row><cell></cell><cell cols="3">Predicted</cell></row><row><cell></cell><cell cols="3">Favor Against</cell><cell>Σ</cell></row><row><cell>Actual</cell><cell cols="2">Favor 16, 929 Against 2, 740</cell><cell>3, 264 17, 453</cell><cell>20, 193 20, 193</cell></row><row><cell></cell><cell cols="2">Σ 19, 669</cell><cell>20, 717</cell><cell>40, 386</cell></row><row><cell></cell><cell cols="2">(b) test set</cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 5</head><label>5</label><figDesc>Confusion matrices for both the (a) training and (b) test sets.</figDesc><table /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://twitter.com/XDevelopers/status/1621026986784337922</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>We extend our deepest gratitude to Vittorio Loreto, the director of the Sony Computer Science Laboratories (CSL) and Professor at La Sapienza University of Rome, for his invaluable support and sponsorship of this research. His guidance was pivotal for the successful completion of our study. We also thank the anonymous reviewers for their insightful suggestions, which have greatly contributed to enhancing the quality of this work.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Stance detection: Concepts, approaches, resources, and outstanding issues</title>
		<author>
			<persName><forename type="first">D</forename><surname>Küçük</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Can</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
				<meeting>the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="2673" to="2676" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">A systematic review of machine learning techniques for stance detection and its applications</title>
		<author>
			<persName><forename type="first">N</forename><surname>Alturayeif</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Luqman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ahmed</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Neural Computing and Applications</title>
		<imprint>
			<biblScope unit="volume">35</biblScope>
			<biblScope unit="page" from="5113" to="5144" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">It is more than what you say!: Leveraging user online activity for improved stance detection</title>
		<author>
			<persName><forename type="first">A</forename><surname>Aldayel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Magdy</surname></persName>
		</author>
		<ptr target="https://2019.ic2s2.org/" />
	</analytic>
	<monogr>
		<title level="m">International Conference on Computational Social Science</title>
				<imprint>
			<date type="published" when="2019">2019. IC2S2 2019</date>
			<biblScope unit="page" from="20" to="27" />
		</imprint>
	</monogr>
	<note>Conference date</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Automatic stance detection for twitter data</title>
		<author>
			<persName><forename type="first">A</forename><surname>Gupta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mehta</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2022 1st International Conference on Informatics (ICI), IEEE</title>
				<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="223" to="225" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Explainable cross-topic stance detection for search results</title>
		<author>
			<persName><forename type="first">T</forename><surname>Draws</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Natesan Ramamurthy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Baldini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dhurandhar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Padhi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Timmermans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Tintarev</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2023 Conference on Human Information Interaction and Retrieval</title>
				<meeting>the 2023 Conference on Human Information Interaction and Retrieval</meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="221" to="235" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">The stance triangle, Stancetaking in discourse: Subjectivity, evaluation</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">W</forename><surname>Bois</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">interaction</title>
		<imprint>
			<biblScope unit="volume">164</biblScope>
			<biblScope unit="page" from="139" to="182" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<ptr target="https://www.weforum.org/publications/global-risks-report-2024/" />
		<title level="m">World Economic Forum, Global Risks Report 2024</title>
				<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
		<respStmt>
			<orgName>World Economic Forum</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Technical Report</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Exploring stance and engagement features in discourse analysis papers</title>
		<author>
			<persName><forename type="first">L</forename><surname>Sayah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">R</forename><surname>Hashemi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Theory &amp; Practice in Language Studies (TPLS)</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Stance detection: A survey</title>
		<author>
			<persName><forename type="first">D</forename><surname>Küçük</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Can</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Computing Surveys (CSUR)</title>
		<imprint>
			<biblScope unit="volume">53</biblScope>
			<biblScope unit="page" from="1" to="37" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Extracting significant signal of news consumption from social networks: the case of Twitter in Italian political elections</title>
		<author>
			<persName><forename type="first">C</forename><surname>Becatti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Caldarelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Lambiotte</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Saracco</surname></persName>
		</author>
		<idno type="DOI">10.1057/s41599-019-0300-3</idno>
	</analytic>
	<monogr>
		<title level="j">Palgrave Communications</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Tweet, tweet, retweet: Conversational aspects of retweeting on twitter</title>
		<author>
			<persName><forename type="first">D</forename><surname>Boyd</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Golder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Lotan</surname></persName>
		</author>
		<idno type="DOI">10.1109/HICSS.2010.412</idno>
	</analytic>
	<monogr>
		<title level="m">2010 43rd Hawaii International Conference on System Sciences</title>
				<imprint>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="1" to="10" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Retweeting: Its linguistic and epistemic value</title>
		<author>
			<persName><forename type="first">N</forename><surname>Marsili</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Synthese</title>
		<imprint>
			<biblScope unit="volume">198</biblScope>
			<biblScope unit="page" from="10457" to="10483" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Neutral bots probe political bias on social media</title>
		<author>
			<persName><forename type="first">W</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Pacheco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K.-C</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Menczer</surname></persName>
		</author>
		<idno type="DOI">10.1038/s41467-021-25738-6</idno>
	</analytic>
	<monogr>
		<title level="j">Nature Communications</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Like-minded sources on facebook are prevalent but not polarizing</title>
		<author>
			<persName><forename type="first">B</forename><surname>Nyhan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Settle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Thorson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Wojcieszak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Barberá</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Y</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Allcott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Brown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Crespo-Tenorio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Dimmery</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Freelon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Gentzkow</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>González-Bailón</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Guess</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kennedy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">M</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lazer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Malhotra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Moehler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">R</forename><surname>Thomas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Tromble</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">V</forename><surname>Rivera</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Wilkins</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Xiong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">K</forename><surname>De Jonge</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Franco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Mason</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">J</forename><surname>Stroud</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Tucker</surname></persName>
		</author>
		<idno type="DOI">10.1038/s41586-023-06297-w</idno>
	</analytic>
	<monogr>
		<title level="j">Nature</title>
		<imprint>
			<biblScope unit="volume">620</biblScope>
			<biblScope unit="page" from="137" to="144" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Crossplatform impact of social media algorithmic adjustments on public discourse</title>
		<author>
			<persName><forename type="first">P</forename><surname>Gravino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">R</forename><surname>Lo Sardo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Brugnoli</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.2405.00008</idno>
	</analytic>
	<monogr>
		<title level="j">ArXiv</title>
		<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Asymmetric ideological segregation in exposure to political news on facebook</title>
		<author>
			<persName><forename type="first">S</forename><surname>González-Bailón</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lazer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Barberá</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Allcott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Brown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Crespo-Tenorio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Freelon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Gentzkow</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Guess</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Iyengar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">M</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Malhotra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Moehler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Nyhan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">V</forename><surname>Rivera</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Settle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Thorson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Tromble</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Wilkins</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Wojcieszak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Kiewiet De Jonge</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Franco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Mason</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Stroud</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Tucker</surname></persName>
		</author>
		<idno type="DOI">10.1126/science.ade7138</idno>
	</analytic>
	<monogr>
		<title level="j">Science</title>
		<imprint>
			<biblScope unit="volume">381</biblScope>
			<biblScope unit="page" from="392" to="398" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Fast unfolding of communities in large networks</title>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">D</forename><surname>Blondel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-L</forename><surname>Guillame</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Lambiotte</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Lefebvre</surname></persName>
		</author>
		<idno type="DOI">10.1088/1742-5468/2008/10/P10008</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Statistical Mechanics: Theory and Experiment</title>
		<imprint>
			<biblScope unit="volume">10008</biblScope>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Fine-grained clustering of social media: How moral triggers drive preferences and consensus</title>
		<author>
			<persName><forename type="first">E</forename><surname>Brugnoli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Gravino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">R</forename><surname>Lo Sardo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Loreto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Prevedello</surname></persName>
		</author>
		<idno type="DOI">10.5220/0012595000003636</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 16th International Conference on Agents and Artificial Intelligence, ICAART 2024</title>
				<editor>
			<persName><forename type="first">A</forename><forename type="middle">P</forename><surname>Rocha</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Steels</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><forename type="middle">J</forename><surname>Van Den Herik</surname></persName>
		</editor>
		<meeting>the 16th International Conference on Agents and Artificial Intelligence, ICAART 2024<address><addrLine>Rome, Italy</addrLine></address></meeting>
		<imprint>
			<publisher>SCITEPRESS</publisher>
			<date type="published" when="2024">February 24-26, 2024. 2024</date>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="1405" to="1412" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Entropy-based detection of twitter echo chambers</title>
		<author>
			<persName><forename type="first">M</forename><surname>Pratelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Saracco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Petrocchi</surname></persName>
		</author>
		<idno type="DOI">10.1093/pnasnexus/pgae177</idno>
	</analytic>
	<monogr>
		<title level="j">PNAS Nexus</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page">e177</biblScope>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Umberto: an italian language model trained with whole word masking</title>
		<author>
			<persName><forename type="first">L</forename><surname>Parisi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Francia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Magnani</surname></persName>
		</author>
		<ptr target="https://github.com/musixmatchresearch/umberto" />
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<author>
			<persName><forename type="first">Y</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Goyal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Du</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Joshi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Levy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lewis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zettlemoyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Stoyanov</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.1907.11692</idno>
		<title level="m">Roberta: A robustly optimized bert pretraining approach</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Analysis of contingency tables under cluster sampling</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Brier</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Biometrika</title>
		<imprint>
			<biblScope unit="volume">67</biblScope>
			<biblScope unit="page" from="591" to="596" />
			<date type="published" when="1980">1980</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Embeddings-based clustering for target specific stances: The case of a polarized turkey</title>
		<author>
			<persName><forename type="first">A</forename><surname>Rashed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kutlu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Darwish</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Elsayed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bayrak</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the International AAAI Conference on web and social media</title>
				<meeting>the International AAAI Conference on web and social media</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="page" from="537" to="548" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<author>
			<persName><forename type="first">S</forename><surname>Shi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Qiao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Song</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Yan</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2301.01123</idno>
		<title level="m">Mgtab: A multi-relational graphbased twitter account detection benchmark</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Outgroup animosity drives engagement on social media</title>
		<author>
			<persName><forename type="first">S</forename><surname>Rathje</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">J</forename><surname>Van Bavel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Van Der Linden</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the National Academy of Sciences</title>
		<imprint>
			<biblScope unit="volume">118</biblScope>
			<biblScope unit="page">e2024292118</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<monogr>
		<author>
			<persName><surname>Newsguardtech</surname></persName>
		</author>
		<ptr target="https://www.newsguardtech.com/wp-content/uploads/2022/01/NewsGuard-Social-Impact-Report-1.21.22.pdf" />
		<title level="m">Social impact report 2021</title>
				<imprint>
			<date type="published" when="2022-11-27">2022. Nov 27, 2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Inferring comparative advantage via entropy maximization</title>
		<author>
			<persName><forename type="first">M</forename><surname>Bruno</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Mazzilli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Patelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Squartini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Saracco</surname></persName>
		</author>
		<idno type="DOI">10.1088/2632-072X/ad1411</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Physics: Complexity</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page">45011</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Finding and evaluating community structure in networks</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">E J</forename><surname>Newman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Girvan</surname></persName>
		</author>
		<idno type="DOI">10.1103/PhysRevE.69.026113</idno>
	</analytic>
	<monogr>
		<title level="j">Phys. Rev. E</title>
		<imprint>
			<biblScope unit="volume">69</biblScope>
			<biblScope unit="page">26113</biblScope>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">FEEL-IT: Emotion and sentiment classification for the Italian language</title>
		<author>
			<persName><forename type="first">F</forename><surname>Bianchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Nozza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Hovy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Association for Computational Linguistics</title>
				<editor>
			<persName><forename type="first">O</forename><forename type="middle">De</forename><surname>Clercq</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Balahur</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Sedoc</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Barriere</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Tafreshi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Buechel</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Hoste</surname></persName>
		</editor>
		<meeting>the Eleventh Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="76" to="83" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">How &quot;bertology&quot; changed the stateof-the-art also for italian nlp</title>
		<author>
			<persName><forename type="first">F</forename><surname>Tamburini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Seventh Italian Conference on Computational Linguistics, CLiC-it 2020</title>
				<meeting>the Seventh Italian Conference on Computational Linguistics, CLiC-it 2020</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<monogr>
		<author>
			<persName><forename type="first">T</forename><surname>Wolf</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Debut</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Sanh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Chaumond</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Delangue</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Moi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cistac</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Rault</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Louf</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Funtowicz</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.1910.03771</idno>
		<title level="m">Huggingface&apos;s transformers: State-ofthe-art natural language processing</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<monogr>
		<author>
			<persName><forename type="first">P</forename><surname>Refaeilzadeh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Tang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Liu</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-0-387-39940-9_565</idno>
		<title level="m">Cross-Validation</title>
				<meeting><address><addrLine>Boston, MA</addrLine></address></meeting>
		<imprint>
			<publisher>Springer US</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="532" to="538" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<monogr>
		<author>
			<persName><forename type="first">I</forename><surname>Loshchilov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Hutter</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.1711.05101</idno>
		<title level="m">Decoupled weight decay regularization</title>
				<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b33">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>Devlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-W</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Toutanova</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.1810.04805</idno>
		<title level="m">Bert: Pre-training of deep bidirectional transformers for language understanding</title>
				<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
