1. Introduction

The Journal of Politics (2024). doi:10.1086/730737. [32] Y. Krak

10.1007/978-3-031-21101-0_24

Detection of Web Propaganda Patterns by Transformer Neural Networks: Improving Efficiency via Dataset Balancing

0 Khmelnytskyi National University , 11, Instytuts'ka str., Khmelnytskyi, 29016 , Ukraine

2025

3496 0000 0001

In the paper, a proposed approach for improving efficiency of web propaganda patterns detection by transformer neural networks is presented. Approach consists of sequential use of three developed methods: method for dataset balancing, method for fine-tuning individual binary neural network models and method for detecting web propaganda patterns. Compared to existing analogues, the use of proposed approach allowed achieving an efficiency increase of 0.1 by F1 metric when detecting propaganda patterns in web texts using transformer neural networks due to dataset balancing optimization. Analyzing the impact of parameter that determines proportion of texts without web propaganda patterns allows assessing how the models ability to distinguish propaganda patterns from neutral texts and texts with other propaganda patterns. This allows finding the optimal ratio of dataset classes to increase the overall effectiveness for detecting web propaganda patterns. Conducted research has established that the highest results were achieved when forming the training dataset with a percentage of texts without patterns of 30% using the RoBERTa neural network, and was achieved 0.725 by F1 metric. Proposed approach ensures the determination of the optimal ratio between text sets with propaganda patterns and neutral text set, which improving the generalization ability of models and reduce their bias.

eol>web propaganda patterns dataset balancing BERT RoBERTa NLP transformer neural network 1

1. Introduction

In the modern information environment, propaganda content plays a significant role in shaping public opinion, political views, and social behavior [ 1 ]. Social networks have become a key space for disseminating information, but at the same time they are also a tool for manipulative influence [ 2 ]. Algorithmic content distribution, personalized news feeds, and automated recommendation systems contribute to the rapid spread of manipulative messages, which makes it difficult to detect web propaganda patterns using traditional methods [ 3, 4 ]. Since manipulative content can have subtle linguistic markers and adapt to the context [ 5 ], its identification requires the use of context-oriented language models, in particular transformers [ 6 ]. Significant progress in the field of automatic text analysis has made it possible to use neural networks to detect manipulations, but the accuracy of such models largely depends on the training sample. The balance of the sample affects the model's ability to recognize manipulative patterns and distinguish them from neutral or unintentional influence [ 7 ].

The research is closely related to the UN Sustainable Development Goals, as it contributes to the formation of quality education (SDG No. 4) through the development of media literacy and critical thinking [ 8 ]. This allows society to more effectively recognize manipulative content and make informed decisions, which is consistent with the principles of ensuring access to reliable information. In addition, methods for detecting web propaganda patterns in text messages play an important role in maintaining peace, justice and strengthening democratic institutions (SDG No. 16) [ 9, 10 ]. They help combat disinformation, increase the level of transparency of governance and contribute to reducing the impact of manipulation in society, which is a key factor in the sustainable development of the information space [ 11 ].

The aim of paper is to improve the efficiency of detecting propaganda patterns in web texts using transformative neural networks by optimizing the dataset balancing. Research is aimed at reducing the impact of class imbalance, increasing the accuracy of classification and improving the generalization ability of model.

The main paper contribution is created methodology that includes method for fine-tuning individual binary neural network models to detect propaganda patterns, method for balancing the dataset, and method for detecting web propaganda patterns. The paper also provides an analysis of the impact of the balance of the training sample on the effectiveness of models for detecting manipulative patterns in social media. An experimental study of the performance of the BERT and RoBERTa transformative language models depending on the distribution of training examples between classes was conducted. The results obtained contribute to a deeper understanding of the role of the training sample in improving algorithms for detecting manipulative texts and can be used to increase the reliability of automated systems for analyzing the information space.

2. Related Works

The issue of automated detection of web propaganda patterns in social media has widely attracted the attention of researchers.

The research [ 12 ] considers a multimodal and multilingual dataset of propaganda patterns PPN (Propagandist Pseudo-News), which contains news texts collected from web resources that expert organizations have classified as containing manipulation patterns. The study analyzes various NLP approaches that allow identifying the characteristic features that annotators have highlighted and comparing them with the results of automated classification. For this purpose, the following methods are used: VAGO to determine the level of subjectivity and vagueness of statements, TF-IDF as a basic analysis tool, as well as four classification algorithms – two RoBERTa models, CATS, which focuses on syntactic features, and XGBoost, which combines semantic and syntactic features.

In [ 13 ] two architectures for classifying propaganda patterns were analyzed: one involved the use of data augmentation (EDA) methods, and the other worked without them. The models using EDA showed a 3% improvement in F1-measure, reaching 57.57% on the test set. A significant increase in accuracy was observed for manipulation patterns such as "Appeal_to_fear-prejudice", "Exaggeration, Minimisation" and "Repetition", while for individual techniques, in particular "Doubt" and "Flag-Waving", a slight decrease in results was noted. "Causal_Oversimplification" and "Thought-terminating_Cliches" showed the most noticeable improvement. Determination of optimal parameters for classification was carried out by analyzing the number of epochs, the length of text fragments and the learning rate. This allowed the authors to achieve an F1-measure of 44% in the sentiment detection task and 57% in the classification of manipulation patterns.

The authors of [ 14 ] used the RoBERTa language model to detect propaganda patterns in news articles. The model was evaluated on the SemEval-2020 Task 11 reference dataset, which confirmed its effectiveness in recognizing complex manipulation patterns in text. Compared to baseline model, RoBERTa achieved an F1-measure of 60.2%, demonstrating its higher accuracy.

In [ 15 ] the multilingual set of propaganda patterns was created by translating the PTC and WANLP corpora, supplemented with SemEval23 data. Three models were proposed: MultiPropBaseline (an ensemble of GPT-2, mBART and XLM-RoBERTa), MultiProp-ML (meta-learning for languages with minimal data) and MultiProp-Chunk (processing long texts exceeding the token limit). As a result of the experiments, the F1 score for the Polish language was 62.5%.

The study [ 16 ] indicates the ambiguity in the ability of LLMs to recognize propaganda patterns in news texts. Experiments conducted on the annotated SemEval2020 Task 11 corpora demonstrated maximum Recall values of 64.53% and Precision of 81.82%. At the same time, none of the models was able to exceed the baseline F1 score, which was approximately 50%. The highest achieved F1 score was only 20%, which is significantly inferior to the baseline and indicates the limitations of generative models in ensuring reproducibility.

In [ 17 ] emphasize that most previous studies focused on linguistic features to detect manipulation patterns in texts. Therefore, authors propose the method based on meta-learning that allows for automatic identification of semantic manipulation patterns at sentence level in news materials. For this, multi-task learning is used, aimed at detecting semantic contradictions. Proposed approach combines CRF, BiLSTM and pre-trained language models, which provides an F1-measure of 61% for multilingual data and 68.8% for monolingual.

The authors of [ 18 ] evaluate the possibility of using large language models (LLMs), in particular OpenAI GPT-3.5-turbo, to detect propaganda in news articles. The analysis is based on 18 propaganda techniques identified by Martino et al., and covers materials from Russia Today and the SemEval-2020 Task 11 corpus. Using a specially designed prompt, the model determines the presence of propaganda techniques and classifies articles. Qualitative analysis of results allows us to assess effectiveness of LLMs in this task and optimal prompt parameters.

The application of machine learning models to identify manipulation patterns in text content is considered in the study [ 19 ]. Among the analyzed approaches, the Stacking Classifier, which uses feature processing methods, in particular Word2Vec and TF-IDF, demonstrates high adaptability and accuracy. Comparative analysis shows that this model outperforms others, such as Naive Bayes, SVM, KNN, Logistic Regression and Random Forest. The implementation of feature engineering significantly improves the results, which is confirmed by the increase in Accuracy, Precision and F1measure.

The study [ 20 ] considers the application of machine learning methods to detect types of propaganda in the text content of social networks. The authors used data obtained through the social network API to evaluate the effectiveness of various models. The results of the study showed that neural networks, in particular the LSTM architecture, have high accuracy in this task, reaching 77.15%. It is noted that the further implementation of more modern models, such as BERT, can contribute to even better results in future studies.

Paper [21] proposes an ensemble model for identifying manipulation patterns in texts obtained from memes. The authors consider the use of modern pre-trained language models, as well as optimization methods, in particular data augmentation and combining multiple models. The model evaluation was carried out on the SemEval-2021 Task 6 dataset, and the results showed that proposed approach allows achieving an F1-micro measure of 60.4% on the test set.

Authors of [22] used a two-stage process to determine the optimal threshold for classifying manipulation patterns to assess the effectiveness of the model. First, experiments were conducted with macrothresholds in the range from 0.1 to 0.9, the threshold with the highest F1 score was selected, after which microthresholds were added for further optimization. The XLM-RoBERTa models were trained using the Adam optimizer, and early termination was used to prevent overtraining. The Accuracy, Precision, Recall, and F1-measure metrics were used to assess performance at each stage.

From above reviews of scientific publications, it is clear that the issue of balancing datasets in existing methodologies was considered only from the perspective of creating synthetic samples, and the issue of the influence of the number of texts without manifestations of propaganda patterns was not considered at all. Therefore, our study is relevant and aims to eliminate this drawback by analyzing the influence of the number of texts without propaganda patterns on the effectiveness of transformer models.

The paper aims to determine the optimal ratio between texts with propaganda patterns and neutral texts, which will improve the generalizability of the models and reduce their bias.

3. Methodology

To solve the problem of detecting web propaganda patterns, it is first necessary to fine-tune the neural networks to detect each of the web propaganda patterns. Accordingly, this can be formalized as the problem of training a set of individual binary neural network models NN, where each model nni corresponds to a certain propaganda pattern pi from the set of propaganda patterns P:

P={ p1 , p2 , … , pk }, (1) where pi – i-th propaganda pattern, k – number of unique propaganda patterns, i=1..k. Within the scope of the study, k=10, and the set P acquires the following elements:  p1=”Loaded Language”;  p2=”Glittering Generalities”;  p3 =”Euphoria”;  p4=”Appeal to Fear”;  p5=”FUD”;  p6=”Bandwagon”;  p7=”Thought-Terminating Cliche”;  p8=”Whataboutism”;  p9=”Cherry Picking”;  p10=”Straw Man”.

This set of propaganda patterns is linked to the existing data source presented within the framework of UNLP 2025 [23], dedicated to the competition for detecting manipulative propaganda patterns in the Ukrainian-language media space [24].

Accordingly, {NN} will take the form:

NN ={nn1 , nn2 , … , nnk }, (2) where nni – i-th neural network for i-th propaganda pattern.

Approach for detection of web propaganda patterns by transformer neural networks consists of sequential use of three developed methods: method for dataset balancing, method for fine-tuning individual binary neural network models and method for detecting web propaganda patterns (Figure 1).

Proposed approach ensures the determination of the optimal ratio between text sets with propaganda patterns and neutral text set, which improving the generalization ability of models and reduce their bias. This improves the efficiency of detecting propaganda patterns in web texts using transformer neural networks through optimizing the dataset balancing.

3.1. Method for Dataset Balancing

Method for dataset balancing is designed to transform the general set of data in the input dataset into 2 datasets (training dataset and validation dataset), which will allow to increase the accuracy of detecting propaganda patterns in web texts. Scheme of training dataset prepare is shown in Figure 2.

Percent of texts without manipulation patterns m – the studied parameter for analyzing the influence of the balance of the training sample on the effectiveness of models for detecting manipulative propaganda patterns in social media. This parameter has an impact on the formation of the training dataset.

In addition to the training dataset, a validation dataset is constructed, which consists equally of all types of web propaganda patterns and texts without propaganda. This allows determining whether the model does not confuse patterns with each other and whether it is able to detect them independently of each other, which is critically important for the multi-label classification problem. Accordingly, the result of the method of dataset balancing will be 2 datasets: training dataset and validation dataset. Schematically, their composition is shown in Figure 3.

Document

count

It is worth noting that the base dataset is annotated at the fragment level, and the training dataset and validation dataset contain not the full text, but fragments (sentences that are marked as propaganda patterns).

The dataset contains annotated data at the fragment level that determine the presence of manipulative influence patterns from the set P. A typical text of the dataset from the category "propaganda patterns" can have either one or several labels. A typical text of the dataset from the category "without propaganda patterns" does not contain any web propaganda patterns from the set P. According to the marked data, the number of documents corresponding to the patterns p1 – p10 has the distribution shown in Table 1.

This approach to dataset generation allows us to assess the impact of sample balancing on the quality of propaganda pattern detection, as well as to avoid the dominance of the most common classes in the training set [25]. Using separate binary models for each pattern allows us to model them independently, which is important in problems with class intersection, when one text may contain several types of manipulation. This allows us to investigate how each pattern is separated within the data corpus and how it is affected by the imbalance of the training sample.

3.2. Method for Fine-Tuning Individual Binary Neural Network Model for Propaganda Patterns Detection

As can be seen from Table 1, the data have an uneven distribution, so using a single multi-class neural network model will not allow to obtain high results. A multi-class model tends to dominate widely represented classes, which leads to a decrease in accuracy for poorly represented classes. As a result, the model may simply ignore small categories, which will lead to a significant imbalance in predictions. In addition, multi-class classification assumes that the text belongs to only one class [26], which contradicts the nature of the task, where 1 text can have several labels corresponding to certain web propaganda patterns. Accordingly, using separate binary models for each pi pattern allows to train each model separately without the influence of the imbalance of other classes to take into account texts with several patterns, since each model from NN set works independently and does not limit the choice to only one class.

To investigate the impact of the balance of the training sample on the detection of web propaganda patterns using a set of individual binary neural network models NN, it is necessary to first present a method for obtaining a typical individual binary neural network model nni for detecting propaganda pattern pi, the scheme of which is shown in Figure 4.

The input data of the method are prepared datasets for training and validation and pre-trained model nn. On Step 1, Fine-Tuning of typical nni on training DataSet, formed by method of datasets balancing, is performed. Fine-Tuning within the framework of the study will be carried out for individual binary neural network model BERT [27] and RoBERTa [28] with «HuggingFace» library [29].

Accordingly, on Step 2, evaluation of individual binary neural network model nni is performed, for evaluations both training dataset and validation dataset, which were formed by method of datasets balancing, will be used. Evaluation of models will be carried out by metrics Accuracy, Precision, Recall and F1. On Step 3, save of validated nni is performed. Accordingly, output data is fine-tuned model nni with metrics.

As pre-trained model nn, the use of BERT-like architectures is proposed, since these models can be applied to the analysis of Ukrainian texts even in the absence of large volumes of marked-up data [30, 31]. This feature is associated with pre-training on large text corpora, which allows these models to form universal language representations that can be refined on specific datasets to detect propaganda patterns. Fine-tuning allows you to adapt the model to the specifics of manipulative discourse, in particular in the Ukrainian language environment, which contains both unique stylistic and syntactic features.

3.3. Method for Web Propaganda Patterns Detection

After forming datasets and training a set of individual binary neural network models NN, detection of web propaganda patterns occurs. Scheme of method of web propaganda patterns detection by transformer neural networks is shown in Figure 5.

Input data of the method detection of web propaganda patterns by transformer neural networks are fine-tuned models NN, web content for analysis and threshold t.

On Step 1, preprocessing of web content for analysis occurs, which includes of splitting into sentences, after which tokenization is performed [32, 33]. The result of web content splitting for analysis will be the representation (3):

S={s1 , s2 , … , sn }, (3) where sj – j-th sentence in web content for analysis, n – count of sentence.

Step 2 performs web content labeling by each of nni . Each sentence sj is evaluated separately by each of nni, and if the output value of the neural network model nni for sentence j exceeds the given threshold t –propaganda pattern pi is considered to be manifested in sentence j. Accordingly, each sentence will be given a subset PPj of the elements of the set P:

S score=

PP j⊆ P , PP j={ pi|scorei , j>t }, (4) where scorei,j – the output value nni of the model for j-th sentence in {S}.

At Step 3, the formation of output view takes place, which is performed according to rules:  if there are already manifestations of other propaganda patterns for sentence sj, then such propaganda patterns are considered manifested in the text, however, the maximum value max_scorej will be displayed with highlighting: max¿= max scorei , j , (5)

pi⊆ PPj  if there are multiple sentences with the pi propaganda pattern, the overall score of the manifestation in web content for analysis is calculated as the arithmetic mean:

1 (6) ¿ SSi∨¿ ∑ scorei , j , SSi={s j∨ pi∈ PP j }¿

sj⊆ SSi where SSi, – a set of sentences in which is found.

Output data of the proposed method are probabilities of each of propaganda patterns in web content and highlighted sentences in which identified patterns are most evident [34].

The proposed in sections 3.1 – 3.2 methods are investigated experimentally in section 4.

4. Experiment

In accordance with purpose of research, problem of improving efficiency via dataset balancing arises, which can be mathematically represented as a problem of maximizing the F1 metric: m¿=arg max f ( m ) , (7)

m where f(m) – the value of the F1 metric of the nni model obtained after fine-tuning on the dataset with the selected percentage value m.

The solution of the optimization problem will be carried out experimentally, changing the % of non-propaganda texts in the Training DataSet from 10% to 70% in steps of 20%.

For the experimental part, specialized software was created, consisting of 2 modules: a training module (without a graphical user interface) and a neural network validation module (the application is shown in Figure 6). The Python language, PyTorch libraries [35], transformers [36], datasets [37] were used to develop the training module. The PySide6 libraries [38], transformers, PyTorch were used to develop the validation module.

Propaganda patterns m % p1 p2 p3 p4 p5 p6 p7 p8 p9 p10

Target

Accordingly, the created test software obtained the results shown in Section 5.

5. Results

After filling the Training DataSet using the method described in section 3.1, data sets were obtained, the quantitative distributions of which are given in Table 2.

The Precision (P), Recall (R), F1 metrics for fine-tuned individual binary neural network models at different percentage values of the parameter m on the test sample (20% of the Training DataSet, which did not participate in training) are given in Table 3. The Precision (P), Recall (R), F1 metrics for fine-tuned individual binary neural network models at different percentage values of the parameter m on the training sample (80% of the Training DataSet, which participated in training) are given in Table 4.

Target B E R T

Propagand a patterns p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p1 p2 p3 p4 p5 p6 p7 p8 p9 p10

The Precision (P), Recall (R), and F1 metrics for fine-tuned individual binary neural network models at different percentage values of parameter m on validation dataset are given in Table 5.

Comparisons by the Accuracy metric (average value) for fine-tuned individual binary neural network models of the BERT and RoBERTa architectures at different percentage values of the parameter m on the Validation DataSet are shown in Figure 7.

Comparisons by F1 metric (average value) for fine-tuned individual binary neural network models of the BERT and RoBERTa architectures at different percentage values of the parameter m on the Validation DataSet are shown in Figure 8.

It is also worth providing a table comparing the obtained results with the data of existing studies (Table 6).

Therefore, the problem of improving efficiency via dataset balancing, given in the form (7), has a solution m* = 30. An analysis of the obtained results is given in Section 6.

6. Discusion

In the presented results of testing models (Table 3) for detecting manipulative propaganda patterns on the test sample (20% of the training sample) with different percentages of texts without manipulations m (10%, 30%, 50%, 70%), one can observe a clear trend towards improving the performance of models with an increase in the value of the parameter m, i.e. with an increase in the percentage of texts without propaganda patterns in the training sample. For fine-tuned models based on BERT, it is seen that Precision, Recall and F1-measure for each category of propaganda patterns gradually increase from m=10% to m=70%. For example, for the “Loaded Language” category, the F1-measure increases from 0.498 at m = 10% to about 0.798 at m = 70%, which indicates a significant improvement in the model’s ability to distinguish target and non-target examples with an increase in the proportion of text samples without manipulations.

Comparing the performance of models for different propaganda patterns shows that some categories, such as “Glittering Generalities” and “Bandwagon”, “Cherry Picking”, have consistently high F1-measures as m increases, indicating that the characteristic features of these patterns are easier to separate with balanced training. In contrast, other categories, such as “Straw Man” and “Thought-Terminating Cliche”, show relatively lower performance, which may be due to greater variability or subtlety of the linguistic features characterizing these patterns.

Similar analysis for RoBERTa-based models shows similar trends, with the overall performance being slightly higher compared to BERT models. This is explained by the more robust pre-training and optimized architecture of RoBERTa, which allows the model to generalize information better. The improvement in the evaluation indicators with an increase in the proportion of unmanipulated text samples highlights the importance of balancing the dataset to overcome the problem of class imbalance, which, in turn, contributes to more reliable and stable detection of propaganda patterns by individual binary neural networks. For the BERT neural network, on average, for the F1 metric, the delta between m = 10% and m = 30% is +0.048, between m = 30% and m = 50%, the delta is 0.063, and between m = 50% and m = 70%, the delta is 0.091. At the same time, for the RoBERTa neural network, delta of +0.0632 is observed between m = 10% and m = 30%, a delta of 0.056 is observed between m = 30% and m = 50%, and delta of 0.082 is observed between m = 50% and m = 70%.

The analysis of the data from Table 4 indicates the ability of neural network models to remember, and here, naturally, as in Table 3, there is a tendency for metrics to increase with increasing parameter m. For the BERT neural network, on average, for the F1 metric, there is a delta between m = 10% and m = 30% of +0.049, between m = 30% and m = 50%, there is delta of 0.02, and between m = 50% and m = 70%, there is delta of 0.031. At the same time, for the RoBERTa neural network, a delta of +0.028 is observed between m = 10% and m = 30%, delta of 0.022 is observed between m = 30% and m = 50%, and delta of 0.024 is observed between m = 50% and m = 70%. Accordingly, RoBERTa demonstrates a gradual increase in metrics, which indicates stable generalization due to the optimized architecture. BERT demonstrates somewhat jumpy increases, which may be due to the lower flexibility of its architecture in adapting to changes in the proportion of text samples without propaganda patterns.

For the RoBERTa neural network, when detecting manipulation patterns “Glittering Generalities”, “Appeal to Fear”, “FUD”, “Bandwagon”, “Whataboutism”, an F1 value of more than 0.95 is observed. For the BERT neural network, a value above 0.95 is observed only for “FUD”. In general, the use of different values of the parameter m affects the ability of neural networks to remember the features of the training set. However, the metrics calculated on the training data allow us to assess how well the model remembered this data, but do not give a complete picture of its ability to generalize new information.

The most relevant estimates of the experiment are given in Table 5, since here the model was validated on data that did not participate in training, and which contain equally represented propaganda patterns and texts without such patterns.

According to Table 5 and Figures 7 and 8, at the parameter m=30% the metrics demonstrate the highest result, where the average value of the Accuracy metric is 0.733 for the RoBERTa neural network, and 0.704 for the BERT architecture. The F1 metric for RoBERTa is 0.725, and for the BERT architecture – 0.693. This suggests that the initial addition of data allows to increase the metrics, but then the effect is smoothed out or even worsened due to overloading with less useful information, such as texts without propaganda patterns. Accordingly, while neural networks show a tendency to better distinguish propaganda patterns at higher values of m during training, testing on a balanced validation set refutes the hypothesis that the higher the resolution of the training data, the better the generalization ability of the neural network model. It is possible that as the proportion of m increases, the models are overtrained due to the lack of unique values inherent in each of the web propaganda patterns. The conclusion that the m*=30 found is also confirmed by the minimum mean deviation between the test data for m=30 (Table 3 and Table 5) for both the BERT architecture neural network (0.05) and RoBERTa (0.06).

The comparison with analogues is carried out in Table 6, and for the purity of the comparison of the developed approach and existing analogues, the F1 value was taken specifically on the validation data. Accordingly, the highest F1 indicator for the RoBERTa architecture at m=30% is 0.725, which is 0.1 higher than the analogue described in [ 15 ]. Therefore, the task of improving the efficiency of detecting propaganda patterns in web texts using transformative neural networks through optimizing the balancing of the dataset has been fully implemented and experimentally proven.

However, the proposed approach has limitations. In this study, an approach at the sentence level was used. This may have an impact on the quality of detecting propaganda patterns, which may work at the level of paragraphs or even entire texts, rather than individual sentences. Also, a single sentence may be neutral in itself, but in the context of propaganda text its meaning changes. These issues will be addressed in further research. There are also limitations at the level of the data source. The manual labeling used in the dataset may contain subjective judgments, which affects the training of the model.

Conclusions

In addition to the training dataset, consisting of texts with target propaganda pattern in the target category, as well as texts without any propaganda patterns and texts with other propaganda patterns, without target, a validation dataset was built, which consists equally of all types of web propaganda patterns and texts without propaganda. This allows us to determine whether the model does not confuse patterns with each other and is able to detect them independently of each other, which is critically important for the patterns detection task.

An analysis of the impact of the balance of the training sample on the effectiveness of propaganda pattern detection models in social media was performed, which showed that of the considered options for forming the training dataset with different percentages of texts without manipulations (10%, 30%, 50% and 70%), the highest results were achieved at 30% using the RoBERTa neural network, and are 0.725 according to the F1 metric. The results obtained contribute to a deeper understanding of the role of training sample balancing in improving propaganda pattern detection algorithms and can be used to increase the reliability of automated information space analysis systems.

Building a validation dataset that contains an equal number of texts with all types of propaganda patterns, as well as neutral texts, provides a fair assessment of the performance of the models. This prevents bias towards the most represented classes and allows for more accurate performance metrics for each individual pattern. In addition, this approach allows for the identification of potential relationships between different types of manipulation, since texts can contain multiple patterns at the same time.

Analyzing the impact of parameter that determines proportion of texts without web propaganda patterns allows assessing how the models ability to distinguish propaganda patterns from neutral texts and texts with other propaganda patterns. This allows finding the optimal ratio of dataset classes to increase the overall effectiveness for detecting web propaganda patterns.

The proposed approach has the limitation of analyzing at the sentence level, which may not take into account the broader context of propaganda patterns at the paragraph or whole text level. In addition, the use of manual data labeling may contain subjective judgments, which affects the training of the model.

Acknowledgements

This study was conducted using dataset made available by UNLP 2025 Shared Task initiative (GitHub repository) [24]. Authors are grateful to organizers and contributors for compiling and sharing this resource, which supports ongoing research in propaganda detection techniques.

Declaration on Generative AI

During the preparation of this work, the authors used Grammarly in order to: Grammar and spelling check. After using this tool, the authors reviewed and edited the content as needed and take full responsibility for the publication’s content.

[1]

B. M.

Almotairy ,

Abdullah ,

D. H.

Alahmadi , Dataset for Detecting and Characterizing Arab Computation Propaganda on X, Data in Brief . ( 2024 ) 110089 . doi: 10 .1016/j.dib. 2024 . 110089 .

[2]

Horák ,

Sabol ,

Herman ,

Baisa , Recognition of propaganda techniques in newspaper texts: Fusion of content and style analysis , Expert Systems with Applications . ( 2024 ) 124085 . doi: 10 .1016/j.eswa. 2024 . 124085 .

[3]

Moore ,

Colley , Two International Propaganda Models: Comparing RT and CGTN's 2020 US Election Coverage , Journalism Practice. ( 2022 ) 1 - 23 . doi: 10 .1080/17512786. 2022 . 2086157 .

[4]

Bösch , T. Divon, The sound of disinformation: TikTok, computational propaganda , and the invasion of Ukraine, New Media & Society. 26.9 ( 2024 ) 5081 - 5106 . doi: 10 .1177/14614448241251804.

[5]

Krak ,

Molchanova ,

Didur ,

Sobko ,

Mazurets ,

Barmak , Method of semantic features estimation for political propaganda techniques detection using transformer neural networks , CEUR Workshop Proceedings , 3917 ( 2025 ) 286 - 297 . URL: https://ceur-ws. org/ Vol- 3917 /paper56.pdf

[6]

Karas , T. I. Profant , Propaganda Model in Slovak Media Space: A Case Study, Hum. Aff . 35 ( 1 ). ( 2024 ) 36 - 60 . doi: 10 .1515/humaff-2024-0062.

[7]

Alam ,

M. R.

Biswas ,

Shah ,

Zaghouani , G. Mikros, Propaganda to Hate: A Multimodal Analysis of Arabic Memes with Multi-agent LLMs , In: Lecture Notes in Computer Science , Springer Nature Singapore, 2024 , p. 380 - 390 . doi: 10 .1007/ 978 -981-96-0576-7_ 28 .

[8]

Yumnam ,

Gyanendra ,

C. I.

Singh , A systematic bibliometric review of the global research dynamics of United Nations Sustainable Development Goals 2030, Sustain . Futures 7 ( 2024 ) 100192 . doi: 10 .1016/j.sftr. 2024 . 100192 .

[9]

Işık ,

Ongan ,

Ozdemir ,

Yan , O. Demir, The sustainable development goals: Theory and a holistic evidence from the USA , Gondwana Research. 132 ( 2024 ) 259 - 274 . doi: 10 .1016/j.gr. 2024 . 04 .014.

[10]

Sorooshian , The sustainable development goals of the United Nations: A comparative midterm research review , Journal of Cleaner Production . 453 ( 2024 ) 142272 . doi: 10 .1016/j.jclepro. 2024 . 142272 .

[11]

Raman ,

Kumar Nair ,

Nedungadi ,

Kumar Sahu ,

Kowalski ,

Ramanathan ,

Achuthan , Fake news research trends, linkages to generative artificial intelligence and sustainable development goals , Heliyon ( 2024 ) e24727 . doi: 10 .1016/j.heliyon. 2024 .e24727.

[12]

Faye ,

Icard ,

Casanova ,

Chanson ,

Maine ,

Bancilhon ,

Égré , Exposing propaganda: an analysis of stylistic cues comparing human annotations and machine classification , arXiv preprint arXiv:2402.03780 ( 2024 ). URL: https://arxiv.org/abs/2402.03780.

[13]

Li ,

Liu ,

Lu ,

Shi ,

Wen , Span identification and technique classification of propaganda in news articles , Complex & Intelligent Systems . ( 2021 ). doi: 10 .1007/s40747-021- 00393-y.

[14]

Abdullah ,

Altiti ,

Obiedat , Detecting Propaganda Techniques in English News Articles using Pre-trained Transformers , 13th International Conference on Information and Communication Systems ( 2022 ) 301 - 308 . doi: 10 .1109/ICICS55353. 2022 . 9811117 .

[15]

Aldabbas ,

Ashraf ,

Sifa , L. Flek, MultiProp Framework: Ensemble Models for Enhanced Cross-Lingual Propaganda Detection in Social Media and News using Data Augmentation, Text Segmentation, and Meta-Learning , Proc. 1st Workshop NLP Lang. Using Arab. Script ( 2025 ) 7 - 22 . URL: https://aclanthology.org/ 2025 .abjadnlp- 1 .2.

[16]

Szwoch ,

Staszkow ,

Rzepka ,

Araki , Limitations of Large Language Models in Propaganda Detection Task, Applied Sciences. 14.10 ( 2024 ) 4330 . doi: 10 .3390/app14104330.

[17]

P. N.

Ahmad ,

Yuanchao ,

Aurangzeb ,

M. S.

Anwar , Q. M. u. Haq, Semantic web-based propaganda text detection from social media using meta-learning , Service Oriented Computing and Applications . ( 2024 ). doi: 10 .1007/s11761-024-00422-x.

[18]

D. G.

Jones , Detecting Propaganda in News Articles Using Large Language Models, Engineering 2 .1 ( 2024 ) 01 - 12 . doi: 10 .33140/eoa.01.02.10.

[19]

A. A.

Mustafa ,

C.-Y.

Lin ,

Kakinaka , Detecting market pattern changes: A machine learning approach , Finance Research Letters. ( 2021 ) 102621 . doi: 10 .1016/j.frl. 2021 . 102621 .

[20] A. M. U. D. Khanday , Q. R.

Khan , S. T.

Rabani , M. A.

Wani , M. ELAffendi, Propaganda Identification on Twitter Platform During COVID-19 Pandemic Using

LSTM

, In: Advances in