<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Neural network detection of digital fatigue and burnout with interpretable thematic segmentation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Olexander Mazurets</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roman Vit</string-name>
          <email>vit.roman.vit@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maryna Molchanova</string-name>
          <email>m.o.molchanova@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olena Sobko</string-name>
          <email>olenasobko.ua@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Adam Wierzbicki</string-name>
          <email>adamw@pjwstk.edu.pl</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dmytro Chumachenko</string-name>
          <email>dichumachenko@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Polish-Japanese Academy of Information Technology</institution>
          ,
          <addr-line>Koszykowa 86 str. 02-008, Warsaw</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Waterloo</institution>
          ,
          <addr-line>Waterloo, ON N2L 3G1</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <abstract>
        <p>The rapid expansion of remote work and digital communication has intensified the prevalence of digital fatigue and professional burnout, yet existing automated detection methods often lack the interpretability required for clinical and organizational trust. A significant gap remains in efectively distinguishing between topical discussions of fatigue and the actual psycho-emotional state of the user. In this work, we propose a novel interpretable approach combining thematic segmentation, communication object identification, and a BERT-based neural network to detect digital fatigue with high contextual sensitivity. On validation data, the proposed model achieved an Accuracy of 0.83, Precision of 0.87, Recall of 0.88, and an 1-score of 0.87. The study demonstrates that integrating thematic analysis with deep learning allows for a multi-level assessment of cognitive load, enabling the identification of both local overload centers and overall fatigue levels. This approach directly contributes to the Sustainable Development Goals by promoting mental well-being (SDG #3) and decent work environments (SDG #8) through healthier digital practices.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Digital fatigue</kwd>
        <kwd>digital burnout</kwd>
        <kwd>communication object</kwd>
        <kwd>neural network</kwd>
        <kwd>interpretable thematic segmentation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        With the rise of remote work, digital communication, and online education, more and more people are
experiencing digital fatigue [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and burnout [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This is a condition where mental resources are depleted
due to the constant demand to be “connected,” to respond, and to adapt to the dynamics of the digital
environment [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The ethical dimension of the application of artificial intelligence for decision-making
in the field of medical law is also a relevant topic for research [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        This problem has become especially relevant after the global changes caused by the COVID-19
pandemic [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], when remote work and learning have become the norm from the exception [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The lack
of physical boundaries between personal and professional life, prolonged interaction through screens,
high intensity information flows, and the loss of familiar social contacts create ideal conditions for the
development of chronic stress, mental exhaustion, and a deterioration in the quality of life [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        The relevance of addressing digital fatigue is also emphasized within the framework of the United
Nations Sustainable Development Goals, as it is directly connected with the promotion of mental
health and well-being under SDG #3 and the advancement of decent work and sustainable economic
growth under SDG #8. In this context, the development of interpretable neural network approaches to
monitoring digital fatigue not only contributes to individual psychological resilience, but also supports
the creation of healthier, more sustainable, and human-centered digital ecosystems that are aligned
with global development priorities [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>The aim of the research is to develop and test a neural network interpretable approach to automated
detection of digital fatigue and professional burnout through thematic segmentation of text messages
and analysis of communication objects, which allows to detect hidden patterns of psycho-emotional
exhaustion in the digital environment.</p>
      <p>The main contributions of the paper are:
• A new multi-stage framework for detecting digital fatigue is proposed, which combines thematic
segmentation, communication object analysis, and fatigue index modeling at both the segment
and profile levels.
• The Neural Network Detection of Digital Fatigue and Burnout Using Thematic Segmentation and
Communication Object Analysis method is implemented, which allows for targeted assessment
of fatigue within the content of communication.
• A multi-indicator fatigue assessment model is developed, which allows for calculating both
localized and aggregated levels of digital fatigue across all user profiles.
• A visualization module is created to interpret user-specific digital fatigue maps, which will allow
for identifying problematic communication segments.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related works</title>
      <p>The literature review in this area aims to identify key determinants of digital fatigue, its manifestations
in the work context, as well as existing methodologies for its detection.</p>
      <p>
        The article [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] is devoted to a comprehensive analysis of the phenomenon of digital fatigue as a
current challenge for the professional environment. The authors conducted a review of the scientific
literature for the period 2010–2025, focusing on identifying key factors, consequences and strategies
for overcoming digital fatigue among employees. The study finds that excessive use of digital tools
leads to cognitive overload, increased stress levels and reduced productivity. The paper focuses on the
complexity of the interaction of synchronous and asynchronous communication formats, as well as on
the blurring of boundaries between professional and personal life. The authors emphasize the need
to implement contextualized organizational approaches to digital communication and adhere to the
principles of digital well-being.
      </p>
      <p>
        The study [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] focused on the growing threat of digital fatigue among young people caused by
excessive use of screens in everyday life. Particular attention was paid to Computer Vision Syndrome
(CVS) and its associated postural strain in the 18–35 age group. Based on a survey of 160 respondents
and analysis of working postures using the OWAS system, a high prevalence of symptoms of visual
and musculoskeletal discomfort was found, including dry and burning eyes, headaches, neck stifness
and shoulder tension. A significant proportion of participants demonstrated a medium to high risk of
postural strain, which is exacerbated by poor ergonomics and low awareness of digital hygiene.
      </p>
      <p>
        The authors conducted a study on the use of natural language processing and machine learning
methods to identify burnout indicators based on text content [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. For the analysis, a corpus of
13,568 anonymized text messages obtained from the social platform Reddit was formed, among which
352 messages were identified as related to burnout and 979 to depression. Ensemble approaches to
classification based on subreddit-based data separation strategies and random batching were proposed
and implemented. The results obtained demonstrate that ensemble models significantly outperform
basic classifiers in terms of balanced Accuracy (0.93), test 1-score (0.43), and test Recall (0.93). The
study confirms the efectiveness of using NLP methods for early detection of symptoms of professional
burnout using text data, which opens up prospects for the further development of automated systems
for monitoring psycho-emotional state.
      </p>
      <p>
        A stress detection methodology for preventing burnout based on speech and written expression
analysis using natural language processing methods is presented in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. The authors combine
knowledge from psychology and data science to create a knowledge base that allows comparing speech
characteristics with objective stress indicators, such as heart rate variability, cortisol levels, and blood
pressure. The tool operates autonomously and passively, identifying both cognitive and emotional
manifestations of stress [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Its accuracy, confirmed by biomedical data, reaches 83% according to the
1-measure metric. The article emphasizes the importance of an interdisciplinary approach and the
potential for implementing this technology in a professional environment for monitoring employee
well-being with the aim of early detection and prevention of burnout.
      </p>
      <p>
        The article [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] is devoted to studying the phenomenon of emotional burnout among higher education
students during the COVID-19 pandemic, which arose due to sharp changes in the learning format and
social environment. Based on a qualitative narrative review of 38 peer-reviewed scientific publications,
the authors analyze the main factors that have caused a decrease in academic motivation, engagement
and success. Particular attention is paid to the impact of financial instability, mental health problems,
social isolation and digital fatigue from distance learning. At the same time, the study considers
university strategies to mitigate the negative consequences, in particular, the introduction of flexible
academic policies, hybrid learning models and psychological support. The authors also note the role of
artificial intelligence – chatbots and teaching assistants, as scalable tools for emotional and academic
assistance in the online environment.
      </p>
      <p>The analysis of scientific sources indicates the growing relevance of research on digital fatigue and
professional burnout as complex multifactorial phenomena manifested in cognitive, physiological and
behavioral symptoms. Modern approaches to detecting these states are based on a combination of
natural language processing methods, postural load analysis, biophysiological monitoring and
cognitiveafective diagnostics. The scientific literature shows a trend towards interdisciplinarity: the combination
of data from digital behavior, language patterns and psycho-emotional indicators allows to expand
the possibilities of early detection of burnout and digital overload in educational and professional
environments. At the same time, there is a lack of research that would combine thematic segmentation
of communications with the analysis of digital interaction objects for the purposes of automated fatigue
detection.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Problem statement</title>
      <p>Despite the growing interest in detecting digital fatigue using natural language processing methods,
there is a conflict between the need for accurate recognition of psycho-emotional states and the linguistic
ambiguity of texts, where the topic of digital fatigue and burnout may not correspond to the real internal
state of the author. This complicates automated classification and requires a deeper analysis of speech
patterns, context and author intentions, which goes beyond traditional thematic or emotional modeling.</p>
      <p>On the one hand, human speech is an important source of detecting psycho-emotional states, in
particular digital fatigue and burnout, however, on the other hand, the presence of semantic topics
related to fatigue or negative events is not yet a direct indicator of digital fatigue as a psychological
phenomenon.</p>
      <p>This creates methodological complexity: the text may contain signs of emotional exhaustion without
explicit mention of the digital context, or, conversely, may contain mentions of fatigue, but only
descriptively, without the presence of symptoms (for example, information messages, news, discussion
of the topic). Thus, there is a need to distinguish the thematic content of the message from latent
psychological markers of the human condition. Therefore, it is necessary to develop a method that
would take into account all the above-described aspects and be interpretable, as well as create software
to study its efectiveness.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Method design</title>
      <p>The proposed method for neural network detection of digital fatigue and burnout using interpretable
thematic segmentation and communication object analysis can be schematically presented as a sequential
execution of the stages shown in Figure 1. The input data are all entries (posts, comments, notes,
etc.) of a certain author or group of authors. Each entry in this block is marked with an index
(1, 2, . . . ) and forms a set of text content, from which an “author profile” is then
generated. The profile refers to metadata and summary characteristics: timestamps, length of entries,
general topic, frequency of updates.</p>
      <p>
        At stage 1, all records undergo thematic analysis using thematic modeling models [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. The result is
that each record belongs to one of the  identified topics. The LDA thematic modeling algorithm [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]
was used in the study. Communication segments mean conditional groups of texts united by a common
topic ( 1,  2, . . .  ). Each record is mapped to the topic that best describes its content
with “arrows”. This stage allows you to break the entire text stream into semantic blocks, preparing the
data for a deeper analysis.
      </p>
      <p>
        At stage 2, the search for target communication objects takes place. Target communication objects
are a combined set of keywords found by diferent methods (TF-IDF [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], YAKE! [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] and Dispersive
Estimation [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]) without repetitions and a set of NER [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], grouped by lemmatization. In this study,
the choice of approach to identifying target communication objects is due to the need to ensure high
semantic sensitivity and resistance to thematic shifts inherent in diferent styles of digital communication.
Combining statistical, linguistic and dispersion features allows us to obtain a representative set of
keywords relevant to the context of each thematic segment. In addition, supplementing automatically
detected key terms with named entities that have undergone the lemmatization procedure allows us to
increase the conceptual integrity of the resulting set of objects, ensuring more accurate identification of
semantic cores of digital interaction.
      </p>
      <p>
        At stage 3, neural network detection of digital fatigue and burnout occurs using the
“BERTForSequenceClassification” neural network [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Such a determination occurs both in the general set of text
representations of the author’s digital profile and within each of the identified topics. The choice of
the “BERTForSequenceClassification” model [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] is due to its ability to form contextualized vector
representations of text sequences, which ensures high quality classification even with limited input
length. Due to pre-training on a large corpus of texts, the BERT architecture efectively takes into
account semantic and syntactic relationships between tokens, which is critical for detecting complex
cognitive and emotional states, such as digital fatigue and burnout.
      </p>
      <p>
        The output data is a digital fatigue map by profile. The results are aggregated into a two-level
visualization [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. By segments, for each topic, a digital fatigue indicator and a graphic interpretation
in the form of a word cloud (target objects of communication) are displayed that characterize the
communication segment. Across the entire profile: a generalized indicator with the appropriate level
and visualization is formed, which allows you to quickly assess the overall state of the authors “digital
health”.
      </p>
      <p>A structured approach to detecting digital fatigue and professional burnout based on a neural network
is proposed, which combines thematic segmentation of text records and analysis of target objects of
communication. The methodology includes the sequential application of thematic modeling, semantic
keyword extraction, and contextualized classification using the BERT architecture, which provides a
multi-level assessment of cognitive load and emotional exhaustion in the digital environment.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Experiment</title>
      <sec id="sec-5-1">
        <title>5.1. Experimental datasets</title>
        <p>
          The “Healthcare Workers Burnout Tweets” dataset [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] on Kaggle [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] contains a collection of public
tweets collected from the profiles of healthcare workers, mostly nurses, who are expressing their
experiences working at the epicenter of the COVID-19 pandemic. Each entry is represented by the
main tweet attributes: text content, publication timestamp, user metadata, and a label indicating the
presence or absence of burnout.
        </p>
        <p>
          The “Mental Health Social Media” dataset [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] from Kaggle is a corpus of English-language tweets
collected using the oficial Twitter API [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ] in the first half of 2019. It consists of about 20k records,
each of which contains: a unique tweet and user ID, as well as a publication timestamp, the message
text, cleaned of unnecessary characters and filtered by language (only English tweets), and interaction
metadata (number of retweets, likes, hashtags and mentions by other users). The dataset also contains
features: tone and subjectivity, frequency of use of personal pronouns, service words and emojis, tweet
length (number of words/characters), lexical indicators (readability level, distribution of speech parts,
etc.).
        </p>
        <p>
          Each category contains an identical number of images, ensuring balanced representation of the target
classes and minimizing model bias throughout the training process [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ]. Therefore, the “Healthcare
Workers Burnout Tweets” dataset will be used to train the BERT neural network model for the digital
fatigue detection method, and the “Mental Health Social Media” dataset will be used to validate the
digital fatigue detection method with deep learning neural network models.
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Experiment and setup description</title>
        <p>
          The software architecture is implemented in the Google Colab cloud environment [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ] and includes
three functional modules: (1) topic modeling, (2) detection of target communication objects, and (3)
classification of digital fatigue using deep learning. Thematic analysis was implemented using the
LDA algorithm, which was applied to previously cleaned and lemmatized records. Cleaning included
noise removal, tokenization, normalization, and stop-word removal, after which a corpus dictionary
was formed. The optimal number of topics was determined using model coherence. To construct
a set of target communication objects, a combined approach was implemented that integrates the
results of three independent keyword detection methods – TF-IDF, YAKE!, and variance analysis [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ].
Named entities were determined using the Stanza library, which allows for linguistically accurate entity
recognition with subsequent lemmatization. All the obtained objects were combined into a single set,
cleaned of repetitions, which guarantees consistency between thematic afiliation and the semantic
core of each segment.
        </p>
        <p>
          Neural network detection of digital fatigue was carried out based on the
“BERTForSequenceClassification” model, implemented in the “HuggingFace Transformers” framework [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ]. Input text data
was fed to the built-in tokenizer, which converted them into tokens sequences, trimmed them to a
maximum length of 32 tokens and supplemented them with special characters [PAD] to a fixed size.
The “Healthcare Workers Burnout Tweets” corpus was used for training, it was stratified into training
(80%) and validation (20%) parts.
        </p>
        <p>
          An independent “Mental Health Social Media” set was used for testing the model, which allowed
for validation on new data, since it is possible to select messages within the selected user ID. The
classification results were aggregated both at the level of individual thematic segments and within the
overall digital profile of the author. The results were visualized using the “matplotlib” [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ], “seaborn” [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ]
and “wordcloud” [
          <xref ref-type="bibr" rid="ref34">34</xref>
          ] libraries, which allow building interactive digital fatigue maps and semantic
clouds of communication objects.
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Results and discussion</title>
      <p>
        As a result of training the BERT neural network architecture, the results according to the metrics shown
in Table 1 were obtained. At the same time, the confusion matrix [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ] showed the result shown in
Figure 2.
      </p>
      <p>The classification results performed using the BERT neural network for digital fatigue detection
demonstrate a balance between two classes – the presence of digital fatigue (class 1) and its absence
(class 0). The model shows relatively equal performance for both classes, which indicates its ability to
accurately distinguish between these categories.</p>
      <p>In the case of positive predictions (class 1), the model correctly identified 175 cases of digital fatigue
as the presence of this condition (True Positives, TP). This indicates that the model does a good job of
detecting digital fatigue, as most of the cases were correctly classified. However, there were also errors:
13 cases of digital fatigue were incorrectly classified as the absence of fatigue (False Negatives, FN),
which indicates the model’s possible dificulties in identifying some cases of fatigue. This may be due
to the fact that the text messages in these cases had more vague or less obvious signs of fatigue.</p>
      <p>Regarding negative predictions (class 0), the model also demonstrated good discrimination. It correctly
classified 171 cases of absence of digital fatigue as the absence of this condition (True Negatives, TN),
which indicates that the model is able to clearly determine when fatigue is absent. At the same time,
there were 17 cases where the model incorrectly classified the absence of digital fatigue as the presence
of fatigue (False Positives, FP). These errors may be related to the presence of similar patterns or
characteristics in the texts that were incorrectly interpreted as signs of digital fatigue. The created
method with a trained neural network model was tested on the training dataset (the “burnout” category).
The results are shown in Table 2.</p>
      <p>According to Table 2, LDA identified 5 communication segments within which the proportion of
texts with burnout is over 94%. However, this is fully correlated with confusion matrix and obtained
accuracy, since texts marked as containing burnout were analyzed.</p>
      <p>The next experiment selected a user for whom a digital profile was built in the form of his posts
(“Mental Health Social Media” dataset). After applying thematic modeling, two communication segments
were extracted. According to the identified communication segments, target communication objects
were identified and the digital fatigue and burnout index was determined for the identified segments
(Figure 3).</p>
      <p>The first theme contains words that are likely to be related to problems that may be related to digital
fatigue or burnout, such as: “therapy”, “treatment”, “disorders”, “depressive”, “work”, “boss”. These words
indicate stress, anxiety, fatigue, as well as interaction with work responsibilities, which are important
aspects for a theme describing digital fatigue. For example, the words “therapy” and “treatment” may
indicate the search for solutions or support in dealing with this problem, while “depressive” and
“disorders” link the problem to psychological aspects.</p>
      <p>At the same time, words like “work” and “boss” indicate potential stressors in the workplace, which
are relevant in the context of digital fatigue, since people working in conditions of constant digital
interaction often feel overloaded due to high demands, in particular from superiors.</p>
      <p>The second theme is more general, and includes words that are mainly related to life, household
chores and family interactions: “talk”, “business”, “article”, “help”, “life”, “family”, “wife”, “home”. This
may be less specific to digital fatigue, as these words are more related to the social aspects of life.
However, some of them, in particular “help”, “business” or “life”, can be interpreted as a connection
to stress or dificulties that can arise due to overload in work or personal life, which can also lead to
emotional exhaustion. For the first theme, the high percentage of burnout (78%) seems logical, since
the words from this theme are indeed more related to stressful situations that can cause digital fatigue:
work, depression, therapy, etc. The high frequency of such words indicates that this is a theme that
covers most of the factors that can lead to burnout.</p>
      <p>The second theme, with a burnout rate of 24%, contains words that are more related to social aspects
of life, which may be less closely related to digital fatigue or stress caused by excessive use of technology.
For example, the words “family”, “home”, “wife” have a lower connection with digital fatigue, which
justifies the lower percentage.</p>
      <p>The proposed method has confirmed its viability as a tool for automated monitoring of
psychoemotional state based on text analysis. Its application contributes to improving the quality of digital
fatigue diagnostics, opens up opportunities for integration into systems for adapting digital services to
the mental state of users, and creates a basis for further research in the field of personalized interfaces
and digital hygiene.</p>
      <p>The proposed approach has a number of limitations, including working only with English-language
data and limiting the length of the input sequence to 32 tokens. Further research will be aimed at
improving the method, which involves adapting it to multilingual analysis, taking into account the
peculiarities of the syntax and vocabulary of the Ukrainian language, by using multilingual models.
Also, a separate direction is to take into account the temporal dynamics of the user’s digital activity.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusion</title>
      <p>This paper presents a new approach to the automated detection of digital fatigue and professional
burnout based on text analysis, which uniquely combines interpretable thematic segmentation of content
with the identification of target objects of communication and subsequent neural network classification
using the BERT architecture. The proposed method allows for a comprehensive, multi-level assessment
of the user’s cognitive and emotional load in the digital environment, providing both contextual and
semantic sensitivity to signs of digital fatigue. The neural network trained to detect digital fatigue
achieved robust metrics on validation data, with an Accuracy of 0.83, Precision of 0.87, Recall of 0.88,
and an 1-score of 0.87.</p>
      <p>Unlike traditional models that operate with a global text profile or aggregated metadata, this paper
implements a method of thematic classification of text records, which allows for the isolation of
semantic segments of communication. Further analysis within each segment is carried out through the
identification of target communication objects, which provides a contextualized interpretation of digital
activity. In contrast to unidimensional assessments focused mainly on cognitive or visual symptoms
(e.g., computer vision syndrome), a formalized model for assessing digital fatigue is proposed, which
includes a number of indicators – at the topic, post, and user profile levels. Such a model allows one to
identify both local centers of overload and the overall level of digital fatigue.</p>
      <p>However, the current study presents certain limitations. The model was trained and validated
exclusively on English-language data, and the input sequence length was restricted to 32 tokens to
optimize processing eficiency, potentially limiting the analysis of longer, more complex narratives.
Future research will address these constraints by adapting the methodology for multilingual analysis,
specifically incorporating the syntax and vocabulary of the Ukrainian language through multilingual
models. Additionally, a significant direction for future work is the integration of temporal dynamics
into the user’s digital activity profile to monitor the progression of fatigue over time.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>The authors are grateful to Prof. Tetiana Hovorushchenko, Prof. Olexander Barmak, Prof. Iurii Krak
and other Program Committee members for organizing and conducting the workshop ExplAI-2025:
Advanced AI in Explainability and Ethics for the Sustainable Development Goals.</p>
    </sec>
    <sec id="sec-9">
      <title>Declaration on Generative AI</title>
      <p>The authors have not employed any Generative AI tools.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>T.</given-names>
            <surname>Supriyadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sulistiasih</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. H.</given-names>
            <surname>Rahmi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fahrudin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Pramono</surname>
          </string-name>
          ,
          <article-title>The impact of digital fatigue on employee productivity and well-being: A scoping literature review</article-title>
          ,
          <source>Environment and Social Psychology</source>
          <volume>10</volume>
          (
          <year>2025</year>
          ). doi:
          <volume>10</volume>
          .59429/esp.v10i2.
          <fpage>3420</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K. H.</given-names>
            <surname>Rahmi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fahrudin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Supriyadi</surname>
          </string-name>
          , E. Herlina,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosilawati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Ningrum</surname>
          </string-name>
          ,
          <article-title>Technostress and cognitive fatigue: Reducing digital strain for improved employee well-being: A literature review</article-title>
          ,
          <source>Multidisciplinary Reviews</source>
          <volume>8</volume>
          (
          <year>2025</year>
          )
          <article-title>2025380</article-title>
          . doi:
          <volume>10</volume>
          .31893/multirev.2025380.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>W.</given-names>
            <surname>Maetzler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. C.</given-names>
            <surname>Guedes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. N.</given-names>
            <surname>Emmert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kudelka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. L.</given-names>
            <surname>Hildesheim</surname>
          </string-name>
          , E. Paulides,
          <string-name>
            <given-names>H.</given-names>
            <surname>Connolly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Davies</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Dilda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ahmaniemi</surname>
          </string-name>
          , et al.,
          <article-title>Fatigue-related changes of daily function: Most promising measures for the digital age</article-title>
          ,
          <source>Digital Biomarkers</source>
          (
          <year>2024</year>
          )
          <fpage>30</fpage>
          -
          <lpage>39</lpage>
          . doi:
          <volume>10</volume>
          .1159/000536568.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Hovorushchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Herts</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hnatchuk</surname>
          </string-name>
          ,
          <article-title>Concept of intelligent decision support system in the legal regulation of the surrogate motherhood</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          , volume
          <volume>2488</volume>
          ,
          <year>2019</year>
          , pp.
          <fpage>57</fpage>
          -
          <lpage>68</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2488</volume>
          /paper5.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Sharma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Anand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ahuja</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Thakur</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Mondal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Kohli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Venkateshan</surname>
          </string-name>
          ,
          <article-title>Digital burnout: Covid-19 lockdown mediates excessive technology use stress</article-title>
          ,
          <source>World Social Psychiatry</source>
          <volume>2</volume>
          (
          <year>2020</year>
          )
          <article-title>171</article-title>
          . doi:
          <volume>10</volume>
          .4103/wsp.wsp_
          <volume>21</volume>
          _
          <fpage>20</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R. K.</given-names>
            <surname>Sarangal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Nargotra</surname>
          </string-name>
          ,
          <article-title>Digital fatigue among students in current covid-19 pandemic: A study of higher education</article-title>
          ,
          <source>Gurukul Business Review</source>
          <volume>18</volume>
          (
          <year>2022</year>
          ).
          <source>doi:10.48205/gbr.v18.5.</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Byrne</surname>
          </string-name>
          ,
          <article-title>Special topic on reducing technology related stress and burnout: Digital compassion fatigue as an emerging phenomenon for registered nurses experiencing techno-stress, Applied Clinical Informatics (</article-title>
          <year>2025</year>
          ). doi:
          <volume>10</volume>
          .1055/a-2564-8809.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P.</given-names>
            <surname>Radiuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Kovalchuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Slobodzian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Manziuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Barmak</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Krak</surname>
          </string-name>
          ,
          <article-title>Human-in-the-loop approach based on MRI and ECG for healthcare diagnosis</article-title>
          ,
          <source>in: Proceedings of the 5th International Conference on Informatics &amp; Data-Driven Medicine (IDDM</source>
          <year>2022</year>
          ), volume
          <volume>3302</volume>
          , CEUR-WS.org, Aachen,
          <year>2022</year>
          , pp.
          <fpage>9</fpage>
          -
          <lpage>20</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3302</volume>
          /paper1.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Supriyadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sulistiasih</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. H.</given-names>
            <surname>Rahmi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fahrudin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Pramono</surname>
          </string-name>
          ,
          <article-title>The impact of digital fatigue on employee productivity and well-being: A scoping literature review</article-title>
          ,
          <source>Environment and Social Psychology</source>
          <volume>10</volume>
          (
          <year>2025</year>
          ). doi:
          <volume>10</volume>
          .59429/esp.v10i2.
          <fpage>3420</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S.</given-names>
            <surname>Bagaji</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rao</surname>
          </string-name>
          ,
          <article-title>Digital fatigue in the age of screens: eye and postural strain among 18-35-year-old screen users</article-title>
          ,
          <source>International Journal of Research - GRANTHAALAYAH 13</source>
          (
          <year>2025</year>
          ). doi:
          <volume>10</volume>
          .29121/ granthaalayah.v13.
          <year>i5</year>
          .
          <year>2025</year>
          .
          <volume>6191</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>G.</given-names>
            <surname>Merhbene</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Nath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Puttick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kurpicz-Briki</surname>
          </string-name>
          ,
          <article-title>Burnoutensemble: Augmented intelligence to detect indications for burnout in clinical psychology</article-title>
          ,
          <source>Frontiers in Big Data</source>
          <volume>5</volume>
          (
          <year>2022</year>
          ). doi:
          <volume>10</volume>
          . 3389/fdata.
          <year>2022</year>
          .
          <volume>863100</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Mendula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gabrielli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Finazzi</surname>
          </string-name>
          , C. Dompe',
          <string-name>
            <given-names>M.</given-names>
            <surname>Delucis</surname>
          </string-name>
          ,
          <article-title>Unveiling mental health insights: A novel nlp tool for stress detection through writing and speaking analysis to prevent burnout</article-title>
          , in: AHFE International,
          <year>2024</year>
          , pp.
          <fpage>167</fpage>
          -
          <lpage>174</lpage>
          . doi:
          <volume>10</volume>
          .54941/ahfe1004653.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>O.</given-names>
            <surname>Kalyta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Barmak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Radiuk</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Krak</surname>
          </string-name>
          ,
          <article-title>Facial emotion recognition for photo and video surveillance based on machine learning and visual analytics</article-title>
          ,
          <source>Applied Sciences</source>
          <volume>13</volume>
          (
          <year>2023</year>
          )
          <article-title>9890</article-title>
          . doi:
          <volume>10</volume>
          .3390/ app13179890.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P. D.</given-names>
            <surname>Deep</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>Student burnout and mental health in higher education during covid-19: Online learning fatigue, institutional support, and the role of artificial intelligence</article-title>
          ,
          <source>Higher Education Studies</source>
          <volume>15</volume>
          (
          <year>2025</year>
          )
          <article-title>381</article-title>
          . doi:
          <volume>10</volume>
          .5539/hes.v15n2p381.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>I.</given-names>
            <surname>Krak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Didur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Molchanova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Mazurets</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Sobko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Zalutska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Barmak</surname>
          </string-name>
          ,
          <article-title>Method for political propaganda detection in internet content using recurrent neural network models ensemble</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          , volume
          <volume>3806</volume>
          ,
          <year>2024</year>
          , pp.
          <fpage>312</fpage>
          -
          <lpage>324</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-3806/S_36_Krak.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>J. H.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Ostwald</surname>
          </string-name>
          ,
          <article-title>Latent dirichlet allocation (lda) topic models for space syntax studies on spatial experience</article-title>
          ,
          <source>City, Territory and Architecture</source>
          <volume>11</volume>
          (
          <year>2024</year>
          ).
          <source>doi:10.1186/s40410-023-00223-3.</source>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>C. A. N.</given-names>
            <surname>Agustina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Novita</surname>
          </string-name>
          , Mustakim,
          <string-name>
            <surname>N. E. Rozanda,</surname>
          </string-name>
          <article-title>The implementation of tf-idf and word2vec on booster vaccine sentiment analysis using support vector machine algorithm</article-title>
          ,
          <source>in: Procedia Computer Science</source>
          , volume
          <volume>234</volume>
          ,
          <year>2024</year>
          , pp.
          <fpage>156</fpage>
          -
          <lpage>163</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.procs.
          <year>2024</year>
          .
          <volume>02</volume>
          .162.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chadha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Tewari</surname>
          </string-name>
          ,
          <article-title>A natural language processing model on bert and yake technique for keyword extraction on sustainability reports</article-title>
          ,
          <source>IEEE Access 1</source>
          (
          <year>2024</year>
          ). doi:
          <volume>10</volume>
          .1109/access.
          <year>2024</year>
          .
          <volume>3352742</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Krak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Barmak</surname>
          </string-name>
          ,
          <string-name>
            <surname>O. Mazurets,</surname>
          </string-name>
          <article-title>The practice implementation of the information technology for automated definition of semantic terms sets in the content of educational materials</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          , volume
          <volume>2139</volume>
          ,
          <year>2018</year>
          , pp.
          <fpage>245</fpage>
          -
          <lpage>254</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2139</volume>
          /
          <fpage>245</fpage>
          -
          <lpage>254</lpage>
          .pdf.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>I.</given-names>
            <surname>Kalia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <article-title>Domain adaptation for ner using mbert</article-title>
          ,
          <source>in: Lecture Notes in Networks and Systems</source>
          , Springer Nature Singapore,
          <year>2024</year>
          , pp.
          <fpage>171</fpage>
          -
          <lpage>181</lpage>
          . doi:
          <volume>10</volume>
          .1007/
          <fpage>978</fpage>
          -981-97-6992-6_
          <fpage>14</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>T.</given-names>
            <surname>Iazykova</surname>
          </string-name>
          ,
          <article-title>Bert-for-sequence-</article-title>
          <string-name>
            <surname>classification</surname>
          </string-name>
          ,
          <year>2025</year>
          . URL: https://pypi.org/project/ bert
          <article-title>-for-sequence-classification/.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>J.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <article-title>Bert-sirna: sirna target prediction based on bert pre-trained interpretable model</article-title>
          ,
          <source>Gene</source>
          <volume>910</volume>
          (
          <year>2024</year>
          )
          <article-title>148330</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.gene.
          <year>2024</year>
          .
          <volume>148330</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>O.</given-names>
            <surname>Kovalchuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Slobodzian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Sobko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Molchanova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Mazurets</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Barmak</surname>
          </string-name>
          , I. Krak,
          <string-name>
            <given-names>N.</given-names>
            <surname>Savina</surname>
          </string-name>
          ,
          <article-title>Visual analytics-based method for sentiment analysis of covid-19 ukrainian tweets</article-title>
          ,
          <source>in: Lecture Notes on Data Engineering and Communications Technologies</source>
          , volume
          <volume>149</volume>
          ,
          <year>2023</year>
          , pp.
          <fpage>591</fpage>
          -
          <lpage>607</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>031</fpage>
          -16203-9_
          <fpage>33</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <surname>M. NG</surname>
          </string-name>
          , Healthcare workers burnout,
          <year>2025</year>
          . URL: https://www.kaggle.com/datasets/mindyng/ healthcareworkersburnout/data.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>A.</given-names>
            <surname>Goldbloom</surname>
          </string-name>
          , Kaggle,
          <year>2025</year>
          . URL: https://www.kaggle.com/.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>InFamousCoder</surname>
          </string-name>
          , Mental health social media,
          <year>2025</year>
          . URL: https://www.kaggle.com/datasets/ infamouscoder/mental
          <article-title>-health-social-media.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>X.</given-names>
            <surname>Corp</surname>
          </string-name>
          .,
          <source>Twitter api documentation</source>
          ,
          <year>2025</year>
          . URL: https://developer.x.com/en/docs/x-api.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>O.</given-names>
            <surname>Sobko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Mazurets</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Molchanova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Krak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Barmak</surname>
          </string-name>
          ,
          <article-title>Method for analysis and formation of representative text datasets</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          , volume
          <volume>3899</volume>
          ,
          <year>2024</year>
          , pp.
          <fpage>84</fpage>
          -
          <lpage>98</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3899</volume>
          /paper9.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>G. LLC.</given-names>
            ,
            <surname>Google</surname>
          </string-name>
          <string-name>
            <surname>colab</surname>
          </string-name>
          ,
          <year>2025</year>
          . URL: https://colab.research.google.com/.
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>O.</given-names>
            <surname>Barmak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Mazurets</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Krak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kulias</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Smolarz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Azarova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Gromaszek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Smailova</surname>
          </string-name>
          ,
          <article-title>Information technology for creation of semantic structure of educational materials</article-title>
          ,
          <source>in: Proceedings of SPIE - The International Society for Optical Engineering</source>
          , volume
          <volume>11176</volume>
          ,
          <year>2019</year>
          , p.
          <fpage>1117623</fpage>
          . doi:
          <volume>10</volume>
          .1117/12.2537064.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>T.</given-names>
            <surname>Wolf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Debut</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Sanh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chaumond</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Delangue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Moi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cistac</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Rault</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Louf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Funtowicz</surname>
          </string-name>
          , et al.,
          <article-title>Transformers: State-of-the-Art Natural Language Processing</article-title>
          ,
          <source>in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Association for Computational Linguistics</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>38</fpage>
          -
          <lpage>45</lpage>
          . URL: https: //www.aclweb.org/anthology/2020.emnlp-demos.6. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2020</year>
          .emnlp-demos.
          <volume>6</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Hunter</surname>
          </string-name>
          ,
          <article-title>Matplotlib: A 2d graphics environment</article-title>
          ,
          <source>Computing in Science &amp; Engineering</source>
          <volume>9</volume>
          (
          <year>2007</year>
          )
          <fpage>90</fpage>
          -
          <lpage>95</lpage>
          . doi:
          <volume>10</volume>
          .1109/
          <string-name>
            <surname>MCSE</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <volume>55</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <surname>M. L. Waskom</surname>
          </string-name>
          ,
          <article-title>seaborn: statistical data visualization</article-title>
          ,
          <source>Journal of Open Source Software</source>
          <volume>6</volume>
          (
          <year>2021</year>
          )
          <article-title>3021</article-title>
          . URL: https://doi.org/10.21105/joss.03021. doi:
          <volume>10</volume>
          .21105/joss.03021.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>A.</given-names>
            <surname>Mueller</surname>
          </string-name>
          , Wordcloud,
          <year>2025</year>
          . URL: https://pypi.org/project/wordcloud/.
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>S.</given-names>
            <surname>Yang</surname>
          </string-name>
          , G. Berdine, Confusion matrix,
          <source>The Southwest Respiratory and Critical Care Chronicles</source>
          <volume>12</volume>
          (
          <year>2024</year>
          )
          <fpage>75</fpage>
          -
          <lpage>79</lpage>
          . doi:
          <volume>10</volume>
          .12746/swrccc.v12i53.
          <fpage>1391</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>