<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Synthesis of model features for fake news detection using large language models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Adam Wierzbicki</string-name>
          <email>adamw@pjwstk.edu.pl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrii Shupta</string-name>
          <email>andrii.shupta@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olexander Barmak</string-name>
          <email>alexander.barmak@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Khmelnytskyi National University</institution>
          ,
          <addr-line>11, Instytuts'ka str., Khmelnytskyi, 29016</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Polish-Japanese Academy of Information Technology</institution>
          ,
          <addr-line>Koszykowa 86 str. 02-008, Warsaw</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In recent years, the issue of fake news has become exceedingly pertinent due to its rapid dissemination through social media and online platforms. Detecting fake news requires the utilization of various methods and resources. This work proposes an approach to building a model for fake news detection using generative artificial intelligence and natural language processing. The main focus of the proposed approach is the synthesis of model features for fake news detection using generative artificial intelligence and Natural Language Processing. The use of this proposed approach not only facilitates the detection of fake news but also renders the detection process transparent and customizable by the user. Additionally, through the proposed method, users gain the ability to expand features and train the system to adapt to new types of fake news and their variations. Experimental results, presented both qualitatively through visual analytics and quantitatively through statistical indicators, convincingly demonstrate the effectiveness of the proposed approach in detecting fake news with satisfactory accuracy levels reaching 90% and provide users with sufficient interpretability of the obtained decisions. Overall, this research aims to create an approach for detecting fake news that may have a significant impact on addressing this issue in contemporary society.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Fake news</kwd>
        <kwd>Fake news detection</kwd>
        <kwd>Natural Language Processing</kwd>
        <kwd>LLM 1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>In modern society, the spread of fake news has become a serious challenge due to its rapid
dissemination on social media and other online platforms. Dissemination of false or
deceptive information can significantly influence individuals' perceptions and beliefs.
Therefore, detecting and combating fake news has become an important task that requires
the application of various methods and strategies to accurately separate true information
from manipulative content.</p>
      <p>
        Social media has become the primary source of news, especially among the youth [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
However, with the increasing popularity of this method of information consumption, the
prevalence of misinformation, including false information and unsubstantiated claims, is
also growing. Unfortunately, many of these platforms lack reliable mechanisms for verifying
users and their publications, which contributes to the spread of false information. This
misinformation may contain propaganda aimed against individuals, society, organizations,
or political parties. Given the vast amount of content on social media platforms, detecting
all instances of fake news becomes challenging, underscoring the relevance of automated
machine learning classifiers.
      </p>
      <p>
        Various social media platforms exist today where users can post and share news online.
However, many of these platforms lack reliable mechanisms for verifying users and their
publications, which contributes to the spread of false information [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This misinformation
may contain propaganda aimed against individuals, society, organizations, or political
parties. Given the vast amount of content on social media platforms, detecting all instances
of fake news becomes challenging, underscoring the relevance of automated machine
learning classifiers.
      </p>
      <p>
        Typically, methods for detecting fake news are trained on data available at the time of
training, and this data may become outdated in the future, hence new approaches such as
the use of Large Linguistic Models (LLM) have emerged [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Building on previous experience [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], this research proposes an approach to synthesizing
models for detecting fake news based on transparent and interpretable features obtained
using generative artificial intelligence and Natural Language Processing from news texts.
This approach allows the system to be adaptive to changes in fake news and to expand its
capabilities for detecting new forms of misinformation. The research proposes an
integrated approach that combines the effectiveness and power of Large Linguistic Models
(LLM) with the transparency and interpretability of machine learning models. This allows
for the synthesis of model features for detecting fake news using responsible and
understandable AI principles based on ethics and transparency.
      </p>
      <p>The main contributions of this work are:
•
•
a method for synthesizing model features for detecting fake news using generative
artificial intelligence and Natural Language Processing
obtaining features for detecting fake news using the proposed method and
successful validation of the obtained model, which demonstrated its ability to detect
fake news qualitatively through visual analytics and quantitatively through
statistical indicators (accuracy exceeding 90%)</p>
      <p>The article is structured as follows. In the Related Work section, research related to fake
news detection is reviewed, their advantages and disadvantages are analyzed, and the
research objective is formulated. The Methods and Materials section presents the
theoretical foundations of the proposed approach to feature synthesis using generative
artificial intelligence and natural language processing. The Results and Discussion section
presents the experimental results under the proposed approach and discusses them.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        This section discusses research dedicated to detecting fake news conducted using various
methods and techniques. In the work [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], a new method for detecting fake news was
presented, based on combining different features, including textual and user-based, and
utilizing deep learning models. Algorithms such as convolutional neural network (CNN)
extensions to graphs allow for the combination of dissimilar data types, resulting in an
accuracy of 92.7%.
      </p>
      <p>
        Another study [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] focuses on using linguistic features. They utilized a dataset consisting
of two sets containing an equal number of true and fake news articles with a political theme.
Text fields were used to extract linguistic and stylistic features and build bag of words TF
and BOW TF-IDF vectors. Various machine learning models, including bagging and boosting
methods, were then applied, achieving the highest level of accuracy.
      </p>
      <p>
        In the study [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], two machine learning algorithms using character and word n-gram
analysis to detect fake news were evaluated. Experimental results showed that character
ngram analysis combined with Term-Frequency-Inverted Document Frequency (TF-IDF)
yielded better results, with accuracy reaching 96%.
      </p>
      <p>
        The work [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] proposes a model based on theoretical principles for detecting fake news,
examining news content at various levels, including lexicon, syntax, semantics, and
discourse. Recognized theories in the fields of social and forensic psychology are used to
represent news at each level and conduct fake news detection within the trained machine
learning model. As an interdisciplinary study, their work aims to study potential patterns in
fake news, improve interpretability in creating fake news features, and investigate
relationships between fake news, deception/disinformation, and strategies aimed at
increasing views.
      </p>
      <p>Based on the analysis of related works, various shortcomings in approaches can be
identified. One of them is the insufficient quality of the data on which the model is based. If
the model is trained on incorrect or inadequate data, it may misclassify news. Another factor
is the speed of news dissemination on the Internet. Fake news can quickly gain popularity
and spread faster than any model can detect them. It is also important to consider that fake
news may contain some truthful information, making their detection more challenging.
Another reason is the evolution of technologies and approaches to creating fake news. Over
time, new technologies emerge, allowing for the creation of more convincing fake news, and
models created to detect previous versions of fake news may be ineffective.</p>
      <p>Therefore, the main goal of this work is to develop and investigate an approach that
allows for the detection and synthesis of features for machine learning models using
artificial intelligence and natural language processing. The mentioned approach aims at the
ability to detect fake news, train on new data, and improve accuracy of analysis. Considering
the increasing importance of trust in the results of artificial intelligence algorithms,
emphasis is placed on creating mechanisms that ensure the responsibility and effectiveness
of analytical models.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methods and materials</title>
      <p>In order to move from subjective to objective evaluation of news for fakeness, it is necessary
to propose an approach to the synthesis of models that would contain objective and such
that can be calculated, indicators indicating the presence or absence of features of fake
news. In addition, such an approach is necessary for the possibility of updating the model,
since the mechanisms of creating fake news are constantly being improved.</p>
      <p>Based on the fact that the essence of the proposed approach is the analysis of texts
without a clear understanding of which features may indicate misinformation, it is proposed
to use the capabilities of Large Linguistic Models (LLMs) in addition to empirical (linguistic,
psychological, etc.) analysis. That is, a mechanism is needed that will allow the parameters
from the analysis to be "determined". That is, by linguistic and logical analysis with the help
of LLMs, including, identifying key factors that can influence the identification of fake news.
The main efforts are focused on natural language searches, taking into account Internet
resources, articles and examples, with the aim of understanding how they can display
features of fake news.</p>
      <p>
        In order to have such a tool, we used a new aspect - Prompt Engineering. Hint
engineering is a groundbreaking approach that is revolutionizing the field of artificial
intelligence and natural language processing. This is an important element that provides
hints for managing Language Models (LMs) and Large Language Models (LLMs) such as
GPT-X [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and LLama [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Cue engineering provides specific input-output pairs for LLMs,
increasing their efficiency, accuracy, and safety in various tasks.
      </p>
      <p>
        Developers of LLMs (eg ChatGPT (OpenAI) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]) provide helpful guidance on how to best
use prompt engineering. After analyzing them, it is possible to develop application
mechanisms for building prompts-requests for identifying the necessary features in the
texts.
      </p>
      <p>In AI speech generation systems, prompts gather information about the user's speech
and create new speech samples that were not previously known. Query elements can
include natural language, code, and multimodal prompts that consider images, videos, and
other media. In our case, at the stage of synthesis of the model and its initial parameters, it
is suggested that only the news text, or part of it, was used as an input element.</p>
      <p>Therefore, in addition to identifying features of fake news by empirical means (linguistic,
psychological, etc.), it is worth applying the capabilities of LLMs. Next, we will consider how
it is proposed to be implemented.</p>
      <sec id="sec-3-1">
        <title>3.1. Preliminary recommendations for the use of the mechanism of engineering prompts (Prompt Engineering) to detect features of fake news</title>
        <p>Let's define a technique (a set of interconnected methods) for detecting features of fake
news by means of hint engineering using LLMs.</p>
        <p>
          Before writing prompts, you need to determine an effective approach for their creation
[
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. Prompts should have several key characteristics to increase their effectiveness
specifically for detecting features of fakeness. It is important to specify the so-called
instructions before (at the beginning of) the request. It is also important to correctly define
the desired response format. That is:
•
•
•
        </p>
        <p>Instructions should be placed at the beginning of the request. Special characters
must be used to separate instruction and context sections.</p>
        <p>Instructions should be placed at the beginning of the request. Special characters
must be used to separate instruction and context sections.</p>
        <p>Instructions should be specific, descriptive, and as detailed as possible regarding the
desired context, outcome, scope, format, style, etc.</p>
        <p>You need to determine the desired output format with the help of examples.</p>
        <p>Next, in the dialogue mode with ChatGPT, using the experience gained from analyzing
articles on the Internet, the main (often mentioned) criteria of fakeness are selected. Asking
questions like:</p>
        <sec id="sec-3-1-1">
          <title>1. what is most important in defining news?</title>
          <p>2. what does this or that parameter mean?
3. how does it affect fakeness?</p>
          <p>In the given way, you can get an initial set of basic criteria by which you can then try to
create and test fakeness classifiers.</p>
          <p>
            You can use both basic models (for example, gpt-3.5 [
            <xref ref-type="bibr" rid="ref13">13</xref>
            ]) and more advanced models
(for example, gpt-4 model [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ]), which will allow adding to the analysis, in addition to texts,
images, audio-video files and fact-checking in real time.
          </p>
          <p>In fig. 1 shows an example of the initial prompt for the problem under consideration.</p>
          <p>Next, for each found parameter, you need to understand the possibility of their
determination in numerical form. It can be, for example, a fragment of program code using
the methods of the corresponding NLP libraries, etc.</p>
          <p>By analyzing different sources, you can get an understanding of what fake news often
uses:</p>
          <p>
            Paraphrasing to change the form of statements and avoid direct quotation. For example,
in the article [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ], the authors indicate that more than 70% of fake news contain
paraphrasing elements.
          </p>
          <p>Subjective words. Fake news tries to avoid explicit expression of the author's opinion
and emotional judgments. Using ChatGPT, it is possible to emulate such an approach and
determine how subjective words can affect the degree of trust in news. Phrases with
subjectivity, such as "my own opinion," can serve as features of fake news.</p>
          <p>Expressions that can cause a feeling of alienation or underestimation of human
individuality. This may include using language that objectifies or creates the impression of
hostility towards specific groups of people.</p>
          <p>Headings that do not reflect the true content of the text. Using the LLM model, one can
compare heading and content structures, paying attention to similarities.</p>
          <p>Atypical or obscene speech can influence the definition of fake news. By analyzing
atypical speech, it is possible to conduct a comparative analysis of fake news and real news
and understand how it affects.</p>
          <p>Tone of the text (emotional shades in the sentences). Fake news can use a tone aimed at
creating negative emotions.</p>
          <p>In addition to the above, it is also possible to aggregate various factors. It is possible to
use the results of fact-checking to assess the reliability of the information provided.</p>
          <p>To build a dynamic system, it is important to generate new parameters and use
combinations of existing models. The user should be able to add parameters to the existing
ones and form new prompts, which will be taken into account as additional weight
coefficients for determining fake news.</p>
          <p>So, the above are preliminary recommendations for using the proposed method of
synthesizing model parameters to detect features inherent in fake news. Next, we present
the main steps of the proposed method of model synthesis by means of generative artificial
intelligence. Separately, we will repeat that it is possible to identify features empirically,
using the approaches of linguistic and psychological analysis of texts. It is worth combining
these approaches.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. The main steps of the method</title>
        <p>We will present the main steps of the proposed method of synthesis of model features by
means of generative artificial intelligence.</p>
        <p>Formally, you need to find the mapping:
 →  ,
(1)
where T is a set of news texts, O is a set of objective features that can be uniquely
determined from the text and presented in numerical form immediately, or later, using the
tools of NLP libraries.</p>
        <p>The following is a sequence of steps for obtaining a display (1).
1. Input information. News or news that cause suspicion of fakeness.
2. The user asks a question in an arbitrary form, what is suspicious in the news.
3. The user receives an answer, possibly from several elements that can affect the
fakeness of the news.
4. The user selects a parameter, and creates or adds to an existing prompt, for example,
in the form:</p>
        <p>Additionally, add THIS parameter to calculate fake news ratio from 0 to 1. The more of
THIS we have in the text, the more we consider this news as a fake. Output explanation.</p>
        <p>It is important to describe what this parameter does so that the system correctly forms
and adds it.
5. The system converts an arbitrary format to a new parameter of an existing input
prompt:</p>
        <sec id="sec-3-2-1">
          <title>THIS_parameter: this will detect how much it is a fake news {</title>
          <p>“THIS_parameter_ratio_and_result”,
}</p>
          <p>The system gives the final result taking into account the free parameters.</p>
        </sec>
        <sec id="sec-3-2-2">
          <title>6. Output information. The initial information of the method is:</title>
          <p>•
•</p>
          <p>A prompt to a specific LLM that outputs the value of a feature in numeric form. Such
a prompt can be used in the following methods to determine a specific feature vector
element for a machine learning model.</p>
          <p>A prompt to a particular LLM that outputs the result verbally, but the researcher
understands how it can be digitized using NLP libraries or otherwise.</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. An example of using the method</title>
        <p>
          We will give an example of using the method. The input information (original taken from
article [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]) is: "Bill Gates will send money to everyone who clicks on the link he drops." It
is obvious that this is a fake, and the picture and message are not real (picture 2 shows an
example from a fake news site).
        </p>
        <p>1. The user asks ChatGPT a question in an arbitrary form, about what is suspicious in
the news (Fig. 3).
2. The user receives an answer from several elements that can affect the fakeness of
the news (Fig. 4).
3. The user selects a parameter and adds a request to the system in the form:
“
}
”
…
{
“Additionally, add unusual_content parameter to calculate fake news ratio from 0 to 1.
The more of unusual content we have in the text, the more we consider this news as a
fake. Output explanation.”
4. The system converts an arbitrary format to a new parameter of an existing input
prompt:
unusual_content_parameter: this will detect how much it is a fake news depending
if the content is expected for parties to work with.
5. The system gives the final result taking into account the free parameters:
"...The news is fake. Bill Gates would not ask for personal information for a reward…
There was no mention of such messages, and he has 1 official account” etc.</p>
        <p>Therefore, the results obtained using the given method of synthesis of model parameters
by means of generative artificial intelligence, together with the results obtained by the
traditional method of identifying features (empirically, using the approaches of linguistic
and psychological analysis of texts) will serve as input information for the method of
forming a vector of features for machine learning models for detection of fake news.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results and discussion</title>
      <sec id="sec-4-1">
        <title>4.1. Synthesis of features</title>
        <p>During interactive sessions with ChatGPT, six signature features were identified that show
promising potential in identifying suspicious content, particularly in the area of fake news.
These features were observed and refined during natural language interactions, which
allowed us to form a preliminary set of criteria for the effective detection of deceptive or
misleading information in text format. This section presents research findings, revealing the
importance of each identified feature and their overall impact on improving text-based fake
news detection capabilities. The following discussion follows from these findings and
clarifies the significance of the features found in the broader plan of detecting fake news.
1. The paraphrase rate is a key parameter in our analysis, providing information about
the degree of paraphrasing in a given text. During our studies of various texts, we
found cases where certain information was presented in a way that deviated
significantly from its original statement.</p>
        <p>Figure 5 shows the idea of the algorithm that was used to implement the code.
2. Subjective Words Ratio is calculated by determining the proportion of subjective
words within a given text. The significance of this ratio lies in assessing the
subjectivity of the language used, which can impact the overall tone and bias of the
text.
3. The Header-Summary Similarity Ratio is a metric used to evaluate the similarity
between the header (title) and the summary (abstract) of a document. It measures
how closely related the content of the summary is to the main topic or theme
indicated by the header.
4. The Unusual Inappropriate Language Ratio is a parameter designed to quantify the
presence of language that is either uncommon or inappropriate within a given text.
Detecting such language is essential for assessing the overall tone and potential
biases in the content
5. The Average Sentiment Ratio is a parameter that measures the overall sentiment
expressed in a given text. Sentiment analysis helps assess whether the text conveys
positive, negative, or neutral emotions. Calculating this ratio is essential for
understanding the emotional tone of the content.</p>
        <p>Including the Average Sentiment Ratio in our analysis is crucial for understanding the
emotional context of the text. By considering sentiment, we gain insights into the overall
tone, which can be valuable for evaluating the potential bias, subjectivity, and the
persuasive nature of the content. This parameter contributes to a more comprehensive
assessment of the text's characteristics in the context of fake news detection.
6. the next parameter Fact Checking Ratio we were able to determine only with the
help of LLM, since LLM have access to the context and understand whether the facts
really correspond to reality.</p>
        <p>Without using LLM, we would need to use external resources or databases for validation.
This step involves queries to reliable fact-checking databases or APIs to cross-check
information. At this stage, we are satisfied with obtaining information using the LLM.</p>
        <p>There were also intermediate and additional indicators that had a smaller impact on the
results. Among them were such as dehumanizing language ratio, awkward text ratio and a
separate definition of positive or negative sentiments. These parameters can be included to
process, for example, tweets, where they can play a greater role in determining the "mood"
of commenters.</p>
        <p>A number of experiments were conducted to test the proposed approach and evaluate
the validity of the feature vector. Below are their results and discussion. A description of the
dataset used in the experiments is given. The result of the application of visual analytics to
assess the ability of the proposed features of fake news texts to be divided into two classes
is given. Visual and numerical (statistical metrics) results of classifier training (using SVM)
are given. The discussion was carried out and the prospects of the proposed approach were
given.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Dataset</title>
        <p>
          The dataset [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] has over 20000 true and fake news labeled and categorized. It is very
popular among the data science community and has been used in many articles and works.
        </p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Visual analysis with MDS</title>
        <p>The results of applying the MDS method to input data (generalized features of news texts)
in 2-dimensional space are shown in Figure 10.</p>
        <p>As can be seen from Figure 10, the result is satisfactory, the classification was successful
for a larger number of texts from the training set. Analysis of a small number of misclassified
texts showed that there are true articles written with poorer text quality and vice versa.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Classification with SVM</title>
        <p>After calculating the MDS, we can pass a value to the train_test_split method to split the data
into training and test samples. Using SVM methods from the scikit-learn library, we
obtained the following results:</p>
        <p>After the number of news articles went over 2000, the results became consistent and we
could consider it average for the whole dataset.</p>
        <p>The obtained numerical results show the high accuracy of the proposed approach for
determining fake news. The given values of the statistical metrics are either in the range or
even better than the published modern results of other researchers.</p>
        <p>
          Also, in Figure 11 and 12, we can outline the decision boundary [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. The boundary is
determined by the support vectors, which are the data points closest to the hyperplane. The
SVM then uses this decision boundary to classify new, unlabeled data points based on which
side of the boundary they fall on.
        </p>
      </sec>
      <sec id="sec-4-5">
        <title>4.5. Limitations</title>
        <p>The main limitation of the proposed approach can be attributed to:
•
lack of high-quality annotated training datasets (especially for the Ukrainian
language) for successful classifier training and
•
lack of generalized text features (to identify more hidden ways of creating fake
news).</p>
        <p>These limitations are not critical because the proposed model parameter synthesis
method is capable of incorporating new datasets and new generalized features for
retraining.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>The paper proposes an approach to the synthesis of features of models for detecting fake
news using natural language processing techniques and machine learning algorithms. A
thorough review of related works was conducted to ensure the novelty and effectiveness of
the proposed approach. Six different parameters (common text features) were used for text
modeling, and multidimensional scaling (MDS) was applied to obtain visual analytics as one
of the criteria for evaluating the quality of the proposed approach. A support vector machine
(SVM) classifier was trained to classify text into fake and non-fake news categories. The
research results show that the proposed approach is in the same range or surpasses the
existing methods in terms of accuracy (overall accuracy is more than 90%).</p>
      <p>Limitations of the proposed approach include the lack of high-quality annotated datasets
(especially for the Ukrainian language) to successfully train the classifier and the
insufficiency of generalized text features (to detect more hidden ways of creating fake
news). These limitations are not critical because the proposed model parameter synthesis
method is capable of incorporating new datasets and new generalized features for
retraining.</p>
      <p>Future improvements of the approach will be aimed at increasing the accuracy of
detecting fake news and achieving greater interpretability and understanding of the
classification results.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Shreya</given-names>
            <surname>Ghosh</surname>
          </string-name>
          , Prasenjit Mitra,
          <article-title>Catching Lies in the Act: A Framework for Early Misinformation Detection on Social Media</article-title>
          ,
          <year>2023</year>
          . doi:
          <volume>10</volume>
          .1145/3603163.3609057.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Qin</given-names>
            <surname>Zhang</surname>
          </string-name>
          <article-title>a, Zhiwei Guo a, Yanyan Zhu b</article-title>
          , Pandi Vijayakumar c, Aniello Castiglione d, Brij B.
          <string-name>
            <surname>Gupta</surname>
          </string-name>
          ,
          <article-title>A Deep Learning-based Fast Fake News Detection Model for CyberPhysical Social Services</article-title>
          ,
          <year>2023</year>
          . URL: https://www.sciencedirect.com/science/article/abs/pii/S0167865523000569?via% 3Dihub
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Canyu</given-names>
            <surname>Chen</surname>
          </string-name>
          , Kai Shu,
          <string-name>
            <surname>Can</surname>
            <given-names>LLM</given-names>
          </string-name>
          -Generated Misinformation Be Detected?.
          <source>Canyu Chen</source>
          ,
          <string-name>
            <given-names>Kai</given-names>
            <surname>Shu</surname>
          </string-name>
          .
          <year>2023</year>
          . URL: https://arxiv.org/abs/2309.13788
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Krak</surname>
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barmak</surname>
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manziuk</surname>
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kulias</surname>
            <given-names>A</given-names>
          </string-name>
          .
          <article-title>Data Classification Based on the Features Reduction and Piecewise Linear Separation</article-title>
          ,
          <source>Advances in Intelligent Systems and Computing</source>
          Volume
          <volume>1072</volume>
          , Pp.
          <fpage>282</fpage>
          -
          <lpage>289</lpage>
          ,
          <year>2020</year>
          , DOI: 10.1007/978-3-
          <fpage>030</fpage>
          -33585-4_
          <fpage>28</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Federico</given-names>
            <surname>Monti</surname>
          </string-name>
          , Fabrizio Frasca, Davide Eynard, Damon Mannion, Michael M.
          <article-title>Bronstein: Fake News Detection on Social Media using Geometric Deep Learning</article-title>
          , https://doi.org/10.48550/arXiv.
          <year>1902</year>
          .06673
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Mayank</given-names>
            <surname>Kumar</surname>
          </string-name>
          <article-title>Jain; Dinesh Gopalani; Yogesh Kumar Meena; Rajesh Kumar: Machine Learning based Fake News Detection using linguistic features and word vector features</article-title>
          , https://ieeexplore.ieee.org/document/9376576
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Hnin</given-names>
            <surname>Ei</surname>
          </string-name>
          <string-name>
            <given-names>Wynne</given-names>
            , Zar Zar Wint:
            <surname>Content Based Fake News Detection Using N-Gram</surname>
          </string-name>
          <string-name>
            <surname>Models</surname>
          </string-name>
          : https://dl.acm.org/doi/10.1145/3366030.3366116
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Xinyi</given-names>
            <surname>Zhou</surname>
          </string-name>
          , Atishay Jain ,
          <string-name>
            <surname>Vir</surname>
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Phoha</surname>
          </string-name>
          , Reza Zafarani,
          <article-title>Fake News Early Detection: A Theory-driven Model</article-title>
          , https://dl.acm.org/doi/10.1145/3377478
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>GPT-models</surname>
            <given-names>list</given-names>
          </string-name>
          , URL: https://platform.openai.com/docs/models
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>LLaMa</surname>
            <given-names>LLM</given-names>
          </string-name>
          , Meta, URL: https://llama.meta.com
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Prompt</surname>
            <given-names>engineering</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Open</surname>
            <given-names>AI</given-names>
          </string-name>
          , URL: https://platform.openai.com/docs/guides/prompt-engineering.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <article-title>Best practices for prompt engineering with the OpenAI API</article-title>
          , URL: https://help.openai.com/en/articles/6654000-best
          <article-title>-practices-for-promptengineering-with-openai-api</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <source>[13] GPT-3</source>
          .5, URL: https://platform.openai.com/docs/models/gpt-3-5
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>[14] GPT-4</source>
          , URL: https:/ /platform.openai.com/docs/models/gpt-4
          <string-name>
            <surname>-</surname>
          </string-name>
          and-gpt-4-turbo
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <article-title>How to Identify Fake News</article-title>
          , URL: https://www.kaspersky.com/resourcecenter/preemptive-safety/
          <article-title>how-to-identify-fake-news.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Fake</surname>
            <given-names>News</given-names>
          </string-name>
          : Examples, URL: https://library-nd.libguides.com/fakenews/examples
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Clement</surname>
            <given-names>Bisaillon</given-names>
          </string-name>
          ,
          <source>Fake and real news dataset</source>
          ,
          <year>2019</year>
          , https://www.kaggle.com/datasets/clmentbisaillon/fake-and
          <article-title>-real-news-dataset</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>SVM</given-names>
            <surname>Decision Boundary</surname>
          </string-name>
          , https://scikitlearn.org/0.18/auto_examples/svm/plot_iris.html
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>