<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>October</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Detecting fake news using Twitter social information</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jesús M. Fraile-Hernández</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Álvaro Rodrigo</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roberto Centeno</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <volume>20</volume>
      <issue>2024</issue>
      <fpage>19</fpage>
      <lpage>28</lpage>
      <abstract>
        <p>In this paper, the aim is to study whether social information can provide useful information when classifying news. For this purpose, a set of news items in Spanish has been extended with social information. Subsequently, a classifier model has been proposed to carry out this task, mixing the social information previously extracted with the textual information of the news item. Finally, we have studied which social features are the most relevant in this task.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Social information</kwd>
        <kwd>Classifying news</kwd>
        <kwd>Classifier model</kwd>
        <kwd>Social features</kwd>
        <kwd>Fake news detection</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Due to the increase in communication channels in recent decades, users have access to an immense
amount of information almost instantaneously. However, it is relatively easy to fall for hoaxes or
misinformation on social media.</p>
      <p>
        Traditional models of fake news detection focus on detecting the linguistic characteristics of the news.
Subsequently, in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], pre-trained embeddings were used along with LSTM. Finally, with the emergence
of contextual models, [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] leveraged the pre-trained BERT model, to perform transferred learning and
identify the veracity of news.
      </p>
      <p>
        However, due to the dificulty even for a human to discern between true and false news, sometimes the
textual information in the news is not enough. In [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] it is proposed at a theoretical level the possibility
of creating a hybrid approach that incorporates the linguistic characteristics of the news and an analysis
of the networks that are formed around that news. In [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] the author uses diferent features to identify
fake news in popular Twitter threads. In [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] fake news is detected using only the extracted textual
information. Regarding hybrid models, the CSI model proposed in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] performs a characterisation in
three modules: capturing, scoring and integrating. In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], a news detection model is proposed that
considers the association of user interactions, the editor’s bias and the users’ stance towards the news.
      </p>
      <p>The aim of this work is to study whether social information can provide useful information for
the detection of fake news. To this end, social information has been collected from Twitter to extend
FakeDeS, a relevant corpus of news in Spanish, and a model has been designed to include textual and
social information. Furthermore, we intend to study which social features are the most relevant for
news classification.</p>
      <p>The rest of this paper is structured as follows: Section 2 describes the datasets to be used along with
the task to be solved. Section 3 describes the methodology followed including the extraction of social
information from Twitter along with the models proposed based on the data they use. Section 4 includes
the evaluation metrics used. Section 5 then presents the results, which will be discussed in Section 6.
Finally, conclusions and future work are given in Section 7.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Dataset and task</title>
      <p>
        The dataset we will work with is the Spanish Fake News Corpus (FakeDeS) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], which contains
publications in Spanish about diferent events that were collected from November 2020 to March 2021. Each of
these publications is labelled as true or false. Newspaper websites and fact-checking websites were
mainly used to collect the information.
      </p>
      <p>The dataset is divided into 3 files with a total of 1543 news items. Because of the methodology used,
it has been decided to merge the training and development files to obtain what we will call the training
set. Each of the news items contains information such as the topic, the name of the source, the headline,
the text and the link to the news item.</p>
      <p>The training set has a total of 971 news items, of which 480 are false and 491 are true. On the other
hand, the test set consists of 572 news items, half of which are true and half of which are false. Therefore,
we are dealing with balanced data sets.</p>
      <p>The topics covered in the training corpus are: politics, entertainment, sport, society, science, health,
economy, security and education.</p>
      <p>It should be noted that the test set has news related to Covid-19, while the training set does not
present any news related to this topic (the most similar are the health news, but in no case do they
mention Covid-19). Therefore, the models that are proposed will have to correctly classify this topic
without having seen it in the training.</p>
      <p>
        In IberLEF 2021, a shared task was proposed whose objective was to classify a series of news items
as true or false. To do so, the FakeDeS corpus described above was used. A report was published in
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], which collected the most important characteristics of the best-performing models. The results of
this task by the diferent participants can be seen in Figure 1. Among the approaches used to solve
it, the participants of the GDUFS team, the team that achieved the best accuracy, used a BERT model
and sample memory with an attention mechanism. The method consisted of taking the first and last
segments of the texts and feeding them into a BERT system, obtaining two embeddings (head and tail).
In addition, there is a matrix called ‘sample memory’, which is obtained by taking a random sample
of the head and tail embeddings; this matrix is used in an attention mechanism with the rest of the
texts. In contrast to the GDUFS_DM approach, the participants of team Haha, the second-placed team,
employed feature selection with a weighted tf-idf and a multilayer perceptron. This model not only
analysed the content of the news item, but also combined information such as the publisher of the news
item or the topic of the news item.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>This section describes the methodology used to extract social information from Twitter users. In
addition, the models trained according to the type of data they use are presented.</p>
      <sec id="sec-3-1">
        <title>3.1. Social information extraction</title>
        <p>The main objective of this work is to study the information provided by social information when
detecting fake news, and as mentioned in Chapter 1, there is no corpus in Spanish that contains this
information. This is why we decided to extract this information from the social network Twitter, using
the API provided by the platform.</p>
        <p>For each news item, we searched for those tweets that contained the headline of the news item or
the link to it. To solve the problem of the maximum length of the queries, special characters have been
eliminated from the news headlines.</p>
        <p>
          According to [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] and [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] there is a series of metadata of the tweets that allow extracting information
about whether the user may be prone to the propagation of fake news or the tweet may contain
untruthful information. Therefore, it has been decided to extract the following metadata from each of
the tweets.
        </p>
        <p>• Tweet. Text of the tweet, id of the author, id of the tweet, number of retweets, number of replies
to the tweet, number of likes, number of citations of the tweet.
• User. username (str), user creation date (date) ISO 8601, verified user (bool), number of followers
(int), number of followed (int), number of tweets (int), number of times listed (int).
We have managed to extract posts from 41.67% of the total number of news items. Of these, the
distribution of the number of tweets collected per news item shows a high concentration in the (0, 200)
interval, representing 86% of the news items. Within this interval, it is observed that true news tends
to receive more interaction. However, as the number of tweets about a news item increases, it is evident
that fake news receives a greater number of interactions. This trend can be seen in the violin diagram
presented in Figure 2.</p>
        <p>It is worth noting that, although the news is written in Spanish, there are tweets in English or French
that talk about the news. This is especially true for news related to Covid-19.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Textual models</title>
        <p>In this section, the textual methods used for the binary classification of the news items will be presented.
The full text of the news item has been used, so it has had to be preprocessed. For the non-contextual
models, urls, emoticons or non-textual expressions, stopwords, the text has been converted to lowercase
and the processes of lemmatisation and stemming have been applied. However, for the contextual
models, only the urls have been eliminated.</p>
        <p>Subsequently, 5 diferent approaches have been used.</p>
        <sec id="sec-3-2-1">
          <title>1. Vector space model based on bags of words (BoW).</title>
          <p>2. Vector space model using a weighted tf-idf.
3. Bigram counting.
4. Neural Networks and deep learning.</p>
          <p>5. Contextual models.</p>
          <p>
            For approaches 1, 2 and 3, Naive Bayes, SVM, Logistic Regression, Decision Trees and Random Forest
models have been trained. For approach 4, multilayer perceptrons with input the tf-idf weight vector,
multilayer perceptrons and convolutional networks with an embedding layer and multilayer perceptrons,
convolutional networks, LSTM, GRU and bidirectional networks with a pre-trained embedding layer.
Finally, for approach 5, the BETO model has been selected: Spanish BERT [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ] with a final classification
layer with two neurons. This model is a BERT model trained with the whole-word masking technique
on a large corpus of more than three billion Spanish words.
          </p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Models with social information</title>
        <p>The methods that use only the social information of the news collected use the following metadata for
each published tweet: number of retweets, number of replies, number of likes of the tweet, number of
quotes of the tweet, verified user, number of followers, number of followed, number of tweets of the
author, number of times the author has been listed. Then, in order to record the impact of the news
item on social networks, the number of tweets collected for this news item is added.</p>
        <p>To represent all the tweets that talk about a certain news item, an average of the previous
characteristics of each tweet has been calculated. Finally, the standard deviation of each characteristic was added.
In this way, a data matrix with 20 columns is obtained (where the column relating to the deviation of
the number of tweets of the news item is always 0).</p>
        <p>Once the feature matrix has been obtained, diferent learning models have been used with diferent
hyperparameter explorations such as Decision Trees, Random Forest, SVM, Gradient Boosting, Adaptive
Boosting, MLP,...</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Hybrid model</title>
        <p>A hybrid model has been developed that seeks to take advantage of both the textual information
provided by the text of the news item and the social information extracted from the Twitter data (both
the non-textual information of the section and the text of the tweets collected).</p>
        <p>In this model, for each news item, a specialised model is used to classify the news using social
information. For this purpose, the best model from the previous subsection (Random Forest) is selected.
With this model, for each news item, the probabilities of being true or false are extracted using as input
the corresponding row of the matrix of social characteristics with standard deviation described in that
section. In the event that no tweets could be extracted from a news item, the output would be a vector
of two zeros.</p>
        <p>
          In parallel, the text of the news item is processed using the BETO: Spanish BERT model [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. The
output is a vector of dimension 768.
        </p>
        <p>
          In parallel to these two processes, for each news item with tweets collected, the text of each tweet is
pre-processed (eliminating URLs and tokenising) and subsequently processed using the pre-trained
XLM-roBERTa-base model [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. This transformer model has been trained on a corpus of about 198
million tweets in 8 diferent languages (Spanish, Arabic, English, French, German, Hindi, Portuguese
and Italian) and is specialised in sentiment classification (positive, negative or neutral). In our case, the
last layer of the model will be removed, obtaining as output a vector of length 768 that will represent
the most relevant features of the text of the tweet.
        </p>
        <p>For each available tweet, the previous process has been carried out, obtaining a vector of length 768.
Finally, an average of all the vectors of the tweets of the news item has been made to obtain a vector
that represents the tweets of that news item. If the news item had no social information, a vector of
zeros is returned.</p>
        <p>Then, the three vectors are joined to obtain a vector of dimensionality 1538. This flowchart can be
seen in Figure 3</p>
        <p>Once all the news has been processed following the previous diagram, several models have been
trained such as Decision Trees, Random Forest, SVM, Gradient Boosting, Adaptive Boosting, MLP, ...</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Evaluation</title>
      <p>Two diferent methodologies have been used to evaluate the models, a cross-validation and an evaluation
on the test set.</p>
      <sec id="sec-4-1">
        <title>4.1. -fold cross-validation</title>
        <p>Cross-validation is one of the most widely used methods to estimate the prediction error of a model
with a given set of hyperparameters. A -fold (or -fold cross-validation) has been used. This method
divides the data set, in our case the train set together with the development set, into  equal parts
1, . . . , . For each  the model is trained using the other  − 1 parts and the error in predicting the
 data (data never seen by this model) is calculated. By doing this for the  parts we obtain a set of
errors. With these  errors we calculate their mean and variance to obtain a measure of the average
error of that model with those hyperparameters.</p>
        <p>It should be noted that this method requires a fairly large computational cost, since for a
crossvalidation of -folds it would be necessary to train  models. As a general rule, a value of 5 or 10 is
Textual models</p>
        <p>TF-IDF (RF)</p>
        <p>BoW (RF)</p>
        <p>Bigramas (RF)
MLP (Embedding)</p>
        <p>MLP (TF-IDF)</p>
        <p>CNN
BETO
GRU
usually chosen as a good compromise between bias and variance. In our case a 5-fold cross-validation
has been used.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Test set evaluation</title>
        <p>Finally, for the model that has performed best in the previous cross-validations, the test set will
be evaluated. This set will never be seen by the model and will provide a representation of the
generalisability of the model.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Evaluation metrics</title>
        <p>To evaluate the performance of our classification model, we use the F1 metric. The F1 value will be
calculated for both true and false classified news. With this, the value   -  1, or simply  1, will
be calculated as the average between the two previous values.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>In this section the results of the various trained models will be presented. For each approach in the
section 3 the following results will be shown:
• Within the training of a particular approach, the   - 1 value of the best algorithms used will
be shown. The average of the   - 1 values will be reflected using 5-fold cross-validation.
• For each approach, the model with the best   - 1 will be selected during training.
Subsequently, it will be retrained with all data and evaluated on the test set. The  1 ,  1 ,
  - 1 and the Accuracy of the model will be exposed.</p>
      <sec id="sec-5-1">
        <title>5.1. Textual models</title>
        <p>The training results of the methods described in section 3.2 are listed in Table 1.</p>
        <p>It can be seen that the non-neural models stand out from those using neural networks. This could be
due to the fact that the models being used have a large number of parameters to optimise and we have
a rather limited data set. It is worth noting that the use of pre-trained embeddings has resulted in lower
performance than training the embeddings from scratch. Also noteworthy is the poor performance
obtained with recurrent networks, models that have required a large amount of training time and are
commonly used for language processing problems. The best performing approach has been to use a
weighted tf-idf together with a Random Forest model.</p>
        <p>The results of the evaluation of this model on the test set and the results of the teams participating in
IberLEF 2021 are shown in Table 4.
Social information models</p>
        <p>Random Forest
Gradient Boosting</p>
        <p>Adaptive Boosting
Extremely Randomized Trees</p>
        <p>Decision Trees</p>
        <p>K-Nearest
Multilayer Perceptron (MLP)</p>
        <p>SVM</p>
        <p>Passive-Aggressive Classifier
Perceptron with two hidden layers
Linear Discriminant Analysis (LDA)</p>
        <p>Multinomial Naive Bayes
Perceptron with one hidden layer</p>
        <p>Bernouilli Naive Bayes
Quadratic Discriminant Analysis</p>
        <p>Logistic Regression</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Social information models</title>
        <p>The training results of the methods described in section 3.3 are collected in Table 2.</p>
        <p>We can see that the  1 of the models is quite high. Tree-based models occupy the top 5 positions in
the list. In addition, those based on clusters of trees stand out from individual decision trees. The best
performing approach was a Random Forest model. It should be remembered that this model has only
been trained and evaluated with those news items from which it has been possible to extract social
information, so the training and test set is smaller than in the rest of the cases.</p>
        <p>Due to these results, it has been decided to choose the Random Forest classifier for the social
information for the hybrid model, as indicated in section 3.4.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. Hybrid model</title>
        <p>The training results of the methods described in section 3.4 are listed in Table 3.</p>
        <p>In view of the training results, any of the first 2 models would be valid for your choice. The rest of the
models have a very similar accuracy to the first three. It has been decided to select logistic regression
over decision trees since it is a simpler algorithm, with a smaller number of hyperparameters and with
a lower computational cost.</p>
        <p>The results of the evaluation of this model on the test set and the results of the teams participating in
IberLEF 2021 are shown in Table 4.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Discussion</title>
      <sec id="sec-6-1">
        <title>This section presents a discussion of the results obtained.</title>
        <p>In view of the results shown in Tables 1 and 3, it can be seen that the approach that obtains the best  1
is a model that uses only textual information, more specifically a Random Forest with a weighted tf-idf.
This approach obtains a higher  1 compared to other types of models that include social information,
so that a priori it could be thought that social information does not provide relevant information.</p>
        <p>However, in Table 4, we can see how on the test set the model that uses only textual information
obtains worse results compared to the hybrid model. This is due to the fact that when using a tf-idf
weight it is possible that there are words in the corpus on which the weight is applied (training news
corpus) that do not exist in the test set. This is why models such as transformer networks pre-trained
Hybrid Model</p>
        <p>Decision Trees
Logistic Regression</p>
        <p>SVM
Linear Discriminant Analysis</p>
        <p>Random Forest</p>
        <p>Gradient Boosting
Passive-Aggressive Classifier
Adaptive Boosting (AdaBoost)</p>
        <p>Extremely Randomized Trees
Quadratic Discriminant Analysis</p>
        <p>Multilayer Perceptron (MLP)</p>
        <p>K-Nearest
Perceptron with three hidden layers
Perceptron with two hidden layers
Perceptron with one hidden layer</p>
        <p>Multinomial Naive Bayes</p>
        <p>Bernoulli Naive Bayes
on large corpora will have more generalisation capacity and, therefore, will be able to obtain better
results.</p>
        <p>Once social information is introduced into the model, a significant increase in results can be seen.
This is due to the fact that on the one hand the text is being processed using transformer models with a
very high generalisation capacity and that the non-textual social information extracted from Twitter is
the same regardless of the subject matter.</p>
        <p>Comparing the models with respect to the best classified in IberLEF 2021, Figure 1, it can be seen
that the hybrid model is the one that best classifies Fake news. This hybrid model obtains the same
Accuracy as the first ranked team.</p>
        <p>
          In addition, a study has been carried out on which social information features are the most relevant
for the model. For this purpose, the importance of the permutation set out in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] has been used. It
can be seen that 8 of the 9 most relevant features only depend on the author’s information and not
on the content or information of the tweet. These 9 features are, in order of importance: listed_count,
following_count_std, followers_count, tweet_count_std, followers_count_std, quote_count_std, verified,
verified_std, tweet_count. In addition, within these characteristics, the information provided by those
obtained from the standard deviation of the set of tweets collected for each news item stands out.
        </p>
        <p>The percentage of importance of the most relevant features used in the logistic regression of the
hybrid model has also been calculated. To calculate the importance of each feature, , the coeficients
of the regression, , have been extracted and the following operation has been carried out  =  .
Finally, the percentage of each of them has been calculated. With this, the most relevant characteristic
for the model, with 10 times more importance over the rest, was the variable that corresponds to the
probability returned by the Random Forest that a news item is true using the social information of the
news item.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusions and Future Work</title>
      <p>Throughout the development of this work, it has been observed how the introduction of social
information, combined with textual information, has enabled the classification of news, helping to improve the
performance of the models. This suggests that, when solving a problem, it would be useful to add social
information to the dataset. However, obtaining this information is quite costly both economically and
in terms of time.</p>
      <p>Additionally, the importance of social features in classifier models has been studied, concluding that
author-related features are more important than tweet-related features. The development of a model
that combines all textual and social features achieves similar or better results than models that use only
textual information.</p>
      <p>However, it is crucial to acknowledge several important limitations:
• Impractical Approach: Many of the social signals being harvested are post-facto. While
disinformation might actually be spreading, many features (such as the number of reposts) would
not have stabilized. Thus, while the current approach of augmenting these signals might work
post-facto, it is unlikely to work with live data. Even post-facto, it is unclear whether the approach
will scale.
• Flawed Methodology: The use of balanced training data, and a small set of data at that, is not
meaningful. Particularly, it is unclear how learning from such a small corpus would generalize
when new kinds of disinformation arise. In practice, the distribution of disinformation-carrying
articles compared to genuine ones is far from balanced. Therefore, any realistic methodology
needs to incorporate the ability to handle imbalance and transferability from the learning phase.
Moreover, adversary behavior might change to emulate the features of good articles or at least
stray away from its current behavior, rendering the specific features used for classification
obsolete.
• Too Static and Small Dataset: The dataset used is too static and small, and lacks adequate diversity
to consider any results conclusive. A variety of distinct datasets ought to be used to determine if
the ideas actually work in a more general setting.</p>
      <p>As a line of future work, it would be a good approach not only to study the individual social metadata
of each user, but also to study a social graph of the followers or followers to see the social relationships
that exist between them. Additionally, the dataset should be expanded and diversified, and methods
should be developed to handle imbalanced data and adapt to changing adversary behavior.</p>
      <p>We acknowledge that this work, while preliminary, can trigger useful discussions and provides a
foundation upon which more robust and scalable approaches can be built in the future.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments References</title>
      <p>This work was supported by the HAMiSoN project grant CHIST-ERA-21-OSNEM-002, AEI
PCI2022135026-2 (MCIN/AEI/10.13039/501100011033 and EU “NextGenerationEU”/PRTR).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P.</given-names>
            <surname>Bharadwaj</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Shao</surname>
          </string-name>
          ,
          <article-title>Fake news detection with semantic features and text mining</article-title>
          ,
          <source>International Journal on Natural Language Computing (IJNLC)</source>
          Vol
          <volume>8</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R. K.</given-names>
            <surname>Kaliyar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Goswami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Narang</surname>
          </string-name>
          , Fakebert:
          <article-title>Fake news detection in social media with a bert-based deep learning approach</article-title>
          ,
          <source>Multimedia tools and applications 80</source>
          (
          <year>2021</year>
          )
          <fpage>11765</fpage>
          -
          <lpage>11788</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>N. K.</given-names>
            <surname>Conroy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. L.</given-names>
            <surname>Rubin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>Automatic deception detection: Methods for finding fake news</article-title>
          ,
          <source>Proceedings of the association for information science and technology 52</source>
          (
          <year>2015</year>
          )
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.</given-names>
            <surname>Buntain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Golbeck</surname>
          </string-name>
          ,
          <article-title>Automatically identifying fake news in popular twitter threads</article-title>
          ,
          <source>in: 2017 IEEE International Conference on Smart Cloud (SmartCloud)</source>
          , IEEE,
          <year>2017</year>
          , pp.
          <fpage>208</fpage>
          -
          <lpage>215</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Albahar</surname>
          </string-name>
          ,
          <article-title>A hybrid model for fake news detection: Leveraging news content and user comments in fake news</article-title>
          ,
          <source>IET Information Security</source>
          <volume>15</volume>
          (
          <year>2021</year>
          )
          <fpage>169</fpage>
          -
          <lpage>177</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>N.</given-names>
            <surname>Ruchansky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Seo</surname>
          </string-name>
          , Y. Liu,
          <article-title>Csi: A hybrid deep model for fake news detection</article-title>
          ,
          <source>in: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>797</fpage>
          -
          <lpage>806</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>K.</given-names>
            <surname>Shu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Wang</surname>
          </string-name>
          , H. Liu,
          <article-title>Exploiting tri-relationship for fake news detection</article-title>
          ,
          <source>arXiv preprint arXiv:1712.07709 8</source>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.-P.</given-names>
            <surname>Posadas-Durán</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Gómez-Adorno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Sidorov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J. M.</given-names>
            <surname>Escobar</surname>
          </string-name>
          ,
          <article-title>Detection of fake news in a new corpus for the spanish language</article-title>
          ,
          <source>Journal of Intelligent &amp; Fuzzy Systems</source>
          <volume>36</volume>
          (
          <year>2019</year>
          )
          <fpage>4869</fpage>
          -
          <lpage>4876</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>H.</given-names>
            <surname>Gómez-Adorno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Posadas-Durán</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. B.</given-names>
            <surname>Enguix</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. P.</given-names>
            <surname>Capetillo</surname>
          </string-name>
          , Overview of fakedes at iberlef 2021:
          <article-title>Fake news detection in spanish shared task</article-title>
          ,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>67</volume>
          (
          <year>2021</year>
          )
          <fpage>223</fpage>
          -
          <lpage>231</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Canete</surname>
          </string-name>
          , G. Chaperon,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fuentes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-H.</given-names>
            <surname>Ho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Kang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pérez</surname>
          </string-name>
          ,
          <article-title>Spanish pre-trained bert model and evaluation data</article-title>
          ,
          <source>Pml4dc at iclr</source>
          <year>2020</year>
          (
          <year>2020</year>
          )
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J.</given-names>
            <surname>Cañete</surname>
          </string-name>
          , G. Chaperon,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fuentes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-H.</given-names>
            <surname>Ho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Kang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pérez</surname>
          </string-name>
          ,
          <article-title>Spanish pre-trained bert model and evaluation data</article-title>
          ,
          <source>in: PML4DC at ICLR</source>
          <year>2020</year>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>F.</given-names>
            <surname>Barbieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Espinosa-Anke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Camacho-Collados</surname>
          </string-name>
          ,
          <article-title>Xlm-t: Multilingual language models in twitter for sentiment analysis and beyond</article-title>
          ,
          <source>Proceedings of the LREC</source>
          , Marseille, France (
          <year>2022</year>
          )
          <fpage>20</fpage>
          -
          <lpage>25</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Altmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Toloşi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Sander</surname>
          </string-name>
          , T. Lengauer,
          <article-title>Permutation importance: a corrected feature importance measure</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>26</volume>
          (
          <year>2010</year>
          )
          <fpage>1340</fpage>
          -
          <lpage>1347</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>