<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Urdu Fake News Detection Using Ensemble of Machine Learning Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Asha Hegde</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hosahalli Lakshmaiah Shashirekha</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, Mangalore University</institution>
          ,
          <addr-line>Mangalore</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>False information or fake news has the potential to mislead the public, damage the social order, undermine government legitimacy, and pose a major threat to societal stability. Fake news is not new, but the spread of fake news is only accelerated to a wider audience through social media causing more damage to the society. As a result, early detection of fake news on social media platforms is gaining importance day by day. Concurrently, the dificulty of swiftly identifying the fake news in various languages on social media is becoming more significant as the global use of the internet grows with the influence of the availability of ambiguous information. The majority of the fake news detection models focus on resource-rich languages like English and Spanish. Due to lack of bench marked corpus, fake news detection in languages like Urdu and many Indian languages have garnered very little attention. However, few workshops and shared tasks are being organized to promote fake news detection in resource-poor languages. One among them is UrduFake 2021 - a shared task at FIRE (Forum for Information Retrieval Evaluation) 2021 that encourages researchers to develop working models for detecting fake news in Urdu - a resource-poor language. In this paper, we - team MUCS, describe the ensemble of four Machine Learning (ML) classifiers, namely: Random Forest (RF), Multilayer Perceptron (MLP), Gradient Boosting (GB) and Adaptive Boosting (AB), submitted to UrduFake 2021. Using word uni-grams, character n-grams and fastText Urdu word vectors as features to train the ensembled classifier, the proposed model obtained macro F1-score of 0.552, accuracy of 0.713, and 12th rank in the shared task.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        In the era of internet, social media and blogs have become the key sources of news, be it real
or fake. With the widespread use of smartphones and the ease of access and freedom to share
the content on social media, people are no longer restricted only to be the consumers of news,
but are also playing a major role as news producers. As a result, several real/ cooked up/ fake
news could be instantly and initially reported on social media even without checking the facts
and figures, before they could appear on traditional media [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Miscreants/ culprits are taking
undue advantage of the technology and spreading fake news on social media platforms. The
anonymity of users on online platforms provide a significant chance for fake news spreaders to
manipulate peoples’ thoughts, social trust and generate wrong opinions [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Fake news also
has the potential to undermine society’s unity while also having a detrimental influence. For
example, Russia developed phoney accounts and social bots in order to promote misleading tales
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. When public opinion is required for any event, fake news thrives and becomes widespread
on social media. Following the 2016 presidential election in the United States, false news and its
consequences are widely addressed [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        Manually detecting the ever-increasing amount of fake news is time-consuming and
errorprone. Further, social media text is often noisy and does not adhere to the syntax of any one
language. Hence, there is a huge need for the techniques to detect the fake news eficiently
and automatically [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Majority of the proposed fake news detection tasks have focused on
resource-rich languages such as English and Spanish. Due to lack of annotated data,
resourcepoor languages such as Urdu and many Indian native languages have got very less attention.
Of late, few shared tasks and workshops are being organized to promote fake news detection
in resource-poor languages like Urdu [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. In order to boost up the text processing activities in
Urdu language, UrduFake 2021 - a shared task at FIRE 2021 aims at distinguishing the real news
from the fake news in Urdu [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. This shared task can be modeled as a binary Text Classification
(TC) task as there are only two labels, Fake and Real.
      </p>
      <p>Researchers have developed a slew of tools and algorithms for TC in general. However, an
algorithm that performs well for one dataset may not exhibit the similar performance on other
datasets. Therefore, claiming that an algorithm or a model performs well on all the datasets is
illogical. In this paper, we - team MUCS, describe the model submitted to UrduFake 2021 shared
task to detect Urdu fake news. The proposed model is an ensemble of four ML algorithms,
namely: RF, MLP, GB and AB, trained with word uni-grams, character n-grams and fastText
Urdu word vectors.</p>
      <p>Rest of the paper is organized as follows: Section 2 highlights the recent work in detecting
fake news and Section 3 details the proposed methodology. While Section 4 spreads light on
experimental results, the paper concludes with future work in Section 5.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>Detecting fake news is challenging, especially for languages with limited resources. Due to lack
of or limited availability of bench marked corpus, several researchers have constructed their
own dataset and developed various techniques to detect fake news. UrduFake 20201 is a shared
task at FIRE 2020 to detect fake news in Urdu language. Several models were submitted by the
researchers to this shared task and the description of some of the good performing models are
given below:</p>
      <p>
        Fazlourrahman et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] proposed an ensemble of ML classifiers to detect Urdu fake news
using an ensemble of three ML classifiers, namely: Multinomial Naïve Bayes (MNB), Logistic
Regression (LR), and MLP with hard voting. The classifiers were trained using the vectors
generated by CountVectorizer of word n-grams in the range (1, 2) and character n-grams in the
range (1, 5). Their model obtained a macro F1-score of 0.770 and 5th rank in the shared task. A
model to detect fake news proposed by Kumar et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] uses Dense Neural Network (DNN) based
classifier to predict the Fake or Real tag for the given Urdu article. Term Frequency–Inverse
Document Frequency (TF-IDF) vectors of character n-grams in the range (1, 3) are fed as features
into the dense layers of DNN which is having four dense layers containing 2,048, 512, 128, and
2 neurons in first, second, third, and fourth layers respectively. To reduce over-fitting, dropout
with a rate of 0.4 was used between each pair of dense layers. Further, the authors used Rectified
Linear Unit (ReLU) activation 2 in each of the dense layers and softmax activation3 at the output
layer. This DNN based model obtained a macro F1-score of 0.810 and 3rd rank in the shared task.
      </p>
      <p>
        Balaji et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] proposed MLP based classifier to detect fake news in Urdu language. To
construct their model, the authors used fastText pre-trained word embeddings to map words
to vectors and character n-grams in the range (1, 4) and achieved a macro F1-score of 0.7881
and placed 6th rank in the shared task. Lina et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] proposed a Urdu fake news detection
model based on the combination of word and character embedding. They extracted word based
features using pre-trained Urdu Robustly Optimized Bidirectional Encoder Representations
from Transformers (ROBERTa)4 and character based features by adopting character level
Convolutional Neural Network (CNN) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Further, sentence embeddings created separately using
word and character embeddings was concatenated and fed to the softmax layer which predicts
the Fake or Real tag. They have also used label smoothing technique to reduce over-fitting.
Their model obtained 1st rank with a macro F1-score of 0.907 and outperformed all other models
submitted to the shared task.
      </p>
      <p>
        Reddy et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] proposed a Bidirectional Gated Recurrent Unit (Bi-GRU) based model to
detect fake news in Urdu articles. They tokenized the dataset using Keras tokenizer5 and used
the skip-gram model to create Urdu word embeddings to represent text as vector values [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]
which were fed to max and average pooling layers. These layers were concatenated to reflect
the spatial relationship of features generated by Bi-GRU. The final feature representation was
created by concatenating these two pooling representations and the feature vectors were fed
to a sigmoid layer to classify the input text into Fake and Real classes. This model obtained a
macro F1-score of 0.807 and obtained 4th rank in the shared task.
      </p>
      <p>Researchers have explored diferent features like character n-grams, character level
embedding, word based features and pre-trained word vectors and diferent classifiers like ML and
Deep Learning (DL) to detect fake news in Urdu news articles. But none of the methods promise
100% results which gives scope for further studies.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>Several experiments were conducted using various features and various learning models to
detect Urdu fake news. Based on the results obtained on the development (Dev.) set provided
by the organizers of the shared task to detect Urdu fake news, three best performing models
were submitted to the shared task. Out of the three models, an ensemble of RF, MLP, GB and
AB classifiers with soft voting obtained good results in the final evaluation of the models by the
2https://numpy-ml.readthedocs.io/en/latest/numpy-ml.neural-nets.activations.html
3https://docs.scipy.org/doc/scipy/reference/generated/scipy.special.softmax.html
4https://huggingface.co/urduhack/urdu-roberta
5https://www.tensorflow.org/api_docs/python/tf/keras/preprocessing/text/Tokenizer
organizers and the same is described here. The structure of the proposed ensemble model is
depicted in Figure 1. Pre-processing, feature extraction and model construction steps carried
out in the proposed methodology are discussed in the following sub-sections:</p>
      <sec id="sec-3-1">
        <title>3.1. Pre-processing and Feature Extraction</title>
        <p>Pre-processing is a crucial step in cleaning up the textual content and transforming it into a
format that ML algorithms can understand. As the given dataset is already pre-processed, only
punctuation, non-relevant characters and stopwords6 are removed as they do not contribute to
the TC task.</p>
        <p>Feature Extraction is the process of extracting features from the text which are used to
train and evaluate the learning models. A combination of TF-IDF of words, char n-grams and
pre-trained Urdu word embeddings from fastText are used as features to train and evaluate the
classifiers. Feature extraction approaches are given below:
• TF-IDF vectors provide a better normalized representation of the text document by
removing the impact of overly repeated words. It expresses the relative importance of
the word in a document versus the entire corpus. Word uni-grams (17,793) and character
n-grams (18,068) in the range (2, 3) are extracted using Tfidf Vectorizer 7 and the range of
char n-grams is fixed based on performing various experiments.
• Word vectors is a type of mapping that allows words with similar meanings to have
similar vector representations. The concept behind word vectors is that the surrounding
words are used to represent target words. fastText8 pre-trained Urdu embeddings available
in gensim library9 with a window size 5 and 300-dimensional space are used to build word
6https://www.kaggle.com/rtatman/urdu-stopwords-list
7https://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.TfidfVectorizer.html
8https://fasttext.cc/docs/en/crawl-vectors.html
9https://pypi.org/project/gensim/
vectors for the words present in the dataset. Each document in the dataset is represented
as a sum of the vector values of the words present in that document.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Model Construction</title>
        <p>
          The performance of the classifier heavily relies on the dataset and the features extracted from the
dataset. Further, the performance of the classifiers is not the same for all the datasets [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. This
has motivated to propose an ensemble of classifiers where the weakness of one classifier will be
compensated by the strength of another. Ensemble learning is also a method of generating a
new classifier from multiple base classifiers, which outperforms any constituent classifier in the
ensemble. It may be noted that any number of classifiers can be ensembled with compatible
parameters. An ensemble of four ML classifiers, namely: RF, MLP, GB and AB classifiers with
soft voting is used to identify Urdu fake news.
        </p>
        <p>
          RF classifier is made up of a large number of individual decision trees which work together
as an ensemble by itself. Each individual tree in the RF classifier produces a class prediction and
the majority vote of all the classes in the forest is used to predict the final tag. This classifier
which resolves the over-fitting issues is also well-suited to deal with noisy high-dimensional
data [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. Unlike other classification algorithms such as RF or MNB, MLP classifier consists of
three layers namely: input layer, hidden layers and output layer [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] and performs classification
using an underlying Neural Network (NN). MLP classifier is a simple feed forward NN classifier
compared to other NN classifiers such as recurrent NN or CNN and it is widely used in TC
tasks.
        </p>
        <p>
          In recent years boosting algorithms have grown in popularity in building ML models. Boosting
algorithm is a type of ensembled ML algorithm that combines the predictions of many weak
learners. The advantage of employing boosting algorithm is that it generates a robust classifier
by converting a group of learners with poor classification performance. In GB, regularisation
methods penalize diferent parts of the algorithm and improve overall performance by reducing
over-fitting [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. AB classifier is a meta-estimator in which weights are assigned to each instance
and higher weights are assigned to incorrectly classified instances to improve poor learner. It
is specifically used in supervised learning to reduce bias and variance. This algorithm works
based on the principle of sequential growth of learners to make weak learners strong [19].
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Experiments and Results</title>
      <p>Bend-The-Truth10 is a benchmark dataset for fake news detection in Urdu language and its
evaluation. This dataset consists of news articles from Sports, Social media, Education,
Technology, Business, and Entertainment domains. While main stream news websites such as BBC
Urdu News, CNN Urdu, Express-News, Jung News, Naway Waqat and other trustworthy news
websites were referred to gather real news in Urdu language, the fake news articles were created
by a group of professional writers and journalists [20]. This corpus is provided by the organizers
of UrduFake 2021 shared task to develop and evaluate the working models to detect fake news
in Urdu news articles [21]. Details of the corpus distributed by the topics are given in Table 1
10https://github.com/MaazAmjad/Urdu-Fake-news-detection-FIRE2021/blob/main/
and the statistics of Train, Development (Dev) and Test sets distributed by Fake and Real labels
are summarized in Table 2.</p>
      <p>Several experiments were carried out to identify fake news by combining diferent features
and classifiers. Three combinations of features and ensemble model with soft voting which
gave good results on the Development set were used to obtain the predictions on the Test set
and submitted to the organizers for final evaluation and ranking. The models submitted by the
participants were evaluated on the basis on macro F1-score. Among the submitted predictions
of the three models, an ensemble model trained using fastText Urdu word embeddings, word
uni-grams and character n-grams performed well and obtained macro F1-score of 0.552, accuracy
of 0.713 and 12th rank11 in the shared task. Performances of the proposed model on Development
set and Test set are given in Table 3. Using the evaluation snippet released by the organizers,
precision, recall, accuracy, and F1-score for Real and Fake class and an average F1-score are
calculated for the proposed model. Training set consists of only 43% fake articles and this
implies that the dataset is slightly imbalanced and the same is reflecting in the results.</p>
      <p>The performances of the individual classifiers of the ensembled model is shown in Figure 2.
From the figure, it is clear that the performance of the ensemble model is better than that of the
individual models as intended to be. The results also illustrates that MLP classifier which is
based on NN exhibits higher macro F1-score than the individual classifiers. Further, the higher
11https://www.urdufake2021.cicling.org/results-and-rankings
performance of all the models for Development set is due to the distribution of fake articles in
the corpus. The comparison of the accuracy and macro F1-score of the top performing models
with the proposed model is shown in Figure 3.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion and Future work</title>
      <p>This paper presents the description of the model proposed and submitted by our team MUCS to
UrduFake 2021 shared task to identify fake news articles in Urdu language. An ensemble of four
ML models which was trained using a combination of fastText word vectors, word uni-grams
and character n-grams has achieved a macro F1-score of 0.552 and an accuracy of 0.713. Various
combinations of features and feature selection models, as well as diferent learning approaches
in identifying fake news articles will be explored further.
Analysis in Languages where NLP Resources are not Plentiful: A Case Study for Modern
Greek, Multidisciplinary Digital Publishing Institute, 2017, p. 34.
[19] R. E. Schapire, Explaining Adaboost, in: Empirical Inference, Springer, 2013, pp. 37–52.
[20] M. Amjad, G. Sidorov, A. Zhila, H. Gómez-Adorno, I. Voronkov, A. Gelbukh, “Bend the
Truth”: Benchmark Dataset for Fake News Detection in Urdu Language and Its Evaluation,
2020, pp. 2457–2469.
[21] M. Amjad, G. Sidorov, A. Zhila, H. Gómez-Adorno, I. Voronkov, A. Gelbukh, Overview of
the Shared Task on Fake News Detection in Urdu at FIRE 2021, in: FIRE (Working Notes),
2021.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. P.</given-names>
            <surname>Rana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. K.</given-names>
            <surname>Dwivedi</surname>
          </string-name>
          ,
          <article-title>Attention-based LSTM Network for Rumor Veracity Estimation</article-title>
          of Tweet, Springer,
          <year>2020</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>B.</given-names>
            <surname>Ghanem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rangel</surname>
          </string-name>
          ,
          <source>An Emotional Analysis of False Information in Social Media and News Articles</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>K.</given-names>
            <surname>Shu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sliva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tang</surname>
          </string-name>
          , H. Liu, Fake news Detection on Social Media:
          <string-name>
            <given-names>A Data</given-names>
            <surname>Mining Perspective</surname>
          </string-name>
          ,
          <year>2017</year>
          , pp.
          <fpage>22</fpage>
          -
          <lpage>36</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>H.</given-names>
            <surname>Allcott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gentzkow</surname>
          </string-name>
          ,
          <source>Social Media and Fake News in the 2016 Election</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>211</fpage>
          -
          <lpage>36</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chang</surname>
          </string-name>
          , H. Liu,
          <article-title>Mining Social Media with Social Theories: A Survey</article-title>
          , ACM New York, NY, USA,
          <year>2014</year>
          , pp.
          <fpage>20</fpage>
          -
          <lpage>29</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Amjad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Sidorov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zhila</surname>
          </string-name>
          ,
          <article-title>Data Augmentation using Machine Translation for Fake News Detection in the Urdu Language</article-title>
          ,
          <source>in: Proceedings of The 12th Language Resources and Evaluation Conference</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>2537</fpage>
          -
          <lpage>2542</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Amjad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Sidorov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zhila</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Gómez-Adorno</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Voronkov</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . Gelbukh, UrduFake@ FIRE2021:
          <article-title>Shared Track on Fake News Identification in Urdu</article-title>
          ,
          <source>in: Forum for Information Retrieval Evaluation</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>F.</given-names>
            <surname>Balouchzahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. L.</given-names>
            <surname>Shashirekha</surname>
          </string-name>
          ,
          <article-title>Learning Models for Urdu Fake News Detection</article-title>
          ,
          <source>in: FIRE (Working Notes)</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>474</fpage>
          -
          <lpage>479</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Saumya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <article-title>NITP-AI-NLP@ UrduFake-FIRE2020: Multi-layer Dense Neural Network for Fake News Detection in Urdu News Articles</article-title>
          ,
          <source>in: FIRE (Working Notes)</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>458</fpage>
          -
          <lpage>463</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>N. N. A.</given-names>
            <surname>Balaji</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bharathi</surname>
          </string-name>
          , SSNCSE_NLP@
          <article-title>Fake News Detection in The Urdu Language (UrduFake) 2020</article-title>
          , in: FIRE (Working Notes),
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>N.</given-names>
            <surname>Lina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Fua</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jianga</surname>
          </string-name>
          ,
          <article-title>Fake News Detection in the Urdu Language using CharCNNRoBERTa</article-title>
          ,
          <source>in: FIRE (Working Notes)</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          <article-title>LeCun, Character-level Convolutional Networks for Text Classification</article-title>
          ,
          <year>2015</year>
          , pp.
          <fpage>649</fpage>
          -
          <lpage>657</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Reddy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Suman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Saha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bhattacharyya</surname>
          </string-name>
          ,
          <article-title>A GRU-based Fake News Prediction System: Working Notes for UrduFake-FIRE 2020</article-title>
          , in: FIRE (Working Notes),
          <year>2020</year>
          , pp.
          <fpage>464</fpage>
          -
          <lpage>468</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>S.</given-names>
            <surname>Haider</surname>
          </string-name>
          , Urdu Word Embeddings,
          <source>in: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC</source>
          <year>2018</year>
          ),
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>M. D. Anusha</surname>
            ,
            <given-names>H. L.</given-names>
          </string-name>
          <string-name>
            <surname>Shashirekha</surname>
          </string-name>
          ,
          <article-title>An Ensemble Model for Hate Speech and Ofensive Content Identification in Indo-European Languages</article-title>
          ,
          <source>in: FIRE (Working Notes)</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>253</fpage>
          -
          <lpage>259</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>M. Z. Islam</surname>
            ,
            <given-names>J.</given-names>
            Liu, J.
          </string-name>
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Kang</surname>
            ,
            <given-names>A Semantics</given-names>
          </string-name>
          <string-name>
            <surname>Aware</surname>
          </string-name>
          <article-title>Random Forest for Text Classification</article-title>
          ,
          <source>in: Proceedings of the 28th ACM International Conference on Information and Knowledge Management</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>1061</fpage>
          -
          <lpage>1070</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bounabi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>El Moutaouakil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Satori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A Probabilistic</given-names>
            <surname>Vector</surname>
          </string-name>
          <article-title>Representation and Neural Network for Text Classification</article-title>
          , in: International Conference on Big Data,
          <source>Cloud and Applications</source>
          , Springer,
          <year>2018</year>
          , pp.
          <fpage>343</fpage>
          -
          <lpage>355</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>V.</given-names>
            <surname>Athanasiou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Maragoudakis</surname>
          </string-name>
          ,
          <string-name>
            <surname>A Novel</surname>
          </string-name>
          ,
          <article-title>Gradient Boosting Framework for Sentiment</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>