<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>P. v. Däniken);</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>ZHAW-CAI at CheckThat! 2023: Ensembling using Kernel Averaging</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pius von Däniken</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jan Deriu</string-name>
          <email>deri@zhaw.ch</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mark Cieliebak</string-name>
          <email>ciel@zhaw.ch</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Zurich University of Applied Sciences, Centre for Artificial Intelligence</institution>
          ,
          <addr-line>Winterthur</addr-line>
          ,
          <country country="CH">Switzerland</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>We describe our approaches to sub-task 1A on multi-modal check-worthiness classification of the CheckThat! Lab 2023 in English. The goal was to determine whether a tweet is worth fact-checking based on its text and image content. Our submission was based on a kernel ensemble of diferent uni-modal and multi-modal classifiers. It achieved second place out of 7 teams with an F1 score of 0.708. multi-modal, claim check-worthiness, multiple kernel learning, CheckThat! The CheckThat! Lab 2023 [1] included five tasks targeting various aspects of misinformation. We describe our approach to Task 1 Check-Worthiness in Multimodal and Unimodal Contents, which contained two sub-tasks. Of the two sub-tasks, we participated specifically in sub-task 1A targeting multi-modal content. The goal was to classify a tweet consisting of both text and an image as check-worthy or not. The sub-task was ofered both in Arabic and English. We only developed methods for the English data.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        The general problem of misinformation in social media has recieved a lot of interest from the
community in recent years. Apart from the CheckThat! Lab tasks there have been tasks focusing
on identifying the veracity of a claim or rumour, such as RumourEval [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ] and FEVEROUS [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        The first modern systems for check-worthiness detection include ClaimBuster [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and
ClaimRank [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Their main focus is on identifying check-worthy claims in political debates. The
various CheckThat! Lab check-worthiness tasks have targeted diferent text genres, including
social media and tweets in particular. While TF-IDF features are a staple of any text
classification task and have been included in systems such as ClaimBuster, many successful previous
participants [
        <xref ref-type="bibr" rid="ref12 ref13">12, 13</xref>
        ] used fine-tuned masked language models such as BERT [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and RoBERTa
[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] in their solutions. We include both approaches in our solution. In terms of analysis of
multi-modal social media content, the Hateful Memes challenge [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] has sparked a lot of interest
in the community. For the challenge of multi-modality for disinformation in particular, we
refer the reader to a recent survey [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. The MM-Claims dataset [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] is a recent multi-modal
claim detection dataset, on which this shared task is based. Our multi-modal sub-component
is most similar to systems such as [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] that use cross-attention between modalities. However,
we use a full transformer [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] encoder to fuse the modalities. Of course, an important recent
development involves the use of large language models such as the GPT family [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] and LLaMa
[
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] that exhibit astonishing zero-shot classification capabilities. We include this approach in
our solutions as well. Finally, we use a multiple kernel learning [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] approach to combine these
disparate classifiers into a unified ensemble model.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Method</title>
      <p>
        3.1. Data
The multi-modal check-worthiness sub-task is a binary classification task where a tweet
consisting of a short text and an image has to be classified as check-worthy or not. During the
development phase of the shared task, the organizers released training data (  ), validation
data (  ) and a dev-test set to be used for evaluation during development ( − ). The test
data   was released shortly before the submission deadline and its labels were only released
after the submission deadline. For all our experiments, we combine the   and  − sets
into a single validation set   . The individual systems are trained on   and evaluated on
  . The sizes of these sets and their label distributions are shown in Table 1. We note that
each sample contained both text and image data. The training and development data came from
the MM-Claims dataset [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] and for the full description of the task data, we refer the reader to
the task overview [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ].
3.2. Systems
We will now describe the diferent uni-modal and multi-modal systems we trained and our
method to combine them using a kernel-based ensemble.
      </p>
      <sec id="sec-3-1">
        <title>3.2.1. Text N-gram Classifier</title>
        <p>
          Our first uni-modal system is based on the tweet text only. We first pre-process the texts by
replacing URLs 1, user handles, and sequences of emoji 2 by placeholder tokens. The text was
then lower-cased and tokenized by splitting on white-space. Tokens shorter than 2 characters
were discarded. Based on this we computed TF-IDF [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] vectors for each text. This means
counting the uni-grams and bi-grams of tokens for each sample. We count only one occurrence
for each n-gram, meaning we ignore repetitions. We also ignore n-grams that appear in fewer
than 3 samples in   . Based on these counts one can compute the inverse document frequency
(IDF) for each token. The resulting feature vectors are normalized to have unit euclidean length.
We used the Tfidf Vectorizer implementation provided by scikit-learn [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ]. We call the resulting
feature vectors  − .
        </p>
        <p>
          We then use these feature vectors to train a linear Support Vector Machine (SVM) [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ] with
regularization strength of 1. We again rely on the implementation provided by scikit-learn.
In particular we also employ their implementation of reweighing the classes based on their
frequency in the training data which was inspired by [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ]. We will call this model text-ngram.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2.2. MLM Classifier</title>
        <p>
          Next, we trained another text-only system. For this we fine-tuned an electra-base-discriminator
[
          <xref ref-type="bibr" rid="ref29">29</xref>
          ] model on the training data. Electra models have the same architecture as BERT [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] but
follow a diferent pre-training setup. During masked language modelling (MLM) pre-training
there is both a generator network  and a discriminator network  . During pre-training a
certain number of input tokens are masked and  has to predict the original token. The masked
tokens are then replaced by those predicted by  and  has to determine whether a token was
the original or has been replaced.
        </p>
        <p>
          For our experiments we use the provided discriminator model checkpoint from Huggingface 3
[
          <xref ref-type="bibr" rid="ref30">30</xref>
          ]. We show the training hyper-parameters in Table 2. We will call the resulting model
electra-clf.
        </p>
        <p>In section 3.2.5 we will need access to a feature vector extracted from electra-clf. For this we
remove the final dense layer of electra-clf and use the model activations as feature vectors and</p>
        <sec id="sec-3-2-1">
          <title>1For this we use the urlextract package: https://github.com/lipoja/URLExtract.</title>
          <p>2For this we use the emoji package: https://github.com/carpedm20/emoji/.
3https://huggingface.co/google/electra-base-discriminator
scale them to unit length. We will refer to these feature vectors as  
.</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>3.2.3. Multi-Modal Classifier</title>
        <p>
          Our multi-modal model relies on pre-trained encoder models for each modality. For text, we use
the twitter-roberta-base 4 checkpoint from Huggingface. This is a RoBERTa [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] model that has
been pre-trained on 58M tweets [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ]. The output of this text encoder has dimensions   ×  
where   is the number of tokens and   the dimension of the token embedding.
        </p>
        <p>For images, we use a Vision Transformer (ViT ) [33] that has been pre-trained on ImageNet21k
[34]. We again use a checkpoint provided by Huggingface5. The model takes images at a
224 × 224 pixel resolution as input and processes them as a sequence of 16 × 16 pixel patches.
This results in an output representation of size   ×   where   is the number of patches
and   the patch embedding dimension.</p>
        <p>
          We first project both representations into a shared space of dimension  ℎ using a dense
layer and relu activation for each representation. This results in representations of sizes
  ×  ℎ and   ×  ℎ . We then concatenate them to get a new representation of size
× ℎ where  =   +  . We then feed this representation through a transformer encoder
[
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] and a relu activation. The transformer encoder preserves the size of the representation
and we use mean pooling across the sequence length to get an embedding  − of size
 ℎ . Finally, we normalize  − to unit length and feed it through a final dense layer
for classification.
        </p>
        <p>We fine-tune this model on   but keep the weights of both the RoBERTa and the ViT
encoders frozen. We call the resulting model multi-modal-clf and show its hyper-parameters in
Table 3.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.2.4. LLM Classifier</title>
        <p>
          Recent Large Language Models (LLMs) such as the GPT family [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] have shown impressive
fewshot and even zero-shot classification capabilities. In particular, chain-of-thought prompting
[35], where the model is asked to generate a step-by-step explanation how it arrives at a certain
prediction, has shown much promise.
4https://huggingface.co/cardiffnlp/twitter-roberta-base
5https://huggingface.co/google/vit-base-patch16-224-in21k
 

 ℎ
Epochs
Batch Size
Optimizer
Learning Rate
Weight Decay
Transformer Encoder Layers
Attention Heads
Transformer Feedforward Dimension
        </p>
        <p>Value
We use the Language Model Query Language (LMQL) [36] to formulate the prompt and constrain
the answers. We show the prompt written in LMQL in Listing 1.</p>
        <sec id="sec-3-4-1">
          <title>Listing 1: LMQL Prompt</title>
        </sec>
        <sec id="sec-3-4-2">
          <title>C o n s i d e r t h e f o l l o w i n g Tweet : { c l a i m } Do you t h i n k t h i s Tweet c o n t a i n s a c l a i m t h a t i s worth argmax</title>
          <p>from
where</p>
          <p>f a c t − c h e c k i n g ?</p>
        </sec>
        <sec id="sec-3-4-3">
          <title>Answer : [ ANSWER]</title>
          <p>R e a s o n i n g : [ REASON ]
o p e n a i / t e x t − d a v i n c i − 0 0 3
STOPS_AT ( REASON , ” . ” )
and ANSWER i n [ ’ Yes ’ , ’ No ’ ]</p>
          <p>The placeholder claim is where we insert the tweet text. The placeholders ANSWER and
REASON are filled in by the model. In our case we use
OpenAI ’s text-davinci-003 6 model. The
answer is constrained to the words Yes and No which we can directly use as predictions, which
we will call gpt-answer. The reasoning is constrained to be one sentence, since it should stop
generating when it produces the first full stop. We apply a similar feature extraction procedure
as for text-ngram in Section 3.2.1 to these reasoning sentences. We forgo any special token
replacements and use n-grams up to length 3 but keep the other parameters the same. The
resulting feature vectors will be called  −
and call it gpt-ngram.
6https://platform.openai.com/docs/models/gpt-3-5
. We then train a linear SVM on  −</p>
        </sec>
      </sec>
      <sec id="sec-3-5">
        <title>3.2.5. Kernel Ensemble</title>
        <p>We have seen that all our base models have an associated feature vector:  − ,   ,
 − , and  − . For each of these we can define a linear kernel. The kernel value
for two samples  and  for a given system  is then defined as   (, ) =   ()  () , where   ()
is the feature vector of system  for sample  . Given such a kernel   , we can then train an
SVM. For  − and  − this is equivalent to their associated classifiers text-ngram
and gpt-ngram. On the other hand, for   and  − we will call the resulting SVM
classifiers electra-kernel and multi-modal-kernel respectively.</p>
        <p>We will include an additional ViT encoder based feature vector  − . It is based
on the same ViT encoder as multi-modal-clf, which also provides a pooled representation for
classification, which we will use as  − . We will call the resulting kernel-based SVM
classifier img-untrained-kernel.</p>
        <p>
          Next, we show how we combine these kernels into an ensemble. Given a set of systems  , we
can define their average kernel as:
 
(, ) = ∑ 1
∈ ||
  (, )
This is known as a fixed rule multiple kernel learning method [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]. We can then use   to
train an SVM. Our main submission was based on this method and used an average kernel using
text-ngram, gpt-ngram, electra-kernel, and multi-modal-kernel as components. We will also
show results for all-kernels which additionally includes img-untrained-kernel in the average.7
All kernel-based SVMs were trained using a regularization strength of 1 and frequency based
class weights.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>In Table 4 we show our main results. Our submission achieved an F1 score of 0.708 on the test
set. We note that if we use the default classification threshold 8 electra-kernel and all-kernels
achieve that exact same score. This could indicate that our ensemble method is redundant. In
practice, F1 scores can be sensitive to the decision threshold. In Figure 1 we show the Precision
and Recall Curves for each system. They show the Precision and Recall of a system for all
potential thresholds. In the plot we include lines of constant F1 in light gray. We can see that
the default thresholds (black cross marks) tend to select sub-optimal operating points.</p>
      <p>We could therefore try to find a better classification threshold. For this we can use the
validation set   and use the threshold which maximizes the F1 score on   . The results
are shown as red cross marks in Figure 1 and in the column called Tuned Threshold in Table 4.
Since gpt-answer provides only binary outputs we can not change its threshold. The values for
electra-clf and multi-modal-clf are missing since we did not compute their output on   9.
We can see that for most systems this method selects an even worse threshold. We had already</p>
      <sec id="sec-4-1">
        <title>7The diference between submission and all-kernels was due to time constraints.</title>
        <p>8For SVM-based systems the default threshold is 0, for classifiers trained using cross-entropy to produce class
probabilities, the default threshold is 0.5.
9This was due to time constraints.
noticed this during development, where system performance varied greatly between   and
 − , and therefore we chose the default classification threshold.</p>
        <p>Finally, in Table 4 we also include the scores that could be achieved if we had access to the
ideal threshold. We computed it by selecting the threshold which maximizes the F1 score on
  . Of course, in reality one never has access to this knowledge, but we include it here to
show how much influence threshold selection can have on the system comparison.</p>
        <p>In Figure 1 we can also see that the curve for submission lies above the individual kernel
based systems over the most recall values. Meaning that for most fixed recalls it achieves higher
precision. This indicates that our ensembling method indeed yields an improved classifier. On
the other hand, we can also see that electra-clf and multi-modal-clf perform even better.</p>
        <p>In Figure 2 we show the Receiver Operating Characteristic (ROC) curves for all our systems.
We can again see that electra-clf and multi-modal-clf have the highest area under the curve
(AUC), meaning that for most fixed false positive rates they have a higher true positive rate
than other systems. We can also see that our ensembling method outperforms individual kernel
methods.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>We have laid out our solution to the CheckThat! Lab 2023 sub-task 1A on multi-modal
checkworthiness classification. Our solution includes diverse components that we combine using a
multiple kernel learning approach. Our submission achieved second place out of 7 teams with an
F1 score of 0.708. While analysing our results, we noted that the performance measure can vary
drastically based on the selected decision threshold. When considering threshold-free methods
such as ROC and PR curves, we find that our ensemble indeed seems to perform better than its
individual components. Nevertheless, we note that the directly fine-tuned models outperform
our submission under this lens. The performance gap between electra-clf and electra-kernel as
well as multi-modal-clf and multi-modal-kernel is an open question requiring further study.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work has been funded by the Hamison project supported by the EU ERA-Net CHIST-ERA;
the Swiss National Science Foundation [20CH21_209672].
tics, Online, 2020, pp. 1644–1650. URL: https://aclanthology.org/2020.findings-emnlp.148.
doi:1 0 . 1 8 6 5 3 / v 1 / 2 0 2 0 . f i n d i n g s - e m n l p . 1 4 8 .
[33] A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M.
Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, N. Houlsby, An image is worth
16x16 words: Transformers for image recognition at scale, CoRR abs/2010.11929 (2020).</p>
      <p>URL: https://arxiv.org/abs/2010.11929. a r X i v : 2 0 1 0 . 1 1 9 2 9 .
[34] T. Ridnik, E. Ben-Baruch, A. Noy, L. Zelnik-Manor, Imagenet-21k pretraining for the
masses, 2021. a r X i v : 2 1 0 4 . 1 0 9 7 2 .
[35] J. Wei, X. Wang, D. Schuurmans, M. Bosma, E. H. Chi, Q. Le, D. Zhou, Chain of thought
prompting elicits reasoning in large language models, CoRR abs/2201.11903 (2022). URL:
https://arxiv.org/abs/2201.11903. a r X i v : 2 2 0 1 . 1 1 9 0 3 .
[36] L. Beurer-Kellner, M. Fischer, M. Vechev, Prompting is programming: A query language
for large language models, PLDI ’23 (2022).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Barrón-Cedeño</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Alam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Caselli</surname>
          </string-name>
          , G. Da San Martino, T. Elsayed,
          <string-name>
            <given-names>A.</given-names>
            <surname>Galassi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Haouari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ruggeri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Struß</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. N.</given-names>
            <surname>Nandi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. S.</given-names>
            <surname>Cheema</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Azizov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nakov</surname>
          </string-name>
          , The clef-2023 checkthat! lab: Checkworthiness, subjectivity, political bias, factuality, and authority, in: J.
          <string-name>
            <surname>Kamps</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Crestani</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Maistro</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Joho</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Gurrin</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          <string-name>
            <surname>Kruschwitz</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . Caputo (Eds.),
          <source>Advances in Information Retrieval</source>
          , Springer Nature Switzerland, Cham,
          <year>2023</year>
          , pp.
          <fpage>506</fpage>
          -
          <lpage>517</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>P.</given-names>
            <surname>Nakov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Barrón-Cedeño</surname>
          </string-name>
          , G. Da San Martino,
          <string-name>
            <given-names>F.</given-names>
            <surname>Alam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Míguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Caselli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kutlu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zaghouani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Shaar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Mubarak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nikolov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. S.</given-names>
            <surname>Kartal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Beltrán</surname>
          </string-name>
          ,
          <article-title>Overview of the CLEF-2022 CheckThat! lab task 1 on identifying relevant claims in tweets</article-title>
          , in: Working Notes of CLEF 2022-
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , CLEF '
          <year>2022</year>
          , Bologna, Italy,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Shaar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hasanain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Hamdan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z. S.</given-names>
            <surname>Ali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Haouari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nikolov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kutlu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. S.</given-names>
            <surname>Kartal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Alam</surname>
          </string-name>
          , G. Da San Martino, A.
          <string-name>
            <surname>Barrón-Cedeño</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Míguez</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Beltrán</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Elsayed</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Nakov</surname>
          </string-name>
          ,
          <article-title>Overview of the CLEF-2021 CheckThat! lab task 1 on check-worthiness estimation in tweets and political debates</article-title>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Shaar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nikolov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Babulkov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Alam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Barrón-Cedeño</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Elsayed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hasanain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Suwaileh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Haouari</surname>
          </string-name>
          , G. Da San Martino, P. Nakov, Overview of CheckThat! 2020 English:
          <article-title>Automatic identification and verification of claims in social media</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Atanasova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nakov</surname>
          </string-name>
          , G. Karadzhov,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mohtarami</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Da San Martino, Overview of the CLEF-2019 CheckThat! lab on automatic identification and verification of claims. Task 1: Check-worthiness</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>P.</given-names>
            <surname>Atanasova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Marquez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Barrón-Cedeño</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Elsayed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Suwaileh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zaghouani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kyuchukov</surname>
          </string-name>
          , G. Da San Martino, P. Nakov,
          <article-title>Overview of the CLEF-2018 CheckThat! lab on automatic identification and verification of political claims</article-title>
          .
          <source>Task</source>
          <volume>1</volume>
          : Check-worthiness,
          <source>CEUR Workshop Proceedings</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L.</given-names>
            <surname>Derczynski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Bontcheva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Liakata</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Procter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Wong Sak Hoi</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. Zubiaga,</surname>
          </string-name>
          <article-title>SemEval2017 task 8: RumourEval: Determining rumour veracity and support for rumours</article-title>
          ,
          <source>in: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)</source>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Vancouver, Canada,
          <year>2017</year>
          , pp.
          <fpage>69</fpage>
          -
          <lpage>76</lpage>
          . URL: https://aclanthology.org/S17-2006.
          <article-title>doi:1 0 . 1 8 6 5 3 / v 1 / S 1 7 - 2 0 0 6</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>G.</given-names>
            <surname>Gorrell</surname>
          </string-name>
          , E. Kochkina,
          <string-name>
            <given-names>M.</given-names>
            <surname>Liakata</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Aker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zubiaga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Bontcheva</surname>
          </string-name>
          , L. Derczynski, SemEval
          <article-title>-2019 task 7: RumourEval, determining rumour veracity and support for rumours</article-title>
          ,
          <source>in: Proceedings of the 13th International Workshop on Semantic Evaluation</source>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Minneapolis, Minnesota, USA,
          <year>2019</year>
          , pp.
          <fpage>845</fpage>
          -
          <lpage>854</lpage>
          . URL: https://aclanthology.org/S19-2147. doi:
          <article-title>1 0 . 1 8 6 5 3 / v 1 / S 1 9 - 2 1 4 7</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Aly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Schlichtkrull</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Thorne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Vlachos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Christodoulopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Cocarascu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mittal</surname>
          </string-name>
          ,
          <article-title>The fact extraction and VERification over unstructured and structured information (FEVEROUS) shared task</article-title>
          ,
          <source>in: Proceedings of the Fourth Workshop on Fact Extraction and VERification (FEVER)</source>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Dominican Republic,
          <year>2021</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>13</lpage>
          . URL: https://aclanthology.org/
          <year>2021</year>
          .fever
          <article-title>-1.1. doi:1 0 . 1 8 6 5 3 / v 1 / 2 0 2 1</article-title>
          . f e v e r -
          <volume>1</volume>
          . 1 .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hassan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tremayne</surname>
          </string-name>
          ,
          <article-title>Detecting check-worthy factual claims in presidential debates</article-title>
          ,
          <source>in: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management</source>
          , CIKM '15,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2015</year>
          , p.
          <fpage>1835</fpage>
          -
          <lpage>1838</lpage>
          . URL: https://doi.org/10.1145/2806416.2806652.
          <source>doi:1 0 . 1 1</source>
          <volume>4 5 / 2 8 0 6 4 1 6 . 2 8 0 6 6 5 2 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>I.</given-names>
            <surname>Jaradat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Gencheva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Barrón-Cedeño</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Màrquez</surname>
          </string-name>
          , P. Nakov,
          <article-title>ClaimRank: Detecting check-worthy claims in Arabic and English, in: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, Association for Computational Linguistics</article-title>
          , New Orleans, Louisiana,
          <year>2018</year>
          , pp.
          <fpage>26</fpage>
          -
          <lpage>30</lpage>
          . URL: https://aclanthology.org/N18-5006.
          <source>doi:1 0 . 1 8</source>
          <volume>6 5 3</volume>
          / v 1 / N 1 8
          <article-title>- 5 0 0 6</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Savchev</surname>
          </string-name>
          , AI Rational at CheckThat!
          <year>2022</year>
          <article-title>: using transformer models for tweet classification</article-title>
          , in: N. Faggioli, Guglielmo andd Ferro,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          , M. Potthast (Eds.), Working Notes of CLEF 2022 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , CLEF '
          <year>2022</year>
          , Bologna, Italy,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>R. M. Buliga Nicu</surname>
          </string-name>
          , Zorros at CheckThat! 2022:
          <article-title>ensemble model for identifying relevant claims in tweets</article-title>
          , in: N. Faggioli, Guglielmo andd Ferro,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hanbury</surname>
          </string-name>
          , M. Potthast (Eds.), Working Notes of CLEF 2022 -
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , CLEF '
          <year>2022</year>
          , Bologna, Italy,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          , M.-
          <string-name>
            <given-names>W.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Toutanova</surname>
          </string-name>
          , BERT:
          <article-title>Pre-training of deep bidirectional transformers for language understanding</article-title>
          ,
          <source>in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          , Volume
          <volume>1</volume>
          (Long and Short Papers),
          <source>Association for Computational Linguistics</source>
          , Minneapolis, Minnesota,
          <year>2019</year>
          , pp.
          <fpage>4171</fpage>
          -
          <lpage>4186</lpage>
          . URL: https://aclanthology.org/ N19-1423.
          <source>doi:1 0 . 1 8</source>
          <volume>6 5 3</volume>
          / v 1 / N 1 9
          <article-title>- 1 4 2 3</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Levy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Stoyanov</surname>
          </string-name>
          ,
          <article-title>Roberta: A robustly optimized bert pretraining approach</article-title>
          ,
          <year>2019</year>
          .
          <article-title>a r X i v : 1 9 0 7 . 1 1 6 9 2</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>D.</given-names>
            <surname>Kiela</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Firooz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mohan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Goswami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Ringshia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Testuggine</surname>
          </string-name>
          ,
          <article-title>The hateful memes challenge: Detecting hate speech in multimodal memes</article-title>
          , in: H.
          <string-name>
            <surname>Larochelle</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Ranzato</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Hadsell</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Balcan</surname>
          </string-name>
          , H. Lin (Eds.),
          <source>Advances in Neural Information Processing Systems</source>
          , volume
          <volume>33</volume>
          ,
          <string-name>
            <surname>Curran</surname>
            <given-names>Associates</given-names>
          </string-name>
          , Inc.,
          <year>2020</year>
          , pp.
          <fpage>2611</fpage>
          -
          <lpage>2624</lpage>
          . URL: https://proceedings. neurips.cc/paper_files/paper/2020/file/1b84c4cee2b8b3d823b30e2d604b1878-Paper.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>F.</given-names>
            <surname>Alam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cresci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Chakraborty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Silvestri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Dimitrov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. D. S.</given-names>
            <surname>Martino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Shaar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Firooz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nakov</surname>
          </string-name>
          ,
          <article-title>A survey on multimodal disinformation detection</article-title>
          ,
          <source>in: Proceedings of the 29th International Conference on Computational Linguistics</source>
          ,
          <source>International Committee on Computational Linguistics</source>
          , Gyeongju, Republic of Korea,
          <year>2022</year>
          , pp.
          <fpage>6625</fpage>
          -
          <lpage>6643</lpage>
          . URL: https://aclanthology.org/
          <year>2022</year>
          .coling-
          <volume>1</volume>
          .
          <fpage>576</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>G. S.</given-names>
            <surname>Cheema</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hakimov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sittar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Müller-Budack</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Otto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ewerth</surname>
          </string-name>
          ,
          <article-title>MM-claims: A dataset for multimodal claim detection in social media, in: Findings of the Association for Computational Linguistics: NAACL 2022, Association for Computational Linguistics</article-title>
          , Seattle, United States,
          <year>2022</year>
          , pp.
          <fpage>962</fpage>
          -
          <lpage>979</lpage>
          . URL: https://aclanthology.org/
          <year>2022</year>
          .findings-naacl.
          <source>72. doi:1 0 . 1 8</source>
          <volume>6 5 3</volume>
          / v 1 /
          <article-title>2 0 2 2 . f i n d i n g s - n a a c l . 7 2 .</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>K. D. N.</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Patil</surname>
          </string-name>
          ,
          <article-title>Multimodal Emotion Recognition Using Cross-Modal Attention and 1D Convolutional Neural Networks</article-title>
          ,
          <source>in: Proc. Interspeech</source>
          <year>2020</year>
          ,
          <year>2020</year>
          , pp.
          <fpage>4243</fpage>
          -
          <lpage>4247</lpage>
          . doi:
          <article-title>1 0 . 2 1 4 3 7 / I n t e r s p e e c h . 2 0 2 0 - 1 1 9 0</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>A.</given-names>
            <surname>Vaswani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Parmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Gomez</surname>
          </string-name>
          , L. u. Kaiser,
          <string-name>
            <surname>I. Polosukhin</surname>
          </string-name>
          ,
          <article-title>Attention is all you need</article-title>
          , in: I. Guyon,
          <string-name>
            <given-names>U. V.</given-names>
            <surname>Luxburg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bengio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wallach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fergus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vishwanathan</surname>
          </string-name>
          , R. Garnett (Eds.),
          <source>Advances in Neural Information Processing Systems</source>
          , volume
          <volume>30</volume>
          ,
          <string-name>
            <surname>Curran</surname>
            <given-names>Associates</given-names>
          </string-name>
          , Inc.,
          <year>2017</year>
          . URL: https://proceedings.neurips.cc/ paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>T. B. Brown</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Mann</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ryder</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Subbiah</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Kaplan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Dhariwal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Neelakantan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Shyam</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Sastry</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Askell</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Agarwal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Herbert-Voss</surname>
            , G. Krueger,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Henighan</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Child</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Ramesh</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          <string-name>
            <surname>Ziegler</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Winter</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Hesse</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Chen</surname>
            , E. Sigler,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Litwin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Gray</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Chess</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Clark</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Berner</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>McCandlish</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Radford</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Amodei</surname>
          </string-name>
          ,
          <article-title>Language models are few-shot learners</article-title>
          , CoRR abs/
          <year>2005</year>
          .14165 (
          <year>2020</year>
          ). URL: https://arxiv. org/abs/
          <year>2005</year>
          .14165.
          <article-title>a r X i v : 2 0 0 5 . 1 4 1 6 5</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>H.</given-names>
            <surname>Touvron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Lavril</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Izacard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Martinet</surname>
          </string-name>
          , M.
          <article-title>-</article-title>
          <string-name>
            <surname>A. Lachaux</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Lacroix</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Rozière</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Goyal</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Hambro</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Azhar</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Rodriguez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Joulin</surname>
          </string-name>
          , E. Grave, G. Lample,
          <article-title>Llama: Open and eficient foundation language models</article-title>
          ,
          <year>2023</year>
          .
          <article-title>a r X i v : 2 3 0 2 . 1 3 9 7 1</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>M.</given-names>
            <surname>Gönen</surname>
          </string-name>
          , E. Alpaydın,
          <article-title>Multiple kernel learning algorithms</article-title>
          ,
          <source>J. Mach. Learn. Res</source>
          .
          <volume>12</volume>
          (
          <year>2011</year>
          )
          <fpage>2211</fpage>
          -
          <lpage>2268</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>F.</given-names>
            <surname>Alam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Barrón-Cedeño</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. S.</given-names>
            <surname>Cheema</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hakimov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hasanain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Míguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Mubarak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. K.</given-names>
            <surname>Shahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zaghouani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Nakov</surname>
          </string-name>
          ,
          <article-title>Overview of the CLEF-2023 CheckThat! lab task 1 on check-worthiness in multimodal and multigenre content</article-title>
          , in: M.
          <string-name>
            <surname>Aliannejadi</surname>
            , G. Faggioli,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Ferro</surname>
          </string-name>
          , Vlachos, Michalis (Eds.), Working Notes of CLEF 2023-
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          ,
          <string-name>
            <surname>CLEF</surname>
          </string-name>
          <year>2023</year>
          , Thessaloniki, Greece,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>C. D. Manning</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Raghavan</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Schütze</surname>
          </string-name>
          , Introduction to Information Retrieval, Cambridge University Press,
          <year>2008</year>
          . URL: https://nlp.stanford.edu/IR-book/.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>F.</given-names>
            <surname>Pedregosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Varoquaux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gramfort</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Michel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Thirion</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Grisel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Blondel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Prettenhofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Weiss</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Dubourg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Vanderplas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Passos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Cournapeau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brucher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Perrot</surname>
          </string-name>
          , E. Duchesnay,
          <article-title>Scikit-learn: Machine learning in Python</article-title>
          ,
          <source>Journal of Machine Learning Research</source>
          <volume>12</volume>
          (
          <year>2011</year>
          )
          <fpage>2825</fpage>
          -
          <lpage>2830</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>C.</given-names>
            <surname>Cortes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. N.</given-names>
            <surname>Vapnik</surname>
          </string-name>
          ,
          <article-title>Support-vector networks</article-title>
          ,
          <source>Machine Learning</source>
          <volume>20</volume>
          (
          <year>1995</year>
          )
          <fpage>273</fpage>
          -
          <lpage>297</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>G.</given-names>
            <surname>King</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zeng</surname>
          </string-name>
          ,
          <article-title>Logistic regression in rare events data</article-title>
          ,
          <source>Political Analysis</source>
          <volume>9</volume>
          (
          <year>2001</year>
          )
          <fpage>137</fpage>
          -
          <lpage>163</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>K.</given-names>
            <surname>Clark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Luong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q. V.</given-names>
            <surname>Le</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Manning</surname>
          </string-name>
          ,
          <article-title>ELECTRA: pre-training text encoders as discriminators rather than generators</article-title>
          ,
          <source>in: 8th International Conference on Learning Representations, ICLR</source>
          <year>2020</year>
          ,
          <string-name>
            <given-names>Addis</given-names>
            <surname>Ababa</surname>
          </string-name>
          , Ethiopia,
          <source>April 26-30</source>
          ,
          <year>2020</year>
          , OpenReview.net,
          <year>2020</year>
          . URL: https://openreview.net/forum?id=r1xMH1BtvB.
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>T.</given-names>
            <surname>Wolf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Debut</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Sanh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chaumond</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Delangue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Moi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cistac</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Rault</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Louf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Funtowicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Brew</surname>
          </string-name>
          ,
          <article-title>Huggingface's transformers: State-of-the-art natural language processing</article-title>
          , CoRR abs/
          <year>1910</year>
          .03771 (
          <year>2019</year>
          ). URL: http://arxiv.org/abs/
          <year>1910</year>
          .03771.
          <article-title>a r X i v : 1 9 1 0 . 0 3 7 7 1</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>I.</given-names>
            <surname>Loshchilov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Hutter</surname>
          </string-name>
          ,
          <article-title>Decoupled weight decay regularization</article-title>
          ,
          <source>in: 7th International Conference on Learning Representations, ICLR</source>
          <year>2019</year>
          ,
          <article-title>New Orleans</article-title>
          , LA, USA, May 6-
          <issue>9</issue>
          ,
          <year>2019</year>
          , OpenReview.net,
          <year>2019</year>
          . URL: https://openreview.net/forum?id=
          <fpage>Bkg6RiCqY7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>F.</given-names>
            <surname>Barbieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Camacho-Collados</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. Espinosa</given-names>
            <surname>Anke</surname>
          </string-name>
          , L. Neves,
          <article-title>TweetEval: Unified benchmark and comparative evaluation for tweet classification</article-title>
          ,
          <source>in: Findings of the Association for Computational Linguistics: EMNLP</source>
          <year>2020</year>
          ,
          <article-title>Association for Computational Linguis-</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>