<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Floods Relevancy and Identification of Location from Twitter Posts using NLP Techniques</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Muhammad Suleman</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Muhammad Asif</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tayyab Zamir</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ayaz Mehmood</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jebran Khan</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nasir Ahmad</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kashif Ahmad</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Abasyn University Islamabad Campus</institution>
          ,
          <country country="PK">Pakistan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>DCSE, University of Engineering and Technology</institution>
          ,
          <addr-line>Peshawar</addr-line>
          ,
          <country country="PK">Pakistan</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Department of AI, AJOU University</institution>
          ,
          <country country="KR">South Korea</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Department of Computer Science, Munster Technological University</institution>
          ,
          <addr-line>Cork</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper presents our solutions for the MediaEval 2022 task on DisasterMM. The task is composed of two subtasks, namely (i) Relevance Classification of Twitter Posts (RCTP), and (ii) Location Extraction from Twitter Texts (LETT). The RCTP subtask aims at diferentiating flood-related and non-relevant social posts while LETT is a Named Entity Recognition (NER) task and aims at the extraction of location information from the text. For RCTP, we proposed four diferent solutions based on BERT, RoBERTa, Distil BERT, and ALBERT obtaining an F1-score of 0.7934, 0.7970, 0.7613, and 0.7924, respectively. For LETT, we used three models namely BERT, RoBERTa, and Distil BERTA obtaining an F1-score of 0.6256, 0.6744, and 0.6723, respectively.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Natural disasters represent hazardous events that are generally caused by geophysical,
hydrological, climatological, and meteorological elements. These hazardous events may have an
adverse impact on human lives and infrastructure. Floods are one such event and it frequently
occurs in diferent parts of the world. Similar to other natural disasters, floods may have a
significant impact on public health and infrastructure. For instance, it has been noticed on
numerous occasions that roads and communication infrastructure are badly damaged during
lfoods [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        A rapid and efective response to disasters may help in mitigating their adverse impact.
Access to relevant and timely information is critical for an efective response. The literature
demonstrates several situations where access to relevant information may be possible due to
several reasons, such as the unavailability of reporters in the area and damage to communication
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Recently social media and crowdsourcing have been explored as a source of communication,
information collection, and dissemination in emergency situations. To this aim, several
interesting solutions have been proposed to collect, analyze, and extract meaningful insights from
social media content [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. However, social media content also comes with several limitations. For
instance, social media content is generally noisy, thus, making access to relevant information
very challenging. Similarly, geolocation information, which is critical for the relevance of the
content, is not necessarily available for all the relevant posts.
      </p>
      <p>
        Considering the importance and applications of social media content in disaster analytics
lfoods detection in social media content has been also included in the MediaEval benchmark
competition as a shared task for several years. This paper presents a solution for the MMDisaster
task presented in MediaEval 2022 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The challenge aims to solve two key challenges to disaster
analytics in social media. The first subtask aims at reducing social media noise by automatically
ifltering social media content to obtain relevant content. The second subtask aims at extracting
location information from social media text, allowing automatic positioning of a potential
incident due to floods. For both subtasks, we proposed several interesting solutions as described
in Section 3.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        In recent years, the potential of social media has been widely explored in diferent application
domains [
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ]. Some of the key applications where social media content has been already
proven very efective include public health [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], education [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], and public resource management
[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Social media outlets have also been widely explored for a diversified set of applications
in disasters and emergency situations [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. For instance, Hao et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] proposed a multi-modal
framework utilizing multi-social media imagery and textual information for damage assessment
in disaster-hit areas. The key factors analyzed in the work include hazard/disaster type, severity,
and damage type. Wu et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] also utilized social media data and the associated geo-location
information for generating early warnings and damage assessment analysis after disasters.
Ahmad et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], on the other hand, used social media imagery for the analysis of road
conditions after the floods. More specifically, the authors proposed an early and late-fusion
framework to identify passable roads in flooded regions. Alam et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] explored the potential
of social media content in another relevant task of assessing flood severity. To this aim, the
authors collected a large-scale benchmark dataset namely CrisisMD. The dataset provides a
large collection of Twitter posts including textual and visual content. Hassan et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] explored
a slightly diferent aspect of natural disasters by extracting sentiments and emotions from visual
content shared in social media outlets. The authors detailed how visual sentiment analysis of
disaster-related social media visual content can be utilized by diferent stakeholders, such as
news agencies, public authorities, and humanitarian organizations.
      </p>
      <p>Despite being proven very efective in diferent tasks of disaster analytics, social media content
has several limitations, such as noisy data and the unavailability of geolocation information. In
this paper, we propose a solution to overcome such challenges.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Approach</title>
      <sec id="sec-3-1">
        <title>3.1. Relevance Classification of Twitter Posts (RCTP)</title>
        <p>As a first step, we analyzed the available multimedia content. During the analysis, we observed
that most of the posts missing visual content. Moreover, most of the images were irrelevant.
Thus, we decided to use textual information only in our solution. Our framework for the
RCTP subtask is composed of two steps. In the first step, we performed some pre-processing
techniques to clean the data by removing unnecessary information, such as usernames, URLs,
emojis, and stop words.</p>
        <p>
          After pre-processing, several state-of-the-art NLP algorithms including BERT [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], Roberta
[
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], Distil BERT [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], and ALBERT [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] are used for the classification of the text. Since its a
binary classification task, in all methods, our cost function is based on binary crossentropy.
Moreover, we used Adam optimizer with a mini batch size of 32 for 20 epochs.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Location Extraction from Twitter Texts (LETT)</title>
        <p>
          LETT subtask is treated as Named Entity Recognition (NER) task. NER involves locating and
classifying named entities in text into pre-defined categories [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. In this task, we are interested
in the identification of words representing the starting and subsequent words of a text sequence
referring to a location. In LETT, annotations are provided at the word level. Similar to the RCTP
task, in this task, we rely on multiple state-of-the-art algorithms including BERT, Roberta, Distil
BERT, and ALBERT. We note that in this task, since annotations are provided at the word level,
we did not use any pre-processing technique before training our models.
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Dataset</title>
        <p>For both subtasks, separate datasets are released. The dataset for RCTP subtask contains data
from a total of 8,000 tweets. The tweets are collected between May 25, 2020, and June 12, 2020,
using flood-related keywords in the Italian Language, such as ”alluvione”, ”allagamento”, and
”esondazione”. The dataset is provided in two diferent sets namely the development set and the
test set. The development set is composed of 5,337 tweets while the test set contains a total of
1,315 tweets.</p>
        <p>The dataset for the LETT subtask is composed of around 6,000 tweets collected between
March 25, 2017, and August 1, 2018, using flood-related Italian keywords. The annotations for
this subtask are available per word in the tweets.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results and Analysis</title>
      <sec id="sec-4-1">
        <title>4.1. Runs Description of RCTP Subtask</title>
        <p>Table 1 shows the experimental results of the proposed solutions on the development set.
We note that during the experiments on the development set, we used 70% samples of the
development set for training, 20% for testing, and 10% samples for validation. As can be seen in
the table, no significant diferences can be observed in the performance of the models on the
clean and un-clean datasets. As far as the performance of the individual models is concerned,
slightly better results are obtained with BERT compared to the other models. Table 2 provides
the oficial results of the proposed solutions on the test set. We note that for the experiments
on the test set the models are trained on the complete development set. In total, 4 diferent runs
are submitted for the task. Our first, second, and fourth runs are based on BERT, RoBERTa, and
Distil Bert models trained on the un-cleaned dataset, respectively. Our third run is based on
the BERT model trained on the cleaned dataset. The performance of the models trained on the</p>
        <p>Exact Results Partial Results
Precision Recall F1-Score Precision Recall F1-Score
0.596 0.522 0.556 0.628 0.622 0.625
0.540 0.676 0.600 0.577 0.810 0.674
0.563 0.604 0.583 0.610 0.760 0.677
un-cleaned dataset is higher than the models trained on the cleaned dataset. This indicates that
the pre-processing information resulted in the removal of some relevant features and thus has a
negative impact on the results.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Runs Description of LETT Subtask</title>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>In this paper, we presented our solutions for the DisasterMM challenge posted in MediaEval
2022. For both subtasks, multiple state-of-the-art NLP algorithms are employed. In the current
implementation, all the models are used individually, however, we believe these models can
complement each other if jointly utilized in a merit-based fusion method. In the future, we
aim to employ diferent merit-based fusion methods to jointly utilize the capabilities of the
individual models in both subtasks.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D. T.</given-names>
            <surname>Nguyen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ofli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Imran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mitra</surname>
          </string-name>
          ,
          <article-title>Damage assessment from social media imagery data during disasters</article-title>
          ,
          <source>in: Proceedings of international conference on advances in social networks analysis and mining</source>
          <year>2017</year>
          ,
          <year>2017</year>
          , pp.
          <fpage>569</fpage>
          -
          <lpage>576</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>N.</given-names>
            <surname>Said</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Riegler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Pogorelov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hassan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Conci</surname>
          </string-name>
          ,
          <article-title>Natural disasters detection in social media and satellite imagery: a survey</article-title>
          ,
          <source>Multimedia Tools and Applications</source>
          <volume>78</volume>
          (
          <year>2019</year>
          )
          <fpage>31267</fpage>
          -
          <lpage>31302</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Andreadis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bozas</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Gialampoukidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Moumtzidou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fiorin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Lombardo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Mavropoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Norbiato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vrochidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ferri</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Kompatsiaris</surname>
          </string-name>
          ,
          <source>DisasterMM: Multimedia Analysis of Disaster-Related Social Media Data Task at MediaEval</source>
          <year>2022</year>
          , in: Proceedings of the MediaEval 2022 Workshop, Bergen, Norway and Online,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>K.</given-names>
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Pogorelov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Riegler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Conci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Halvorsen</surname>
          </string-name>
          ,
          <article-title>Social media and satellites</article-title>
          ,
          <source>Multimedia Tools and Applications</source>
          <volume>78</volume>
          (
          <year>2019</year>
          )
          <fpage>2837</fpage>
          -
          <lpage>2875</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>I.</given-names>
            <surname>Alsmadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Nazzal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Alam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Al-Fuqaha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Khreishah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Algosaibi</surname>
          </string-name>
          ,
          <article-title>Adversarial nlp for social network applications: Attacks, defenses</article-title>
          , and research directions,
          <source>IEEE Transactions on Computational Social Systems</source>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Andreadis</surname>
          </string-name>
          , G. Antzoulatos,
          <string-name>
            <given-names>T.</given-names>
            <surname>Mavropoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Giannakeris</surname>
          </string-name>
          , G. Tzionis,
          <string-name>
            <given-names>N.</given-names>
            <surname>Pantelidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Ioannidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Karakostas</surname>
          </string-name>
          , I. Gialampoukidis,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vrochidis</surname>
          </string-name>
          , et al.,
          <article-title>A social media analytics platform visualising the spread of covid-19 in italy via exploitation of automatically geotagged tweets</article-title>
          ,
          <source>Online Social Networks and Media</source>
          <volume>23</volume>
          (
          <year>2021</year>
          )
          <fpage>100134</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>A.-R</surname>
          </string-name>
          . et al.,
          <article-title>The influence of information system success and technology acceptance model on social media factors in education</article-title>
          ,
          <source>Sustainability</source>
          <volume>13</volume>
          (
          <year>2021</year>
          )
          <fpage>7770</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>K.</given-names>
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ayub</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Khan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Al-Fuqaha</surname>
          </string-name>
          ,
          <article-title>Social media as an instant source of feedback on water quality</article-title>
          ,
          <source>IEEE Transactions on Technology and Society</source>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>H.</given-names>
            <surname>Hao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Leveraging multimodal social media data for rapid disaster damage assessment</article-title>
          ,
          <source>International Journal of Disaster Risk Reduction</source>
          <volume>51</volume>
          (
          <year>2020</year>
          )
          <fpage>101760</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Cui</surname>
          </string-name>
          ,
          <article-title>Disaster early warning and damage assessment analysis using social media data and geo-location information, Decision support systems 111 (</article-title>
          <year>2018</year>
          )
          <fpage>48</fpage>
          -
          <lpage>59</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>K.</given-names>
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Pogorelov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Riegler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Ostroukhova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Halvorsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Conci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Dahyot</surname>
          </string-name>
          ,
          <article-title>Automatic detection of passable roads after floods in remote sensed and social media data</article-title>
          ,
          <source>Signal Processing: Image Communication</source>
          <volume>74</volume>
          (
          <year>2019</year>
          )
          <fpage>110</fpage>
          -
          <lpage>118</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>F.</given-names>
            <surname>Alam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ofli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Imran</surname>
          </string-name>
          , Crisismmd:
          <article-title>Multimodal twitter datasets from natural disasters</article-title>
          ,
          <source>in: Twelfth international AAAI conference on web and social media</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S. Z.</given-names>
            <surname>Hassan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hicks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Halvorsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Al-Fuqaha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Conci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Riegler</surname>
          </string-name>
          ,
          <article-title>Visual sentiment analysis from disaster images in social media</article-title>
          ,
          <source>Sensors</source>
          <volume>22</volume>
          (
          <year>2022</year>
          )
          <fpage>3628</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          , M.-
          <string-name>
            <given-names>W.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Toutanova</surname>
          </string-name>
          , Bert:
          <article-title>Pre-training of deep bidirectional transformers for language understanding</article-title>
          , arXiv preprint arXiv:
          <year>1810</year>
          .
          <volume>04805</volume>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Levy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Stoyanov</surname>
          </string-name>
          ,
          <article-title>Roberta: A robustly optimized bert pretraining approach</article-title>
          , arXiv preprint arXiv:
          <year>1907</year>
          .
          <volume>11692</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>V.</given-names>
            <surname>Sanh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Debut</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chaumond</surname>
          </string-name>
          , T. Wolf,
          <article-title>Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter</article-title>
          , arXiv preprint arXiv:
          <year>1910</year>
          .
          <volume>01108</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Lan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Goodman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Gimpel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Sharma</surname>
          </string-name>
          , R. Soricut,
          <string-name>
            <surname>Albert:</surname>
          </string-name>
          <article-title>A lite bert for self-supervised learning of language representations</article-title>
          , arXiv preprint arXiv:
          <year>1909</year>
          .
          <volume>11942</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sun</surname>
          </string-name>
          , J. Han,
          <string-name>
            <given-names>C.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>A survey on deep learning for named entity recognition</article-title>
          ,
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>34</volume>
          (
          <year>2020</year>
          )
          <fpage>50</fpage>
          -
          <lpage>70</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>