1. Introduction

Floods Relevancy and Identification of Location from Twitter Posts using NLP Techniques

Muhammad Suleman

Muhammad Asif

Tayyab Zamir

Ayaz Mehmood

Jebran Khan

Nasir Ahmad

Kashif Ahmad

3 0 Abasyn University Islamabad Campus , Pakistan 1 DCSE, University of Engineering and Technology , Peshawar , Pakistan 2 Department of AI, AJOU University , South Korea 3 Department of Computer Science, Munster Technological University , Cork , Ireland

This paper presents our solutions for the MediaEval 2022 task on DisasterMM. The task is composed of two subtasks, namely (i) Relevance Classification of Twitter Posts (RCTP), and (ii) Location Extraction from Twitter Texts (LETT). The RCTP subtask aims at diferentiating flood-related and non-relevant social posts while LETT is a Named Entity Recognition (NER) task and aims at the extraction of location information from the text. For RCTP, we proposed four diferent solutions based on BERT, RoBERTa, Distil BERT, and ALBERT obtaining an F1-score of 0.7934, 0.7970, 0.7613, and 0.7924, respectively. For LETT, we used three models namely BERT, RoBERTa, and Distil BERTA obtaining an F1-score of 0.6256, 0.6744, and 0.6723, respectively.

1. Introduction

Natural disasters represent hazardous events that are generally caused by geophysical, hydrological, climatological, and meteorological elements. These hazardous events may have an adverse impact on human lives and infrastructure. Floods are one such event and it frequently occurs in diferent parts of the world. Similar to other natural disasters, floods may have a significant impact on public health and infrastructure. For instance, it has been noticed on numerous occasions that roads and communication infrastructure are badly damaged during lfoods [ 1 ].

A rapid and efective response to disasters may help in mitigating their adverse impact. Access to relevant and timely information is critical for an efective response. The literature demonstrates several situations where access to relevant information may be possible due to several reasons, such as the unavailability of reporters in the area and damage to communication [ 2 ]. Recently social media and crowdsourcing have been explored as a source of communication, information collection, and dissemination in emergency situations. To this aim, several interesting solutions have been proposed to collect, analyze, and extract meaningful insights from social media content [ 2 ]. However, social media content also comes with several limitations. For instance, social media content is generally noisy, thus, making access to relevant information very challenging. Similarly, geolocation information, which is critical for the relevance of the content, is not necessarily available for all the relevant posts.

Considering the importance and applications of social media content in disaster analytics lfoods detection in social media content has been also included in the MediaEval benchmark competition as a shared task for several years. This paper presents a solution for the MMDisaster task presented in MediaEval 2022 [ 3 ]. The challenge aims to solve two key challenges to disaster analytics in social media. The first subtask aims at reducing social media noise by automatically ifltering social media content to obtain relevant content. The second subtask aims at extracting location information from social media text, allowing automatic positioning of a potential incident due to floods. For both subtasks, we proposed several interesting solutions as described in Section 3.

2. Related Work

In recent years, the potential of social media has been widely explored in diferent application domains [ 4, 5 ]. Some of the key applications where social media content has been already proven very efective include public health [ 6 ], education [ 7 ], and public resource management [ 8 ]. Social media outlets have also been widely explored for a diversified set of applications in disasters and emergency situations [ 2 ]. For instance, Hao et al. [ 9 ] proposed a multi-modal framework utilizing multi-social media imagery and textual information for damage assessment in disaster-hit areas. The key factors analyzed in the work include hazard/disaster type, severity, and damage type. Wu et al. [ 10 ] also utilized social media data and the associated geo-location information for generating early warnings and damage assessment analysis after disasters. Ahmad et al. [ 11 ], on the other hand, used social media imagery for the analysis of road conditions after the floods. More specifically, the authors proposed an early and late-fusion framework to identify passable roads in flooded regions. Alam et al. [ 12 ] explored the potential of social media content in another relevant task of assessing flood severity. To this aim, the authors collected a large-scale benchmark dataset namely CrisisMD. The dataset provides a large collection of Twitter posts including textual and visual content. Hassan et al. [ 13 ] explored a slightly diferent aspect of natural disasters by extracting sentiments and emotions from visual content shared in social media outlets. The authors detailed how visual sentiment analysis of disaster-related social media visual content can be utilized by diferent stakeholders, such as news agencies, public authorities, and humanitarian organizations.

Despite being proven very efective in diferent tasks of disaster analytics, social media content has several limitations, such as noisy data and the unavailability of geolocation information. In this paper, we propose a solution to overcome such challenges.

3. Approach 3.1. Relevance Classification of Twitter Posts (RCTP)

As a first step, we analyzed the available multimedia content. During the analysis, we observed that most of the posts missing visual content. Moreover, most of the images were irrelevant. Thus, we decided to use textual information only in our solution. Our framework for the RCTP subtask is composed of two steps. In the first step, we performed some pre-processing techniques to clean the data by removing unnecessary information, such as usernames, URLs, emojis, and stop words.

After pre-processing, several state-of-the-art NLP algorithms including BERT [ 14 ], Roberta [ 15 ], Distil BERT [ 16 ], and ALBERT [ 17 ] are used for the classification of the text. Since its a binary classification task, in all methods, our cost function is based on binary crossentropy. Moreover, we used Adam optimizer with a mini batch size of 32 for 20 epochs.

3.2. Location Extraction from Twitter Texts (LETT)

LETT subtask is treated as Named Entity Recognition (NER) task. NER involves locating and classifying named entities in text into pre-defined categories [ 18 ]. In this task, we are interested in the identification of words representing the starting and subsequent words of a text sequence referring to a location. In LETT, annotations are provided at the word level. Similar to the RCTP task, in this task, we rely on multiple state-of-the-art algorithms including BERT, Roberta, Distil BERT, and ALBERT. We note that in this task, since annotations are provided at the word level, we did not use any pre-processing technique before training our models.

3.3. Dataset

For both subtasks, separate datasets are released. The dataset for RCTP subtask contains data from a total of 8,000 tweets. The tweets are collected between May 25, 2020, and June 12, 2020, using flood-related keywords in the Italian Language, such as ”alluvione”, ”allagamento”, and ”esondazione”. The dataset is provided in two diferent sets namely the development set and the test set. The development set is composed of 5,337 tweets while the test set contains a total of 1,315 tweets.

The dataset for the LETT subtask is composed of around 6,000 tweets collected between March 25, 2017, and August 1, 2018, using flood-related Italian keywords. The annotations for this subtask are available per word in the tweets.

4. Results and Analysis 4.1. Runs Description of RCTP Subtask

Table 1 shows the experimental results of the proposed solutions on the development set. We note that during the experiments on the development set, we used 70% samples of the development set for training, 20% for testing, and 10% samples for validation. As can be seen in the table, no significant diferences can be observed in the performance of the models on the clean and un-clean datasets. As far as the performance of the individual models is concerned, slightly better results are obtained with BERT compared to the other models. Table 2 provides the oficial results of the proposed solutions on the test set. We note that for the experiments on the test set the models are trained on the complete development set. In total, 4 diferent runs are submitted for the task. Our first, second, and fourth runs are based on BERT, RoBERTa, and Distil Bert models trained on the un-cleaned dataset, respectively. Our third run is based on the BERT model trained on the cleaned dataset. The performance of the models trained on the

Exact Results Partial Results Precision Recall F1-Score Precision Recall F1-Score 0.596 0.522 0.556 0.628 0.622 0.625 0.540 0.676 0.600 0.577 0.810 0.674 0.563 0.604 0.583 0.610 0.760 0.677 un-cleaned dataset is higher than the models trained on the cleaned dataset. This indicates that the pre-processing information resulted in the removal of some relevant features and thus has a negative impact on the results.

4.2. Runs Description of LETT Subtask 5. Conclusions

In this paper, we presented our solutions for the DisasterMM challenge posted in MediaEval 2022. For both subtasks, multiple state-of-the-art NLP algorithms are employed. In the current implementation, all the models are used individually, however, we believe these models can complement each other if jointly utilized in a merit-based fusion method. In the future, we aim to employ diferent merit-based fusion methods to jointly utilize the capabilities of the individual models in both subtasks.

[1]

D. T.

Nguyen ,

Ofli ,

Imran ,

Mitra , Damage assessment from social media imagery data during disasters , in: Proceedings of international conference on advances in social networks analysis and mining 2017 , 2017 , pp. 569 - 576 .

[2]

Said ,

Ahmad ,

Riegler ,

Pogorelov ,

Hassan ,

Ahmad ,

Conci , Natural disasters detection in social media and satellite imagery: a survey , Multimedia Tools and Applications 78 ( 2019 ) 31267 - 31302 .

[3]

Andreadis ,

Bozas , I. Gialampoukidis ,

Moumtzidou ,

Fiorin ,

Lombardo ,

Mavropoulos ,

Norbiato ,

Vrochidis ,

Ferri , I. Kompatsiaris , DisasterMM: Multimedia Analysis of Disaster-Related Social Media Data Task at MediaEval 2022 , in: Proceedings of the MediaEval 2022 Workshop, Bergen, Norway and Online, 2023 .

[4]

Ahmad ,

Pogorelov ,

Riegler ,

Conci ,

Halvorsen , Social media and satellites , Multimedia Tools and Applications 78 ( 2019 ) 2837 - 2875 .

[5]

Alsmadi ,

Ahmad ,

Nazzal ,

Alam ,

Al-Fuqaha ,

Khreishah ,

Algosaibi , Adversarial nlp for social network applications: Attacks, defenses , and research directions, IEEE Transactions on Computational Social Systems ( 2022 ).

[6]

Andreadis , G. Antzoulatos,

Mavropoulos ,

Giannakeris , G. Tzionis,

Pantelidis ,

Ioannidis ,

Karakostas , I. Gialampoukidis,

Vrochidis , et al., A social media analytics platform visualising the spread of covid-19 in italy via exploitation of automatically geotagged tweets , Online Social Networks and Media 23 ( 2021 ) 100134 .

[7] A.-R . et al., The influence of information system success and technology acceptance model on social media factors in education , Sustainability 13 ( 2021 ) 7770 .

[8]

Ahmad ,

Ayub ,

Khan ,

Ahmad ,

Al-Fuqaha , Social media as an instant source of feedback on water quality , IEEE Transactions on Technology and Society ( 2022 ).

[9]

Hao ,

Wang , Leveraging multimodal social media data for rapid disaster damage assessment , International Journal of Disaster Risk Reduction 51 ( 2020 ) 101760 .

[10]

Wu ,

Cui , Disaster early warning and damage assessment analysis using social media data and geo-location information, Decision support systems 111 ( 2018 ) 48 - 59 .

[11]

Ahmad ,

Pogorelov ,

Riegler ,

Ostroukhova ,

Halvorsen ,

Conci ,

Dahyot , Automatic detection of passable roads after floods in remote sensed and social media data , Signal Processing: Image Communication 74 ( 2019 ) 110 - 118 .

[12]

Alam ,

Ofli ,

Imran , Crisismmd: Multimodal twitter datasets from natural disasters , in: Twelfth international AAAI conference on web and social media , 2018 .

[13]

S. Z.

Hassan ,

Ahmad ,

Hicks ,

Halvorsen ,

Al-Fuqaha ,

Conci ,

Riegler , Visual sentiment analysis from disaster images in social media , Sensors 22 ( 2022 ) 3628 .

[14]

Devlin , M.-

Chang ,

Lee ,

Toutanova , Bert: Pre-training of deep bidirectional transformers for language understanding , arXiv preprint arXiv: 1810 . 04805 ( 2018 ).

[15]

Liu ,

Ott ,

Goyal ,

Du ,

Joshi ,

Chen ,

Levy ,

Lewis ,

Zettlemoyer ,

Stoyanov , Roberta: A robustly optimized bert pretraining approach , arXiv preprint arXiv: 1907 . 11692 ( 2019 ).

[16]

Sanh ,

Debut ,

Chaumond , T. Wolf, Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter , arXiv preprint arXiv: 1910 . 01108 ( 2019 ).

[17]

Lan ,

Chen ,

Goodman ,

Gimpel ,

Sharma , R. Soricut, Albert: A lite bert for self-supervised learning of language representations , arXiv preprint arXiv: 1909 . 11942 ( 2019 ).

[18]

Li ,

Sun , J. Han,

Li , A survey on deep learning for named entity recognition , IEEE Transactions on Knowledge and Data Engineering 34 ( 2020 ) 50 - 70 .