=Paper=
{{Paper
|id=Vol-3181/paper4
|storemode=property
|title=WaterMM: Water Quality in Social Multimedia Task at MediaEval 2021
|pdfUrl=https://ceur-ws.org/Vol-3181/paper4.pdf
|volume=Vol-3181
|authors=Stelios Andreadis,Ilias Gialampoukidis,Aristeidis Bozas,Anastasia Moumtzidou,Roberto Fiorin,Francesca Lombardo,Anastasios Karakostas,Daniele Norbiato,Stefanos Vrochidis,Michele Ferri,Ioannis
Kompatsiaris
|dblpUrl=https://dblp.org/rec/conf/mediaeval/AndreadisGBMFLK21
}}
==WaterMM: Water Quality in Social Multimedia Task at MediaEval 2021==
WaterMM: Water Quality in Social Multimedia Task at MediaEval 2021 Stelios Andreadis1 , Ilias Gialampoukidis1 , Aristeidis Bozas1 , Anastasia Moumtzidou1 , Roberto Fiorin2 , Francesca Lombardo2 , Anastasios Karakostas1 Daniele Norbiato2 , Stefanos Vrochidis1 , Michele Ferri2 , Ioannis Kompatsiaris1 1 Information Technologies Institute - Centre of Research and Technology Hellas, Greece 2 Eastern Alps River Basin District, Italy {andreadisst,heliasgj,arbozas,moumtzid,akarakos,stefanos,ikom}@iti.gr {roberto.fiorin,francesca.lombardo,daniele.norbiato,michele.ferri}@distrettoalpiorientali.it ABSTRACT This paper describes the “WaterMM: Water Quality in Social Mul- timedia” Task at MediaEval 2021. The overall aim of the task is to analyse the textual content of social media data that express real-world issues. The focus is specifically on water quality, safety and security, which is a fundamental part of life sustainability. Par- ticipants of this task are required to classify the social media posts of a bilingual dataset as relevant or not relevant to water-related problems, while they can optionally combine textual features with visual. The automatic prediction of posts could enhance the quality of crowd-sourced information, consequently supporting situational awareness in the water sector. 1 INTRODUCTION With the rise of social media in the everyday life of people around the world, a very broad range of topics is now discussed online. The widespread availability of public social media posts has paved the way for developing Artificial Intelligence solutions that ex- ploit crowd-sourced information. The scientific community has particularly focused on emergency, disaster and crisis management [1, 7] where the use of social media data can be really beneficial to detecting threats, monitoring situations, and enhancing response. Despite the fact that research aims attention mostly at sudden crisis, i.e. natural or human-caused disasters that occur without warning, another highly interesting domain is the creeping crisis, i.e. a threat to life-sustaining systems that evolves over time and space and is foreshadowed by precursor events [6]. Such a type of crisis could threaten, for example, water quality, safety and security. Figure 1: A tweet that is considered relevant to water quality Among the various topics discussed on Twitter, it is anticipated that users will also post tweets that refer to water quality. The interested organisations, such as water utilities or water protection acquisition of posts containing citizen complaints on the condition agencies, receive from social media. Estimating the relevance of a of drinking water (as an addition to traditional means, e.g. phone tweet faces two further challenges. First, the textual information calls) or news coverage about water-related issues could support of a tweet (i.e. Twitter message) may have a different relevance situational awareness in a water distribution network. to the examined topic in comparison to its visual information (i.e. However, within the post stream it is expected that a number Twitter image). Secondly, the text of the tweets may be in multiple of posts containing water-quality-related keywords does not refer languages, which requires independent processing and training. to actual cases of polluted water. To minimize the incoming noise, The potential contribution of relevance prediction to situational automatic prediction of a post’s relevance is required. Filtering out awareness in the water sector has motivated the organisation of the irrelevant posts will improve the quality of the information that “WaterMM: Water Quality in Social Multimedia” Task1 at MediaEval Copyright 2021 for this paper by its authors. Use permitted under Creative Commons 2021. As a continuation of the Multimedia Satellite Task (2017-2019) License Attribution 4.0 International (CC BY 4.0). [3–5] and the Flood-related Multimedia Task [2], the WaterMM MediaEval’21, December 13-15 2021, Online 1 https://multimediaeval.github.io/editions/2021/tasks/watermm/ MediaEval’21, December 13-15 2021, Online S. Andreadis et al. Table 1: Most frequently matched keywords English Italian muddy water bottiglia acqua (water bottle) water not clear acqua rubinetto (tap water) water bad taste acqua potabile (drinking water) water pollution sabbia acqua (sand water) water bad smell colore acqua (water color) by searching for English and Italian keywords inside the tweet text about water quality (e.g. issues with drinking water, signs of water pollution, illnesses related to water, etc.). The keywords have been proposed by the Eastern Alps River Basin District, who are responsible for hydrogeological defense, which involves the protection of water resources and aquatic environments, in the Eastern Alps partition of North-East Italy. For reasons of brevity, we present here only the most frequently matched keywords for Figure 2: A tweet that is considered irrelevant to water qual- both languages in Table 1, while the complete list is provided to ity participants along with the dataset in the task’s repository2 . The bilingual dataset is separated into two sets: the development-set that contains 8,000 posts and the test-set with 2,000 posts. In order to be Task focuses exclusively on social media data and shifts the domain fully compliant with the Twitter Developer Policy, only the IDs of of application from floods to water quality, safety and security. The the tweets are distributed to the participants. Thus, it was ensured overall goal of the task is to tackle the aforementioned challenges at the time of releasing the dataset that all tweets were still online. and use textual information (as well as visual information and The ground truth of the dataset reflects the relevance of a tweet metadata) from a bilingual dataset of Twitter posts in order to (relevant / not relevant) and has been manually collected with hu- identify tweets that refer to concerns about water. man annotation. The annotation has been realized again by the Eastern Alps River Basin District. Apart from their valuable exper- 2 TASK DESCRIPTION tise on the domain, they were also able to annotate tweets in their The WaterMM Task deals with the analysis of social media posts native language, i.e. Italian. It should be noted that each tweet has from Twitter with regards to issues of water quality, safety and been annotated by a single person and not by multiple annotators. security. The participants of this task are provided with a set of Initially, solely the ground truth for the development-set is re- Twitter post IDs in order to download the text, the attached image leased, since the ground truth for the test-set is used in the eval- (if it exists) and the metadata of tweets that have been selected uation stage and will be available only after the completion of with keyword-based search that involved words/phrases about the MediaEval 2021. Participants are provided with key-value pairs of quality of drinking water (e.g. strange color, smell or taste, related Tweet ID and ground truth label for the relevancy (0=not relevant/ illnesses, etc.). Nevertheless, the occurrence of such phrases in a 1=relevant). In particular, 1,374 tweets (17.18%) of the development- tweet might not necessarily reflect a case of water contamination. set are relevant and 6,626 (82.82%) are not relevant to water quality, The objective of this task is to build a binary classification system showing that it is a quite imbalanced training dataset and partici- that will be able to distinguish whether a post is relevant or not to pants should consider this issue. water-quality issues. An example of a relevant tweet is shown in Fig. 1, while an irrelevant tweet in Fig. 2. Participants can tackle the 4 EVALUATION task using text features, image features, metadata, or a combination F1-Score is selected as the official metric for evaluating the binary of the above, and they are allowed to submit up to 5 runs: classification of tweets as relevant (1) and not relevant (0) on the • Required run 1: automated using textual information only test set, since this measure is the harmonic mean between precision • Optional run 2: automated using fused textual and visual and recall, taking both metrics into account. Participants are also information encouraged to carry out a failure analysis of their results in order • Optional run 3: automated using fused textual and visual to gain insight in the mistakes that their classifiers make. information as well as other metadata • General runs 4 & 5: everything automated allowed, includ- ACKNOWLEDGMENTS ing using data from external sources This work has been supported by the EU’s Horizon 2020 research and innovation programme under grant agreements H2020-832876 3 DATASET DESCRIPTION aqua3S, H2020-883484 PathoCERT, and H2020-101004157 WQeMS. The dataset of the task is a set of social media posts collected 2 https://github.com/multimediaeval/2021-WaterMM/blob/main/dataset/keywords. from Twitter during one year, i.e. from May 2020 to April 2021, json WaterMM MediaEval’21, December 13-15 2021, Online REFERENCES [1] David E Alexander. 2014. Social media in disaster risk reduction and crisis management. Science and engineering ethics 20, 3 (2014), 717–733. [2] Stelios Andreadis, Ilias Gialampoukidis, Anastasios Karakostas, Ste- fanos Vrochidis, Ioannis Kompatsiaris, Roberto Fiorin, Daniele Nor- biato, and Michele Ferri. 2020. The flood-related multimedia task at mediaeval 2020. In Proceedings of the MediaEval 2020 Workshop, Online. 14–15. [3] Bischke Benjamin, Helber Patrick, Zhao Zhengyu, Borth Damian, and others. 2018. The Multimedia Satellite Task at MediaEval 2018: Emergency response for flooding events. (2018). [4] Benjamin Bischke, Patrick Helber, Simon Brugman, Erkan Basar, Zhengyu Zhao, Martha Larson, and Konstantin Pogorelov. The Multi- media Satellite Task at MediaEval 2019: Estimation of Flood Severity. In Proc. of the MediaEval 2019 Workshop (Oct. 27-29, 2019). Sophia Antipolis, France. [5] Benjamin Bischke, Patrick Helber, Christian Schulze, Venkat Srini- vasan, Andreas Dengel, and Damian Borth. 2017. The Multimedia Satellite Task at MediaEval 2017.. In MediaEval. [6] Arjen Boin, Magnus Ekengren, and Mark Rhinard. 2020. Hiding in plain sight: Conceptualizing the creeping crisis. Risk, Hazards & Crisis in Public Policy 11, 2 (2020), 116–138. [7] Yan Jin, Brooke Fisher Liu, and Lucinda L Austin. 2014. Examining the role of social media in effective crisis management: The effects of crisis origin, information form, and source on publics’ crisis responses. Communication research 41, 1 (2014), 74–94.