<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>WISC at MediaEval 2017: Multimedia Satellite Task</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nataliya Tkachenko</string-name>
          <email>nataliya.tkachenko@warwick.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Arkaitz Zubiaga</string-name>
          <email>a.zubiaga@warwick.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rob Procter</string-name>
          <email>rob.procter@warwick.ac.uk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Warwick</institution>
          ,
          <addr-line>Coventry CV4 7AL</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>The Alan Turing Institute</institution>
          ,
          <addr-line>The British Library, London NW1 2DB</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Warwick Institute for the Science of Cities, University of Warwick</institution>
          ,
          <addr-line>Coventry CV4 7AL</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <fpage>13</fpage>
      <lpage>15</lpage>
      <abstract>
        <p>This working note describes the work of the WISC team on the Multimedia Satellite Task at MediaEval 2017. We describe the runs that our team submitted to both the DIRSM and FDSI subtasks, as well as our evaluations on the development set. Our results demonstrate high accuracy in the detection of flooded areas from user-generated content in social media. In the first subtask consisting of disaster image retrieval from social media, we found that tags defined by users to describe the images are very helpful for achieving high accuracy classification. In the second subtask consisting of detecting lfood in satellite images, we found that social media can increase the precision in analyses when combined with satellite images by taking advantage of spatial and temporal overlaps between data sources.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Accurate and timely designation of flooded areas is beneficial to
help build and maintain situational awareness and to estimate the
impact of natural hazards [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. When it comes to the estimation of
impact, there is no consistency across experts as to the diferent
methods used to measure impact [
        <xref ref-type="bibr" rid="ref11 ref3">3, 11</xref>
        ]. Assessing and comparing
disaster impact has traditionally been deemed a very challenging
task as systematic data or studies are hard to obtain. Moreover,
historical data gathered from diferent sources covering diferent
regions cannot be efectively used for new regions or at diferent
points in time. Examples of valuation techniques used for impact
assessments include market based techniques, such as property
destruction, reduction in income and sales, and non-market based
techniques, such as loss of life, various environmental consequences
and psychological efects sufered by the afected individuals [
        <xref ref-type="bibr" rid="ref11 ref3 ref6">3, 6,
11</xref>
        ].
      </p>
      <p>
        Owing to the importance of furthering impact estimation
techniques, interest in the development of computational approaches
has increased [
        <xref ref-type="bibr" rid="ref12 ref2">2, 12</xref>
        ]. Current methods for flood detection and flood
impact estimation make use of contemporary, open data sources
such as social media [
        <xref ref-type="bibr" rid="ref10 ref7 ref9">7, 9, 10</xref>
        ]. The objective of this shared task and
the contribution by the Warwick Institute for the Science of Cities
(WISC) team is to assess the extent to which these techniques
approximate the results obtained through traditional methods of flood
detection, such as local sensor networks and satellite images [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
This working note presents our eforts and achievements towards
this objective.
      </p>
    </sec>
    <sec id="sec-2">
      <title>RELATED WORK</title>
      <p>Recent work has proposed to combine traditional data sources
such as sensor networks (e.g., river gauges and pluviometers) with
user-generated data from social media such as Twitter and Flickr
(Tkachenko et al., under review). To the best of our knowledge,
however, the combination of satellite images and georeferenced
UGC has not been tackled in scientific research, potentially because
it may be of limited use in areas with high percentage of cloud
coverage (e.g., Northern Europe) or for being very expensive due
to the need of suficiently high spatio-temporal resolution.</p>
      <p>With the growing availability of the free or inexpensive
multiand hyperspectral image tiles, it is becoming increasingly important
to understand how such data sources perform alongside new
methods and how their combined use can help overcome each other’s
limitations when used independently. With our participation in the
Multimedia Satellite Task, we aimed to analyse how social media
can be used to identify flooded areas, as well as to identify the best
classification approaches.
3.1.1 Experiment Setings. We performed 10-fold cross-validation
experiments. We used two diferent ways of evaluating our
approaches: (1) precision, recall and F1 score over the positive (flood)
class, and (2) Average Precision at X (AP@X) at various cutofs,
X={50, 100, 200, 300, 400, 500}. Since the oficial evaluation relies
on the latter, we ended up choosing our best submissions based on
AP@X, especially looking at X={50, 100, 200}, as the other values
were rather high for our smaller test sets.</p>
      <p>
        3.1.2 Features. We use combinations of these features:
• Visual features: having performed leave-one-out tests of
combinations of the visual features provided by the
organisers, we found the best combination to be that including
CEDD, CL and GABOR.
• Metadata: we combined three of the metadata provided
with the dataset, namely description, title and tags. The
features were all represented using a bag-of-words approach,
however, we built three separate vectors, one for each
metadata, which were then concatenated into a single vector.
With all three features, we lowercased the texts, and
tokenised them by the space character. We also tokenised
multi-word user tags.
• Word embeddings: we trained word embeddings from
a large collection of titles, descriptions and user tags. We
used the entire YFCC100m dataset [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] to get overall 215
million input texts combining all three types of features,
which were fed into a Gensim word embedding with 300
dimensions [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. These word embeddings were then used to
create vectors of 300 dimensions for each of title,
description and user tags of each image. To create word vectors
for each sentence, we averaged the word vectors of the
words composing the sentence, as in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
• Machine translation: we used the Bing machine
translation API to translate user tags into English, where a user
tag was not originally in English. We used the translation
package for Python1 to achieve this.
      </p>
      <p>3.1.3 Classifiers. We tested diferent classifiers, including a
Logistic Regression classifier, Random Forests, Multinomial Naive
Bayes and Multi-layer Perceptron. We opted to build our system
using a Logistic Regression classifier based on the performance
observed on the development set. We used confidence scores provided
by the classifier to rank the images.
3.2</p>
    </sec>
    <sec id="sec-3">
      <title>FDSI Subtask</title>
      <p>In this subtask, we performed the selection of the spectral images in
the first instance, which were possible to construct from the 4-band
spectral resolution data supplied. Selected indices in question were
LWI (Land Water Index), NDVI (Normalised Diference Vegetation
Index) and NDWI (Normalised Diference Water Index). For the
subsequent runs we used machine learning methods for supervised
classification and for unsupervised clustering machine learning.
This was applied to the NDWI as the best performing spectral index
in the first step of the FDSI task.</p>
      <p>We also developed a second run, where we used KMeans to
achieve binary image segmentations on the basis of the spatial
distribution of the spectrally concentrated and transitioned pixels.
4
4.1</p>
    </sec>
    <sec id="sec-4">
      <title>EXPERIMENTS AND RESULTS</title>
    </sec>
    <sec id="sec-5">
      <title>DIRSM Subtask</title>
      <p>Based on performance assessments, we chose these 5 submissions:
• Run 1, visual information: only visual features.
• Run 2, metadata: only metadata features.
• Run 3, visual information and metadata: both features
by concatenating the two vectors.
• Run 4, word vectors: we concatenate five vectors for
visual features, metadata, word vectors of titles, word vectors
of user tags and word vectors of descriptions.
• Run 5, machine translation and word vectors: we
concatenate five vectors for visual features, metadata, word
vectors of titles, word vectors of machine translated user
tags and word vectors of descriptions.</p>
      <p>Run no.
#1
#2
#3
#4
#5</p>
      <p>Tables 1 and 2 show our results on the development and test
sets, respectively. While results are similar over the development
set, we observe remarkable diefrences in the test set. The metadata
classifier (#2) performs better than that based on visual features (#1),
however, the combination of both leads to substantial improvements
(#3). There is still a considerable improvement when we used deep
learning to represent the features using word vectors (#4), and a
further slight improvement when we used machine translation to
have all tags consistently in English (#5).
4.2</p>
    </sec>
    <sec id="sec-6">
      <title>FDSI Subtask</title>
      <p>Run no. Jaccard (Dev. Set)
#1 0.83
#2 0.87
Table 3: FDSI results on the development set.</p>
      <p>Run no. Jaccard (Test Set 1) Jaccard (Test Set 2)
#1 0.80 0.83
#2 0.81 0.77</p>
      <p>Table 4: FDSI results on the test set.</p>
      <p>Tables 3 and 4 show our results on the development and test
sets, respectively. Our results show the benefit of leveraging social
media features (#2) over not using them (#1) when training and
testing data overlap spatially and temporally (Test Set 1). This is,
however, not the case for the Test Set 2 where the test data includes
new locations, which we aim to explore further in future work.
5</p>
    </sec>
    <sec id="sec-7">
      <title>CONCLUSION</title>
      <p>We have explored the use of classifiers to identify social media
images of flooded areas. In the DIRSM task we have found that
combining both visual features and social metadata can be beneficial,
and that the use of external resources to train word embeddings
and translate the metadata into English can lead to even further
improvements. In the FDSI task, our results showed higher
accuracy detection for the flooded areas with help of the social media
classifiers. Social media can boost precision in combined analyses,
where training and test data overlap spatially and temporally.</p>
    </sec>
    <sec id="sec-8">
      <title>ACKNOWLEDGMENTS</title>
      <p>We wish to thank the Alan Turing Institute for its support.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Benjamin</given-names>
            <surname>Bischke</surname>
          </string-name>
          , Patrick Helber, Christian Schulze, Srinivasan Venkat, Andreas Dengel, and
          <string-name>
            <given-names>Damian</given-names>
            <surname>Borth</surname>
          </string-name>
          .
          <source>The Multimedia Satellite Task at MediaEval</source>
          <year>2017</year>
          :
          <article-title>Emergence Response for Flooding Events</article-title>
          .
          <source>In Proc. of the MediaEval 2017 Workshop (Sept</source>
          .
          <fpage>13</fpage>
          -
          <lpage>15</lpage>
          ,
          <year>2017</year>
          ). Dublin, Ireland.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J</given-names>
            <surname>Fohringer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D</given-names>
            <surname>Dransch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H</given-names>
            <surname>Kreibich</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          K Schröter.
          <year>2015</year>
          .
          <article-title>Social media as an information source for rapid flood inundation mapping</article-title>
          .
          <source>Natural Hazards and Earth System Sciences</source>
          <volume>15</volume>
          ,
          <issue>12</issue>
          (
          <year>2015</year>
          ),
          <fpage>2725</fpage>
          -
          <lpage>2738</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Valentina</given-names>
            <surname>Gallina</surname>
          </string-name>
          , Silvia Torresan, Andrea Critto, Anna Sperotto, Thomas Glade, and Antonio Marcomini.
          <year>2016</year>
          .
          <article-title>A review of multirisk methodologies for natural hazards: Consequences and challenges for a climate change impact assessment</article-title>
          .
          <source>Journal of environmental management 168</source>
          (
          <year>2016</year>
          ),
          <fpage>123</fpage>
          -
          <lpage>132</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Benjamin</given-names>
            <surname>Herfort</surname>
          </string-name>
          , João Porto de Albuquerque,
          <string-name>
            <surname>Svend-Jonas Schelhorn</surname>
            , and
            <given-names>Alexander</given-names>
          </string-name>
          <string-name>
            <surname>Zipf</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Exploring the geographical relations between social media and flood phenomena to improve situational awareness. In Connecting a digital Europe through location and place</article-title>
          . Springer,
          <fpage>55</fpage>
          -
          <lpage>71</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , Ilya Sutskever, Kai Chen, Greg S Corrado, and
          <string-name>
            <given-names>Jef</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          .
          <volume>3111</volume>
          -
          <fpage>3119</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>OECD.</surname>
          </string-name>
          <year>2013</year>
          . OECD:
          <article-title>New Data for Understanding the Human Condition</article-title>
          .
          <source>OECD Global Science Forum Report on Data and Research Infrastructure for the Social Sciences. (</source>
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Luke</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Qiuhua</given-names>
            <surname>Liang</surname>
          </string-name>
          , Phil James, and
          <string-name>
            <given-names>Wen</given-names>
            <surname>Lin</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Assessing the utility of social media as a data source for flood risk management using a real-time modelling framework</article-title>
          .
          <source>Journal of Flood Risk Management</source>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Bart</given-names>
            <surname>Thomee</surname>
          </string-name>
          , David A Shamma, Gerald Friedland, Benjamin Elizalde, Karl Ni, Douglas Poland, Damian Borth, and
          <string-name>
            <surname>Li-Jia Li</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>YFCC100M: The new data in multimedia research</article-title>
          .
          <source>Commun. ACM 59</source>
          ,
          <issue>2</issue>
          (
          <year>2016</year>
          ),
          <fpage>64</fpage>
          -
          <lpage>73</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Nataliya</given-names>
            <surname>Tkachenko</surname>
          </string-name>
          , Stephen Jarvis, and
          <string-name>
            <given-names>Rob</given-names>
            <surname>Procter</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Predicting lfoods with Flickr tags</article-title>
          .
          <source>PloS one 12</source>
          ,
          <issue>2</issue>
          (
          <year>2017</year>
          ),
          <year>e0172870</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Nataliya</surname>
            <given-names>Tkachenko</given-names>
          </string-name>
          , Rob Procter, and
          <string-name>
            <given-names>Stephen</given-names>
            <surname>Jarvis</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Predicting the impact of urban flooding using open data</article-title>
          .
          <source>Open Science</source>
          <volume>3</volume>
          ,
          <issue>5</issue>
          (
          <year>2016</year>
          ),
          <fpage>160013</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Gisela</surname>
            <given-names>Wachinger</given-names>
          </string-name>
          , Ortwin Renn, Chloe Begg, and
          <string-name>
            <given-names>Christian</given-names>
            <surname>Kuhlicke</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>The risk perception paradoxâĂŤimplications for governance and communication of natural hazards</article-title>
          .
          <source>Risk analysis 33</source>
          ,
          <issue>6</issue>
          (
          <year>2013</year>
          ),
          <fpage>1049</fpage>
          -
          <lpage>1065</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Huan</surname>
            <given-names>Wu</given-names>
          </string-name>
          , Robert F Adler,
          <string-name>
            <surname>Yudong Tian</surname>
          </string-name>
          , George J Hufman,
          <source>Hongyi Li, and JianJian Wang</source>
          .
          <year>2014</year>
          .
          <article-title>Real-time global flood estimation using satellite-based precipitation and a coupled land surface and routing model</article-title>
          .
          <source>Water Resources Research</source>
          <volume>50</volume>
          ,
          <issue>3</issue>
          (
          <year>2014</year>
          ),
          <fpage>2693</fpage>
          -
          <lpage>2717</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Arkaitz</surname>
            <given-names>Zubiaga</given-names>
          </string-name>
          , Elena Kochkina, Maria Liakata, Rob Procter, and
          <string-name>
            <given-names>Michal</given-names>
            <surname>Lukasik</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Stance classification in rumours as a sequential task exploiting the tree structure of social media conversations</article-title>
          .
          <source>In Proceedings of the 26th International Conference on Computational Linguistics.</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>