<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Time-Critical Analysis of Evolving Social Media Streams During Sudden-Onset Events</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Muhammad Imran</string-name>
          <email>mimran@hbku.edu.qa</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Qatar Computing Research Institute (HBKU)</institution>
          ,
          <addr-line>Doha</addr-line>
          ,
          <country country="QA">Qatar</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Sudden-onset emergencies such as natural or man-made disasters bring uncertainties in which time-critical information needs emerge from formal response organizations, a ected communities and other concerned population [2]. The growing adaption of Information and Communication Technologies (ICT) and Social Networks such as Twitter, Facebook has created numerous opportunities to disseminate and consume critical information during an on-going situation. However, time-critical analysis of high-velocity social media streams containing high-volume data involves solving multiple challenges including realtime parsing of brief and informal messages, handling information overload issues, and classifying [3], summarizing [10, 9], and prioritizing di erent types of information. Time-Critical Analysis of Live Streams: Live data analysis (online processing) is usually performed over a stream of data relevant to the event, often provided in real-time or with a short delay. The trade-o between retrospective and real-time data analysis is a matter of accuracy versus latency. However, in time-critical situations, the urgency with which the output of an analysis is required remains high, thus online algorithms using stream processing architectures can play a vital role in supporting real-time decision-making [3, 8]. Crowdsourced Stream Processing (CSP): A signi cant drawback of traditional stream processing systems is that they rely entirely on automated algorithms of data processing, and as such they are limited by the processing capabilities of these algorithms. Indeed, it is often the case that data stream to be processed is imprecise, highly variable and previously unseen, yet the decision based on this data needs to be made in real-time. In this case, automated processing on its own may produce undesirable results, harming the decision-making process it needs to support. CSP systems that combine high-speed processing with human intelligence can produce better and accurate results as compared to traditional stream processing techniques [5]. Handling Evolving Social Media Streams: An emerging challenge in the online classi cation of social media data streams is to keep the categories used for classi cation up-to-date [1]. Identi cation of novel concepts in an online supervised classi cation setting demands iteration over past observed data items, while due to the in nite length property of data streams, it is impractical to store past data. Approaches employing unsupervised techniques and human-experts could potentially be useful [4]. Domain Adaptation and Transfer Learning: Scarcity of labeled data at the onset of a crisis delays machine learning processes. Past events labeled data (e.g., [6]) can be employed to train more robust machine learning models using</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        domain adaptation (for same/similar tasks) or transfer learning (for novel tasks)
techniques [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>Brief Biography. Muhammad Imran is a Research Scientist at the Qatar
Computing Research Institute (QCRI) where he leads the Crisis Computing group
from both science and engineering directions. His interdisciplinary research
focuses on Human-Computer Interaction (HCI), real-time Information Retrieval
(IR), and Stream Processing areas. He studies the user-generated content on
Social Networks (e.g. Twitter) during mass convergence events to develop novel
computational models, techniques, and technologies helpful for stakeholders to
gain situational awareness and actionable information.</p>
      <p>Dr. Imran has published over 40 research papers in top-tier international
conferences and journals. Two of his papers have received the Best Paper Award.
He is a Co-Chair of the Social Media Studies track of the ISCRAM international
conference (2014-2017) and has served as Program Committee (PC) of many
major conferences and workshops (ICWSM, ACM DH, ICWE, SWDM, etc.).
Dr. Imran has worked as a Post-Doctoral researcher at QCRI (2013-2015). He
received his PhD in Computer Science from the University of Trento (2013).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>M.</given-names>
            <surname>Imran</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Castillo</surname>
          </string-name>
          .
          <article-title>Towards a data-driven approach to identify crisis-related topics in social media streams</article-title>
          .
          <source>In Proceedings of the 24th International Conference on World Wide Web</source>
          , pages
          <volume>1205</volume>
          {
          <fpage>1210</fpage>
          . ACM,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>M.</given-names>
            <surname>Imran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Castillo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Diaz</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Vieweg</surname>
          </string-name>
          .
          <article-title>Processing social media messages in mass emergency: A survey</article-title>
          .
          <source>ACM Computing Surveys (CSUR)</source>
          ,
          <volume>47</volume>
          (
          <issue>4</issue>
          ):
          <fpage>67</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>M.</given-names>
            <surname>Imran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Castillo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lucas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Meier</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Vieweg</surname>
          </string-name>
          . AIDR:
          <article-title>Arti cial intelligence for disaster response</article-title>
          .
          <source>In Proceedings of the 23rd International Conference on World Wide Web</source>
          , pages
          <volume>159</volume>
          {
          <fpage>162</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>M.</given-names>
            <surname>Imran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chawla</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Castillo</surname>
          </string-name>
          .
          <article-title>A robust framework for classifying evolving document streams in an expert-machine-crowd setting</article-title>
          .
          <source>In Proceedings of the 18th International Conference on Data Mining (ICDM)</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>M.</given-names>
            <surname>Imran</surname>
          </string-name>
          , I. Lykourentzou,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Naudet</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Castillo</surname>
          </string-name>
          .
          <article-title>Engineering crowdsourced stream processing systems</article-title>
          .
          <source>arXiv preprint arXiv:1310.5463</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>M.</given-names>
            <surname>Imran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mitra</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Castillo</surname>
          </string-name>
          .
          <article-title>Twitter as a lifeline: Human-annotated twitter corpora for nlp of crisis-related messages</article-title>
          .
          <source>In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC</source>
          <year>2016</year>
          ), may
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>M.</given-names>
            <surname>Imran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mitra</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          .
          <article-title>Cross-language domain adaptation for classifying crisis-related short messages</article-title>
          .
          <source>In Proceedings of ISCRAM</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>L.</given-names>
            <surname>Neumeyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Robbins</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nair</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Kesari</surname>
          </string-name>
          . S4:
          <article-title>Distributed stream computing platform</article-title>
          .
          <source>In Data Mining Workshops (ICDMW)</source>
          ,
          <year>2010</year>
          IEEE International Conference on, pages
          <volume>170</volume>
          {
          <fpage>177</fpage>
          . IEEE,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>K.</given-names>
            <surname>Rudra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Banerjee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ganguly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Imran</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Mitra</surname>
          </string-name>
          .
          <article-title>Summarizing situational tweets in crisis scenario</article-title>
          .
          <source>In Proceedings of the 27th ACM Conference on Hypertext and Social Media</source>
          , pages
          <volume>137</volume>
          {
          <fpage>147</fpage>
          . ACM,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>K.</given-names>
            <surname>Rudra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ghosh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ganguly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Goyal</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Ghosh</surname>
          </string-name>
          .
          <article-title>Extracting situational information from microblogs during disaster events: a classi cation-summarization approach</article-title>
          .
          <source>In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management</source>
          , pages
          <volume>583</volume>
          {
          <fpage>592</fpage>
          . ACM,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>