<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A HYBRID APPROACH FOR MULTIMEDIA USE VERIFICATION</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Quoc-Tin Phan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alessandro Budroni</string-name>
          <email>alessandro.budroni@studenti.unitn.it2</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cecilia Pasquini</string-name>
          <email>cecilia.pasquinig@unitn.it1</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francesco G. B. De Natale</string-name>
          <email>denatale@ing.unitn.it3</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Information Engineering and Computer Science - University of Trento</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <fpage>20</fpage>
      <lpage>21</lpage>
      <abstract>
        <p>Social networks enable multimedia sharing between worldwide users, however, there is no automatic mechanism implemented aiming to verifying multimedia use. This has been known as a highly challenging problem due to the variety of media types and huge amount of information they convey. As a participating team of MediaEval 2016, we propose a hybrid approach for detecting misused multimedia on Twitter which has been known as Verifying Multimedia Use task. Speci cally, we designed a ver cation system that can answer how likely an associated multimedia le is fake based on multiple forensic features and textual features, which were acquired by performing online text search and image reverse search. Next, e ective post-based features and user-based features are utilized to validate the credibility of tweet posts. Finally, based on the assumption that a tweet sharing fake images or videos is likely to be fake, credibility scores of tweet posts and associated multimedia are fused to detect misused multimedia.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>
        Online Social Network (OSN) services o er a medium for
users to connect and share daily information. With respect
to speci c events, part of information is usually not trustable
and its dissemination causes several negative consequences
on the community. Attempts have been proposed to address
the problem of image manipulation on online news [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], or the
impact of image manipulations to users' perceptions [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        In MediaEval Verifying Multimedia Use task [
        <xref ref-type="bibr" rid="ref3 ref5">3, 5</xref>
        ] , given
tweet content features, user features and some e ective
forensic features, innovative methods are welcomed to verify whether
multimedia (images and videos) are correctly used on
Twitter. Due to the variety of languages used and the fact that
many reposted tweets do not contain meaningful textual
information, linguistic approaches like [
        <xref ref-type="bibr" rid="ref10 ref8">8, 10</xref>
        ] are believed not
e ective enough in this task. Moreover, almost each tweet
post is accompanied by at least an image or video, and the
image or video itself re ects the credibility of tweet. To the
best of our knowledge, only [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] took into account multimedia
forensic features in Multimedia Use Veri cation task.
      </p>
      <p>Despite the fact that associated multimedia les play a
signi cant role in assessing credibility of tweets, forensic
algorithms are very sensitive to subsequent image modi cations
and multiple lossy compression. In this work we propose a
novel approach to assess the credibility of associated images
or videos by using not only forensic features but also textual
features which are acquired by performing online text search
and image reverse search. The acquired results on
development and test sets con rm the e ectiveness of our proposed
method.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>THE PROPOSED METHOD</title>
      <p>We propose a veri cation system composing two classi
cation tiers as depicted in Figure 1. The rst classi cation
tier takes as inputs the event and the associated image or
video, and answer How likely does this image or video re ect
the event?. We consider the occurence context of associated
images or videos on the Internet as a strong evidence for
assessing their trustworthiness. Having certain con dence
about the credibility of associated images or videos, we
proceed to design the second classi cation tier to validate the
credibility of tweets based on Twitter-based features.
Finally, scores returned from two classi ers are fused to give
nal decision.
2.1</p>
    </sec>
    <sec id="sec-3">
      <title>Multimedia assessment</title>
      <p>In the rst step, we conduct online text search using
relevant keywords associated with the event and select top
returned websites from which we extract most relevant terms
based on the statistical measurement TF-IDF (Term
Frequency - Inverse Document Frequency). On another side,
the associated image is searched over Google Images and we
select only top returned websites to check the frequency of
most relevant terms from event text search. To Youtube
videos, only users' comments are extracted, while leaving
out videos from other cites unprocessed. By this step, the
system is expected to correctly recognize images or videos
not belonging to current event. In the second step, we check
occurences of positive, negative and \fake" related words in
the whole text retrieved from image or video reverse search,
assuming that a fake multimedia should receive negative
assessment from readers.</p>
      <p>
        Forensic operations can be applied on multimedia les
to verify whether or not the multimedia le is tampered,
and even which regions are most likely to be modi ed. We
adopt non-aligned double JPEG compression [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], block
artifact grid [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], and Error Level Analysis [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] as useful forensic
features. Finally, we integrate textual features and forensic
features in the rst classi cation tier.
2.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>Tweet credibility assessment</title>
      <p>After having the output from the rst classi cation tier</p>
      <sec id="sec-4-1">
        <title>Multimedia</title>
      </sec>
      <sec id="sec-4-2">
        <title>Event</title>
      </sec>
      <sec id="sec-4-3">
        <title>Post</title>
      </sec>
      <sec id="sec-4-4">
        <title>User</title>
      </sec>
      <sec id="sec-4-5">
        <title>Forensic feature extraction</title>
      </sec>
      <sec id="sec-4-6">
        <title>Search by image/video</title>
      </sec>
      <sec id="sec-4-7">
        <title>Search by keywords</title>
      </sec>
      <sec id="sec-4-8">
        <title>Textual feature extraction</title>
      </sec>
      <sec id="sec-4-9">
        <title>Concatenate</title>
      </sec>
      <sec id="sec-4-10">
        <title>Classifier 1</title>
      </sec>
      <sec id="sec-4-11">
        <title>Forensic features</title>
      </sec>
      <sec id="sec-4-12">
        <title>Textual features</title>
      </sec>
      <sec id="sec-4-13">
        <title>Post-based features</title>
      </sec>
      <sec id="sec-4-14">
        <title>Concatenate</title>
      </sec>
      <sec id="sec-4-15">
        <title>User-based features</title>
      </sec>
      <sec id="sec-4-16">
        <title>Score fusion</title>
      </sec>
      <sec id="sec-4-17">
        <title>Classifier 2</title>
      </sec>
      <sec id="sec-4-18">
        <title>Final decision</title>
        <p>re ecting the trustworthiness of associated multimedia, the
second classi cation tier is designed to assess how
multimedia are used on Twitter. Tweet credibility assessment is
feasible thanks to post-based features, i.e. whether the tweet
contains the question mark or exclamation mark characters,
number of negative sentiment words the tweet contains,
together with user-based features, i.e. the number of followers
the user has, whether the user is veri ed by Twitter.
2.3</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Score fusion</title>
      <p>We approach the problem by experimenting with LR
(Logistic Regression) and RF (Random Forest) classi ers. As
depicted in Table 1, LR performs less e cient than RF on
the development set. This can be explained as RF suits
well with non-linearly separable and uneven data, i.e. some
Twitter posts do not associate with any meaningful text,
forensic features of videos are not included (all are zeros).
For that reason, we select RF as our classi ers and proceed
to nal decision by conducting score level fusion. With the
assumption that a tweet sharing fake images or videos is
likely to be fake, higher weight is assigned to the output of
the rst tier, while lower weight to the second tier. In order
to validate our method, we conduct experiments counting
only scores from classi cation tier 2 (using post-based and
user-based features provided by the task), and experiments
using 0:8 : 0:2 weighting strategy. Statistics shown in Table
1 con rm the e ectiveness of our multimedia assessment tier
and score fusion strategy.</p>
    </sec>
    <sec id="sec-6">
      <title>RESULTS AND DISCUSSION</title>
      <p>In this section, we report accumulated results on the
subtask based on our multimedia assessment approach and the
main task based on two-tier classi cation. In the sub-task,
we submit two RUNs: i) RUN 1 (required): apply only
forensic features described in Section 2.1, ii) RUN 2: apply both
textual features and forensic features described in Section
2.1. Especially, on the second RUN, we train the
classier on entire multimedia available in development set of the
main task. Acquired results from Table 2 reveal the fact
that our method gains recall if we take into account textual
features acquired from online text search and image reverse
search. This means we can e ectively reduce false negative
rate and more fake samples are detected.</p>
      <p>Next, results of the main task are reported from three
RUNs: i) RUN 1 (required): apply only the second
classi cation tier, ii) RUN 2: apply two-tier classi cation and
0:8 : 0:2 fusion strategy, answer UNKNOWN to cases where
the output of classi cation tier 1 is not available due to
online searching errors, iii) RUN 3: apply two-tier classi
cation and 0:8 : 0:2 fusion strategy, consider only the output
of classi cation 2 to cases where the output of classi cation
tier 1 is not available due to online searching errors.</p>
      <p>Results from Table 3, especially RUN 2, again con rms
the e ectiveness of our proposed method on multimedia
assessment and fusion strategy. Our method, however, is
subject to online searching errors which happen to videos NOT
hosted by YouTube.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>[1] Error level analysis tutorial</article-title>
          . http://fotoforensics.com/tutorial-ela.php. Accessed:
          <volume>28</volume>
          /08/
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Bianchi</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Piva</surname>
          </string-name>
          .
          <article-title>Image Forgery Localization via Block-Grained Analysis of JPEG Artifacts</article-title>
          .
          <source>IEEE Transactions on Information Forensics and Security</source>
          ,
          <volume>7</volume>
          (
          <issue>3</issue>
          ):
          <volume>1003</volume>
          {
          <fpage>1017</fpage>
          ,
          <year>June 2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Boididou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Andreadou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Papadopoulos</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.-T.</surname>
            Dang-Nguyen,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Boato</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Riegler</surname>
            , and
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Kompatsiaris. Verifying Multimedia</surname>
          </string-name>
          <article-title>Use at MediaEval 2015</article-title>
          . In MediaEval 2015 Workshop, Wurzen, Germany,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.</given-names>
            <surname>Boididou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Papadopoulos</surname>
          </string-name>
          , D.-T
          <string-name>
            <surname>Dang-Nguyen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Boato</surname>
            , and
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Kompatsiaris. The CERTH-UNITN Participation</surname>
          </string-name>
          @
          <article-title>Verifying Multimedia Use 2015</article-title>
          .
          <source>In Proceedings of the MediaEval 2015 Workshop</source>
          , pages
          <fpage>6</fpage>
          <issue>{8</issue>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>C.</given-names>
            <surname>Boididou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Papadopoulos</surname>
          </string-name>
          , D.-T
          <string-name>
            <surname>Dang-Nguyen</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Boato</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Riegler</surname>
            ,
            <given-names>S. E.</given-names>
          </string-name>
          <string-name>
            <surname>Middleton</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Petlund</surname>
            , and
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Kompatsiaris. Verifying Multimedia</surname>
          </string-name>
          <article-title>Use at MediaEval 2016</article-title>
          .
          <source>In Proc. of the MediaEval 2016 Workshop</source>
          , Hilversum, Netherlands, Oct.
          <volume>20</volume>
          -21
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>V.</given-names>
            <surname>Conotter</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.-T.</surname>
            Dang-Nguyen,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Boato</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Menendez</surname>
            , and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Larson</surname>
          </string-name>
          .
          <article-title>Assessing the impact of image manipulation on users' perceptions of deception</article-title>
          .
          <source>In Proceedings of SPIE - Human Vision and Electronic Imaging XIX</source>
          , volume
          <volume>9014</volume>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>W.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yuan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Yu</surname>
          </string-name>
          .
          <article-title>Passive Detection of Doctored JPEG Image via Block Artifact Grid Extraction</article-title>
          . Signal Process.,
          <volume>89</volume>
          (
          <issue>9</issue>
          ):
          <year>1821</year>
          {1829, Sept.
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S. E.</given-names>
            <surname>Middleton</surname>
          </string-name>
          .
          <article-title>Extracting Attributed Veri cation and Debunking Reports from Social Media : MediaEval-2015 Trust and Credibility Analysis of Image and Video</article-title>
          .
          <source>In Proceedings of the MediaEval 2015 Workshop</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C.</given-names>
            <surname>Pasquini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Brunetta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. F.</given-names>
            <surname>Vinci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Conotter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Boato.</surname>
          </string-name>
          <article-title>Towards the veri cation of image integrity in online news</article-title>
          .
          <source>In Proceedings of Multimedia Expo Workshops (ICMEW)</source>
          , pages
          <fpage>1</fpage>
          <lpage>{</lpage>
          6,
          <string-name>
            <surname>June</surname>
          </string-name>
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Resnick</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Q.</given-names>
            <surname>Mei</surname>
          </string-name>
          . Enquiring Minds:
          <article-title>Early Detection of Rumors in Social Media from Enquiry Posts</article-title>
          .
          <source>In Proceedings of the 24th International Conference on World Wide Web</source>
          , pages
          <volume>1395</volume>
          {
          <fpage>1405</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>