<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Pixel Privacy: Increasing Image Appeal while Blocking Automatic Inference of Sensitive Scene Information</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Martha Larson</string-name>
          <email>m.larson@cs.ru.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Zhuoran Liu</string-name>
          <email>z.liu@cs.ru.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simon Brugman</string-name>
          <email>simon.brugman@cs.ru.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Zhengyu Zhao</string-name>
          <email>z.zhao@cs.ru.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Radboud University</institution>
          ,
          <country country="NL">Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>29</fpage>
      <lpage>31</lpage>
      <abstract>
        <p>We introduce a new privacy task focused on images that users share online. The task benchmarks image transformation algorithms that are capable of blocking the ability of automatic classifiers to infer sensitive information in images. At the same time, the image transformations should maintain the original value of the image to the user who is sharing it, either by leaving it not obviously changed, or by enhancing it to increase its visual appeal. This year, the focus is on a set of 60 scene categories, selected from the Places365-Standard dataset, that can be considered privacy-sensitive. The objective of the MediaEval Pixel Privacy task is to promote the innovation of protective technologies that make it safer to share social multimedia online. Participants are provided with a set of images and asked to develop protective transformations that prevent the automatic detection of privacy-sensitive information contained in these images. In 2018, we use a subset of the Places365Standard dataset associated with privacy-sensitive scene categories. The transformation algorithms may either leave the images not obviously changed (i.e., maintain the value of the image to the user who is sharing it) or else enhance their original appeal (i.e., increase the value of the image to the user who is sharing it). We especially encourage the development of transformation algorithms that enhance images, since we feel that users will be motivated to use them: it is easier to get excited about dressing up images than about taking precautionary measures against intangible risks. The Pixel Privacy task can be seen as an evolution of the MediaEval Visual Privacy task [1, 11] and DroneProtect task [2], which focused on protecting information in surveillance video from people watching the video. There are four key diferences:</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>• We are interested in social images that users share online.
• We focus on image transformations that protect
privacysensitive information against automatic inference.
• Consistent with this focus, we do not necessarily expect,
nor do we require, that sensitive information is hidden
from people who look at the images. (Our primary focus is
protection against computer vision.)
• Our goal is irreversible protection (although results may
also be relevant for reversible protection).</p>
      <p>
        The motivation for the Pixel Privacy task is the growing concern
about the information implicit in the user data that is shared online,
and in particular, in the data accumulated by large social networks.
Trust in social network platforms is necessary, but not enough.
Events of recent years have made us realize how easily social
network data can be misappropriated (e.g., [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]) or put to a use that is
acceptable from the perspective of the social network company, but
not from the perspective of users (e.g., [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]). Additionally, online
data can be mined in order to search for victims, i.e., a so-called
cybercasing attack [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. A cybercasing attack is carried out by a
malicious party who automatically searches through a large amount
of social media in order to identify victims. For example, a criminal
looking to identify houses to rob can make use of computer vision
technology to rank images according to the probability that the
user who shared them is currently traveling, and thus not at home.
It has become clear that, in addition to trusting social networks, we
need local technologies that provide users with more control over
the information that can be inferred about them on the basis of the
multimedia data that they share.
      </p>
      <p>
        Researchers interested in tackling the Pixel Privacy task should
apply their creativity to imagine which sorts of image
enhancements users will find appealing. However, they should also dig
deeply into the related work. In area of multimedia privacy,
relevant work has been carried out on social images (e.g., [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]) and
security video (e.g., [
        <xref ref-type="bibr" rid="ref3 ref6">3, 6</xref>
        ]). Beyond work on privacy, work on
adversarial machine learning (e.g., [
        <xref ref-type="bibr" rid="ref14 ref15">14, 15</xref>
        ]) is also relevant.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>TASK DEFINITION AND DATA</title>
      <p>The larger goal of the Pixel Privacy task is to address the challenge
of online multimedia privacy by creating user-controlled
technologies (i.e., transformations that can be locally applied before sharing).
Users must find the technologies easy and even fun to use. At the
same time, the technologies must lower the risk of users sufering
privacy violations due to the inference ability of computer vision
algorithms that have been trained on large quantities of data.
Potential violations include a range of diferent threats: being singled
out as a victim (e.g., rich, not at home) for a harmful attack (e.g.,
break-in, blackmail) and being assigned by a commercial algorithm
to a category (e.g., frequents slums, attends church) for the
purposes of targeting advertising. The specific formulation of the Pixel
Privacy task addresses a highly simplified version of the overall
problem of online multimedia privacy. The goal is to provide a
foundation upon which solutions addressing progressively more
realistic versions of the problem may be developed in the future.</p>
      <p>
        The focus of the MediaEval 2018 Pixel Privacy task is on
privacysensitive information that is related to scene categories. A scene
category can be understood to be the identity of the setting in
which a photo was taken. The task data is a subset of the
Places365Standard dataset [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. The task provides a list of 60 privacy-sensitive
categories chosen from the original 365 scene categories. It defines a
validation set (MEPP18val) and a test set (MEPP18test) each
containing 3000 images (50 from each of the 60 classes). Task participants
use the validation set to develop their protection transformations.
Then, they receive the test set, and are asked to apply their
transformations to the these images, and submit the protected test set
images for evaluation.
      </p>
      <p>Although most protection transformations will be algorithms
that are applied automatically by a computer, we also encourage
participants (especially those specialized in art or photography) to
develop manual protection techniques that are applied by hand.
The purpose of manual techniques is to leverage creativity and
explore unexpected new ways in which the visual appeal of images
can be enhanced. Participants applying manual enhancement do
not need to enhance 50 images for 60 categories. Instead, we have
defined a special test set MEPP18test_manual, which is a subset of
MEPP18test containing one image per category. Also, in case the
user studies turn out to be too time-consuming, we will also focus
the user study evaluation on MEPP18test_manual.</p>
      <p>We inspect the scene categories and identify 60 scene categories
that could be considered privacy-sensitive. Each class is related to
each least one of ten privacy criteria: Places in the home, Places
far away from the home (typical vacation places), Places typical
for children, Places related to religion, Places related to people’s
health, Places related to alcohol consumption, Places in which
people do not typically wear street clothes, Places related to people’s
living conditions/income, Places related to security, Places related
to military. These privacy criteria are intended to represent aspects
of privacy in images that are interesting for future work. The list
provides a basis that can be refined or extended in the future.</p>
    </sec>
    <sec id="sec-3">
      <title>3 EVALUATION</title>
      <p>
        The submitted test set images will be evaluated with respect to
protection and appeal. Performance of transformations with respect
to protection is evaluated by measuring the degree to which the
protected test set images block inference of a computer vision
algorithm, referred to as the attack algorithm. The attack algorithm
that we use is a ResNet50 [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] classifier trained on the training
set of the Places365-Standard dataset [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. The Places365-Standard
dataset contains 1,803,460 training images and 365 scene categories.
The number of images per category varies from 3,068 to 5,000. The
attack algorithm is trained to detect all 365 categories.
      </p>
      <p>We provide here some notes on the diference between the
training data of the attack algorithm and the MEPP18val and MEPP18test
datasets. MEPP18val and MEPP18test are derived from the
validation set (and not the training set) of the Places365-Standard dataset,
meaning that the attack algorithm is trained on data mutually
exclusive from the Pixel Privacy validation and test sets. Note also that
the Places365-Standard dataset was created by collecting images
online, and for this reason, we do not expect any of the images in
a given category to be diferent views of the same scene. We also
assume the data does not include other forms of near duplicates.</p>
      <p>
        Performance of transformations with respect to appeal will be
carried out with an automatic aesthetics classification algorithm,
and also user study. The automatic algorithm is NIMA [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] trained
on the AVA [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] dataset. The user study will have human
evaluators inspect a selection of the images and give a rating to the
acceptability of the change and also a short explanation for their
rating.
      </p>
      <p>Top-1 acc.</p>
      <p>Top-5 acc.</p>
      <p>Original images
Protected images
Protection gain (abs.)</p>
    </sec>
    <sec id="sec-4">
      <title>4 SIMPLE BASELINE</title>
      <p>
        Here, we provide a simple baseline approach to enhancement by
using a style transfer technique called CartoonGAN [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], based on a
Generative Adversarial Network (GAN). We note that cartooning
approaches to privacy in surveillance video have been previously
used in the MediaEval Visual Privacy task [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ]. Table 1 shows
the top-1 and top-5 predication accuracy before and after the
enhancement. Figure 1 shows some image examples from three
different categories (i.e., army base, bedroom and temple asia) from
MEPP18val and their enhanced versions, along with the predicted
results before and after the enhancement. In these examples, the
image is correctly classified into a privacy-sensitive category before
enhancement, but once the image is enhanced, the classifier can no
longer predict the correct category. Inspection of these examples
provides an impression of the potential user appeal of this form of
privacy protection.
      </p>
    </sec>
    <sec id="sec-5">
      <title>ACKNOWLEDGMENTS</title>
      <p>This work is part of the Open Mind research program, financed by
the Netherlands Organization for Scientific Research (NWO).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Atta</given-names>
            <surname>Badii amd Tomas Piatrik</surname>
          </string-name>
          , Mathieu Einig, and
          <string-name>
            <given-names>Chattun</given-names>
            <surname>Lallah</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Overview of MediaEval 2012 Visual Privacy Task</article-title>
          .
          <source>In Working Notes Proceedings of the MediaEval 2015 Workshop</source>
          . http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>927</volume>
          /mediaeval2012_submission_8.pdf
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Atta</given-names>
            <surname>Badii</surname>
          </string-name>
          , Pavel Koshunov, Hamid Oudi, Touradj Ebrahimi, Tomas Piatrik, Volker Eiselein, Natacha Ruchaud, Christian Fedorczak, JeanLuc Dugelay, and Diego Fernandez Vazquez.
          <year>2015</year>
          .
          <article-title>Overview of the MediaEval 2015 Drone Protect Task</article-title>
          .
          <source>In Working Notes Proceedings of the MediaEval 2015 Workshop</source>
          . http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1436</volume>
          /Paper7.pdf
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Serdar</given-names>
            <surname>Çiftçi</surname>
          </string-name>
          , Ahmet Oğuz Akyüz, and
          <string-name>
            <given-names>Touradj</given-names>
            <surname>Ebrahimi</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>A Reliable and Reversible Image Privacy Protection Based on False Colors</article-title>
          .
          <source>IEEE Transactions on Multimedia 20, 1 (Jan</source>
          <year>2018</year>
          ),
          <fpage>68</fpage>
          -
          <lpage>81</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Yang</surname>
            <given-names>Chen</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu-Kun Lai</surname>
          </string-name>
          , and
          <string-name>
            <surname>Yong-Jin Liu</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>CartoonGAN: Generative Adversarial Networks for Photo Cartoonization</article-title>
          .
          <source>In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</source>
          .
          <fpage>9465</fpage>
          -
          <lpage>9474</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Jaeyoung</given-names>
            <surname>Choi</surname>
          </string-name>
          , Martha Larson,
          <string-name>
            <given-names>Xinchao</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Kevin</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Gerald</given-names>
            <surname>Friedland</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Alan</given-names>
            <surname>Hanjalic</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>The Geo-Privacy Bonus of Popular Photo Enhancements</article-title>
          .
          <source>In Proceedings of the 2017 ACM International Conference on Multimedia Retrieval (ICMR'17)</source>
          .
          <fpage>84</fpage>
          -
          <lpage>92</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>F.</given-names>
            <surname>Dufaux</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Ebrahimi</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>A framework for the validation of privacy protection solutions in video surveillance</article-title>
          .
          <source>In 2010 IEEE International Conference on Multimedia and Expo</source>
          .
          <volume>66</volume>
          -
          <fpage>71</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Ádám</given-names>
            <surname>Erdélyi</surname>
          </string-name>
          , Thomas Winkler, and
          <string-name>
            <given-names>Bernhard</given-names>
            <surname>Rinner</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Serious Fun: Cartooning for Privacy Protection</article-title>
          .
          <source>Working Notes Proceedings of the MediaEval 2013 Workshop</source>
          . http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1043</volume>
          / mediaeval2013_submission_74.pdf
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Ádám</given-names>
            <surname>Erdélyi</surname>
          </string-name>
          , Thomas Winkler, and
          <string-name>
            <given-names>Bernhard</given-names>
            <surname>Rinner</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>MultiLevel Cartooning for Context-Aware Privacy Protection in Visual Sensor Networks</article-title>
          .
          <source>In Working Notes Proceedings of the MediaEval 2014 Workshop</source>
          . http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1263</volume>
          /mediaeval2014_submission_ 41.pdf
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Gerald</given-names>
            <surname>Friedland</surname>
          </string-name>
          and
          <string-name>
            <given-names>Robin</given-names>
            <surname>Sommer</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Cybercasing the Joint: On the Privacy Implications of Geo-tagging</article-title>
          .
          <source>In Proceedings of the 5th USENIX Conference on Hot Topics in Security (HotSec'10)</source>
          .
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Kaiming</surname>
            <given-names>He</given-names>
          </string-name>
          , Xiangyu Zhang, Shaoqing Ren, and
          <string-name>
            <given-names>Jian</given-names>
            <surname>Sun</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Deep residual learning for image recognition</article-title>
          .
          <source>In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'16)</source>
          .
          <fpage>770</fpage>
          -
          <lpage>778</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Pavel</given-names>
            <surname>Korshunov</surname>
          </string-name>
          and
          <string-name>
            <given-names>Touradj</given-names>
            <surname>Ebrahimi</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>PEViD: privacy evaluation video dataset</article-title>
          .
          <source>Proc. SPIE 8856</source>
          ,
          <article-title>Applications of Digital Image Processing XXXVI</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Sam</given-names>
            <surname>Levin</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Facebook told advertisers it can identify teens feeling 'insecure' and 'worthless'</article-title>
          ,
          <source>The Guardian</source>
          ,
          <volume>1</volume>
          <fpage>May</fpage>
          . (
          <year>2017</year>
          ). https://www.theguardian.com/technology/2017/may/01/ facebook-advertising
          <article-title>-data-insecure-teens, Online; accessed 15-Sept2018</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>McNamee</surname>
          </string-name>
          , Roger and Parakilas, Sandy.
          <year>2018</year>
          .
          <article-title>The Facebook breach makes it clear: data must be regulated</article-title>
          ,
          <source>The Guardian, 19 March</source>
          . (
          <year>2018</year>
          ). https://www.theguardian.com/commentisfree/2018/mar/19/ facebook
          <article-title>-data-cambridge-analytica-privacy-</article-title>
          <string-name>
            <surname>breach</surname>
          </string-name>
          , Online; accessed 15-Sept-
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Seyed-Mohsen</surname>
            Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, and
            <given-names>Pascal</given-names>
          </string-name>
          <string-name>
            <surname>Frossard</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Universal adversarial perturbations</article-title>
          .
          <source>In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR'17)</source>
          .
          <fpage>1765</fpage>
          -
          <lpage>1773</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Seyed-Mohsen</surname>
            Moosavi-Dezfooli,
            <given-names>Alhussein</given-names>
          </string-name>
          <string-name>
            <surname>Fawzi</surname>
            , and
            <given-names>Pascal</given-names>
          </string-name>
          <string-name>
            <surname>Frossard</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Deepfool: a simple and accurate method to fool deep neural networks</article-title>
          .
          <source>In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'16)</source>
          .
          <fpage>2574</fpage>
          -
          <lpage>2582</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Naila</surname>
            <given-names>Murray</given-names>
          </string-name>
          , Luca Marchesotti, and
          <string-name>
            <given-names>Florent</given-names>
            <surname>Perronnin</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>AVA: A large-scale database for aesthetic visual analysis</article-title>
          .
          <source>In Computer Vision and Pattern Recognition (CVPR)</source>
          ,
          <source>2012 IEEE Conference on. IEEE</source>
          ,
          <fpage>2408</fpage>
          -
          <lpage>2415</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>Hossein</given-names>
            <surname>Talebi</surname>
          </string-name>
          and
          <string-name>
            <given-names>Peyman</given-names>
            <surname>Milanfar</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Nima: Neural image assessment</article-title>
          .
          <source>IEEE Transactions on Image Processing 27</source>
          ,
          <issue>8</issue>
          (
          <year>2018</year>
          ),
          <fpage>3998</fpage>
          -
          <lpage>4011</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Bolei</surname>
            <given-names>Zhou</given-names>
          </string-name>
          , Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba.
          <year>2017</year>
          .
          <article-title>Places: A 10 million Image Database for Scene Recognition</article-title>
          .
          <source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>