<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>MediaEval 2014 Visual Privacy Task: Geometrical Privacy Protection Tool</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pavel Korshunov</string-name>
          <email>pavel.korshunov@ep</email>
          <email>pavel.korshunov@epfl.ch</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Touradj Ebrahimi</string-name>
          <email>touradj.ebrahimi@ep</email>
          <email>touradj.ebrahimi@epfl.ch</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>MMSPG, EPFL</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2014</year>
      </pub-date>
      <fpage>16</fpage>
      <lpage>17</lpage>
      <abstract>
        <p>This paper describes EPFL privacy protection tool for the MediaEval 2014 Visual Privacy task. The goal of the task is to obscure faces, body silhouettes, and personal items of people in the provided surveillance clips to preserve their personal privacy. The EPFL privacy protection tool mainly relies on two privacy protection filters: a warping-based reversible filter to obscure features with low visual details (body silhouettes) by distorting them with randomized warping and morphing-based reversible filter to obscure features with high visual details (faces and personal items) by 'replacing' them with a graphical representation. The aim of this tool is to achieve an acceptable balance between privacy (how well the privacy is protected) and intelligibility (how well the surveillance task can still be performed), as well as, privacy and pleasantness (how pleasant is the protection). The results of three types of subjective evaluations, via crowdsourcing, practitioners, and stakeholders, provided by the organizers of the task demonstrates that EPFL privacy protection tool achieves a great overall balance between privacy, intelligibility, and pleasantness, while being secure and reversible in the same time.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Recent adoption of digital video surveillance systems, especially
in public spaces and communities, has significantly increased the
concern for intrusion into individual privacy. New sensing
technologies, such as ultra high definition, high dynamic range, or video
capturing with mini-drones, threaten to eradicate boundaries of
private space even more. As a possible solution, many privacy
protection tools have been proposed for preserving privacy, ranging from
simple methods such as masking blurring, pixelization, or
masking to more advanced methods satisfying the following desirable
practical properties: reversibility, robustness, and security. The
advanced methods can be divided into several categories:
encryptionbased [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], scrambling-based [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and geometrical-based [
        <xref ref-type="bibr" rid="ref5 ref6">6, 5</xref>
        ]
methods.
      </p>
      <p>
        Despite wide availability of visual privacy protection tools, with
an exception of some work [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], little is known about which tools
are suitable for practical applications. To close this gap,
MediaEval 2014 Visual Privacy task was designed to facilitate submissions
of different protection tools and to benchmark them on practical
privacy video dataset [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] via several types of subjective
evaluations. Moreover, the focus of this task is twofold: one explores the
privacy-intelligibility tradeoff, which is between how well
surveillance can be performed while privacy is being preserved, and
another explores the privacy-pleasantness tradeoff, which is about
how socially acceptable is a given privacy protection tool for a
human observer. This year, the task is also separates visual privacy
features into two types: low detailed features, such as body
silhouettes, and features with high details, such as faces or personal
items [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        In the submission to MediaEval 2014 Privacy task, EPFL aimed
to address both tradeoffs and separately obscure two types of
visual features. Therefore, the proposed privacy protection tool
consists of two privacy protection filters: a warping-based filter [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] that
obscures features with low visual details by distorting them with
randomized warping and morphing-based filter [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] to obscure
features with high visual details by ‘replacing’ them with a graphical
representation. The privacy protection tool is implemented using
Python, OpenCV1, and Matlab.
      </p>
      <p>
        Organizers of the task provided video dataset [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] with
annotations of privacy sensitive regions including faces, hair, skin,
accessories, and body regions, as well as classification of these regions
into low, medium, oh high detailed features. The tool, therefore,
assumed the privacy regions known (in a practical scenario, they
can be detected by video analytics) and focused on developing the
privacy protection tool that achieves an acceptable balance between
privacy (how well the privacy is protected) and intelligibility (how
well the surveillance task can still be performed), as well as,
privacy and pleasantness (how pleasant is the protection).
2.
      </p>
    </sec>
    <sec id="sec-2">
      <title>KEY DECISIONS AND CHALLENGES</title>
      <p>The best privacy preserving filter would be a blacked out camera
with no video feed, but, in such case, there would be no surveillance
possible and intelligibility would be zero. Therefore, a usable
privacy protection filter should have a balance between privacy and
intelligibility. Similarly, an encryption or scrambling based privacy
filters could lead to high privacy but can be annoying or even scary,
resulting in very low pleasantness. Another important practical
requirement is the secure reversibility of the privacy protection tool,
so that the protection can be undone in secure way (only if one has
a secret key) to restore the original video in case police or court
would require it.</p>
      <p>
        To achieve the balance between privacy, intelligibility, and
pleasantness, as well as to provide reversible protection, the proposed
privacy protection tool adopted a two-stage approach: (i) reversible
warping filter [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] is applied on body silhouettes as a low detailed
visual feature to distort general personal appearance and (ii)
reversible morphing filter [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] on faces and personal items as high
detailed visual features to remove all the identifiable details. Figure 1
illustrates how the proposed tool protects privacy of people.
      </p>
      <p>Warping filter makes the details of the visible object
unrecognizable (i.e., privacy is increased), but, by controlling the strength of
the filter, the overall general shape of the object can be preserved,
so it would still be possible to understand what is going on in the
surveillance scene (i.e., intelligibility is not decreased). Morphing
filter replaces faces and personal items with the graphical
representation, e.g., ‘smiley face’ instead of the original face, which
effectively removes all personal details, i.e., privacy is increased, with
the aim to keep both intelligibility and pleasantness/appropriateness
high.</p>
    </sec>
    <sec id="sec-3">
      <title>EVALUATION RESULTS</title>
      <p>The organizers provided the results from three subjective
evaluations: crowdsourcing-based evaluation in Stream 1, evaluation
by stakeholders and surveillance experts in Stream 2, and
evaluation by practitioners and data protection experts in Stream 3. The
corresponding results of the EPFL privacy protection tool are
summarized in the Table 1 and compared against the average of the total
8 submissions to the Privacy Task of MediaEval 2014.</p>
      <p>From the table, it can be noted that across all evaluations, the
tool demonstrates higher than average level of privacy but
underperforms in terms of intelligibility and pleasantness. In
crowdsourcing evaluation, the performance of the tool is nearer to average
compared to other two evaluations. It means that the tool would be
more suitable for the scenarios where the observers are naïve
subjects as it is in the case of crowdsourcing. The low intelligibility
score can be compensated by the fact that the tool is reversible and
original video can be securely restored, which would allow the
detailed examination of the video data if necessary. Low pleasantness
value is probably due to the choice of graphical representations for
faces and personal items (see Figure 1), which subjects did not like.
A more appropriate and use case oriented choice of such
representation may improve the pleasantness of the visual protection.
4.</p>
    </sec>
    <sec id="sec-4">
      <title>CONCLUSION</title>
      <p>EPFL privacy protection tool combines warping and morphing
privacy protection filters and achieves an acceptable balance
between privacy, intelligibility, and pleasantness, providing, in the
same time, ability to securely restore the original content if
necessary. In a practical scenario, a better fitting graphical
representations of the faces and personal items can be selected.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work was conducted in the framework of the EC funded
Network of Excellence VideoSense.
5.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Badii</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ebrahimi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Fedorczak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Korshunov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Piatrik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Eiselein</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A</given-names>
            .
            <surname>Al-Obaidi</surname>
          </string-name>
          .
          <article-title>Overview of the MediaEval 2014 visual privacy task</article-title>
          .
          <source>In MediaEval 2014 Workshop</source>
          , Barcelona, Spain, October
          <volume>16</volume>
          -17
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>F.</given-names>
            <surname>Dufaux</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Ebrahimi</surname>
          </string-name>
          .
          <article-title>Scrambling for privacy protection in video surveillance systems</article-title>
          .
          <source>IEEE Trans. on Circuits and Systems for Video Technology</source>
          ,
          <volume>18</volume>
          (
          <issue>8</issue>
          ):
          <fpage>1168</fpage>
          -
          <lpage>1174</lpage>
          , Aug.
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>P.</given-names>
            <surname>Korshunov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Araimo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>De Simone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Velardo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Dugelay</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Ebrahimi</surname>
          </string-name>
          .
          <article-title>Evaluation of visual privacy filters impact on video surveillance intelligibility</article-title>
          .
          <source>In International Workshop on Quality of Multimedia Experience (QoMEX)</source>
          , pages
          <fpage>150</fpage>
          -
          <lpage>151</lpage>
          ,
          <year>July 2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>P.</given-names>
            <surname>Korshunov</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Ebrahimi</surname>
          </string-name>
          .
          <article-title>PEViD: privacy evaluation video dataset</article-title>
          .
          <source>In SPIE Applications of Digital Image Processing XXXVI</source>
          , volume
          <volume>8856</volume>
          , San Diego, California, USA, Aug.
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Korshunov</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Ebrahimi</surname>
          </string-name>
          .
          <article-title>Using face morphing to protect privacy</article-title>
          .
          <source>In IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS)</source>
          , pages
          <fpage>208</fpage>
          -
          <lpage>213</lpage>
          , Krakow, Poland, Aug.
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>P.</given-names>
            <surname>Korshunov</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Ebrahimi</surname>
          </string-name>
          .
          <article-title>Using warping for privacy protection in video surveillance</article-title>
          .
          <source>In 18th International Conference on Digital Signal Processing (DSP)</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          , Santorini, Greece,
          <year>July 2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>T.</given-names>
            <surname>Winkler</surname>
          </string-name>
          and
          <string-name>
            <surname>B. Rinner.</surname>
          </string-name>
          <article-title>TrustCAM: Security and privacy-protection for an embedded smart camera based on trusted computing</article-title>
          .
          <source>In IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)</source>
          , pages
          <fpage>593</fpage>
          -
          <lpage>600</lpage>
          , Sept.
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>