=Paper=
{{Paper
|id=Vol-1436/Paper7
|storemode=property
|title=Overview of the MediaEval 2015 Drone Protect Task
|pdfUrl=https://ceur-ws.org/Vol-1436/Paper7.pdf
|volume=Vol-1436
|dblpUrl=https://dblp.org/rec/conf/mediaeval/BadiiKOEPERFDV15
}}
==Overview of the MediaEval 2015 Drone Protect Task==
Overview of the MediaEval 2015 Drone Protect Task 1 2 3 4 5 6 Atta Badii , Pavel Koshunov , Hamid Oudi , Touradj Ebrahimi , Tomas Piatrik , Volker Eiselein , 7 8 9 10 Natacha Ruchaud , Christian Fedorczak , Jean-Luc Dugelay , Diego Fernandez Vazquez 1,3. {atta.badii, h.oudi}@reading.ac.uk, 2 , 4. {pavel.korshunov, touradj.ebrahimi}@epfl.ch,.5. t.piatrik@qmul.ac.uk, 6. eiselein@nue.tu-berlin.de, 7.Natacha.Ruchaud@eurecom.fr , 8. christian.fedorczak@thalesgroup.com, 9. jean-luc.dugelay@eurecom.fr , 10. dfvazquez@isdefe.es ABSTRACT car park. The contents of the videos were grouped into three This paper presents an overview of the Drone Protect Task (DPT) categories: Normal, Suspicious and Illicit behaviour. The videos of MediaEval 2015, its objectives, related dataset, and evaluation in the Normal category involved subjects performing common approach. Participants in this task were required to implement a social behaviours in the car park such as entering or leaving a car. privacy filter or a combination of filters to protect various The Suspicious category included loitering, taking a picture of personal information regions in the video sequences provided. parked cars and other questionable behaviours. On the other The challenge was to achieve an adequate balance between the hand, Illicit behaviour included Actors stealing a car, leaving a car degree of privacy protection, intelligibility (how much useful unattended, parking the car in forbidden areas or fighting. information is retained after privacy filtering), and pleasantness The actors in the videos, carry specific items and so could (how minimal were the adverse effects of filtering on the potentially reveal their identity and may therefore need to be appearance of the video frames). The evaluation methods for this privacy-filtered appropriately. For example, the actors are task include subjective evaluation by those working in the video featured carrying backpacks, umbrellas, wearing scarves, and surveillance sector and also by naïve viewers. performing various actions, such as fighting, stealing, loitering, or simply walking. Actors may be at a distance from the camera or 1. INTRODUCTION near the camera, making their faces appear with varying size and The number of drones deployed for civil applications and quality. Despite the use of advanced stabilisation techniques for other non-military uses such as journalism, recreation, public the Camera on board the drone, the drone maneuvering and the safety, and precision agriculture is increasing. In particular, the variable conditions outdoors still led to some jitter effects in some deployment of the highly mobile and versatile drones for aerial video segments. The ground truth data set has been created surveillance in urban policing and crowd management gives rise manually by the task organisers and consists of annotations of the to new challenges for civil liberties, privacy and safety. The bounding boxes containing the regions of High ﴾H﴿, Medium ﴾M﴿, ubiquity and enhanced capability of such surveillance can pose or, Low ﴾L﴿ Personally Identifiable Information (PII) including significant threats to citizens’ privacy and therefore new vehicles, persons’ faces and accessories, and, unusual events such mitigation technologies are needed to ensure an appropriate level as fighting, stealing and bag dropping. of privacy protection. The Drone Protect Task (VPT) of The data included such annotations that distinguished the MediaEval 2015 has thus provided an opportunity for relative privacy sensitivity of PIIs; namely for License Plates(H), experimentation to explore how video-analytic techniques may Skin (M), Face (H), Hair (L), Accessories (M), and for Person’s arrive at enhanced solutions to some visual privacy problems. body (L). The dataset has been provided in accordance with the This task focuses on privacy protection techniques that are European Data Protection and ethical compliance guidelines responsive to the context-specific needs of persons for privacy. including informed consent and access control as required. Figure The DroneProtect performance evaluation involves three distinct 1, shows an example of a video illustrating Illicit behaviour. user studies aimed at developing a deeper understanding of users’ perceptions of the effects and side-effects of privacy filtering for a Figure 1 Sample of the video in the dataset [3]. user-centered evaluation of the privacy solutions offered. 2. DPT 2015 DATASET The DPT dataset has provided 38 video clips of about 20 seconds each, in full HD resolution with sufficient number of examples of video images depicting different typical scenarios in a car park [3]. The bounding boxes for persons and cars are annotated. However, the detection of the face-head area as a region of interest and detecting a “person-entering-a-car” event are regarded as task to be solved as would be the cased in a real- life Car Park Security use-case scenario; this will provide an appropriate level of challenge in the present task, especially as the region-specific privacy filtering element has been previously benchmarked within the MediaEval 2014 Visual Privacy Task [1]. The video data included various scenarios featuring one or 3. AIM AND OBJECTIVES The objective of the DroneProtect: Mini-drone Video Privacy several human subjects walking and interacting with vehicles in a Task is to benchmark privacy filtering solutions for drone videos Copyright is held by the author/owner(s). MediaEval 2015 Workshop, Sept. 14-15, 2015, Wurzen, Germany. related to public safety. The performance of solutions is judged privacy filtering; a separate evaluation process has been planned by their ability to retain sufficient (frame-level) semantic for the results of this additional element of the task. information about activities and situations, while at the same time providing the required level of privacy for people appearing in the 4. SUBMISSIONS EVALUATIONS videos. Task participants are to implement a combination of The ground truth consisted of video frames with annotations privacy filters to protect various personal information regions in a of the bounding boxes containing description of entities in the set of drone videos as had been provided. Privacy filtering is to video images of persons and cars plus examples of alternative be optimised for the fulfillment of both the following criteria: filtering approaches and questionnaires used by the human Obscure personally identifying information effectively viewers who had evaluated them, and, the final rankings achieved. Preserving the information needed by a human viewer in Privacy Solutions Evaluation: Participants have submitted order to interpret the video at the level required to maintain privacy protected video clips using the testing subset. The evaluation of the submitted privacy solutions is based on the security in the area monitored by the drone. human-perceived level of privacy filtering i.e., the level by which Solutions attempted to preserve the overall visual the High/Low regions of personally identifiable information, as acceptability-attractiveness of the resulting privacy filtered video- previously annotated in the dataset, have been responsively frames, since these factors had potential impact on interpretability obscured by appropriate filtering techniques. Thus the evaluation and on the quality of the work experience for humans interpreting is essentially based on the overall human perception and the videos. As a secondary goal, the task aimed to investigate interpretation of the resulting privacy filtered image in terms of mixtures of reversible and irreversible privacy filters. the level of retained information i.e., intelligibility, and, For this task, the use-case scenario was Car Park Security appropriateness (acceptability-attractiveness) of the privacy and so the typical objectives of such a scenario would determine filtered image (also defined in the MediaEval 2012, MediaEval how much of which type of information must be retained in the 2013, and, MediaEval 2014 Privacy Task descriptions [1,2]). video to support the goal of maintaining security. The video input Participants will each receive the results of the evaluations of for the privacy filtering process consisted of drone video clips their submission as well as the overall results and rankings for all showing examples of: Persons walking, running, or fighting in the the submitted entries. The rankings will be based on the car park area, Persons attacking a driver, loitering, entering or application of different weightings to the results for each of the leaving a particular car in the car park, wrongly parked cars, and above three criteria (privacy protection level, intelligibility, collision with cyclists. appropriateness) as calculated from the evaluation results arising The output of the privacy filtering process was to preserve from evaluations by the surveillance security practitioners and sufficient semantics for recognition of specific security-relevant naïve evaluators. events unfolding in the car park scenes whilst reversibly masking The weightings will be agreed by the participants so as to the following aspects: reflect the relative importance of each of the above three Person’s face and silhouette evaluation criteria as perceived by each of the human evaluator Person’s gender and race (note this does not entail groups. 6 participants from the security practitioner’s category gender/race recognition but rendering un-classifiable) and 11 from the naïve category will be asked to complete a survey Personal accessories with 13 questions after viewing each of 3 randomly selected and Vehicle make and model distinct videos of results of privacy filtering as submitted by each Vehicle license plate (if zoomed-in on) team. The 13 question will evaluate all three criteria. The score given to each team will consist of the average score for each of The face and the car body have high personal identification the criteria mentioned above for each evaluation category. potential, whereas the human body outline, particularly one that has been rendered gender-unclassifiable, has a low personal 5. ACKNOWLEDGEMENTS identification potential. Note that gait analysis has been excluded The Drone Protect Task at MediaEval 2015 was supported by in the formulation of the task. Accordingly all image regions as the European Commission under contract FP7-261743 listed above needed to be masked respectively with corresponding VideoSense. filter strength, High (H), Low (L), Medium (M) so as to maintain the appropriate privacy protection, intelligibility and 6. REFERENCES attractiveness-acceptability of the resulting privacy filtered video [1] Badii, A., Al-Obaidi, A., and Einig, M., MediaEval 2013 frame. Thus this privacy filtering task required the detection of the Visual Privacy Task: Holistic Evaluation Framework for Privacy human face-and-head zone within each bounding box that has by Co-Design Impact Assessment. MediaEval 2013 Workshop. already delineated a person. CEUR-WS.org, 1043, Barcelona, Spain, October 2013. As a secondary goal the task invited solutions that deployed an appropriately managed mix of reversible and irreversible [2] A. Badii, T. Ebrahimi, C. Fedorczak, P. Korshunov, T. Piatrik, privacy filters. Such filters are typically optimised responsively to V. Eiselein, and A. Al-Obaidi. Overview of the MediaEval 2014 the context of the events and persons’ behaviours occurring in the visual privacy task, In MediaEval 2014 Workshop, Barcelona, video. Such filtering must also allow the car park staff to reverse Spain, October 2014. the filtering to investigate any activities as deemed possibly [3] Bonetto, M., Korshunov, P., Ramponi, G., and Ebrahimi, T., relevant to the investigation of any security incidents within a Privacy in Mini-drone Based Video Surveillance, Workshop on specific time frame as set by the regulations; e.g., within 7-30 De-identification for privacy protection in multimedia, May 2015. days of any video-recording after which all videos are usually [4] Badii, A., Einig, M., Tiemann, M., Thiemert, D. and Lallah, deleted. C., Visual context identification for privacy-respecting video As an additional challenge a set of 5 un-annotated videos analytics, in IEEE 14th International Workshop on Multimedia were provided for the participants, optionally to attempt blind Signal Processing (MMSP 2012), pp. 366-371, Banff, Canada, September 2012.