<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Validate Privacy Constraints in Surveillance Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>László Havasi</string-name>
          <email>havasi@sztaki.hu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Zoltán Szlávik</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Levente Kovács</string-name>
          <email>levente.kovacs@sztaki.hu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ákos Utasi</string-name>
          <email>utasi@sztaki.hu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Csaba Benedek</string-name>
          <email>bcsaba@sztaki.hu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tamás Szirányi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>MTA-SZTAKI</institution>
          ,
          <addr-line>H-1111 Budapest Kende u 13-17</addr-line>
        </aff>
      </contrib-group>
      <fpage>81</fpage>
      <lpage>88</lpage>
      <abstract>
        <p>The paper introduces a video surveillance and event detection framework and application for semi-supervised surveillance use. The system development follows the guidelines of national strategy for surveillance systems. The aim is to generalize the interoperability, compatibility and legality in camera surveillance systems in Hungary. The system's intended use is in automatic mode on camera feeds that are not actively watched by surveillance personnel, and should raise alarms when unusual events occur. We present the current detector filters, and the extendable modular interface. Filters include local and global unusual motion detectors, left/stolen object detector, motion detector, tampering/failure detector, etc. It has been tested in real life situation for police street surveillance.</p>
      </abstract>
      <kwd-group>
        <kwd>surveillance</kwd>
        <kwd>image processing</kwd>
        <kwd>annotation</kwd>
        <kwd>content based retrieval</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Visual surveillance and activity analysis has attained great interest in the field of
computer vision research [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">2,3,4</xref>
        ]. We provide a transparent and distributed
architecture for easy integration of third party modules into a common framework to
facilitate easier research collaboration and evaluation. The setup is hierarchical thus
helping the scalability of the whole framework. Various complex surveillance related
algorithms, such as analysis algorithm for static and moving cameras, automatic fight
detection, shadow segmentation, discovery of unusual motion patterns are integrated
into this framework. In the case of detecting unusual motion occurrences, we refer to
the term unusual in statistical sense. The system development is driven by the recent
published strategy for public surveillance systems [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The goal of the published
strategy guidelines is to standardize the technical and service capabilities of
surveillance systems. Furthermore the strategy focused on the harmonization of
judicial and privacy constraints to the necessary technical solutions.
      </p>
      <p>Post-proceedings of the 2nd International Conference
on ICT Solutions for Justice (ICT4Justice 2009)</p>
      <sec id="sec-1-1">
        <title>1.2 System Architecture</title>
        <p>The system consists of a user interface and a database backend. The interface runs on
a dual display workstation, which also contains the frame grabber cards that accept
the camera feeds. The feeds come from large camera matrix multiplexers. The
operators monitor and use the system through this interface, while the database
backend can be anywhere.</p>
        <p>A single interface instance can handle several live feeds simultaneously, and each
can be assigned a chain of freely combined filters (i.e. filter chains). The handlers and
the filters were written to be real time, with SMP and heavy multithreading in mind.
Increasing the number of processor cores can increase the possibility of using more
filters on more feeds. Modules/filters can be added easily, either by coding them by
using a provided class template as internal filters, or by a provided a library template,
as a plugin. Either way, the coder needs only focus on the core algorithm, interfacing
is seamless.</p>
      </sec>
      <sec id="sec-1-2">
        <title>2.1 Panorama Image</title>
        <p>In this section we describe some of the more important modules/filters currently
deployed in the framework. The system also contains classical surveillance functions.
The need of constructing and displaying a panorama image of the scene arose since
there have been a lot of panning cameras that cover a large field of view. This module
allows us to construct and display the full field of view of a camera for the operator,
and also to identify the actual camera position. The method continuously registers the
incoming frames and builds a mosaic. The properties of these cameras are totally
unknown and different on each camera source.</p>
      </sec>
      <sec id="sec-1-3">
        <title>2.2 Unusual motion detector</title>
        <p>
          For this filter, the goal was to detect unusual large motion patterns [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Intended use
cases are, e.g.:
• Someone goes against the traffic in a one way street: long term statistics show
one major motion direction, and then a different motion occurs.
• One lane jams in a two way street: long term statistics show two typical motion
directions, and then one of them disappears.
• Accident, traffic jam: statistics show intensive various motions, which
considerably slows, stops, or drops in variance.
        </p>
        <p>
          The second filter also detects unusual motion patterns [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], but in a local manner: it
operates locally, builds different statistics for parts of the frame, can signal unusual
patterns at different locations, and can give the mask of the moving object. The
method is based on statistical processing of raw data without object level
understanding, and uses spatio-temporal information for the analysis of motion.
        </p>
      </sec>
      <sec id="sec-1-4">
        <title>2.3 Fight detector</title>
        <p>The algorithm of this filter detects fighting people (a few people or small group), and
raises an alarm when such disorderly motion patterns are detected in the video stream.
The tuning of this algorithm is easy and mostly invariant to the characteristics of
video (spatial resolution, refresh rate, view parameters, etc.). This filter runs in real
time.</p>
      </sec>
      <sec id="sec-1-5">
        <title>2.3 Left and Removed Object Detector</title>
        <p>
          In conventional video surveillance applications, the aims of background modeling and
background subtraction modules are usually limited to moving object detection and
analysis. However, relevant information can be exploited by following the changes in
the background as well. We implemented a filter [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], which not only detects objects
moving in front of the camera, but it detects changes in the static background and
signals the appearance of new objects (i.e. objects that are brought into the field of
view, then left there, or objects that are taken from the field). The method can be used
to observe abandoned or stolen objects, which is an important surveillance task.
        </p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>3 Indexing and Retrieval</title>
      <p>For indexing and retrieval of frame and video data we created a searchable video
database application, on which textual and content-based searches can be performed,
and also an annotation application which can be used to assign annotations to
archived video footage. After a video – with or without annotations – is imported into
the database, several content based feature extractors are automatically run (i.e.
average color, color structure, texture, motion, edge histogram, day/night, grass/sky,
etc.) that will be indexed (by a modified BK-tree based approach) and used when
searches based on a model will be performed. Textual data, which comes from the
annotations, is also stored into the database, thus textual and content based queries
can both be performed.</p>
      <p>All the modules of the unusual event detection framework produce alarm and
module-output data in XML markup which is imported into a relational database with
native markup language support. This provides us with the event data and details that
can be later looked through and searches can be performed upon. The main data that
gets stored into the database are the id of the alarm module, the time of the
occurrence, and description of the event, associated filter data for later processing, the
id of the camera feed on which the event occurred, optional annotations that can
manually be assigned to events, and a frame taken at the time of the event from the
live camera feed.</p>
      <sec id="sec-2-1">
        <title>3.1 Visualization</title>
        <p>Besides presenting the results in decreasing order of relevance to the user on the
web interface above, we created a tool which provides a flexible way for browsing
among database shots and images, easily selecting query shots/images, performing
textual queries, viewing/editing/assigning annotations and categories to shots/images,
2D and 3D visualization of results and images belonging to a specific category.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>4 Privacy Aspects</title>
      <p>Most surveillance systems are subject to privacy control legal issues. Unusual events
generating alarms are very rare, but watching and recording usual human activities are
not accepted by legal authorities keeping the human rights for privacy. Implementing
automatic alarming plug-ins into a surveillance system can help to control scenes of
sensitive private areas as well without hurting the privacy. The above mentioned
automatic capabilities make the system privacy-friendly, avoiding the unnecessary
human watching.</p>
      <p>
        The development of the introduced system is synchronized to the Hungarian
videosurveillance strategy [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The main aspects of the forthcoming developments are:
• Coherently: the independent systems can connected in a hierarchical
order.
• Interoperability: whole system must be available for different application
domains with specific database support.
• Certificated video storage: the video accesses are controlled centrally. The
application server verifies the requested data.
• Access dependent quality control: sensorial data (e.g. video) is available
various quality depending on the access rights
• Autonomous functioning: built in plug-ins focus operator attention on
unusual events.
      </p>
    </sec>
    <sec id="sec-4">
      <title>5 Conclusions</title>
      <p>We presented an automatic surveillance system, which is intended to be an aid for
surveillance operators who handle hundreds of feeds, thus being physically unable to
watch all feeds at once. We presented some of our current filters that are the base of
the unusual event signaling and review framework. The actual version has been tested
at local police surveillance stations, and we are working towards creating a finalized
deployable version. The system is being developed to be as much modular and
extendable as possible, with easy integration with existing systems.</p>
      <p>The most promising result is the creation of national strategy for admissible
surveillance systems. The further developments will support ASP (Application
Service Provider) as a centralized event and quality monitoring and retrieval backend.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <article-title>National strategy for development of public video surveillance systems</article-title>
          in Hungary: http://ekmk.sztaki.hu/images/SPOTS/a%20k%
          <article-title>F6zter%FCleti%20t%E9rfigyel%E9s%20stra t%E9gi%E1ja_20090331</article-title>
          .pdf
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>A.</given-names>
            <surname>Hampapur</surname>
          </string-name>
          , L. Brown,
          <string-name>
            <given-names>R.</given-names>
            <surname>Feris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Senior</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chiao-Fe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhai</surname>
          </string-name>
          and Max Lu, “
          <article-title>Searching surveillance video”</article-title>
          ,
          <source>In Proc. of AVSS</source>
          <year>2007</year>
          , pp.
          <fpage>75</fpage>
          -
          <lpage>80</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>J.</given-names>
            <surname>Meessen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Coulanges</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Desurmont</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.F.</given-names>
            <surname>Delaigle</surname>
          </string-name>
          ,
          <article-title>"Content-Based Retrieval of Video Surveillance Scenes,"</article-title>
          <source>Multimedia Content Representation, Classification and Security</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>G.</given-names>
            <surname>Medioni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Cohen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bremond</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hongeng</surname>
          </string-name>
          , R. Nevatia, “
          <article-title>Event detection and analysis from video streams”</article-title>
          ,
          <source>IEEE Trans. on Pattern Analysis and Machine Intelligence</source>
          , vol.
          <volume>23</volume>
          , no.
          <issue>8</issue>
          , pp.
          <fpage>873</fpage>
          -
          <lpage>889</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. Á. Utasi and
          <string-name>
            <given-names>L.</given-names>
            <surname>Czúni</surname>
          </string-name>
          ,
          <article-title>"Anomaly Detection with Low-level Processes in Videos"</article-title>
          ,
          <source>The 3rd International Conference on Computer Vision Theory and Applications</source>
          , pp.
          <fpage>678</fpage>
          -
          <lpage>681</lpage>
          ,
          <string-name>
            <surname>VISAPP</surname>
          </string-name>
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Z.</given-names>
            <surname>Szlávik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Kovács; L. Havasi</surname>
          </string-name>
          , Cs. Benedek,
          <string-name>
            <surname>I. Petrás</surname>
          </string-name>
          , Á. Utasi,
          <string-name>
            <given-names>A.</given-names>
            <surname>Licsár</surname>
          </string-name>
          , L. Czúni, , T. Szirányi, “
          <article-title>Behavior and event detection for annotation and surveillance</article-title>
          .”,
          <source>CBMI</source>
          <year>2008</year>
          , pp.
          <fpage>117</fpage>
          -
          <lpage>124</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>