<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Relations as Context to Improve Multi-Target Tracking and Activity Recognition</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Cristina Manfredotti</string-name>
          <email>cristina@cs.toronto.edu</email>
          <email>manfredotti@disco.unimib.it</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Enza Messina</string-name>
          <email>messina@disco.unimib.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David Fleet</string-name>
          <email>fleet@cs.toronto.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Computer Science Dept, University of Toronto</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>DISCo, University of Milano-Bicocca</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>The explicit recognition of the relationships between interacting objects can improve the understanding of their dynamic model. In this work, we investigate the use of Relational Dynamic Bayesian Networks to represent the dependencies between objects behavior in the context of multitarget tracking. We propose a new formulation of the transition model that accommodates for First-Order Logic relations and we extend the Particle Filter algorithm in order to directly track relations between targets. Many applications can benefit from this work, as activities recognition, traffic monitoring, strategic analysis, sports coaching and others. We present some results about activity recognition in monitoring Canadian costal borders.</p>
      </abstract>
      <kwd-group>
        <kwd>Multi Target tracking</kwd>
        <kwd>Probabilistic Relational Models</kwd>
        <kwd>Bayesian Filtering</kwd>
        <kwd>Particle Filtering</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Context interpretation and context-based reasoning have been shown to be key factors
for Computer Vision in the development of algorithms for object recognition [3]. In this
domain the context is the scene where objects are located and the knowledge about it is
expressed by the beliefs over the scene [4].</p>
      <p>In this paper we deal with moving objects and we refer the concept of context to
“what is happening around the object we are tracking”. Knowing the scene can
improve the objects recognition task and the knowledge about the identity of the objects
improves the belief over the scene; knowing what is happening in the scene (which
“relations” are believed to be true in the scene) can improve the tracking and the knowledge
about the state of the objects can improve our knowledge about the relation between the
objects in the scene (i.e. the context).</p>
      <p>Consider, for example, the situation in which we have a group of people walking in
a park. If we know they are walking together (i.e. if we have a certain belief over their
relation), we know they will have a similar behavior or a similar motion. This will help
us in tracking them. Moreover, taking into account the relations between objects can
also allow us to recognize complex activities like, for example, the activity of “going
to a pub together”: single persons walking can be a simple fragment of a more
complex activity that includes some people meeting, going in the same direction, waiting
each other at different points and entering together into the pub. Dealing with relations
between moving objects allows us to recognize a complex activity like this one from
another similar one that can be the “catching the subway during rush hour”: this
complex activity also includes a group of people walking together in the same direction but
those people will not wait for each other. In the last years Computer Vision has mostly
dealt with the recognition of activities composed by the repetition of simple movements
[12], instead those are examples of more complex activities that involve relations
between objects and/or single actions during time.</p>
      <p>In our work we model the context as a set of First-Order Logic relations using them
in two principal ways:
– We will use relations to improve the efficiency of the tracking. The information
contained in the relationships can improve prediction, resulting in a better estimation
of objects trajectories.
– We will monitor relations as a goal in itself. This is the case in many applications
like traffic prediction or consumer monitoring, anomaly detection or activity
recognition.</p>
      <p>In this work we consider Relational Dynamic Bayesian Networks (RDBNs)1, an
extension of Probabilistic Relational Model [5] to dynamic domains, as a formalism
to monitor relations between moving objects. In our RDBN-based model, relationships
are considered as random variables whose values may change over time. While tracking
the objects in the domain, we also track the evolution of their relationships. For this
purpose, in the next sections we propose a formalization of a new dynamic model able to
predict the future state of the objects taking into account their relations and we introduce
a new version of Particle Filter (that we call Relational Particle Filter) that adapts to
these settings. After presenting some preliminary results obtained on the Intelligent
System Challenge 2008-2009 data set 2, and a brief review of the literature, we conclude
with some final remarks.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Modeling and Inference</title>
      <p>A relational domain is a set of objects with relations between them. We will call the
state s of a relational domain relational state, and we define it as the set of
instantiations of all the objects and their relations in the domain. Therefore, we can divide the
relational state in two parts: the state of the objects3 so and the state of the relations sr
and we will write: s = [so, sr].</p>
      <p>A Relational Bayesian Network (RBN) is a directed acyclic graph whose nodes are
First-Order Logic attributes or relations between objects in the relational domain and
whose structure represents the causality between the nodes.
1 The authors are aware of the works of Sanghai, Weld and Domingos on
RDBNs; however the paper presenting their work has been retracted. Refer to:
http://www.aaai.org/Library/JAIR/Vol24/jair24-019.php
2 http://www.intelligent-systems-challenge.ca/home/index.html
3 We will use the terms state of objects and state of instantiations interchangeably.</p>
      <p>When we deal with dynamic, relational states evolve with time and RBNs has to be
extended to RDBNs. A Relational Dynamic Bayesian Network is structured as a pair
of RBNs (B0, B→), where B0 represents the probability distribution over the state of
the relational domain at time 0 and B→ is a RBN of nodes at time t whose parent are
predicates at time t − 1 or predicates at time t and nodes at time t − 1 without their
parents.</p>
      <p>In order to make inference in a multi-target setting, we need to extend the algorithms
traditionally used in tracking to represent relations. As in classic tracking, the aim is
to estimate the current posterior distribution of the state space st conditioned to the
sequence of observations z1:t up to time t: p(st|z1:t). This distribution is often called
the target’s belief (bel(st)).</p>
      <p>
        The tracker predicts the probability distribution of the future state st, given the
knowledge about the current state st−1, by means of a state transition model p(st|st−1).
Once measurements about the state at time t (zt) are acquired, the state is filtered using
the sensor model p(zt|st) that relates (potentially noisy) measurements to the state.
The relational transitional model p(st|st−1) = p(sto, str|sto−1, str−1) is a joint
probability of the state of all instances and relations. We assume that the state of relations
is not directly affected by the state of the objects at the previous time step (see
Figure 1). Therefore the transition model can be rewritten as:
p(sto, str|sto−1, str−1) = p(sto|sto−1, str−1)p(str|str−1, sto).
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
The sensor model p(zt|st) gives the probability of the state at time t given the
measurements obtained at the same time. We assume the relations to be not directly
measurable, so the observation zt is independent of the relations between objects:
p(zt|st) = p(zt|sto, str) = p(zt|sto).
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
      </p>
      <p>Under the Markov assumption and the conditional independence of the data given
the state, we can use a Bayesian filter algorithm to compute the belief of the relational</p>
      <p>bel(st) = α p(zt|sto) bfel(st)
where α is a normalization constant and bfel(st) is the prediction done over the system
(p(sto, str|z1:t−1)) that can be computed as:</p>
      <p>p(sto, str|sto−1, str−1)bel(st−1)dst−1.
bfel(st) =</p>
      <p>
        Z
Z
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
(
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
(
        <xref ref-type="bibr" rid="ref5">5</xref>
        )
According to the state transition model (Equation 1), we can write Equation (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) as:
bfel(st) =
      </p>
      <p>p(sto|sto−1, str−1)p(str|str−1, sto)bel(st−1)dst−1.</p>
      <p>
        In the most general case we can represent the two partial transition models of
Equation (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) by a First Order Logic Tree (FOPT)4. We will introduce an example of FOPT
when dealing with the experiments.
2.1
      </p>
      <p>Relational Particle Filter
The specific and complex probabilistic nature of the presented setting makes impossible
to use filters that require a probabilistic function in closed form, such as the Kalman
filter. To solve this issue we developed an extension of the Particle Filter (PF) algorithm
whose properties are appealing for our case.</p>
      <p>The PF algorithm [1] is a Monte Carlo method that approximates the required
posterior density function by a set of random samples with associated weights and computes
estimates based on these samples and weights. As the number of samples becomes very
large, the Monte Carlo approximation to the correct posterior improves and the PF
approaches the optimal Bayesian estimate.</p>
      <p>
        We integrate the relational transitional model introduced in Equation (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) in a new
Relational Particle Filter (RPF), shown in Algorithm (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ).
      </p>
      <p>Algorithm 1: Pseudo Code for the Relational Particle Filter algorithm
bel(st) = RP F (bel(st−1), zt)
for all m = 1 : M do
1. xtro,(m) ∼ p(sto|sto−1, str−1); hypothesis for the state of instantiations
2. xt,(m) ∼ p(str|str−1, sto = xto,(m)); hypothesis for the state of relations
3. ω(m) = p(zt|xto,(m)); weights computation</p>
      <p>for all m = 1 : M do
4. ωe(m) = P mMω=(1mω)(m) ; weights normalization
5. Resample bel(st) from {[xto,(m), xtr,(m)]} according to weights {ωe(m)} with repetition.
4 A FOPT (also known as First Order Decision Diagram [2]) is Probabilistic Tree whose nodes
are First Order Logic formulas.</p>
      <p>A particle (xt,(m)) is a representation of the state. For this reason, in our setting,
it is divided in two parts: the part of the objects xto,(m) and the part of the relations
xtr,(m). (see Figure 2(a)). The part of the particle relative to the instantiations is sampled
according to p(sto|sto−1, str−1) (Line 1), subsequently the part of the particle relative to
the relations is sampled according to the second part of the relational transition model
(Line 2). When the measurement is acquired, particles are weighted according to the
sensor model (Line 3). The sensor model takes into account only the part of the
particles relative to the objects, since the particles are composed by two parts, also the parts
associated to the relations are weighted. After the weighting step, weights are
normalized (Line 4) and the set of particles for the next iteration is extracted according to the
normalized weights in the resampling step (Line 5).</p>
      <p>(a) Particle representation.</p>
      <p>(b) First step of hypothesis.
(c) Second step of hypothesis.</p>
      <p>(d) Particle weighting.
We evaluated the proposed method on a synthetic data set that is based on the
Intelligence System Challenge 2008-2009 data set. We chose to use this data set because it
is easier than a real data set but still challenging. The data set contains the description
of the events happened in the sea; each element of the data set reports the tracks of two
boats participating in an event (i.e. Rendezvous, PickUp and Avoidance) together. At
each time interval at most one event takes place.</p>
      <p>We are particularly interested in the case where there is uncertainty about the
participants taking part in an event, in order to demonstrate the advantage of maintaining
beliefs over the set of possible relations. In order to test our relational particle filter
algorithm for activity recognition in a more challenging scenario with multiple targets,
we use the original data set to build a new synthetic data set of 120 situations (either
rendezvous or avoidance), obtained by pairing two encounters randomly sampled from
the original data set. In this way, four ships are present at the same time in the scene.</p>
      <p>In the experiment, we consider the task of detecting a rendezvous between a yacht
and a fisher ship. After describing the setting of our experiments, in the next subsection,
we report some results.
3.1</p>
      <p>Settings and Results
180
n160
o
it
is140
o
p
x120</p>
      <p>Rendezvous between a yacht and a fisher</p>
      <p>100
1000 0.5 1 1.5 tim2 e 2.5 3 3.5 4
30
20
xd 10
e
pe 0
s−10
−200 0.5 1 1.5 tim2 e 2.5 3 3.5 4
250
n200
o
it
is150
o
p
x100
500 0.5 1 1.5 tim2 e 2.5 3 3.5 4
40
20
xd 0
e
pe−20
s−40
−600 0.5 1 1.5 tim2 e 2.5 3 3.5 4</p>
      <p>
        We used the data set to estimate the prior for the event Rendezvous between a Fisher
and a Yacht (33/80). Then, we examined the data relative to the encounters in order to
acquire information about the two different events (rendezvous or avoidance) that can
be used to predict the relation. In particular, we focused on the variation of speed of
the two targets. Consider for example, the rendezvous in Figure (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ): the two ships come
closer and both progressively reduce their speed until a nearly-zero value. Different
is the case of ships that are avoiding each other (thus not in relation according to our
model), one maintains its speed and the other decelerates (Figure (
        <xref ref-type="bibr" rid="ref4">4</xref>
        )).
      </p>
      <p>From the images it is also possible to notice the three-phases which characterize the
event of rendezvous: ships approach each other reducing their speed in the first phase,
they travel in the same direction with nearly-zero speed in the second phase and finally
they go apart and at least one of them change its speed. Our relational transition model
takes into account these three different phases allowing to detect when the event starts
and when it finishes but also allowing to understand if two ships can be in relation (since
a ship can be in relation only with another ship).</p>
      <p>
        An example of the relational transition model used in our experiments is given in
Figure (
        <xref ref-type="bibr" rid="ref5">5</xref>
        ) and in Figure (
        <xref ref-type="bibr" rid="ref6">6</xref>
        ) .
      </p>
      <p>
        We ran the experiments on each of the 120 sets of four tracks in the data set. In
table (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) we show the accuracy of our method for the rendezvous detection compared
to the accuracy of a method that randomly choses which boats are in relation. In the
table it is also reported the average tracking error of the RPF algorithm compared to a
PF algorithm that does not take into account relations. The tracking error is computed
as the distance between the trajectories acquired by the particle filter (at each time-step,
it averages the position considering the states of all particles) and the real trajectories.
Our work is at the intersection of work in Probabilistic Relational Models, that to our
knowledge have never before considered applications in tracking, and Computer Vision,
where often heuristics are used to improve tracking, but not with a systematic account
of relationships between targets.
      </p>
      <p>Recently there has been increasing interest in models that extend probabilistic
reasoning to First Order Logic to exploit redundancies observed in the worlds ([5], [6]).
In this setting, many relational inference algorithms proceed by first fully
instantiating the First-Order relations and then working at the propositional level. In [10] an
inference algorithm that instantiates relations only as needed is presented, but this
algorithm can deal only with static domains as the relations are not supposed to change
over time. Moreover, our model is different from the one presented in [9], where the
concept of class is used to develop an inference system able to deal with a large number
of heterogenous objects. We use First-Order Logic to explicitly represent relationships
between objects to improve the inference task. Our method is potentially applicable to
situations with a large number of objects as well.</p>
      <p>Hybrid states models have been used to deal with complex tracking tasks [7]. They
combined continuous-valued dynamic with a discrete state of the world (context)
encoding which switching dynamic is performed jointly with tracking. Our system uses
relations as representations of the context of each object instead of the context of the
entire world. The explicit recognition of the relations of each object allows us to deal
with much more complex tracking tasks. Moreover, the use of First Order Logic (as
opposed to predicative logic) generalizes our models to different domains.</p>
      <p>In [8] the recognition of complex activity (temporally extended activities that can
be fragmented in simple ones) is based on context-free grammar. They decouple the
recognition task in two levels: a lower level that detects single simple activities that are
the inputs for the stochastic context-free grammar used as a “bag of words” for a parsing
mechanism. Instead, our approach does not decouple the recognition task, but seek to
take advantage from the tracking, that provides the detection of simple activities, to
recognize the temporally extended activity and from the knowledge about the complex
activity to improve the tracking.</p>
      <p>In [11], the authors address the problem of activity recognition using First Order
Logic rules and Markov Logic Networks to represent common sense domain
knowledge. Differently from the method we are proposing, the inference task is performed
off-line: they perform probabilistic inference for input queries about events of interest
already happened. We seek, instead, to perform an on-line probabilistic inference of
both the state of the domain and the activities.
4</p>
    </sec>
    <sec id="sec-3">
      <title>Conclusions</title>
      <p>In this paper we presented a technique based on relational Bayesian reasoning in order
to address the problem of activity recognition and tracking. We presented an extension
of particle filter, called relational particle filter, that can be used to make inference. From
our preliminary results we can conclude that our method can help to identify the type of
encounter that the targets are engaging. Moreover we have shown how using relations as
context can improve the tracking task. Compared to hybrid state model techniques, we
are able to model the problem with a single dynamic model and the state representation
is much more compact.</p>
      <p>There are a number of possible applications of this approach in problems where
there is the need of monitoring a situation from sensed data (video surveillance,
homelandsecurity, etc.) that we are interested to consider for future works.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>S.</given-names>
            <surname>Arulampalam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Maskell</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Gordon</surname>
          </string-name>
          .
          <article-title>A tutorial on particle filters for online nonlinear/non-gaussian bayesian tracking</article-title>
          .
          <source>IEEE Transactions on Signal Processing</source>
          ,
          <volume>50</volume>
          :
          <fpage>174</fpage>
          -
          <lpage>188</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>S. J. C.</given-names>
            <surname>Wang</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Khardon</surname>
          </string-name>
          .
          <article-title>First order decision diagrams for relational mdps</article-title>
          .
          <source>JAIR</source>
          ,
          <volume>31</volume>
          :
          <fpage>431</fpage>
          -
          <lpage>472</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>A. A. E. Derek</given-names>
            <surname>Hoiem</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Hebert</surname>
          </string-name>
          .
          <article-title>Putting objects in perspective</article-title>
          .
          <source>In Proc. IEEE Computer Vision and Pattern Recognition (CVPR)</source>
          , volume
          <volume>2</volume>
          , pages
          <fpage>2137</fpage>
          -
          <lpage>2144</lpage>
          ,
          <year>June 2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>G.</given-names>
            <surname>Elidan</surname>
          </string-name>
          , G. Heitz, and
          <string-name>
            <given-names>D.</given-names>
            <surname>Koller</surname>
          </string-name>
          .
          <article-title>Learning object shape: From drawings to images</article-title>
          .
          <source>In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR)</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>N.</given-names>
            <surname>Friedman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Getoor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Koller</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Pfeffer</surname>
          </string-name>
          .
          <article-title>Learning probabilistic relational models</article-title>
          .
          <source>In IJCAI</source>
          , pages
          <fpage>1300</fpage>
          -
          <lpage>1309</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>L.</given-names>
            <surname>Getoor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Friedman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Koller</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Taskar</surname>
          </string-name>
          .
          <article-title>Learning probabilistic models of link structure</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          ,
          <volume>3</volume>
          :
          <fpage>679</fpage>
          -
          <lpage>707</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>M.</given-names>
            <surname>Isard</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Blake</surname>
          </string-name>
          .
          <article-title>A mixed-state condensation tracker with automatic model-switching</article-title>
          .
          <source>In ICCV</source>
          , pages
          <fpage>107</fpage>
          -
          <lpage>112</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Y. A.</given-names>
            <surname>Ivanov</surname>
          </string-name>
          and
          <string-name>
            <given-names>A. F.</given-names>
            <surname>Bobick</surname>
          </string-name>
          .
          <article-title>Recognition of visual activities and interactions by stochastic parsing</article-title>
          .
          <source>IEEE Trans. Pattern Anal. Mach</source>
          . Intell.,
          <volume>22</volume>
          (
          <issue>8</issue>
          ):
          <fpage>852</fpage>
          -
          <lpage>872</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>B.</given-names>
            <surname>Milch</surname>
          </string-name>
          and
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Russell</surname>
          </string-name>
          .
          <article-title>General-purpose mcmc inference over relational structures</article-title>
          .
          <source>In UAI</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. H.
          <string-name>
            <surname>Poon</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Domingos</surname>
            , and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Sumner</surname>
          </string-name>
          .
          <article-title>A general method for reducing the complexity of relational inference and its application to mcmc</article-title>
          .
          <source>In AAAI</source>
          , pages
          <fpage>1075</fpage>
          -
          <lpage>1080</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>S. D.</given-names>
            <surname>Tran</surname>
          </string-name>
          and
          <string-name>
            <given-names>L. S.</given-names>
            <surname>Davis</surname>
          </string-name>
          .
          <article-title>Event modeling and recognition using markov logic networks</article-title>
          .
          <source>In ECCV (2)</source>
          , pages
          <fpage>610</fpage>
          -
          <lpage>623</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>R. S. Yan</given-names>
            <surname>Ke</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Hebert</surname>
          </string-name>
          .
          <article-title>Event detection in crowded videos</article-title>
          .
          <source>In IEEE International Conference on Computer Vision</source>
          ,
          <year>October 2007</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>