=Paper=
{{Paper
|id=None
|storemode=property
|title=EURECOM @ MediaEval 2011 Social Event Detection Task
|pdfUrl=https://ceur-ws.org/Vol-807/Liu_SED_EURECOM_me11wn.pdf
|volume=Vol-807
|dblpUrl=https://dblp.org/rec/conf/mediaeval/LiuHT11
}}
==EURECOM @ MediaEval 2011 Social Event Detection Task==
<pdf width="1500px">https://ceur-ws.org/Vol-807/Liu_SED_EURECOM_me11wn.pdf</pdf>
<pre>
 EURECOM @ MediaEval 2011 Social Event Detection Task

                   Xueliang Liu                             Benoit Huet                      Raphaël Troncy
               EURECOM,France                         EURECOM,France                        EURECOM,France
             xueliang.liu@eurecom.fr                benoit.huet@eurecom.fr              raphael.troncy@eurecom.fr


ABSTRACT                                                           mation online about such scheduled events. For example,
In this paper, we present our approach and results of the Me-      FBLeague1 provides the official football games that regis-
diaEval 2011 social event detection (SED) task. We solve the       tered in FIFA2 and UEFA3 . From this web site, we obtained
event detection problem in three steps. First, we query all        461 football games that occurred in May 2009, among which
event instances that happened given some condition. Then,          6 took place in Roma and Barcelona. These 6 soccer events
an event identification model is proposed to measure the re-       are our prior knowledge for the challenge 1.
lationship between events and photos. Finally, visual prun-           For challenge 2, we extract concerts information from event
ing and owner refining heuristics are employed to improve          directories such as Last.fm4 , Eventful5 , and Upcoming 6 .
the results.                                                       After manual check, only Last.fm contains descriptions of
                                                                   events held on the given conditions. Last.fm is a popular
                                                                   music web site that records concert events held in more than
1.    INTRODUCTION                                                 190 countries. In addition, Last.fm provides an API for the
   The Social Event Detection task at MediaEval 2011 aims          developer to build their algorithm based on its data. Using
at detecting social events that occurred during May 2009           its public API, we found 68 events that took place in the
from a dataset composed of images shared on Flickr [2]. The        Paradiso and 3 events in Parc del Forum in May 2009.
strategy we investigate is to find the event instances that
occurred during this period of time and then try to match          2.2      Event Identification Model
these event instances with photos from the Flickr dataset.            With the prior knowledge of scheduled events description,
We also study how to employ the visual features and “owner”        the event detection task changes to a matching problem
metadata from the photos to improve the performance. We            where a model can be used to measure the relationship be-
first detail our approach (Section 2) before presenting and        tween events and photos. Here, we consider events as some-
discussing our results (Section 3). Finally, we conclude the       thing happening in some place during sometime. Therefore,
paper in Section 4.                                                the title, time and location are three key factors that iden-
                                                                   tify an event. The corresponding photo metadata are text
2.    APPROACH DESCRIPTION                                         description, taken time and place. Since these three factors
  The challenge of the social event detection task is to find      are independent, we can measure the probability of a given
the photo clusters that are relevant to events held on a given     photo P to be relevant to an event E by
location during a particular period of time. We tackle this            p(P |E) = p(P.text|E.title)p(P.time|E.time)p(P.geo|E.geo) (1)
problem in two steps: first, we attempt to retrieve all of
                                                                   where: The first item measures the similarity of a photo text
the events that occurred at a given place and time; second,
                                                                   description with an event title. Since both of them are short
we use the extracted information about these events and at-
                                                                   and sparse, the most straightforward way to measure them
tempt to match them to the photos metadata in the dataset.
                                                                   is:
All of the photos that are matched to the same event can
be grouped in one cluster. Besides these two main steps, we                                         |T ext1 ∩ T ext2|
                                                                               p(T ext1|T ext2) =                            (2)
also improve the detection results with visual feature and                                               |T ext2|
“owner” metadata.                                                  Where the function | · | is the total number of words in a
                                                                   text vector.
2.1    Prior knowledge acquisition                                   The second item in Equation 1 measures the difference
   We known that it is easier and more accurate for the com-       between photo taken time and event held time. Here, we
puter to identify specific pattern compared with abstract          measure the difference using the Dirac function.
concept. To find concert or soccer events that may be hid-
den in the dataset, we first look for all instances of these two                                    date(T ime2 − T ime1)
                                                                            p(T ime1|T ime2) = δ(                         )     (3)
kinds of events held in a given place and time.                                                              N
                                                                   1
   Soccer games and concerts are types of favorite activi-           http://www.fbleague.com
                                                                   2
ties in people’s daily life and one can find substantial infor-      http://www.www.fifa.com
                                                                   3
                                                                     http://www.www.uefa.com
                                                                   4
                                                                     http://www.last.fm
                                                                   5
Copyright is held by the author/owner(s).                            http://www.eventful.com
                                                                   6
MediaEval 2010 Workshop, September 1-2, 2011, Pisa, Italy            http://upcoming.yahoo.com
Where the function date(·) calculates the number of days                • run3 the parameter N in Equation 3 is set to 3 to
for a time span, δ is the Dirac delta function that takes the             reduce the impact from erroneous taken time, and the
value 1 when and only when the input parameter is zero,                   basic Event Identification Model is run.
and N is used for scaling (its value will be discussed in the
Section 3).                                                             • run4 Owner Refinement is performed on the results
   The third item in Equation 1 measures the distance be-                 of run3.
tween photo geo tags and event location. The best distance              • run5 Visual Pruning and Owner Refinement are per-
measure to use seems the L2 distance between the two loca-                formed on the results of run3.
tions. However, an important amount of photos do not have
geo tags and when provided, GPS data in the Flickr dataset         A summary of the results is detailed in the Table 1. As
can be inaccurate. Consequently, we just use the city/venue
name to measure the location feature and we use the textual
metric formalized in the Equation 2.                                          Table 1: Event Detection Results
   This method finds many photos with a clear description                          Results                   Evaluation
and association to events. However, text-based matching              Run       Events    Photos    P(%)     R(%)     F(%)      NMI
brings also noise and it can not deal with photos without           run 1.1          2       216    97,69    41,21   57,97   0,2420
any text description. We employ visual features to remove           run 1.2          2       222    97,75    42,38   59,13   0,2472
the noisy photos and “owner” metadata to find out relevant          run 2.1         18      1133    70,79    48,90   57,84   0,4516
photos without text description.                                    run 2.2         18      1172    71,13    50,49   59,06   0,4697
                                                                    run 2.3         24      1502    70,51    64,57   67,41   0,5987
2.3      Visual Pruning                                             run 2.4         24      1556    70,99    67,01   68,95   0,6171
   Visual pruning is employed to remove the noisy photos            run 2.5         24      1546    71,00    66,59   68,72   0,6139
from the results of the Event Identification Model [1]. We
assume that the photos that are corresponding to the same
                                                                   shown in the Table 1, 2 events are found for challenge 1 with
event should be similar visually. The method used here
                                                                   216 photos identified by the Event Identification Model. 6
is quite straightforward. Given a set of the photo feature
                                                                   additional photos are found by the “Owner Refinement” ap-
{fi , i ∈ [1, N ]}, the distance between each feature fi and its
                                                                   proach. For the challenge 2, there are mainly two groups
mean vector m is measure by the L1 distance.
                                                                   of runs. The first group (run1,run2) used the parameter
                     di = sum(|fi − m|)                     (4)    N=1, and 18 events are found from the 69 events set previ-
                                                                   ous obtained. In the second group (run3, run4, run5), 24
Photos are then sorted according to the distance di . The          events are found with the parameter N=3. In general, the
bigger the distance and the less similar the photo is with         results for the challenge 1 are just average since only 6 foot-
the photo cluster, so we prune the photos with such a large        ball games were found as prior knowledge and we suppose
distance. Experimentally, we remove the 5% photos that are         that several other games have been missed. For the chal-
far from the center in the visual feature space.                   lenge 2, the results are more promising and competitive.
2.4      Owner Refinement
                                                                   4.    CONCLUSION
  Owner refinement is another way to improve the detection
results [1]. We assume that a person can not attend more              In this paper, we propose a framework to detect social
than one event simultaneously. Therefore, all the photos           events within a media dataset. In our approach, the events
that have been taken by the same owner during the event            instances are retrieved first as prior knowledge, and then, an
duration should be assigned to the same cluster. Using this        Event Identification Model is used to measure the similarity
heuristic, it is possible to retrieve photos which do not have     of event and photos. In the solution, multi-modality feature
any textual description.                                           such as text, time, visual feature and “owner” metadata are
                                                                   used.

3.      EXPERIMENTS AND RESULTS                                    Acknowledgments
  Based on the proposed approach and the events instances
                                                                   This work is supported by the project AAL-2009-2-049 “Adapt-
obtained previously, we design our runs as follows:
                                                                   able Ambient Living Assistant” (ALIAS) co-funded by the
  Challenge 1 :
                                                                   European Commission and the French Research Agency (ANR)
      • run1 The parameter N in Equation 3 is set to 3, and        in the Ambient Assisted Living (AAL) programme.
        the basic Event Identification Model is run.
                                                                   5.    REFERENCES
      • run2 Owner Refinement is performed on the results          [1] X. Liu, R. Troncy, and B. Huet. Finding Media
        of run1.                                                       Illustrating Events. In 1st ACM International
                                                                       Conference on Multimedia Retrieval (ICMR’11),
     Challenge 2 :
                                                                       Trento, Italy, 2011.
      • run1 the parameter N in Equation 3 is set to 1, and        [2] S. Papadopoulos, R. Troncy, V. Mezaris, B. Huet, and
        the basic Event Identification Model is run.                   I. Kompatsiaris. Social Event Detection at MediaEval
                                                                       2011: Challenges, Dataset and Evaluation. In
      • run2 Owner Refinement is performed on the results              MediaEval 2011 Workshop, Pisa, Italy, September 1-2
        of run1.                                                       2011.

</pre>