<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">EURECOM @ MediaEval 2011 Social Event Detection Task</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Xueliang</forename><surname>Liu</surname></persName>
							<email>xueliang.liu@eurecom.fr</email>
							<affiliation key="aff0">
								<orgName type="institution">EURECOM</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Benoit</forename><surname>Huet</surname></persName>
							<email>benoit.huet@eurecom.fr</email>
							<affiliation key="aff1">
								<orgName type="institution">EURECOM</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Raphaël</forename><surname>Troncy</surname></persName>
							<email>raphael.troncy@eurecom.fr</email>
							<affiliation key="aff2">
								<orgName type="institution">EURECOM</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">EURECOM @ MediaEval 2011 Social Event Detection Task</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">9F56C114F78FAD85AB91EDAB32E197D9</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T01:28+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper, we present our approach and results of the Me-diaEval 2011 social event detection (SED) task. We solve the event detection problem in three steps. First, we query all event instances that happened given some condition. Then, an event identification model is proposed to measure the relationship between events and photos. Finally, visual pruning and owner refining heuristics are employed to improve the results.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">INTRODUCTION</head><p>The Social Event Detection task at MediaEval 2011 aims at detecting social events that occurred during May 2009 from a dataset composed of images shared on Flickr <ref type="bibr" target="#b1">[2]</ref>. The strategy we investigate is to find the event instances that occurred during this period of time and then try to match these event instances with photos from the Flickr dataset. We also study how to employ the visual features and "owner" metadata from the photos to improve the performance. We first detail our approach (Section 2) before presenting and discussing our results (Section 3). Finally, we conclude the paper in Section 4.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">APPROACH DESCRIPTION</head><p>The challenge of the social event detection task is to find the photo clusters that are relevant to events held on a given location during a particular period of time. We tackle this problem in two steps: first, we attempt to retrieve all of the events that occurred at a given place and time; second, we use the extracted information about these events and attempt to match them to the photos metadata in the dataset. All of the photos that are matched to the same event can be grouped in one cluster. Besides these two main steps, we also improve the detection results with visual feature and "owner" metadata.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Prior knowledge acquisition</head><p>We known that it is easier and more accurate for the computer to identify specific pattern compared with abstract concept. To find concert or soccer events that may be hidden in the dataset, we first look for all instances of these two kinds of events held in a given place and time.</p><p>Soccer games and concerts are types of favorite activities in people's daily life and one can find substantial infor-</p><p>Copyright is held by the author/owner(s).</p><p>MediaEval 2010 Workshop, September 1-2, 2011, Pisa, Italy mation online about such scheduled events. For example, FBLeague<ref type="foot" target="#foot_0">1</ref> provides the official football games that registered in FIFA<ref type="foot" target="#foot_1">2</ref> and UEFA<ref type="foot" target="#foot_2">3</ref> . From this web site, we obtained 461 football games that occurred in May 2009, among which 6 took place in Roma and Barcelona. These 6 soccer events are our prior knowledge for the challenge 1.</p><p>For challenge 2, we extract concerts information from event directories such as Last.fm<ref type="foot" target="#foot_3">4</ref> , Eventful<ref type="foot" target="#foot_4">5</ref> , and Upcoming<ref type="foot" target="#foot_5">6</ref> . After manual check, only Last.fm contains descriptions of events held on the given conditions. Last.fm is a popular music web site that records concert events held in more than 190 countries. In addition, Last.fm provides an API for the developer to build their algorithm based on its data. Using its public API, we found 68 events that took place in the Paradiso and 3 events in Parc del Forum in May 2009.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Event Identification Model</head><p>With the prior knowledge of scheduled events description, the event detection task changes to a matching problem where a model can be used to measure the relationship between events and photos. Here, we consider events as something happening in some place during sometime. Therefore, the title, time and location are three key factors that identify an event. The corresponding photo metadata are text description, taken time and place. Since these three factors are independent, we can measure the probability of a given photo P to be relevant to an event E by p(P |E) = p(P.text|E.title)p(P.time|E.time)p(P.geo|E.geo) <ref type="bibr" target="#b0">(1)</ref> where: The first item measures the similarity of a photo text description with an event title. Since both of them are short and sparse, the most straightforward way to measure them is:</p><formula xml:id="formula_0">p(T ext1|T ext2) = |T ext1 ∩ T ext2| |T ext2|<label>(2)</label></formula><p>Where the function | • | is the total number of words in a text vector.</p><p>The second item in Equation 1 measures the difference between photo taken time and event held time. Here, we measure the difference using the Dirac function.</p><formula xml:id="formula_1">p(T ime1|T ime2) = δ( date(T ime2 − T ime1) N )<label>(3)</label></formula><p>Where the function date(•) calculates the number of days for a time span, δ is the Dirac delta function that takes the value 1 when and only when the input parameter is zero, and N is used for scaling (its value will be discussed in the Section 3). The third item in Equation 1 measures the distance between photo geo tags and event location. The best distance measure to use seems the L2 distance between the two locations. However, an important amount of photos do not have geo tags and when provided, GPS data in the Flickr dataset can be inaccurate. Consequently, we just use the city/venue name to measure the location feature and we use the textual metric formalized in the Equation <ref type="formula" target="#formula_0">2</ref>.</p><p>This method finds many photos with a clear description and association to events. However, text-based matching brings also noise and it can not deal with photos without any text description. We employ visual features to remove the noisy photos and "owner" metadata to find out relevant photos without text description.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Visual Pruning</head><p>Visual pruning is employed to remove the noisy photos from the results of the Event Identification Model <ref type="bibr" target="#b0">[1]</ref>. We assume that the photos that are corresponding to the same event should be similar visually. The method used here is quite straightforward. Given a set of the photo feature {fi, i ∈ [1, N ]}, the distance between each feature fi and its mean vector m is measure by the L1 distance.</p><formula xml:id="formula_2">di = sum(|fi − m|)<label>(4)</label></formula><p>Photos are then sorted according to the distance di. The bigger the distance and the less similar the photo is with the photo cluster, so we prune the photos with such a large distance. Experimentally, we remove the 5% photos that are far from the center in the visual feature space.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4">Owner Refinement</head><p>Owner refinement is another way to improve the detection results <ref type="bibr" target="#b0">[1]</ref>. We assume that a person can not attend more than one event simultaneously. Therefore, all the photos that have been taken by the same owner during the event duration should be assigned to the same cluster. Using this heuristic, it is possible to retrieve photos which do not have any textual description.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">EXPERIMENTS AND RESULTS</head><p>Based on the proposed approach and the events instances obtained previously, we design our runs as follows:</p><p>Challenge 1 :</p><p>• run1 The parameter N in Equation 3 is set to 3, and the basic Event Identification Model is run.</p><p>• run2 Owner Refinement is performed on the results of run1.</p><p>Challenge 2 :</p><p>• run1 the parameter N in Equation 3 is set to 1, and the basic Event Identification Model is run.</p><p>• run2 Owner Refinement is performed on the results of run1.</p><p>• run3 the parameter N in Equation 3 is set to 3 to reduce the impact from erroneous taken time, and the basic Event Identification Model is run.</p><p>• run4 Owner Refinement is performed on the results of run3.</p><p>• run5 Visual Pruning and Owner Refinement are performed on the results of run3.</p><p>A summary of the results is detailed in the Table <ref type="table" target="#tab_0">1</ref>. As For the challenge 2, there are mainly two groups of runs. The first group (run1,run2) used the parameter N=1, and 18 events are found from the 69 events set previous obtained. In the second group (run3, run4, run5), 24 events are found with the parameter N=3. In general, the results for the challenge 1 are just average since only 6 football games were found as prior knowledge and we suppose that several other games have been missed. For the challenge 2, the results are more promising and competitive.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">CONCLUSION</head><p>In this paper, we propose a framework to detect social events within a media dataset. In our approach, the events instances are retrieved first as prior knowledge, and then, an Event Identification Model is used to measure the similarity of event and photos. In the solution, multi-modality feature such as text, time, visual feature and "owner" metadata are used.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>Event Detection Results</figDesc><table><row><cell></cell><cell cols="2">Results</cell><cell>Evaluation</cell></row><row><cell>Run</cell><cell cols="3">Events Photos P(%) R(%) F(%)</cell><cell>NMI</cell></row><row><cell>run 1.1</cell><cell>2</cell><cell></cell><cell>216 97,69 41,21 57,97 0,2420</cell></row><row><cell>run 1.2</cell><cell>2</cell><cell></cell><cell>222 97,75 42,38 59,13 0,2472</cell></row><row><cell>run 2.1</cell><cell>18</cell><cell cols="2">1133 70,79 48,90 57,84 0,4516</cell></row><row><cell>run 2.2</cell><cell>18</cell><cell cols="2">1172 71,13 50,49 59,06 0,4697</cell></row><row><cell>run 2.3</cell><cell>24</cell><cell cols="2">1502 70,51 64,57 67,41 0,5987</cell></row><row><cell>run 2.4</cell><cell>24</cell><cell cols="2">1556 70,99 67,01 68,95 0,6171</cell></row><row><cell>run 2.5</cell><cell>24</cell><cell cols="2">1546 71,00 66,59 68,72 0,6139</cell></row><row><cell cols="4">shown in the Table 1, 2 events are found for challenge 1 with</cell></row><row><cell cols="4">216 photos identified by the Event Identification Model. 6</cell></row><row><cell cols="4">additional photos are found by the "Owner Refinement" ap-</cell></row><row><cell>proach.</cell><cell></cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">http://www.fbleague.com</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">http://www.www.fifa.com</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">http://www.www.uefa.com</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">http://www.last.fm</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">http://www.eventful.com</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">http://upcoming.yahoo.com</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work is supported by the project AAL-2009-2-049 "Adaptable Ambient Living Assistant" (ALIAS) co-funded by the European Commission and the French Research Agency (ANR) in the Ambient Assisted Living (AAL) programme.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Finding Media Illustrating Events</title>
		<author>
			<persName><forename type="first">X</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Troncy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Huet</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">1 st ACM International Conference on Multimedia Retrieval (ICMR&apos;11)</title>
				<meeting><address><addrLine>Trento, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Social Event Detection at MediaEval 2011: Challenges, Dataset and Evaluation</title>
		<author>
			<persName><forename type="first">S</forename><surname>Papadopoulos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Troncy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Mezaris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Huet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Kompatsiaris</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">MediaEval 2011 Workshop</title>
				<meeting><address><addrLine>Pisa, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2011-02">September 1-2 2011</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
