<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Sports Video Classification: Classification of Strokes in Table Tennis for MediaEval 2020</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Pierre-Etienne</forename><surname>Martin</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution" key="instit1">Univ. Bordeaux</orgName>
								<orgName type="institution" key="instit2">CNRS</orgName>
								<orgName type="institution" key="instit3">Bordeaux INP</orgName>
								<orgName type="institution" key="instit4">LaBRI</orgName>
								<address>
									<settlement>Talence</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jenny</forename><surname>Benois-Pineau</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution" key="instit1">Univ. Bordeaux</orgName>
								<orgName type="institution" key="instit2">CNRS</orgName>
								<orgName type="institution" key="instit3">Bordeaux INP</orgName>
								<orgName type="institution" key="instit4">LaBRI</orgName>
								<address>
									<settlement>Talence</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Boris</forename><surname>Mansencal</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution" key="instit1">Univ. Bordeaux</orgName>
								<orgName type="institution" key="instit2">CNRS</orgName>
								<orgName type="institution" key="instit3">Bordeaux INP</orgName>
								<orgName type="institution" key="instit4">LaBRI</orgName>
								<address>
									<settlement>Talence</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Renaud</forename><surname>Péteri</surname></persName>
							<affiliation key="aff1">
								<orgName type="laboratory">MIA</orgName>
								<orgName type="institution">La Rochelle University</orgName>
								<address>
									<settlement>La Rochelle</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Laurent</forename><surname>Mascarilla</surname></persName>
							<affiliation key="aff1">
								<orgName type="laboratory">MIA</orgName>
								<orgName type="institution">La Rochelle University</orgName>
								<address>
									<settlement>La Rochelle</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jordan</forename><surname>Calandre</surname></persName>
							<affiliation key="aff1">
								<orgName type="laboratory">MIA</orgName>
								<orgName type="institution">La Rochelle University</orgName>
								<address>
									<settlement>La Rochelle</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Julien</forename><surname>Morlier</surname></persName>
							<affiliation key="aff2">
								<orgName type="department">IMS</orgName>
								<orgName type="institution">University of Bordeaux</orgName>
								<address>
									<settlement>Talence</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Sports Video Classification: Classification of Strokes in Table Tennis for MediaEval 2020</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">BCCDD486319A879B052AB3F2A6775794</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T07:12+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Fine grained action classification has raised new challenges compared to classical action classification problem. Sport video analysis is a very popular research topic, due to the variety of application areas, ranging from multimedia intelligent devices with user-tailored digests, up to analysis of athletes' performances. Running since 2019 as a part of MediaEval, we offer a task which consists in classifying table tennis strokes from videos recorded in natural conditions at the University of Bordeaux. The aim is to build tools for teachers, coaches and players to analyse table tennis games. Such tools could lead to an automatic profiling of the player and the training session could then be adapted for improving sports skills more efficiently.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION</head><p>Action detection and classification is one of the main challenges in visual content analysis and mining <ref type="bibr" target="#b25">[26]</ref>. Over the last few years, the number of datasets for action classification has drastically increased in terms of video content, resolution, localization and number of classes. However the latest research shows that classification performed using deep neural networks often focuses on the whole scene and the background and not on the action itself.</p><p>Sport video analysis has been a very popular research topic, due to the variety of application areas, ranging from multimedia intelligent devices with user-tailored digests, up to analysis of athletes' performance <ref type="bibr" target="#b4">[5]</ref>. The Sport Video Classification project was initiated by the Faculty of Sports (STAPS) and the computer science laboratories (LaBRI) of the University of Bordeaux, and by the MIA laboratory of La Rochelle University 1 . The goal of this project is to develop artificial intelligence and multimedia indexing methods for the recognition of table tennis sport activities. The ultimate goal is to evaluate the performance of athletes, with a particular focus on students, in order to develop optimal training strategies. To that aim, a video corpus named TTStroke-21 was recorded with volunteer players. These data are of great scientific interest for the Multimedia community participating in the MediaEval campaign.</p><p>Several datasets such as UCF-101 <ref type="bibr" target="#b23">[24]</ref>, HMDB <ref type="bibr" target="#b9">[10]</ref> and AVA <ref type="bibr" target="#b6">[7]</ref> have been used for many years as benchmarks for action classification methods. In <ref type="bibr" target="#b14">[15]</ref>, spatio-temporal dependencies are learned from the video using only RGB images for classification. This method is promising but its scores are still below multi-modal methods such I3D <ref type="bibr" target="#b3">[4]</ref>. More recently, datasets have been enriched, like JH-MDB <ref type="bibr" target="#b7">[8]</ref> and Kinetics <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3,</ref><ref type="bibr" target="#b8">9]</ref> or fused like AVA_Kinetics <ref type="bibr" target="#b11">[12]</ref>. Some also focus on the intra-class dissimilarity such as the Something-Something dataset. Others, such as the Olympic Sports dataset <ref type="bibr" target="#b21">[22]</ref>, focus on sport actions only. However those datasets are not dedicated to a specific sport and its associated rules. Few datasets focus on fine-grained classification. We can cite FineGym <ref type="bibr" target="#b22">[23]</ref>, introduced recently, which focuses on gymnastic videos, and our TTStroke21 <ref type="bibr" target="#b20">[21]</ref> comprising table tennis strokes.</p><p>TTStroke-21 is manually annotated by professional players or teachers of table tennis, making the annotation process more time consuming, but more temporally and qualitatively accurate. Classification methods such as I3D model <ref type="bibr" target="#b3">[4]</ref> or LTC model <ref type="bibr" target="#b27">[28]</ref> performing well on UCF-101 dataset inspired the work done in <ref type="bibr" target="#b17">[18,</ref><ref type="bibr" target="#b20">21]</ref> which introduces a TSTCNN -Twin Spatio Temporal Convolutional Neural Network. Here, the video stream and derived computed optical flow are passed through the branches of the TSTCNN. In <ref type="bibr" target="#b18">[19]</ref> the normalization of the flow is also investigated to improve the classification score while in <ref type="bibr" target="#b19">[20]</ref> an attention block is introduced to improve the performances and speed of convergence. The inter-similarity of actions -strokes -in TTStroke-21 makes the classification task challenging and the multi-modal method seemed to improve performances. To better understand learned features and classification process taking place in the TSTCNN, we also developed a new visualization technique <ref type="bibr" target="#b5">[6]</ref>.</p><p>Recent work focusing on table tennis <ref type="bibr" target="#b29">[30]</ref> tries to get the tactics of the players based on their performance during matches using a Markov chain model. In <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b26">27,</ref><ref type="bibr" target="#b31">32]</ref> stroke recognition is performed using sensors. In <ref type="bibr" target="#b28">[29]</ref> segmentation of the player, ball coordinates, event detection is explored while <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b30">31]</ref> focus solely on the trajectory of the ball.</p><p>In this task overview paper, in section 2, we introduce the specific conditions of usage of this data, then describe TTStroke-21 and the task respectively in sections 3 and 4. The evaluation method is explained in section 5. Supplementary notes are shared in section 6. More information can be found on the dedicated GitHub web page<ref type="foot" target="#foot_0">2</ref> .</p><p>P-e Martin et al.  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">DATASET DESCRIPTION</head><p>In the MediaEval 2020 campaign, we release the same subset of the TTStroke-21 dataset than last year. The only difference is the blurring of the faces and the specification if the player is right-handed or left-handed. The dataset has been recorded in a sport faculty facility using a light-weight equipment, such as GoPro cameras.  <ref type="bibr" target="#b15">[16]</ref>, most of the faces are blurred. To do so, faces are detected with OpenCV deep learning face detector, based on the Single Shot Detector (SSD) framework with a ResNet base network, for each frame of the original video. The detected face is blurred and frames are re-encoded in a video.</p><p>The organisation of the delivered data is as follows:</p><p>• The provided dataset is split into two subsets: i) training set and ii) test set; • In each directory, there are several videos (in MPEG-4 format) and each video may contain several actions; • Each video file is provided with a XML file describing the actions present in the video and if the player is right-handed or left-handed; • Each action has 3 attributes: the starting frame, the ending frame, and the stroke class; • In the train set XML files, all the attributes are specified.</p><p>In the test set XML files, only the starting and ending frames are specified. The stroke class attribute is purposely set to value: "Unknown", and should be updated by the participants to one of the 20 valid classes.</p><p>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">TASK DESCRIPTION</head><p>The Sport Video Annotation task consists, for each action of each test video, in assigning a label using a given taxonomy of 20 classes of table tennis strokes. Participants may submit up to five runs. For each run, they must provide one XML file per video file containing, with the actions associated with the recognised stroke class. Runs may be submitted as an archive (zip or tar.gz file) with each run in a different directory. Participants should also indicate if any external data, such as other dataset or pretrained networks, was used to compute their runs. The task is considered fully automatic. Once the video are provided to the system, results should be produced without any human intervention.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">EVALUATION</head><p>For MediaEval 2020, we propose a light-weight classification task. It consists in classification of table tennis strokes which temporal borders are supplied in the XML files accompanying each video file. Hence for each test video the participants are invited to produce an XML file in which each stroke is labelled accordingly to the given taxonomy. This means that the default label "unknown" has to be replaced by the label of the stroke class that the participant's system has assigned. All submissions will be evaluated in terms of per-class accuracy (𝐴 𝑖 ) and of global accuracy (𝐺𝐴).</p><p>The organizers will also provide to the participants different confusion matrices: one considering all the classes, and others considering the type of the stroke such as: offensive, defensive and defensive and/or using forehand and backhand superclasses of the strokes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">DISCUSSION</head><p>The participants from last years have reached a maximum accuracy of 22.9% <ref type="bibr" target="#b16">[17]</ref>, 14.1% <ref type="bibr" target="#b0">[1]</ref> and 11.3% <ref type="bibr" target="#b24">[25]</ref> leaving room for improvement. Participants are welcome to share their difficulties and their results even if they seem not sufficiently good.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>a. Video acquisition b. Annotation platform</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: TTStroke-21 acquisition process</figDesc></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0">https://multimediaeval.github.io/2020-Sports-Video-Classification-Task/</note>
		</body>
		<back>

			<div type="funding">
<div xmlns="http://www.tei-c.org/ns/1.0"> <ref type="bibr" target="#b0">1</ref> <p>This work was supported by the New Aquitania Region through CRISP project -ComputeR vIsion for Sport Performance and the MIRES federation.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Optical Flow Singularities for Sports Video Annotation: Detection of Strokes in Table Tennis</title>
		<author>
			<persName><forename type="first">Jordan</forename><surname>Calandre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Renaud</forename><surname>Péteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Laurent</forename><surname>Mascarilla</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
	<note>11</note>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">A Short Note about Kinetics-600</title>
		<author>
			<persName><forename type="first">João</forename><surname>Carreira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Eric</forename><surname>Noland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andras</forename><surname>Banki-Horvath</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chloe</forename><surname>Hillier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andrew</forename><surname>Zisserman</surname></persName>
		</author>
		<idno>CoRR abs/1808.01340</idno>
		<imprint>
			<date type="published" when="2018">2018. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">A Short Note on the Kinetics-700 Human Action Dataset</title>
		<author>
			<persName><forename type="first">João</forename><surname>Carreira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Eric</forename><surname>Noland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chloe</forename><surname>Hillier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andrew</forename><surname>Zisserman</surname></persName>
		</author>
		<idno>CoRR abs/1907.06987</idno>
		<imprint>
			<date type="published" when="2019">2019. 2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset</title>
		<author>
			<persName><forename type="first">João</forename><surname>Carreira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andrew</forename><surname>Zisserman</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2017">2017. 2017</date>
			<biblScope unit="page" from="4724" to="4733" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Activity-Conditioned Continuous Human Pose Estimation for Performance Analysis of Athletes Using the Example of Swimming</title>
		<author>
			<persName><forename type="first">Moritz</forename><surname>Einfalt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dan</forename><surname>Zecha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Rainer</forename><surname>Lienhart</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE WACV 2018</title>
				<meeting><address><addrLine>Lake Tahoe, NV, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018-03-12">2018. March 12-15, 2018</date>
			<biblScope unit="page" from="446" to="455" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Feature Understanding in 3D CNNs for Actions Recognition in Video</title>
		<author>
			<persName><forename type="first">Kazi</forename><surname>Ahmed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Asif</forename><surname>Fuad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pierre-Etienne</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Romai</forename><surname>Giot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Romain</forename><surname>Bourqui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jenny</forename><surname>Benois-Pineau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Akka</forename><surname>Zemmari</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Tenth International Conference on Image Processing Theory, Tools and Applications, IPTA 2020</title>
				<meeting><address><addrLine>Paris, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020-11-09">2020. November 9-12, 2020</date>
			<biblScope unit="page" from="1" to="6" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions</title>
		<author>
			<persName><forename type="first">Chunhui</forename><surname>Gu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chen</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">David</forename><forename type="middle">A</forename><surname>Ross</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carl</forename><surname>Vondrick</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Caroline</forename><surname>Pantofaru</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yeqing</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sudheendra</forename><surname>Vijayanarasimhan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">George</forename><surname>Toderici</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Susanna</forename><surname>Ricco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Rahul</forename><surname>Sukthankar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cordelia</forename><surname>Schmid</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jitendra</forename><surname>Malik</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2018">2018. 2018</date>
			<biblScope unit="page" from="6047" to="6056" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Towards Understanding Action Recognition</title>
		<author>
			<persName><forename type="first">Hueihan</forename><surname>Jhuang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Juergen</forename><surname>Gall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Silvia</forename><surname>Zuffi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cordelia</forename><surname>Schmid</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><forename type="middle">J</forename><surname>Black</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE ICCV 2013</title>
				<meeting><address><addrLine>Sydney, Australia</addrLine></address></meeting>
		<imprint>
			<publisher>IEEE Computer Society</publisher>
			<date type="published" when="2013-12-01">2013. December 1-8, 2013</date>
			<biblScope unit="page" from="3192" to="3199" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">The Kinetics Human Action Video Dataset</title>
		<author>
			<persName><forename type="first">Will</forename><surname>Kay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">João</forename><surname>Carreira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Karen</forename><surname>Simonyan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Brian</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chloe</forename><surname>Hillier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sudheendra</forename><surname>Vijayanarasimhan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fabio</forename><surname>Viola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tim</forename><surname>Green</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Trevor</forename><surname>Back</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paul</forename><surname>Natsev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mustafa</forename><surname>Suleyman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andrew</forename><surname>Zisserman</surname></persName>
		</author>
		<idno>CoRR abs/1705.06950</idno>
		<imprint>
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">HMDB: A large video database for human motion recognition</title>
		<author>
			<persName><forename type="first">Hildegard</forename><surname>Kuehne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hueihan</forename><surname>Jhuang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Estíbaliz</forename><surname>Garrote</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tomaso</forename><forename type="middle">A</forename><surname>Poggio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Thomas</forename><surname>Serre</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE ICCV 2011</title>
				<editor>
			<persName><forename type="first">Dimitris</forename><forename type="middle">N</forename><surname>Metaxas</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Long</forename><surname>Quan</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Alberto</forename><surname>Sanfeliu</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Luc</forename><surname>Van Gool</surname></persName>
		</editor>
		<meeting><address><addrLine>Barcelona, Spain</addrLine></address></meeting>
		<imprint>
			<publisher>IEEE Computer Society</publisher>
			<date type="published" when="2011-11-06">2011. November 6-13, 2011</date>
			<biblScope unit="page" from="2556" to="2563" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Mihai</title>
		<author>
			<persName><forename type="first">Martha</forename><forename type="middle">A</forename><surname>Larson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Steven</forename><forename type="middle">Alexander</forename><surname>Hicks</surname></persName>
		</author>
		<ptr target="CEUR-WS.org" />
	</analytic>
	<monogr>
		<title level="m">Working Notes Proceedings of the MediaEval 2019 Workshop</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">Gabriel</forename><surname>Constantin</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Benjamin</forename><surname>Bischke</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Alastair</forename><surname>Porter</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Peijian</forename><surname>Zhao</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Mathias</forename><surname>Lux</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Laura</forename><surname>Cabrera Quiros</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Jordan</forename><surname>Calandre</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Gareth</forename><surname>Jones</surname></persName>
		</editor>
		<meeting><address><addrLine>Sophia Antipolis, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019-10-30">2020. 27-30 October 2019</date>
			<biblScope unit="volume">2670</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">The AVA-Kinetics Localized Human Actions Video Dataset</title>
		<author>
			<persName><forename type="first">Ang</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Meghana</forename><surname>Thotakuri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">David</forename><forename type="middle">A</forename><surname>Ross</surname></persName>
		</author>
		<author>
			<persName><forename type="first">João</forename><surname>Carreira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alexander</forename><surname>Vostrikov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andrew</forename><surname>Zisserman</surname></persName>
		</author>
		<idno>CoRR abs/2005.00214</idno>
		<imprint>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Ball Tracking and Trajectory Prediction for Table-Tennis Robots</title>
		<author>
			<persName><forename type="first">-I</forename><surname>Hsien</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zhangguo</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yi-Chen</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><surname>Huang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Sensors</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="page">2</biblScope>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Table Tennis Stroke Recognition Based on Body Sensor Network</title>
		<author>
			<persName><forename type="first">Ruichen</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zhelong</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Xin</forename><surname>Shi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hongyu</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sen</forename><surname>Qiu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jie</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ning</forename><surname>Yang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IDCS 2019</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<editor>
			<persName><forename type="first">Raffaele</forename><surname>Montella</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Angelo</forename><surname>Ciaramella</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Giancarlo</forename><surname>Fortino</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Antonio</forename><surname>Guerrieri</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Antonio</forename><surname>Liotta</surname></persName>
		</editor>
		<meeting><address><addrLine>Naples, Italy</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2019-10-10">2019. October 10-12, 2019</date>
			<biblScope unit="volume">11874</biblScope>
			<biblScope unit="page" from="1" to="10" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Spatiotemporal Relation Networks for Video Action Recognition</title>
		<author>
			<persName><forename type="first">Zheng</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Haifeng</forename><surname>Hu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="14969" to="14976" />
			<date type="published" when="2019">2019. 2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Sports Video Annotation: Detection of Strokes in Table Tennis Task for MediaEval</title>
		<author>
			<persName><forename type="first">Pierre-Etienne</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jenny</forename><surname>Benois-Pineau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Boris</forename><surname>Mansencal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Renaud</forename><surname>Péteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Laurent</forename><surname>Mascarilla</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jordan</forename><surname>Calandre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Julien</forename><surname>Morlier</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2019">2019. 2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">Siamese Spatio-Temporal Convolutional Neural Network for Stroke Classification in Table Tennis Games</title>
		<author>
			<persName><forename type="first">Pierre-Etienne</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jenny</forename><surname>Benois-Pineau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Boris</forename><surname>Mansencal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Renaud</forename><surname>Péteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Julien</forename><surname>Morlier</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Sport Action Recognition with Siamese Spatio-Temporal CNNs: Application to Table Tennis</title>
		<author>
			<persName><forename type="first">Pierre-Etienne</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jenny</forename><surname>Benois-Pineau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Renaud</forename><surname>Péteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Julien</forename><surname>Morlier</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CBMI 2018</title>
				<meeting><address><addrLine>La Rochelle, France</addrLine></address></meeting>
		<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2018-09-04">2018. September 4-6, 2018</date>
			<biblScope unit="page" from="1" to="6" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Optimal Choice of Motion Estimation Methods for Fine-Grained Action Classification with 3D Convolutional Networks</title>
		<author>
			<persName><forename type="first">Pierre-Etienne</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jenny</forename><surname>Benois-Pineau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Renaud</forename><surname>Péteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Julien</forename><surname>Morlier</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE ICIP 2019</title>
				<meeting><address><addrLine>Taipei, Taiwan</addrLine></address></meeting>
		<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2019-09-22">2019. September 22-25, 2019</date>
			<biblScope unit="page" from="554" to="558" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">3D attention mechanisms in Twin Spatio-Temporal Convolutional Neural Networks</title>
		<author>
			<persName><forename type="first">Pierre-Etienne</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jenny</forename><surname>Benois-Pineau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Renaud</forename><surname>Péteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Julien</forename><surname>Morlier</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Application to action classification in videos of table tennis games</title>
				<meeting><address><addrLine>, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">2020. 10-15 January 2021</date>
		</imprint>
	</monogr>
	<note>2ICPR2020 -MiCo Milano Congress Center</note>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Fine grained sport action recognition with Twin spatiotemporal convolutional neural networks</title>
		<author>
			<persName><forename type="first">Pierre-Etienne</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jenny</forename><surname>Benois-Pineau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Renaud</forename><surname>Péteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Julien</forename><surname>Morlier</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Multim. Tools Appl</title>
		<imprint>
			<biblScope unit="volume">79</biblScope>
			<biblScope unit="page" from="20429" to="20447" />
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification</title>
		<author>
			<persName><forename type="first">Juan</forename><surname>Carlos Niebles</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chih-Wei</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fei-Fei</forename><surname>Li</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Computer Vision -ECCV 2010</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<editor>
			<persName><forename type="first">Kostas</forename><surname>Daniilidis</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Petros</forename><surname>Maragos</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Nikos</forename><surname>Paragios</surname></persName>
		</editor>
		<meeting><address><addrLine>Heraklion, Crete, Greece</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2010-09-05">2010. September 5-11, 2010</date>
			<biblScope unit="volume">6312</biblScope>
			<biblScope unit="page" from="392" to="405" />
		</imprint>
	</monogr>
	<note>Proceedings, Part II</note>
</biblStruct>

<biblStruct xml:id="b22">
	<monogr>
		<title level="m" type="main">FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding</title>
		<author>
			<persName><forename type="first">Dian</forename><surname>Shao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yue</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bo</forename><surname>Dai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dahua</forename><surname>Lin</surname></persName>
		</author>
		<idno>CoRR abs/2004.06704</idno>
		<imprint>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<title level="m" type="main">UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild</title>
		<author>
			<persName><forename type="first">Khurram</forename><surname>Soomro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mubarak</forename><surname>Amir Roshan Zamir</surname></persName>
		</author>
		<author>
			<persName><surname>Shah</surname></persName>
		</author>
		<idno>CoRR abs/1212.0402</idno>
		<imprint>
			<date type="published" when="2012">2012. 2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<author>
			<persName><forename type="first">Siddharth</forename><surname>Sriraman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Srinath</forename><surname>Srinivasan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Vishnu</surname></persName>
		</author>
		<author>
			<persName><surname>Krishnan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bhuvana</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">T</forename><surname>Mirnalinee</surname></persName>
		</author>
		<title level="m">MediaEval 2019: LRCNs for Stroke Detection in Table Tennis</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Fast Action Localization in Large-Scale Video Archives</title>
		<author>
			<persName><forename type="first">Andrei</forename><surname>Stoian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marin</forename><surname>Ferecatu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jenny</forename><surname>Benois-Pineau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michel</forename><surname>Crucianu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Trans. Circuits Syst. Video Techn</title>
		<imprint>
			<biblScope unit="volume">26</biblScope>
			<biblScope unit="issue">10</biblScope>
			<biblScope unit="page" from="1917" to="1930" />
			<date type="published" when="2016">2016. 2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Comparative Study of Table Tennis Forehand Strokes Classification Using Deep Learning and SVM</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Tabrizi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Pashazadeh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Javani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Sensors Journal</title>
		<imprint>
			<biblScope unit="page" from="1" to="1" />
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Long-Term Temporal Convolutions for Action Recognition</title>
		<author>
			<persName><forename type="first">Gül</forename><surname>Varol</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ivan</forename><surname>Laptev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Cordelia</forename><surname>Schmid</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
		<imprint>
			<biblScope unit="volume">40</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="1510" to="1517" />
			<date type="published" when="2018">2018. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<monogr>
		<title level="m" type="main">TTNet: Real-time temporal and spatial video analysis of table tennis</title>
		<author>
			<persName><forename type="first">Roman</forename><surname>Voeikov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nikolay</forename><surname>Falaleev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ruslan</forename><surname>Baikulov</surname></persName>
		</author>
		<idno>CoRR abs/2004.09927</idno>
		<imprint>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">Tac-Simur: Tactic-based Simulative Visual Analytics of Table Tennis</title>
		<author>
			<persName><forename type="first">Jiachen</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kejian</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dazhen</forename><surname>Deng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Anqi</forename><surname>Cao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Xiao</forename><surname>Xie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zheng</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hui</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yingcai</forename><surname>Wu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Trans. Vis. Comput. Graph</title>
		<imprint>
			<biblScope unit="volume">26</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="407" to="417" />
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">FuturePong: Real-time Table Tennis Trajectory Forecasting using Pose Prediction Network</title>
		<author>
			<persName><forename type="first">Erwin</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hideki</forename><surname>Koike</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CHI 2020</title>
				<editor>
			<persName><forename type="first">Regina</forename><surname>Bernhaupt</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Florian</forename><forename type="middle">'</forename><surname>Floyd' Mueller</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">David</forename><surname>Verweij</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Josh</forename><surname>Andres</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Joanna</forename><surname>Mcgrenere</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Andy</forename><surname>Cockburn</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Ignacio</forename><surname>Avellino</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Alix</forename><surname>Goguey</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Pernille</forename><surname>Bjøn</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Shengdong</forename><surname>Zhao</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Briane</forename><surname>Paul Samson</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Rafal</forename><surname>Kocielnik</surname></persName>
		</editor>
		<meeting><address><addrLine>Honolulu, HI, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="1" to="8" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Racquet Sports Recognition Using a Hybrid Clustering Model Learned from Integrated Wearable Sensor</title>
		<author>
			<persName><forename type="first">Kun</forename><surname>Xia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hanyu</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Menghan</forename><surname>Xu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zheng</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sheng</forename><surname>He</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yusong</forename><surname>Tang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Sensors</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="page">1638</biblScope>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
