<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Scene-Adaptive Optimization Scheme for Depth Sensor Networks</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Johannes</forename><surname>Wetzel</surname></persName>
							<email>johannes.wetzel@hs-karlsruhe.de</email>
							<affiliation key="aff0">
								<orgName type="laboratory">Intelligent Systems Research Group (ISRG)</orgName>
								<orgName type="institution">Karlsruhe University of Applied Sciences</orgName>
								<address>
									<settlement>Karlsruhe</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Samuel</forename><surname>Zeitvogel</surname></persName>
							<email>samuel.zeitvogel@hs-karlsruhe.de</email>
							<affiliation key="aff0">
								<orgName type="laboratory">Intelligent Systems Research Group (ISRG)</orgName>
								<orgName type="institution">Karlsruhe University of Applied Sciences</orgName>
								<address>
									<settlement>Karlsruhe</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Astrid</forename><surname>Laubenheimer</surname></persName>
							<email>astrid.laubenheimer@hs-karlsruhe.de</email>
							<affiliation key="aff0">
								<orgName type="laboratory">Intelligent Systems Research Group (ISRG)</orgName>
								<orgName type="institution">Karlsruhe University of Applied Sciences</orgName>
								<address>
									<settlement>Karlsruhe</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Michael</forename><surname>Heizmann</surname></persName>
							<email>michael.heizmann@kit.edu</email>
							<affiliation key="aff1">
								<orgName type="department">Institute of Industrial Information Technology (IIIT)</orgName>
								<orgName type="institution">Karlsruhe Institute of Technology (KIT)</orgName>
								<address>
									<settlement>Karlsruhe</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Scene-Adaptive Optimization Scheme for Depth Sensor Networks</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">C2F90469ADF3146E9D77A8A6526418A0</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T02:41+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>depth sensor networks</term>
					<term>context aware</term>
					<term>knowledge based optimization</term>
					<term>scene-adaptive</term>
					<term>optimization</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this work a scheme for scene-adaptive depth sensor network optimization is presented. We propose to fuse the knowledge inferred by the sensor network into a common world model while at the same time exploiting this knowledge to improve the perception and post processing algorithms themselves. Moreover, we show how our optimization scheme can be applied to improve the use cases of disparity estimation as well as people detection with multiple depth sensors.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Low cost commodity depth sensors are an emerging technology and are applied to a broad field of applications such as people detection and tracking, 3D reconstruction or emergency detection in an ambient assisted living context. However, depth sensor networks as well as modern vision algorithms have many parameters and require fine-tuned, scene-specific configurations to achieve optimal performance. Due to strongly varying scenes and changing conditions at run time it is very challenging to fine-tune those parameters manually in real world applications. To overcome the problem of scene-specific manual (re)configuration of depth sensor networks, we propose a scene-adaptive scheme which exploits the scene knowledge to improve perception and post processing vision algorithms. Our objective is not only to tune the given parameters but also to improve the vision algorithms, such as stereo block matching, detection or tracking by explicit exploitation of the scene knowledge, e.g. by building scene-specific object models. Therefore, we fuse the knowledge inferred from the sensor network into a common world model, representing our current context knowledge. This knowledge is then fed back to optimize sensor parameters and algorithms to improve the performance of a sensor network at run time.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related work</head><p>The configuration of video sensor networks in the context of video surveillance has been widely studied in the literature. In <ref type="bibr" target="#b12">[13]</ref> a general overview of the different aspects of sensor network reconfiguration is given. Rinner et al. <ref type="bibr" target="#b11">[12]</ref> focus on the aspect of configuration of smart camera networks in the context of video surveillance. They review the configuration for a specific analysis task and evaluate different configuration methods. In <ref type="bibr" target="#b7">[8]</ref> a flexible uncertainty model is presented to reconfigure the sensor network with the objective to optimize the detection performance. Fischer et al. <ref type="bibr" target="#b3">[4]</ref> give an overview of intelligent surveillance systems, analyzing the information flow between sensors, world model and inference algorithms. In <ref type="bibr" target="#b13">[14]</ref> an overview to visual sensor networks is given. However, prior work focuses on monocular camera networks and employs parameter reconfiguration. In contrast, our work deals with depth sensor networks and proposes a scheme for explicit exploitation of the given scene knowledge. This includes conventional parameter reconfiguration methods as well as methods that construct and use sophisticated world models to improve the integrated algorithms of sensor networks at run time.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Scene-adaptive sensor network optimization</head><p>In this section we present a scheme for scene-adaptive sensor network optimization. The general information flow in a depth sensor network is depicted in Fig. <ref type="figure" target="#fig_0">1</ref>   While a world model can be used to do e.g. scene and situation analysis, we use it to optimize the parameters of each individual sensor online and support the data analysis methods e.g. by building scene-specific object models gradually.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Knowledge representation</head><p>The employed knowledge representation within the world model has to be expressive to solve the high-level task of the sensor network and the optimization of the sensor network itself. The fusion layer might provide sensor data as well as locally derived high-level knowledge and the world model therefore might need to cover low-level data up to high level information. Taking these aspects into account, several existing approaches for knowledge representations are qualified to serve as world model. For most tasks and networks, a world model consisting of geometric and semantic scene descriptions will be suitable. Geometric scene knowledge thereby encompasses information about the objects contained in the scene and their properties. This includes the object class (e.g. humans, furnature, floor plan), the object location and orientation in a global world coordinate system, dynamic properties e.g. a motion model, shape, material. Examples for such a world model are object oriented world models <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b4">5]</ref>. In order to enhance the quality of the world model, a knowledge base consisting of preprocessed information or prior knowledge can be used. This includes morphable shape models <ref type="bibr" target="#b2">[3]</ref> for different object classes as well as common recognition, detection and segmentation models <ref type="bibr" target="#b17">[18]</ref> which are applied on image and 3D data, e.g. RGB-D data, point clouds, voxels or triangulated surfaces <ref type="bibr" target="#b0">[1]</ref>. In terms of semantic knowledge Fuzzy Metric Temporal Logic and Situation Graph Trees <ref type="bibr" target="#b10">[11]</ref> or ontologies <ref type="bibr" target="#b9">[10]</ref> can be incorporated. The semantic description might be data driven, e.g. Hartz and Neumann <ref type="bibr" target="#b5">[6]</ref> use a scene interpretation system <ref type="bibr" target="#b6">[7]</ref> and learn ontological concept descriptions from data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Optimization possibilities</head><p>Depth sensor networks involve multiple algorithms which leads to a large amount of parameters. In this section we give an overview of parameters and methods which are suitable for automatic scene-adaptive sensor optimization. We assume that a suitable knowledge base (see section 3.1) exists and focus on algorithm and parameter optimization. Following our layered scheme, we categorize the optimization targets into three major categories, see Fig. <ref type="figure" target="#fig_1">2</ref>. Sensing parameters have a direct impact on the measurement quality. Parts of this category have already been addressed. Auto exposure is state-of-the-art for decades in consumer cameras, but sophisticated scene models <ref type="bibr" target="#b16">[17]</ref> can improve the result </p><formula xml:id="formula_0">• exposure • sensitivity • resolution • frame rate • sensor pos. • emitter strength • synchronization • calibration • … • gate size • assignment distance • dynamics model • flow field • … • discriminativ classifier • generativ object model • region of interest • … • multifrequency face unwrapping • … • minimum disparity • maximum disparity • …</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>e.g. by taking only the pixel intensities near regions of interest into account.</head><p>Sensor data post processing methods vary highly between different depth sensing technologies. The depth estimation of a stereo sensor can be improved by setting the minimum and maximum observable disparity based on geometric scene knowledge. In section 4.1 a approach for the task of scene-adaptive disparity estimation is presented with an exemplary knowledge representation. Many scene-adapative local data analysis methods have already been published. Yang et al. <ref type="bibr" target="#b15">[16]</ref> learn global appearance and motion models to improve multiple target tracking. Masksai et al. <ref type="bibr" target="#b8">[9]</ref> propose a context-aware optimization strategy for multi object tracking. They learn the most likely trajectory patterns with respect to a given scene layout to reduce incorrect assignments between detections and tracks. In 4.2 we show how the task of people detection can be optimized in a scene-specific fashion.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Application</head><p>In this section we show the applicability of our scheme on two exemplary use cases.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">3D model based disparity estimation</head><p>Our knowledge representation contains sensor knowledge in the form of a camera model and existing camera calibration parameters π, scene geometry using a ground plane assumption P (h) ⊂ R 3 and a 3D morphable human surface model parameterized by β. Scene semantics are represented as segmentations of a single human s h and the ground plane s g in the image. Let D π (u) be a depth image computed using the estimated disparity values u from the image pair (I 1 , I 2 ). Classical stereo algorithms estimate the disparity values u minimizing a cost function</p><formula xml:id="formula_1">E(u) = E photometric (u; I 1 , I 2 ) + E reg (u) ,<label>(1)</label></formula><p>where E photometric is the photometric error penalizing intensity deviation in the local neighborhood given u and E reg regularizes the problem penalizing unlikely disparity values based on simple scene assumptions. We propose to employ a scene-adaptive optimization scheme reformulating (1) with</p><formula xml:id="formula_2">E adaptive (u) = E photometric (u; I 1 , I 2 ) + E model (u[s h ], u[s g ]; β, h) ,<label>(2)</label></formula><p>where E model uses our provided scene representation to measure the deviation from the estimated depth at the segmented pixel locations u[s h ] and u[s g ] to the explicit geometric scene representation consisting of the ground plane at height h and the human shape model parameterized by β. Scene-adaptive disparity estimation is then performed by estimating û = arg min u E adaptive (u).</p><p>Eq.( <ref type="formula" target="#formula_2">2</ref>) can be extended in various ways, which proves the generality of the proposed approach by e.g. introducing a human motion model to enforce temporal consistency constraints.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">People detection with multiple depth sensors</head><p>The sensors have a top view on the scene and a significant overlap to each other. Additionally, we assume that the sensors are intrinsically and extrinsically calibrated in advance and that the common ground plane is known. We model the presence of a person on the ground floor as a discrete grid of Bernoulli random variables X = (x 1 , .., x n ), x i ∈ {0, 1} where each x i maps to one specific ground plane grid location g i ∈ R </p><p>For this application we assume that the likelihood p(O|X) is given (see <ref type="bibr" target="#b14">[15]</ref> for details on the construction of the likelihood) and only focus on the sceneadaptive choice of the prior p(X). We start with an uninformative prior to make the detection of people at every location equally likely. In many real world scenes this is a crude assumption due to obstacles or preferred walking tracks which can be present in the scene. Thus, we propose to accumulate the detections over time to get the relative frequencies H = (h 1 , . . . , h n ) of the presence of people for every ground plane grid location g i and fuse those information into the world model. This scene-specific knowledge can be used in the feedback step to continuously update the prior beliefs p(x i ) accordingly to H on regular time intervals.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conclusion</head><p>In the present work we have proposed a scheme for scene-adaptive optimization of depth sensor networks. We have given an analysis of relevant knowledge representations and categorized identified optimization targets. Moreover, we have</p><p>Internet of Things, Networks and Security exemplarily applied our scheme on the use cases of disparity estimation as well as people detection with multiple depth sensors. Future work will include the investigation of more use cases as well as proof of concept implementations.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Information flow in a depth sensor network with scene-adapative optimization strategy.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig.2. Non-exhaustive taxonomy of building blocks within a depth sensor network which are suitable for scene-specific optimization.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>2 .</head><label>2</label><figDesc>Our goal is to infer the likelihood of a scene configuration X given current depth observations O = (O 1 , . . . , O C ) from C depth sensors. Applying Bayes' theorem and assuming that the prior factorizes as p(X) = n i=1 p(x i ) we get the posterior distribution p(X|O) ∝ p(O|X) n i=1 p(x i ).</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>and separated into five different abstraction layers. The sensing layer</figDesc><table><row><cell>sensing</cell><cell>sensor data postprocessing</cell><cell>local 2D and 3D analysis</cell><cell></cell><cell></cell><cell></cell></row><row><cell>sensing</cell><cell>sensor data postprocessing</cell><cell>local 2D and 3D analysis</cell><cell>data and know-ledge fusion</cell><cell>global data analysis</cell><cell>world model</cell></row><row><cell>sensing</cell><cell>sensor data postprocessing</cell><cell>local 2D and 3D analysis</cell><cell></cell><cell></cell><cell>scene and situa-tion analysis</cell></row></table></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">E</forename><surname>Ahmed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Saint</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">E R</forename><surname>Shabayek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Cherenkova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Das</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Gusev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Aouada</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Ottersten</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1808.01462</idno>
		<title level="m">Deep learning advances on different 3d data representations: A survey</title>
				<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Object-oriented world model for surveillance systems</title>
		<author>
			<persName><forename type="first">A</forename><surname>Bauer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Emter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Vagts</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Beyerer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Future Security: 4th Security Research Conference</title>
				<imprint>
			<publisher>Fraunhofer Verlag</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="339" to="345" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">A morphable model for the synthesis of 3d faces</title>
		<author>
			<persName><forename type="first">V</forename><surname>Blanz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Vetter</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 26th annual conference on computer graphics and interactive techniques</title>
				<meeting>the 26th annual conference on computer graphics and interactive techniques</meeting>
		<imprint>
			<publisher>ACM Press/Addison-Wesley Publishing Co</publisher>
			<date type="published" when="1999">1999</date>
			<biblScope unit="page" from="187" to="194" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">A top-down-view on intelligent surveillance systems</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Fischer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Beyerer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 7th International Conference on Systems (c)</title>
				<meeting>of the 7th International Conference on Systems (c)</meeting>
		<imprint>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="43" to="48" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">World modeling for autonomous systems</title>
		<author>
			<persName><forename type="first">I</forename><surname>Ghet ¸a</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Heizmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Belkin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Beyerer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Artificial Intelligence</title>
				<editor>
			<persName><forename type="first">R</forename><surname>Dillmann</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Beyerer</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">U</forename><forename type="middle">D</forename><surname>Hanebeck</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Schultz</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2010">2010. 2010</date>
			<biblScope unit="page" from="176" to="183" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Learning a knowledge base of ontological concepts for high-level scene interpretation</title>
		<author>
			<persName><forename type="first">J</forename><surname>Hartz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Neumann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ICMLA</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="436" to="443" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Scene interpretation as a configuration task</title>
		<author>
			<persName><forename type="first">L</forename><surname>Hotz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Neumann</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Optimizing the detection performance of smart camera networks through a probabilistic image-based model</title>
		<author>
			<persName><forename type="first">C</forename><surname>Kyrkou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Christoforou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Timotheou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Theocharides</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Panayiotou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Polycarpou</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on Circuits and Systems for Video Technology</title>
		<imprint>
			<biblScope unit="volume">8215</biblScope>
			<biblScope unit="issue">c</biblScope>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Non-markovian globally consistent multi-object tracking</title>
		<author>
			<persName><forename type="first">A</forename><surname>Maksai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Fleuret</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Fua</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE ICCV</title>
				<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="2544" to="2554" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">The more you know: Using knowledge graphs for image classification</title>
		<author>
			<persName><forename type="first">K</forename><surname>Marino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Salakhutdinov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Gupta</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1612.04844</idno>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">High-level situation recognition using fuzzy metric temporal logic, case studies in surveillance and smart environments</title>
		<author>
			<persName><forename type="first">D</forename><surname>Münch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ijsselmuiden</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Arens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Stiefelhagen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ICCV Workshops</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="882" to="889" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Resource-aware configuration in smart camera networks</title>
		<author>
			<persName><forename type="first">B</forename><surname>Rinner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Dieber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Esterle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">R</forename><surname>Lewis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Yao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE CVPR</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="58" to="65" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Selfreconfigurable smart camera networks</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Sanmiguel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Micheloni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Shoop</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">L</forename><surname>Foresti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Cavallaro</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Computer</title>
		<imprint>
			<biblScope unit="volume">47</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="67" to="73" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">A survey of visual sensor networks</title>
		<author>
			<persName><forename type="first">S</forename><surname>Soro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Heinzelman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Multimedia</title>
				<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Towards global people detection and tracking using multiple depth sensors</title>
		<author>
			<persName><forename type="first">J</forename><surname>Wetzel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Zeitvogel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Laubenheimer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Heizmann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE ISETC</title>
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">An online learned crf model for multi-target tracking</title>
		<author>
			<persName><forename type="first">B</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Nevatia</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE CVPR</title>
		<imprint>
			<biblScope unit="page" from="2034" to="2041" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">Personalized attentionaware exposure control using reinforcement learning</title>
		<author>
			<persName><forename type="first">H</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Vesdapunt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Guo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">B</forename><surname>Kang</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="page" from="1" to="17" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<title level="m" type="main">Object detection with deep learning: A review</title>
		<author>
			<persName><forename type="first">Z</forename><forename type="middle">Q</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Zheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">T</forename><surname>Xu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Wu</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1807.05511</idno>
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
