<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Towards an Integrated Solution for IoT Data Management</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Anderson</forename><surname>Chaves</surname></persName>
							<email>achaves@lncc.br</email>
							<affiliation key="aff0">
								<orgName type="laboratory">Supervised by Fabio Porto LNCC</orgName>
								<address>
									<country key="BR">Brazil</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Towards an Integrated Solution for IoT Data Management</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">8F57EA1B9385E674B108F90BD7737106</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T04:53+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The emergence of Big Data and the Internet of Things (IoT) is increasingly affecting all areas of modern society, being characterized by a huge number of data streams that demand real-time processing and analysis. The development of systems to assist on the management of these data streams plays an important role for IoT applications. However, there are numerous challenges that must be taken into account when building an efficient data system for handling large scale, dynamic, semi-structured data such as IoT, and currently existing solutions only partially address the requirements of these scenarios. In this PhD research, we summarize some of the main challenges involved in building an efficient system for IoT data management and analysis, and how different data management approaches such as Actor oriented, Array and Active Databases fit together offering strong contributions to these requirements. We also examine the potential of performing Machine Learning inference and handling Concept Drift in IoT as an integrated database process. Through this work, we lay the structure for the development of a Database Management System to support large scale data stream based analysis capable of combining these different strategies.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION</head><p>From smart homes control systems to transportation, healthcare and industrial automation, the Internet of Things has been enabling great benefits both for individual and businesses, being used for better decision making, planning and higher productivity <ref type="bibr" target="#b0">[1]</ref>. The main characteristics behind this IoT paradigm is the exploration of different technologies such as communication, embedded systems and data analytics in order to create smart devices for intelligent monitoring, locating, tracking and so forth <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b17">18]</ref>.</p><p>The efficient management of sensor data from IoT devices is essential to perform IoT data analysis. Through Complex Event Processing (CEP) methods, it is possible to detect anomalies and meaningful events from data streams and perform real-time decision making. However, processing and analyzing continuous data streams from heterogeneous networks still leads to a number of different challenges, and requires the development of new techniques and strategies.</p><p>A major challenge in an IoT environment is related to its large scale data flows. Data in IoT can have its sources in a very big range of endpoints that generate masses of data, and is frequently semi-structured or unstructured, conforming it to the Big Data paradigm <ref type="bibr" target="#b8">[9]</ref>. Traditional DBMSs, which need to store and index data before processing it, cannot fulfill the requirements of timeliness and scalability of IoT data streams <ref type="bibr" target="#b9">[10]</ref>. Besides, in order to perform analysis and visualization, existing solutions are often inefficient, because they incur in an incompatibility between the structure of the source data and the analysis tool <ref type="bibr" target="#b6">[7]</ref>. Finally, there are a number of privacy and security issues as well as resource constraints such as memory, bandwidth and energy that must be taken into account when building an IoT data management system.</p><p>Another challenge in IoT is the necessity for on-line processing of data streams as opposed to off-line analysis. Machine learning (ML) is one of the leading strategies to perform reliable, efficient real-time analysis of IoT data in tasks such as predictions or anomalies detection <ref type="bibr" target="#b0">[1]</ref>. However, the lack of integration between the ML application and the data system is often a restraint to performance improvements, since optimizations such as query planning or lazy evaluation are not possible when the two processes are treated as completely isolated tasks <ref type="bibr" target="#b7">[8]</ref>. Additionally, when dealing with dynamic stream data such as IoT, the nature of the data distribution tends to change over time, resulting in the phenomenon known as concept drift. It occurs when the statistical properties of the target variable, which the model is trying to predict, change over time in unforeseen ways <ref type="bibr" target="#b14">[15]</ref>. When that happens, the learned patterns of past data may not be relevant to the new data, leading to poor predictions and incorrect decisions. Machine Learning based analysis needs to be able not only to detect the drift, but also understand and react to it.</p><p>We argue that data management systems demand efficient mechanisms to deal with large-scale, heterogeneous IoT data. A recent work <ref type="bibr" target="#b24">[25]</ref> has demonstrated that the programming model aimed specifically at concurrency and inherent parallelism of actororiented databases such as Orleans <ref type="bibr" target="#b4">[5]</ref> and ReactDB <ref type="bibr" target="#b21">[22]</ref> is an adequate solution for systems focused on IoT data management. Reactive behavior and CEP techniques are also essential for evaluating complex patterns over high-throughput data streams such as IoT <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b20">21]</ref>. Since a large part of data made available by IoT devices is multidimensional spatio-temporal <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b18">19]</ref>, multidimensional array data models could provide great advantages to its management <ref type="bibr" target="#b3">[4]</ref>. However, managing several different platforms instead of one makes the resulting solution unnecessarily complex and potentially inefficient. To the best of our knowledge, no existing solution has been yet proposed to combine all these approaches for IoT Scenarios.</p><p>Therefore, to address the challenges involved in the development of an adequate IoT solution, we envision a Database Management System capable of offering scalable support for IoT data management as well as analysis through Machine Learning. In this work, we present the following contributions: The remainder of this paper is organized as follows. In Section 2 we present the base concepts for the highlighted problems and proposed solutions. In Section 3 we present our idea of leveraging array databases to a scalable, reactive and intelligent solution fit for IoT. We conclude and present our research directions in Section 4.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">RESEARCH CONTEXT</head><p>In this section, we introduce the base concepts of IoT data and challenges related to it. Afterward, we present the different database models that serve as foundation to the proposed solution. Finally, we describe the problem of Concept Drift in IoT context.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">IoT Big Data Challenges</head><p>According to <ref type="bibr" target="#b8">[9]</ref>, big data in IoT has three features that conform to the big data paradigm: (a) a very big range of endpoints that generate masses of data; (b) semi-structured or unstructured data; (c) it is only useful after being analyzed.</p><p>Data generated by IoT has usually a high number of parallel sources, being subject to inaccuracies and noise during acquisition and transmission. It can be streamed continuously or accumulated as a source of big data. When dealing with big data analytics, its possible to produce insights after several days of its generation, but in the case of streaming data IoT analytics, they must be delivered in at most a few seconds or less. This real-time constraint incur in the following challenges for IoT big data:</p><p>Data Management: Data management is a big challenge to be addressed in order to realize the full potential of IoT, and therefore has become a key research topic <ref type="bibr" target="#b16">[17,</ref><ref type="bibr" target="#b19">20]</ref>. Many IoT systems are processor-intensive and require processing a massive amount of highly concurrently generated data. How to perform the management of these data interactions while ensuring low latency?</p><p>Visualization: Visualization is important in big data analytics, specially for IoT systems <ref type="bibr" target="#b17">[18]</ref>. How can we perform visualization in the case of heterogeneous and diversely structured data generated in IoT?</p><p>Data Mining: The realization of the potential of IoT depends on being able to gain the insights hidden in the vast and ever increasing available data. Current data mining approaches don't scale well to IoT volumes. What characteristics are the most essential for a system fit to such environments?</p><p>Resource Constraints: In the IoT data stream model, a high volume of data is produced at high speed. Therefore algorithms that process it must do so under very strict constraints of space and time. Addressing these constraints requires that a significant amount of data processing must happen on edge devices. How can we design algorithms that work efficiently in such environments?</p><p>Security: Being able to deal with dynamic scaling while guaranteeing protection of data from different entities is another significant challenge. What is the most effective way to ensure access control and protection of data from large volumes of devices and, at the same time, ensure the development of a dynamic and flexible application?</p><p>2.2 Data management solutions 2.2.1 Array Database Models. Most IoT environments are constituted by static or moving sensor devices placed in specific locations that produce data continuously. Each data item has space coordinates as well as a time-stamp associated, incurring in a high time and space correlation. Because of this multidimensional spatiotemporal nature of IoT data, multidimensional array database models, built using arrays as the primary data representation, offer advantages for an efficient data management.</p><p>Array databases were initially proposed to better represent sensor, image, simulation, and statistics data of tipically spatio-temporal dimensions <ref type="bibr" target="#b3">[4]</ref>. They have special query languages built upon arraybased algebraic formalizations that model different kinds of operations such as aggregations or subsetting. Cells in an array have an intrinsic ordering, making it easy to quickly lookup values by taking advantage of this ordering. Array indexes do not need to be stored and can be inferred by the position of a cell, saving storage space. Arrays can also be split into subarrays (called tiles or chunks) that can be used as processing and storage units to help answering queries efficiently.</p><p>Recently, some research effort is being applied in order to integrate ML tools and array DMBSs <ref type="bibr" target="#b23">[24]</ref>. The system Rasdaman <ref type="bibr" target="#b2">[3]</ref> allows the implementation of machine learning algorithms through User Defined Types and Functions that implement the underlying linear algebra operations directly over the arrays. In the case of SciDB <ref type="bibr" target="#b22">[23]</ref>, users are provided with linear algebra operators that can be used as building blocks to implement the ML algorithms.</p><p>In SAVIME <ref type="bibr" target="#b10">[11]</ref>, users can perform inference from machine learning models as part of the query expression, allowing the jointly optimization of the data preparation process and its input to the model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.2">Active Databases and Complex Event</head><p>Processing. An event can be defined as an occurrence of significance in a system <ref type="bibr" target="#b15">[16]</ref>. Historically, many different initiatives have studied event processing for different reasons. Active Databases intended to extend traditional DBMSs by enabling the specification of reactive behavior. The idea was to develop strategies to respond automatically to events and changes in the database state through mechanisms formalized as ECA rules <ref type="bibr" target="#b25">[26]</ref>: if an event is detected, and any of previously defined conditions become true, then a corresponding action is taken without any external intervention.</p><p>Complex Event Processing extend the logic behind ECA rules, being understood as a set of techniques combined in order to perform real-time stream processing for monitoring and detection of arbitrarily complex patterns in massive data streams <ref type="bibr" target="#b15">[16]</ref>. They are commonly used in IoT environments to enable real-time or near real-time decisions <ref type="bibr" target="#b12">[13]</ref>. In CEP, each data item is abstracted as an event produced by a data source. A CEP engine combines multiple simpler events to produce more complex ones, that match previously defined patterns. It typically must process multiple data streams from different sources in order to track simultaneously hundreds or even thousands of different patterns through evaluation mechanisms such as non-deterministic finite automaton or tree-based plans <ref type="bibr" target="#b11">[12]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.3">Actor Oriented Databases.</head><p>The actor programming model is a well-known model for distributed and concurrent programming, in which the actor is the fundamental computing unit. Its main principle is that in a system, the control flow and the data flow must be inseparable. Actors do not share state and communicate via asynchronous messages. Because of its characteristics, actors are a scalable solution to support the management of any number of independent and heterogeneous streaming data sources.</p><p>In recent works, it has been demonstrated the effectiveness of the integration of data management features such as transactions and indexing into actor runtimes <ref type="bibr" target="#b5">[6]</ref>. The authors of <ref type="bibr" target="#b24">[25]</ref> demonstrate that this solution is in fact very suitable to perform IoT data management. A similar approach has sought to integrate actor primitives into relational databases <ref type="bibr" target="#b21">[22]</ref> by extending the programmability of stored procedures with actor objects, taking advantage of databases state management features.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">IoT Concept Drift</head><p>Concept drift can be formally defined as follows <ref type="bibr" target="#b14">[15]</ref>: given a time period [0, 𝑡], a set of samples, denoted as 𝑆 0,𝑡 = {𝑑 0 , ..., 𝑑 𝑡 }, where 𝑑 𝑖 = (𝑋 𝑖 , 𝑦 𝑖 ) is one observation or data instance, 𝑋 𝑖 is the feature vector, 𝑦 𝑖 is the label, and 𝑆 0,𝑡 follows a certain distribution 𝐹 0,𝑡 (𝑋, 𝑦). Concept drift occurs at timestamp 𝑡 + 1, if 𝐹 0,𝑡 (𝑋, 𝑦) ≠ 𝐹 𝑡 +1,∞ (𝑋, 𝑦).</p><p>Research on learning under concept drift presents three components beyond traditional Training/Prediction: Drift detection, drift understanding and drift adaptation. The first refers to whether or not a concept drift occurs in a stream set of data. Drift understanding is related to when, how and where it occurs. Finally, drift adaptation refers to reacting to the existence of a drift.</p><p>Recently, some works have been proposed to deal with concept drift specifically in IoT platforms. For example, the work of <ref type="bibr" target="#b13">[14]</ref> proposes an ensemble learning method based on offline classifiers to address concept drifts and imbalance data concurrently. In <ref type="bibr" target="#b1">[2]</ref>, its proposed an unsupervised model-independent methodology to detect drifts in data generated from IoT devices. In <ref type="bibr" target="#b26">[27]</ref>, it is proposed a concept drift adaptive method to anomaly detection in IoT services that considers the time influence to change the sample distribution. However, this is a not fully explored topic and many research opportunities still exist.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">LEVERAGE ARRAY DATABASES TO IOT COMPLEX EVENT PROCESSING</head><p>Historically, Database Management Systems have offered many benefits to data intensive applications, such as transactions, indexing, query planning and declarative query languages. An IoT data management solution must answer specific demands, such as encapsulation for isolating state and access control, asynchronous primitives and dynamic scalability, since in many scenarios, sensing devices can instantly enter and leave a system. It should be able to detect and react to predefined data patterns automatically, while providing quick data access and an efficient integration to ML analysis.  By taking our inspiration in the approaches of Orleans <ref type="bibr" target="#b4">[5]</ref>, that added data-management functionality in a virtual actor runtime and ReactDB <ref type="bibr" target="#b21">[22]</ref>, which integrates actor features into a relational database system, we investigate the potential of performing event detection and reactive behavior through actor-based primitives in an array database model. Figure <ref type="figure" target="#fig_1">1</ref> illustrates the proposed idea. At the things layer, data is collected from sensor devices and communicated to actor engines at the actor layer. Distributed actors manage these intermediate nodes that process and detect relevant (local) events based on attached sensors before sending them to the cloud based data center, along with relevant data in the form of array data structures. At the analysis layer, global queries and analysis that take into account alerts provided by actors can be made over the collected data. The intention is to provide a low latency environment, in which there is a reduced communication bottleneck.</p><p>The integration of ML-based analytics as part of the Data Management System may lead to powerful optimization opportunities since different parts of the ML process may be treated as operators of the query plan. To cope with the growing need for ML support in IoT data systems, we aim to provide both a local and a global event detector that supports ML inference from trained models as first class operators.</p><p>In IoT environments, communicated data from devices is usually collected and recorded by assuming a temporal relationship between records. As time goes on, concept drift is bound to occur, which may cause an accuracy drop to any methods that rely on long-term statistical data attributes. The proposed solution will count with a central drift detector that is able to determine if and when the drift occurred as well as the best reaction to it based on the local drift detectors.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">CONCLUSION AND RESEARCH DIRECTION</head><p>In this paper, we discuss characteristics and challenges of IoT data management and summarize potential contributions from different strategies in addressing each of them. Our goal is to build an efficient, in-memory data management system that combines each of these different contributions into a single integrated solution, while offering a robust support for data analysis trough Machine Learning. As the next step in our study, we aim to focus on the design refinement and implementation of a prototype system as a foundation to our subsequent investigations. To evaluate the viability of our approach, we intend to submit it to a real use-case scenario that presents the IoT characteristics and challenges described. We also intend to perform comparative experiments with state-of-the-art big data frameworks in order to demonstrate the optimization opportunities that we envision.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>Proceedings of the VLDB 2021 PhD Workshop, August 16th, 2021. Copenhagen, Denmark. Copyright (C) 2021 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: System Overview</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>Potential contributions from different models for IoT data management • We propose the development of a new Database Management System that offers CEP primitives through actor-based programming in order to perform rule-based monitoring for real-time scalable IoT scenarios. • We propose to further extend our solution to include ML inference as first class operators for CEP, enabling further integration between the data system and the Machine Learning tasks.</figDesc><table><row><cell cols="2">System Features</cell><cell>Actor Oriented Databases</cell><cell>Array Databases</cell><cell>Active Databases</cell><cell>Proposed Solution</cell></row><row><cell>Actor-Based Programming</cell><cell>Dynamic Scalability Asynchronous primitives Encapsulation</cell><cell>+</cell><cell>-</cell><cell>-</cell><cell>+</cell></row><row><cell>Array Based Data Management</cell><cell>Array-Based Operations Flexible Storage Format</cell><cell>-</cell><cell>+</cell><cell>-</cell><cell>+</cell></row><row><cell>Complex Event Handling</cell><cell>Event Detection Reactive Behavior</cell><cell>-</cell><cell>-</cell><cell>+</cell><cell>+</cell></row><row><cell>Machine Learning Support</cell><cell>ML as first class operations Concept Drift Handling</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>+</cell></row></table><note>• We propose to investigate the challenges involved in concept drift handling specifically in an IoT environment, and how to address these challenges in a data management system.</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1</head><label>1</label><figDesc>highlights the strong contributions offered by active, actor-oriented and array databases to each of these IoT demands.</figDesc><table><row><cell>Sensor Devices</cell><cell>Query (continuous) Processor</cell><cell>Data Staging</cell><cell>Storage</cell><cell>Array</cell></row><row><cell>Working Storage</cell><cell>/ Concept Drift Detector</cell><cell>Event</cell><cell cols="2">Data Structures</cell></row><row><cell></cell><cell></cell><cell>Processor</cell><cell></cell><cell></cell></row><row><cell></cell><cell>Event Detector</cell><cell>(Local)</cell><cell>Array Continuous Loader</cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell>Model</cell><cell>Array Data</cell></row><row><cell>Stream</cell><cell></cell><cell></cell><cell>Manager</cell><cell>Manager</cell></row><row><cell>Data</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell>Event Processor</cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell>(Global)</cell><cell></cell></row><row><cell>Things</cell><cell>Actors</cell><cell></cell><cell cols="2">Analysis</cell></row><row><cell>Layer</cell><cell>Layer</cell><cell></cell><cell></cell><cell>Layer</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">ACKNOWLEDGEMENT</head><p>We would like to thank CAPES for its scholarships, and Petrobras for financing this work through the Gypscie project.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Analysis of eight data mining algorithms for smarter Internet of Things (IoT)</title>
		<author>
			<persName><forename type="first">Furqan</forename><surname>Alam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Rashid</forename><surname>Mehmood</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Iyad</forename><surname>Katib</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Aiiad</forename><surname>Albeshri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Procedia Computer Science</title>
		<imprint>
			<biblScope unit="volume">98</biblScope>
			<biblScope unit="page" from="437" to="442" />
			<date type="published" when="2016">2016. 2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Aggregate density-based concept drift identification for dynamic sensor data models</title>
		<author>
			<persName><forename type="first">Mohsen</forename><surname>Asghari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Daniel</forename><surname>Sierra-Sosa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Telahun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Anup</forename><surname>Kumar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Adel</forename><forename type="middle">S</forename><surname>Elmaghraby</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Neural Computing and Applications</title>
		<imprint>
			<biblScope unit="page" from="1" to="13" />
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">The multidimensional database system RasDaMan</title>
		<author>
			<persName><forename type="first">Peter</forename><surname>Baumann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Andreas</forename><surname>Dehmel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paula</forename><surname>Furtado</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Roland</forename><surname>Ritsch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Norbert</forename><surname>Widmann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 1998 ACM SIGMOD international conference on Management of data</title>
				<meeting>the 1998 ACM SIGMOD international conference on Management of data<address><addrLine>Washington, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computing Machinery</publisher>
			<date type="published" when="1998">1998</date>
			<biblScope unit="page" from="575" to="577" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Array databases: concepts, standards, implementations</title>
		<author>
			<persName><forename type="first">Peter</forename><surname>Baumann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dimitar</forename><surname>Misev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Vlad</forename><surname>Merticariu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bang</forename><surname>Pham</surname></persName>
		</author>
		<author>
			<persName><surname>Huu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Big Data</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="61" />
			<date type="published" when="2021">2021. 2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Orleans: Distributed virtual actors for programmability and scalability</title>
		<author>
			<persName><forename type="first">Phil</forename><surname>Bernstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sergey</forename><surname>Bykov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alan</forename><surname>Geller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gabriel</forename><surname>Kliot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jorgen</forename><surname>Thelin</surname></persName>
		</author>
		<idno>MSR-TR- 2014-41</idno>
		<imprint>
			<date type="published" when="2014">2014. 2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Indexing in an Actor-Oriented Database</title>
		<author>
			<persName><forename type="first">Mohammad</forename><surname>Philip A Bernstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tim</forename><surname>Dashti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">David</forename><surname>Kiefer</surname></persName>
		</author>
		<author>
			<persName><surname>Maier</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CIDR</title>
				<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Parallel data analysis directly on scientific file formats</title>
		<author>
			<persName><forename type="first">Spyros</forename><surname>Blanas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kesheng</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Surendra</forename><surname>Byna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bin</forename><surname>Dong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Arie</forename><surname>Shoshani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2014 ACM SIGMOD international conference on Management of data</title>
				<meeting>the 2014 ACM SIGMOD international conference on Management of data<address><addrLine>Utah, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computing Machinery</publisher>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="385" to="396" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Model slicing for supporting complex analytics with elastic inference cost and resource constraints</title>
		<author>
			<persName><forename type="first">Shaofeng</forename><surname>Cai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gang</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Beng</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Chin</forename><surname>Ooi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jinyang</forename><surname>Gao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the VLDB Endowment</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="86" to="99" />
			<date type="published" when="2019">2019. 2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">Big data: related technologies, challenges and future prospects</title>
		<author>
			<persName><forename type="first">Min</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Shiwen</forename><surname>Mao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yin</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">M</forename><surname>Victor</surname></persName>
		</author>
		<author>
			<persName><surname>Leung</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2014">2014</date>
			<publisher>Springer</publisher>
			<biblScope unit="volume">96</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Processing flows of information: From data stream to complex event processing</title>
		<author>
			<persName><forename type="first">Gianpaolo</forename><surname>Cugola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alessandro</forename><surname>Margara</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Computing Surveys (CSUR)</title>
		<imprint>
			<biblScope unit="volume">44</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="1" to="62" />
			<date type="published" when="2012">2012. 2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">SAVIME: An Array DBMS for Simulation Analysis and ML Models Prediction</title>
		<author>
			<persName><forename type="first">Anderson</forename><surname>Chaves Da Silva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hermano</forename><surname>Lourenço Souza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Daniel</forename><surname>Lustosa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fábio André Machado</forename><surname>Nascimento Ramos Da Silva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Patrick</forename><surname>Porto</surname></persName>
		</author>
		<author>
			<persName><surname>Valduriez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Information and Data Management</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page">3</biblScope>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Complex event recognition in the big data era: a survey</title>
		<author>
			<persName><forename type="first">Nikos</forename><surname>Giatrakos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Elias</forename><surname>Alevizos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alexander</forename><surname>Artikis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Antonios</forename><surname>Deligiannakis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Minos</forename><surname>Garofalakis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The VLDB Journal</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="313" to="352" />
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Real-time multi-pattern detection over event streams</title>
		<author>
			<persName><forename type="first">Ilya</forename><surname>Kolchinsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Assaf</forename><surname>Schuster</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2019 International Conference on Management of Data</title>
				<meeting>the 2019 International Conference on Management of Data</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="589" to="606" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Concept drift detection and adaption in big imbalance industrial IoT data using an ensemble learning method of offline classifiers</title>
		<author>
			<persName><forename type="first">Chun-Cheng</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Der-Jiunn</forename><surname>Deng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chin-Hung</forename><surname>Kuo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Linnan</forename><surname>Chen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="56198" to="56207" />
			<date type="published" when="2019">2019. 2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Learning under concept drift: A review</title>
		<author>
			<persName><forename type="first">Jie</forename><surname>Lu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Anjin</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fan</forename><surname>Dong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Feng</forename><surname>Gu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Joao</forename><surname>Gama</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Guangquan</forename><surname>Zhang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on Knowledge and Data Engineering</title>
		<imprint>
			<biblScope unit="volume">31</biblScope>
			<biblScope unit="issue">12</biblScope>
			<biblScope unit="page" from="2346" to="2363" />
			<date type="published" when="2018">2018. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">The Power of Events: An Introduction to Complex Event Processing in Distributed Enterprise Systems</title>
		<author>
			<persName><forename type="first">C</forename><surname>David</surname></persName>
		</author>
		<author>
			<persName><surname>Luckham</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
			<publisher>Addison-Wesley Longman Publishing Co., Inc</publisher>
			<pubPlace>USA</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Data management for internet of things: Challenges, approaches and opportunities</title>
		<author>
			<persName><forename type="first">Meng</forename><surname>Ma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ping</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chao-Hsien</forename><surname>Chu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE International conference on green computing and communications and IEEE Internet of Things and IEEE cyber, physical and social computing</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2013">2013. 2013</date>
			<biblScope unit="page" from="1144" to="1151" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Big IoT data analytics: architecture, opportunities, and open research challenges</title>
		<author>
			<persName><forename type="first">Mohsen</forename><surname>Marjani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Fariza</forename><surname>Nasaruddin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Abdullah</forename><surname>Gani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ahmad</forename><surname>Karim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ibrahim</forename><surname>Abaker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Targio</forename><surname>Hashem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Aisha</forename><surname>Siddiqa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ibrar</forename><surname>Yaqoob</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="5247" to="5261" />
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Deep learning for IoT big data and streaming analytics: A survey</title>
		<author>
			<persName><forename type="first">Mehdi</forename><surname>Mohammadi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ala</forename><surname>Al-Fuqaha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sameh</forename><surname>Sorour</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mohsen</forename><surname>Guizani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Communications Surveys &amp; Tutorials</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="2923" to="2960" />
			<date type="published" when="2018">2018. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">VergeDB: A Database for IoT Analytics on Edge Devices</title>
		<author>
			<persName><forename type="first">John</forename><surname>Paparrizos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chunwei</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bruno</forename><surname>Barbarioli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Johnny</forename><surname>Hwang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ikraduya</forename><surname>Edian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Aaron</forename><forename type="middle">J</forename><surname>Elmore</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><forename type="middle">J</forename><surname>Franklin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sanjay</forename><surname>Krishnan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CIDR</title>
				<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Integrating complex event processing and machine learning: An intelligent architecture for detecting IoT security attacks</title>
		<author>
			<persName><forename type="first">José</forename><surname>Roldán</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Juan</forename><surname>Boubeta-Puig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">José</forename><surname>Luis Martínez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Guadalupe</forename><surname>Ortiz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Expert Systems with Applications</title>
		<imprint>
			<biblScope unit="volume">149</biblScope>
			<biblScope unit="page">113251</biblScope>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Reactors: A case for predictable, virtualized actor database systems</title>
		<author>
			<persName><forename type="first">Vivek</forename><surname>Shah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marcos</forename><surname>Antonio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Vaz</forename><surname>Salles</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2018 International Conference on Management of Data</title>
				<meeting>the 2018 International Conference on Management of Data</meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="259" to="274" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">SciDB: A database management system for applications with complex analytics</title>
		<author>
			<persName><forename type="first">Michael</forename><surname>Stonebraker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paul</forename><surname>Brown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Donghui</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jacek</forename><surname>Becla</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computing in Science &amp; Engineering</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="54" to="62" />
			<date type="published" when="2013">2013. 2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">On the Integration of Machine Learning and Array Databases</title>
		<author>
			<persName><forename type="first">Sebastian</forename><surname>Villarroya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Peter</forename><surname>Baumann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE 36th International Conference on Data Engineering (ICDE). IEEE</title>
				<imprint>
			<date type="published" when="2020">2020. 2020</date>
			<biblScope unit="page" from="1786" to="1789" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Modeling and Building IoT Data Platforms with Actor-Oriented Databases</title>
		<author>
			<persName><forename type="first">Yiwen</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Julio</forename><surname>Cesar Dos</surname></persName>
		</author>
		<author>
			<persName><surname>Reis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marcos</forename><surname>Kasper Myrtue Borggren</surname></persName>
		</author>
		<author>
			<persName><surname>Antonio Vaz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Claudia</forename><surname>Salles</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yongluan</forename><surname>Bauzer Medeiros</surname></persName>
		</author>
		<author>
			<persName><surname>Zhou</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">EDBT</title>
				<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="512" to="523" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<monogr>
		<title level="m" type="main">Active database systems: Triggers and rules for advanced database processing</title>
		<author>
			<persName><forename type="first">Jennifer</forename><surname>Widom</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stefano</forename><surname>Ceri</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1996">1996</date>
			<publisher>Morgan Kaufmann</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Improved Long Short-Term Memory based anomaly detection with concept drift adaptive method for supporting IoT services</title>
		<author>
			<persName><forename type="first">Rongbin</forename><surname>Xu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yongliang</forename><surname>Cheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zhiqiang</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ying</forename><surname>Xie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yun</forename><surname>Yang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Future Generation Computer Systems</title>
		<imprint>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
