<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Big Data Management Architecture for Smart Cities based on Fog-to-Cloud Data Management Architecture</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, Norwegian University of Science and Technology (NTNU)</institution>
          ,
          <addr-line>Trondheim</addr-line>
          ,
          <country country="NO">Norway</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>1876</year>
      </pub-date>
      <fpage>0000</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>A massive volume of data has been created and saved in their related data repositories in either structured or unstructured format (Big Data). Therefore, there are several ongoing studies (including in different environments, scenarios, and sciences) nowadays to propose the Big Data management architectures. Data management (including data acquisition, data preservation, or data processing) becomes a complex task in these studies during their entire life cycles. In a particular example of the Big Data environment, the smart city is an advanced technological option to increase citizens' quality of life through different smart services. It is evident that data is the most critical element in a smart city. Without relevant data, no smart services can be launched. Recently Fog-toCloud (F2C) data management combines the advantages of both the centralized (Cloud) and distributed data management in a smart city. The advantage of this hierarchical distributed data management is diverse, including reducing communication latencies for real-time or critical services, decreasing network data traffic, applying different policies (for example, for data filtering, data aggregation, data securities, etc.) and so on. In this paper, we developed our proposed hierarchical distributed data management architecture for Zero Emission Neighborhoods (ZEN) center in Norway. In the beginning, the hierarchical distributed architecture, that has the potential to organize all data life cycle stages, has been described (from creation to consumption). Afterward, it has been illustrated that the architecture can manage different obtained data types (including real-time, last-recent, and historical data) in each cross-layer (from IoT devices to cloud technologies) of the architecture.</p>
      </abstract>
      <kwd-group>
        <kwd>Smart Cities</kwd>
        <kwd>IoT</kwd>
        <kwd>Data Management</kwd>
        <kwd>Fog-to-Cloud (F2C) Data Management</kwd>
        <kwd>Big Data Management</kwd>
        <kwd>Data LifeCycle (DLC)</kwd>
        <kwd>Vs Challenges</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        A massive amount of data is being produced and stored rapidly, regularly and
unlimitedly in their related distributed data repositories in the contemporary IT world. This
massive data can be in structured, semi-structured or unstructured formats (Big Data).
In addition, the above mentioned data have the possibility to be shared and openly
accessible (Open Data) and used by the potential clients in either private or public sector
Copyright held by the author(s). NOBIDS 2018
(Open Government Data). However, it is widely accepted that sharing huge amounts of
heterogeneous data imposes several challenges and difficulties for the data management
systems. Several specific Data LifeCycle (DLC) models have been designed for
particular scenarios and different fields [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1-4</xref>
        ], to name a few, but there is still an open
discussion about how to organize and manage these complexities of data and their related
challenges. This solution can contribute to efficient usage and exploitation of data
during all steps of the data life cycle.
      </p>
      <p>As an example of the Big Data scenario, smart cities are the available technological
solutions to organize all the challenges and complexities of the increasing population
and urban density. Data is one of the most valuable ingredients in the smart cities’
environments. They provide a fertile ground for a city to be smart and creative. In fact,
data create the required information for services to proceed according to contextual
parameters, or some higher value knowledge extracted from complicated data analysis.
Therefore, smart cities are an ideal scenario to create abundant data from many different
sources in the city (such as sensors, smartphones, camera surveillance etc.) to combine
with historical information. Consequently, there are many ongoing efforts and active
studies in academia as well as in industry nowadays to manage these large data
amounts.</p>
      <p>
        In this research paper, we developed further our Big Data management architecture
proposal for the ZEN research centers [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. For more information, we recently proposed
a ZEN data management architecture based on the F2C data management architecture
in the smart cities [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Our main interest in this paper is to show that the ZEN data
management architecture is able to cover all data life stages (from generation to usage)
through cross-layers (from IoT devices to cloud technologies) with respect to the recent
proposal for the Comprehensive Scenario Agnostic DLC (COSA-DLC) model in the
smart cities [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Besides that, we depict that different data types in terms of the product
time (including historical, last-recent, and real-time data) can be supported and
managed through their data life cycle stages in our proposed Big Data management
architecture. On top of that, we used the F2C technologies [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] to have both advantages of
the centralized and distributed data management architectures for our proposed Big
Data management architecture in the smart cities context.
      </p>
      <p>
        The rest of this paper is structured as follows. Section 2 introduces background about
the data management architectures (including cloud and F2C data management) in the
smart city environments. In Section 3, we explain some main insights related to the Big
Data contexts and concepts. Section 4 describes the essential concepts in a Big Data
management architecture based on F2C data management in smart cities. Section 5
pertains to the different smart cities scenario through ZEN center in Norway. In addition,
our focus is to extend our proposed ZEN data management architecture [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] with some
additional Big Data management insights (including different data types, covering all
data life stages and so on). Section 6 concludes this paper and introduces future research
works.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        There is recently much ongoing research to build the Big Data Management
architectures in different environments, scenarios and sciences [9-11], to name a few. Data
management and organization considered as a complicated task during their entire life
cycle (DLC) [12, 13]. The principal aim of data management is to give direction for the
easy and safe access to the data sources and repositories, in order to discover any kind
of new value from your Big Data sources. Because of the above mentioned reason,
advanced data management and organization systems are core topic for the efficient
value generation. Moreover, the available concepts of Relational Database
Management Systems (RDBMS) and the recent Extract-Transform-Load (ETL) process have
been suggested for modeling the typical data life cycles in data warehousing
environments [
        <xref ref-type="bibr" rid="ref4">4, 14</xref>
        ] but these are not sufficient for Big Data paradigm. Therefore, Big Data
concepts impose several new challenges to the existing data management and
organization systems [
        <xref ref-type="bibr" rid="ref4">4, 15</xref>
        ].
      </p>
      <p>
        In a specific example of the Big Data scenario, smart cities are composed of large
amount of data (including unstructured, semi-structured, and structured). Therefore,
there are two main references for the data management architecture in smart cities. On
one side, the majority of the architectures designed with explicit data management
schemes are centralized in one place. This highlights that even though data is obtained
from distinct sources spread across the city (including sensors, surveillance cameras,
third-party applications, external databases, etc.), data can be accessed from a
centralized platform, normally using cloud technologies [16, 17]. On the other side, minor
architectures go beyond a distributed schema for data management [
        <xref ref-type="bibr" rid="ref6 ref8">6, 8, 18</xref>
        ], using
some advanced technologies such as Fog Computing [19] or F2C Computing [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>To sum up, we can highlight that all the available data management architectures for
smart cities have the following concerns and limitations:
 There are a few studies to contribute the distributed architecture especially in terms
of the distributed data management architecture;
 There is no proposal for DLC model (including all data life stages from creation to
conception) within ZEN data management architecture in smart city context.
 There is no previous work (based on F2C data management architecture) to handle
all the data life stages through different smart city scenarios.</p>
      <p>
        Therefore; because of the all above mentioned points, hierarchical distributed data
management architecture [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] for ZEN center [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] to cover all concerns and limitations has
been developed.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Big Data Contexts and Concepts</title>
      <p>Coined a few years ago, the importance of the “Big Data” term has been discussed
by several scientific communities. Nowadays, there are several technical issues and
business impacts of Big Data but there is no global consensus on a uniform and highly
accepted definition [20]. That said, we might suggest that Big Data definitions can be
assumed into three main categories depending on the main character used to formally
establish the definition after a thorough reading process on the current related literature.
Those main characters are “data quantity”, “challenges”, and “Big Data management
complexities”. For each of the mentioned characteristics, Table 1 defines relevant
references in the literature.</p>
      <p>
        After revisiting all definitions about the Big Data contexts [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], it has recognized that
there is a difference between using the Vs model for Big Data definition and Big Data
challenges. In one side, the complete definition of Big Data can be suggested
considering Variety, Volume, and Velocity, since these are the features that explain Big Data –
Value might be also added considering Big Data has imagined fueled by the potential
value among such massive data. On the other side, with respect to the Big Data
challenges, it has not offered that Visualization, relating to a way of presenting data once
processed [26], is one main challenge for Big Data technology. This is rooted in the
fact that Visualization is an optional software programming aspect for end-users.
Indeed, in [
        <xref ref-type="bibr" rid="ref1 ref6">1, 6</xref>
        ] offered that the 6Vs challenges model, including of Value, Volume,
Variety, Velocity, Variability and Veracity.
      </p>
      <p>Big Data is a massive amount of both structured and unstructured data that that is hard to be processed by
using traditional database and software algorithms [37].</p>
      <p>This section is organized into three main subsections with respect to the Big Data
definitions concepts as it is categorized in Table 1. First, we briefly mention the quantity
of Big Data environments. Second, we discuss Big Data challenges. These challenges
are mainly related to the Vs challenges generation (including 3Vs, 4Vs, 5Vs, 6Vs, etc.).
Third, we describe the complexities of the Big Data management.
3.1</p>
      <sec id="sec-3-1">
        <title>Big Data Quantity</title>
        <p>Data is being produced in large amounts unstructured and structured formats in every
moment. In addition, the produced data generated from different data sources, with
distinct formats and so on. This freshly obtained data is often combined together with the
archived historical data. This accumulated data builds initial ingredients for future
knowledge purposes in many different fields of sciences and Big Data scenarios. As an
example of the Big Data environment in the biological systems field [38], the data
growth is from a rate of 1 KB per day in 1996 to a rate of 10 GB per day in 2011.
Therefore, the size and number of experimental datasets are increasing exponentially
in fact.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Big Data Challenges</title>
        <p>
          The main challenges in Big Data environment have been widely discussed [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]
through the 3Vs challenges (including Volume, Variety and Velocity) as proposed by
Gartner [24]. This model has been suggested to the 5Vs challenges, which may be
considered as 4 +1, or 4Vs parameters (including Value, Variety, Velocity and volume).
The additional V challenge may consider either Variability as stated by [33, 39] or
Veracity as to be read in [30-32]. Currently few authors showed that the challenges can be
7Vs [34, 35] (including both Variability and Veracity, and adding Visualization). Then,
other authors added Volatility, Viscosity and Virality as additional challenges [40-54].
Indeed, after a thorough reading process on the current related literature, in [
          <xref ref-type="bibr" rid="ref1 ref6">1, 6</xref>
          ]
suggested that the Big Data challenges can be defined as a 6Vs challenges (including
Volume, Variety, Velocity, Variability, Veracity, and Value).
3.3
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Big Data Management Complexities</title>
        <p>The chronological evolution of digital data generation can be related to some years ago.
Then, it is seems that the first digital data was produced by the first computer in the
world (Data Creation). The produced digital data might be eventually stored in distinct
types of media for the future purpose (Data Storing). Then, the stored data must be
converted into information and useful knowledge through some specific processing
(Data Processing). Then data analyzed to extract new values for the end users (Data
Analysis). This initial roadmap (from data creation to data analysis through data storing
and data processing) demonstrates a simple data life cycle.</p>
        <p>
          DLC models have been proposed to set a high-level framework standing for a global
data life view from the creation stage to the usage stage. The main purpose of a DLC
model is to organize data management in order to suggest the data products to the
endusers, which is fitted to their requirements [
          <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1-4</xref>
          ]. In addition, several DLC models exist
in the literature as part of particular sciences and/or environments and/or data cycles
management. In [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] the authors proposed the COSA-DLC model to manage and
organize data in any scenario, science, and Big Data environment. On top of that, the model
is able to fit easily to any Big Data scenario (including smart city, Scientific areas and
so on).
        </p>
        <p>
          The COSA-DLC model consist of three main blocks and is presented in [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] as shown
in Fig. 1. Each block contains a set of sophisticated phases. These phases make the
model comprehension, agnosticism, and adaption true. For more detailed information,
the Data Acquisition block includes four phases. The Data Processing block offers in
three phases. Finally, the Data Preservation block consists of four phases. The
description of each phase (including all responsibilities and functionalities) together with the
relationship among phases, is named the Data LifeCycle Management (DLM) and is
presented in [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <sec id="sec-3-3-1">
          <title>DaPtraocess</title>
          <p>a ty
t i
Daual
Q</p>
          <p>Dantaalysis
A
Data Processing Block
Data Acquisition Block
Data Preservation Block
DCaotlalection</p>
          <p>DFaitlatering</p>
          <p>Datuaality
Q</p>
          <p>DDeastcaription</p>
          <p>Daitfaication
Class</p>
        </sec>
        <sec id="sec-3-3-2">
          <title>DQatuaality</title>
          <p>DAatrachive</p>
          <p>Dataination
Dissem
Smart cities include several technological challenges and need for a ubiquitous
deployment of computing resources throughout the city (from IoT devices to advanced data
centers). All data resources must be connected through several communication
networks by many different network technologies (including wireless sensor networks,
Bluetooth, 4G, Wi-Fi, etc.) and this scenario together should be organized by deploying
advanced architectural approaches (including IoT, IoE etc [55]) to build the smart city
idea. However, beyond all technologies, the most precious resource for a city to become
smart is data. In addition, there is a huge number of data sources (including sensors,
smart devices, 3rd party applications and so on) across smart cities in today’s world. As
a result, there is a huge concern to organize the big amount of produced data in the
smart cities in academia and industries nowadays. Such a huge concern faces the
following difficulties, to name a few:
 Data sources are distributed in a smart city.
 The data volume is growing exponentially in a smart city.
 Data generates by varieties sources, types, and formats (heterogeneous data).
 There are many redundant and dark (useless) data in related data storages.
 Smart services request for the use of both, real-time data (for fast access and critical
services) and historical data (for deeper computing services).</p>
          <p>
            In [
            <xref ref-type="bibr" rid="ref7">7, 56</xref>
            ] the authors defined a comprehensive DLC model in the context of a smart
city, which combines the advantages of the centralized and distributed data
management strategies. Therefore, if services or data stakeholders request particular (or
critical) data, it will be collected from the distributed data sources (means that the real-time
data comes from a close location). Although if in case more historical dataset is
required, probably least recent, it is collected from upper levels (thus with higher level of
the capabilities), more centralized nodes. The model is called the Smart City
Comprehensive Data Lifecycle (SCC-DLC) model based on the F2C data management
architecture (from distributed to centralized data management). . This model is able to
manage all data life cycle stages (from data collection to usage). Moreover, the model
covered all other important features, such as data quality and data security. The benefits of
this model is that it mix the benefits of both the cloud and fog computing technologies.
These benefits are using high performance capacities for computationally intensive
applications, diminish communication latencies for real-time or critical services, decrease
network data traffic and increase fault tolerance and security protection, to name a few.
5
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Use Case for Big Data Management in Smart Cities</title>
      <p>
        As an example of the smart cities use case (which can be assumed as a particular
scenario for the Big Data environments), the ZEN center is located in Norway [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The
ZEN center looks at groups of buildings, instead of a single building, which was the
Zero Emission Buildings (ZEB) center’s objective [57]. A neighborhood is described
as a group of interconnected buildings with related infrastructure [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Therefore, the
final target moves to “smart cities” through the idea of the zero-emission neighborhoods
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The ZEN center has eight different pilot projects in distinct cities (Bodø,
Trondheim, Steinkjer, Evenstad, Elverum, Oslo, Baerum and Bergen) in Norway as depicted
in Fig. 2.
      </p>
      <p>
        Our main effort in this paper is to develop our proposed data management
architecture for the ZEN center [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and their related pilots. Then, [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] designed the hierarchy
architecture as shown in Fig.2. The base of our hierarchy architecture is established
throughout the two main axes, Time and Location, has been depicted. Those axes helps
us to depict our idea about Big Data Management in smart cities through the concept
of the “data types”, “data management architecture” and “DLC models”.
      </p>
      <p>There are several applications/services in the ZEN pilots, which are working with
multiple data sources (including IoT sources, web services and so on). Then, it seems
that there are several complexities to discover suitable data sources regarding
timealignment (real-time, historical, and last-recent data). Therefore, in our Big Data
management proposal, we categorized data according to its age, ranging from real-time to
historical data. First, real-time data needs to be consumed for critical applications as it
is generated. Such real-time data requires some implicit proximity constraints because
these data faces some obstacles to be critical in remote services. Second, data can be
considered historical (older data) as long it is accumulated and saved on the data
repositories for the future purpose. In this situation, historical data can reside in a place (for
instance cloud technologies) which is farther away of their own related data sources. It
can result in the higher level of latencies that can be assumed to data access from the
cloud. Finally, the last-recent data is in the middle layer of our time-alignment schedule
assumption through the architecture. Then, the data will be received from all lower data
sources in this layer. This layer has individual tasks (including processing and storage).
Afterward the data will be sent to the upper layers (cloud technologies).</p>
      <p>There are two types of the data management architecture in our proposed Big Data
architecture. One is centralized (Cloud) and the other one is distributed data
management architecture. In this proposed architecture, we used the advantage of using both
data management architectures to handle the Big Data complexities and challenges in
our scenario. On one hand, the distributed data management architecture positioned to
the closest layers of the data sources in the city. In addition, the distributed data
management architecture uses the potential of the fog technologies to overcome the Big
Data challenges in the distributed schema. On the other hand, the centralized data
management architecture is located to the top place in our architecture. Sometimes, the
physical place can be considered in different cities or continent that is far away from
the data sources. Traditionally, the centralized data management model uses the cloud
technologies to organize all historical (centralized) data.</p>
      <p>As we discussed in Section 4, our proposed Big Data management architecture fits
the SCC-DLC model to organize all data life stages (from creation to consumption) in
our scenario. In addition, we explained our vision of data lifecycle in terms of core steps
and data flow. We proposed three main blocks (which are namely Data Acquisition,
Data Processing, and Data Preservation). Data Acquisition block plans to collect all
available data from our city pilots. Therefore, the Data Acquisition block is mainly
responsible for collection, classification, quality check, description tag of data and
prepare them for the next purpose. Then, the data can be processed or preserved. The Data
Processing block is mainly converting data into valuable information through several
analysis or analytical processes. This processed data can be utilized by the users or
saved for the next purpose. Indeed, the Data Preservation block is mainly handling the
data storing tasks (received in either the Data Acquisition or Data Processing blocks),
and make them ready for publication, or for advanced processing purposes.</p>
      <p>As shown in Fig.2, a proposed ZEN data management architecture consists of the
following three layers architecture: (Fog-Layer-1, Fog-Layer-2, and Cloud layer) and
described below.
 Fog-Layer-1: is an adjacent layer to the end-users and IoT devices in the pilot. This
layer constitutes many different IoT-Sources (including sensors, smart-phones and
so on) in the Fog-Areas (including various types of the building and their
neighborhoods) and Fog-Device (is the most robust node for the processing and storage
among the IoT-Sources). This layer can handle several Big Data management tasks
as shown below.
o
o
o</p>
      <p>Data Type: Fog-Layer-1 is saving the real-time data.</p>
      <p>Data Management architecture: this layer is considered as a part of the distributed
data management architecture.</p>
      <p>DLC model: on one hand, this layer is mainly responsible for the duties of the
Data Acquisition block (darker color level) because the majority of the data
sources are positioned in this layer. On the other hand, this layer provides the
processing and storage capacities (lighter color level).
 Fog-Layer-2 is a middle layer between Cloud and Fog-Layer-1 layers. This layer
includes with IoT-Hub (is the strongest node for processing and storage to handle all
obtained data). Moreover, this layer is somewhere in the city of the pilot, but it is not
as close as to IoT devices like Fog-Layer-1. This layer can manage several Big Data
management tasks as shown below.</p>
      <p>o Data Type: Fog-Layer-2 is a place for storing the last-recent data.
o Data Management architecture: again, this layer assumes under consideration
of the distributed data management architecture.
o DLC model: IoT-Hub is responsible for high-level tasks under Data
Preservation and Data Processing blocks (medium color level). However, the cloud is
responsible for the advanced level of the processing and storage tasks. Besides
that, the Data Acquisition block has less responsibility in this layer (lighter color
level) than the lower layer because the data sources are fewer than the lower
layer.
 Cloud layer is in the prominent position. The cloud accumulates with the most potent
resources concerning processing and storage. Data Type: Fog-Layer-1 provides
facility to store all historical data. This layer can organize several Big Data
management tasks as shown below.</p>
      <p>o Data Type: cloud is responsible for keeping the historical data.
o Data Management architecture: this layer is the central place for centralized
data management.
o DLC model: the cloud technology has almost unlimited resources for all
enquires of data. Then, all related tasks will be done in the cloud environment
(darker color level).</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In this paper, we developed our proposed data management architecture for ZEN center
based on a distributed hierarchical F2C data management. The main points of this
development are the following:
 We illustrated the upsides of using both data management architectures (including
distributed and centralized) in the context of the smart cities to handle Big Data
complexities and challenges;
 We explained that our proposed architecture has an interesting facility to organize
all different data types (including real-time, last-recent, and historical data) from IoT
devices to cloud technologies in each cross-layer of our architecture;
</p>
      <p>We described that F2C data management (from distributed to centralized) has a
great possibility to handle all data life stages (from creation to conception) with
respect to the DLC concepts;
 We contributed to different smart city scenarios to demonstrate our proposed Big
Data architecture for the smart cities.</p>
      <p>As a part of our future work, we will discover more options related to developing
our ZEN data management architecture, such as extending other data sources and
external third-party applications.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgment</title>
      <p>This paper has been written within the Research Centre on Zero Emission
Neighborhoods in smart cities (FME ZEN). The authors gratefully acknowledge the support from
the ZEN partners and the Research Council of Norway. In addition, the first author
would like to express my very great appreciation to the Advanced Network Architecture
Lab (https://craax.upc.edu/) in UPC university of Barcelona, Spain because of their
support for his Ph.D. thesis under the FI-DGR scholarship 2015FI_B100186
(https://upcommons.upc.edu/handle/2117/114435).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Sinaeepourfard</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masip-Bruin</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Hierarchical distributed fog-to-cloud data management in smart cities</article-title>
          .
          <source>Departament d'Arquitectura de Computadors</source>
          , vol.
          <source>Doctoral thesis</source>
          . Universitat Politècnica de Catalunya (UPC), Barcelona, Spain (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Levitin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Redman</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>A model of the data (life) cycles with application to quality</article-title>
          .
          <source>Journal of Information and Software Technology on Elsevier</source>
          <volume>35</volume>
          ,
          <fpage>217</fpage>
          -
          <lpage>223</lpage>
          (
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Rüegg</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gries</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bond-Lamberty</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bowen</surname>
            ,
            <given-names>G.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Felzer</surname>
            ,
            <given-names>B.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McIntyre</surname>
            ,
            <given-names>N.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soranno</surname>
            ,
            <given-names>P.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vanderbilt</surname>
            ,
            <given-names>K.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weathers</surname>
          </string-name>
          , K.C.
          <article-title>: Completing the Data Life Cycle: using information management in macrosystems ecology research</article-title>
          .
          <source>Journal of Frontiers in Ecology and the Environment</source>
          <volume>12</volume>
          ,
          <fpage>24</fpage>
          -
          <lpage>30</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wen</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chua</surname>
          </string-name>
          , T.-S.,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Toward scalable systems for big data analytics: A technology tutorial</article-title>
          .
          <source>Journals &amp; Magazines on IEEE Access 2</source>
          ,
          <fpage>652</fpage>
          -
          <lpage>687</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>5. https://fmezen.no/</mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Sinaeepourfard</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krogstie</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Petersen</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gustavsen</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>A Zero Emission Neighbourhoods Data Management Architecture for Smart City Scenarios: Discussions toward 6Vs challenges</article-title>
          .
          <source>International Conference on Information and Communication Technology Convergence (ICTC)</source>
          .
          <source>IEEE</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Sinaeepourfard</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masip-Bruin</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marín-Torder</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          :
          <article-title>Towards a comprehensive data lifecycle model for big data environments</article-title>
          .
          <source>Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies</source>
          , pp.
          <fpage>100</fpage>
          -
          <lpage>106</lpage>
          . ACM (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Sinaeepourfard</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Masip-Bruin</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marin-Tordera</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          :
          <article-title>Data Preservation through Fog-to-Cloud (F2C) Data Management in Smart Cities</article-title>
          .
          <source>2nd International Conference on Fog and Edge Computing (ICFEC)</source>
          ,
          <year>2018</year>
          IEEE, pp.
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          . IEEE (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>