<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Multi-Agent Framework for a Hadoop Based Air Quality Decision Support System</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Abdelaziz El Fazziki</string-name>
          <email>elfazziki@uca.ma</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Abderrahmane Sadiq</string-name>
          <email>Abderrahmane.sadiq@edu.uca.ma</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jamal Ouarzazi</string-name>
          <email>ouarzazi@uca.ma</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohamed Sadgal</string-name>
          <email>sadgal@uca.ma</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Computer Systems Engineering Laboratory, Cadi Ayyad University of Marrakech</institution>
          ,
          <country country="MA">Morocco</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Laboratoire Physico-Chimie des Matériaux et Environnement (URAC 20), Cadi Ayyad University of Marrakech</institution>
          ,
          <country country="MA">Morocco</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2015</year>
      </pub-date>
      <fpage>45</fpage>
      <lpage>59</lpage>
      <abstract>
        <p>Tropospheric pollution is controlled by various factors such as the distribution of pollutant sources, the nature and amount of energy, as well as the land use and meteorological parameters. These factors must be taken into account in the management of the air quality. Thus, a development of an air quality decision support system able to manage these factors and to answer the questions of environmental managers in real-time is imperative. Such system requires an advanced modeling and information analyzing and processing techniques that should take into account some aspects, such as the integration of a large amount of data, the behavior of the system environment, the available data sources and the emerging paradigm related to the intelligent systems. To this end, we propose an approach based on the use of the agent technology and big data concept. For the air quality data collection and analysis, we use a Hadoop framework: HBase for data storage and a MapReduce based forecasting process; artificial neural network (ANN) based prediction and K-means as clustering algorithm. Finally, the approach is validated by a case study in which an air quality management support system for the Marrakech city is presented.</p>
      </abstract>
      <kwd-group>
        <kwd>Decision support systems</kwd>
        <kwd>Agent technology</kwd>
        <kwd>Air quality management</kwd>
        <kwd>Hadoop MapReduce</kwd>
        <kwd>Artificial neural network</kwd>
        <kwd>Big Data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The continuous increases in productivity bring damage to the environment, due to the
various factory emissions, vehicle exhausts and other pollution sources. In response to
this concern, several studies on air quality management using forecasting and
prediction based solutions have been done [
        <xref ref-type="bibr" rid="ref1 ref2">1,2</xref>
        ]. Therefore, this problem can be controlled
by monitoring and alert forecasting, in the context of scientific researches which is the
aim of the proposed system. Also, the solution to growing volumes of data that
demand fast and effective retrieval of information is related to the integration of the
principles of data mining over a distributed environment. The main objective of this
paper is to suggest a solution applicable to large scale data and gives a great flexibility
and speed to perform prediction and forecasting over a distributed framework. For
this we propose a development approach, based on the use of multi-agent systems
(MAS), a Chemistry-transport model (CTM) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and an ANN [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] over the Hadoop
MapReduce framework [
        <xref ref-type="bibr" rid="ref4 ref5">4,5</xref>
        ]. The proposed approach will be validated with a case
study in which an air quality management system will be presented. This system is
used in order to perform predictions and forecasting for air quality and monitor the
level of pollution, to ensure conformity with the local legislation and to evaluate
control options.
      </p>
      <p>The proposed approach is illustrated over a few sections starting with a brief
literature review followed by the system overview in section 3. Section 4 is devoted to the
development process presentation. The resulting multi-agent system is described
Section 5. Section 6 and 7 are dedicated to the data modeling details and the MapReduce
based data analysis process description. In section 8 a case study and the experimental
results are presented, followed by a conclusion and perspectives in section 9.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Many research projects have worked on air quality management and assessment
systems such as [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] where the suggested system is designed to give a support in decision
making, connected with air quality forecasting and managing models. Such system
ensures an assessment of air pollution and allows predicting the air quality in diverse
urban situations. Several studies have also investigated the knowledge discovery
aspects of analyzing data collected from sensor networks. As example Li et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ],
investigate a method of analyzing and monitoring data produced by different sensors
distributed over Taiwan. The system allows investigating the use of a larger variety of
data analysis components.
      </p>
      <p>
        Other projects have also addressed the issue of air quality’s data integration; like
Appetise project [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] that aims to produce a database containing pollution data
combined with other related data such as weather records, and to develop tools for
analyzing and visualizing this data. The TimeMap project [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] has also developed data
analysis software that allows visualization of distributed spatiotemporal data sets, and
interactive maps.
      </p>
      <p>
        Information system enhancement using Hadoop as a data hub to optimize the
decision making infrastructure is a new emerging strategy. Many research works have
proposed a method to leverage the Hadoop framework by effectively integrating it to
the existing data warehouse such as [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] that proposed a study on big data integration
with data warehouse built using relational technology mainly for operational sources.
      </p>
      <p>
        Concerning the use of MAS, we are based on works done by the authors in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]
which propose an agent-based decision support system development approach where
the software agents use data mining methods for knowledge discovery, which will be
used as a foundation for decision making and recommendation generation. This
system provides all the necessary steps for a standard decision making procedure using
intelligent agents [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>An Overview of the System</title>
      <p>
        In this system we propose the use of a chemistry-transport model [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] for the air
quality estimation and modeling. For the data analysis, we use an online analytical
processing (OLAP) tool. We also proposed the use of an ANN based predictions
algorithm [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Concerning the data gathering, cleaning and integration, we use a Hadoop
and MapReduce based process to make needed algorithms applicable to large scale
data and give a great flexibility and speed to execute a process over the distributed
framework. The aimed system is composed of the following set of components (see
Figure 1):
 Monitoring data integration component (MDIC)
 External data integration component (EDIC)
 CTM and Prediction component (CTMPC)
 OLAP component (OLAPC)
 User-interface component (UIC)
      </p>
      <p>External data integration</p>
      <p>Component
- Geographic data Emission Data:
- Boundary Type,distribution,
conditions nature
Meteorological Data:
Pressure, Humidity, Wind,
Temperature, Flux...</p>
      <p>Monitoring stations
Monitoring data integration</p>
      <p>Component</p>
      <p>DW</p>
      <p>Big Data
Tools
(Hadoop)</p>
      <p>Pre-processing</p>
      <p>Data [C]mod
Stored into
HBase Scores</p>
      <p>[C]obs</p>
      <sec id="sec-3-1">
        <title>Cube DM</title>
        <p>Data Marts/</p>
        <p>Data Cubes
OLAP Queries
OLAP Component</p>
        <p>CTM Component User-interface</p>
        <p>Component
Chemistry-Transport</p>
        <p>Modeling</p>
        <p>Traffic
Regularization
Prediction Models
e
c
a
Itfrr
e
n
e
s
U</p>
        <p>User 1
User 2</p>
        <p>...</p>
        <p>
          User n
In order to provide an adequate solution in terms of robustness and agility, we use a
multi-agent framework to represent the decision support system components. The
objective is to propose an architecture that consists of a set of autonomous agents able
to set their own goals and actions and interact and collaborate with each other through
a communication protocol [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. The proposed system components are structured into
agents. These agents should be identified during the development process.
4.1
        </p>
        <sec id="sec-3-1-1">
          <title>The Development Process</title>
          <p>
            In the development process, we are based on the MDA [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ] Paradigm and
Prometheus methodology [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ] which has been developed to support the complete software
development lifecycle from problem description to implementation. It offers an
environment for analyzing, designing, and developing heterogeneous multi-agent systems.
          </p>
          <p>This methodology consists of three phases: System Specification, architectural design
and the detailed design.</p>
          <p>
            The MDA based development process is an iterative process, allowing incremental
development and provides the rollback possibility to previous phase [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ]. It consists
of describing the system as different models expressed with various levels of
abstraction. In our case the first level of abstraction is the Computation Independent Model
(CIM) which corresponds to the analysis and goal capturing stages. The second level
is the Platform Independent Model (PIM) in which we define the agent models based
on Prometheus concepts. The third level is the Platform Specific Model (PSM). In our
case this level is dedicated to the JACK agent files generation. The last development
stage is the automatic Java code generation from PSM. Figure 2, shows the proposed
development approach stages.
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Use Cases</title>
        <p>Goal Hierarchy
Capturing Goals
Analysis Overview Diagram
Goal Overview Diagram
System Scepcification</p>
        <p>System Roles Diagram
Agent Role Grouping Diagram</p>
        <p>Protocol Diagram
System Overview Diagram
Agent overview diagram /
Process Diagram
Architectural Design</p>
        <p>Detailed Design</p>
      </sec>
      <sec id="sec-3-3">
        <title>JACK Framework</title>
      </sec>
      <sec id="sec-3-4">
        <title>Java Code</title>
        <p>
          In the following sub-sections we describe the different modeling stages in which we
use the Prometheus Development Tool (PDT) [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] to apply the proposed development
approach and generate the different needed diagrams and models.
        </p>
        <p>The Domain Analysis. The first modeling step is the global domain analysis and the
identification of the different use cases and actors. These elements are then used in the
goals capturing stage, which consists in defining the user and system general goals
(external and internal goals). Figure 3, shows the resulting goal hierarchy diagram.
Get Air Quality
prediction results</p>
        <p>Get Forecasting and
estimation results
Apply Prediction
algorithm</p>
        <p>Apply the CTM
model</p>
        <p>OLAP analysis
Query request
Get analysis</p>
        <p>Query results
Perform the Map
and Reduce tasks
Prepares extrenal
data files
External data
gathering</p>
        <p>Handle monitoring
data requests and
tasking
monitoring
data gathering
Handle meteorological Handle Domain Handle Emissions
data request data request data request
After the goals capturing, we can establish the PIM using the PDT and based on the
Prometheus models.</p>
        <p>
          The System Specification. In this stage we use the Prometheus analysis overview
diagram. This diagram is designed to show the interactions between the system and
the environment. At this abstract level we have to identify the actors, scenarios,
percepts and actions through two steps: The actors and the scenarios identification and
then the actions and percepts between the actors and the system definition. Figure 4,
illustrates a part of the system analysis diagram. Each specified scenario in this
diagram must then be associated with a goal using a scenario diagram which represents
the scenario aims. The analysis stage is followed by the goal overview diagram. In
this diagram, from each high-level goal several sub goals can be defined.
The Architectural Design. The next step is to transform the structured goals into
roles which are the building blocks used to define agent’s classes. After roles are
created, tasks are associated with each role and gives details about how the goal is
accomplished. The agent classes are then identified from the component role.
Furthermore, a Prometheus social diagram can be used to represent each agent, the
beliefs they have about the environment, the set of goals and sub-goals, and the different
plans to achieve them [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
        </p>
        <p>The Detailed Design. Once we have established the complete roles diagram, we can
use the Architectural Design phase in order to group roles into agents using the Agent
Role Grouping diagram and introduce and develop agent interactions using the
protocol diagram and the system overview diagram.
4.3</p>
        <sec id="sec-3-4-1">
          <title>Code Generation</title>
          <p>
            The Prometheus Development tool is extended with the ability to generate skeleton
code in the JACK agent-oriented programming language [
            <xref ref-type="bibr" rid="ref16">16</xref>
            ] using a PDT code
generator extension which maintains also synchronization between the generated code and
the design when either of them changes.
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>The Resulting Multi-agent System Structuring</title>
      <p>According to the modeling process we can assign each generated agent to the suitable
component. Table 1, Shows the different agents assigned to system components and
Figure 5, illustrates an overview of the different agents and their interactions.
These agents represent all monitoring stations distributed in the study area and
provide the required functionality during the data extraction, transformation and loading
process. The Station agents are used to retrieve data from internal data sources (e.g.
Relational databases, and XML/Text files provided by stations) (See Figure 6). A
station agent is also responsible for data validation, accuracy, the type conversion, etc.
Collaboration between station’s agents will allow a better understanding of the
spatiotemporal evolution of surface air quality.</p>
      <p>Monitoring
stations</p>
      <p>Gathered</p>
      <p>Data</p>
      <p>Cleaned</p>
      <p>Data Area
Data transformation/</p>
      <p>loading
Station
Agent</p>
      <p>Data extraction/</p>
      <p>data loading</p>
      <p>Data Staging Area
DB</p>
      <p>
        DB
Responsible for the integration of data gathered from external sources (e.g.
MOZART2, LMDZ, WRF, EMEP) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and prepares all input data required for the
good functioning of the model agent. It sets up a register of emissions for the region
in order to make regional modeling and prepare the needed tropospheric emissions
data, domain data and meteorological parameters. Figure 7, shows this agent
interaction with the other system agents.
      </p>
      <p>MOZART2</p>
      <p>External data integration subsystem
LMDZ WRF</p>
      <p>EMEP</p>
      <p>Meteorological Data
- Geographic data 3D: Pressure, Humidity,
-coBnoduintidoanrsy 2WDin:du,*TQem0,pLe,raFtluurxe....</p>
      <p>Emission Data
Type, distribution,
nature
Monitoring data
integeration
subsystem
Station Agent</p>
      <p>CTM Subsystem</p>
      <p>BigData processing tools</p>
      <p>Data provider</p>
      <p>Agent
Model Agent</p>
      <p>User Interface</p>
      <p>Subsystem
User Interface</p>
      <p>
        Agent
A Model agent performs the deterministic modeling. It has in charge the functionality
corresponding to a chemistry-transport model, which brings together a set of
equations representing the transport and chemistry of gaseous species, allowing the
quantification of the evolution of a set of pollutants according to time on different domains,
taking into account all parameters (e.g. meteorological, boundary conditions,
emissions, etc.). This agent uses the resources provided by the Hadoop MapReduce
framework [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] in order to explicitly calculate and provide an average concentration
over a surface represented by the meshes of his grid. The following figure illustrates
this agent functioning.
A Prediction agent is responsible for the generation of the air quality’s long-term
prediction using an ANN, which is capable of modeling highly nonlinear relationships
while taking into account the data distribution factors. The strong capability of ANN
in predicting fuzzy data and the efficiency of this approach in modeling dynamic
systems has promoted their implementation in this work to predict air quality based on
gathered data (see Figure 9). Given the big amount of data, training time for ANN is
very large. To address this, we use a MapReduce based on an ANN training process
since the MapReduce programming model has the ability to rapidly process large
quantity of data in parallel. Also, In order to reduce the time cost of data loading, we
store the large scale training data-sets in Hadoop HBase [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], and concurrently load
one of them into the memory of computing nodes across the cluster when needed
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
5.5
      </p>
      <sec id="sec-4-1">
        <title>OLAP Agent</title>
        <p>
          The purpose of the OLAP agent is to convert the amount of monitoring data into
valuable information by applying quick and effective analysis and create various views
and representations of this data. It provides all the basic functionality of an OLAP
system and also the missing intelligence in traditional OLAP systems. The aim is
performing OLAP analyses on behalf of an agent or a user and reporting its result
back to the requesting entity and all other entities that should be informed [
          <xref ref-type="bibr" rid="ref18 ref19">18,19</xref>
          ].
5.6
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>User-interface Agent</title>
        <p>The user-interface agent enhances the ability of the system user to use and entirely
benefit from the DSS. It is responsible for all communications between the air quality
management center and the other agents in order to transmit raw data of air pollutant
concentrations measured by each station, data gathered by the data provider agent as
well as the forecasting and prediction results.
6</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Data Modeling</title>
      <p>
        In this work all data are extracted and stored into a Hadoop HBase. HBase is a
database with high reliability, high performance, column storage, scalable characteristics
based on the Hadoop distributed file system (HDFS). Its goal is the hosting of very
large tables with billions of rows and millions of columns atop clusters of commodity
hardware [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. An HBase table is organized as key-value and each table contains a
series of row records. Through the HBase feature of column-oriented store
and versioning, the time-series data sets are built based on the primary key Row-key
and timestamp. The following Figure illustrates a part of the conceptual model
concerning the pollutant data and Table 2, shows an HBase table example.
      </p>
    </sec>
    <sec id="sec-6">
      <title>Data Analysis</title>
      <p>
        The chemistry-transport estimation process uses a multi-phase MapReduce process to
get emissions of various time resolutions [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. The data are loaded from the
monitoring stations, meteorological, geographical and emission databases. First, we perform a
data cleaning process using a single MapReduce phase. In the second Map stage (see
Figure 10), we use the cleaned data set files to calculate the Atmospheric Pollution
Index (API) by applying the Murena [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] method for each pollutant and applying the
K-means clustering algorithm [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] for the data analysis. Simultaneously, we use
meteorological, geographical, emission and boundary conditions data to generate a
spatiotemporal distribution of the pollutants [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. In the reduce phase, we store
intermediate results into the output database.
      </p>
      <p>
        In the third Map phase (see Figure 11) the forecasting model (CTM) is applied in
order to calculate the emission estimation in a given period. The intermediate results
use the pair of geographic zone identifier and timestamp as the key map and the
amount of emission as the value. In the Reduce phase, the amounts of emissions that
have the same key are accumulated together [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
8
8.1
      </p>
    </sec>
    <sec id="sec-7">
      <title>Case Study</title>
      <sec id="sec-7-1">
        <title>Description</title>
        <p>In this case we are interested in Marrakech-City, which is not an industrial city, but it
suffers from the effects of pollutants produced by vehicle exhaust systems. This study
is based on three stations that provide information and measures of the air pollutants
concentration (Jamaa EL Fna station, Mhamid station and Daoudiat station). The
study focused on the following pollutants: Sulfur Dioxide (SO2), Nitrogen dioxides
(NO2), Carbon Monoxide (CO), Particulate Matter, and Ozone (O3).
8.2</p>
      </sec>
      <sec id="sec-7-2">
        <title>Application</title>
      </sec>
      <sec id="sec-7-3">
        <title>Atmospheric Pollution Index Generation. The system generates the analysis que</title>
        <p>ries based on selected options from an easy to use user interface to get the resulting
API. Figure 12 shows the user interface to select the Ozone API during March 2009
in the Mhamid station. The result is shown in Figure 13.
Air Quality Prediction. We use a three-layer perceptron ANN model and data
concerning the study area described above to predict pollutants level. We used six
neurons in the input layer including temperature, solar radiation, the NO2 concentration,
CO concentration and the wind velocity. The number of hidden layers and values of
neurons in each hidden layer are the parameters to choose in the model construction.
Therefore, one or two hidden layers and different value of neurons were chosen to
optimize the ANN performance. The last layer is the output, which consists of the
target of the prediction model. Here, O3 was used as the output variable and a
hyperbolic tangent sigmoid function was used as the transfer function. A year data set was
divided into two parts: 80% used for training the networks and the remaining 20%
employed in testing the networks. The mean square error was chosen as the statistical
criteria for measuring the network performance.</p>
        <p>The following graph shows the performance of the network above. It represents a
comparison between the observed and predicted Ozone concentration based on the
mean square error.
The main contribution of this work is the definition of a development process based
on big data and intelligent systems concepts. We have, through this paper presented
the implementation of an air quality management system over a distributed data
gathered from different monitoring stations and other external databases and managed
by using Hadoop to ensure a fast data loading, fast queries processing and an efficient
storage. The Hadoop highly efficient fault tolerant nature, flexibility, extensibility,
efficient load balancing and the platform-independent are also useful features for
development of any distributed process. We also have adopted an MDA based
approach and automatic rule transformations, in order to obtain an adaptive system. The
MDA principle is based on reusable model transformations to define specific platform
models. Thus, we used an agent oriented MDA approach based on a set of models that
are constantly evolving, reflecting current needs and which are associated to a set of
agents.</p>
        <p>The case study addresses the generation of pollutants API and performing
predictions using ANN. Our experimental results show that the algorithms deployed in the
large-scale data processing system is feasible and efficient. During the experiment, we
found that the data block size impacts the performance significantly. For big number
of small data blocks, the processing jobs increases the number of collaboration
during the Map and Reduce operation and decrease the performance, since Hadoop has
the advantage on handling large size of files.</p>
        <p>The perspectives of this work are the integration of multi-criteria decision support
tools for decision-making and the use of generated data to address the traffic
regulation issue.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Menut</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bessagnet</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , et al.:
          <source>CHIMERE</source>
          <year>2013</year>
          :
          <article-title>a model for regional atmospheric composition modeling</article-title>
          .
          <source>Journal of Geoscientific Model Development</source>
          .
          <volume>6</volume>
          ,
          <fpage>981</fpage>
          -
          <lpage>1028</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Ivan</surname>
            ,
            <given-names>T.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Žarko</surname>
            ,
            <given-names>M.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vlastimir</surname>
            ,
            <given-names>D.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Predrag</surname>
            ,
            <given-names>M.Ž.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mladen</surname>
            <given-names>A.T.</given-names>
          </string-name>
          :
          <article-title>Air quality estimation by computational intelligence methodologies</article-title>
          .
          <source>THERMAL SCIENCE</source>
          .
          <volume>16</volume>
          ,
          <fpage>S493</fpage>
          -
          <lpage>S504</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Russo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raischel</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lind</surname>
            ,
            <given-names>P.G.</given-names>
          </string-name>
          :
          <article-title>Air quality prediction using optimal neural networks with stochastic variables</article-title>
          .
          <source>Atmospheric Environment</source>
          .
          <volume>79</volume>
          ,
          <fpage>822</fpage>
          -
          <lpage>830</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>4. Apache Hadoop Documentation, http://hadoop.apache.org.</mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Lammel</surname>
          </string-name>
          , R.:
          <source>Google's MapReduce Programming Model - Revisited. Science of Computer Programming</source>
          .
          <volume>70</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>30</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Fedra</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>AirWare: an urban and industrial air quality assessment and management information system</article-title>
          .
          <source>SATURN-EURASAP, Urban Air Quality Management Systems, Munich</source>
          , pp.
          <fpage>73</fpage>
          -
          <lpage>91</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>S.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chou</surname>
            ,
            <given-names>S.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>S.J.:</given-names>
          </string-name>
          <article-title>Multi-resolution spatiotemporal data mining for the study of air pollutant regionalization</article-title>
          .
          <source>In: 33rd Hawaiian International Conference on System Sciences, Island of Maui</source>
          , Hawaii (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Matejicek</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Spatial modelling of air pollution in urban areas with GIS: a case study on integrated database development</article-title>
          .
          <source>Advances in geosciences. 4</source>
          ,
          <fpage>63</fpage>
          -
          <lpage>68</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Johnson</surname>
            , I., Wilson,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The TimeMap project: developing time-based GIS display for cultural data</article-title>
          .
          <source>Journal of GIS in Archaeology</source>
          ,
          <volume>1</volume>
          ,
          <fpage>124</fpage>
          -
          <lpage>135</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Das</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mohapatro</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>A Study on Big Data Integration with Data Warehouse</article-title>
          .
          <source>International Journal of Computer Trends and Technology</source>
          .
          <volume>9</volume>
          (
          <issue>4</issue>
          ),
          <fpage>188</fpage>
          -
          <lpage>192</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Sokolova</surname>
            ,
            <given-names>M.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fernandez-Caballero</surname>
            <given-names>A</given-names>
          </string-name>
          .:
          <article-title>Modeling and implementing an agent-based environmental health impact decision support system</article-title>
          .
          <source>International Journal of Expert Systems with Applications</source>
          ,
          <volume>36</volume>
          (
          <issue>2</issue>
          ),
          <fpage>2603</fpage>
          -
          <lpage>2614</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Gloria</surname>
            ,
            <given-names>P.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jain</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <source>Recent Advances in Intelligent Decision Technologies. Lecture Notes in Computer Science</source>
          .
          <volume>4692</volume>
          ,
          <fpage>567</fpage>
          -
          <lpage>571</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Lavbic</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rupnik</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <article-title>: Multi-Agent System for Decision Support in Enterprises</article-title>
          .
          <source>Journal of Information and Organizational Sciences</source>
          .
          <volume>33</volume>
          (
          <issue>2à</issue>
          ),
          <fpage>269</fpage>
          -
          <lpage>284</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Mellor</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scott</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uhl</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weise</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>MDA distilled: principles of Model-Driven Architecture</article-title>
          .
          <string-name>
            <surname>Addison-Wesley</surname>
          </string-name>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Padgham</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thangarajah</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Winikoff</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>AUML protocols and code generation in the Prometheus design tool</article-title>
          .
          <source>In: 6th international joint conference on Autonomous agents and multiagent systems</source>
          , pp.
          <fpage>1374</fpage>
          -
          <lpage>1375</lpage>
          . Honolulu, Hawaii, (
          <year>2007</year>
          ) [doi&gt;10.1145/1329125.1329451]
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Busetta</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ronnquist</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hodgson</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lucas</surname>
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>JACK Intelligent Agents - Components for Intelligent Agents in Java</article-title>
          .
          <source>Technical report, Agent Oriented Software Pty. Ltd</source>
          , Melbourne, Australia (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Rong</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Furao</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yihua</surname>
          </string-name>
          , H.:
          <article-title>A Parallel Computing Platform for Training Large Scale Neural Networks</article-title>
          .
          <source>In: IEEE International Conference on Big Data</source>
          , pp.
          <fpage>376</fpage>
          -
          <lpage>384</lpage>
          . IEEE,
          <string-name>
            <surname>Silicon</surname>
            <given-names>Valley</given-names>
          </string-name>
          , CA (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Muhammad</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Development and implementation of air quality data mart for Ontario, canada: A case study of air quality in Ontario using OLAP tool</article-title>
          .
          <source>Master Thesis</source>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Foster</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGregor</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>El-Masri</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>A Survey of Agent-Based Intelligent Decision Support Systems to Support Clinical Management and Research</article-title>
          .
          <source>In: Proceedings of the Workshop on Multi-Agent Systems for Medicine, Computational Biology and Bioinformatics in association with the 4th International Joint conference on Autonomous Agents and Multi-Agent Systems</source>
          , pp.
          <fpage>16</fpage>
          -
          <lpage>34</lpage>
          . Utrecht,
          <string-name>
            <surname>Netherlands</surname>
          </string-name>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Junyan</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Junkui</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Siqi</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qi</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yue</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>A MapReduce Framework for On-road Mobile Fossil Fuel Combustion CO2 Emission Estimation</article-title>
          . In: International Conference on Geoinformatics, pp.
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          . Shanghai (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Murena</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Measuring Air Quality over Large Urban Areas: Development and Application of an Air Pollution Index at the Urban Area of Naples</article-title>
          .
          <source>Atmospheric Environment</source>
          .
          <volume>38</volume>
          , pp.
          <fpage>6195</fpage>
          -
          <lpage>6202</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , Ma, H.,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          :
          <article-title>Parallel K-Means Clustering Based on MapReduce</article-title>
          .
          <source>In: Proc. 1st International Conference on Cloud Computing</source>
          , pp.
          <fpage>674</fpage>
          -
          <lpage>679</lpage>
          . Beijing, China (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Wei</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sheng</surname>
            ,
            <given-names>V.S.</given-names>
          </string-name>
          , XueZhi, W.,
          <string-name>
            <surname>Wubin</surname>
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Meteorological Data Analysis Using MapReduce</article-title>
          .
          <source>The Scientific World Journal</source>
          .
          <year>2014</year>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>