<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Atmospheric Environment</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Airvlc: An application for real-time forecasting urban air pollution</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Lidia Contreras Ochando</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Universitat Polite`cnica de Vale`ncia. Spain</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cristina I. Font Julia´n</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Universitat Polite`cnica de Vale`ncia. Spain</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Francisco Contreras Ochando</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Universitat Polite`cnica de Vale`ncia. Spain</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ce`sar Ferri</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>LICONOC@UPV.ES</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Karppinen</institution>
          ,
          <addr-line>A, Kukkonen, J, Elola ̈hde, T, Konttinen, M</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Karppinen</institution>
          ,
          <addr-line>A, Kukkonen, J, Elola ̈hde, T, Konttinen, M</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Koskentalo</institution>
          ,
          <addr-line>T, and Rantakrans, E. A modelling sys-</addr-line>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>ent pollutants (CO</institution>
          ,
          <addr-line>NO, PM2.5, NO2) in three</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2009</year>
      </pub-date>
      <volume>37</volume>
      <issue>15</issue>
      <fpage>2149</fpage>
      <lpage>2157</lpage>
      <abstract>
        <p>This paper presents Airvlc, an application for producing real-time urban air pollution forecasts for the city of Valencia in Spain. Although many cities provide air quality data, in many cases, this information is presented with significant delays (three hours for the city of Valencia) and it is limited to the area where the measurement stations are located. The application employs regression models able to predict the levels of four differdifferent locations of the city. These models are trained using features that represent traffic intensity, persistence of pollutants and meteorological parameters such as wind speed and temperature. We compare different learning techniques to get the better performance in the prediction of pollutants. According to our experiments, ensembles of decision trees (Random Forest) outperforms the rest of methods in almost all of our tests.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Airvlc incorporates the best regression models
and, by a distance-weighted combination of the
predictions, is able to generate a real-time
pollution map of the city of Valencia. The application
also includes a warning system for sending
notifications to users when a nearby risk pollution
concentration is detected.
authors. Copying permitted for private and academic purposes.</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>
        Air pollution can have important impact (short and
longterm) on the health of people. For instance, urban air
pollution increases the risk of suffering respiratory diseases
such as pneumonia, or chronic, such as lung cancer or
cardiovascular disease
        <xref ref-type="bibr" rid="ref10">(World Health Organisation, 2015)</xref>
        . A
recent work
        <xref ref-type="bibr" rid="ref8">(Wilker et al., 2015)</xref>
        relates long-term
exposure to ambient air pollution to structural changes in the
brain.The SOER 2015 report
        <xref ref-type="bibr" rid="ref6">(The European Environment
Agency , 2015)</xref>
        , with data about the European Union
countries’ air quality in 2011, concludes that although the
atmosphere in the continent has improved in the last decades,
there are significant traces of the most harmful
contaminants. In fact, in 2011, the report estimates that 430.000
Europeans died prematurely because of pollution.
Although some governments are introducing restriction
policies that limit the use of vehicles (main source of
pollution in most cases), only in Europe, important cities such
as Paris, Naples, Moscow, Milan or Barcelona still report
significant levels of urban pollution in 2015
        <xref ref-type="bibr" rid="ref6">(The European
Environment Agency , 2015)</xref>
        . In this context, it is
important for citizens of urban agglomerations to reduce the
exposition to urban air pollution as much as possible. This is
especially relevant for high risk population such as: kids,
elderly people, asthmatics or people suffering respiratory
diseases.
      </p>
      <p>In this work we present an application that predicts urban
air pollution in real time by employing historical data. The
application is based on the city of Valencia in Spain. This
city can be considered a medium size urban agglomeration
(around 1.000.000 inhabitants). The city provides an open
data site containing real-time information about the city in
different aspects such as traffic data, noise sensors, pollen
sensors... Although different sensors of urban pollution air
are included in the site, this information needs to be
carefully verified and it is published with a delay of three hours.
This delay can represent a problem since risky high levels
of pollutions are not detected in real-time. Additionally, the
network of sensors is limited (six in the city of Valencia).
Considering these limitations, we have developed an
application able to display in real-time foreseeable levels of
pollution in a wide number of points of the city. The
application is based on the predictions of regression models
that are trained using features that represent traffic
intensity, persistence of pollutants and meteorological
parameters.</p>
      <p>The paper is organised as follows. Section 2 details the
process of data recollection of pollution particles and the
factors that affect the generation, concentration or
dispersion of these pollutants. Experiments in learning
regression models for predicting the pollutant concentrations are
included in Section 3. The Airvlc application is detailed
in 4. Related works are discussed in Section 5. Finally,
Section 6 closes the paper with a discussion of the main
conclusions and some plans for future work.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Data collection</title>
      <p>
        Different particles are associated with urban air pollution.
In order to measure air contamination, pollutant parameters
found in the lower levels of the troposphere are controlled.
Air quality sensors measure concentrations of particles that
have an anthropogenic origin and produce effects during
or after the inhalation by humans. The historical
pollution data for this work has been obtained from the open
data web of the Generalitat Valenciana1. Following the
recommendations of
        <xref ref-type="bibr" rid="ref6">(The European Environment Agency
, 2015)</xref>
        , we concentrate on the following particles:
• PM 2.5 (Suspended particles below 2.5 microns):
This parameter has been chosen because of its
pollutant power. It is one of the most dangerous
particles, since its size makes it almost unstoppable by the
natural filters of the body. This fact means that the
PM 2.5 are usually able to reach the pulmonary
alveoli and in some cases, these particles are attached to
these alveoli with a consequent reduction of lung
capacity; in worst cases, the particles cross the alveolar
membranes and reach the blood stream. Considering
that PM 2.5 particles have its origin in anthropogenic
activities (especially in the use of fuels in motor
vehicles), it is not surprising that its atomic structure
contains heavy metals, extremely toxic to the human
1http://www.cma.gva.es/cidam/emedio/
atmosfera/jsp/historicos.jsp
body. Atmospheric conditions in the Mediterranean
coast of Spain can influence the particle levels, due
to lower rainfall and wind action with respect to other
northern Europe countries, and the North African
particles (Saharan dust), PM10 and PM2.5.
• NO (Nitrogen monoxide): Nitrogen monoxide is a
highly unstable compound; it causes nitrogen dioxide
by quickly reacting in the atmosphere. This instability
makes the nitrogen monoxide a radical, namely, a high
reactive power molecule, whose effects on the body
are abnormal DNA, lipids and proteins. This kind
of changes derives in the medium and long term as
a greater chance of developing cancer.Its origin stems
largely from vehicle engines.
• NO2 (Nitrogen dioxide): Nitrogen dioxide is not a
directly generated pollutant, since its presence in the
atmosphere is caused by the oxidation of nitrogen
monoxide. In the presence of moisture, this
compound results in nitric acid, and its inhalation, even
in low concentrations, can cause lung tissue
degradation, as well as can reduce the efficacy of the immune
system, especially in children.
• CO (Carbon monoxide): Carbon monoxide is a
primary pollutant. CO is toxic; it prevents oxygen
transport by poisoning the blood, since it replaces the
haemoglobin. People with cardiovascular and
cerebrovascular problems could suffer heart attacks or
strokes because of problems related to high
concentrations of CO.
      </p>
      <p>The distribution of air pollution is decisively influenced
by climatic conditions. We have collected Climatological
observations for the meteorological data of Valencia city
from Meteorological Agency of the Government of Spain
(AEMET)2. We consider the following parameters:
• Temperature: In an ordinary atmosphere situation,
temperature decreases with altitude, favouring
ascension of warmer (and less dense) air, and dragging
contaminants upwards. In a situation of thermal inversion,
a warmer layer of air is over the colder surface air and
prevents the rise of this last (denser), so the
contamination is confined and increases.
• Humidity: Humidity is a weather factor to be
considered; in its presence, nitrogen dioxide derives in nitric
acid, harmful to human health.
• Wind speed: Strong winds can disperse pollutants
and transport them away from their emission point.</p>
      <sec id="sec-3-1">
        <title>2http://www.aemet.es/</title>
        <p>• Precipitations Precipitations wash contaminants and
the levels of pollutants.</p>
        <p>can dissolve substances and gases.</p>
        <p>The two main sources of pollution in developed countries
are motor vehicles and industry.</p>
        <p>Vehicles release large
amounts of nitrogen oxides, carbon oxides, hydrocarbons
and particulates when burning gasoline and diesel.
Therefore, we need to measure the level of traffic in the city in
order to predict the air pollution. For this purpose, the City
of Valencia provides a network of sensors (electromagnetic
coils) that measure the intensity of traffic (Vehicles/hour)
in the city. This data can be found in the open data site of
the Valencia City Council3.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. Experiments</title>
      <p>With all the selected parameters, we have built datasets
aimed to predict the concentration of pollutants from the
intensity of traffic and weather parameters. Concretely, we
have collected data for a period of two years (2013 and
2014). Data was collected every 60 minutes, 24 hours a
day during those two years. Although Valencia city has
six stations for the detection and measurement of air
pollution, three of them have not sufficient data for the analysed
period and were discarded. In this way we collected data
from these stations: Mol`ı, Avd Francia and Pista de Silla.
These three stations are located inside the urban
agglomeration, and thus most of the pollutants measured in the
sensors should be generated by urban activities (mainly
traffic). For each one of these stations, we create a dataset with
the level of the pollutants measured and parameters that can
affect these measurements, we concentrate on traffic level
(measured by electromagnetic coils), weather conditions.
In order to measure the traffic related to each air pollution
station, we average the traffic intensity of the closest six
traffic measurament sensors. This is a simplification since,
certainly, all the traffic of the city has effect on the
measured level of all the stations in the city.</p>
      <p>We can see a summary of the three datasets in Table 1. This
table includes averages and standard deviation for the three
stations of the pollutant particles measured and the
intensity of traffic associated with each station. If we analyse
traffic intensity, Avd Francia is the busiest station, while
the other two have similar values. With regard to pollution
levels Pista de Silla station presents the maximum levels
for three parameters. The only exception is PM2.5. This
behaviour can probably be associated with the specific
location of the stations: While Pista de Silla station is located
in a the central part of the city, and therefore more
vulnerable to the overall city pollution, the other two are in the
suburbs of the city where external air streams can reduce</p>
      <sec id="sec-4-1">
        <title>3http://www.valencia.es/ayuntamiento/</title>
        <p>DatosAbiertos.nsf/</p>
        <p>We first study the weekly evolution of pollutants in the
three stations. Figure 1 shows the evolution of the
average of the four parameters of pollution analysed and the
average traffic intensity for Mol`ı station depending on the
day of the week. Figure 2 presents the same plot for Avd
Francia station and Figure 3 corresponds to Pista de Silla
station. In order to make the values comparable in the plot
we normalise each parameter by the maximum value of
that parameter. The level of pollutants and traffic reach the
maximum levels during the working days of the week for
the three stations (Friday seems to be the worst day). We
can clearly see the dependency of the four parameters of
pollution on the traffic intensity level. During the
weekend days, the level of traffic drastically descends and
associated with this reduction the levels of pollutants
significantly drop. Again, the exception is PM2.5. This behaviour
can be caused because these particles can be generated by
all types of combustion activities (motor vehicles, power
plants, wood burning, etc.) and certain industrial processes
(US Environmental Protection Agency , 2015).
We have performed a similar analysis considering the
evolution of pollutants, traffic intensity and meteorological
variables during a day (humidity and wind). Figure 4 shows
the evolution of the daily average of these parameters for
Mol`ı station depending on the hour of the day . Figure 5
corresponds to Avd Francia station and Figure 6 to Pista
de Silla station. Again, we normalise each parameter by
the maximum value of that parameter. If we observe
traffic intensity, we can discover in all the three plots a similar
behaviour, there are three peaks in traffic intensity
corresponding to the hours where workers travel to their work
places (around 9 am), lunch time (around 2 pm) and an
evening period (around 8 pm). In the three stations the
maximum of pollution parameters is found at the same
period of the first peak in traffic intensity (around 9 am). In
the second peak of traffic intensity (around 2 pm) the
levels of pollutants does not follow the increase in traffic. In
fact, after the maximum period around 9 am, pollutants
decrease their levels until around 4 pm where they change the
behaviour and start an increasing of the values. The second
peak in pollutant values is found around 9 pm. Our
intuition with respect to this behaviour is that wind disperses
part of the pollutant in the most sunny hours. Valencia is
in the Mediterranean coast and in this city it is easy to find
(especially in summer) sea breezes. These kind of winds
are created over bodies of water (usually sea or big lakes)
near land due to differences in air pressure created by their
different heat capacity. This phenomenon can be detected
in the plots if we observe the increase in wind strength
during the midday hours. Finally, we observe a strange and
different behaviour of the CO particle in Mol´ı station. For
this pollutant there is a second peak in the midday period.
This behaviour probably corresponds to an extra source of
pollution that needs to be further studied.</p>
        <p>As stated previously, we are interested in predicting
pollution levels in real time. Since these levels are only made
public with a delay of three hours, we need to produce a
prediction model from real time features. We extract the
following set of features from the data collected from
different sources (detailed in the previous section):
• Climatological features: Temperature (Celsius
degrees),</p>
        <sec id="sec-4-1-1">
          <title>Relative humidity (Percentage), Pressure</title>
          <p>(hPa), Wind speed (km/h), Rain (mm/h)
• Calendar features: Year, Month, Day in the month,</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>Day in the week, Hour</title>
          <p>
            • Traffic intensity features: Traffic level in the
surrounding stations (vehicles/hour), traffic level 1, 2, 3
• Pollution features: Pollution level in the target
staand 24 hours before
tion 3 and 24 hours before
With this goal we compare several regression learning
techniques from R
            <xref ref-type="bibr" rid="ref6 ref8">(R Core Team, 2015)</xref>
            in order to identify the
technique that is able to better predict the levels of
pollution. To test the prediction ability of different models, we
learn the models using as training data the registers of 2013
and the first nine months of 2014. We test the models with
the last three months of 2014. We use Mean Squared Error
(MSE) as a performance measure. Concretely, we employ
the following techniques for learning regression models (all
of them with the default parameters, unless stated
otherwise): Linear Regression (lr) (Hornik et al., 2009),
            <xref ref-type="bibr" rid="ref1">quantile regression (qr) (Koenker, 2015</xref>
            ) with lasso method, K
nearest neighbours (IBKreg) with k = 10 (Hornik et al.,
2009) , a decision tree for regression (M5P) (Hornik et al.,
2009), Random Forest (RF) (Liaw &amp; Wiener, 2002),
Support Vector Machines (SVM) (Meyer et al., 2014) and
Neural Networks (Venables &amp; Ripley, 2002). In order to
compare the predictive performance of these models, we also
introduce three baseline models: A model that always
predicts the mean of the train data (TrainMean), a model that
always predicts the mean of the test data (TestMean), and a
basic model that predicts the same value of the target
pollutant 3 hours before (X3H).
          </p>
          <p>Table 2 contains the MSE of the regression models for the
prediction of the four target pollution levels of the Mol´ı
station. Results for Pista de Silla station and Avd Francia
station are shown in Table 4 and 3 respectively. If we analyse
these results, we can conclude that learned models are
improving the performance of the basic baseline models in
almost all cases. When we compare the learning techniques
in the three tables, the ensemble of decision trees technique
lse .07
v
(random forest) is the best model in almost all of cases.
These results are in concordance with (Singh et al., 2013)
where ensembles of trees outperformed other approaches
such as SVMs.</p>
          <p>Molí
Sunday Monday Tuesday Weds. Thursday Friday</p>
          <p>Sunday Monday Tuesday Weds. Thursday Friday
Saturday</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. Airvlc</title>
      <p>In the previous section we have analysed how to obtain
real-time air pollution predictions from a given set of
features. In this section we summarise Airvlc, a mobile app for
Android and iOS and a web application4. This application
generates from the regression models a map of the city of
Valencia showing the predicted intensity of pollution
levels. The application also allows the user to configure a set
of automatic warnings every time a pollution threshold is
reached near the position of the mobile device.</p>
      <sec id="sec-5-1">
        <title>4http://airvlc.lidiacontreras.com/</title>
        <p>0
.
9
.
0
8
.
0
0
5
.
0
4
.
0
3
.
0</p>
        <p>Mol´ı
Francia
Silla
users. However, showing just a concentration value of each
parameter is not very useful for most users, since most of
them are not experts in pollutants and they could not
interpret correctly these numbers. In order to improve the
comprehensibility of the predictions we have established
three ranges of risk represented as speedometer: Low risk
(green) corresponds to a measurement that is safe; Medium
risk (yellow) when concentrations reach levels to cause
harmful effects in people sensitive to air pollution
exposure (kids, elderly people...); High risk (red) when
concentrations can cause acute and chronic effects to anyone,
especially those with sensitivity.</p>
        <p>The ranges of risk shown by the application from the
predicted values of the four pollutants are based on the
recommendations of the Directive 2008/50/EC (European
Comission, 2008). The variable as NOx (oxides of nitrogen)
refers to NO or NO2, since the normative establishes the
same limits for both levels.</p>
        <p>• Green level: [NOx] &lt; 14.0 μg/m3 ∧ [CO] &lt; 30.0
mg/m3 ∧ [PM 2.5] &lt; 7.5 μg/m3.
• Yellow level: We establish medium risk (yellow level)
if the levels do not satisfy the conditions of the green
level and the red level.
• Red level: [NOx] ≥190.0 μg/m3 ∨ [CO] ≥ 55.0
mg/m3 ∨ [PM 2.5] ≥ 25.0 μg/m3
4.3. Risk warnings
Airvlc mobile application can be configured to send
warnings to users if the device is near to a zone (200 meters
approximately) where a high risk level is predicted. These
warnings can be personalised by the user in different ways.
For example, the user can establish personal limits for
warnings or modify the range of distance for the detection
of high risk levels of pollutant concentration. Obviously,
the user needs to allow the application to know the actual
GPS location of the device
In the case of the web application, given that here it is more
complex to know the exact location of the user, we adopt a
different strategy. We are working in an automated warning
system where the user needs to fix a set of areas, and then
the system sends an electronic email whenever a dangerous
situation (high risk level by default) is detected.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>5. Related work</title>
      <p>A wide number of works employs machine learning
techniques or statistical approaches for predicting pollution
levels. A classical work is (Yi &amp; Prybutok, 1996). In this
paper, the authors propose ozone prediction models.
Specifically, they develop a neural network model for forecasting
daily maximum ozone levels and compare it to previous
approaches by regression, and Box-Jenkins ARIMA. The
results show that the neural network model improves the
performance of the regression and Box-Jenkins ARIMA
models tested. Neural networks models have been widely</p>
      <p>6
lse .0
v
e
L
employed in this field, a review of these approaches can be
found in (Khare &amp; Nagendra, 2006).</p>
      <p>A more related work is (Karppinen et al., 2000a). Here
the authors propose a modelling system for predicting the
traffic volumes, emissions from stationary and vehicular
sources, and atmospheric dispersion of pollution in an
urban area.</p>
      <sec id="sec-6-1">
        <title>They employ four monitoring stations in the</title>
        <p>Helsinki metropolitan area in 1993. The paper compares
the predicted NOx and NO2 concentrations with the results
of an urban air quality monitoring network. The agreement
of model predictions was better for the two suburban
monitoring stations, compared with two urban stations. Some
applications of these models are introduced in (Karppinen
et al., 2000b). A similar work for the city of Izmir in Turkey
is (Elbir, 2003).</p>
        <p>Here, the authors compare The
CALMET meteorological model and its puff dispersion model
CALPUFF for predicting dispersion of the sulphur dioxide
emissions from industrial and domestic sources.
Another related work, and in this case very recent, is
(Donnelly et al., 2015). This paper presents a model for real
time air quality forecasts. The predictions are concentrated
in nitrogen dioxide (NO2) and they are used to estimate
air quality 48 hours in advance. The model is based on
a multiple linear regression which uses linearised factors
describing variations in concentrations together with
meteorological parameters and persistence as predictors.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>6. Conclusions</title>
      <p>Air pollution can decrease life expectancy since
contamination rises the risk of suffering respiratory diseases.
Although policies motivating the reduction of emissions of
pollutant particles have been introduced in the last years,
many cities frequently still present risky levels of air
pollution. In these situations, the reduction of the exposure to
ambient air pollution is highly recommended. In this work,
we have presented Airvlc, an application that predicts in
real-time the levels of four dangerous pollutants in a wide
set of points in the city of Valencia. The system is able to
predict these pollution levels by applying regression
models trained from data containing information traffic
intensity, persistence of pollutants and meteorological
parameters. Airvlc can be a useful tool for avoiding risky locations
in terms of air pollution.</p>
      <p>As future work we propose the integration of the
application in middleware platforms such as Fi-Ware5, this could
help to extend the applicability of the system to other cities
or regions. We also are interested in the incorporation of
additional features in order to improve the prediction
models: wind direction, sand storms, forest wildfires and
agricultural burnings... Finally, the use of the tool for the
recommendation of routes that minimise the exposure to air
pollution.</p>
      <p>Our comparison of regression techniques obtains similar</p>
      <sec id="sec-7-1">
        <title>5http://www.fiware.org/</title>
        <p>Donnelly, Aoife, Misstear, Bruce, and Broderick, Brian.</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>We thank the anonymous reviewers for their comments,
which have helped to improve this paper significantly.
We are also grateful to Ajuntament de Vale`ncia, InnDEA
Vale`ncia and specially to Ramo´n Ferri, Ruth Lo´pez and
Paula Llobet for their help in providing traffic data. This
work was supported by the REFRAME project, granted by
the European Coordinated Research on Long-term
Challenges in Information and Communication Sciences &amp;
Technologies ERA-Net (CHIST-ERA), and funded by the
Ministerio de Econom´ıa y Competitividad in Spain
(PCIN2013-037). It also has been partially supported by the EU
(FEDER) and the Spanish MINECO project ref.
TIN201345732-C4-01 (DAMAS), and by Generalitat Valenciana
ref. PROMETEOII/2015/013 (SmartLogic).</p>
      <p>Real time air quality forecasting using integrated
parametric and non-parametric regression techniques.
Atmo</p>
      <p>Elbir, Tolga. Comparison of model predictions with the
data of an urban air quality monitoring network in izmir,
2003.</p>
      <p>2008.</p>
      <p>European Comission. Directive 2008/50/ec of the european
parliament on ambient air quality and cleaner air for
europe. http://ec.europa.eu/environment/
Hornik, Kurt, Buchta, Christian, and Zeileis, Achim.
Opensource machine learning: R meets Weka.</p>
      <p>Computas00180-008-0119-7.
Koskentalo, T. A modelling system for predicting urban
air pollution:: comparison of model predictions with the
data of an urban measurement network in helsinki.
Attem for predicting urban air pollution: model description
and applications in the helsinki metropolitan area.
AtmoKhare, Mukesh and Nagendra, SM Shiva. Artificial neural
networks in vehicular pollution modelling, volume 41.</p>
      <sec id="sec-8-1">
        <title>R-project.org/.</title>
        <p>Identifying pollution sources and predicting urban air
quality using ensemble learning methods. Atmospheric</p>
        <p>The European Environment Agency . Soer 2015 — the
european environment — state and outlook 2015. http:</p>
        <sec id="sec-8-1-1">
          <title>US Environmental Protection Agency .</title>
        </sec>
        <sec id="sec-8-1-2">
          <title>Particulate</title>
          <p>matter (pm) regulations.</p>
        </sec>
        <sec id="sec-8-1-3">
          <title>Venables, W. N. and Ripley, B. D.</title>
          <p>Modern Applied
Statistics with S.</p>
        </sec>
        <sec id="sec-8-1-4">
          <title>Springer, New York, fourth edi</title>
          <p>tion, 2002. URL http://www.stats.ox.ac.uk/
pub/MASS4. ISBN 0-387-95457-0.
Yi, Junsub and Prybutok, Victor R. A neural network</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Quantile</given-names>
            <surname>Regression</surname>
          </string-name>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          URL http://CRAN.R-project.org/ package=quantreg.
          <source>R package version 5</source>
          .11.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>R</given-names>
            <surname>News</surname>
          </string-name>
          ,
          <volume>2</volume>
          (
          <issue>3</issue>
          ):
          <fpage>18</fpage>
          -
          <lpage>22</lpage>
          , URL http://CRAN.R-project.org/ Meyer, David, Dimitriadou, Evgenia, Hornik, Kurt, Weingessel, Andreas, and
          <string-name>
            <surname>Leisch</surname>
          </string-name>
          , Friedrich.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <source>e1071: Misc Functions of the Department of Statistics (e1071)</source>
          ,
          <source>TU Wien</source>
          ,
          <year>2014</year>
          . URL http://CRAN.R-project.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <source>org/package=e1071. R package version 1</source>
          .6-
          <fpage>4</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>R Core</given-names>
            <surname>Team. R:</surname>
          </string-name>
          <article-title>A Language and Environment for Statistical Computing</article-title>
          . R Foundation for Statistical Computing, Vienna, Austria,
          <year>2015</year>
          . URL http://www.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          //www.eea.europa.eu/soer,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Wilker</surname>
            ,
            <given-names>Elissa H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Preis</surname>
          </string-name>
          , Sarah R., Beiser, Alexa S.,
          <string-name>
            <surname>Wolf</surname>
            ,
            <given-names>Philip A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Au</surname>
          </string-name>
          , Rhoda, Kloog, Itai, Li, Wenyuan, Schwartz, Joel, Koutrakis, Petros, DeCarli, Charles, Seshadri, Sudha, and
          <article-title>Mittleman, Murray A. Long-Term Exposure to Fine Particulate Matter, Residential Proximity to Major Roads and Measures of Brain Structure</article-title>
          . Stroke,
          <year>April 2015</year>
          . doi:
          <volume>10</volume>
          .1161/strokeaha.114.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>URL http://dx.doi.org/10.1161/ strokeaha.114.008348.</mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          databases/health_impacts/en/,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>