<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Cooperative Kernel-Based Forecasting in Decentralized Multi-Agent Systems for Urban Traffic Networks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jelena Fiosina</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maksims Fiosins</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institure of Informatics, Clausthal University of Technology</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>3</fpage>
      <lpage>7</lpage>
      <abstract>
        <p>The distributed and often decentralised nature of complex stochastic traffic systems having a large amount of distributed data can be represented well by multi-agent architecture. Traditional centralized data mining methods are often very expensive or not feasible because of transmission limitations that lead to the need of the development of distributed or even decentralized data mining methods including distributed parameter estimation and forecasting. We consider a system, where drivers are modelled as autonomous agents. We assume that agents possess an intellectual module for data processing and decision making. All such devices use a local kernelbased regression model for travel time estimation and prediction. In this framework, the vehicles in a traffic network collaborate to optimally fit a prediction function to each of their local measurements. Rather than transmitting all data to one another or a central node in a centralized approach, the vehicles carry out a part of the computations locally by transmitting only limited amount of data. This limited transmission helps the agent that experiences difficulties with its current predictions. We demonstrate the efficiency of our approach with case studies with the analysis of real data from the urban traffic domain.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Multi-agent systems (MAS) often deal with complex applications,
such as sensors, traffic, or logistics networks, and they offer a
suitable architecture for distributed problem solving. In such
applications, the individual and collective behaviours of the agents depend
on the observed data from distributed sources. In a typical distributed
environment, analysing distributed data is a non-trivial problem
because of several constraints such as limited bandwidth (in wireless
networks), privacy-sensitive data, distributed computing nodes, etc.
The distributed data processing and mining (DDPM) field deals with
these challenges by analysing distributed data and offers many
algorithmic solutions for data analysis, processing, and mining using
different tools in a fundamentally distributed manner that pays
careful attention to the resource constraints [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Traditional centralized data processing and mining typically
requires central aggregation of distributed data, which may not always
be feasible because of the limited network bandwidth, security
concerns, scalability problems, and other practical issues. DDPM
carries out communication and computation by analyzing data in a
distributed fashion [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The DDPM technology offers more efficient
solutions in such applications.
      </p>
      <p>In this study we focus on the urban traffic domain, where many
traffic characteristics such as travel time, travel speed, congestion
probability, etc. can be evaluated by autonomous agents-vehicles in
a distributed manner. In this paper, we present a general distributed
regression forecasting algorithm and illustrate its efficiency in
forecasting travel time.</p>
      <p>
        Numerous data processing and mining techniques were suggested
for forecasting travel time in a centralized and distributed manner.
Statistical methods, such as regression and time series, and artificial
intelligence methods, such as neural networks, are successfully
implemented for similar problems. However, travel time is affected by a
range of different factors. Thus, accurate prediction of travel time is
difficult and needs considerable amount of traffic data.
Understanding the factors affecting travel time is essential for improving
prediction accuracy [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>We focus on non-parametric, computationally intensive
estimation, i.e. Kernel-based estimation, which is a promising technique
for solving many statistical problems, including parameter
estimation. In our paper, we suggest a general distributed kernel-based
algorithm and use it for forecasting travel time using real-world data from
southern Hanover. We assume that each agent autonomously
estimates its kernel-based regression function, whose additive nature fits
very well with streaming real-time data. When an agent is not able to
estimate the travel time because of lack of data, i.e., when it has no
data near the point of interest (because the kernel-based estimation
uses approximations), it cooperates with other agents. An algorithm
for multi-agent cooperative learning based on based on transmission
of the required data as a reply to the request of the agent
experiencing difficulties was suggested. After obtaining the necessary data
from other agents, the agent can forecast travel-time autonomously.</p>
      <p>
        The travel-time, estimated in the DDPM stage, can serve as an
input for the next stage of distributed decision making of the intelligent
agents [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>This study contributes in the following ways: It suggests (a) a
decentralized kernel-based regression forecasting approach, (b) a
regression model with a structure that facilitates travel-time
forecasting; and it improves the application efficiency of the proposed
approach for the current real-world urban traffic data.</p>
      <p>The remainder of this paper is organized as follows. Section 2
describes the related previous work in the DDPM field for MAS, kernel
density estimation, and travel-time prediction. Section 3 describes
the current problem more formally. Section 4 presents the
multivariate kernel-based regression model adopted for streaming data.
Section 5 describes the suggested cooperative learning algorithm for
optimal prediction in a distributed MAS architecture. Section 6 presents
a case study and the final section contains the conclusions.
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>RELATED PREVIOUS WORK 2.2 Travel Time Prediction Models Distributed Data Mining in Multi-agent</title>
      <p>
        Systems
A strong motivation for implementing DDPM for MAS is given by
Da Silva et al. in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], where authors argue that DDPM in MAS deals
with pro-active and autonomous agents that perceive their
environment, dynamically reason out actions on the basis of the
environment, and interact with each other. In many complex domains, the
knowledge of agents is a result of the outcome of empirical data
analysis in addition to the pre-existing domain knowledge. DDPM
of agents often involves detecting hidden patterns, constructing
predictive and clustering models, identifying outliers, etc. In MAS,
this knowledge is usually collective. This collective ’intelligence’
of MAS must be developed by distributed domain knowledge and
analysis of the distributed data observed by different agents. Such
distributed data analysis may be a non-trivial problem when the
underlying task is not completely decomposable and computational
resources are constrained by several factors such as limited power
supply, poor bandwidth connection, privacy-sensitive multi-party data.
      </p>
      <p>
        Klusch at al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] concludes that autonomous data mining agents,
as a special type of information agents, may perform various kinds
of mining operations on behalf of their user(s) or in collaboration
with other agents. Systems of cooperative information agents for data
mining in distributed, heterogeneous, or homogeneous, and massive
data environments appear to be quite a natural progression for the
current systems to be realized in the near future.
      </p>
      <p>
        A common feature of all approaches is that they aim at
integrating the knowledge that is extracted from data at different
geographically distributed network sites with minimum network
communication and maximum local computations. Local computation is carried
out on each site, and either a central site communicates with each
distributed site to compute the global models or a peer-to-peer
architecture is used. In the case of the peer-to-peer architecture, individual
nodes might communicate with a resource-rich centralized node, but
they perform most tasks by communicating with neighbouring nodes
through message passing over an asynchronous network [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        A distributed system should have the following features for the
efficient implementation of DDPM: the system consists of multiple
independent data sources, which communicate only through message
passing; communication between peers is expensive; peers have
resource constraints (e. g. battery power) and privacy concerns [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Typically, communication involves bottlenecks. Since
communication is assumed to be carried out exclusively by message passing,
the primary goal of several DDPM methods, as mentioned in the
literature, is to minimize the number of messages sent. Building a
monolithic database in order to perform non-distributed data processing
and mining may be infeasible or simply impossible in many
applications. The costs of transferring large blocks of data may be very
expensive and result in very inefficient implementations [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>Moreover, sensors must process continuous (possibly fast) streams
of data. The resource-constrained distributed environments of sensor
networks and the need for a collaborative approach to solve many
problems in this domain make MAS architecture an ideal candidate
for application development.</p>
      <p>
        In our study we deal with homogeneous data. However, a
promising approach to agent-based parameter estimation for partially
heterogeneous data in sensor networks was suggested in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Another
decentralized approach for homogeneous data was suggested in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]
to estimate the parameters of a wireless network by using a
parametric linear model and stochastic approximations.
      </p>
      <p>
        Continuous traffic jams indicate that the maximum capacity of a road
network is met or even exceeded. In such a situation, the modelling
and forecasting of traffic flow is one of the important techniques that
need to be developed [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Nowadays, knowledge about travel time
plays an important role in transportation and logistics, and it can be
applied in various fields and purposes. From travellers’ viewpoints,
the knowledge about travel time helps to reduce the travel time and
improves reliability through better selection of travel routes. In
logistics, accurate travel time estimation could help reduce transport
delivery costs and increase the service quality of commercial
delivery by delivering goods within the required time window by avoiding
congested sections. For traffic managers, travel time is an important
index for traffic system operation efficiency [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>
        There are several studies in which a centralized approach is used to
predict the travel time. The approach was used in various intelligent
transport systems, such as in-vehicle route guidance and advanced
traffic management systems. A good overview is given in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. To
make the approach effective, agents should cooperate with each other
to achieve their common goal via the so called gossiping scenarios.
The estimation of the actual travelling time using vehicle-to-vehicle
communication without MAS architecture was described in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>
        On the other hand, a multi-agent architecture is better suited for
distributed traffic networks, which are complex stochastic systems.
Further, by using centralized approaches the system cannot adapt
quickly to situations in real time, and it is very difficult to
transmit a large amount of information over the network. In centralized
approaches, it is difficult or simply physically impossible to store
and process large data sets in one location. In addition, it is known
from practice that the most drivers rely mostly on their own
experience; they use their historical data to forecast the travel time [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
Thus, decentralized multi-agent systems are are fundamentally
important for the representation of these networks [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. We model our
system with autonomous agents to allow vehicles to make decisions
autonomously using not only the centrally processed available
information, but also their historical data.
      </p>
      <p>
        Traffic information generally goes through the following three
stages: data collection and cleansing, data fusion and integration, and
data distribution [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. The system presented in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] consists of three
components, namely a Smart Traffic Agent, the Real-time Traffic
Information Exchange Protocol and a centralized Traffic Information
Centre that acts as the backend. A similar architecture is used in
this study, but the prediction module, incorporated into Start
Traffic Agent (vehicle agent), is different. In our study we do not focus
on the transmission protocol describing only the information, which
should be sent from one node to another, without the descriptions of
protocol packets. The centralized Traffic Information Centre in our
model is used only for storing system information.
      </p>
      <p>
        The decentralized MAS approach for urban traffic network was
considered in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] also, where the authors forecast the traversal time
for each link of the network separately. Two types of agents were
used for vehicles and links, and a neural network was used as the
prediction model.
      </p>
      <p>
        For travel time forecasting different regression models can be
applied. Linear multivariate regression model for decentralized urban
traffic network was proposed in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. This regression model is
wellstudied, is parametric and allows estimation of each variables
contribution. However is not sufficiently effective in the case of non-linear
systems. Alternatively non-parametric kernel-based regression
models can be applied. These models can be effectively used for any types
of systems, however are relatively new and not well-studied yet. The
efficiency of non-parametric kernel-based regression approach for
traffic flow forecasting in comparison to parametric approach was
made in [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. In this study we apply non-parametric kernel
regression for the similar traffic system as in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] in order to increase the
prediction quality.
2.3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Kernel Density Estimation</title>
      <p>Kernel density estimation is a non-parametric approach for
estimating the probability density function of a random variable. Kernel
density estimation is a fundamental data-smoothing technique where
inferences about the population are made on the basis of a finite data
sample. A kernel is a weighting function used in non-parametric
estimation techniques.</p>
      <p>Let X1, X2, . . . , Xn be an iid sample drawn from some
distribution with an unknown density f . We attempt to estimate the shape of
f , whose kernel density estimator is</p>
      <p>
        n
fˆh(x) = 1 X K
nh
i=1
x − Xi ,
h
(1)
where kernel K(• ) is a non-negative real-valued integrable function
satisfying the following two requirements: R ∞ K(u)du = 1 and
−∞
K(−u) = K(u) for all values of u; h &gt; 0 is a smoothing
parameter called bandwidth. The first requirement to K(• ) ensures that the
kernel density estimator is a probability density function. The second
requirement to K(• ) ensures that the average of the corresponding
distribution is equal to that of the sample used [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Different kernel
functions are available: Uniform, Epanechnikov, Gausian, etc. They
differ in the manner in which they take into account the vicinity
observations to estimate the function from the given variables.
      </p>
      <p>
        A very suitable property of the kernel function is its additive
nature. This property makes the kernel function easy to use for
streaming and distributed data [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. In [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], the distributed
kernelbased clustering algorithm was suggested on the basis of the same
property. In this study, kernel density is used for kernel regression to
estimate the conditional expectation of a random variable.
3
      </p>
    </sec>
    <sec id="sec-4">
      <title>PROBLEM FORMULATION</title>
      <p>
        We consider a traffic network with several vehicles, represented as
autonomous agents, which predict their travel time on the basis of
their current observations and history. Each agent estimates locally
the parameters of the same traffic network. In order to make a
forecast, each agent constructs a regression model, which explains the
manner in which different explanatory variables (factors) influence
the travel time. A detailed overview of such factors is provided
in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. The following information is important for predicting the
travel time [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]: average speed before the current segment, number
of stops, number of left turns, number of traffic light, average travel
time estimated by Traffic Management Centres (TMC). We should
also take into account the possibility of an accident, network
overload (rush hour) and the weather conditions.
      </p>
      <p>Let us consider a vehicle, whose goal is to drive through the
definite road segment under specific environment conditions (day, time,
city district, weather, etc.). Let us suppose that it has no or little
experience of driving in such conditions. For accurate travel time
estimation, it contacts other traffic participants, which share their
experience in the requested point.</p>
      <p>In this step, the agent that experiences difficulties with a forecast
sends its requested data point to other traffic participants in the
transmission radius. The other agents try to make a forecast themselves.
In the case of a successful forecast, the agents share their experience
by sending their observations that are nearest to the requested point.
After receiving the data from the other agents, the agent combines
the obtained results, increases its experience, and makes a forecast
autonomously.</p>
      <p>In short, decentralized travel-time prediction consists of three
steps: 1) local prediction; 2) in the case of unsuccessful prediction:
selection of agents for experience sharing and sending them the
requested data point; 3) aggregation of the answers and prediction.
4</p>
      <p>
        LOCAL PARAMETER ESTIMATION
The non-parametric approach to estimating a regression curve has
four main purposes. First, it provides a versatile method for exploring
a general relationship between two variables. Second, it can predict
observations yet to be made without reference to a fixed parametric
model. Third, it provides a tool for finding spurious observations by
studying the influence of isolated points. Fourth, it constitutes the
flexible method of substitution or interpolating between adjacent
Xvalues for missing values [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        Let us consider a non-parametric regression model [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] with a
dependent variable Y and a vector of d regressors X
      </p>
      <p>Y = m(x) + ǫ,
where ǫ is the disturbance term such that E(ǫ| X = x) = 0 and
V ar(ǫ| X = x) = σ2(x), and m(x) = E(Y | X = x). Further,
let (Xi, Yi)in=1 be the observations sampled from the distribution of
(X, Y ). Then the Nadaraya-Watson kernel estimator is
mn(x) =</p>
      <p>Pn</p>
      <p>i=1 K
Pn
i=1 K
x−Xi Yi
h
x−Xi
h
= pn(x) ,
qn(x)
where K(• ) is the kernel function of Rd and h is the bandwidth.
Kernel functions satisfy the restrictions from (1). In our case we have
a multi-dimensional kernel function K(u) = K(u1, u2, . . . , ud),
that can be easily presented with univariate kernel functions as:
K(u) = K(u1) · K(u2) · . . . · K(ud). We used the Gaussian kernel
in our experiments.</p>
      <p>
        Network packets are streaming data. Standard statistical and data
mining methods deal with a fixed dataset. There is a fixed size n for
dataset and algorithms are chosen as a function of n. In streaming
data there is no n: data are continually captured and must be
processed as they arrive. It is important to develop algorithms that work
with non-stationary datasets, handle the streaming data directly, and
update their models on the fly [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>This requires recursive windowing. The kernel density estimator
has a simple recursive windowing method that allows the recursive
estimation using the kernel density estimator:
mn(x) =
pn(x) =
qn(x)
pn−1(x) + K
qn−1(x) + K
x−Xn Yn
h
x−Xn
h
.
5</p>
      <p>DECENTRALISED MULTI-AGENT</p>
      <p>COOPERATIVE LEARNING ALGORITHM
In this section, we describe the cooperation for sharing the
prediction experience among the agents in a network. While working with
(2)
(3)
(4)
streaming data, one should take into account two main facts. The
nodes should coordinate their prediction experience over some
previous sampling period and adapt quickly to the changes in the
streaming data, without waiting for the next coordination action.</p>
      <p>Let us first discuss the cooperation technique. We introduce the
following definitions.</p>
      <p>Let L = { Lj | 1 ≤ j ≤ p} be a group of p agents. Each agent
Lj ∈ L has a local dataset Dj = { (Xjc, Ycj )| c = 1. . . . , N j } , where
Xjc is a d-dimensional vector. In order to underline the dependence
of the prediction function (3) from the local dataset of agent Lj , we
denote the prediction function by m[Dj ](x).</p>
      <p>Consider a case when some agent Li is not able to forecast for
i
some d-dimensional future data point Xnew because it does not have
i
sufficient data in the neighbourhood of Xnew. In this case, it sends
a request to other traffic participants in its transmission radius by
sending the data point Xnew to them. Each agent Lj that has received
i
the request tries to predict m[Dj ](Xnew). If it is successful, it replies
i
to agent Li by sending its best data representatives Dˆ(j,i) from the
neighbourhood of the requested point Xinew. Let us define Gi ⊂ L,
a group of agents, which are able to reply to agent Li by sending the
requested data.</p>
      <p>To select the best data representatives, each agent Lj makes a
ranking among its dataset Dj . It can be seen from (3) that each Ycj
is taken with the weight wcj with respect to Xnew, where
i
wcj =</p>
      <p>K
Pn
l=1 K</p>
      <p>j
Xinew−Xc
h</p>
      <p>j .</p>
      <p>Xinew−Xl
h
The observations with maximum weights wcj are the best candidates
for sharing the experience.</p>
      <p>All the data Dˆ(j,i), Lj ∈ Gi received by agent Li should be
verified, and duplicated data should be removed. We denote the new
dataset of agent Li as Dniew = SLj ∈Gi Dˆ(j,i). Then, the kernel
function of agent Li is updated taking into account the additive
nature of this function:
m[Dniew](x) =
m[ Dˆ(j,i)](x) + m[Di](x).</p>
      <p>(5)
X</p>
      <p>Lj ∈Gi</p>
      <p>Finally, agent Li can autonomously make its forecast as
m[Dniew](Xinew) for Xnew.</p>
      <p>i
6</p>
    </sec>
    <sec id="sec-5">
      <title>CASE STUDIES</title>
      <p>We simulated a traffic network in the southern part of Hanover
(Germany). The network contains three parallel and five perpendicular
streets, creating fifteen intersections with a flow of approximately
5000 vehicles per hour.</p>
      <p>
        The vehicles solve a travel time prediction problem. They receive
information about the centrally estimated system variables (such as
average speed, number of stops, congestion level, etc.) for this city
district from TMC, combine it with their historical information, and
make adjustments according to the information of other participants
using the presented cooperative-learning algorithm. In this case, the
regression analysis is an essential part of the local time prediction
process. We consider the local kernel based regression model (3) and
implement the cooperative learning algorithm (5). The factors are
listed in Table 1. The selection and significance of these variables
was considered in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>We simulated 20 agents having their initial experience represented
by a dataset of size 20 till each agent made 100 predictions, thus
s 4
n
o
it
a
c
inu 3
m
m
o
c
f
ro 2
e
b
m
u
N 1
0</p>
      <p>Time
making their common experience equal to 2400. We assumed the
maximal number of transmitted observations from one agent equals
2.</p>
      <p>During the simulation, to predict more accurately, the agents used
the presented cooperative learning algorithm that supported the
communication between agents with the objective of improving the
prediction quality. The necessary number of communications depends
on the value of the smoothing parameter h. The average number
of necessary communications is given in Figure 1. We can see the
manner in which the number of communications decreased with the
learning time. We varied h and obtained the relation between the
communication numbers and h as a curve. The prediction ability of
one of the agents is presented at Figure 2. Here, we can also see the
relative prediction error, which decreases with time. The predictions
that used communication between agents are denoted by solid
triangles, and the number of such predictions also decreases with the
time.</p>
      <p>
        The goodness of fit of the system was estimated using a
crossvalidation technique. We assume that each agent has its own training
set, but it uses all observations of other agents as a test set, so we
use 20-fold cross-validation. To estimate the goodness of fit, we used
analysis of variance and generalized coefficient of determination R2
that provides an accurate measure of the effectiveness of the
prediction of future outcomes by using the non-parametric model [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. The
calculated R2 values and the corresponding number of the
observations that were not predicted (because cooperation during testing was
not allowed) depending on h are listed in Table 2. From Figure 3 we
can also see how R2 is distributed among the system agents. The
results suggest that we should find some trade-off between system
Self prediction
      </p>
      <p>Prediction after communication
5
.</p>
      <p>2
rrr
o
itsneg .02
a
frceo .15
e
iltvaeR .01
5
.
0
0
.</p>
      <p>0
y
c
n
e
u
q
e
r
F
0
8
0
6
0
4
0
2
0</p>
      <p>Relative prediction error and communication frequency of a</p>
      <p>single agent over time
0.75
0.80
0.90</p>
      <p>0.95
0.85</p>
      <p>R2
accuracy (presented by R2) and the number of necessary
communications (presented by the percentage of not predicted observations),
which depend on h. The point of trade-off should depend on the
communication and accuracy costs.</p>
      <p>R2 goodness of fit measure using cross-validation for the whole
system for different h
Characteristic of System</p>
      <p>System average R2
Average % of not predicted observations
h=2</p>
      <p>
        A linear regression model [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] applied to the same data gives lower
average goodness of fit R2=0.77, however predictions can be
calculated for all data points.
7
      </p>
    </sec>
    <sec id="sec-6">
      <title>CONCLUSIONS</title>
      <p>In this study, the problem of travel-time prediction was considered.
Multi-agent architecture with autonomous agents was implemented
for this purpose. Distributed parameter estimation and cooperative
learning algorithms were presented, using the non-parametric
kernelbased regression model. We demonstrated the efficiency of the
suggested approach through simulation with real data from southern
Hanover. The experimental results show the high efficiency of the
proposed approach. In future we are going to develop a combined
approach that allows agent to choose between parametric and
nonparametric estimator for more accurate prediction.</p>
    </sec>
    <sec id="sec-7">
      <title>ACKNOWLEDGEMENTS</title>
      <p>The research leading to these results has received funding from the
European Union Seventh Framework Programme (FP7/2007-2013)
under grant agreement No. PIEF-GA-2010-274881. We thank Prof.
J. P. M u¨ller for useful discussions during this paper preparation.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.L</given-names>
            <surname>Bazzan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wahle</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Kluegl</surname>
          </string-name>
          , '
          <article-title>Agents in traffic modelling - from reactive to social behaviour'</article-title>
          ,
          <source>Advances in Artificial Intelligence (LNAI)</source>
          ,
          <volume>1701</volume>
          ,
          <fpage>509</fpage>
          -
          <lpage>523</lpage>
          , (
          <year>1999</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Claes</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Holvoet</surname>
          </string-name>
          , '
          <article-title>Ad hoc link traversal time prediction'</article-title>
          ,
          <source>in Proc. of the 14th Int. IEEE Conf. on Intelligent Transportation Systems</source>
          , (
          <year>October 2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.C.</given-names>
            da
            <surname>Silva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Giannella</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Bhargava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Kargupta</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Klusch</surname>
          </string-name>
          , '
          <article-title>Distributed data mining and agents', Eng</article-title>
          . Appl. Artif. Intell.,
          <volume>18</volume>
          (
          <issue>7</issue>
          ),
          <fpage>791</fpage>
          -
          <lpage>801</lpage>
          , (
          <year>2005</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J.</given-names>
            <surname>Fiosina</surname>
          </string-name>
          , '
          <article-title>Decentralised regression model for intelligent forecasting in multi-agent traffic networks'</article-title>
          ,
          <source>Distributed Computing and Artificial Intelligence (Advances in Intelligent and Soft Computing)</source>
          ,
          <volume>151</volume>
          ,
          <fpage>255</fpage>
          -
          <lpage>263</lpage>
          , (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Fiosins</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Fiosina</surname>
          </string-name>
          , J. Mu¨ ller, and J. Go¨ rmer, '
          <article-title>Agent-based integrated decision making for autonomous vehicles in urban traffic'</article-title>
          ,
          <source>Advances on Practical Applications of Agents and Multiagent Systems (Advances in Intelligent and Soft Computing)</source>
          ,
          <volume>88</volume>
          ,
          <fpage>173</fpage>
          -
          <lpage>178</lpage>
          , (
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6] Handbook of Computational Statistics: Concepts and Methods, eds.,
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Gentle</surname>
          </string-name>
          , W. Ha¨rdle, and Y. Mori, Springer, Berlin/Heidelberg,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>C.</given-names>
            <surname>Guestrin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bodik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Thibaux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Paskin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Madden</surname>
          </string-name>
          , '
          <article-title>Distributed regression: an efficient framework for modeling sensor network data'</article-title>
          ,
          <source>in Proc. of the 3rd Int. Sym. on Information Processing in Sensor Networks</source>
          , Berkeley, (
          <year>2004</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>W.</given-names>
            <surname>Ha</surname>
          </string-name>
          ¨rdle, Applied Nonparametric Regression, Cambridge University Press, Cambridge,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>W.</given-names>
            <surname>Ha</surname>
          </string-name>
          <article-title>¨rdle, M. Mu¨ ller, S. Sperlich, and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Werwatz</surname>
          </string-name>
          , Nonparametric and Semiparametric Models, Springer, Berlin/Heidelberg,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <article-title>Advances in Distributed and Parallel Knowledge Discovery, eds</article-title>
          .,
          <string-name>
            <given-names>H.</given-names>
            <surname>Kargupta</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Chan</surname>
          </string-name>
          , AAAI Press The MIT PressAcademic Press, Melno Park and Cambridge and London,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Klusch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lodi</surname>
          </string-name>
          , and G. Moro, '
          <article-title>Agent-based distributed data mining: The kdec scheme'</article-title>
          ,
          <source>in Proc. of Int. Conf. on Intelligent Information Agents - The AgentLink Perspective</source>
          , volume
          <volume>2586</volume>
          <source>of LNCS</source>
          , pp.
          <fpage>104</fpage>
          -
          <lpage>122</lpage>
          . Springer, (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>W.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tseng</surname>
          </string-name>
          , and W. Shieh, '
          <article-title>Collaborative real-time traffic information generation and sharing framework for the intelligent transportation system'</article-title>
          ,
          <source>Information Scienses</source>
          ,
          <volume>180</volume>
          ,
          <fpage>62</fpage>
          -
          <lpage>70</lpage>
          , (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>H.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Zito</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.A.P.</given-names>
            <surname>Taylor</surname>
          </string-name>
          , '
          <article-title>A review of travel-time prediction in transport and logistics'</article-title>
          ,
          <source>in Proc. of the Eastern Asia Society for Transportation Studies</source>
          , volume
          <volume>5</volume>
          , pp.
          <fpage>1433</fpage>
          -
          <lpage>1448</lpage>
          , Hamburg, (
          <year>2005</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>G.</given-names>
            <surname>Malnati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Barberis</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.M.</given-names>
            <surname>Cuva</surname>
          </string-name>
          , 'Gossip:
          <article-title>Estimating actual travelling time using vehicle to vehicle communication'</article-title>
          ,
          <source>in Proc. of the 4-th Int. Workshop on Intelligent Transportation, Hamburg</source>
          , (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>C.E. McKnight</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          <string-name>
            <surname>Levinson</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Kamga</surname>
            , and
            <given-names>R.E.</given-names>
          </string-name>
          <string-name>
            <surname>Paaswell</surname>
          </string-name>
          , '
          <article-title>Impact of traffic congestion on bus travel time in northern new jersey'</article-title>
          ,
          <source>Transportation Research Record Journal</source>
          ,
          <year>1884</year>
          ,
          <fpage>27</fpage>
          -
          <lpage>35</lpage>
          , (
          <year>2004</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>J.S.</given-names>
            <surname>Racine</surname>
          </string-name>
          , '
          <article-title>Consistent significance testing for nonparametric regression'</article-title>
          ,
          <source>Journal of Business and Economic Statistics</source>
          ,
          <volume>15</volume>
          ,
          <fpage>369379</fpage>
          , (
          <year>1997</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>B.L.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.M.</given-names>
            <surname>Williams</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.K.</given-names>
            <surname>Oswald</surname>
          </string-name>
          , '
          <article-title>Comparison of parametric and nonparametric models for traffic flow forecasting'</article-title>
          , Transportation Research Part C,
          <volume>10</volume>
          ,
          <fpage>303</fpage>
          -
          <lpage>321</lpage>
          , (
          <year>2002</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>S.S.</given-names>
            <surname>Stankovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.S.</given-names>
            <surname>Stankovic</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.M.</given-names>
            <surname>Stipanovic</surname>
          </string-name>
          , '
          <article-title>Decentralized parameter estimation by consensus based stochastic approximation'</article-title>
          ,
          <source>IEEE Trans. Automatic Controll</source>
          ,
          <volume>56</volume>
          , (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>