<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Genetic Algorithm Neural Network model vs Backpropagation Neural Network model for GDP Forecasting</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dezdemona Gjylapi</string-name>
          <email>dezdemona.gjylapi@univlora.edu.al</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eljona Proko</string-name>
          <email>eljona.proko@univlora.edu.al</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alketa Hyso</string-name>
          <email>alketa.hyso@univlora.edu.al</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Computer Science Dept., University “Ismail Qemali”</institution>
          ,
          <addr-line>Vlorë</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Computer Science Dept., University “Ismail Qemali”</institution>
          ,
          <addr-line>Vlorë</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Computer Science Dept., University “Ismail Qemali”</institution>
          ,
          <addr-line>Vlorë</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper evaluates the usefulness of neural networks in GDP forecasting. It is focused on comparing a neural network model trained with genetic algorithm (GANN) to a backpropagation neural network model, both used to forecast the GDP of Albania. Its forecasting is of particular importance in decision-making issues in the field of economy. The conclusion is that the GANN model achieves higher accuracy on GDP forecasting. AG improves ANN model performance compared with standard backpropagation ANN model.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The quarterly GDP data are important for economic
analysis, because it gives insight on the general
economic activity, on the fluctuations of business cycle
and on the economic turning points. Forecast of
macroeconomic variables such as GDP play an
important role in monetary policy decisions and in
assessing the future economic situation. Economic
policymakers and analysts can adapt their theoretical
analysis of economic conditions according to the
forecasts of macroeconomic variables, or even perhaps
use them as a support and an explanation of their
theoretical analysis. Forecasts with better performance
on macroeconomic variables will lead to better
decisions.</p>
      <p>To have an assessment of the economic situation of
the country, policy institutions need to have information
on the indicator of GDP, in order to analyze the
macroeconomic policies implemented in the past and to
take political decisions about the future.</p>
      <p>Tkacz and Hu (1999) [TKA99] studied forecasting
GDP of Canada. The result of this study is that the best
neural network models outperform the best linear
models by between 15 and 19 per cent at this horizon,
implying that neural network models can be exploited
for noticeable gains in forecast accuracy.</p>
      <p>Giovanis (2009) [GIO09] in his paper is using the
ARIMA and ANN to predict the rate of economic
growth in the USA. This study examines the estimation
and forecasting performance of ARIMA models in
comparison with some of the most popular and common
models of neural networks. The results of this study
indicate that neural networks models outperform the
ARIMA forecasting.</p>
      <p>(Çeliku, Kristo and Boka) (2009) in the discussion
paper treats several models to forecast quarterly GDP in
Albania. They consist on ARIMA models with seasonal
components and indicator models, similar to bridge
models. This paper presents a first attempt to model the
GDP using a multi equations system which accounts for
the sectional interactions [ÇEL09].</p>
      <p>Models that predict GDP in the short term use
surveys’ indicators because these indicators have
several advantages such as:
1. Provide preliminary signals about short-term
developments of the activities of economic
agents.
2. These indicators are published formerly than the
main macroeconomic aggregates
3. These indicators are rarely subject to
adjustments.</p>
      <p>Due to the difficulties encountered in modeling the
quarterly GDP and the fact that ANN has proven to be
an efficient tool for non-parametric model data in the
form of non-linear function; we have developed a model
for the Albania’s Gross Domestic Product forecasting
with artificial neural network approach. In this paper, we
forecast the GDP using a neural network model trained
by genetic algorithm (GANN) and compare it with a
backpropagation neural network model, also used to
predict the Albania’s GDP.</p>
      <p>The rest of the paper is organized as follows. Section
2 presents a brief description of the ANN for GDP
forecasting, Section 3 explains the details of the GANN
model, Section 4 explains the results of the GANN and
BPNN models, and finally, Section 5 presents our
conclusions.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Artificial Neural Networks for GDP</title>
    </sec>
    <sec id="sec-3">
      <title>Forecasting</title>
      <p>The advantage of artificial neural network approach
is that this model can capture the relationship of
nonlinear data, especially if the economy is very
volatile, and it is superior when forecasting chaotic data
[IMA11].</p>
      <p>An ANN consists of a number of simple and
interconnected processors, also called neurons,
analogous to biological neurons in the brain. Each
neuron receives a number of input signals through its
connections, however, it gives no more than one output
signal. The output signal is transmitted through the
neuron’s exit line [NEG05].</p>
      <p>Developing an ANN comprises the definition of:
1 - the network architecture, which is defined by the
basic processing elements (i.e. neurons) and by the
way in which they are interconnected (i.e. layers);
2 - the NN Learning, which implies that a processing
unit is capable of changing its input or output
behavior as a result of changes in the environment,
i.e. to adjust the weights based on input vector
values;
3 - the data used for training, testing and validating the
neural network.</p>
      <p>In this paper two neural networks have been
implemented: the multilayer perceptron (MLP, Figure 1)
neural networks trained by back-propagation (BP), and
the MLP neural network trained by a genetic algorithm
(GANN), both using bipolar sigmoid activation
function, which form is as follows:
 ( ) =</p>
      <sec id="sec-3-1">
        <title>2.1 The data used for the neural network</title>
        <p>The accuracy of GDP forecasting with ANN
depends on the selection of the variables to include as
input to the network.</p>
        <p>In this selection process we were based on the paper of
Celiku et al.[ÇEL09] to use the variables judging their
potential economic correlations with the quarterly GDP.</p>
        <p>In this paper we use all these economic and financial
variables combined with surveys’ variables.
a. Economic variables
o Government expenditures, data provided by the</p>
        <p>Ministry of Finance.
o Construction permissions for residential and
business purposes, in value data provided by
INSTAT.
o Total imports, data provided by the INSTAT.
b. Financial variables
o Interest rate on loans denominated in EURO, data
provided by the Bank of Albania;
c.</p>
        <sec id="sec-3-1-1">
          <title>Variables from surveys Variables from surveys are based on indicators constructed from qualitative surveys developed by the Bank of Albania with businesses and consumers.</title>
          <p>o
o
o
o</p>
          <p>ESI (Alb. TNE), the Economic Sentiment Indicator
aggregates in a single indicator the opinions of the
main market agents. They are collected from the
confidence surveys for the industry, construction
and services sector and for the consumers.</p>
          <p>EI (Alb.TE) the survey indicator for the economy
II (Alb.TI) the survey indicator for industry sector
CI (Alb.TN) the survey indicator for construction
sector
o
o</p>
          <p>SI (Alb.TSH) the survey indicator for services
sector
MPI (Alb. TBM) the survey indicator for major
purchases
and the output layer has only one. The main parameters
of the BP used in BPNN model were set as:
Momentum = 0.5
Learning rate = 0.3</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>The results are presented in Figure 3.</title>
          <p>The Figure 2 presents the sample of a neuron of the
hidden layer for the GDP forecasting, used in both
models, GANN and BPNN.</p>
          <p>The data used to forecast GDP are quarterly data
from 2002Q2 till 2016Q1. These data was stored in a
.csv file and served as input to the neural network
models. In general, ANN models require large amounts
of data. For the case of Albania’s GDP forecasting, the
biggest inhibitor is the lack of sufficient data. This
encompasses availability of data, its consistency and the
data time span.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>2.2 BPNN for Albania’s GDP Forecasting</title>
        <p>In a GDP forecast by using the neural network
trained with the Backpropagation method, it was used a
three-layer architecture, i.e. the input, output, and the
hidden layer. The input layer, similarly as in the
“neurogenetic” model, it has 10 neurons, the hidden layer 20,</p>
        <sec id="sec-3-2-1">
          <title>1 MFE-Mean Forecasted Error</title>
        </sec>
        <sec id="sec-3-2-2">
          <title>2 MAD-Mean Absolute Deviation</title>
          <p>In Table 1 are presented the results of an evaluation
performed for calculating the accuracy of the forecast.</p>
          <p>In Table 1 are presented the results of an evaluation
performed for calculating the accuracy of the forecast.</p>
        </sec>
        <sec id="sec-3-2-3">
          <title>3 MSE - Mean Squares Error</title>
        </sec>
        <sec id="sec-3-2-4">
          <title>4 TS - Tracing Signal</title>
          <p>Table 1 shows that this model works, as TS = 0.0052,
and since the MFE&gt; 0 the model tends to under-forecast,
with a mean square error of 0.0056 units.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. GANN Model for Albania’s GDP</title>
    </sec>
    <sec id="sec-5">
      <title>Forecasting</title>
      <sec id="sec-5-1">
        <title>3.1 Network architecture</title>
        <p>In this model is used the artificial neural network
Feedforward with multiple layers. Several tests are
conducted with networks with 3, 4, and 5 layers. In the
three cases the first layer, which is the input layer, it has
10 neurons, the same number of the factors which are
determined to be influencers in the forecast of the GDP.
It is the same with the last layer, the output one, which
consists of only one neuron.</p>
        <p>The hidden layers have a different number of
neurons: the 3-layer model has a hidden layer consisting
of 20 neurons (two times the number of neurons of input
layer); the 4-layer model has two hidden layers with
respectively 20 and 10 neurons; while the 5-layer model
has three hidden layers with respectively 20, 20, and 10
neurons.</p>
      </sec>
      <sec id="sec-5-2">
        <title>3.2 The learning Algorithm of the GANN model</title>
        <p>There are two types of learning algorithms: the
gradient descent and the global search method. The
methods such as Backpropagation, Newton,
QuasiNewton, and Levenberg - Marquant can be classified as
gradient descent methods. While the genetic algorithm
is a global search method.</p>
        <p>ANN model is very useful, especially in the cases
where the process of data generation is nonlinear and
complex, or when the functional form is not clear.
However, the learning process or the adjustment of the
parameters in an ANN model can take time and can fall
also in a trap, such as the local minima, especially when
there are too many parameters. To avoid such potential
problems, we can use the GA in the learning process.</p>
        <p>The advantage of the ANN model with genetic
algorithms is that this model can capture the relationship
of the nonlinear data, especially if the economy is very
unstable; and it is a superior model when it comes to
predict chaotic data.</p>
        <p>There are four main differences to be distinguished
in what respects genetic algorithms (GAs) differ from
traditional search and optimization procedures that
make them such a robust method [GOL92]:
1) GAs use an encoding of the parameters, not the
parameters themselves;
2) GAs search from a population of search points,
not a single point;
3) GAs only use the objective function to judge
solution quality, not derivatives or other
auxiliary knowledge;
4) GAs use probabilistic transition rules, not
deterministic rules.</p>
        <p>The basic idea in GA is finding the suitable
individual in the current population. In this context, the
GA searches for the vector of coefficients which is the
global optimal solution. The basic process of the GA is
represented as follows. It starts with an initial population
selected in a random manner. If the population
converges, the process stops and the solution is
presented. Otherwise, new individuals are created from
the old ones and after some operations the new
generation is created. This process is repeated until the
objective is reached, explained in more details as
follows:</p>
        <p>The objective of GA is to minimize the sum of
squares error (SSE), subject of the parameters generated
by the algorithm, i.e.</p>
        <p>Min yi  yi 
2</p>
        <p>ku y  f (x | )
Where y is the output generated from the model, and θ
are the coefficients parameters, which minimize the
error function. GA has 8 main steps, which are described
as follows:
1) The creation of an initial population of vectors with
coefficients [θ1, θ2, ..., θp ], where p is an even
number, and θi is a vector of Kx1 elements. The
initial population can be created randomly from the
normal standard distribution, or using constrains on
the sign or interval of the parameters’ values.
2) Two couples are selected randomly from the initial
population. The fitness of these four vectors is
estimated in regards to the objective function and
two vectors with the best fitness (smallest SSE) are
selected as winners. These two vectors are also
referred to as parents.
3) Through the application of the crossover operator
from the parent vectors are created two new vectors,
called children. The simplest form of the crossover
operator is the one-point crossover, through which
the two parents intersect at point I and the parts on
the right of point I switch the positions. I is selected
randomly from the set (1, K-1). For K=6 and I=3,
the operator would turn out as below:
 11 
 
 12 
 13 
 
 
 
 14 
 15 
 
 16  P1
 21 
 
 22 
 23 

 
 
 24 
 25 
 
 26  P2
 11 
 
 12 
 13 
 
  
 
 24 
 25 
 
 26  F1
 21 
 
 22 
 23 

 
 
 14 
 15 
 
 16  F 2
4) The mutation operator is applied to each element of
the children vectors. Under the influence of this
operator, each of these elements is subject of a hit
with a low probability, µ &gt; 0. The probability is
typically given by the formula: µ = 0.15 + 0.33 /
G, where G is the size of the generation, while in
our application it is defined by the user. The
mutation operator looks like the following [MIC96]
:
  s[1  r2(1t / T )b ]

  
  s[1  r2(1t / T )b ]

if
if
While in our application we use the following
mutation operator:</p>
        <p>  r2
  
 * r2
if
if
Where r1, r2 are two real numbers from the range [0,
1], selected randomly and s is a random number
from a normal standard distribution, t – the number
of the next generation, T – the maximal number of
the generations, and b is a parameter which defines
the degree in which the mutation operator is
nonuniform.
5) A “fight” runs between the four individuals, the
parents and the children (P1, P2, F1 and F2). The
two best vectors, the ones with the lowest sum of
squares error will survive, and they will move to the
next generation, whereas the two others will be
eliminated.
6) The process repeats, returning the parents again in
the pool of the population, so that they have the
possibility of re-selection, until the next generation
is populated with P vectors of coefficients.
7) The members of the current generation are
evaluated together with the ones of the new
generation, in regards to the fitness criteria, and the
best one for the next generation is selected. Hence,
the concept of “Elitism” is applied.
8) Create new generations of populations with P
individuals and assess the convergence through the
behavior of the best member in each generation,
based on the fitness criteria. If the change in the
assessment of fitness of the best member in each
generation, which passes through 50 generations, is
small, it can be claimed that genetic research has
converged on an optimum.</p>
        <p>The GANN learning flowchart is presented in Figure 4.</p>
        <p>The main parameters of the GA used in GANN
model were set as:
 Genetic population Size = 100
 Crossover probability in genetic population = 0.6
 Mutation probability in genetic population= 0.4
 Probability to add newly generated chromosome to
population = 0.25
 The bipolar sigmoid coefficient α = 1.5</p>
        <p>We tested three types of architectures. The results
are presented in Table 2</p>
        <p>As shown in Table 2, it results that the model with
4layers architecture is the best model. The results of this
model are presented in Figure 5.
MAD
MSE
TS</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>5. Conclusions</title>
      <p>In the model developed in this paper it is treated
exactly the evolution of neural network weights through
genetic algorithm.</p>
      <p>Due to the simplicity and generalization of evolution
and the fact that training algorithms based on gradient
often need to be executed several times in order to avoid
being trapped in a local minima, the evolution technique
is highly competitive.</p>
      <p>The results show that the model that uses neural
networks to forecast GDP, regardless of the method used
for weight training, it works and has a very satisfactory
performance.</p>
      <p>As long as the tracking signal (TS) is between –4 and
4, (in our model TS is equal to GANN=3.61 and
BPNN=0.0052) we can say that the model is working
correctly.</p>
      <p>The GANN forecasted GDP in this study resulted
with a MSE equal to 0.0004 while the BPNN forecasted
GDP resulted with a MSE equal to 0.0056.</p>
      <p>The GANN model tends to slightly under-forecast,
with an average absolute error of 0.0081 units, while the
BPNN model with an average absolute error of 0.0184
units.</p>
      <p>GANN outperforms better then BPNN in Albania’s
GDP forecasting.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [ÇEL09]Çeliku, Evelina,
          <string-name>
            <given-names>Ermelinda</given-names>
            <surname>Kristo</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Merita</given-names>
            <surname>Boka</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>"Modelimi i PBB-së tremujore. Roli i treguesve ekonomikë dhe atyre të vrojtimeve." Tirane: Banka e Shqipërisë</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [GIO09] Giovanis, Eleftherios.
          <year>2009</year>
          .
          <article-title>"ARIMA and Neural Networks</article-title>
          .
          <article-title>An application to the real GNP growth rate and the unemployment rate of U.S.A." SSRN Eletronic Journal</article-title>
          . doi:
          <volume>10</volume>
          .
          <fpage>2139</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [GOL92]
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>David E.</given-names>
          </string-name>
          <year>1992</year>
          .
          <article-title>Genetic Algorithms in Search, Optimization, and Machine Learning</article-title>
          .
          <source>Addison-Wesley Publishing.</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [IMA11]Imansyah,
          <string-name>
            <given-names>Muhammad</given-names>
            <surname>Handry</surname>
          </string-name>
          , Suryani, Nurhidayat, and
          <string-name>
            <surname>Muzdalifah</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>"GDP Estimation and Slow Down Signal Model for</article-title>
          <source>Indonesia:An Artificial Neural Network Approach." Finance and Banking Journal</source>
          <volume>13</volume>
          (
          <issue>1</issue>
          ):
          <fpage>77</fpage>
          -
          <lpage>94</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [MIC96]Michalewicz, Zbigniew.
          <year>1996</year>
          .
          <article-title>Genetic Algorithms + Data Structures = Evolution Programs</article-title>
          . Springer-Verlag Berlin Heidelberg.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [NEG05]
          <string-name>
            <surname>Negnevitsky</surname>
            ,
            <given-names>Michael. 2005. Artificial</given-names>
          </string-name>
          <string-name>
            <surname>Intelligence</surname>
          </string-name>
          :
          <article-title>A Guide to Intelligent Systems</article-title>
          . Pearson Education.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [NUR14]Nurcahyo, Septian,
          <string-name>
            <given-names>Fhira</given-names>
            <surname>Nhita</surname>
          </string-name>
          , and
          <string-name>
            <surname>Adiwijaya</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>"Rainfall Prediction in Kemayoran Jakarta Using Hybrid Genetic Algorithm (GA) and Partially Connected Feedforward Neural Network (PCFNN)</article-title>
          .
          <source>" 2nd International Conference on Information and Communication Technology (ICoICT).</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [TKA99]
          <string-name>
            <surname>Tkacz</surname>
            , G. and
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Hu</surname>
          </string-name>
          .
          <year>1999</year>
          .
          <article-title>"Forecasting GDP growth using artificial neural networks." Bank of Canada WP</article-title>
          , No
          <volume>99</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>