<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Gross Product Simulation with pooling of Linear and Nonlinear Regression Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ahmad Flaih</string-name>
          <email>anflaih@ualr.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Abbas Abdalmuhsen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ebtisam Abdulah</string-name>
          <email>ekabdulah@ualr.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Srini Ramaswamy</string-name>
          <email>sxramaswamy@ualr.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Arkansas at Little Rock Little Rock</institution>
          ,
          <addr-line>AR 72204</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2010</year>
      </pub-date>
      <fpage>69</fpage>
      <lpage>76</lpage>
      <abstract>
        <p>This paper discusses the problem of decision support systems in the organization. The procedure (linear combination) developed with the aim to combine some predicted results obtained with simulation of linear and nonlinear regression models (experts), multiple regression model, nonparametric regression model, and semi parametric regression model. This adjustment procedure enforce some statistical characteristics like the expected value of the gross production rate based on Cobb-Douglas production function is unbiased for the actual value, and the total weights(importance) of all models(experts) is equal to one. We used modeling and simulation techniques to generate our data and to apply the procedure.</p>
      </abstract>
      <kwd-group>
        <kwd>Regression Models</kwd>
        <kwd>Linear combination</kwd>
        <kwd>Cobb-Douglas production function</kwd>
        <kwd>Predict of the gross product</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        In economics, the relationship between the production (outputs) and the
production elements (inputs) known as production function. Usually decision makers
in the organization and companies used some statistical and economics methods to
support the production policy. Cobb-Douglas (CD) production function is one of the
most useful tools to support the production policy, [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] used the CD function
and found that public investment has a large contribution to production, [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] argued
that public-sector capital has no effect on production after controlling for location
characteristic, [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] pursued the insignificance of public capital. All of the
literature above assumes a Cobb-Douglas (CD) production function to estimate
productivity of inputs. One property of the CD functional form is that elasticity of an
input is the estimated slope coefficient when the output and inputs are measured in
logs, [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] used nonparametric perspective, which allows elasticities to vary across
location and time, [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] use the CD functions proposed by [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] considers
the gross product(GP) as responding variable, public capital (PUC), private capital
(PRC), the employment rate (EMP), and the unemployment rate (UEPR) as
explanatory variables, to predict of the gross product by using linear and nonlinear
regression models. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] they have propsed linear pooling method to combine the
probability forecasts and proved the propsed method gave them more accurate
results.
      </p>
      <sec id="sec-1-1">
        <title>In this paper we used the same variables and methodology to predict gross production, but we use linear combination method to combine the results of the three experts (multiple linear regression, non-linear regression, and semi-parametric) as</title>
        <p>input variables, we choose these three types of models because we think there is linear
relationship, non-linear relationship and mixture relationship between gross
production and the predictor variables. It is based on our hypothesis that the linear
combination method will give us more accurate predictive gross production since we
believe this is a better way to take into account the results of multiple models
accompanied by an appropriate weighting scheme. Furthermore, we prove that this
method gives an unbiased estimator to the gross production. Our contribution is that
we use linear combination method to pool three regression models for predicting the
gross production.</p>
        <p>Linear combination is linear pooling of some sets of ordered pairs (importance, gross
production), we have explored the prediction associated with different regression
model can be combined into one final model via weighted linear combination will
give more accurate results. Mathematically linear combination of the sequence
y 1 , y 2 ,..., y n each with mean  is:
Y</p>
        <p>n
  a y</p>
        <p>i i
i 1
, where ai is the weight of y i ,and</p>
        <p>n
 (Y )   ai  ( y i )  a1  a2   ...  an 
i 1
(1)
n
The (Y )   if  ai  1</p>
        <p>i 1</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2 The Models</title>
      <sec id="sec-2-1">
        <title>Regression analysis is a technique used in data analysis; we use regression</title>
        <p>
          technique to predict the value of the response (dependent) variable given any value of
the predictor (independent) variable. A general regression model is [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]:
y i  E ( y i x i )  ei (2)
Where i=1, 2... n denoting an observation of a subject. yi is the response variable and
xi is a k 1 vector of independent variables. E (yi│xi) is the expectation of yi
conditional on xi, and ei is the error term. In this paper we will use the following types
of regression model:
        </p>
        <sec id="sec-2-1-1">
          <title>2.1 Parametric Regression Model:</title>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>In this model, it is assumed that yi is linearly related with xi, so we can say it is</title>
        <p>linear regression model:
E ( y i x i )    x i  </p>
      </sec>
      <sec id="sec-2-3">
        <title>Thus a linear regression model is written as:</title>
        <p>y i    x i   ei
(3)
Where α is the intercept and  is k 1 vector of parameters. Under Gauss-Markov
assumptions, the estimators of α and 
are Best Linear Unbiased Estimators
(BLUE), and can be estimate by using Ordinary Least Squares method (OLS).</p>
        <sec id="sec-2-3-1">
          <title>2.2 Nonparametric Regression Model</title>
          <p>
            If we do not know the data generating process, it is very unlikely that a linear
regression (parametric) model is exactly the appropriate model specification, so the
estimators α and  are not BLUE (best linear unbiased estimator). Instead of
making assumptions for the functional form of E (yi│xi), nonparametric regression
methods do not require any presumptions about the underlying data generating
process [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ]. Let
          </p>
          <p>y i  m (x i )  ei (4)</p>
          <p>Where m (.) is some function of unspecified functional form. Some basic
assumptions about m (.) are commonly made. Many methods have been devised to
estimate the regression function m (.), but we will just consider a simple but effective
estimate known as the Kernel regression estimate. Suppose we have a random sample
(x1, y1), (x2, y2) …, (xn,yn). The Kernel regression estimate of m(x) =E (y│X=x) is
given by:
n
 k (
x i  x j
d</p>
          <p>)y i
mˆ (x) = i 1
n
 k (</p>
          <p>x i  x j )
i 1 d</p>
          <p>Here, K is a nonnegative symmetric function that is not increasing as its
argument gets away from zero, and d is a parameter called the smoothing parameter
that is selected by the user to control the amount of smoothing. The estimator in
equation (4) is called the Local- Constant Least- Squares, which can be interpreted as
a weighted average of yi, where:
k (
x i  x j )</p>
          <p>d
n
 k (
x i  x j )
d</p>
          <p>is the weight attached to yi
It should be noted that the product Kernel function  (
x i  x j
d
of the Kernel function of all components of x. That is:
 ( x i  x j )  k k (</p>
          <p>x is  x js )
d s 1 d S</p>
          <p>Where xjs is the sth component of xj and ds the sth component of d. The Kernel
function k (.) can take several forms. In this paper we used the Gaussian Kernel
function is defined as:
k ( x is  x js ) 
1
exp[
1 x is  x js )2 ]</p>
          <p>(
d S 2 2 d S</p>
          <p>The smoothing parameter d is generally the most important factor when
performing nonparametric regression. The bandwidth is chosen to obtain a desirable
) is the product
(6)
(5)
(7)
trade-off between the bias and the variance of estimation, so we need a method that
could balance the bias and variance of the resulting estimate.</p>
        </sec>
      </sec>
      <sec id="sec-2-4">
        <title>We have used Leave-One-Out Cross-Validation to select the optimal bandwidth.</title>
        <p>
          This method is depending on the principle of selecting bandwidth that minimizes the
sum squared error of the resulting estimates. We will try to find the minimum MSE,
mathematically; we are trying to minimize the following function [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]:
1 n
CV (d) = [y i  mˆ j (x j )]2 (8)
        </p>
        <p>n i 1</p>
        <p>Where mˆ j (x j ) is the Leave-One-Out estimator of m (.) evaluated at xj. SAS
software uses this method to find the optimal bandwidth values.</p>
        <sec id="sec-2-4-1">
          <title>2.3 Semi parametric regression model:</title>
          <p>
            Nonparametric techniques are very flexible because they do not need any
assumptions about functional form. However, there are cases in which the
relationship between the dependent variable and some independent variables is known
to be linear and the relation between the dependent variable and other independent
variables remain undetermined. In this situation semi parametric models are
developed to solve this problem. Semi parametric models have both parametric and
nonparametric components [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ].
          </p>
        </sec>
        <sec id="sec-2-4-2">
          <title>2.3.1 Semi parametric Partially Linear Model</title>
        </sec>
      </sec>
      <sec id="sec-2-5">
        <title>Consider the semi parametric partially linear model:</title>
        <p>y i  x i   m (z i )  ei</p>
        <p>
          Where  is a k 1 vector of parameters, zi is a q  1 vector of independent
variables and ei is an additive error, is assumed to be uncorrelated with xi and zi.  is
the parametric part of the model and the unknown function m (.) is the nonparametric
part. [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] proposed an estimate of  and m (zi) as follows. Using ordinary least
squares, the estimator of  is:
        </p>
        <p>n n
ˆ  ( xi xi  )1  xi  y i
i 1</p>
      </sec>
      <sec id="sec-2-6">
        <title>Where:</title>
        <p>xi  xi- E (xi│ zi) and</p>
        <p>y i =yi-E (yi│ zi)</p>
        <p>Once we obtain ˆ , the nonparametric part m (zi) is easy to estimate. From
equation (8), we get: m (zi) = y i  xi   ei</p>
        <p>Then we can get the estimator of m (zi) as follow:</p>
        <p>n ( y i  xi  )k ( z i d z j )
mˆ (z i )  i 1
n k ( z i  z j )
i 1 d
(9)</p>
        <p>(9)
(10)</p>
        <p>(11)</p>
      </sec>
      <sec id="sec-2-7">
        <title>Where d can be estimate similar to the nonparametric model? SAS software uses Cross-Validation to find the optimal bandwidth.</title>
        <sec id="sec-2-7-1">
          <title>3. Simulation Study</title>
        </sec>
      </sec>
      <sec id="sec-2-8">
        <title>We will generate a random sample of size 30 observations for each explanatory</title>
        <p>variables: public capital (PUC), private capital (PRC), the employment rate (EMP),
and the unemployment rate (UERP). The Cobb-Douglas (CD) production function is,
y i  f (PUC i , PRC i , EMPi ,UERPi )
(12)
Where y i =gross product, PUC i =public capital,
PRC i =private capital, EMPi =employment rate (labor)
UERPi =unemployment rate.</p>
      </sec>
      <sec id="sec-2-9">
        <title>We can rewrite the CD function as,</title>
        <p>( y i )   0 PUC iB1 PRC iB2 EM Pi B3UER PIB4</p>
      </sec>
      <sec id="sec-2-10">
        <title>The log-linear CD production function is</title>
        <p>i
y i   0  1PUC i   2PRC i   3EMPi   4UERPi   i
(13)</p>
      </sec>
      <sec id="sec-2-11">
        <title>Further,</title>
        <p>1- If 1   2   3   4 =1, the product function has constant returns to scale.
2- If 1   2   3   4 &lt;1, returns to scale are decreasing.
3- If 1   2   3   4 &gt;1, returns to scale are increasing.</p>
        <sec id="sec-2-11-1">
          <title>3.1.1 Parametric model Results</title>
        </sec>
      </sec>
      <sec id="sec-2-12">
        <title>We apply multi-linear regression model, covariance model, and variance component model to our generated data to find the estimated values by using SAS procedure proc reg [12] and the output are listed in Tables 1, follow: Table 1: Estimates of output Elasticity: parametric approaches</title>
      </sec>
      <sec id="sec-2-13">
        <title>We used this model to find the estimates values of Elasticity by using SAS procedure (proc gam) and we considered the independent variable (Public Capital) is linearly related to the gross production. The output from proc gam is listed as follows in Table 3.</title>
        <p>-2.010
(1.50)
 4
-0.642
(0.28)
1
-0.211
(0.37)
1
-0.459
(0.470)
 2</p>
        <sec id="sec-2-13-1">
          <title>3.1.2 Non Parametric Model</title>
        </sec>
      </sec>
      <sec id="sec-2-14">
        <title>We used this model to find the estimated values of the intercept and elasticity values by using generated data and SAS procedure (proc gam). The output from proc gam is listed in following table, Table 2: Table 2: Estimates of output Elasticity: Nonparametric approach</title>
        <sec id="sec-2-14-1">
          <title>3.1.3 Semi- Parametric Model</title>
          <p>3.2</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Linear Combination Method</title>
      <p>We will use the estimated models (experts), multiple parametric regression,
nonparametric regression, and Semiparametric regression to predict o f the log-gross
production, for sample of size equal to 10 observations, and by using the values of the
columns(Parametric y 1 ,Nonparametric y 2 ,Semiparametric y 3 ) in the following
table, we can estimates the weight corresponding to each expert(model) with SPSS
software by using neural networks procedure and estimate the log-gross production Y
as follow:</p>
      <p>Y  0.194y 1  0.595y 2  0.255y 3
Correlation is a measure of the association between two variables; it is a very
important part of statistics. One of the most fundamental concepts in many
applications is the concept of correlation. If two variables are correlated, this means
that you can use information about one variable to predict the values of the other
variable. The correlation matrixes between the predicted values in the table 4 as
follows:
i</p>
      <p>From Table 5 we can find that the correlation coefficient between the predicted
values of the linear combination(Y) and the actual variable of gross production is
(0.954) that means the linear combination method is more associated with the actual
values than the expert1, expert2, and expert3.this strongest association supports the
suggestion that the predicted value of the gross production which associated with the
different experts that combined is better than taking the predictive values of each
expert individually. The average difference is (0.261, 0.959, 0.146, 0.103) of the three
regression models (parametric, non-parametric, and semi-parametric) respectively and
linear combination procedure. The proposed method (linear combination) is more
accurate than regression models because the difference (actual and Y) in table 4 is
more less than the other models, so this means that the proposed method give us
unbiased estimator.
4. Conclusion</p>
      <sec id="sec-3-1">
        <title>In this paper, table 5(correlation matrix) shown that the correlation coefficients</title>
        <p>between the linear combination method, parametric model, nonparametric model and
semi-parametric and the actual values of the gross production are (0.954, 0.457,
0.935, and -0.015), because we believe the public capital (PUC) is linearly related to
gross production, so we choose it as parametric variable and that is why the
correlation between y3 and actual value is -0.015. Linear combination method is more
associated with the actual value than the others experts (regression model), i.e. linear
combination method reflects the experts views to find the most fitted predicted value
to the actual value. Finally as forecasters often wish to provide an accurate predicted
value, so we will use the linear combination method to predict of the gross
production.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Aschauer</given-names>
            <surname>David</surname>
          </string-name>
          <article-title>A. : public investment and productivity growth in the Group of Seven</article-title>
          .
          <volume>89</volume>
          -
          <fpage>13</fpage>
          , Federal Reserve Bank of Chicago (
          <year>1989</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Baltagi</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Pinnoi</surname>
          </string-name>
          ,N. :
          <article-title>Public capital stock and state productivity growth:Further evidence from and error component model</article-title>
          .
          <source>Empirical Economics</source>
          <volume>20</volume>
          ,
          <fpage>351</fpage>
          -
          <lpage>359</lpage>
          (
          <year>1995</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Clemen</surname>
          </string-name>
          , R. T. and
          <string-name>
            <surname>Winkler</surname>
            ,
            <given-names>R. L.</given-names>
          </string-name>
          :
          <article-title>"Combining Probability Distributions from Experts in Risk Analysis"</article-title>
          ,
          <source>Risk Analysis</source>
          <volume>19</volume>
          ,
          <fpage>187</fpage>
          -
          <lpage>203</lpage>
          (
          <year>1999</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Dong</given-names>
            <surname>Xiang</surname>
          </string-name>
          <article-title>"Fitting Generalized Additive Models with the GAM Procedure"</article-title>
          , SAS Institute Inc., Cary, NC
          <volume>27513</volume>
          (
          <year>1990</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Garcia-Mila</surname>
          </string-name>
          ,Teresa,Therese.J.
          <string-name>
            <surname>McGuire</surname>
          </string-name>
          ,and
          <string-name>
            <surname>Robert</surname>
            <given-names>H.</given-names>
          </string-name>
          ,
          <article-title>Porter: The effect of public capital in state-level production function reconsidered</article-title>
          .
          <source>Review Economics and Statistics forthcoming</source>
          (
          <year>1996</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Henderson</surname>
            ,
            <given-names>D.J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Kumbhakar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Public and private capital productivity puzzle:A nonparametric approach</article-title>
          ,
          <source>southern Economic Journal</source>
          <volume>73</volume>
          (
          <issue>1</issue>
          ),
          <fpage>219</fpage>
          -
          <lpage>232</lpage>
          (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Holtz-Eakin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Public-sector capital and the productivity puzzle</article-title>
          ,
          <source>Review of Economics and statistics 76</source>
          ,
          <fpage>12</fpage>
          -
          <lpage>21</lpage>
          (
          <year>1994</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Racine</surname>
            ,
            <given-names>J.S. :</given-names>
          </string-name>
          <article-title>Nonparametric econometrics: theory and practice</article-title>
          , Princeton University Press (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Munnell</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>How does public infrastructure affect regional economic performance? New England Economic Review September</article-title>
          ,
          <fpage>11</fpage>
          -
          <lpage>32</lpage>
          (
          <year>1990</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. Par Osterholm :
          <article-title>"Incorporating Judgment in Far Chart"</article-title>
          ,
          <source>Finance and Economics Discussion Series. Screen Reader Version</source>
          (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Robinson</surname>
            ,
            <given-names>P.M.</given-names>
          </string-name>
          :
          <article-title>Root-n-consistent semiparametric regression</article-title>
          ,
          <source>Econometrica</source>
          <volume>56</volume>
          ,
          <fpage>931</fpage>
          -
          <lpage>954</lpage>
          (
          <year>1988</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Ronald</surname>
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Cody and Jeffrey K. Smith :</surname>
          </string-name>
          <article-title>"Applied Statistics and the SAS Programming Language"</article-title>
          5th.ed (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Roopesh</surname>
            <given-names>R.</given-names>
          </string-name>
          , and Tilmann G. :
          <article-title>Combining Probability Factors"</article-title>
          ,
          <source>Technical Report no.543</source>
          (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Schick</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>On asymptotically efficient estimation in semiparametric models</article-title>
          ,
          <source>Annals of Statistics</source>
          <volume>14</volume>
          ,
          <fpage>1139</fpage>
          -
          <lpage>1151</lpage>
          (
          <year>1986</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Xianghang</surname>
            <given-names>S.:</given-names>
          </string-name>
          <article-title>"Application of Nonparametric and Semi-parametric methods in Economics and Finance"</article-title>
          ,
          <source>PhD dissertation</source>
          , University of Southern California (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>