<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Classification of Financial Conditions of the Enterprises in Different Industries of Ukrainian Economy Using Bayesian Networks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oleksandr Chernyak</string-name>
          <email>chernyak@univ.kiev.ua</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yevgen Chernyak</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Economic Cybernetics, Taras Shevchenko National University of Kyiv</institution>
          ,
          <addr-line>Kyiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of International Economics, Taras Shevchenko National University of Kyiv</institution>
          ,
          <addr-line>Kyiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <fpage>519</fpage>
      <lpage>530</lpage>
      <abstract>
        <p>In this work the analysis of branches of Ukrainian economy was done, particularly average financial parameters were found. For each parameter the boundaries were determined which divide enterprises into 5 parts and allow making more detailed ratings. The ratings were made by each parameter and then the aggregate rating was found. The analysis of indices interrelation was made using Bayesian network (BN). The coefficient of partial correlation in BN was used to analyze the interrelation of indices. This subject-matter was developed for Ministry of Industrial Policy of Ukraine. We recommend to use cascade naive Bayes model in financial planning.</p>
      </abstract>
      <kwd-group>
        <kwd>financial indices</kwd>
        <kwd>bankruptcy</kwd>
        <kwd>Bayesian networks</kwd>
        <kwd>naïve Bayes</kwd>
        <kwd>partial correlation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>Each industry of economy is characterized by numerous features which distinguish
one particular industry from a variety of others, for instance such features are length
of operating cycle, requirement in available funds, tax policy of the state etc. The
peculiarity of every industry causes the difference in major financial indices. That is
why defining indices standards, their average values within the industry is an
important issue, which helps to describe the place of each enterprise in the industry
and also to compare industries with each other.</p>
      <p>Setting the problem of standardizing of the financial indices estimation in frames
of industries at once raises a question about the necessity of calculating the
bankruptcy probability for each industry separately. To be mentioned, defining
bankruptcy probability following problems are faced: a) the fact of bankruptcy is
influenced not only by quantitative but also by qualitative indices like the possibility
of getting preferential crediting, support of the state, uninteresting of creditors to
confess a debtor to be a bankrupt; b) inadequate statistics of bankruptcies (procedure
Copyright ©by the paper’s authors. Copying permitted only for private and academic purposes.
of bankruptcy stretches on a few years and fact of confession a bankrupt becomes
separate from the beginning of problems what could have been foreseen before by
the changes of financial indices); c) absence of adequate, representative base of
bankruptcies, which would allow estimating probability of bankruptcy within
industries.</p>
      <sec id="sec-1-1">
        <title>An estimation of arising of overdue payments probability from the side of enterprise would be more precisely, as problems which are described above level with the estimation of non-fulfillment of creditor liabilities. 1</title>
        <sec id="sec-1-1-1">
          <title>Financial</title>
          <p>indices</p>
        </sec>
        <sec id="sec-1-1-2">
          <title>Problems with liabilities fulfillment 2</title>
        </sec>
      </sec>
      <sec id="sec-1-2">
        <title>Bankruptcy</title>
      </sec>
      <sec id="sec-1-3">
        <title>Qualitative indices influence the stage 2 in a greater measure than stage 1 (Fig. 1).</title>
      </sec>
      <sec id="sec-1-4">
        <title>A fact and a term of payment delay is accurately fixed by credit organizations.</title>
      </sec>
      <sec id="sec-1-5">
        <title>Statistics of overdue debt is collected by credit organizations, delays in payments happen more frequently, so that estimation of probability for every industry is more exact.</title>
      </sec>
      <sec id="sec-1-6">
        <title>Therefore we stress on adequacy and possibility of estimation on the stage 1 and</title>
        <p>mention that during the transition from the first stage to the second one accuracy is
being lost, and that is why estimation of the link “financial indices – bankruptcy” is
considered to be purposeless.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2 Criterion of Choosing the Standards</title>
      <p>The only assumption we will use to make the analysis is that we know the direction
of index influence, in other words, increasing of the index influences positively the
enterprise state or contrariwise. The standard values of index can be based on
following considerations:
a) Through the influence of index on a resulting index (investigation of different
variants both negative and positive: fact of overdue debt, bankruptcy debt, increase
of net income). Recommended value of index would be the one which guarantees
fulfillment of obligations with certain probability.</p>
      <sec id="sec-2-1">
        <title>b) Finding average value within the industry, medians or division into several groups</title>
        <p>of sorted index values (more than 2) and finding average value for every group. This
approach is similar to rating; some part receives the highest rate and the other lowest.</p>
      </sec>
      <sec id="sec-2-2">
        <title>Moreover, it is convenient to follow the indices moving from one group to other and</title>
        <p>afterwards to check stability of a model.</p>
        <p>The disadvantage of the first variant is difficulty to work with the correlated
indices because we have to define which of them exactly influences the result. The
exclusion of the strongly correlated indices from a model will not deprive us of
possibility to estimate standards for them. For example, we will have to use one of
the indices of liquidity only. The disadvantage of the second variant is a risk that
industry is in the phase of recession/growth and we will not get the standard values,
but correspondingly decreased (increased). The best way would be to compare the
results which were found by two methods and exactly to estimate in what parts the
whole set is divided by probability found by first method and what probabilities we
will get for the indices were found by dividing the set into equal parts.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Breaking on the Branch</title>
      <sec id="sec-3-1">
        <title>Companies were divided into the industries according to The Classifier of Kinds of</title>
      </sec>
      <sec id="sec-3-2">
        <title>Economic Activities (CKEA). But the way of fragmentation of CKEA was different</title>
        <p>from the standard approach. We tried to pick out specific industries. For example,
insurance was picked out of financial sector, pharmacy – out of chemical industry.</p>
      </sec>
      <sec id="sec-3-3">
        <title>Such method turned out to be appropriate, that was proved by the difference between indicators.</title>
      </sec>
      <sec id="sec-3-4">
        <title>We tried to provide the fragmentation as accurately as possible to be sure that the</title>
        <p>company’s activity is the same that is in the industry. For example, how production
of metal should be divided from production of metal products, wholesale trade and
subsidiary services? Trade and subsidiary services may differ much one from
another. But at the same time it is inappropriate to combine them in one industry.</p>
      </sec>
      <sec id="sec-3-5">
        <title>Therefore, companies were divided into the next classes: extraction, production,</title>
        <p>engineering industry, wholesale trade, retail trade, rent and services.</p>
      </sec>
      <sec id="sec-3-6">
        <title>Finally, we have got the following distribution of all the enterprises (376151) into</title>
        <p>the industries: Auto – 9 384, Building – 41 831, Building materials – 12 271, Power
engineering – 4 427, Cafe and hotels – 10 400, Municipal service – 6 208, Culture
and education – 10 602, Wooding – 11 656, Medicine – 5 446, Metallurgy -5 469,
Real estate – 30 671, Fuel – 8 134, Polygraphy – 6 537, Cattle breeding – 6 473,
Textile – 6 396, Telecommunications – 14 568, Transport – 12 684, Tourism – 4 978,
Pharmacy – 5 481, Media – 3 529, Food industry – 28 058, Chemical – 7 061,</p>
      </sec>
      <sec id="sec-3-7">
        <title>Wholesale trade – 50 019, Retail – 27 121, Machinery construction – 9 685,</title>
      </sec>
      <sec id="sec-3-8">
        <title>Financial services – 15 685, Insurance – 726, Non-financial services – 16 892, Law – 3 759.</title>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4 Dividing into Groups with the Further Purpose to Make Ratings</title>
      <sec id="sec-4-1">
        <title>Now we will determine the average indices (see Table 1).</title>
        <p>1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.</p>
      </sec>
      <sec id="sec-4-2">
        <title>Name</title>
      </sec>
      <sec id="sec-4-3">
        <title>Moment liquidity ratio:</title>
      </sec>
      <sec id="sec-4-4">
        <title>Current ratio:</title>
      </sec>
      <sec id="sec-4-5">
        <title>General liquidity ratio:</title>
      </sec>
      <sec id="sec-4-6">
        <title>Current assets to equity ratio</title>
      </sec>
      <sec id="sec-4-7">
        <title>Independence coefficient:</title>
      </sec>
      <sec id="sec-4-8">
        <title>Return on assets:</title>
      </sec>
      <sec id="sec-4-9">
        <title>Return on sales:</title>
      </sec>
      <sec id="sec-4-10">
        <title>Inventory turn(days):</title>
      </sec>
      <sec id="sec-4-11">
        <title>Debtors accounts turn(days):</title>
      </sec>
      <sec id="sec-4-12">
        <title>Creditors accounts turn (days):</title>
      </sec>
      <sec id="sec-4-13">
        <title>Capital assets depreciation:</title>
      </sec>
      <sec id="sec-4-14">
        <title>The proportion of capital assets and goods in process in total assets:</title>
        <p>ML
CR
GL</p>
      </sec>
      <sec id="sec-4-15">
        <title>Definition</title>
        <p>AHL / Lc
Al / Lc</p>
        <p>Aw / Lc
CA (Eq</p>
        <p>Anon_ current) / Eq
IC OF / Eq
R(a) (NP 12) /(AA N)
R(s) (NI 12) /(NP N)
IT
DT</p>
        <p>N 30 ICavg/ NP</p>
        <p>N 30 ARavg/ NP
CT N 30 APavg / NP
D(ca) D / OC
CAinA (CA G) / А</p>
        <p>AHL – high-liquidity assets, which consist of cash, their equivalents and current
financia investmens; Lc – current liabilitis which consist of short-term credits and
accounts with creditors; Al – liquid assets which consist of high-liquidity assets,
accounts receivable and billss of exchange received; Aw – working assets; Eq
equity; Anon_current – non-current assets; OF- obtained funds; Eq- equity; NP – net
profit; N – number of monthes in period; NI – net income; AA – average value of
assets is calculated as (assets at the beginning of period + assets at the end of
period)/2; ICavg – average value of inventoryis calculated as (inventory a the
beginning of a period+inventory at the end of a period)/2; ARavg – the average sum
of the accounts receivable is calculated as (accounts receivable at the beginning of a
period + accounts receivable at the end of a period)/2; APavg – the average sum of
accounts payable is calculated as (accounts payable at the beginning of a
period+accounts payable at the end of a period)/2; OC – original cost of capital
assets; D – depreciation; CA-capital assets; G-goods-in-process; А-assets ( see
definitions in Van Horne and Wachowicz, (2008) or Stickney et al., 2010).</p>
      </sec>
      <sec id="sec-4-16">
        <title>The period for NI, NP, AA , IT, DT , CT is quarter.</title>
      </sec>
      <sec id="sec-4-17">
        <title>The differences between the branch indices showed the necessity of the work</title>
        <p>which was done. The short-term indices them selves don’t allow to estimate the
enterprises adequately, their place in the whole field. The values of each index were
divided by quantity into 5 equal groups (see Table 2).</p>
        <p>IT
5 10 15 10 50 1 1 500
0,115 0,996 2,058 1,000 2,545 0,045 0,025 203,774
0,022 0,508 1,154 0,967 0,557 0,002 0,005 108,812
0,003 0,221 0,883 0,397 0,074 0,000 0,000 52,493
0,000 0,045 0,426 -0,009 -1,155 -0,025 -0,024 18,246
0,000 0,000 0,000 -10,000 -50,000 -1,000 -1,000 1,000
ML CR GL CA IC R(a) R(s) IT</p>
        <p>5 10 15 10 50 1 1
2,627 5,479 7,569 0,981 0,281 0,188 0,300
1,224 2,451 3,429 0,689 0,058 0,041 0,091
0,408 1,333 1,554 0,184 0,008 0,004 0,031
0,049 0,515 0,757 0,000 0,001 0,000 0,000
0,000 0,000 0,000 -10,000 -50,000 -1,000 -1,000
500
28,630
8,203
4,418
2,323
1,000</p>
        <p>It gives the possibility to determine the position of an enterprise by each of the
parameters more precisely. In this table we can see that 20% (after filtered of
information) enterprises of food industry have high value of ML in range [0,115; 5],
also 20% enterprises of insurance industry have high value of ML in range [2,627; 5].
40% enterprises of food industry have low value of ML in range [0; 0,003], also 20%
of insurance industry have low value of ML in range [0; 0,049]. We recommend use
this information in comparative analysis and determination position in industry.</p>
        <p>After making the division for each enterprise by all the parameters the ratings
were made (0 means error, 2-6 according to the value of parameter: the less
parameter is the bigger the rating is, 1 was used for errors testing and isn’t applied as
a rating estimation). In this work there were considered both those coefficients which
increase is positive for an enterprise (return on assets, absolute liquidity) and those,
which increase is negative (depreciation, stock turn). For making the general rating
it’s necessary to make transformation so that the increase of the rating estimation by
all the parameters will cause increase of the general rating. Let’s convert the rating
estimation of the parameters, which increase is positive by the following
formula: R 8 R . This transformation leads to 2 6 , 3
6 2 .</p>
      </sec>
      <sec id="sec-4-18">
        <title>Below is given the rating of three branches enterprises (Fig. 2):</title>
        <p>5 , 4
4 , 5</p>
        <p>3 ,</p>
        <p>Rate distributin of enterprises rating
0,14
0,12
0,1
y
cn0,08
e
qu0,06
e
r
F0,04
0,02
0
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30</p>
        <p>Value of rating
Food industry</p>
        <p>Building</p>
        <p>Financial services</p>
        <p>As a result we have a distribution close to even (it was expected because the
coefficients with the least correlation values were chosen for this rating). The
similarity for different branches is the evidence of the proposed method adequacy
and gives the possibility to compare enterprises from different fields by means of this
rating. For making rating 5 parameters were used: GL, IC, R(a), CT, D(ca). While
forming the rating the following indices were transformed: GL, IC, R(a); so the
higher the value of R is, the more risk there is for solvency in the future. Visual
similarity of distributions causes a question about the similar connection between the
values notwithstanding the branch. The more detailed research of the parameters
influence using Bayesian networks will be given further.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Construction of Bayesian Network</title>
      <sec id="sec-5-1">
        <title>Bayesian networks are used for modelling subject domains which are</title>
        <p>
          characterized by uncertainty. BNs are often used for the classification problem
          <xref ref-type="bibr" rid="ref1">(Friedman et al., 1997)</xref>
          . There are the direction of using Bayesian networks in
economics: bankruptcy prediction
          <xref ref-type="bibr" rid="ref5">(Sun and Shenoy, 2007)</xref>
          , early warning of bank
failures
          <xref ref-type="bibr" rid="ref3">(Sarkar and Sriram, 2001)</xref>
          , credit risk modeling
          <xref ref-type="bibr" rid="ref2">( Pavlenko, Chernyak, 2010)</xref>
          ,
portfolio risk analysis and others.
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>Now we calculate the coefficients of correlation among the variables. In the</title>
        <p>Table 3 represented values of the coefficients of correlation among the variables.</p>
      </sec>
      <sec id="sec-5-3">
        <title>Colored cells represent coefficients of correlation which 0,1 .</title>
        <p>According to the Table 3 results the connection graph was built ( Fig. 3). On this
graph R-rating is the value of the 0-level. ML, CR, GL, IC, R(a), R(s), CT, IT, D(ca)
are the first-level values (on the graph ML, GL are imaged not on the same level with
the other values of the first-level for the better visual perception and for showing the
influence of this value on the other, their interdependency).The second-level indices
(DT, CAinA, CA) have the biggest influence on the turnover indices (CT, IT) and
liquidity (ML, CR, GL). We chose 0,1 to be the level of link value.</p>
      </sec>
      <sec id="sec-5-4">
        <title>In case if the influence of some index (eliminating the other indicators influence)</title>
        <p>on rating is inessential (absolute value of partial correlation is less then 0,1) then this
index will be moved from the first- level to the second and then its influence on the
first-level indices will be estimated. If some index of the second-level will influence
all the first-level linked indices inessential then it will be moved into the third-level.
While moving into the lower level we “break” only the links with the indices of the
upper level (while moving the index into the second-level only the link with the
rating is broken).The following are the values of partial correlations for indices,
which are linked on the graph (Table 4):
0,1 ).</p>
      </sec>
      <sec id="sec-5-5">
        <title>According to the given calculations we come to the conclusion that the influence</title>
      </sec>
      <sec id="sec-5-6">
        <title>CT on R is inessential so this index should be moved into the second-level. Colored cells show insignificant correlations (absolute value of partial correlation is less then 0,1).</title>
      </sec>
      <sec id="sec-5-7">
        <title>Now we only have to calculate the partial correlations between the first and second-level taking into account translation of CT into the second-level. Before moving CT we have following result (Table 5).</title>
      </sec>
      <sec id="sec-5-8">
        <title>Here we may conclude that the influence of CAinA on CT is inessential. After moving CT we have following result (Table 6).</title>
      </sec>
      <sec id="sec-5-9">
        <title>We come to the conclusion that the link between IT and DT, ML and DT is absent. As a result we get the following links (Fig. 4 – cascaded naïve Bayes model):</title>
      </sec>
      <sec id="sec-5-10">
        <title>In the article (Sun and Shenoy, 2007) it was proposed to set the value level 0,1</title>
        <p>analogically. Finding bigger threshold value of , the influence of the second-level
indices on first-level indices was confirmed, it didn’t lead to any changes in the graph
structure.</p>
      </sec>
      <sec id="sec-5-11">
        <title>We recommend using cascade naive Bayes model while making financial</title>
        <p>planning. For example, an enterprise seeks to minimize the risk of insolvency – it
should seek to decrease/increase the correspondent index (depending on the
correlation sign), taking into consideration that the first-level indices are influenced
by the second-level indices. Measure and character of the influence have to be
compared using the following tables of conditional probabilities (Tables 7, 8):
P(R(a) high/ ML high) 0,032 0,1918 0,49 0,5024 0,478 0,1997 0,3476 . (4)</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6 Conclusions</title>
      <sec id="sec-6-1">
        <title>The main idea of this research is to demonstrate the differences between the</title>
        <p>financial indices for different industries. The analysis of indices interrelation was
made using Bayesian network. The coefficient of partial correlation in BN was used
to analyze the interrelation of indices. While making ratings there was made an
assumption about the independence of the distribution form in which the rating
frequency is described for all enterprises from branch.</p>
      </sec>
      <sec id="sec-6-2">
        <title>The explanation of the inadequacy of the bankruptcy probability estimation is</title>
        <p>given (especially in terms of Ukrainian economy). The bigger accuracy of the
solvency estimation is pointed out. The assumption is made about keeping the
coefficients proportions in discriminatory models of solvency estimation
notwithstanding the branch.</p>
      </sec>
      <sec id="sec-6-3">
        <title>This subject-matter is being developed for Ministry of Industrial Policy with the purpose of temporary revelation of the enterprises subordinate to these Ministry financial problems.</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Friedman</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Geiger</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldszmidt</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>1997</year>
          )
          <article-title>Bayesian network classifiers</article-title>
          .
          <source>Machine Learning</source>
          ,
          <volume>29</volume>
          , p.
          <fpage>131</fpage>
          -
          <lpage>163</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Pavlenko</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chernyak</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          (
          <year>2010</year>
          )
          <article-title>Credit risk modeling using bayesian networks</article-title>
          .
          <source>International Journal of Intelligent Systems</source>
          ,
          <volume>25</volume>
          ,
          <issue>N4</issue>
          , p.
          <fpage>326</fpage>
          -
          <lpage>344</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Sarkar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sriram</surname>
            ,
            <given-names>R.S.</given-names>
          </string-name>
          (
          <year>2001</year>
          )
          <article-title>Bayesian models for early warning of bank failures</article-title>
          .
          <source>Management science, 47, N 11</source>
          , p.
          <fpage>1457</fpage>
          -
          <lpage>1475</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Stickney</surname>
            ,
            <given-names>C.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weil</surname>
            ,
            <given-names>R.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Francis</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2010</year>
          )
          <article-title>Financial Accounting: An Introduction to Concepts</article-title>
          ,
          <source>Methods and Uses. 13th Edition</source>
          .
          <article-title>South-Western: Cengage Learning</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shenoy</surname>
            ,
            <given-names>P.P.</given-names>
          </string-name>
          (
          <year>2007</year>
          )
          <article-title>Using Bayesian networks for bankruptcy prediction: Some methodological issues</article-title>
          .
          <source>European Journal of Operation Research</source>
          ,
          <volume>180</volume>
          , p.
          <fpage>738</fpage>
          -
          <lpage>753</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Van</given-names>
            <surname>Horne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.C.</given-names>
            ,
            <surname>Wachowicz</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.M.</surname>
          </string-name>
          (
          <year>2008</year>
          )
          <article-title>Fundamentals of Financial Management</article-title>
          .
          <source>12th Edition</source>
          . Lebanon, Indiana, USA: FT Prentice Hall.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>