<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Risk Assessment Technology of Crediting with the Use of Logistic Regression Model</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Ivan Franko National University of Lviv</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Lviv Polytechnic National University</institution>
        </aff>
      </contrib-group>
      <fpage>0000</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>The mathematical model of loan borrower estimation taking into account expert assessment has been created. The application of the model based on logistic regression aimed at estimating the solvency of the client has allowed to establish the correlation between risk factors and probable size of loan risk.</p>
      </abstract>
      <kwd-group>
        <kwd>machine learning</kwd>
        <kwd>credit risk</kwd>
        <kwd>scoring model</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Loan scoring which provides flexible tools of estimation of loan risks as well as a
possibility to automatize the adoption of decisions connected with loans is one of the main
approaches towards the quantitative evaluation of the solvency of borrowers which has
been actively developing worldwide. Information systems of credit scoring based on
the comparative data analysis of credit history of the existing borrowers and of the
applicants for a loan analogically, allow to define integrated score assessment of solvency
(reliability) of potential borrowers. The purpose of loan scoring is optimization of a
decision making concerned with granting the bank loans which in its turn causes current
relevance of our research.</p>
      <p>Practical functioning of risk management systems at Ukrainian banks is
characterized by a number of imperfections which added many problems to banking institutions
during crisis and partially caused its emergence. Thus, the methods of the analysis and
assessment of risk of borrowers on behalf of ownerships were imperfect. As a result
many consumer loans were issued to persons who could not cope to return the them.</p>
      <p>Currently, the potential of tools for scoring technologies of risk assessment related
to crediting has not been properly used. Appropriate use of scoring technologies
throughout all life cycle of the loan allows to make more adequate and justified
decisions which can be efficiently automated.</p>
      <p>Nowadays the application of special information technologies which would enable
the chance at a stage of credit applications consideration directed at the eliminating of
unreliable borrowers is relevant for the bank sphere. The Ukrainian banks use mainly a
single application scoring. This situation is explained by both underdeveloped scoring
systems in Ukraine and high prices for services of scoring model developers.</p>
      <p>The world practice uses different scoring types at different stages of crediting in
addition to the stage of consideration of the loan application, continuing to estimate the
properness of the loan application during all lifecycle of a loan, and even at a stage of
collecting debts and transfers to collectors. In particular, there is an analytical module
for the formation of models of solvency assessment of borrowers and their
segmentation as well as the possibility of algorithm development and fraud identification models
in a complex control system of loan scratches of SAS Credit Scoring for Banking. The
following indicates provided possibilities of realization of all types of scoring for
different types of tasks and for different types of data.</p>
      <p>Loan scoring includes the models of decision making and the main methods which
provide support to creditors during the process of decision making about a consumer
loan. The use of these methods allows to define the potential loan owner, the size and
expeditious strategy that will aid to increase profitability of borrowers as well as to
estimate a possible risk. Credit scoring is based on real data that allows to refer it to
reliable estimates of personal solvency.</p>
      <p>The scoring occurs in several types depending on the tasks which have to be solved:
Application scoring (scoring of the applicant) is the assessment of client solvency to
receive the loan (scoring according to biographical data is in priority); Behavioral
scoring (scoring of behaviour) is used for the assessment of return reliability of the issued
loans (the behavioural analysis); Collection scoring (scoring aimed to analyze arrears)
is used to assess a possibility of the complete or partial return of the loan in case of
violation of debt repayment periods (calculation of risks behind portfolio contents);
Fraud scoring (scoring aimed to fight swindlers) is used to assess the probability of a
new client to be a swindler; Response scoring (scoring of a response) is the assessment
of consumer’s reaction (response) to the directed offer; Attrition scoring (scoring of
losses) is the assessment of further product implicational probability or a product
supplier change.</p>
      <p>Traditionally, two approaches are used in case of realization of a scoring system in
Ukraine. The first one is classical (retrospective) scoring on the basis of the analysis of
historical data with application of the modern mathematical methods if such analysis
enables the choice of the significant fields for the questionnaire designed for the
borrower and other indexes. The second approach is an expert scoring when, for instance,
the expert sets solvency estimation rules and the software automatizes this algorithm
without application of any statistical methods for the analysis of historical data.
Nowadays the second alternative is the most common in both medium-sized and small banks
and in many larger ones. Nevertheless, in recent years the first option has been
characterized by the market demand due to the appearance of small but considerably more
important number of credit stories in some markets.</p>
      <p>Currently the integration of several scoring methods is the most efficient, namely
statistical and expert. The fact is that a system includes and realizes the tools which
give the chance to integrate the statistical approach and expert scoring, to consider
regional specifics of the market and loan products as well as to talk about their effective
use.</p>
      <p>The modern set of methods of loan scoring is developed on the basis of the tools for
the predictive analysis which belongs to a wide range of so-called methods of the
profound data analysis (data mining).</p>
      <p>The tools for the predictive analysis include:
─ the statistical methods designed on the basis of discriminant analysis (the linear
regression, logistic regression);
─ classification tree;
─ neural networks;
─ genetic algorithm;
─ method of the closest neighbors.
─ different options of the linear programming;
2</p>
    </sec>
    <sec id="sec-2">
      <title>Main Material Presenting</title>
      <p>The problem of model operation, assessment and management of credit risk was
investigated by the following Ukrainian and foreign scientists: G.I. Beregova, R. Gallati,
V.M. Gorbachuk, A.B. Kaminsky, B.Yu. Kishakevich, A.A. Lobanov, O. Habyyuk,
A.V. Chugunov and other.</p>
      <p>Model operation of loan risk is a significant and currently important problem, since
the application of effective loan risk assessment model allows the financial organization
to save time and money and to protect themselves from undesirable losses or even a
default. In addition, it helps to adopt management decisions concerning the avoidance
or minimization of the negative influence caused by the tendency to risk. Therefore, the
problem related to the choice of an optimal assessment model for loan as well as the
application of new loan assessment and modelling methods remains unresolved
problem risk in the modern changeable environment.</p>
      <p>The article aims at developing and realizing of mathematical evaluation model of
the loan borrower taking into account expert assessment.</p>
      <p>The evaluation of client solvency by the model of logistic regression in the presented
paper required the statement of correlation between risk factors and probability size of
loan risk y which has values ranging from 0 to 1. The selection of the most informative
quantitative variables of financial risk included the possible usage of the discriminant
analysis tools. The Table 1 presents a matrix fragment of values of nine indexes which
were selected during the analysis in which a binary variable y describes the following
situations: 0 – overdue or problematic loan; 1 – in due time repaid loan.</p>
      <p>The tab. 1 introduces designations: d – average monthly income, s – the sum of the
loan, t – the term of the loan, r – an interest rate, В – the age, О – expert’s assessment
of a professional, economic and social status of the client.</p>
      <p>
        The calculation of solvency coefficient of the ownership is performed by a formula [8]:
We use the prepared data which are partially presented in tab. 2 for the estimation of
unknown parameters of econometric model [1]:
 (  = 1|  ) =  ( 0 +  1  1 +  2  2 +  3  3) +   , i = 1,2,…,n
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
in which Р(уi = l | xi ) is the probability that the i-th value of a binary variable is 1 under
the condition of хi;  ( ) = 1+1− – logistic function; εі – random component; x1 –
individual creditworthiness ratio; x2 – age; x3 – expert assessment of the profession and the
socio-economic status of the client.
      </p>
      <p>
        Logistic function has that property that its values are ranging from zero to one at any
values of an argument.
Suppose that there is a training selection received in test data: (xi1, xi2, xi3, yi). It is
required to estimate the parameters in equation (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) using the obtained sample. For the
estimation of unknown parameters use of the maximum likelihood principle. According
to this principle, the values that give a maximum for the likelihood function are taken
as parameter estimates. The likelihood function in our case is as follows:
 ( ,  ) = ∏

  =1  (   )  [ −  (   )]1− 
Here the following designations are for short accepted:
      </p>
      <p>b = (b0, b1,b3)T,
xi = (1, Xi1, Xi2, Xi3),
xib = b0 + b1xi1 + b2xi2 + b3xi3</p>
      <p>
        Usually, instead of function (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ), its logarithm is used, which does not change the
essence of the problem, but allows one to get rid of the multiplication:

ln = ∑
      </p>
      <p>=   ln (   ) + ( −   )ln( −  (   )),
where n is the number of tests.</p>
      <p>Obviously, function (7) has a maximum. For estimation of values of the parameters
at which the maximum of the function (7) is reached, the partial derivatives are
calculated from these parameters and equated to zero:
  ( , )</p>
      <p>= 0</p>
      <p>Thus, the task has been reduced to solving the system of equations (8) with respect
to the unknown parameters b. The solution to this system presents certain difficulties,
since it is not linear.</p>
      <p>For the solution, you can use the Newton – Raphson method. The method involves
the selection of some initial approximation of the solution and its consistent
improvement in the course of performing a series of calculations:
  +1 =   −
The initial values can be defined as a vector of linear regression parameters:


 =
 =
 =
… 
where

 =


 =
(  =
(9)
(10)
(11)
(12)
(13)</p>
      <p>)
(14)
(15)
is confirmed by a calculated value a chi-square (22.14) and almost zero reliability not
to reject a null hypothesis. Analytical expression of the constructed model will have
such expression:</p>
      <p>(  = 1|  ) = (1 +  −1,31−0,55  1+1,31  2−15,95  3)−1</p>
      <p>The adequacy of the designed model can be defined by the index of the McFadden
likelihood ratio, using a formula
ln (  )
ln ( ) = 0,97
ln ( ) is a maximum value of a logarithmic function of credibility. It is reached in a
point with coordinates equal to estimation parameters model  = ( 0,  1,  2,  3).
ln (  ) is a value of a logarithmic function of credibility calculated by the assumption
that bl = b2 = ... = bт = 0. The calculated value of the index of the McFadden likelihood
ratio demonstrates adequacy of the designed model.</p>
      <p>Using the constructed model, the possibility of granting loans to customers was
calculated depending on the individual’s solvency ratio for fixed values of the other
variables (Fig. 2) (y1: x2 = 45, x3 = 3,4; y2: x2 = 55, x3 = 4; y3: x2 = 65, x3 = 5)
The calculations of the possibility of issuing loans to customers, depending on the age
of an individual with fixed values of the other variables (Fig. 3) (y1: x1 = 6, x3 = 4; y2:
x1 = 15, x3 = 3; y3: x1 = 3, x3 = 5)</p>
      <p>Using the constructed model, the possibility of granting loans to customers was
calculated depending on expert assessment of the profession and the socio-economic status
of the client for fixed values of the other variables (Fig. 2) (y1: x1 = 2, x2 = 25; y2: x1
= 2, x2 = 40; y3: x1 = 2, x2 = 55)
Fig. 3. Possibility of issuing loans to customers depending on the age of an individual
Fig. 4. Possibility of granting loans to customers depending on expert assessment of the
profession and the socio-economic status of the client
The estimation of the possibility of insuring loans to new clients by means of the
designed model, is provided in table 3.
The carried-out calculations indicate a possibility of granting the loan only to the
second and third clients.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Conclusions</title>
      <p>Loan risks carry the greatest danger to commercial banks in the context of providing
and maintaining their financial stability. Therefore, introduction of new methods of
assessment, management and consequently, as well as prevention of loan risks have to be
the priority direction of development of a Ukrainian banking system.</p>
      <p>The designed model of loan scoring allows the bank loan analyst to have an
opportunity of justified and grounded self-decisions on loan service for customers and
management of the loan portfolio under the conditions of intense competition in the market
of retail loan issuing.</p>
      <p>With the passage of time, any statistical model becomes inaccurate. It occurs for
many reasons: owing to business cycles, changes of the customer data base of bank,
structural shifts in economy, inflation etc.</p>
      <p>Using the jargon of probability model it means that the influence of borrower’s
characteristics on the probability of returning or not returning the loan does not remain
constant, and tends to change over time. Thus, in order to assure a continuous functioning
of the scoring model, it requires periodical adjustment.
6. Kaminsky, А. B.: Modeling of financial risks. Кyiv: Publishing and Printing Center "Kyiv</p>
      <p>University", 304 p. (2006)
7. Melnyk, О. H.: Methodical provisions on the rapid diagnosis of the threat of bankruptcy of
the enterprise. In: Finance of Ukraine, 16, pp. 108-116 (2010)
8. Prokopenko, І.F., Ganin, V.І., Solyar, V.V., Maslov, S.І.: Fundamentals of Banking:
Manual. Кyiv: TsNL, 410 p. (2005)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Greene</surname>
            ,
            <given-names>W. H.</given-names>
          </string-name>
          :
          <article-title>Ekonometric analysis</article-title>
          .
          <source>Кyiv: Osnovy</source>
          , 1196 p. (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Altman</surname>
            ,
            <given-names>E. I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saunders</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Credit risk measurement: Developments over the last 20 years</article-title>
          . In
          <source>: Journal of Banking and Finance</source>
          ,
          <volume>21</volume>
          (
          <fpage>11</fpage>
          -
          <lpage>12</lpage>
          ), pp.
          <fpage>1721</fpage>
          -
          <lpage>1742</lpage>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bashko</surname>
            ,
            <given-names>V. I.</given-names>
          </string-name>
          :
          <article-title>Estimation of foreign exchange risks and cost of external state borrowing in Ukraine</article-title>
          .
          <source>In: Finance of Ukraine, 9</source>
          , pp.
          <fpage>94</fpage>
          -
          <lpage>104</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Giesecke</surname>
          </string-name>
          , К.,
          <string-name>
            <surname>Shilin</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Transform analysis for point processes and applications in credit risk</article-title>
          .
          <source>In: Mathematical Finance</source>
          , Vol.
          <volume>00</volume>
          ,
          <issue>0</issue>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>21</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Dolinsky</surname>
            ,
            <given-names>L. B.</given-names>
          </string-name>
          :
          <article-title>Probabilistic models of default borrowers</article-title>
          .
          <source>In: Finance of Ukraine</source>
          ,
          <volume>10</volume>
          , pp.
          <fpage>73</fpage>
          -
          <lpage>81</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>