<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>International Conference on Advanced Aspects of Software Engineering
ICAASE, December</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Comparison of ensemble cost sensitive algorithms: application to credit scoring prediction</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Meryem Saidi</string-name>
          <email>miryem.saidi@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nesma Settouti</string-name>
          <email>nesma.settouti@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mostafa El Habib Daho</string-name>
          <email>mostafa.elhabibdaho@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohammed El Amine Bechar</string-name>
          <email>am.bechar@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Biomedical Engineering Laboratory, Tlemcen University</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>High School of Management, GBM Laboratory, Tlemcen University</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <volume>0</volume>
      <fpage>1</fpage>
      <lpage>02</lpage>
      <abstract>
        <p>Page 56</p>
      </abstract>
      <kwd-group>
        <kwd>Cost sensitive learning</kwd>
        <kwd>credit scoring</kwd>
        <kwd>ensemble algorithms</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>In recent years, the increase in the demand for
credit leads the nancial institutions to
consider arti cial intelligence and machine
learning techniques as a solution to make decisions
in a reduced time. These decision support
systems reach good results in classifying loan
applications into good loans and bad loans.
Albeit they su er of some limitations, mainly,
they consider that the misclassi cation errors
have the same nancial impact.</p>
      <p>In this work, we study the performance of
ensemble cost sensitive algorithms in reducing
the most expensive errors. We apply these
techniques on German credit data. By
comparing the di erent algorithms, we
demonstrate the e ectiveness of cost sensitive
ensemble algorithms in determining the potential
loan defaulters to reduce the nancial cost.</p>
      <p>Copyright c by the paper's authors. Copying permitted for
private and academic purposes.</p>
      <p>In: Proceedings of the 3rd Edition of the International
Conference on Advanced Aspects of Software Engineering
(ICAASE18), Constantine, Algeria, 1,2-December-2018,
published at http://ceur-ws.org
1</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <p>Credit scoring is the process of analyzing credit les,
to decide the creditworthiness of an individual.
Distinguishing a good applicant for a loan from a bad
one is important to cut nancial institution's losses
[AEW13]. The use of machine learning tools allows
auditors to analyze large amounts of information for
evaluating the credit risk in a reasonable time [Yu17].</p>
      <p>These algorithms tend to decrease the classi cation
error and assume that all misclassi cation's have the
same cost. However, the cost for labeling a positive
example as negative is di erent from the cost for
labeling a negative example as positive. Indeed, approving
a bad loan is much more costly than rejecting a
potentially good loan [KBC16]. Indeed, if a loan can not
full ll its loan obligations this may result in negative
impacts on bank pro ts and big nancial losses.
However, if a good loan is rejected, it causes lower pro ts
losses.</p>
      <p>These algorithms tend to decrease the classi cation
error and assume that all misclassi cation's have the
same cost. However, the cost for labeling a positive
example as negative is di erent from the cost for
labeling a negative example as positive. Indeed, approving
a bad loan is much more costly than rejecting a
potentially good loan [KBC16]. Indeed, if a loan can not
full ll its loan obligations this may result in negative
impacts on bank pro ts and big nancial losses.
However, if a good loan is rejected, it causes lower pro ts
losses.</p>
      <p>On the other hand, credit datasets are highly
imbalanced which worsens the situation. Traditional
machine learning algorithms tend to maximize accuracy
by considering most of the cases as good loans
(majority class), thus causing signi cant default loss.</p>
      <p>Motivated by the non-uniform cost classi cation
problem, the data mining researchers propose new
cost-sensitive learning approaches for taking into
account the misclassi cation costs or other types
of costs such as acquisition cost or computer cost
[Tur00, Dom99, Elk01, Mar02]. Some studies have
been conducted on the use of cost-sensitive (CS)
learning in credit scoring as a CS-boosted tree [XLL17],
CS-Neuronal network [AGM+13], CS-decision tree
[BAO15] and CS-logistic regression [BAO14].</p>
      <p>The objective of this study is to compare the e
ectiveness of di erent techniques to assist the loan o cer
in screening out potential loan defaulters in the credit
environment. The rest of paper is framed as follows:
Section 2 describes the used algorithms.
Experimental results and discussion are presented in Section 3,
Finally, we conclude with a summary of results and
directions for future works.
2</p>
    </sec>
    <sec id="sec-3">
      <title>Research methodology</title>
      <p>In this section, we present the cost sensitive learning
principle and the selected algorithms for the
evaluation.
2.1</p>
      <sec id="sec-3-1">
        <title>Cost sensitive learning</title>
        <p>There are several methods to deal with unequal
misclassi cation costs. The rst one is to use a
learning algorithm that takes into account the costs when
building the classi er. The second strategy is to use
sampling (oversampling and under-sampling) to alter
the class distribution of the training data. In
costsensitive classi cation, the misclassi cation cost plays
an important role in the learning process. A cost
matrix is used to encode the penalty of misclassifying an
example from one class as another [Dom99]. Table 1
represents a misclassi cation cost matrix, used to
obtain the cost of a false positive (FP), false negative
(FN), true positive (TP), and true negative (TN).</p>
        <sec id="sec-3-1-1">
          <title>Actual</title>
          <p>Positive
Negative</p>
          <p>The positive class is the most expensive class and
C(i; j) denote the cost of predicting an instance from
class i as class j. Usually, C(i; i) have a null or a
negative cost and the FN cost is more expensive than a FP
cost (C(0; 1) &gt; C(1; 0)). The best evaluation metrics
in cost sensitive learning is total cost (see equation 1).</p>
          <p>T otal Cost = (F N</p>
          <p>CF N ) + (F P</p>
          <p>CF P )
(1)
The cost-sensitive learning methods can be categorized
into two categories: direct and indirect.</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>Direct methods</title>
          <p>In the direct method, the learning algorithm is
itself cost-sensitive (CS). The CS learning algorithms
use the misclassi cation cost during the learning
process. There are several works on cost-sensitive
learning algorithms such as ICET [Tur95], an
evolutionary algorithm using a misclassi cation cost in the
tness function. Many cost sensitive decision tree
approaches were proposed [MZ12, Tur95, DHR+06,
FCPB07, ZL16]. In [KK98], the authors perform a
comparative study of di erent cost-sensitive neural
networks. Other researches propose cost sensitive
ensemble methods [KW14, KWS14, SKWW07, MSV11,
Mar99].</p>
        </sec>
        <sec id="sec-3-1-3">
          <title>Indirect methods</title>
          <p>On the other hand, the indirect methods, called
Cost-sensitive meta-learning, convert existing
costinsensitive learning algorithms into cost-sensitive ones
without modifying them. The cost-sensitive
metalearning technique, propose two major mechanisms:
a pre-process instance sampling or weighting of the
training dataset and a threshold adjusting of the
output of a cost-insensitive algorithm [Zha08]. In this
category, we can cite MetaCost [Dom99] which
manipulate the training set labels, Costing [ZLA03],
Weighting [Tin02]or Empirical Thresholding [SL06].
2.2</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Used algorithms</title>
        <p>Classi cation and regression trees (CART)
Proposed by Breiman et al. [BFOS84], CART is a
binary decision tree. This algorithm processes
continuous and categorical attributes and target. CART
uses the Gini splitting rule to search the best
possible variable to split the node into two child nodes and
grow the trees to their maximum size until no splits
are possible.</p>
        <sec id="sec-3-2-1">
          <title>Bagging</title>
          <p>Bootstrap Aggregation (Bagging), is one of the
earliest and simplest ensemble algorithms [Bre96]. The
learners are tted to bootstrap replicates of the
training set by sampling randomly from original set with
replacement i.e.: an observation xi may appear
multiple times in the sample. After the base learners have
been t, the aggregated response is the majority vote.
Page 57
Hence, Bagging has no memory, it is easily parallelize
(as can be seen in Algorithm 1).</p>
        </sec>
        <sec id="sec-3-2-2">
          <title>Algorithm 1 CS-bagging Algorithm</title>
          <p>Input: S = ((x1; y1); :::; (xm; ym))), P: the number
of classi er to train.
for p:=1 to P do</p>
          <p>Sp = Bootstrap(S), i.i.d. sampling with
replacement from S.
hp = TrainClassi er(St).</p>
          <p>Add hp = to the ensemble.</p>
          <p>end for</p>
        </sec>
        <sec id="sec-3-2-3">
          <title>Boosting</title>
          <p>Proposed by Schapire [SFBL97, SF12], boosting is
a technique for sequentially combining multiple base
classi ers whose combined performance is signi cantly
better than that of any of the base classi ers. Each
base classi er is trained on data that is weighted based
on the performance of the previous classi er and each
classi er votes to obtain a nal decision.</p>
        </sec>
        <sec id="sec-3-2-4">
          <title>CS-CART</title>
          <p>To generate a cost-sensitive CART algorithm Breiman
et al. [BFOS84] modify the class probabilities, P (i)
used in the information gain measure. Instead of
estimating P (i) by Ni=N , it is weighted by the relative
cost.</p>
          <p>P (i) = Cij (Ni=N )= X cost(j)(Nj =N )
j
The cost of misclassifying an example of class j as class
i is : cost(j) = P Cij</p>
        </sec>
        <sec id="sec-3-2-5">
          <title>CS-bagging</title>
          <p>It learns the di erent individual classi ers then it uses
the available classi ers for a better estimation of the
posterior probabilities according to a voting scheme.
This approach is applicable regardless of the
underlying learning method [Shi15].</p>
        </sec>
        <sec id="sec-3-2-6">
          <title>MetaCost</title>
          <p>This algorithm was proposed by [Dom99]. MetaCost
estimates the class probabilities then relabel the
training instances to minimize the expected cost. Finally,
a new classi er is built on the relabeled dataset.</p>
          <p>We test di erent combinations of the former
algorithms:</p>
          <p>The insensitive cost classi ers: CART,
BAGGING of CART, BOOSTING of CART.</p>
        </sec>
        <sec id="sec-3-2-7">
          <title>Algorithm 2 CS-bagging Algorithm</title>
          <p>Input: S = ((x1; y1); :::; (xm; ym))), P: the number
of classi er to train.
for p:=1 to P do</p>
          <p>Sp = Bootstrap(S), i.i.d. sampling with
replacement from S.
hp = TrainClassi er(St).</p>
          <p>Add hp = to the ensemble.
end for
for p:=1 to P do</p>
          <p>Y^p(w) =Set the prediction with hp .
end for
According to the proportions observed on the
P 0s prediction, we have an estimate of P (Y =
yk=X(w)).</p>
          <p>Make the prediction which minimizes the cost.</p>
        </sec>
        <sec id="sec-3-2-8">
          <title>Algorithm 3 MetaCost Algorithm</title>
          <p>Input: S = ((x1; y1); :::; (xm; ym))), L: cost matrix,
H : classi er.</p>
          <p>Estimate the class probabilities P (yijxi).</p>
          <p>Relabel yi = argmin P j = 1kP (jjxi)L( ; j)8i .
T = H(x; y).</p>
          <p>Output: T .</p>
          <p>One phase Cost classi ers : CS-CART,
BAGGING CS-CART, CS-BAGGING CART, MULTI
COST CART, BOOSTING CS-CART.</p>
          <p>Two phases Cost classi er: CS-BAGGING
CSCART
3
3.1</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experimentation</title>
      <sec id="sec-4-1">
        <title>Dataset</title>
        <p>The empirical evaluation was made on the German
credit scoring dataset from the UCI Machine Learning
Repository. This dataset consists of 20 features and
1000 instances including 700 instances of credit-worthy
applicants and 300 instances of insolvent customers
who should not have been granted credit. This dataset
is provided with a cost matrix,</p>
        <sec id="sec-4-1-1">
          <title>Actual</title>
          <p>Insolvent
Creditworthy
Page 58</p>
          <p>Table 3 presents the general results of the nine
algorithms. Following the recommendation of [TSTL12],
we employ the non-parametric Freidman test to
compare the classi ers. The Friedman test ranks the
algorithms; to the best performing is the rank of 1, the
second best is the rank 2, etc. The last column depicts
the statistical test.</p>
          <p>A number of conclusions emerge from this table.
First, it emphasizes the superiority of ensemble
methods compared to the individual classi er. When we
consider the classi cation error, the best performances
are reached by classical bagging and boosting.
However, these algorithms focus on improving the classi
cation accuracy at the expense of the minority class.
So, they obtain a low sensibility which increases the
cost.</p>
          <p>On the other hand, a CS-bagging of CS-CART
obtains the lowest misclassi cation cost followed by the
individual classi er CS-CART. In this case, the
statistical improvement is not signi cant (just 1.19). We
can consider that this little improvement not worth
the the computational cost. However, in some cases a
small gain in performances represents a great gain in
economical bene ts. Albeit this technique obtains the
highest classi cation error.</p>
          <p>Figure 1 compares the average results of the
different methods. In gure 2 and 3, we can see the
results error vs cost and speci city vs sensitivity for each
classi er. Considering those values, we can suppose
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In recent years, the number of insolvent loans has
increased due to the nancial crisis. It becomes
necessary for banks to nd new methods for the evaluation
credit application. Machine learning techniques have
been used to perform nancial decision making.
However, these methods intended to minimize the
misclassi cation error and assume that the di erent errors
are equals. The cost sensitive techniques are used to
handle the misclassi cation cost in many real world
problems.</p>
      <p>In this paper, we compare the performance of di
erent cost-sensitive and cost-insensitive ensemble
algorithms in determining the creditworthiness of an
individual. The experiments drew the following
conclusions (1) the ensemble approaches obtain better
results than individual classi er; (2) the insensitive
approaches reached the best classi cation accuracy but
since the class distribution is highly imbalanced the
minority class (insolvent loan) is less well recognized;
(3) the cost sensitive approaches intended to reduce
the cost at the expense of the accuracy.</p>
      <p>Finally, we found that the cost sensitive bagging
algorithm o ers the best trade-o between accuracy and
misclassi cation cost. For future research, we aim to
use techniques to handle imbalanced datasets and
experiment with other cost sensitive algorithms.
Page 59
Page 60
[AEW13]</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>Anatomy of the credit score</article-title>
          .
          <source>Journal of Economic Behavior &amp; Organization</source>
          ,
          <volume>95</volume>
          :
          <fpage>175</fpage>
          {
          <fpage>185</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>R.</given-names>
            <surname>Alejo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Garc</surname>
          </string-name>
          <string-name>
            <surname>a</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. I.</given-names>
            <surname>Marques</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. S.</given-names>
            <surname>Sanchez</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. A</surname>
          </string-name>
          . Antonio-Velazquez.
          <article-title>Making accurate credit risk predictions with cost-sensitive mlp neural networks</article-title>
          .
          <source>In Jorge Casillas</source>
          , Francisco J. Mart nezLopez,
          <string-name>
            <surname>Rosa Vicari</surname>
          </string-name>
          , and Fernando De la Prieta, editors,
          <source>Management Intelligent Systems</source>
          , pages
          <fpage>1</fpage>
          <lpage>{</lpage>
          8, Heidelberg,
          <year>2013</year>
          . Springer International Publishing.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Alejandro</given-names>
            <surname>Correa</surname>
          </string-name>
          <string-name>
            <surname>Bahnsen</surname>
          </string-name>
          , Djamila Aouada, and
          <string-name>
            <given-names>Bjrn</given-names>
            <surname>Ottersten</surname>
          </string-name>
          .
          <article-title>Example-dependent cost-sensitive logistic regression for credit scoring</article-title>
          .
          <source>In 13th International Conference on Machine Learning and Applications</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Alejandro</given-names>
            <surname>Correa</surname>
          </string-name>
          <string-name>
            <surname>Bahnsen</surname>
          </string-name>
          , Djamila Aouada, and
          <string-name>
            <given-names>Bjrn</given-names>
            <surname>Ottersten</surname>
          </string-name>
          .
          <article-title>Example-dependent cost-sensitive decision trees</article-title>
          .
          <source>Expert Systems with Applications</source>
          ,
          <volume>42</volume>
          (
          <issue>19</issue>
          ):
          <volume>6609</volume>
          {
          <fpage>6619</fpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Stone</surname>
          </string-name>
          .
          <article-title>Classi cation And Regression Trees</article-title>
          .
          <source>Chapman and Hall</source>
          , New York,
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>L.</given-names>
            <surname>Breiman</surname>
          </string-name>
          .
          <article-title>Bagging predictors</article-title>
          .
          <source>Machine Learning</source>
          ,
          <volume>24</volume>
          :
          <fpage>123</fpage>
          {
          <fpage>140</fpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>J.V.</given-names>
            <surname>Davis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.J.</given-names>
            <surname>Rossbach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.E.</given-names>
            <surname>Ramadan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Witchel</surname>
          </string-name>
          .
          <article-title>Cost-sensitive decision tree learning for forensic classi cation</article-title>
          .
          <source>In Proceedings of the 17th European Conference on Machine Learning,, page 622629</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Pedro</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Domingos</surname>
          </string-name>
          .
          <article-title>Metacost: a general method for making classi ers cost-sensitive</article-title>
          .
          <source>In Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          , pages
          <volume>155</volume>
          {
          <fpage>164</fpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Charles</given-names>
            <surname>Elkan</surname>
          </string-name>
          .
          <article-title>The foundations of cost-sensitive learning</article-title>
          .
          <source>In Proceedings of the 17th International Joint Conference on Arti cial Intelligence - Volume 2, IJCAI'01</source>
          , pages
          <fpage>973</fpage>
          {
          <fpage>978</fpage>
          , San Francisco, CA, USA,
          <year>2001</year>
          . Morgan Kaufmann Publishers Inc.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Data</given-names>
            <surname>Warehousing and Knowledge Discovery</surname>
          </string-name>
          , page
          <volume>303312</volume>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Yeonkook J. Kim</surname>
            , Bok Baik, and
            <given-names>Sungzoon</given-names>
          </string-name>
          <string-name>
            <surname>Cho</surname>
          </string-name>
          .
          <article-title>Detecting nancial misstatements with fraud intention using multi-class cost-sensitive learning</article-title>
          .
          <source>Expert Systems with Applications</source>
          ,
          <volume>62</volume>
          :
          <fpage>32</fpage>
          {
          <fpage>43</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Kukar</surname>
          </string-name>
          and I. Kononenko.
          <article-title>Cost-sensitive learning with neural networks</article-title>
          .
          <source>In Proceedings of the Thirteenth European Conference on Arti cial Intelligence</source>
          , Chichester, NY.,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Bartosz</given-names>
            <surname>Krawczyk</surname>
          </string-name>
          and
          <string-name>
            <given-names>Michal</given-names>
            <surname>Wozniak</surname>
          </string-name>
          .
          <article-title>Evolutionary cost-sensitive ensemble for malware detection</article-title>
          . In International Joint Conference SOCO'
          <fpage>14</fpage>
          - CISIS'
          <fpage>14</fpage>
          -ICEUTE'
          <volume>14</volume>
          , pages
          <fpage>433</fpage>
          {
          <fpage>442</fpage>
          ,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Bartosz</given-names>
            <surname>Krawczyk</surname>
          </string-name>
          , Micha Woniak, and
          <string-name>
            <given-names>Gerald</given-names>
            <surname>Schaefer</surname>
          </string-name>
          .
          <article-title>Cost-sensitive decision tree ensembles for e ective imbalanced classi cation</article-title>
          .
          <source>Applied Soft Computing</source>
          ,
          <volume>14</volume>
          :
          <fpage>554</fpage>
          {
          <fpage>562</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>D.D.</given-names>
            <surname>Margineantu</surname>
          </string-name>
          .
          <article-title>Building ensembles of classi ers for loss minimization</article-title>
          .
          <source>In Proceedings of the 31st Symposium on the Interface, Models, Predictions and Computing</source>
          , pages
          <volume>190</volume>
          {
          <fpage>194</fpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <source>[Mar02] [MSV11] [MZ12] [SF12] [SFBL97] [Shi15] [SKWW07] [SL06] [Tin02] [TSTL12] [Tur95] [Tur00] [XLL17] [Yu17] [Zha08] [ZL16] [ZLA03] Dragos Dorin Margineantu</source>
          .
          <article-title>Methods for Costsensitive Learning</article-title>
          .
          <source>PhD thesis</source>
          , Oregon State University, Corvallis,
          <string-name>
            <surname>OR</surname>
          </string-name>
          , USA,
          <year>2002</year>
          . AAI3029569.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>H.</given-names>
            <surname>Masnadi-Shirazi</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Vasconcelos</surname>
          </string-name>
          .
          <article-title>Costsensitive boosting</article-title>
          .
          <source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          ,
          <volume>33</volume>
          (
          <issue>2</issue>
          ):
          <fpage>294309</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>F.</given-names>
            <surname>Min</surname>
          </string-name>
          and
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhu</surname>
          </string-name>
          .
          <article-title>A competition strategy to costsensitive decision trees</article-title>
          .
          <source>Rough Sets and Knowledge Technology</source>
          , pages
          <volume>359</volume>
          {
          <fpage>368</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>R.E.</given-names>
            <surname>Schapire</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Freund</surname>
          </string-name>
          . Boosting:
          <article-title>Foundations and Algorithms</article-title>
          . The MIT Press,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Lee</surname>
          </string-name>
          .
          <article-title>Boosting the margin: a new explanation for the e ectiveness of voting methods</article-title>
          .
          <source>In Machine Learning: Proceedings of the Fourteenth International Conference</source>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>S.A.</given-names>
            <surname>Shilbayeh</surname>
          </string-name>
          .
          <article-title>Cost sensitive meta learning</article-title>
          .
          <source>PhD thesis</source>
          , School of computing, science and engineering university of salford manchester, UK,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Wong</surname>
            ,
            <given-names>and Yang</given-names>
          </string-name>
          <string-name>
            <surname>Wang</surname>
          </string-name>
          .
          <article-title>Cost-sensitive boosting for classi cation of imbalanced data</article-title>
          .
          <source>Pattern Recognition</source>
          ,
          <volume>40</volume>
          (
          <issue>12</issue>
          ):
          <volume>3358</volume>
          {
          <fpage>3378</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <given-names>V. S.</given-names>
            <surname>Sheng</surname>
          </string-name>
          and
          <string-name>
            <given-names>C. X.</given-names>
            <surname>Ling</surname>
          </string-name>
          .
          <article-title>Thresholding for making classi ers cost-sensitive</article-title>
          .
          <source>In Proceedings of the st national conference on arti cial intelligence</source>
          , Boston, Massachusetts,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>K. M. Ting</surname>
          </string-name>
          .
          <article-title>An instance-weighting method to induce cost-sensitive trees</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          ,
          <volume>14</volume>
          (
          <issue>3</issue>
          ):
          <volume>659</volume>
          {
          <fpage>665</fpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>Bogdan</given-names>
            <surname>Trawinski</surname>
          </string-name>
          , Magdalena Smetek, Zbigniew Telec, and
          <string-name>
            <given-names>Tadeusz</given-names>
            <surname>Lasota</surname>
          </string-name>
          .
          <article-title>Nonparametric statistical analysis for multiple comparison of machine learning regression algorithms</article-title>
          .
          <source>Applied Mathematics and Computer Science</source>
          ,
          <volume>22</volume>
          (
          <issue>4</issue>
          ):
          <volume>867</volume>
          {
          <fpage>881</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <given-names>P.</given-names>
            <surname>Turney</surname>
          </string-name>
          .
          <article-title>Cost-sensitive classi cation: Empirical evaluation of a hybrid genetic decision tree induction algorithm</article-title>
          .
          <source>Journal of Arti cial Intelligence Research (JAIR)</source>
          ,
          <volume>2</volume>
          :
          <fpage>369</fpage>
          {
          <fpage>409</fpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>Peter D. Turney</surname>
          </string-name>
          .
          <article-title>Types of cost in inductive concept learning</article-title>
          .
          <source>In Workshop on Cost-Sensitive Learning at the Seventeenth International Conference on Machine Learning</source>
          , volume cs.
          <source>LG/0212034</source>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <given-names>Yufei</given-names>
            <surname>Xia</surname>
          </string-name>
          , Chuanzhe Liu, and Nana Liu.
          <article-title>Costsensitive boosted tree for loan evaluation in peer-topeer lending</article-title>
          .
          <source>Electronic Commerce Research and Applications</source>
          ,
          <volume>24</volume>
          :
          <fpage>30</fpage>
          {
          <fpage>49</fpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <given-names>Xiaojiao</given-names>
            <surname>Yu</surname>
          </string-name>
          .
          <article-title>Machine learning application in online lending risk prediction</article-title>
          .
          <source>ArXiv</source>
          e-prints,
          <year>July 2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <given-names>Huimin</given-names>
            <surname>Zhao</surname>
          </string-name>
          .
          <article-title>Instance weighting versus threshold adjusting for cost-sensitive classi cation</article-title>
          .
          <source>Knowledge and Information Systems</source>
          ,
          <volume>15</volume>
          (
          <issue>3</issue>
          ):
          <volume>321</volume>
          {
          <fpage>334</fpage>
          ,
          <string-name>
            <surname>Jun</surname>
          </string-name>
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhao</surname>
          </string-name>
          and
          <string-name>
            <given-names>X.</given-names>
            <surname>Li</surname>
          </string-name>
          .
          <article-title>A cost sensitive decision tree algorithm based on weighted class distribution with batch deleting attribute mechanism</article-title>
          .
          <source>Information sciences</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <article-title>Cost-sensitive learning by cost-proportionate example weighting</article-title>
          .
          <source>In Proceedings of the Third IEEE International Conference on Data Mining, ICDM '03</source>
          , pages
          <fpage>435</fpage>
          {, Washington, DC, USA,
          <year>2003</year>
          . IEEE Computer Society.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>