<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1662-7482</issn>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.4028/www.scientific.net/AMM</article-id>
      <title-group>
        <article-title>Measuring the Risk of Public Contracts Using Bayesian Classifiers</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Leonardo J. Sales</string-name>
          <email>leonardo.sales@cgu.gov.br</email>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rommel N. Carvalho</string-name>
          <email>rommel.carvalho@cgu.gov.br</email>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Campus Darcy Ribeiro Bras ́ılia</institution>
          ,
          <addr-line>DF</addr-line>
          ,
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Campus Darcy Ribeiro Bras ́ılia</institution>
          ,
          <addr-line>DF</addr-line>
          ,
          <country country="BR">Brazil</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Department of Computer Science at the University of Bras ́ılia</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Department of Economics at the University of Bras ́ılia</institution>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Department of Research and Strategic Information at the Brazilian Office of the Comptroller General</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <fpage>333</fpage>
      <lpage>335</lpage>
      <abstract>
        <p>Bayesian Classifiers are widely used in machine learning supervised models where there is a reasonable reliability in the dependent variable. This work aims to create a risk measurement model of companies that negotiate with the government using indicators grouped into four risk dimensions: operational capacity, history of penalties and findings, bidding profile, and political ties. It is expected that this model contributes to the selection of contracts to be audited under the central unit of internal control of the Brazilian government, responsible for auditing more than 30,000 public contracts per year.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        INTRODUCTION
Public contracts can be understood as adjustments made
between public administration and private sector for the
attainment of public interest objectives
        <xref ref-type="bibr" rid="ref9">(Di Pietro, 1999)</xref>
        .
The contract terms are set by the governmental unit, this
being understood as any body or public authority of
federal, municipal, or state level.
      </p>
      <p>Government spending coming from public contracts and
direct purchases of goods and services account for
approximately 19% of the Brazilian GDP in recent years.
Data from the Brazilian Institute of Geography and
Statistics (IBGE), published in National Accounts
Report in the 2015 fourth quarter, quantifies in R$ 1.07
trillion the amount of government consumption expenditure
in that year (IBGE, 2016). The bidding and procurement
are the institutional means by which consumption
materializes, having important role in the search for efficiency
and effectiveness of public spending.</p>
      <p>Given the huge number of contracts and purchasing
processes to audit, this context raises the challenge of
acting effectively in the pursuit of management problems,
fraud, and corruption. This is the responsibility of the
governmental control units, which specially in Brazil has
limited resources.</p>
      <p>
        Take the example of the Office of the Comptroller
General (CGU), the central unit of internal control of the
Brazilian federal government, which is responsible for
auditing any transaction that represents federal spending.
The CGU should audit both spending conducted directly
(by the central units of the ministries) as the ones
conducted indirectly (by almost 20,000 decentralized units),
including all payments made by any state or
municipality that receives federal funds through voluntary transfers
        <xref ref-type="bibr" rid="ref5">(Brazil, 2003)</xref>
        . Nevertheless, the CGU has only 1,200
auditors working directly in the oversight of these
expenditures.
      </p>
      <p>In this context, a big issue arises involving the need to
rationalize the use of auditing capabilities. There is a clear
need to optimize the choice of what will be effectively
audited, since the complete census is impossible and
uneconomical. Acting in a preventive way to avoid future
problems is also important since most of the errors found
generate irrecoverable damage, such as paralysis of a
engineering project or the need to redo it.</p>
      <p>
        Both the rationalization of choices (in a subsequent
operation) and the understanding and treatment of
vulnerability (in preventive action) can be analyzed within the
more general concept of risk assessment. After all, what
is sought in both cases is to identify factors or
characteristics of purchases or contracts which increase the
chance of future problems such as mismanagement or
even fraud.
Supervised learning models have been used in similar
problems in private sector. Financial institutions assess
the risk of potential borrowers, among many suitors with
different characteristics and history using such models,
in this case called credit scoring (Lessmann et al., 2015).
Insurance companies also use such statistical models to
assign the value of insurance for a certain good. The
techniques learn from the transaction history and
quantify the weight of certain characteristics in determining
the risk of a client or specific process. Thus, the auto
insurance company knows that unmarried young men offer
more risk than married women with children.
In practice, these models are applications of statistical
and computational techniques of regression and
classification using databases that have information of
transaction history and labeled cases of “success” and
“failure”
        <xref ref-type="bibr" rid="ref10">(Friedman et al., 2001)</xref>
        . A good condition in the
construction of this type of risk analysis model is the
existence of information on transaction history, with
variables representing different characteristics of each
transaction. Thus, one can distinguish and identify
correlations between groups.
      </p>
      <p>This paper proposes to create a predictive model of risk
in contracts based on Bayesian classifiers. It will
result in the quantification of the propensity that a
supplier has problems in government contracts, according
to the company’s characteristics. Learning models using
Bayesian networks are especially useful when you need
to organize or discover the knowledge of a particular area
through the construction of cause and effect relationships
captured from a set of data (Spiegelhalter et al., 1993).
Besides this, Bayesian Classifiers have been
incorporated into risk measurement studies, especially when it
is important to capture and explain the relationships of
cause and effect between the different prediction
parameters, avoiding the “black box” issue, common in other
techniques.</p>
      <p>The model will be used to select high-risk contracts to
be audited by the CGU and will be based on the
estimation of the relations of cause and effect between various
indicators that are related to the propensity of
contractual risk. The dependent variable is the occurrence of
more severe punishment that can be given to a supplier
in Brazil: the impediment to bidding. The indicators
that will be used as predictors represent characteristics
grouped into four risk dimensions: operational capacity,
history of penalties and findings, bidding profile, and
political ties.</p>
      <p>This work is divided into 5 sections. Besides this
introduction, Section 2 presents the theoretical framework
that supports the central idea of the work and the
methodological approach adopted. Section 3 contains the
details of the methodology used in the study, including
the understanding of data modeling, the creation of the
networks, and the validation of the models. Section 4
presents and discusses the results. Finally, Section 5
provides conclusions and considerations on gaps and
opportunities for future work.
2</p>
      <p>THEORETICAL REFERENCES
In this section we describe the public bidding process
in Brazil, the Bayesian classifiers used for learning the
predictive models, and some related works.
2.1</p>
      <p>
        PUBLIC BIDDING IN BRAZIL
The whole process of buying products or hiring services
in the Brazilian federal government takes place
according to the rules of Law 8666/1993
        <xref ref-type="bibr" rid="ref3">(Brazil, 1993)</xref>
        , called
Procurement Law. Other regulatory acts complement
this law, such as Law 10520/2005
        <xref ref-type="bibr" rid="ref4">(Brazil, 2002)</xref>
        ,
establishing the types of Auction and Complementary Law
123/2006
        <xref ref-type="bibr" rid="ref6">(Brazil, 2006)</xref>
        establishing privileges for
micro and small businesses in bidding. Law 8666/1993
        <xref ref-type="bibr" rid="ref3">(Brazil, 1993)</xref>
        details the stages of the bidding process
itself, the bidding types allowed, types of contracts,
aspects of qualification of companies, and also defines
administrative and criminal penalties to be applied to
suppliers in case of noncompliance.
      </p>
      <p>The Procurement Law, together with other mentioned
legislation, defines the following administrative penalties
to suppliers, due to total or partial non-performance of
contracts:
• warning;
• pecuniary penalty;
• temporary suspension of bid;
• declaration of non-trustworthiness; and
• impediment to bid and hire.</p>
      <p>The whole process of procurement and contracting in the
federal government is done using the government’s
General Services Administration System (SIASG). Each
purchase or contract is recorded in this system, since the
opening of the process to the issue of commitment.
Existing since 1994, the SIASG started to be used by the
government gradually and it already has more than 5
million purchases. All federal administration is required to
use this system. Annually it records over 700,000 bids.
Some of these bids representing continued provision of
services or delivery of goods turns into contracts,
generating nearly 30,000 new contracts per year.</p>
    </sec>
    <sec id="sec-2">
      <title>BAYESIAN CLASSIFIERS MODELS</title>
      <p>
        Since Bayesian networks (BNs) have been successfully
used in classification problems – e.g., see
        <xref ref-type="bibr" rid="ref11 ref12 ref13 ref17 ref7 ref8">(Sahami et al.,
1998; Friedman et al., 1997; Goldszmidt et al., 2010;
Friedman and Goldszmidt, 1996; Cheng and Greiner,
1999; Ceccon et al., 2014; Ye et al., 2014; Shi et al.,
2013)</xref>
        –, we decided to experiment with different BN
learning algorithms in order to classify the companies
that sell service and goods to the government with high
likelihood of noncompliance.
      </p>
      <p>Score-based learning is a popular method for inducing
BNs. The main idea is to assign a score to a model based
on how well it represents the data set used for learning.
Thus, the purpose of the algorithm is to maximize the
goodness-of-fit score.</p>
      <p>
        In this work we use standard and well-known Bayesian
network classifiers, which are aimed at classification.
More specifically, we use two algorithms available in the
bnlearn R package1
        <xref ref-type="bibr" rid="ref15">(Scutari, 2009)</xref>
        :
• Na¨ıve Bayes (naive.bayes): a simple algorithm
that assumes that all explanatory variables are
independent of each other. In other words, the target
variable is the only parent of all other variables.
• Tree-Augmented Na¨ıve Bayes (tree.bayes): an
algorithm that relaxes the simple Na¨ıve Bayes
assumption of independence, by allowing the
explanatory variables to have one other variable as parent
besides the target one.
      </p>
      <p>
        Besides that, we also tried two different score-based
learning algorithms, which are also available in the
bnlearn R package used in this work
        <xref ref-type="bibr" rid="ref15">(Scutari, 2009)</xref>
        :
• Hill-Climbing (hc): a hill climbing greedy search
on the space of the directed graphs.
• Tabu Search (tabu): a modified hill-climbing able
to escape local optima.
      </p>
      <p>
        The bnlearn package implements random restart with
configurable perturbing operations for both algorithms.
A number of different scores were used to fine tune the
models learned from the score-based algorithms and to
improve their performance, which are also available in
the bnlearn package
        <xref ref-type="bibr" rid="ref15">(Scutari, 2009)</xref>
        :
• the Akaike Information Criterion score (aic);
• the Bayesian Information Criterion score (bic);
1The package is available at http://www.bnlearn.com/.
• the logarithm of the Bayesian Dirichlet equivalent
score (bde); and
• the logarithm of the modified Bayesian Dirichlet
equivalent score (mbde).
2.3
      </p>
    </sec>
    <sec id="sec-3">
      <title>RELATED WORKS</title>
      <p>
        Many studies use supervised learning models in order to
predict risk in business transactions. The area where it
is more common this type of approach is the bank credit
        <xref ref-type="bibr" rid="ref12">(Lessmann et al., 2015; Hand and Henley, 1997)</xref>
        .
These learning models attempt to quantify how the
characteristics of potential borrowers influence the
probability of default. Classically, the techniques most used for
this purpose are Logistic Regression and Discriminant
Analysis (Ghodselahi, 2011). Other studies have been
testing and comparing some modern techniques
        <xref ref-type="bibr" rid="ref1">(Baesens
et al., 2002)</xref>
        . In other areas, such as insurance, such
models are also widely used.
      </p>
      <p>
        Bayesian Classifiers have been incorporated into these
studies, especially when you want to capture and explain
the relationships of cause and effect between the
different prediction parameters, avoiding the “black box”
issue, common in other techniques
        <xref ref-type="bibr" rid="ref1 ref18">(Jiang and Wu, 2009;
Zonneveldt et al., 2010; Baesens et al., 2002)</xref>
        . Bayesian
algorithms provide more clear insights when modeling
causal relationships.
      </p>
      <p>
        A new approach to credit scoring by synthesizing
Simple Na¨ıve Bayesian Classifier (SNBC) and the Rough Set
Theory is presented by (Jiang and Wu, 2009). A
comparison between Na¨ıve Bayes (NB) models, different
augmented NB models, and a handcrafted causal network is
made by
        <xref ref-type="bibr" rid="ref18">(Zonneveldt et al., 2010)</xref>
        .
      </p>
      <p>
        In the context of public procurement, some initiatives
already exist in order to implement similar models in
predicting irregularities or contractual problems. For
example, Na¨ıve Bayes algorithms are used by
        <xref ref-type="bibr" rid="ref2">(Balaniuk et al.,
2012)</xref>
        in an unsupervised approach to quantify the
combined risk of private companies and government units in
the execution of contracts.
        <xref ref-type="bibr" rid="ref14">(Sales, 2014)</xref>
        built a model with the same objective of
this work (to measure the risk of public contracts) and
with similar data. In that case the accuracy using Logistic
Regression and Decision Tree were compared, resulting
in the best accuracy of 64%.
3
      </p>
      <p>METHODS AND PROCEDURES
The first step in building the Bayesian classification
model was the definition of the criteria for
characterization of the companies with the highest risk (the “Bad”).
In this sense, we chose to characterize the “Bad” group
all companies that suffered the following punishments in
the years 2015 and 2016: temporary suspension of bid,
declaration of non-trustworthiness, and impediment to
bid and hire. The group of low-risk companies
(hereinafter “Good”) are companies with existing contracts in
the same period but without such punishment.
The database used contained 1,448 companies, of which
724 were previously classified as “Bad” and other 724
previously classified as “Good”2.</p>
      <p>From this initial setting, the second step was the creation
of risk indicators, which cover the past of relations
between companies and government, considering the
period since 2011, as well as other information that are
independent of the period, such as those from the
registry of companies. The idea is to answer the following
question: What happened in the recent past of the
companies that contributed to its contractual default in 2015
and 2016?
These indicators were obtained from the four dimensions
of risk: operational capacity, history of penalties and
findings, bidding profile, and political ties. The
meaning of each of the risk dimensions and some indicators
used are described below:
• Operational capacity: irregularities related to the
existence or insufficient physical and operational
structure of the contracted company.</p>
      <p>– Quantity of indicators: 11.
– Examples of indicators: number of
employees, number of partners, the total amount
received from the government, amount received
from the government per employee, value
received from the government for partner,
average salary of employees, average salary of
the partners, company size, number of
activities carried out by the company, age from the
company.
• History of penalties and findings: pre-existence of
punishment or audit findings related to the
company.</p>
      <p>– Quantity of indicators: 04.
– Examples of indicators: quantity of received
punishments, number of alerts generated in
CGU monitoring.</p>
      <p>2The 724 companies in the “Bad” group are all companies
that meet the criteria described for this class. The 724
companies in “Good” group was obtained by sampling in the set of
41,000 companies that meet the requirements described.
Sampling the second group was made in order to solve the dominant
class issue, in a process called undersampling (see (Japkowicz
et al., 2000) for more details of this process).
• Bidding profile: company profile when
participating in bids, as the average quantity of offers, and the
degree of success of business (percentage of wins).
– Quantity of indicators: 12.
– Examples of indicators: quantity of purchases,
purchase quantity of items, average amount of
offers, number of units of the federation,
number of wins, percentage of victory, value of
contracts, the difference in days between the
opening of the company and the first
participation in a public procurement.
• Political ties: company relationship with
politicians, via donations in campaigns.</p>
      <p>– Quantity of indicators: 01.
– Examples of indicators: amount donated in
political campaigns.</p>
      <p>The next step was the transformation of all variables in
factors (categories), using a simple process of
discretization, where values of each variable were divided into
three intervals of equal size. Once complete, the database
has been divided in training set (70%) and test (30%).
The discretization was carried out due to the limitation
of some algorithms used. In future experiments, we will
learn models using algorithms that allows continous
variables.</p>
      <p>At first, we used standard Bayesian classifiers available
in the bnlearn R package, Na¨ıve Bayes (NB) and
TreeAugmented Na¨ıve Bayes (TAN).</p>
      <p>As the database does not have a very large number of
observations, we used a process of estimation with
crossvalidation in the training subset for both algorithms. The
Cross-Validation procedure applied was the random
division of training based on 10 sample partitions of equal
size, for use in cycles of modeling where 9 partitions are
used for training and one for testing. Error measures are
then combined to have a single measurement error.
The estimation with cross-validation was performed
using a Score-based learning algorithm, which ranks the
network structures created with emphasis on model fit.
In these algorithms, various parameters can be adjusted
in search of the best results forecast.</p>
      <p>The loss function used to measure the model results was
the misclassification, where the dependent variable value
is the result of local distributions (from its parents) and
the error function is measured by coincidence or not with
the actual values (hit rate).</p>
      <p>Since an important aspect of machine learning is the
parameter tuning and both NB and TAN in bnlearn do
not have any parameters to be tuned, we decided to also
try another set of algorithms. In bnlearn, a set of
algorithms that allow many different configurations is the
score-based learning algorithms, namely: Hill-Climbing
(HC) and Tabu Search (Tabu), both using incremental
search. Tabu introduces changes in HC in order to avoid
local optima.</p>
      <p>In score-based algorithms, it is critical to set the network
score calculation method, which measures the quality of
the network created using the quantification of
posterior probability. Two variables were used in the score
parameterization: type of score and penalty parameter.
The tested scores types were AIC (Akaike Information
Criterion Score), BIC (Bayesian Information Criterion
Score), BDE (Bayesian Dirichlet Equivalent Score), and
MBDE (Modified Bayesian Dirichlet Equivalent Score),
suitable for categorical variables. Besides that, we also
tried many different penalty parameters.</p>
      <p>The central idea was to try different values of each
parameter in order to find the setting that present the best
predictive ability. For better understanding, Table 1
shows some of these tested settings and its accuracy
measure, aiming to compare the Na¨ıve Bayes (NB) algorithm
setting with different configurations3 of Score-Based
algorithms.</p>
      <p>
        RESULTS
Since the best models did not present a statistically
significant difference in performance and usually the
simpler the model the better the generalization, we chose
the Na¨ıve Bayes algorithm to run the final model with
all the data from the training set in order to check the
3The parameters used to set the algorithm were the
scorebased algorithm, Hill-Climbing (HC) or Tabu Search (Tabu),
the score types (AIC, BIC, BDE or MBDE) and the penalty
parameter (ISS or K).
performance with the test set. The 95% confidence
interval of the accuracy was (0.69, 0.77), which shows that
the model generalizes well. The sensitivity of the model
(prediction ability of “BAD” companies) was 76%.
Table 2 shows the results of prediction on the test set.
We consider this a good result in the context of
government contracts, especially when compared with other
similar works. Taking as reference the results obtained
by
        <xref ref-type="bibr" rid="ref14">(Sales, 2014)</xref>
        , you can see a reasonable gain in
predictive ability. The sensitivity of the model is particularly
important since what really matters is the identification
of high-risk cases, even assuming the cost of auditing
some low risk contracts, which were misclassified.
5
      </p>
      <p>CONCLUSION AND FUTURE WORK
This work is consistent with a great effort that has been
developed by government control institutions to
rationalize the use of their human and material resources in order
to provide more effective results at lower operating and
financial costs.</p>
      <p>Considering the current Brazilian context, where a
severe economic crisis has been treated through large cuts
in public budgets (reducing the sending of resources to
control bodies), the efficient use of resources should be a
permanent goal.</p>
      <p>
        The attempt to use statistical models based on Bayesian
networks is in addition to other initiatives presented in
Section 2. The main purpose of these studies is to extract
knowledge from various databases that government
control institutions have access in order to facilitate the
selection of audit objects more likely to present problems.
The classification results are slightly better than other
supervised models applied in government databases with
the same goal (see
        <xref ref-type="bibr" rid="ref14">(Sales, 2014)</xref>
        , described in section
2.3). However, we believe that there is room for
improvement in two possible ways: the inclusion of new
indicators that capture aspects ignored by this model and
the use of optimization algorithms in the
parameterization of score-based networks.
Each step in direction of improving these models is a
permanent gain for the public auditing activity, and
consequently to society.
http://link.springer.com/article/
10.1023/A%3A1007465528199.
      </p>
      <p>Ahmad Ghodselahi. A hybrid support vector machine
ensemble model for credit scoring. International
Journal of Computer Applications, 17(5):1–5, 2011.
Moises Goldszmidt, James J. Cochran, Louis A.</p>
      <p>Cox, Pinar Keskinocak, Jeffrey P. Kharoufeh,
and J. Cole Smith. Bayesian network
classifiers. In Wiley Encyclopedia of Operations
Research and Management Science. John Wiley &amp;
Sons, Inc., 2010. ISBN 9780470400531. URL
http://onlinelibrary.wiley.com/doi/
10.1002/9780470400531.eorms0099/
abstract.</p>
      <p>David J Hand and William E Henley. Statistical
classification methods in consumer credit scoring: a
review. Journal of the Royal Statistical Society: Series
A (Statistics in Society), 160(3):523–541, 1997.
IBGE. Indicadores do Instituto Brasileiro de Geografia</p>
      <p>Estatstica, Contas Nacionais Trimestrais, 2016.
Nathalie Japkowicz et al. Learning from imbalanced
data sets: a comparison of various strategies. In AAAI
workshop on learning from imbalanced data sets,
volume 68, pages 10–15. Menlo Park, CA, 2000.
Yi Jiang and Li Hua Wu. Credit scoring model based
on simple naive bayesian classifier and a rough set.
In 2009 International Conference on Computational
Intelligence and Software Engineering, 2009.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Bart</given-names>
            <surname>Baesens</surname>
          </string-name>
          , Michael Egmont-Petersen,
          <string-name>
            <given-names>Robert</given-names>
            <surname>Castelo</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Jan</given-names>
            <surname>Vanthienen</surname>
          </string-name>
          .
          <article-title>Learning bayesian network classifiers for credit scoring using markov chain monte carlo search</article-title>
          .
          <source>In Pattern Recognition</source>
          ,
          <year>2002</year>
          . Proceedings. 16th International Conference on, volume
          <volume>3</volume>
          , pages
          <fpage>49</fpage>
          -
          <lpage>52</lpage>
          . IEEE,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Remis</given-names>
            <surname>Balaniuk</surname>
          </string-name>
          , Pierre Bessiere, Emmanuel Mazer, and
          <string-name>
            <given-names>Paulo</given-names>
            <surname>Cobbe</surname>
          </string-name>
          .
          <article-title>Risk based Government Audit Planning using Nave Bayes Classifiers</article-title>
          .
          <source>In Advances in Knowledge-Based and Intelligent Information and Engineering Systems</source>
          ,
          <year>2012</year>
          . URL https://hal. archives-ouvertes.fr/hal-00746198/.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Brazil</surname>
          </string-name>
          . Lei n 8666, de
          <year>1993</year>
          ,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Brazil</surname>
          </string-name>
          . Lei n 10520, de
          <year>2002</year>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Brazil</surname>
          </string-name>
          . Lei n 10683, de
          <year>2003</year>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Brazil</surname>
          </string-name>
          . Lei Complementar n 123, de
          <year>2006</year>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>S.</given-names>
            <surname>Ceccon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.F.</given-names>
            <surname>Garway-Heath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.P.</given-names>
            <surname>Crabb</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Tucker</surname>
          </string-name>
          .
          <article-title>Exploring early glaucoma and the visual field test: Classification and clustering using bayesian networks</article-title>
          .
          <source>IEEE Journal of Biomedical and Health Informatics</source>
          ,
          <volume>18</volume>
          (
          <issue>3</issue>
          ):
          <fpage>1008</fpage>
          -
          <lpage>1014</lpage>
          , May
          <year>2014</year>
          . ISSN 2168-
          <fpage>2194</fpage>
          . doi:
          <volume>10</volume>
          .1109/JBHI.
          <year>2013</year>
          .
          <volume>2289367</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Jie</given-names>
            <surname>Cheng</surname>
          </string-name>
          and Russell Greiner.
          <article-title>Comparing bayesian network classifiers</article-title>
          .
          <source>In Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence</source>
          , UAI'
          <volume>99</volume>
          ,
          <string-name>
            <surname>page</surname>
            <given-names>101108</given-names>
          </string-name>
          , San Francisco, CA, USA,
          <year>1999</year>
          . Morgan Kaufmann Publishers Inc.
          <source>ISBN 1-55860-614-9</source>
          . URL http://dl.acm. org/citation.cfm?id=
          <volume>2073796</volume>
          .
          <fpage>2073808</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Maria</given-names>
            <surname>Sylvia Zanella Di Pietro</surname>
          </string-name>
          .
          <source>Direito administrativo</source>
          , volume
          <volume>22</volume>
          .
          <source>Atlas Sa˜o Paulo</source>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Jerome</given-names>
            <surname>Friedman</surname>
          </string-name>
          , Trevor Hastie, and
          <string-name>
            <given-names>Robert</given-names>
            <surname>Tibshirani</surname>
          </string-name>
          .
          <article-title>The elements of statistical learning</article-title>
          , volume
          <volume>1</volume>
          . Springer series in statistics Springer, Berlin,
          <year>2001</year>
          . URL http://statweb.stanford.edu/ ˜tibs/book/preface.ps.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Nir</given-names>
            <surname>Friedman</surname>
          </string-name>
          and
          <string-name>
            <given-names>Moises</given-names>
            <surname>Goldszmidt</surname>
          </string-name>
          .
          <article-title>Building classifiers using bayesian networks</article-title>
          .
          <source>In Proceedings of the national conference on artificial intelligence, page 12771284</source>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Nir</given-names>
            <surname>Friedman</surname>
          </string-name>
          , Dan Geiger, and
          <string-name>
            <given-names>Moises</given-names>
            <surname>Goldszmidt</surname>
          </string-name>
          .
          <article-title>Bayesian network classifiers</article-title>
          .
          <source>Machine Learning</source>
          ,
          <volume>29</volume>
          (
          <issue>2-3</issue>
          ):
          <fpage>131</fpage>
          -
          <lpage>163</lpage>
          ,
          <year>November 1997</year>
          .
          <source>ISSN 0885-6125</source>
          ,
          <fpage>1573</fpage>
          -
          <lpage>0565</lpage>
          . doi:
          <volume>10</volume>
          .1023/A:1007465528199. URL Stefan Lessmann, Bart Baesens,
          <string-name>
            <surname>Hsin-Vonn Seow</surname>
          </string-name>
          , and Lyn C. Thomas.
          <article-title>Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research</article-title>
          .
          <source>European Journal of Operational Research</source>
          ,
          <volume>247</volume>
          (
          <issue>1</issue>
          ):
          <fpage>124</fpage>
          -
          <lpage>136</lpage>
          ,
          <year>November 2015</year>
          . ISSN 03772217. doi:
          <volume>10</volume>
          .1016/j.ejor.
          <year>2015</year>
          .
          <volume>05</volume>
          . 030. URL http://linkinghub.elsevier. com/retrieve/pii/S0377221715004208.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Mehran</given-names>
            <surname>Sahami</surname>
          </string-name>
          , Susan Dumais, David Heckerman,
          <string-name>
            <given-names>and Eric</given-names>
            <surname>Horvitz</surname>
          </string-name>
          .
          <article-title>A bayesian approach to filtering junk e-mail</article-title>
          .
          <source>In Learning for Text Categorization: Papers from the 1998 workshop</source>
          , volume
          <volume>62</volume>
          , page 98105,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Leonardo</given-names>
            <surname>Jorge Sales</surname>
          </string-name>
          .
          <article-title>Risk prevention brazilian government contracts using credit scoring</article-title>
          .
          <source>In Interdisciplinary Insights on Fraud, chapter 11</source>
          , pages
          <fpage>264</fpage>
          -
          <lpage>286</lpage>
          . Cambridge Scholars Publishing,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>Marco</given-names>
            <surname>Scutari</surname>
          </string-name>
          .
          <article-title>Learning bayesian networks with the bnlearn r package</article-title>
          .
          <source>arXiv preprint arXiv:0908.3817</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Wei</surname>
            <given-names>Shi</given-names>
          </string-name>
          , Yao Wu Pei, Liang Sun,
          <article-title>Jian Guo Wang, and Shao Qing Ren. The defect identification of LED chips based on bayesian classifier</article-title>
          . Applied Mechanics David J.
          <string-name>
            <surname>Spiegelhalter</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Philip</surname>
            <given-names>Dawid</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steffen L. Lauritzen</surname>
          </string-name>
          , and Robert G. Cowell.
          <source>Bayesian Analysis in Expert Systems. Statistical Science</source>
          ,
          <volume>8</volume>
          (
          <issue>3</issue>
          ):
          <fpage>219</fpage>
          -
          <lpage>247</lpage>
          ,
          <year>1993</year>
          . URL http://www.jstor.org/stable/ 2245959.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Ye</surname>
            <given-names>Ye</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fuchiang (Rich) Tsui</surname>
            , Michael Wagner, Jeremy U. Espino, and
            <given-names>Qi</given-names>
          </string-name>
          <string-name>
            <surname>Li</surname>
          </string-name>
          .
          <article-title>Influenza detection from emergency department reports using natural language processing and bayesian network classifiers</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          , pages
          <fpage>amiajnl2013</fpage>
          -
          <lpage>001934</lpage>
          ,
          <year>January 2014</year>
          . ISSN ,
          <fpage>1527</fpage>
          -
          <lpage>974X</lpage>
          . doi:
          <volume>10</volume>
          .1136/amiajnl-2013
          <article-title>-001934</article-title>
          . URL http://jamia.bmj.com/content/early/ 2014/01/09/amiajnl-2013-
          <year>001934</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>S</given-names>
            <surname>Zonneveldt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K</given-names>
            <surname>Korb</surname>
          </string-name>
          , and
          <string-name>
            <given-names>A</given-names>
            <surname>Nicholson</surname>
          </string-name>
          .
          <article-title>Bayesian network classifiers for the german credit data</article-title>
          .
          <source>Technical report, Technical report</source>
          ,
          <year>2010</year>
          /1,
          <string-name>
            <given-names>Bayesian</given-names>
            <surname>Intelligence</surname>
          </string-name>
          . http://www. Bayesian-intelligence. com/publications. php,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>