<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>” in Journal of Physics: Conference Series</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1186/s40854-021-00295-5</article-id>
      <title-group>
        <article-title>Predicting Company Credit Rating Using Artificial Intelligence Techniques from Publicly Available Financial Data*</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Juozas Širmenis</string-name>
          <email>juozas.sirmenis@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mindaugas Kavaliauskas</string-name>
          <email>m.kavaliauskas@ktu.lt</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ingrida Lagzdinytė-Budnikė</string-name>
          <email>ingrida.lagzdinyte@ktu.lt</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Faculty of Informatics, Kaunas University of Technology</institution>
          ,
          <addr-line>Kaunas</addr-line>
          ,
          <country country="LT">Lithuania</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Faculty of Mathematics and Natural Sciences, Kaunas University of Technology</institution>
          ,
          <addr-line>Kaunas</addr-line>
          ,
          <country country="LT">Lithuania</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>IVUS2024: Information Society and University Studies 2024</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>RiskPlanner, UAB Idėjų valda</institution>
          ,
          <addr-line>Kaunas</addr-line>
          ,
          <country country="LT">Lithuania</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <volume>7</volume>
      <issue>1</issue>
      <fpage>826</fpage>
      <lpage>831</lpage>
      <abstract>
        <p>The study focuses on predicting credit rating using statistical methods (Linear and Huber Regressions) and machine learning techniques (Artificial Neural Network and Random Forest) while using publicly available financial data with additionally calculated features. The results show that machine learning techniques outperformed statistical methods significantly. The best results were obtained using the ANN model: MSE reached 0.063, MAE - 0.1858, R² - 0.9065, and RMSE - 0.251. The notable performance improvement across all models was noticed when incorporating additionally derived financial ratios, notwithstanding their derivation from metrics already included in the analysis.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Credit Rating</kwd>
        <kwd>Linear Regression</kwd>
        <kwd>Huber Regression</kwd>
        <kwd>Artificial Neural Network</kwd>
        <kwd>Random Forest</kwd>
        <kwd>Financial Statements</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>In today's world, where countries' borders are less of a barrier to international collaboration and
diseases and military conflicts pose a threat to people and the environment, predicting a partner's
financial behavior is critical. Some researchers work on projecting business defaults [1-3], while
others focus on credit rating and scoring [4-6]. Credit ratings reflect how likely someone is to meet
their financial responsibilities, however they are based on opinion rather than fact[7].</p>
      <p>Based on research, data analysis techniques for evaluating, comparing, or predicting credit risk can
be categorized into two groups: statistical methods and machine learning (ML) methodologies.
According to some papers, combining methods and algorithms could lead to better results [5, 8-10].</p>
      <p>Two of the most popular statistical methods are Logistic Regression (LG) and Discriminant
Analyses (DA). In some papers, they are used to predict bankruptcy [1] or defaults [11] of the
corporates. The others, forecasts the defaults of small and medium enterprise (SME) [2, 3, 12] or uses
to create credit rating and scoring models [5, 10, 13-15]. Also, LG and DA was used to predict bond
ratings [16] or evaluate credit risk in general [8].</p>
      <p>The most popular ML methods for evaluating credit risk, predicting defaults or bankruptcy,
forecasting ratings and scores are: Artificial Neural Network (ANN)[17, 18], Support Vector Machine
(SVM) [15, 16], Decision Trees (DT) [14, 16], Genetic Algorithm [19], Random Forest (RF) [11, 13, 15],
Bayesian techniques [18], Gradient Boosting techniques [19], Multilayer Perceptron [20], new
approach of ANN - Convolutional Neural Networks (CNNs) [20, 21].</p>
      <p>It is also worth noting the custom techniques and architectures that have been developed for
solving similar credit risk problems: hybrid best–worst method (BWM) [22], combination of the deep
neural network and decision tree classifier [23], model made from particle swarm optimization,
random tree and Naïve Bayes techniques [24], the combination of decision trees and logistic
regression - penalized logistic tree regression (PLTR) [25], the variables selection, regressor, and
ordered probit model [26].</p>
      <p>Two statistical methods were chosen for this study: Linear Regression and Huber regression. The
first due to its simplicity and popularity for similar problems, such as modelling the dependence of
bank ratings [27] or predicting companies' credit risk ratings [28]. And the second one, because of its
improvement in terms of finding outliers [29]. Based on the literature analysis, machine learning
methods were chosen: ANN and Random Forest, because they are among the most popular and
promising models in this field.</p>
      <p>For all models, data is crucial. The new regulation of European data aims to open more company
data to the public [30] and it hopes that widely available free data will act as key element to developing
the AI models [31]. We are going to use free and publicly available financial information of Lithuanian
companies, although it is not complete financial statements, only the essentials are provided.</p>
      <p>This research project aims to perform a comprehensive analysis of the performance of different
algorithms and techniques in credit rating prediction but using only publicly available and
free-ofcharge financial data on Lithuanian companies. This task is complicated by the fact that the amount of
this type of data is highly limited and may be restricted to a few financial ratios per company. To
determine the relative performance of traditional statistical methodologies and state-of-the-art
machine learning algorithms in this type of dataset scenario, a comparative analysis is proposed. In
addition, a new approach has been proposed: a combination of the classification and prediction model.</p>
    </sec>
    <sec id="sec-2">
      <title>Materials and Methods</title>
      <p>Financial data for this paper were obtained from Registrų Centras, an official Lithuanian publicly
available data source (https://www.registrucentras.lt/p/1094). The credit rating of the corporations was
determined using the credit risk management tool called RiskPlanner (https://www.riskplanner.io/).
The two statistical methods and two machine learning algorithms were selected to be examined:
Linear Regression (LR), Huber Regression (HR), Artificial Neural Network (ANN), and Random Forest
(RF).</p>
    </sec>
    <sec id="sec-3">
      <title>Dataset</title>
      <p>The dataset consists 7 features from Financial Statement (FS) data: Sales (ISLT00001), Net Profit
(Loss) (ISLT00019), Profit/Loss Before Tax (ISLT00017), Short Term Assets (BSLT00021), Long Term
Assets (BSLT00001), Amounts Payable and Liabilities (BSLT00055), and Net Worth (BSLT00040). The
data is from the 2017-2022, with total of 8395 records (see Table 1), respectively, where the year mostly
refers to the period of this FS year from January 1st to December 31st.</p>
      <p>Each record includes the credit rating calculated by RiskPlanner from company financials, along
with register number and statement year. The rating value ranges from 1 to 5, with classes ranging
from A to E, where A represents the best rating and E the worst. The difference in values between
classes A/B, as well as D/E, is 0.5, while for the other classes, it is 1.</p>
    </sec>
    <sec id="sec-4">
      <title>Data Preprocessing</title>
      <p>In addition to publicly available financial features, the extra ratios were calculated and added to the
dataset: Altman Z score [32], Current Ratio and Net Profit Margin[33], Return on capital employed
(ROCE) [34], and Accounts Payable Turnovers in days [35]. In instances where the value required for
the original formula was unavailable, it was identified as non-existent or the other value, which in
financial logic may be similar, was used instead. Furthermore, a calculation was performed to figure
out the value of total assets (BSLT00039) by adding long-term and short-term assets.</p>
      <p>The Altman Z score is calculated by multiplying X coefficients by weights (1):
X 1=</p>
      <p>BSLT 00021−BSLT 00055</p>
      <p>BSLT 00039
; X 2=</p>
      <p>BSLT 00040
BSLT 00039
; X 4=</p>
      <p>BSLT 00040
B SLT 00055
; X 5=</p>
      <p>ISLT 00001</p>
      <p>;
BSLT 00039</p>
      <p>AltmanZ =1.2 ∙ X 1+1.4 ∙ X 2+0.6 ∙ X 4+1 ∙ X 5 , (1)
where the codes refer to the financial characteristics, the same applies to other formulas. The current
ratio is calculated by dividing short term assets and amounts payable and liabilities (2):
The net profit margin was obtained by dividing the profit/loss before taxes by the sales amount (3):
CurrentRatio=</p>
      <p>BSLT 00021</p>
      <p>,</p>
      <p>BSLT 00055
NetProfitMargin=</p>
      <p>ISLT 00017</p>
      <p>,</p>
      <p>ISLT 00001
ROCE=</p>
      <p>ISLT 00017
BSLT 00040+ BSLT 00055
,
(2)
(3)
(4)
The ROCE was computed by dividing the profit/loss before taxes by the net worth and amounts
payable and liabilities totals (4):
Accounts payable turnover in days was calculated by dividing payables and liabilities by sales and
multiplying the total by 365 (5):</p>
      <p>BSLT 00055 (5)
AccountsPayableTurnovers= ∙ 365 ,</p>
      <p>ISLT 00001</p>
      <p>In addition to computing extra ratios and scores, data cleaning procedures were executed. Records
containing null or infinity values were eliminated, removing 1146 records. Upon dataset analysis,
significant noise was detected across all features. To fix this, 1180 records were deleted using Z-Score
outlier detection, which involves subtracting the mean from the value, dividing the result by the
standard deviation, and filtering the value[36]. Additionally, 224 records were removed after expert
evaluation, leaving 5845 for further examination. The four different combinations of this final dataset
used in the experiments are explained in the section "Experimental Setup".</p>
      <p>Models</p>
      <p>In this section the used models were presented. It contains statistical: Linear Regression (LR),
Huber Regression (HR), and ML techniques: Artificial Neural Network (ANN), and Random Forest
(RF).</p>
    </sec>
    <sec id="sec-5">
      <title>Linear Regression</title>
      <p>Linear Regression (LR) is a statistical method used to model the relationship between a dependent
variable and one or more independent variables. The model assumes that the relationship between the
dependent and independent variables is linear [37]. Based on the linear relationship, the formula can
be constructed to perform the prediction task. It is a simple and useful use of linear regression [38].</p>
      <p>Due to the use of various data scenarios in this research, two different multiple linear regression
formulas were created (see more in the Experimental Setup section). The feature significance analysis
was done to determine which independent variables were best for each formulation [37].</p>
    </sec>
    <sec id="sec-6">
      <title>Huber Regression</title>
      <p>Huber Regression (HR) strikes a balance between squaring errors, like Linear Regression, and
computing absolute errors, like Mean Absolute Error Regression, to handle outliers effectively. The
primary goal of HR is to reduce the difference between the values predicted by the model and the
actual observed values. When the errors are small, meaning the predictions are close to the actual
values, HR behaves similarly to Linear Regression and squares these errors. On the other hand, when
the errors are large, indicating a significant difference between the predictions and actual values, HR
acts like Mean Absolute Error Regression and computes their absolute values. The shift from squaring
to absolute at which the model switches from squaring errors to taking their absolute values [39].</p>
    </sec>
    <sec id="sec-7">
      <title>Artificial Neural Network</title>
      <p>ANNs, inspired by the human brain, are made up of interconnected neurons that process inputs
and generate outputs. The network adjusts input weights using backpropagation and optimization
algorithms, allowing it to learn complex data patterns and improve performance over time [40].</p>
      <p>The main principle of ANN is that it learns by adjusting the connection between neurons. A
training set consists of input patterns and associated labels encoding the characteristics the network
should learn. The ANN adjusts connection strengths, learning to classify data accurately. Once
trained, networks can generalize the results by extending their learning to other datasets. The new
data must not significantly differ from the training set for this generalization to be made. In essence,
the network’s ability to classify new data accurately is dependent on the similarity between the
training and new data [41].</p>
      <p>In this research, a simple model architecture was selected, comprising 64, 32, and 1 (4 for
classification) neurons in the input, hidden, and output layers. Additionally, the ANN was trained
with varying: Batch Size, Epochs, Optimizers with different learning rates, and Activation functions.</p>
    </sec>
    <sec id="sec-8">
      <title>Random Forest</title>
    </sec>
    <sec id="sec-9">
      <title>Proposed method</title>
      <p>A Random Forest (RF) is a tree-based model that systematically splits an input dataset into two
subsets based on a specific rule, repeating this process until a certain condition is met. The end points
of these trees, known as leaf nodes or leaves, represent the final divisions made by the model. In the
context of a predicting credit ratings, RF uses the average prediction of all the trees to generate a
result. This method is particularly effective as it reduces variance and prevents overfitting[42].</p>
      <p>To optimize the model's hyperparameters, tuning will involve modifying N-Estimators, Max
Depth, Min Samples (Split), and Min Samples (Leaf).</p>
      <p>To validate a hypothesis that emerged during the review of related studies, we suggest a two-tiered
approach: initially, we employ a classification algorithm to predict the rating class. Subsequently,
depending on the predicted class, we selected a different structure of machine learning algorithm
which was trained based on data from that class and attempted to predict the rating. For both
classification and prediction tasks, an ANN model was chosen.</p>
    </sec>
    <sec id="sec-10">
      <title>Performance Metrics</title>
      <p>The performance of rating value prediction models will be evaluated on the following four main
characteristics:
1. Mean Squared Error (MSE):</p>
      <p>MSE=
1 n</p>
      <p>∑ ( yi− ^yi )2 ,
n i=1
(6)
2. Mean Absolute Error (MAE):
3. R-squared (R ²) score:</p>
      <p>MAE=
1 n</p>
      <p>∑ ∣ yi− ^yi∣ ,
n i=1</p>
      <p>n
∑ ( yi− ^yi )2
(7)
(8)
R ²=1− i=n1 ,
∑ ( yi− ´y )2
i=1</p>
      <p>Where in all formulas: yi is the actual value of the i-th value, ^yi is the predicted value of
the i-th value, n – the total number of records and ´y is the mean of the actual values in the test set.
4. Root Mean Square Error (RMSE).</p>
      <p>RMSE=√ MSE , (9)</p>
      <p>To obtain preliminary results for the evaluation of the proposed method, four main metrics have
been chosen for the classification problem: Accuracy, Precision, Recall, and F1-Score (F1S).</p>
    </sec>
    <sec id="sec-11">
      <title>Results</title>
      <p>In this section, the datasets and experiments with methods were presented. Also, described each
experiment scenario, hyper-parameter tuning, and the results obtained.</p>
    </sec>
    <sec id="sec-12">
      <title>Experimental Setup</title>
      <p>
        Datasets vary based on whether all financial features or just the initial ones are u
        <xref ref-type="bibr" rid="ref19 ref7">sed, and whether
the value for 2022</xref>
        is predicted or done randomly. The data is randomly
        <xref ref-type="bibr" rid="ref19 ref7">split to match the number of
2022</xref>
        records as the test data:
 FinDataRandom- only financial features from database (for Linear Regression - without
      </p>
      <p>
        BSLT00021 and ISLT00017) and random split (89% training, 11% te
        <xref ref-type="bibr" rid="ref19 ref7">st data)
 FinData2022</xref>
        - only financial features from database (for Linear Regression - without
      </p>
      <p>
        BSLT00021 and ISLT00017) and manual
        <xref ref-type="bibr" rid="ref19 ref7">split (2022</xref>
        years data as test)
 AllRandom - all financial features with calculated ratios/scores and random split (89%
training, 11% te
        <xref ref-type="bibr" rid="ref19 ref7">st data)
 All2022</xref>
        - all financial features with calculated ratio
        <xref ref-type="bibr" rid="ref19 ref7">s/scores and manual split(2022</xref>
        years
data as test)
All models were implemented using the SKLearn library, except for only one ANN - from Keras.
      </p>
      <p>The first model in the experimental part isLinear Regression (LR). The LR functions used in the
experiments were constructed in two ways: from the SKLearn library (SK-LR) and manually created
using coefficients received from the statsmodels module (StatsM-LR). Using the stasmodels with all
publicly available financial features and calculated ratios – with every feature there was high enough
to be included in the linear regression formula, although, whenever we used a dataset containing only
publicly available features, the BSLT00021 and ISLT00017 were removed due to high p value [37].</p>
      <p>The other statistical method in the experiments is Huber Regression (HR). The model includes a
parameter named epsilon. A smaller epsilon – less sensitive model to extreme data points. The
GridSearchCV method from the SKLearn library was used to find the optimal epsilon. In further
experiments, the Huber Regression model will be marked as HR-XX, while XX means the optimal
epsilon parameter for that dataset (testing range 1-100, step - 1).</p>
      <p>One of the machine learning models included in this research was Artificial Neural Network
(ANN). Using grid search, various parameters were systematically tweaked to optimize the model’s
performance. These parameters included batch sizes (b), the number of epochs (e), various optimizers
with differing learning rates, and a range of activation functions (see Table 2).</p>
      <sec id="sec-12-1">
        <title>Optimizer (lr) SGD (0.1, 0.05, 0.01, 0.001), Adam (0.1, 0.05, 0.01, 0.001)</title>
      </sec>
      <sec id="sec-12-2">
        <title>Activation (f)</title>
        <p>relu, tanh,
sigmoid
The other machine learning approach which was used in this research is Random Forest (RF)
algorithm. Similarly to an ANN, the four different hyper-parameters were experimented with. It
included n-estimators, max depth, mininum samples for the split and of the leaf (see Table 4).</p>
        <p>For the proposed method the ANN were choosed to perform both: classification and prediction
tasks. Due to small amount of data for E rating, the two classes were combined: D and E. The
experiments were conducted with the same parameters as shown in Table 2. For the classification
problem, the scenarios of the ANN’s parameters that achieved the best results for each dataset are
shown in Table 10 and for the rating prediction, the best result for each four rating classes (D and E
combined) and for each dataset is shown in Table 11. A total of 33,000 different experiments were
conducted for the classification and prediction task to achieve preliminary results of the proposed
method.</p>
      </sec>
    </sec>
    <sec id="sec-13">
      <title>Experimental Results</title>
      <p>In this section the results of each dataset and each method with different scenarios are presented.</p>
      <sec id="sec-13-1">
        <title>Min Samples</title>
        <p>(Leaf)
1
1
1
1</p>
      </sec>
    </sec>
    <sec id="sec-14">
      <title>Testing Results for each Dataset</title>
      <p>
        In the table below, we can
        <xref ref-type="bibr" rid="ref19 ref7">see the dataset called All2022</xref>
        results. The machine learning models
outperformed the statistical ones. These two models did not differ much from each other and the same
may be said about ANN and RF models.
      </p>
      <p>The table below shows the outcomes of the FinDataRandom dataset. The first time when RF
outperformed the ANN, admittedly, very little.</p>
      <p>The results of the dataset known as AllRandom is displayed in the table below. It performed slighty
worse than preivous dataset.</p>
      <p>
        The table below pre
        <xref ref-type="bibr" rid="ref19 ref7">sents results from the FinData2022</xref>
        dataset. It is interesting that the results of
all statistical methods significantly decreased compared with the other two datasets where data
included all the financial features (initial and calculated ones). In this dataset, the difference between
the results of ANN and RF models was highest.
      </p>
      <p>The best results were achieved with the ANN1 scenario, with a batch size of 100, number of epochs
of 120, optimizer Adam with a learning rate of 0.01, and an activation function being sigmoid. The
model obtained significant quite good results: MSE being 0.0630, MAE - 0.1858, R² - 0.9065, and RMSE
0.2510. The model achieved the best results using all data from features and calculated ratios while
splitting the dataset to train as previous years and test as the newest one.</p>
      <p>The results of the proposed model’s classification task are presented in Table 10. All the indicators
are around 0.8, which is quite a high score. It can be concluded that there is no significant difference
between the results of each dataset.</p>
      <p>The preliminary results of the prediction task using the proposed method for each dataset and
class, with parameter configuration, are shown below. These results appear promising and
demonstrate a consistent pattern: incorporating all financial features has a slight positive effect on
outcomes.</p>
    </sec>
    <sec id="sec-15">
      <title>Conclusions</title>
      <p>This research found a big improvement in the performance of statistical methods and a noticeable
increase in machine learning results when using all financial features rather than just the initial ones
for prediction (datasets: All vs FinData). In terms of classification, there was not much difference in
accuracy between the datasets.</p>
      <p>
        The most favorable outcomes for prediction ta
        <xref ref-type="bibr" rid="ref19 ref7">sk were achieved using the All2022</xref>
        dataset. This
suggests that utilizing all available features, even if derived from each other, and training on past data
while testing on the latest data is preferable when dealing with annual financial information.
      </p>
      <p>Machine learning algorithms outperformed statistical methods significantly. Linear Regression
slightly outperformed Huber Regression, and in most cases, Artificial Neural Network performed
better than the Random Forest model. The prediction task improved with rating class data training but
note the significant decrease in training data size due to class splitting, which may impact the results.</p>
      <p>Future work may include additional publicly available company data as well as other calculated
ratios. Furthermore, additional datasets from other countries would be highly advantageous. In
addition, it would be useful to experiment with the architecture of ANN itself. Preliminary results of
the proposed method are promising, but further experiments are needed in the proposed prediction
process.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Y.</given-names>
            <surname>Shi</surname>
          </string-name>
          and
          <string-name>
            <given-names>X.</given-names>
            <surname>Li</surname>
          </string-name>
          , “
          <article-title>An overview of bankruptcy prediction models for corporate firms: A systematic literature review,” Intangible Capital</article-title>
          , vol.
          <volume>15</volume>
          , no.
          <issue>2</issue>
          , pp.
          <fpage>114</fpage>
          -
          <lpage>127</lpage>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>F.</given-names>
            <surname>Ciampi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Giannozzi</surname>
          </string-name>
          , G. Marzi, and
          <string-name>
            <surname>E. I. Altman</surname>
          </string-name>
          , “
          <article-title>Rethinking SME default prediction: a systematic literature review and future perspectives</article-title>
          ,
          <source>” Scientometrics</source>
          , vol.
          <volume>126</volume>
          , no.
          <issue>3</issue>
          , pp.
          <fpage>2141</fpage>
          -
          <lpage>2188</lpage>
          ,
          <year>2021</year>
          , doi: 10.1007/s11192-020- 03856-0.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>H.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Cho</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Ryu</surname>
          </string-name>
          , “
          <source>Corporate Default Predictions Using Machine Learning: Literature Review,” Sustainability</source>
          , vol.
          <volume>12</volume>
          , no.
          <issue>16</issue>
          ,
          <year>2020</year>
          , doi: 10.3390/su12166325.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>P.</given-names>
            <surname>Golbayani</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Florescu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Chatterjee</surname>
          </string-name>
          , “
          <article-title>A comparative study of forecasting corporate credit ratings using neural networks, support vector machines, and decision trees</article-title>
          <source>,” The North American Journal of Economics and Finance</source>
          , vol.
          <volume>54</volume>
          , p.
          <fpage>101251</fpage>
          ,
          <year>2020</year>
          , doi: https://doi.org/10.1016/j.najef.
          <year>2020</year>
          .
          <volume>101251</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>P.</given-names>
            <surname>Ubarhande</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Chandani</surname>
          </string-name>
          , “
          <article-title>Elements of Credit Rating: A Hybrid Review</article-title>
          and Future Research Agenda,”
          <source>Cogent Business &amp; Management</source>
          , vol.
          <volume>8</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>1878977</fpage>
          ,
          <year>2021</year>
          , doi: 10.1080/23311975.
          <year>2021</year>
          .
          <volume>1878977</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>J.</given-names>
            <surname>Kaur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Vij</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Chauhan</surname>
          </string-name>
          , “
          <article-title>Signals influencing corporate credit ratings-a systematic literature review,” DECISION, vol</article-title>
          .
          <volume>50</volume>
          , no.
          <issue>1</issue>
          , pp.
          <fpage>91</fpage>
          -
          <lpage>114</lpage>
          ,
          <year>2023</year>
          , doi: 10.1007/s40622-023-00341-4.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>S.</given-names>
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Tse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Luo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. D</given-names>
            <surname>'Addona</surname>
          </string-name>
          , and G. Pau, “
          <article-title>Machine learning-driven credit risk: a systemic review</article-title>
          ,
          <source>” Neural Comput Appl</source>
          , vol.
          <volume>34</volume>
          , no.
          <issue>17</issue>
          , pp.
          <fpage>14327</fpage>
          -
          <lpage>14339</lpage>
          ,
          <year>2022</year>
          , doi: 10.1007/s00521-022-07472-2.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>A.</given-names>
            <surname>Bhattacharya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kr</surname>
          </string-name>
          . Biswas,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Mandal</surname>
          </string-name>
          , “
          <article-title>Credit risk evaluation: a comprehensive study</article-title>
          ,
          <source>” Multimed Tools Appl</source>
          , vol.
          <volume>82</volume>
          , no.
          <issue>12</issue>
          , pp.
          <fpage>18217</fpage>
          -
          <lpage>18267</lpage>
          ,
          <year>2023</year>
          , doi: 10.1007/s11042-022-13952-3.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Machado</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Karray</surname>
          </string-name>
          , “
          <article-title>Assessing credit risk of commercial customers using hybrid machine learning algorithms</article-title>
          ,
          <source>” Expert Syst Appl</source>
          , vol.
          <volume>200</volume>
          , p.
          <fpage>116889</fpage>
          ,
          <year>2022</year>
          , doi: https://doi.org/10.1016/j.eswa.
          <year>2022</year>
          .
          <volume>116889</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Moscatelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Parlapiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Narizzano</surname>
          </string-name>
          , and G. Viggiano, “
          <article-title>Corporate default forecasting with machine learning</article-title>
          ,
          <source>” Expert Syst Appl</source>
          , vol.
          <volume>161</volume>
          , p.
          <fpage>113567</fpage>
          ,
          <year>2020</year>
          , doi: https://doi.org/10.1016/j.eswa.
          <year>2020</year>
          .
          <volume>113567</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Z.</given-names>
            <surname>Khemais</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Nesrine</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Mohamed</surname>
          </string-name>
          , “
          <article-title>Credit scoring and default risk prediction: A comparative study between discriminant analysis &amp; logistic regression,”</article-title>
          <source>Int J Econ Finance</source>
          , vol.
          <volume>8</volume>
          , no.
          <issue>4</issue>
          , p.
          <fpage>39</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>L.</given-names>
            <surname>Munkhdalai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Munkhdalai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.-E.</given-names>
            <surname>Namsrai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Y.</given-names>
            <surname>Lee</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K. H.</given-names>
            <surname>Ryu</surname>
          </string-name>
          , “
          <article-title>An empirical comparison of machinelearning methods on bank client credit assessments</article-title>
          ,
          <source>” Sustainability</source>
          , vol.
          <volume>11</volume>
          , no.
          <issue>3</issue>
          , p.
          <fpage>699</fpage>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>N.</given-names>
            <surname>Muñoz-Izquierdo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Segovia-Vargas</surname>
            , M.-M. Camacho-Miñano
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Pérez-Pérez</surname>
          </string-name>
          , “
          <article-title>Machine learning in corporate credit rating assessment using the expanded audit report</article-title>
          ,” Mach Learn, vol.
          <volume>111</volume>
          , no.
          <issue>11</issue>
          , pp.
          <fpage>4183</fpage>
          -
          <lpage>4215</lpage>
          ,
          <year>2022</year>
          , doi: 10.1007/s10994-022-06226-4.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Wallis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Gepp</surname>
          </string-name>
          , “
          <article-title>Credit rating forecasting using machine learning techniques,” in Managerial perspectives on intelligent big data analytics</article-title>
          ,
          <source>IGI Global</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>180</fpage>
          -
          <lpage>198</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>S.</given-names>
            <surname>Ben Jabeur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sadaaoui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sghaier</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Aloui</surname>
          </string-name>
          , “
          <article-title>Machine learning models and cost-sensitive decision trees for bond rating prediction</article-title>
          ,
          <source>” Journal of the Operational Research Society</source>
          , vol.
          <volume>71</volume>
          , no.
          <issue>8</issue>
          , pp.
          <fpage>1161</fpage>
          -
          <lpage>1179</lpage>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>C.</given-names>
            <surname>Daniel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hančlová</surname>
          </string-name>
          , and H. el Woujoud Bousselmi, “
          <article-title>Corporate rating forecasting using Artificial Intelligence statistical techniques</article-title>
          ,
          <source>” Investment Management &amp; Financial Innovations</source>
          , vol.
          <volume>16</volume>
          , no.
          <issue>2</issue>
          , p.
          <fpage>295</fpage>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>G.</given-names>
            <surname>Teles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rodrigues</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A. L.</given-names>
            <surname>Rabê</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S. A.</given-names>
            <surname>Kozlov</surname>
          </string-name>
          , “
          <article-title>Artificial neural network and Bayesian network models for credit risk prediction</article-title>
          ,
          <source>” Journal of Artificial Intelligence and Systems</source>
          , vol.
          <volume>2</volume>
          , no.
          <issue>1</issue>
          , pp.
          <fpage>118</fpage>
          -
          <lpage>132</lpage>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Provenzano</surname>
          </string-name>
          et al., “
          <article-title>Machine learning approach for credit scoring</article-title>
          ,” arXiv preprint arXiv:
          <year>2008</year>
          .01687,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>S.</given-names>
            <surname>Pol</surname>
          </string-name>
          and
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Ambekar</surname>
          </string-name>
          , “
          <article-title>Predicting Credit Ratings using Deep Learning Models-An Analysis of the Indian IT Industry</article-title>
          ,” Australasian Accounting,
          <source>Business and Finance Journal</source>
          , vol.
          <volume>16</volume>
          , no.
          <issue>5</issue>
          , pp.
          <fpage>38</fpage>
          -
          <lpage>51</lpage>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>