<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Using a business simulator with elements of machine learning to develop personal finance management skills</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dmytro S. Antoniuk</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tetiana A. Vakaliuk</string-name>
          <email>tetianavakaliuk@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vladyskav V. Didkivskyi</string-name>
          <email>v.didkivskyi@sana-commerce.com</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oleksandr Yu. Vizghalov</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oksana V. Oliinyk</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Valentyn M. Yanchuk</string-name>
          <email>v.yanchuk@gmail.com</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute for Digitalisation of Education of the National Academy of Educational Sciences of Ukraine</institution>
          ,
          <addr-line>9 M. Berlynskoho Str., Kyiv, 04060</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Kryvyi Rih State Pedagogical University</institution>
          ,
          <addr-line>54 Gagarin Ave., Kryvyi Rih, 50086</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Zhytomyr Polytechnic State University</institution>
          ,
          <addr-line>103 Chudnivsyka Str., Zhytomyr, 10005</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <fpage>59</fpage>
      <lpage>70</lpage>
      <abstract>
        <p>Any person, not even a financier, should be knowledgeable in the field of personal finance management. As a result of the survey, it was concluded that schoolchildren and students should be interested in the study of personal finance, even within other disciplines in the form of separate sections. Successful personal experience and knowledge of the teacher in the field of personal finance is an important component of the efectiveness of the formation of competence of pupils, students, and adults in the ifeld of personal finance. Business simulators come in handy here. That is why the authors developed a business simulator for the development of personal finance management skills, which was developed in the form of a web service. The main features of this simulator, which are presented in diferent sections of the simulator, are considered. It is shown what skills of personal finance management are developed using this business simulator. Since this simulator was designed to develop personal finance management skills, an attempt was made to apply machine-learning elements to make this business simulator work even better. The proposed simulator can be used in the future to teach the elements of personal finance management to people who are not suficiently knowledgeable in the field of such finance. Moreover, the web application can be useful even for school-age children, so the simulator can complement the educational process within the economic courses not only in higher education institutions but also in secondary education institutions of Ukraine.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;business simulator</kwd>
        <kwd>machine learning</kwd>
        <kwd>personal finance</kwd>
        <kwd>personal finance management skills</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Almost every day, each person spends money on housekeeping, utility bills, payment for services
and purchases. It also includes debts on loans, or just borrowed money from friends. Personal
ifnance includes all the funds available to us and the ability to manage them, so the topic of
personal finance is relevant and worthy of attention for everyone who cares about their financial
well-being. That is why any person, not even a financier, should be knowledgeable in the field
of personal finance management. As a result, a survey was conducted to find out certain aspects
of the financial literacy of first-year students as individuals who have their financial resources.
In 2019, 167 respondents took part in the survey, in 2020 – 214 respondents (first-year students
of the Faculty of Information and Computer Technologies of the Zhytomyr Polytechnic State
University).</p>
      <p>As a result, it was found that in 2019, 70.7% of respondents consider themselves financially
literate, ready to consciously and efectively manage their own and family financial resources;
in 2020 – 74.3%. At the same time, 29.3% of respondents in 2019 do not consider themselves
suficiently aware in this regard; in 2020 – 25.7% (see figure 1, a).</p>
      <p>It was also found that in 2019 45.5% of respondents studied the basics of financial literacy in
at least one form, in 2020 – this figure dropped to 19.6%, in 2019 54.5% – did not study in any
form, in 2020 – this figure is 80.4% (see figure 1, b).</p>
      <p>In addition, by the results of the survey, it was found that in 2019 14.4% of respondents at the
highest level assess their level of readiness for conscious and efective management of own and
family financial resources, in 2020 – 10.7%, 38.3% of respondents in 2019 – indicated the level of
readiness “good” and “satisfactory”, while in 2020 46.7% indicated the level of readiness “good”,
35% - “satisfactory”. All others either rated the level at the lowest level or an unsatisfactory
level.</p>
      <p>
        As a result of the survey, it was concluded that schoolchildren and students should be
interested in the study of personal finance, even within other disciplines in the form of separate
sections. Successful personal experience and knowledge of the teacher in the field of personal
ifnance is an important component of the efectiveness of the formation of competence of pupils,
students, and adults in the field of personal finance. Business simulators come in handy here.
1.1. Theoretical background
The need for personal finance competency development was analyzed in the research
internationally. The challenge is relevant for both developed and developing countries. Influence of
the financial literacy on the well-being of people worldwide and the need to recognize it as the
fundamental right and universal need was discussed by Lusardi [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Another research by Lusardi
et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] shows that in the USA 30–40% of retirement wealth inequality is associated with the
level of financial literacy alone. As for the initial personal finance management, knowledge, and
skills formation, the study concludes significant influence of the high school financial literacy
courses on the lower default rates and better credit scores [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Using business simulations in education is a shared practice now. Simulations are being used
in practical training, such as flight or combat simulations as well as in economic, managerial,
and financial areas. Researchers study the pedagogical significance of this type of
technologyenhanced educational method. The group of researchers from the consortium of Spanish
universities concluded the positive impact of business simulations on the generic competencies
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Farashahi and Tajeddin [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] conclude the efectiveness of business simulations in their
comparative study. The study of business simulations in economic or finance evolves with
the evolution of technologies available to develop immersive, attractive, and highly functional
experiences. The range of use-cases difers from elementary schools to life-long learning
establishments. Korgin [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] provides evidence of the efectiveness of the simulation games for
kids with basic arithmetical operations literacy. Palan [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] studied the criterion and approaches to
select business simulation software for asset market experiments to use in the higher education
establishments.
      </p>
      <p>The analysis provides evidence of the efectiveness of business simulation usage in education
in general and the need to further development of business simulation for the economy and
ifnance literacy development, assessment, and study of its behavioral aspects.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Results</title>
      <p>
        The analysis of the analogues revealed that most part of the simulation software is appropriate
for a specific area of the finance in the countries with developed financial markets. Based on
the conclusions above this work represents the experiment in developing more generalized
simulator for the countries with less developed market of financial instruments [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        That is why the authors developed a business simulator for the development of personal
ifnance management skills, which was developed in the form of a web service [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Consider the
main features of this simulator, which are presented in diferent sections of the simulator (see
ifgure 2):
1. Current account management options.
2. Savings management options.
3. Deposit management options.
4. Credit management options.
5. Non-financial investment management options.
6. Information on changes in current accounts that have occurred in the last week.
7. Analytical information on the dynamics of changes in current account funds, savings,
and investments.
8. Information on current exchange rates.
9. List of recent transactions that have been made on current accounts.
      </p>
      <p>Consider in more detail the section with current account management options (see figure 2, A).
The “Current account” section contains a list of current accounts in the form: code and available
amount.</p>
      <p>The user has the opportunity to open a new account. To do this, click on the “Open account”
button, then a window will appear where you need to select the currency in which you want to
open a new account (see figure 3). Each user action in the simulator (for example, opening a
new account) is accompanied by a message that appears in the lower right corner of the page
(see figure 4).</p>
      <p>After opening a new account, the “Exchange” option will be available in the “Current account”
section, which allows you to exchange currencies at the rate that can be viewed in another
section “Exchange rates” (see figure 2, B).</p>
      <p>To exchange currencies, the user can enter the amount he wants to sell and the amount in
the currency to be purchased will be filled in automatically. The user can also enter the amount
to purchase and the amount required for the purchase will be filled in automatically. The user
can also change currencies and the amounts will be automatically transferred.</p>
      <p>In the process of performing all actions by the user, a real-time section is available with
information about changes in current accounts that have occurred in the last week (see figure 2, D).</p>
      <p>The user can choose the currency of the account for which he wants to see the relevant data
(see figure 5).</p>
      <p>In addition, in the simulator, there is a section with analytical data on the dynamics of changes
in current account funds, savings, and investments since the beginning of the simulation (see
ifgure 6). The user can select the currency of the account for which he wants to see the
information. By default, to visualize the dynamics of funds in all accounts, the option “Accounts
total amount in USD” is selected, which displays the total amount of funds in all accounts
converted into USD.</p>
      <p>The user also can view transaction information on all current accounts (see figure 2, C). The
corresponding section contains a list of the last 5 transactions. In this case, the user can view
all transactions or filter them by specific weeks in the details window (see figure 7).</p>
      <p>At the beginning of the 4th week of the simulation, a window with a test question appears, in
which the participant is asked what the amount is enough to save for savings from a monthly
salary per month. This question is part of the assessment, and the user is obliged to answer it.</p>
      <p>To sometimes distract the user from financial instruments and bring more of the reality into
the simulation process, random events were added to the system, including various diseases,
repairs, the birth of a child, and so on. Most random events last for some time and additional
costs for the entire period of validity.</p>
      <p>Some events are constant, so costs are made every week, such as the birth of a child. You
can’t cancel any random event, you can just accept it and close the window.</p>
      <p>Upon reaching the 60th week of the simulation, the user will be shown a page where the
participant can view analytical data on the dynamics of changes in funds on all current accounts,
savings, and investments since the beginning of the simulation (see figure 8). Funds on all
accounts, savings, and investments are converted into USD and shown in one chart. On this
page, the user can compare their results with the results of other participants in their country,
as well as from other countries, in addition, they can share their thoughts or ideas about the
simulator by filling in the field at the bottom of the page.</p>
      <p>The use of the Simulator facilitates forming of the following knowledge, skills, and
competencies as well as the personal attitude towards:
• Using the diversified list of currencies.
• Forming widely-acknowledged standard of emergency savings and assessing readiness
for longer-term investment.
• Using basic saving and investment instruments available in a wide range of countries
such as deposits (deposit certificates), real estate, and business investments. Experiencing
diferent types of instruments and understanding their specific.
• Debt-management and forming personal behavioral attitude to using debt as constructive
investment leverage.</p>
      <p>Since this simulator was designed to develop personal finance management skills, an attempt
was made to apply machine-learning elements to make this business simulator work even
better. This requires the creation of a system that could determine for the user an efective
strategy of action in the simulator because the amount of real data of simulation participants is
extremely small for analysis and identification of hidden dependencies using cluster analysis.
This task is suitable for those that are solved by the latest approach in machine learning, namely
reinforcement training.</p>
      <p>
        Reinforcement learning (RL) is a type of machine learning in which the agent directly examines
environmental data, receives rewards, and sets policies for optimal action. The goal of RL is to
ifnd the optimal policy that maximizes the expected amount of future rewards [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>
        Reinforcement learning focuses primarily on how to obtain the optimal strategy [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Ideally,
the system should determine what will be financially optimal, as well as understand what is
inherent in man, given its characteristics. For example, if a person has never used investment
instruments, it is not worth building a strategy for him with too aggressive investment behavior,
because it is unlikely that a person will follow it.
      </p>
      <p>Reinforcement learning includes an agent, a set of states , and a set of actions by states
. By acting  ∈  , the agent moves from state to state. Acting in a certain state provides
the agent with a reward. The agent’s goal is to maximize his overall (future) reward. He does
this by adding the maximum reward that can be achieved from future states to the reward for
reaching his current state, thus influencing the current action with a potential future reward.
This potential reward is the sum of the rewards of all subsequent steps, starting from the current
state.</p>
      <p>
        Formally, reinforcement learning is modeled as a Markov decision-making process (MDP).
MDP provides a mathematical basis for modeling decision-making in situations where the
results are partly random and partly controlled by the decision-maker [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <p>One of the algorithms of reinforcement training is Q-training. The purpose of Q-learning is
to learn a strategy that gives the agent information about which step is best to perform in a
particular state .</p>
      <p>The algorithm contains the following elements:
•  the reward received in the transition from state  to state +1;
•  – the pace of learning, which determines the extent to which new information will
override old information;
•  – depreciation ratio, which gives the efect of valuing rewards received earlier, higher
than those received later.</p>
      <p>
        Before training, Q is initialized with an arbitrary fixed value (selected by the programmer),
for example, 0. Then, at each point in time t, the agent selects an action , receives a reward ,
moves to a new state +1 (which may depend on both the previous state  and the selected
action), and table Q is specified. The core of the algorithm is the Bellman equation as a simple
update of values by iteration, using the weighted average old value and new information [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>
        As a result, the open-source library “SharpRL” was chosen [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. This library provides basic
functionality for developing a learning environment with reinforcement based on the Q-learning
algorithm. The program for defining personal financial strategies is a console application.
Running this program involves setting certain parameters, including:
      </p>
      <p>• number of simulation passes in training mode;
• the number of simulations in the mode of compliance with the model policy after training;
• duration of the simulation in weeks;
• monthly income and expenses of the user in a certain currency, which will be determined
as the main currency of this user.</p>
      <p>At the end of the training process, the screen will display the maximum reward that was
achieved during the program; data on the user in the last week, which include: funds in current
accounts, amounts invested in deposits and savings of the user; the best way that was found
during the operation of the system. Each step of the path is an action that is recommended for
the user to perform in a particular week. For more information, user data is displayed next to
each step after performing the specified operation.</p>
      <p>To have more opportunities to analyze the results, the program saves them in a CSV file,
which can be used to compare the data with data from other launches of the program.</p>
      <p>The generated file has only two columns: the first – indicates the week of the simulation, the
second – the reward received by the user this week.</p>
      <p>As a result, the operation of the system with diferent initial parameters was analyzed and it
was found that a small number of iterations is the cause of an untrained model, which results in
unstable results. Experiments were conducted and it was hypothesized that the optimal ratio
of initial parameters, namely: the number of simulation passes in the training mode and the
number of simulation passes in the mode of adherence to the model policy after training, is
close to 1000. It was found that best results. After attempts to increase these figures, the results
remained within the statistical error. For example, figure 10 shows the result of increasing the
number of training epochs to 2000, which, compared to the results of 1000 epochs, remained
almost unchanged. As a result of the study, it was concluded that with the current set of possible
actions in the system (for example, to open a deposit of 10% of profits, etc.), increasing the
number of training epochs does not improve the result.</p>
      <p>To test the trained model, a method of comparing the results of the system with the average
values of the results of the passage of real users in the developed simulator was chosen. Thus,
it was decided to take a certain period, for example, 54 weeks, and collect user data using the
available functionality of the simulator. Based on the collected user data and the results of the
trained model, several comparative graphs with diferent indicators of income and expenses
were constructed.</p>
      <p>It was found that for 54 weeks, the system shows consistently better results than the average
results of users who underwent the simulation. From the graph, you can see that the performance
of the model is growing from week to week, which means that the system adapts to the
environment.</p>
      <p>After increasing the parameters of profit and expenses, the diference in the graphs becomes
more significant in favor of the trained model. This is shown in figure 11, 12, 13.</p>
      <p>Jumps can be seen in the graph showing the results of the reinforcement learning model.
They can be seen more often on charts where higher incomes and a large diference between
income and expenses. This is the efect of specific rules that are described to train the model
so that the virtual agent system follows the best path. For example, if an agent has opened a
deposit for 3 months, after this period the deposit will be closed automatically and this closing
of the deposit will significantly afect the user’s reward for this week. However, the system
learns and “understands” where to invest or save. As a result, this leads to sharp increases in
performance, as seen in the graphs. In the graphs responsible for the results of real users, such
behavior is not observed, which indicates the ignorance of users with efective ways to manage
personal finances within the developed simulator.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Conclusions</title>
      <p>Currently, economic processes are a new topic for research into the possibilities of applying
the full potential of machine learning. Scientists from around the world are just making the
ifrst attempts to reproduce such processes programmatically to use artificial intelligence to find
solutions and answer various economic questions of humanity.</p>
      <p>The proposed developed software package consists of two parts: a personal finance
management simulator and a system for determining efective financial strategies, which uses
reinforcement learning opportunities.</p>
      <p>The proposed simulator can be used in the future to teach the elements of personal finance
management to people who are not suficiently knowledgeable in the field of such finance.
Moreover, the web application can be useful even for school-age children, so the simulator can
complement the educational process within the economic courses not only in higher education
institutions but also in secondary education institutions of Ukraine.</p>
      <p>The prospects for further research include testing the efectiveness of the proposed simulator
of personal finance as a learning tool.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Lusardi</surname>
          </string-name>
          ,
          <article-title>Financial literacy and the need for financial education: evidence and implications</article-title>
          ,
          <source>Swiss Journal of Economics and Statistics</source>
          <volume>155</volume>
          (
          <year>2019</year>
          )
          <article-title>1</article-title>
          . doi:
          <volume>10</volume>
          .1186/ s41937-019-0027-5.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Lusardi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.-C.</given-names>
            <surname>Michaud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. S.</given-names>
            <surname>Mitchell</surname>
          </string-name>
          ,
          <article-title>Optimal financial knowledge and wealth inequality</article-title>
          ,
          <source>Journal of Political Economy</source>
          <volume>125</volume>
          (
          <year>2017</year>
          )
          <fpage>431</fpage>
          -
          <lpage>477</lpage>
          . doi:
          <volume>10</volume>
          .1086/690950.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Urban</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schmeiser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Collins</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. Brown,</surname>
          </string-name>
          <article-title>The efects of high school personal ifnancial education policies on financial behavior</article-title>
          ,
          <source>Economics of Education Review</source>
          <volume>78</volume>
          (
          <year>2020</year>
          )
          <article-title>101786</article-title>
          . doi:j.econedurev.
          <year>2018</year>
          .
          <volume>03</volume>
          .006.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A. B.</given-names>
            <surname>Hernández-Lara</surname>
          </string-name>
          ,
          <string-name>
            <surname>E.</surname>
          </string-name>
          <article-title>Serradell-López, À</article-title>
          . Fitó-Bertran,
          <article-title>Students' perception of the impact of competences on learning: An analysis with business simulations</article-title>
          ,
          <source>Computers in Human Behavior</source>
          <volume>101</volume>
          (
          <year>2019</year>
          )
          <fpage>311</fpage>
          -
          <lpage>319</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.chb.
          <year>2019</year>
          .
          <volume>07</volume>
          .023.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Farashahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tajeddin</surname>
          </string-name>
          ,
          <article-title>Efectiveness of teaching methods in business education: A comparison study on the learning outcomes of lectures, case studies and simulations</article-title>
          ,
          <source>The International Journal of Management Education</source>
          <volume>16</volume>
          (
          <year>2018</year>
          )
          <fpage>131</fpage>
          -
          <lpage>142</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.ijme.
          <year>2018</year>
          .
          <volume>01</volume>
          .003.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>N.</given-names>
            <surname>Korgin</surname>
          </string-name>
          ,
          <article-title>Introduction to theory of control in organizations for kids via interactive games</article-title>
          ,
          <source>IFAC-PapersOnLine</source>
          <volume>48</volume>
          (
          <year>2015</year>
          )
          <fpage>289</fpage>
          -
          <lpage>294</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.ifacol.
          <year>2015</year>
          .
          <volume>11</volume>
          .250.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Palan</surname>
          </string-name>
          ,
          <article-title>GIMS-Software for asset market experiments</article-title>
          ,
          <source>Journal of Behavioral and Experimental Finance</source>
          <volume>5</volume>
          (
          <year>2015</year>
          )
          <fpage>1</fpage>
          -
          <lpage>14</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.jbef.
          <year>2015</year>
          .
          <volume>02</volume>
          .001.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Antoniuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. A.</given-names>
            <surname>Vakaliuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. V.</given-names>
            <surname>Didkivskyi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. Y.</given-names>
            <surname>Vizghalov</surname>
          </string-name>
          ,
          <article-title>Development of a simulator to determine personal financial strategies using machine learning</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          <volume>3077</volume>
          (
          <year>2022</year>
          )
          <fpage>12</fpage>
          -
          <lpage>26</lpage>
          . URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3077</volume>
          /paper02.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Personal</surname>
            <given-names>finance</given-names>
          </string-name>
          ,
          <year>2020</year>
          . URL: https://financesimulator.azurewebsites.net/.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>R. S.</given-names>
            <surname>Sutton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. G.</given-names>
            <surname>Barto</surname>
          </string-name>
          ,
          <article-title>Reinforcement learning: an introduction</article-title>
          , 2nd ed., The MIT Press, Cambridge,
          <year>2015</year>
          . URL: https://web.stanford.edu/class/psych209/Readings/ SuttonBartoIPRLBook2ndEd.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>M. L. Puterman</surname>
          </string-name>
          ,
          <article-title>Markov decision processes: Discrete stochastic dynamic programming</article-title>
          , John Wiley &amp; Sons, Inc.,
          <year>2008</year>
          . doi:
          <volume>10</volume>
          .1002/9780470316887.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>C. J. C. H. Watkins</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Dayan</surname>
          </string-name>
          , Technical Note: Q-Learning,
          <source>Machine Learning</source>
          <volume>8</volume>
          (
          <year>1992</year>
          )
          <fpage>279</fpage>
          -
          <lpage>292</lpage>
          . doi:
          <volume>10</volume>
          .1023/A:
          <fpage>1022676722315</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Sharprl</surname>
          </string-name>
          ,
          <year>2019</year>
          . URL: https://github.com/svenslaggare/SharpRL.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>