<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Machine Learning Techniques</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nandini Tripurana</string-name>
          <email>nandinitripurana@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Binodini Kar</string-name>
          <email>binodinikar28@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sujata Chakravarty</string-name>
          <email>chakravartys69@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bijay K. Paikaray</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Suneeta Satpathy</string-name>
          <email>suneeta.s@srisriuniversity.edu.in</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dept. of CSE, Centurion University of Technology and Management</institution>
          ,
          <addr-line>Odisha</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Faculty. of Management Studies, Sri Sri University</institution>
          ,
          <addr-line>Odisha</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>School of Information &amp; Communication Technology, Medhavi Skills University</institution>
          ,
          <addr-line>Sikkim</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <fpage>274</fpage>
      <lpage>281</lpage>
      <abstract>
        <p>Significantly, gold is among the precious metals that are utilized to finance trading purchases. In countries like India, China, Indonesia, and many more, gold is considered the ideal jewelry, in addition, gold is also served as a present/remembrance and even gold accessories are presented as marriage settlements. Moreover, the countries with large gold reserves are considered a booming nations. At present time, precious metal like gold, is not just considered ornaments or jewelry but are considered as an investment and are kept with all nations' central bank as an assurance for repayment of non-native loans, and also to manage inflation. Due to the increasing demand and dispense of this asset in the market the state of the major economies throughout the globe has a considerable impact on gold prices. Due to the change in gold prices, more investors are now considering gold investments. But irregularity in the gold price in the market makes it riskier for the investor. Thus, the goal of “Gold price prediction” is to forecast gold's price using a variety of Machine learning techniques, considering the relationship between several economic factors that influence gold rates. The machine learning algorithms that have been used are Random Forest regression, Decision tree (DT), Support vector regression (SVR), linear regression, and Artificial neural network (ANN). Artificial neural network (ANN), Random Forest (RF), Decision tree (DT) ACI'22: Workshop on Advances in Computation Intelligence, its Concepts &amp; Applications at ISIC 2022, May 17-19, Savannah, United States</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        For generations, gold has been a significant precious metal. It is one of the significant financial assets
for nations and an important part of the world’s monetary reserves for trade and currency hedging [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The
inertness, undersupply, and difficulty in extraction of this metal gave itself a great value in the market. But
shortage and stability are not the only reason for the increasing demand and supply of this valuable metal.
Another quality that is making it more precious is the attention of investors towards it. It is one of the major
assets being considered as an investment approach by many investors. Moreover, gold is reviewed to be
“the final instance’s assets” i.e., is the asset investors rely on [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>2020 Copyright for this paper by its authors.</p>
      <p>Investors rely on gold prices to check oscillation in other markets. But the unstable rise and fall in gold
rates makes it riskier and develops a fear for the investors to invest in the gold market. Therefore, it is
important to predict gold rates carefully. Thus, this paper aims on examining the interconnection between
certain economic market variables and the price of gold, they are the S&amp;P500 index, silver price, crude oil
price, and US dollar exchange value (USD). Additionally, this research employs five Machine learning
techniques, such as Support vector regression (SVR). Decision Tree (DT), Random Forest, linear
regression, and Artificial neural network (ANN), and compares them to get the best fit model.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Literature survey</title>
      <p>
        The journal [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] “Gold price prediction” studied about machine learning system that can predict gold
prices based on several other economic variables. By using this they compared the correlation between
economic variables and gold price. To predict gold prices, they used several machine learning algorithms.
They used linear regression, SVM, and Decision Tree to train the model for predicting gold rates.
      </p>
      <p>
        According to the second journal [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] “Future gold prices can be predicted using machine learning
techniques” one of the most significant metals in the world is gold. Many countries maintain their gold
reserves to be recognized as healthy and progressive countries, so, based on this they predicted the gold
prices, so investors can invest in this commodity by analyzing the proposed model and can get huge
benefits. In her article [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] "Modeling and Forecasting of Gold Prices on Financial Markets," V.K.F.B.
Rebecca Davis makes use of the Autoregressive Moving Average (ARMA) model, a statistical tool that is
often used to analyze time series data. The monthly prices of gold during a ten-year period are the data
under consideration. The accuracy of the model was 66.67%. In their study titled [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] "Predicting Future
Gold Rates using Machine Learning Approach," Iftikhar ul Sami and Khurum Nazir Junejo employ
Artificial Neural Networks (ANN) to forecast gold prices. Over the course of eleven years, the data for this
study were gathered from diverse sources. This information includes factors like the price of crude oil, the
S&amp;P 500 index, USD exchange rates, and other economic factors. In their article [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] "Prediction of the gold
price with ARIMA and SVM," D Makala and Z Li use information gathered from the World Gold Council
that includes daily gold prices from January 1979 to December 2019. To predict the price of gold, this study
uses the Autoregressive Integrated Moving Average (ARIMA) approach with SVM. The ARIMA model's
accuracy is lower than the SVM model's accuracy.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Data</title>
      <p>The dataset, which includes ten years’ worth of data, is taken from the Kaggle website, dated from
January 2008 to May 2018. It has six characteristics, such as gold rates, given as date, silver price, stock
profit exchange, gold price, US dollar rate, and United States oil ETF. This dataset consists of 2290 records.</p>
    </sec>
    <sec id="sec-4">
      <title>3.1 Data pre-processing</title>
      <p>The dataset used should be cleaned at the very first step, in order to remove the noise, the null values,
void values, etc.… to avoid the errors at the end. In order to do this firstly, the data should be normalized,
next Feature extraction is done, where the attributes are identified as dependent and independent, and the
unnecessary attributes should be removed, if any. In this paper ‘Date’ variable is dropped, as it is not
important for the calculations.</p>
    </sec>
    <sec id="sec-5">
      <title>3.2 Correlation analysis</title>
      <p>Correlation is used to check the relationship among all the different variables that we have. For this paper,
the correlation analysis has been shown in figure 1.</p>
    </sec>
    <sec id="sec-6">
      <title>4. Methodology</title>
      <p>Machine learning algorithms used: For developing this model, we have used the algorithms listed as,
Random Forest, DT, SVR, linear regression, and ANN.</p>
    </sec>
    <sec id="sec-7">
      <title>4.1 Random Forest Regressor</title>
      <p>
        Random forest regression is based on the concept of ensemble learning, let’s go into more detail about
ensemble learning. This technique uses the outputs of several machine learning algorithms or similar
algorithms repeated many times. Ensemble learning techniques can be utilized for that reason because an
exclusive lone model might not produce results that are as exact [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Similarly, it will use several decision
trees trained together to get a final combined prediction. This supervised learning technique known as
Random Forest uses ensemble learning to carry out both the regression and classification tasks. It creates
several decision trees during the training phase and integrates the outputs from each to determine the final
output, which might take the form of classification or regression [7].
      </p>
    </sec>
    <sec id="sec-8">
      <title>4.2 Decision Tree</title>
      <p>The decision tree organized a series of rules in the tree structure. A set of rules were arranged in a tree
structure via a decision tree. It is one of the most useful techniques for supervised non-parametric learning.
The decision tree has a few key terminologies. The Root node, which is at the top of the tree, depicts the
overview of the entire community as it splits into sub-nodes. Decision nodes are referred to when sub-nodes
can be split into further nodes. The node of a tree with no additional split nodes is called a leaf node. Pruning
is the act of removing a decision node’s sub-node [8].</p>
    </sec>
    <sec id="sec-9">
      <title>4.3 Support Vector Regressor</title>
      <p>Support vector regression is a part of a support vector machine, and this concept is applied for both
regression as well as classification of data. Support vector machine is one of the fast and non-linear
supervised machine learning models, given a set of labeled training data. SVM will help to find the optimum
hyperplane which categorized new examples in one-dimensional space. The hyperplane is a point
in two-dimensional space.</p>
      <p>The hyperplane is a land in three-dimensional space, it is a surface that divides into two parts,
where each class lies on either side. In two-dimensional linear separable cases, the data are
separated by a line. In this case, the data is linearly separable [9].</p>
    </sec>
    <sec id="sec-10">
      <title>4.4 Linear Regression</title>
      <p>Linear regression is the linear estimation of a normal connection between two or more variables.
Regression models are extremely important since they are among the most popular methods for reasoning
and prediction and the activity goes like this [10].
 =  (  ,   ⋯  )</p>
      <p>………… (1)
Independent variables labeled as X1, and X2, are the predictors. Y is the function of the X variable. The
regression model is a linear estimation of this function. T.</p>
    </sec>
    <sec id="sec-11">
      <title>4.5 Artificial Neural Network</title>
      <p>A neural network means, the connection of neurons. It constructs the base of deep learning. The working
of Human neural networks and Artificial neural networks are almost the same, although there is a difference
in their mechanism.</p>
      <p>An artificial neural network works in three layers, the Input layer, and the output layer, and in between
there exists the Hidden layer. The input layer receives the data, the output layer provides the final result,
and the hidden layer is in charge of handling all of the weight distribution, calculating bias, and calculations
[11][12]. The formula is mentioned below:
 ⅈ= 
∗  ⅈ + 
  =ⅈⅈ</p>
    </sec>
    <sec id="sec-12">
      <title>4.6 Work-flow diagram</title>
      <p>( ∗  +  ∗  +⋯  ∗ )
…………… (2)
…………… (3)
Presentation metrics of model: The four standard assessment metrics, the Root mean squared
error (RMSE), Mean squared error (MSE), coefficient of determination (R-squared error), and
mean absolute percentage error (MAPE) are used to determine the anticipated outputs of all five
machine learning models.
-RMSE can be calculated by the given formula:

ⅈ=
⋅
(̂ⅈ− ⅈ)</p>
      <p>RMSE = √∑</p>
      <p>………. (4)
-MSE can be calculated by the given formula:</p>
      <p>MSE= 1 ∑ =1</p>
      <p>(  −  ̂ )2
-MAPE can be calculated by the given formula
|  −  |</p>
    </sec>
    <sec id="sec-13">
      <title>5. Results and discussions</title>
      <p>A comparison of the model’s performances based on the evaluation metrics has been listed below: After
applying all the five machine learning algorithms, the least RMSE is calculated as 0.377 for ANN, and the
least MAPE is calculated as 0.151, it is also for ANN.
The best fit model on the basis of errors calculated, is an artificial neural network, with the least RMSE,
MAPE, and R2 error.</p>
    </sec>
    <sec id="sec-14">
      <title>6. Conclusion</title>
      <p>From this paper, it can be concluded that Machine learning algorithms can be used to forecast gold prices
accurately. And this model can be very beneficial not only the government agencies and investors but to
common people also. Moreover, for future work related to this, we can analyze and include some more
variables such as diamond price, some other precious metal prices, or some other stock exchange indices
that strongly influence the gold rates, to make our predicted values more accurate.</p>
    </sec>
    <sec id="sec-15">
      <title>7. References</title>
      <p>[7] Manjula K A, Karthikeyan P,” Gold Price Prediction using Ensemble based Machine Learning
Techniques”, IEEE 2019 3rd International Conference on Trends in Electronics and Informatics
(ICOEI), Issue-23-25 (April 2014).
[8] Z. Ismail, A. Yahya, A. Shabri,” Forecasting Gold Prices Using Multiple Linear regression Method”,</p>
      <p>American Journal of Applied Sciences, Issue-2009.
[9] L.A Sjaastad, F. Scacciavillani,” The price of gold and exchange rate”, Journal of International Money
and Finance, Issue-December, 1996
[10] loannis E. Livieris, EmmanuelPintelas, Panagiotis Pintelas,” A CNN-LSTM model for gold price
time-series forecasting”, Neural computing and Applications, Issue- 21(November 2019).
[11] Dr. Abhay Kumar Agarwal, Swati Kumari, "Gold Price Prediction using Machine Learning",</p>
      <p>International Journal of Trend in Scientific Research and Development (ijtsrd), 2020.
[12] Sami Ben Jabeur, Salma Mefteh-Wali, Jean-Laurent Viviani,” Forecasting gold price with the XGBoost
algorithm and SHAP interaction values, Annals of Operations Research, Issue-24(June 2021).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Sravani</surname>
          </string-name>
          , Ch. Abhilash, T. Divya, Ch. Vasthav,
          <string-name>
            <given-names>D.</given-names>
            <surname>Priyanka</surname>
          </string-name>
          , “Gold price prediction”,
          <source>International journal of creative research thoughts (IJCRT)</source>
          , Issue-
          <volume>6</volume>
          (
          <year>June 2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Iftikhar</surname>
            <given-names>ul Sami</given-names>
          </string-name>
          , Khurum Nazir Junejo, “
          <article-title>Predicting Future Gold Rates using Machine Learning Approach”</article-title>
          ,
          <source>International Journal of Advanced Computer Science and Applications</source>
          , Issue-
          <volume>12</volume>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>V.</given-names>
            <surname>K. F. B. Rebecca Davis</surname>
          </string-name>
          ,
          <article-title>"Modeling and Forecasting of Gold Prices on Financia Markets,"</article-title>
          <source>American International Journal of Contemporary Research</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Iftikharul</given-names>
            <surname>Sami</surname>
          </string-name>
          and
          <article-title>Khurum Nazir Junejo, "Predicting Future Gold Rates using Machine Learning Approach"</article-title>
          ,
          <source>International Journal of Advanced Computer Science and Applications</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D</given-names>
            <surname>Makala</surname>
          </string-name>
          and
          <string-name>
            <given-names>Z</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>“Prediction of gold price with ARIMA and SVM”</article-title>
          ,
          <source>Journal of Physics: Conference Series</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Chakravarty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. K.</given-names>
            <surname>Paikaray</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mishra</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Dash</surname>
          </string-name>
          ,
          <article-title>"Hyperspectral Image Classification using Spectral Angle Mapper," 2021 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE</article-title>
          ),
          <year>2021</year>
          , pp.
          <fpage>87</fpage>
          -
          <lpage>90</lpage>
          , doi: 10.1109/WIECONECE54711.
          <year>2021</year>
          .
          <volume>9829585</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>