<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">VolTS Augmented: An Improvement of a Volatility-based Trading System to Forecast Stock Markets Trends</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ivan</forename><surname>Letteri</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">University of L&apos;Aquila</orgName>
								<address>
									<addrLine>Coppito (AQ)</addrLine>
									<postCode>67100</postCode>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">VolTS Augmented: An Improvement of a Volatility-based Trading System to Forecast Stock Markets Trends</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">50B8C9923E1AFBDD7399A592AB441C6E</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T16:55+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Dynamic Time Warping</term>
					<term>K-Nearest Neighbour</term>
					<term>Technical Analysis</term>
					<term>Stock Market Prediction</term>
					<term>Algorithmic Trading</term>
					<term>Backtesting</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this study, we proposed an improvement to a previous strategy of the work called VolTS. It is a module of the AITA framework that integrates statistical analysis with machine learning techniques for forecasting stock market trends. The main goal is to verify the cointegration in volatility-based trading strategies within financial markets, and then to devise a methodology that leverages market dynamics to yield profits. This methodology improvement incorporates the Dynamic Time Warping to assess the similarity of trend sequences, even when they exhibit temporal misalignment coupling it. With the K-Nearest Neighbors algorithm, which identifies the most akin price patterns, we construct a sophisticated model spotlighting stocks among the nine largest ones listed on both the NYSE and Nasdaq exchanges that exhibit analogous price movements to our portfolio stocks. From volatility assets pointofview, it is applied the Granger Causality Test to the dataset containing the same mid-range volatility clusters of the chosen stocks, thus identifying them with robust predictive relationships. These "predictor" stocks were pivotal in shaping our trading strategy, serving as trend indicators to inform decisions on target stock trades. The empirical findings demonstrated the effectiveness of our method in identifying, small but not rare, profitable day-trading opportunities. This success was attributed to the predictive insights from volatility clusters, the Granger causality relationships, and Co-integration trends identified among the stocks. In conclusion, our research has significantly contributed to the realm of volatility-based trading strategies by introducing a methodology that mixes statistical techniques with machine learning.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>This article explores the emerging field of volatilitybased trading strategies, which is encountering promising growth in the financial sector. Artificial intelligence (AI) has emerged as a pivotal player, offering robust tools for analysing market volatility and leveraging it for profitable outcomes. AI models, trained to estimate mean volatility, offer valuable insights into the inherent uncertainty and risk linked to individual securities or the overall market <ref type="bibr" target="#b0">[1]</ref>.</p><p>Our research was primarily driven by the following key questions: 𝑅𝑄1: Can k-means clustering effectively determine the mean volatility of prominent stocks? 𝑅𝑄2: defined the stocks with the same mid-term volatility, can the Granger Causality Test be employed to identify predictive influences between stocks? 𝑅𝑄3: Established the subset of influ-enced stocks, can Dynamic Time Warping (DTW) <ref type="bibr" target="#b1">[2]</ref> be paired with K-Nearest Neighbours (KNN) to spotlight stocks that "fluctuate" to the same tune exploiting the delay among them?</p><p>This study aimed to formulate trading strategies based on predictive connections affecting influential stocks as trend indicators. The proposed time series study, related to the prices and volatilitydriven trading strategy was then rigorously evaluated through backtesting and performance analysis to validate its reliability. Through empirical evidence, this research has given an improvement to the previous volatility-based trading strategies <ref type="bibr" target="#b2">[3]</ref>.</p><p>The augmented AI trading strategy utilised kmeans clustering of average volatility data <ref type="bibr" target="#b3">[4]</ref>. This data encompassed nine major stock markets. Our initial objective was to identify distinct volatility patterns within the market and subsequently group assets accordingly. Following this, the Granger Causality Test (GCT) <ref type="bibr" target="#b4">[5]</ref> was leveraged to pinpoint stocks that significantly predicted others within our analysis. Then, we pairwise compare two time series to find the closest match between them via DTW. Then, the future trend is predicted as the average of the trends of the 𝐾 neighbours. These predictive relationships were then utilised to establish buy, sell, or hold trading decisions.</p><p>Our previous research focused on technical trading strategies that emphasised technical indicators <ref type="bibr" target="#b0">[1]</ref>, <ref type="bibr" target="#b5">[6]</ref>. This current exploration delves, first into Historical Volatility estimators as a dataset for identifying medium volatility selecting stocks for the Granger Causality Test asset cointegration approach <ref type="bibr" target="#b6">[7]</ref>. Second, within the context of investment timing, we predict it via DTW paired with KNN.</p><p>This paper is organised as follows: Section 2 summarize fundamental concepts within our AITA framework <ref type="bibr" target="#b7">[8]</ref>, highlighting the Volatility Trading System (VolTS) module <ref type="bibr" target="#b2">[3]</ref>. Section 3 outlines the improvement with the VolTS Augmented (in short VolTS-Aug) module, which analyses securities' volatility averages and price trends establishing predictive relationships. It then delves into the implementation of the trading strategy and includes a thorough empirical analysis of its performance and robustness. Section 4 presents practical findings achieved through backtesting followed by a discussion. Finally, Section 5 concludes the study by summarising the effectiveness and applicability of the proposed method.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">BACKGROUND</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Price Action</head><p>The price action (PA) influences Historical Volatility (HV), and in turn, HV can provide insights into future PA. When the PA exhibits strong price movements, such as wide trading ranges, breakouts, or rapid directional changes, it tends to increase.</p><p>VolTS-Aug, as an improvement of the VolTS module within the AITA framework, adheres to these principles. Low HV signifies a period of consolidation or low price volatility, indicating a potential upcoming spike in volatility or a shift in the PA. On the other side, high HV suggests a higher probability of sharp market movements or trend changes. Also into VolTS-Aug, the PA is encoded as OHLC, i.e., the open, high, low, and close prices of the assets.</p><p>For each timeframe 𝑡, the OHLC of an asset is represented as a 4-dimensional vector 𝑋𝑡 = (𝑥</p><formula xml:id="formula_0">(𝑜) 𝑡 , 𝑥 (ℎ) 𝑡 , 𝑥 (𝑙) 𝑡 , 𝑥 (𝑐) 𝑡 ) 𝑇 , where 𝑥 (𝑙) 𝑡 &gt; 0, 𝑥 (𝑙) 𝑡 &lt; 𝑥 (ℎ) 𝑡 and 𝑥 (𝑜) 𝑡 , 𝑥 (𝑐) 𝑡 ∈ [𝑥 (𝑙) 𝑡 , 𝑥 (ℎ) 𝑡 ].</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Historical Volatility Time Series</head><p>The construction of the dataset is composed by time series from the following HV estimators:</p><p>The Parkinson (PK) estimator incorporates the stock's daily high and low prices as follow:</p><formula xml:id="formula_1">𝑃 𝐾 = ⎯ ⎸ ⎸ ⎷ 1 4𝑁 𝑙𝑛(2) 𝑁 ∑︁ 𝑖=1 (︂ 𝑙𝑛 𝑥 (ℎ) 𝑡 𝑥 (𝑙) 𝑡 )︂ 2 .</formula><p>It is derived from the assumption that the true volatility of the asset is proportional to the logarithm (𝑙𝑛) of the ratio of the high 𝑥 prices of 𝑁 observations. The Garman-Klass (GK) estimator assumes that price movements are lognormally distributed calculated as follows:</p><formula xml:id="formula_2">√︃ 1 𝑁 𝑁 ∑︀ 𝑖=1 1 2 (︂ 𝑙𝑛 𝑥 (ℎ) 𝑡 𝑥 (𝑙) 𝑡 )︂ 2 − 𝑁 ∑︀ 𝑖=1 (2𝑙𝑛(2) − 1) (︂ 𝑙𝑛 𝑥 (𝑐) 𝑡 𝑥 (𝑜) 𝑡 )︂ 2</formula><p>The Rogers-Satchell (RS) estimator uses the range of prices within a given time interval as a proxy for the volatility of the asset as follows:</p><formula xml:id="formula_3">𝑅𝑆 = √︃ 1 𝑁 𝑁 ∑︀ 𝑡=1 𝑙𝑛 (︂ 𝑥 (ℎ) 𝑡 𝑥 (𝑐) 𝑡 )︂ 𝑙𝑛 (︂ 𝑥 (ℎ) 𝑡 𝑥 (𝑜) 𝑡 )︂ + 𝑙𝑛 (︂ 𝑥 (𝑙) 𝑡 𝑥 (𝑐) 𝑡 )︂ 𝑙𝑛 (︂ 𝑥 (𝑙) 𝑡 𝑥 (𝑜) 𝑡 )︂</formula><p>RS assumes that the range of prices within the interval is a good proxy for the volatility of the asset, additionally, the estimator may be sensitive to outliers and extreme price movements.</p><p>The Yang-Zhang (YZ) estimator <ref type="bibr" target="#b8">[9]</ref> incorporates OHLC prices as follows:</p><formula xml:id="formula_4">𝑌 𝑍 = √︀ 𝜎 2 𝑂𝑣𝑒𝑟𝑛𝑖𝑔ℎ𝑡𝑉 𝑜𝑙 + 𝑘𝜎 2 𝑂𝑝𝑒𝑛𝑇 𝑜𝐶𝑙𝑜𝑠𝑒𝑉 𝑜𝑙 + (1 − 𝑘)𝜎 2 𝑅𝑆 , where 𝑘 = 0.34/1.34 + 𝑁 +1 𝑁 −1 , 𝜎 2 𝑂𝑝𝑒𝑛𝑇 𝑜𝐶𝑙𝑜𝑠𝑒𝑉 𝑜𝑙 = 1 𝑁 −1 𝑁 ∑︀ 𝑖=1 (︂ 𝑙𝑛 𝑥 (𝑐) 𝑡 𝑥 (𝑜) 𝑡 − 𝑙𝑛 𝑥 (𝑐) 𝑡 𝑥 (𝑜) 𝑡 )︂ 2 , and 𝜎 2 𝑂𝑣𝑒𝑟𝑛𝑖𝑔ℎ𝑡𝑉 𝑜𝑙 = 1 𝑁 −1 𝑁 ∑︀ 𝑖=1 (︂ 𝑙𝑛 𝑥 (𝑜) 𝑡 𝑥 (𝑐) 𝑡−1 − 𝑙𝑛 𝑥 (𝑜) 𝑡 𝑥 (𝑐) 𝑡−1 )︂ 2</formula><p>. YZ exhibits notable performance across a broad spectrum of scenarios, including those characterised by jumps and nonnormality in the data. However, this estimator has limitations, and its effectiveness may be constrained in certain contexts.</p><p>In this research, our attention is centred on midvolatility. This focus allows us to either close open positions or refrain from entering a position when the anticipated volatility coefficient is high, thereby mitigating the risk of losses. On the other hand, if the expected volatility is too low, it does not offer any potential for gains.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Trading Strategy</head><p>In this experiment, we used the Trend Following (TF) strategy. It is one way to engage in trend trading, where a trader initiates an order in the direction of the breakout after the price surpasses the resistance line as follows: let 𝑃𝑡 the price at time 𝑡, and let 𝑀 𝐴 denote the Moving Average of the asset price over a certain period. If 𝑃𝑡 ≥ 𝑀 𝐴𝑡 indicates an upward trend to take a long position otherwise it is a downward trend to take a short position.</p><p>Then, our strategy is compared only with the Buy and Hold (B&amp;H) strategy considering it as a benchmark.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Backtesting Metrics</head><p>AITA framework considers the following profit and risk metrics to evaluate the potential profitability of investments and manage risk exposure.</p><p>Drawdown (DD). It is a measure of the peak-totrough decline in the value of a trading account before a new peak is attained. DD is defined as follows: 𝐷𝐷 = 𝑃 −𝑇 𝑇 , where 𝑃 is the highest value or peak of the portfolio. 𝑇 is the lowest value or trough after the peak. Maximum Drawdown (MDD) is the most significant loss from peak to trough during a specific period calculated as follows:</p><formula xml:id="formula_5">𝑀 𝐷𝐷 = 𝑚𝑎𝑥𝑖 (︁ 𝑃 𝑖 −𝑚𝑖𝑛 𝑗:𝑖≤𝑗≤𝑁 𝑇 𝑗 𝑃 𝑖 )︁</formula><p>, where 𝑃𝑖 is the highest value or peak of the portfolio time 𝑖. 𝑇𝑗 is the lowest value or trough after the peak up to time 𝑗. 𝑁 is the total number of data points. Sortino ratio (SoR). It is a risk-adjusted profit measure, which refers to the return per unit of deviation as follows:</p><formula xml:id="formula_6">𝑆𝑜𝑅 = 𝑅𝑝−𝑅 𝑓 𝜎 𝑑</formula><p>, where 𝑅𝑝 is the expected portfolio return, 𝑅 𝑓 the risk-free rate of return, and 𝜎 𝑑 denotes the downside deviation of the portfolio returns.</p><p>Sharpe ratio (SR). It is a variant of the riskadjusted profit measure, which applies 𝜎𝑝 as a risk measure:</p><formula xml:id="formula_7">𝑆𝑅 = 𝑅𝑝−𝑅 𝑓 𝜎𝑝</formula><p>where 𝜎𝑝 is the standard deviation of the portfolio return.</p><p>Calmar ratio (CR) is another variant of the riskadjusted profit measure, which applies MDD as risk measure: 𝐶𝑅 = 𝑅𝑝−𝑅 𝑓 𝑀 𝐷𝐷 .</p><p>To check the goodness of trades, we mainly focused on the Total Returns 𝑇 𝑅 𝑘 (𝑡) for each stock (𝑘 = 1, ..., 𝑝) in the time interval (𝑡 = 1, ..., 𝑛) with the price 𝑃 , defined as follows:</p><formula xml:id="formula_8">𝑇 𝑅 𝑘 (𝑡) = 𝑃 𝑘 (𝑡+Δ𝑡)−𝑃 𝑘 (𝑡) 𝑃 𝑘 (𝑡)</formula><p>. Furthermore, we analyzed the standardized returns 𝑟 𝑘 = (𝑇 𝑅 𝑘 −𝜇 𝑘 )/𝜎 𝑘 , with (𝑘 = 1, ..., 𝑝), where 𝜎 𝑘 is the standard deviation of 𝑇 𝑅 𝑘 , e 𝜇 𝑘 denote the average overtime for the studied period.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">THE EXPERIMENT</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Asset Collections</head><p>AITA automatically downloads the OHLC prices via an internal Python library connected to an API, using the MetaTrader5 (MT5) <ref type="foot" target="#foot_0">1</ref> . The data collected for this study includes the OHLC prices of the stocks listed in Table <ref type="table">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 1</head><p>List of the main 9 stocks selected for the experimentation. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Ticker</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Historical Volatility Time Series</head><p>The History Volatility Clustering process of our approach determines the stocks with intermediate volatility. First calculate the average of historical volatility time series among the aforementioned estimators (see sect. 2.2). Next, the resulting volatility series are clusterized using the KMeans++ algorithm. In particular, we split into three clusters (𝐾 = 3) high, middle, and low volatility.</p><p>Figure <ref type="figure" target="#fig_1">1</ref> shows the results displayed through a plot of the time series belonging to the middle cluster where we are focused on our strategy. It is worth noting that, the main region is in the time window from 1st January 2021 to 1st March 2024.</p><p>So, we use this interval as the dataset, and then from the intermediate cluster (confined between the two red dashed lines in fig <ref type="figure" target="#fig_1">1</ref>). The candidate assets selected are NVDA, META, AMZN, MU and QCOM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Causality Analysis</head><p>Co-integration refers to the long-term stable linear combination between two or more time series, although individual series may be non-stationary.</p><p>In the context of volatility-based trading, the VolTS-Aug module performs the GCT to examine this relationship between the lagged volatility of one asset and the future volatility of another asset by applying the following steps:</p><p>Step 1. Significant Granger causality: Let 𝑋 and 𝑌 be the pair stocks time series volatility to check, where 𝑋 represents the potential causal variable and 𝑌 represents the potential effect variable. The null hypothesis (H0) states that 𝑋 does not Granger cause 𝑌 , while the alternative hypothesis (H1) states that 𝑋 does Granger cause 𝑌 . The F-test is defined as follows:</p><formula xml:id="formula_9">𝐹 − 𝑡𝑒𝑠𝑡 = [(𝑅𝑆𝑆 𝑌 (𝑡) − 𝑅𝑆𝑆𝑌 𝑋 (𝑡) )/𝑝] [(𝑅𝑆𝑆𝑌 𝑋 (𝑡) )/(𝑛 − 𝑝 − 𝑘)] ,</formula><p>where Step 3. Delta Time Trends: VolTS-Aug performs the DTW paired with KNN to examine the interval time necessary for profitable trades: (i) The DTW distance between two time series is the sum of differences between their corresponding points, optimally aligned. (ii) The KNN classifier, aimed at finding the most similar neighbours for each observation based on DTW distance.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.">The Algorithm</head><p>Three are the main steps followed by VolTS-Aug: Regression step: For each pair of time series (𝑋𝑖, 𝑌𝑗), where 𝑖 ̸ = 𝑗, we construct a linear regression model: 𝑋𝑖 = 𝛽0,𝑖𝑗 + 𝛽1,𝑖𝑗𝑌𝑗 + 𝜖𝑖𝑗, where 𝛽0,𝑖𝑗 is the intercept, 𝛽1,𝑖𝑗 is the regression coefficient, and 𝜖𝑖𝑗 is the error term. We calculate the F-statistic to evaluate the overall adequacy of the model. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.5.">Metrics observed</head><p>Profit and risk metrics are pivotal considerations in trading AITA framework evaluates the following, for the potential profitability of the investments and to manage the risk exposure.</p><p>(i) The Maximum drawdown (MDD) measures the largest decline from the peak in the whole trading period, to show the worst case, as follows: 𝑀 𝐷𝐷 = 𝑚𝑎𝑥 𝜏 ∈(0,𝑡) [𝑚𝑎𝑥 𝑡∈(0,𝜏 )</p><formula xml:id="formula_10">𝑛 𝑡 −𝑛𝜏 𝑛 𝑡</formula><p>]. (ii) The Sharpe ratio (SR) is a risk-adjusted profit measure, which refers to the return per unit of deviation as follows: 𝑆𝑅 = E [𝑟]  [𝑟] . (iii) The Sortino ratio (SoR) is a variant of the risk-adjusted profit measure, which applies DD as risk measure: 𝑆𝑜𝑅 = E [𝑟]  𝐷𝐷 . (iv) The Calmar ratio (CR) is another variant of the risk-adjusted profit measure, which applies MDD as risk measure: 𝐶𝑅 = E [𝑟]  𝑀 𝐷𝐷 . To check the goodness of trades, we mainly focused on the (v) Total Returns 𝑅 𝑘 (𝑡) for each stock (𝑘 = 1, ..., 𝑝) in the time interval (𝑡 = 1, ..., 𝑛), where 𝑇 𝑅 = 𝑅 𝑘 (𝑡) = 𝑍 𝑘 (𝑡+Δ𝑡)−𝑍 𝑘 (𝑡)</p><formula xml:id="formula_11">𝑍 𝑘 (𝑡)</formula><p>, and furthermore analysing the standardized returns 𝑟 𝑘 = (𝑅 𝑘 − 𝜇 𝑘 )/𝜎 𝑘 , with (𝑘 = 1, ..., 𝑝), where 𝜎 𝑘 is the standard deviation of 𝑅 𝑘 , e 𝜇 𝑘 denote the average overtime for the studied period. The VolTS algorithm iterates the daily lags in a range from 2 to 30 days to determine the best testing range time. The best result is achieved with lags=21, where 'best' is considered when there is direction coherency among the stocks with the lowest p-value, with the maximum cardinality of the set of stocks previously filtered, and not create cyclic graph in the connections. On other words, the GCT suggests buying AMZN, MU and NVDA when META has a positive trend and vice versa (see fig. <ref type="figure" target="#fig_3">2</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">RESULTS AND DISCUSSIONS</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">The Experiment</head><p>The experiment results indicate that the strategy resulted in a total gain of 241.57$ in 21 days of market opening, starting with an initial budget of 1000$ per stock.</p><p>Tab. 2 contains further details about the performance metrics of the strategy and shows how the total amount in the portfolio is increased to 3241.57$ (8.05%), which is a positive sign of profitable trading, also considering the fixed commission of 9$ per trade. Notice that, the managing of the budget is set in compounded mode, so the full amount is reused for each trade.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Backtesting</head><p>The analysis of individual stocks' performance is presented in figure <ref type="figure" target="#fig_4">3</ref> about META co-integration. The trades of AMZN bought following the META trend given a profit of 59.98$, with a winrate of 75%, a MDD of 0.035%, and a return of 5.98%, which outperforms the B&amp;H strategy with return of 0.998%. The trades of MU bought following the META trend given a profit of 146.68$, with a winrate of 100%, a MDD of 0%, and a return of 34.91%, which outperforms the B&amp;H strategy with return of 3%. The trades of NVDA bought following the META trend given a profit of 39.11$, with a winrate of 75%, a MDD of 0.11%, and a return of 3.98%, which is similar to the B&amp;H strategy with return of 4.02%. Figure <ref type="figure" target="#fig_4">3</ref> shows all the 7 positions applied at the same time on NVDA, MU and AMZN with buying/selling trades following the co-integration with META trend determined with VolTS-Aug algorithm. The trades on MU stock outperforms all the others stocks increasing notably the portfolio gain (see Table <ref type="table" target="#tab_2">2</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions</head><p>In this paper, we propose VolTS-Aug an improvement of the AITA framework module calle VolTS. VolTS-Aug handle volatility in trading strategy combining causality by the Historical Volatility Granger Causality Test and DTW &amp; KNN to determine a profitable stock pairings improving on previous work <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b0">1]</ref>. The novelty of the approach implemented in VolTS-Aug lies in a better trades timing. To validate this claim, we applied our methodology to nine assets reduce to four after filtering by our methodology. The results shows a promising potential of this approach with a gain of 241.57$ (8.05%) in 21 days. In future works, we will further test its reliability with more refined assets selection (e.g., <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b11">12]</ref>) and balancing (buy, sell and hold trades) strategies (e.g., <ref type="bibr" target="#b12">[13,</ref><ref type="bibr" target="#b13">14]</ref>).</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Stocks selected in the range, from 1st January 2021 to 1st March 2024, are MU, NVDA, AMZN, QCOM, META from the Historical Volatility estimators dataset.</figDesc><graphic coords="4,84.65,84.19,425.97,137.20" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head></head><label></label><figDesc>GCT step: For each pair of time series (𝑋𝑖, 𝑌𝑗), we perform the Granger causality test. The model for the Granger test can be expressed as 𝑋𝑖(𝑡) = 𝛼𝑖𝑗 + ∑︀ 𝑛 𝑘=1 𝛽 𝑘,𝑖𝑗 𝑋𝑖(𝑡 − 𝑘) + ∑︀ 𝑛 𝑘=1 𝛾 𝑘,𝑖𝑗 𝑌𝑗(𝑡 − 𝑘) + 𝜖𝑖𝑗(𝑡), where 𝑋𝑖(𝑡) is the current value of 𝑋𝑖, 𝑋𝑖(𝑡 − 𝑘) and 𝑌𝑗(𝑡 − 𝑘) are the lagged values of 𝑋𝑖 and 𝑌𝑗, respectively, and 𝜖𝑖𝑗(𝑡) is the error term. If the coefficients 𝛾 𝑘,𝑖𝑗 are statistically different from zero, we reject the null hypothesis and conclude that 𝑌𝑗 Granger causes 𝑋𝑖. DTW-KNN step: (i) For each pair of time series (𝑋𝑖, 𝑌𝑗), where 𝑋𝑖 has length 𝑛 and 𝑌𝑗 has length 𝑚, the DTW distance 𝑑(𝑋𝑖, 𝑌𝑖) is given by 𝑑(𝑋𝑖, 𝑌𝑖) = 𝑚𝑖𝑛 𝑎𝑙𝑖𝑔𝑛𝑚𝑒𝑛𝑡 ∑︀ 𝑛 𝑖=1 ∑︀ 𝑚 𝑗=1 𝑐(𝑖, 𝑗), where 𝑐(𝑖, 𝑗) is the distance between points 𝑋𝑖[𝑖] and 𝑌𝑗[𝑗], and the optimization is performed overall possible alignments. (ii) The process of finding the best parameters delta time 𝛿𝑡 involves the KNN whit 𝑘 optimization through grid search.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Best Granger Causality Test with 21 lags.</figDesc><graphic coords="5,91.00,351.30,199.95,115.15" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Trades during the 21 testing days (1st March -5th April).</figDesc><graphic coords="5,299.98,374.37,208.63,90.72" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2</head><label>2</label><figDesc>Results of the backtesting in the experiment.</figDesc><table><row><cell>Stock</cell><cell>#Trades</cell><cell>𝜇 WinRate (%)</cell><cell>𝜇 TR ($)</cell><cell>𝜇 SR</cell><cell>𝜇 SoR</cell><cell>CR</cell></row><row><cell>META -&gt;AMZN, NVDA, MU</cell><cell>7</cell><cell>83.33</cell><cell>241.57</cell><cell>3.03</cell><cell>8.75</cell><cell>15.941</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://www.metatrader5.com/</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">statistical timeseries analysis and deep neural networks</title>
		<author>
			<persName><forename type="first">I</forename><surname>Letteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Penna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Gasperis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dyoub</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2210.11532</idno>
	</analytic>
	<monogr>
		<title level="m">Dnn-forwardtesting: A new trading strategy validation using</title>
				<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">On clustering multimedia time series data using k-means and dynamic time warping</title>
		<author>
			<persName><forename type="first">V</forename><surname>Niennattrakul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">A</forename><surname>Ratanamahatana</surname></persName>
		</author>
		<idno type="DOI">10.1109/MUE.2007.165</idno>
	</analytic>
	<monogr>
		<title level="m">2007 International Conference on Multimedia and Ubiquitous Engineering (MUE&apos;07)</title>
				<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="733" to="738" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">I</forename><surname>Letteri</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2307.13422</idno>
		<title level="m">Volts: A volatility-based trading system to forecast stock markets trend using statistics and machine learning</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">K-means++: The advantages of careful seeding</title>
		<author>
			<persName><forename type="first">D</forename><surname>Arthur</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Vassilvitskii</surname></persName>
		</author>
		<idno type="DOI">10.1145/1283383.1283494</idno>
		<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="1027" to="1035" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Kirchgässner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wolters</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-540-73291-4_3</idno>
		<title level="m">Granger Causality</title>
				<meeting><address><addrLine>Berlin Heidelberg; Berlin, Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="93" to="123" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Trading strategy validation using forwardtesting with deep neural networks</title>
		<author>
			<persName><forename type="first">I</forename><surname>Letteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Penna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Gasperis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dyoub</surname></persName>
		</author>
		<idno type="DOI">10.5220/0011715300003494</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th International Conference on Finance, Economics, Management and IT Business, FEMIB 2023</title>
				<editor>
			<persName><forename type="first">M</forename><surname>Arami</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Baudier</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">V</forename><surname>Chang</surname></persName>
		</editor>
		<meeting>the 5th International Conference on Finance, Economics, Management and IT Business, FEMIB 2023<address><addrLine>Prague, Czech Republic</addrLine></address></meeting>
		<imprint>
			<publisher>SCITEPRESS</publisher>
			<date type="published" when="2023">April 23-24, 2023. 2023</date>
			<biblScope unit="page" from="15" to="25" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Co-integration and error correction: Representation, estimation, and testing</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">F</forename><surname>Engle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">W J</forename><surname>Granger</surname></persName>
		</author>
		<ptr target="http://www.jstor.org/stable/1913236" />
	</analytic>
	<monogr>
		<title level="j">Econometrica</title>
		<imprint>
			<biblScope unit="volume">55</biblScope>
			<biblScope unit="page" from="251" to="276" />
			<date type="published" when="1987">1987</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">AITA: A new framework for trading forward testing with an artificial intelligence engine</title>
		<author>
			<persName><forename type="first">I</forename><surname>Letteri</surname></persName>
		</author>
		<ptr target="org" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Italia Intelligenza Artificiale -Thematic Workshops co-located with the 3rd CINI National Lab AIIS Conference on Artificial Intelligence (Ital IA 2023)</title>
		<title level="s">CEUR Workshop Proceedings</title>
		<editor>
			<persName><forename type="first">F</forename><surname>Falchi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Giannotti</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Monreale</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Boldrini</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Rinzivillo</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Colantonio</surname></persName>
		</editor>
		<meeting>the Italia Intelligenza Artificiale -Thematic Workshops co-located with the 3rd CINI National Lab AIIS Conference on Artificial Intelligence (Ital IA 2023)<address><addrLine>Pisa, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">May 29-30, 2023. 2023</date>
			<biblScope unit="volume">3486</biblScope>
			<biblScope unit="page" from="506" to="511" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Drift-independent volatility estimation based on high, low, open, and close prices</title>
		<author>
			<persName><forename type="first">D</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Zhang</surname></persName>
		</author>
		<idno type="DOI">10.1086/209650</idno>
		<ptr target="http://www.jstor.org/stable/10.1086/209650" />
	</analytic>
	<monogr>
		<title level="j">The Journal of Business</title>
		<imprint>
			<biblScope unit="volume">73</biblScope>
			<biblScope unit="page" from="477" to="492" />
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">A stock trading system for a medium volatile asset using multi layer perceptron</title>
		<author>
			<persName><forename type="first">I</forename><surname>Letteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Penna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Gasperis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dyoub</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2201.12286</idno>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<title level="m" type="main">Mta-kdd&apos;19: A dataset for malware traffic detection</title>
		<author>
			<persName><forename type="first">I</forename><surname>Letteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Penna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">D</forename><surname>Vita</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Grifa</surname></persName>
		</author>
		<ptr target="http://ceur-ws.org/Vol-2597/paper-14.pdf" />
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="153" to="165" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">Feature selection strategies for HTTP botnet traffic detection</title>
		<author>
			<persName><forename type="first">I</forename><surname>Letteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Penna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Caianiello</surname></persName>
		</author>
		<idno type="DOI">10.1109/EuroSPW.2019.00029</idno>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="202" to="210" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">Dataset optimization strategies for malware traffic detection</title>
		<author>
			<persName><forename type="first">I</forename><surname>Letteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">D</forename><surname>Cecco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Penna</surname></persName>
		</author>
		<idno>CoRR abs/2009.11347</idno>
		<ptr target="https://arxiv.org/abs/2009.11347.arXiv:2009.11347" />
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Imbalanced dataset optimization with new resampling techniques</title>
		<author>
			<persName><forename type="first">I</forename><surname>Letteri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">D</forename><surname>Cecco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dyoub</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">D</forename><surname>Penna</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-030-82196-8_15</idno>
	</analytic>
	<monogr>
		<title level="m">Intelligent Systems and Applications -Proceedings of the 2021 Intelligent Systems Conference, IntelliSys 2021</title>
		<title level="s">Lecture Notes in Networks and Systems</title>
		<editor>
			<persName><forename type="first">K</forename><surname>Arai</surname></persName>
		</editor>
		<meeting><address><addrLine>Amsterdam, The Netherlands</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2021-09-03">2-3 September, 2021. 2021</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="199" to="215" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
