<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Study of the applicability of an itemset-based portfolio planner in a multi-market context</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Luca Cagliero</string-name>
          <email>luca.cagliero@polito.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paolo Garza</string-name>
          <email>paolo.garza@polito.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dipartimento di Automatica e Informatica, Politecnico di Torino</institution>
          ,
          <addr-line>Turin</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <fpage>50</fpage>
      <lpage>55</lpage>
      <abstract>
        <p>Planning stock portfolios for long-term investments is a wellknown financial problem. Many data mining and machine learning strategies have been proposed to automatically predict the set of uncorrelated stocks maximizing long-term portfolio returns. Among others, the use of scalable itemset-based strategies has recently been studied. Potentially, they can analyze large sets of historical prices corresponding to thousands of stocks in the worldwide market indexes. However, the current studies are still limited to single markets. This paper investigates the applicability of itemset-based strategies for planning stock portfolios in a multi-market context. Scaling the analyses towards multi-market scenarios poses a number of research questions, among which the choice of the diversification strategy, the influence of inter-market correlations among stock prices, and the profitability of multi-market strategies compared to single-market ones. This paper aims at answering to the aforesaid questions by considering a state-of-the-art itemsetbased approach. The experimental results show that itemset-based strategies focus the generated portfolios on the outperforming markets. Furthermore, the performance of multi-market strategies with sector-based diversification is on average superior or comparable to single-market ones.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        INTRODUCTION
Forecasting the stock markets is a well-known financial problem.
It entails predicting the future prices of a set of stocks to drive
investments in the short-, medium-, or long-term. Predictions
are commonly driven by fundamental or technical analyses [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
The former studies analyze the overall state of a company or a
business (e.g., earnings, production, manufacturing), whereas the
latter analyze the historical stock prices, which are assumed to
reflect all the external influences. Technical analyses often
consider both statistics-based indicators, computed on the sampled
stock prices, and graphical patterns, recognized from the price
time series, that are likely to be related to specific trends [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
      </p>
      <p>
        In this work, we focus on the analysis of the historical stock
prices to make long-term predictions. The aim is to generate a
portfolio consisting of a subset of market stocks whose prices
are likely to increase. To spread bets across multiple assets, thus
minimizing the losses in case forecasts turn out to be wrong,
portfolios are asked to be diversified , i.e., they should comprise
stocks from diferent sectors, markets, or geographical areas [
        <xref ref-type="bibr" rid="ref14 ref4">4,
14</xref>
        ].
      </p>
      <p>
        In recent years, the difusion of machine learning and data
mining techniques has prompted the financial sector and the
© 2018 Copyright held by the owner/author(s). Published in the Workshop
Proceedings of the EDBT/ICDT 2018 Joint Conference (March 26, 2018, Vienna,
Austria) on CEUR-WS.org (ISSN 1613-0073). Distribution of this paper is permitted
under the terms of the Creative Commons license CC-by-nc-nd 4.0.
research community to investigate their application to solve the
portfolio generation problem. For example, classification and
regression algorithms such as Neural Networks [
        <xref ref-type="bibr" rid="ref15 ref24">15, 24</xref>
        ], Decision
trees [
        <xref ref-type="bibr" rid="ref18 ref2">2, 18</xref>
        ], and Support Vector Machines [
        <xref ref-type="bibr" rid="ref12 ref5">5, 12</xref>
        ] have been
exploited to predict the future stock directions and prices,
respectively, based on the values of multiple dependent variables.
Alternative strategies entail the use of
(i) Time series analyses, to pinpoint significant temporal trends
in continuous stock signals [
        <xref ref-type="bibr" rid="ref11 ref13 ref25 ref9">9, 11, 13, 25</xref>
        ],
(ii) Clustering algorithms, to group stocks characterized by
similar behaviors [
        <xref ref-type="bibr" rid="ref16 ref21">16, 21</xref>
        ],
(iii) Pattern recognition techniques, to recognize graphical
patterns coming from technical analyses [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], and
(iv) Particle swarm optimization and evolutionary algorithms, to
identify the stocks that maximize a given objective function [
        <xref ref-type="bibr" rid="ref1 ref7">1, 7</xref>
        ].
      </p>
      <p>
        Itemset mining is an exploratory data mining technique that
focuses on discovering recurrent co-occurrences among items
in large transactional dataset [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. For example, let us consider
a transactional dataset collecting the baskets of the customers
of a market, where each transaction (basket) consists of a set of
distinct items. Frequent itemset mining algorithms have been
exploited to discover combinations of items that are frequently
purchased together. Since items may have diferent importance
within the analyzed datasets (e.g., diferent prices and purchased
amounts) their occurrences in each transaction can be weighted [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
      </p>
      <p>
        Recently, in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] a first attempt to apply itemset mining
techniques to generate diversified stock portfolios has been made.
Stocks are represented as distinct items in the dataset. A
transactional dataset collects the historical stock prices within a time
range. Each transaction corresponds to a distinct timestamp
within the given time range and contains all the quoted stocks
weighted by their price at the corresponding timestamp.
According to the data model described above, itemsets represent
candidate stock portfolios consisting of sets of stocks of arbitrary
size. In [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] the most interesting itemsets are generated and ranked
according to the average return of the contained stocks as well
to their level of diversification in the portfolio.
      </p>
      <p>
        The main advantages of itemset-based approaches are (i) the
interpretability of the generated model and (ii) the scalability of
the extraction algorithms, which can be applied to very large
datasets [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. On the one hand, the interpretability of the mined
itemsets allows domain experts to manually explore the top
ranked itemsets to make appropriate decisions. On the other
hand, the scalability of the itemset mining process makes the
portfolio generation process portable to multi-market domains. The
algorithm can analyze large stock datasets acquired from
multiple markets and automatically recommend diversified worldwide
investments with limited human efort. However, to the best of
our knowledge, the application of itemset-based strategies in
multi-market contexts has not been investigated yet.
      </p>
      <p>
        CONTRIBUTION
This paper investigates the applicability of itemset-based
strategies for planning diversified stock portfolios in a multi-market
context. Extending the scope of the stock data analysis from
single markets to multiple ones poses the following research
questions:
Choice of diversification strategy. Stocks can be categorized
based on diferent strategies, such as the industrial sector of the
underlying company, the market index of the stock, the
nationality of the company, or the country/continent associated with the
market index. These categorizations can be exploited to diversify
investments across uncorrelated assets. In multi-market contexts,
the choice of the diversification strategy is not trivial, as it could
relevantly afect the performance of the stock portfolio planner.
The research questions we would like to address in this study can
be formulated as follows: Which type of diversification
strategy better preserves the portfolio profits? Which type of
diversification strategy allows optimally spreading
investments across multiple assets?
Influence of inter-market stock correlations. Studying the
correlation between the prices of multiple stocks is crucial for
professional traders and private investors to take appropriate
decisions. However, analyzing the influence between the stocks
belonging to multiple markets is potentially challenging, because
the number of stocks indexed in worldwide markets is very large.
Itemset-based approaches [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] allow domain experts to set the
desired levels of average return and diversification according to
the chosen stock categorization. Under these constraints, a set
of candidate portfolios is generated. The top ranked portfolios
include the stocks with maximal average return and with a
diversification level at least equal to the set least diversification
level. Therefore, the outperforming markets are likely to be
overweighted, while the stocks indexed in under-performing markets
are likely to be under-weighted. The research question we would
like to address in this study can be formulated as follows: Does
the majority of the portfolio stocks belong to
outperforming markets? Are the stocks in the portfolio correlated in
terms of membership index?
Comparison between diferent scenarios. Extending the scope
of the analysis towards multiple markets gives professional traders
and private investors new opportunities of investment on
foreign markets. Considering a larger number of considered stocks
not only simplifies the process of diversification of the
investments, but also allows traders to move investments towards most
profitable markets. However, a quantitative evaluation of the
benefits for itemset-based approaches of considering multiple
markets at the same time compared to single-market analyses is
still missing. The research question we would like to address in
this study can be formulated as follows: Are the portfolios
generated from multiple markets more profitable than those
generated from single markets?
      </p>
      <p>In this study we investigated the use of diferent
diversification strategies in multi-market scenarios to gain insights into
the efectiveness of itemset-based strategies on large stock data.
Furthermore, we analyzed the generated portfolios to understand
to what extent inter-market stock correlations are considered in
the recommended portfolios. Finally, we empirically compared
the performance of single- and multi-market recommendations
in diferent scenarios.</p>
      <p>
        The rest of the paper is organized as follows. Section 3
summarizes the main steps of the diversified stock portfolio planner [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
Section 4 describes the experimental design. Sections 5, 6, 7
discuss the choice of the diversification strategy, the influence of
inter-market stock correlations, and the comparison between
multiple and single market strategies, respectively. Finally, Section 8
draws conclusions and discusses the future research perspectives
of this work.
3
      </p>
      <p>
        THE DIVERSIFIED STOCK PORTFOLIO
PLANNER
DISPLAN (Diversified stock portfolio planner) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] is an
itemsetbased strategy for generating diversified stock portfolios based
on the analysis of historical stock prices. It relies on the following
steps:
Stock data collection and preprocessing. This step focuses
on crawling historical stock prices and collecting them into a
unique dataset. It takes as input a list of stocks and a time range.
It acquires the daily closing prices of all the considered stocks
and stores them into a weighted transactional dataset. Each row
in the dataset (called transaction) corresponds to a diferent
timestamp and contains the prices of all the considered stocks at the
corresponding timestamp. Each pair ⟨stock, price⟩ occurring in
the dataset is denoted as weighted item.
      </p>
      <p>
        Weighted itemset mining. This step analyzes the correlations
between stock prices based on weighted itemset mining
techniques [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. It takes as input the weighted transactional dataset
prepared at the previous step and a taxonomy aggregating stocks
into higher-level categories. For example, to each stock the
corresponding industrial sector can be assigned.
      </p>
      <p>It extracts interesting patterns, called frequent weighted
itemsets, from the weighted transactional dataset. A weighted itemset
is a set of stocks of arbitrary length, which represents a
candidate stock portfolio. The extracted weighted itemsets satisfy the
following properties:
(i) The average daily return of all the stocks in the itemset is
above a given minimum return threshold minret.
(ii) The percentage of stocks belonging to diferent categories
(according to the input taxonomy) is above a given diversification
threshold mindiv.</p>
      <p>Portfolio generation. This step analyzes the extracted
itemsets to identify the best candidate stock portfolios satisfying all
the user requirements. To make the extracted patterns promptly
usable by investors for stock portfolio planning the mined
itemsets are first ranked in order of (i) decreasing length (i.e.,
number of contained stocks) and (ii) average daily return. The top
ranked itemsets are deemed as the most appropriate hints for
buy-and-hold (long-term) investors. More specifically, the
itemsets containing the maximal number of stocks are selected as best
candidate portfolios, because they satisfy all user requirements
(by construction) and contain the maximal number of stocks thus
allowing investors to spread their bets over the largest number
of diferent assets. In case of ties, the itemset with maximal least
average return is considered as the best candidate stock portfolio
because it achieved maximal profit on historical data. On equal
terms (i.e., same length and average daily return) the analyst is
asked to decide which itemset is deemed to be the most
appropriate stock portfolio to consider based on his personal judgment
and experience.</p>
      <p>
        EXPERIMENTAL DESIGN
We analyzed stock data acquired by means of the Yahoo! Finance
APIs [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. To crawl data, we performed several API requests to
retrieve the closing stock prices of several stocks from diferent
market indexes. Each request produces a diferent stock dataset,
which consists of the closing prices of the requested stocks within
the considered time period sampled at the desired frequency. In
our analyses, we considered the daily closing prices of the stocks
in two representative years, i.e., 2008 and 2013. Year 2008 is
representative of an unfavorable condition for worldwide financial
markets, i.e., the rise of the global financial crisis, whereas year
2013 represents a favorable market condition, i.e., the boom of
U.S. markets. Analyzing opposite market scenarios allows us to
perform a fair assessment of the portfolio generator with diferent
settings.
      </p>
      <p>To study the performance of the itemset-based portfolio
planner in a multi-market context, we classified market indexes based
on the corresponding opening timezone as follows: (i) Europe,
with approximate opening times from 9am to 5.30pm CET. (ii)
Asia and Oceania, with approximate opening times from 2am to
9.30am CET. (iii) North and South America: with approximate
opening times from 3pm to 10pm CET. As European indexes we
considered the following ones: the Bruxelles Stock exchange (BEL
20) (20 stocks), the Paris market (CAC 40) (40 stocks), the London
Financial Times Stock Exchange (FTSE 100) (98 stocks), the
Italian stock exchange (FTSE MIB 40) (40 stocks), the General Athens
Composite Index (GD) (59 stocks), the HDAX Deutscher Aktien
index (GDAXI) (109 stocks), the OMX Stockholm 30 (OMX) (25
stocks), and Oslo Bors All Share Index (OSEAX) (127 stocks). As
Asian and Oceania Indexes we considered the following ones: the
BSE Sensex (BSESN) Based on the Bombay Stock Exchange
(India), the FTSE Straits Times Index (STI) based on the Singapore
Exchange (29 stocks), the Hang Seng Index (HSI) based on the Hong
Kong Stock Exchange (50 stocks), the NIFTY 50 (NSEI) consisting
of companies listed on the Bombay Stock Exchange (BSE-India)
(50 stocks), the NZX 50 (NZ50) based on the New Zealand Stock
Exchange (NZSX) (39 stocks), the S&amp;P/ASX 200 (AXJO) based
on the Australian Securities Exchange from Standard &amp; Poor’s
(199 stocks), and the Taiwan Capitalization Weighted Stock Index
TAIEX Index (TWII) (898 stocks). As North and South America
Indexes we considered the following ones: the Brazil Broad-Based
Index (IBRA) (116 stocks), the US Dow Jones Industrial Average
(DJI) based on the New York Stock Exchange (30 stocks), the
Brasilian Indice Bovespa (BVSP) (63 stocks), the IPC Index (MXX)
based on the Mexican Stock Exchange (34 stocks), the IVBX 2
Brasilian Index (IVBX) (50 stocks), the MERVAL Index (MERV)
based on the Stock Exchange of Buenos Aires (Argentina) (12
stocks), the US Nasdaq Stock Market index NASDAQ-100 (NDX),
the US S&amp;P 500 (GSPC) (502 stocks), and the Canadian S&amp;P/TSX
Venture Composite Index (SPCDNX) (338 stocks). The Europe
timezone comprises 518 stocks, the Asia and Oceania timezone
1118 stocks, while the North and South America 1147.
Hereafter we will denote as market-based categorization the stock
categorization based on the considered indexes. We considered
also a sector-based stock categorization according to the Industry
Classification Benchmark (http://www.icbenchmark.com/).</p>
      <p>To simulate long-term stock investments, we applied the
following procedure: (i) We trained the itemset-based model and
generated the diversified stock portfolio on the first 7-month time
period. (ii) We tested the model by virtually buying the stocks
at the beginning of August and selling the whole portfolios in
the following year. We varied the selling date in the date range
between August, 1st 2013 and August, 1st 2014.</p>
      <p>
        We simulated both long and short selling investing positions.
Long selling entail buying the stocks because its price is likely
to increase thus yielding a profit in case the price has increased
when the stock is sold. Conversely, short selling is the practice
of selling stocks that are not currently owned, and subsequently
repurchasing them at the end of the investment. If the price
decreases, the short seller profits, since the cost of (re)purchase
is less than the proceeds received upon the initial (short) sale.
Conversely, the short selling position closes out at a loss if the
stock price rises prior to repurchase [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>In the following section we will analyze also the impact of
the algorithm parameters on the quality of the training models.
Notice that the portfolio profits are lower bound estimates of
the actual profits as stocks may be sold one by one rather than
altogether and the investments can be reconsidered during the
whole period (not only at the end). Furthermore, payofs produced
by intermediate sells can be reinvested during the same time
period. Finally, transactions costs and local taxes and fees were
not considered.
5</p>
      <p>CHOICE OF THE DIVERSIFICATION</p>
      <p>STRATEGY
We compared the performance of the itemset-based stock
portfolio planner (hereafter denoted as DISPLAN for the sake of brevity)
by using two alternative diversification strategies:
(i) a market-based strategy, where stocks are picked from diferent
indexes to limit intra-market stock correlations (independently
of the industrial sector of the considered stocks), and
(ii) a sector-based strategy, where stocks picked from diferent
sectors (independently of the underlying market index).</p>
      <p>As representative examples, in Figures 1, 2, and 3 we plotted
the relative returns achieved in year 2013 by the portfolio
generated by DISPLAN on the markets of the Asia and Oceania, Europe,
and North and South America timezones with both market- and
sector-based diversification. For all the considered timezones we
reported the results achieved by using long selling positions (see
Section 4) and by setting the minimum diversification threshold
to 70% (i.e., at least seven stocks out of 10 must belong to diferent
categories). The minimum return threshold values enforced to
discard non-profitable sets of stocks are 11%, 9%, and 12%,
respectively. To compare DISPLAN performance with that of the
considered indexes for each configuration, we plotted also the
percentage variation of the benchmark indexes as well as those
of an aggregate index consisting of all the underlying indexes
the same timezone.</p>
      <p>The achieved results show that applying a sector-based
diversification yields significantly higher profits compared to
applying a market-based strategy (e.g., in the Asia and Oceania
timezone on July, 1 2014 the average variation of the
sectorbased strategy is 120% vs. the 18% of the market-based one). The
motivations behind the achieved results are the inherent
sparsity of the market-based categorization compared to the
sectorbased one and the stronger influence of sector-driven stock price
movements. Specifically, by applying sector-based diversification
within each category the algorithm can choose among a quite
large number of stocks. Among the per-sector candidate stocks,
some of them are likely to outperform the benchmarks. Therefore,
the stocks that under-perform the corresponding sector can be
discarded. Conversely, to satisfy the least diversification level, the
market-based strategy could be forced to pick stocks that
underperform the corresponding markets index as well. Furthermore,
the number of candidate indexes per timezone is still limited
(8 for Europe and Asia and Oceania, 10 for South and North
America). Thus, in large portfolios to achieve high diversification
levels the stocks belonging to low-performing indexes cannot
be neglected. On the other hand, the intra-sector correlations
among stock prices appear to be stronger than intra-market ones.
For example, a drop of the oil price negatively influences all the
correlated stocks independently of the market index. In summary,
based on the achieved results it turns out that considering a stock
categorization based on sectors prevents the DISPLAN algorithm
from making inappropriate decisions.
6</p>
      <p>ANALYSIS OF INTER-MARKET STOCK
CORRELATIONS
The stock portfolios generated by the DISPLAN algorithm may
include stocks belonging to multiple markets. Hence, it is
interesting to investigate how the inter-market correlations among stock
prices could afect the performance of the DISPLAN algorithm.</p>
      <p>Figure 4 shows, as representative case study, the relative
returns achieved in year 2013 by the portfolio generated by
DISPLAN (with long selling position) in both multi- and
singlemarket scenarios. Specifically, as representative study, we
reported the performance of the portfolio generated by DISPLAN
from the analysis of the stock data related to all the markets in
the Asia and Oceania timezone as well as the performance of the
portfolios generated by DISPLAN from each index in the same
timezone. Notice that since the AXJO, HSI, and STI indexes did
not produce any single-market portfolios satisfying the minimum
return and diversification constraints, the corresponding curves
were omitted.</p>
      <p>By enforcing a minimum diversification level among sectors
of 70% (mindiv=70%), the multi-market portfolio consists of the
same stocks selected by the best performing single-market
portfolio, i.e., the one generated for the TWII index of Taiwan (see
Figure 4(a)). Conversely, while setting the highest possible value
of sector-based diversification level ( mindiv=100%), all the stocks
in the portfolios must belong to a diferent sector. The maximally
diversified portfolio difers from the former one because a stock
from the AX index (JBH) was selected. The sector of stock JBH
under-performed the benchmark index in year 2013. However,
stock JBH performed better than the benchmark sector
(approximately +15%). For this reason, despite the higher diversification
of the new portfolio its relative returns are still relatively high.
On the contrary, by setting the diversification threshold to its
maximal value the portfolio generated from the single-market
TWII index significantly decreases its relative returns, because
the added stocks under-performed the benchmark sector index
(see Figure 4(b)).
7</p>
      <p>COMPARISON BETWEEN MULTI- AND
SINGLE-MARKET STRATEGIES
We compared multi- and single-market strategies to assess the
applicability of the DISPLAN system in a multi-market scenario.
The results, which were summarized in Figure 4 for a
representative case study (Asia and Oceania, long selling position, year
2013), show that the the multi-market approach performed better
than most single-market strategies while it performs as good
as the best performing single-market one. Therefore, applying
the itemset-based portfolio generation strategy is particularly
appealing, as it allows us to diversify investments across
multiple market indexes without significantly degrading the portfolio
returns.</p>
      <p>Figure 5 shows the performance of the DISPLAN algorithms
by setting a relatively high minimum return threshold (16%).
The results show that the multi-market strategy performed as
good as the best single-market strategy (the one corresponding
with the best performing index). Conversely, many single-market
strategies appear to be less efective because few candidate stocks
are selected. The motivation behind is that, given a large number
of candidate stocks from multiple markets, the likelihood that a
set of highly profitable stocks diversified over sectors is found is
higher. The more stocks the algorithm can analyze, the most likely
a profitable and diversified stock portfolio can be discovered. To
avoid data overfitting the number of stock data samples should
be at least on the order of the number of analyzed stocks.
8</p>
      <p>CONCLUSIONS AND FUTURE WORK
In this paper, the application of itemset-based approaches to
generating diversified stock portfolios in a multi-market scenario
has been studied. Given a stock categorization and a dataset
collecting the historical prices of a potentially large set of stocks,
itemsets representing profitable yet diversified stock portfolios
can be automatically extracted and recommended to investors,
professional and not. The scalability of itemset-based techniques
prompted their application in a multi-market scenario, where the
following issues have been addressed.</p>
      <p>(i) The choice of the diversification strategy is not immediate,
because in multi-market contexts investors could spread bets
across either markets or sectors. Based on the achieved results,
sector-based diversification yielded significantly better results
due to the good balancing between the stocks across sectors.</p>
      <p>(ii) The inter-market correlations among stocks are properly
handled by the itemset-based strategy, as the most profitable
stocks are selected independently of the underlying market index.</p>
      <p>(iii) Multi-market strategies performed better than or as good
as single-market ones in most of the performed experiments.</p>
      <p>Future works will entail applying itemset-based strategies on
datasets collecting historical stock prices at finer time
granularities. The aim is to apply itemset-based models to drive
mediumand short-term investments (e.g., intra-day trading). Furthermore,
we will try to apply more advanced itemset mining techniques,
such as utility and probabilistic itemset mining, in order to (i)
shape investments according to the amounts of stocks already
in the portfolio, and (ii) take stock volatility and risk levels into
account.
A
-10
.
.tr )
.
w (% 25
n 3
ito 1
.rvvaag --ug210A 1250
i
45
40
35
30
10
5
0
-5
45
40
35
30
5
0
-5
-10
.
.tr )
.w (% 25
on 13 20
it 0
rvaa -ug2 15
i
.vg -1A 10</p>
    </sec>
    <sec id="sec-2">
      <title>AXJOIndex</title>
    </sec>
    <sec id="sec-3">
      <title>NZ50Index</title>
    </sec>
    <sec id="sec-4">
      <title>TWIIIndex</title>
    </sec>
    <sec id="sec-5">
      <title>DISPLAN</title>
    </sec>
    <sec id="sec-6">
      <title>Aggr. Index</title>
      <p>BEL20Index
FCHIIndex
FTSEMIBIndex
FTSEIndex
GDIndex</p>
      <p>OMXIndex
OSEAXIndex
DISPLAN
Aggr. Index</p>
      <sec id="sec-6-1">
        <title>GSPCIndex</title>
        <p>MXXIndex
SPCDNXIndex</p>
        <p>DISPLAN
Aggr. Index
3
1
g
u
A
1
0
3
1
p
e
S
1
0
3
1
tc
O
1
0
3
1
v
o
N
1
0
3
1
c
e
D
1
0
4
1
ra
M
1
0
4
1
rp
A
1
0
4
1
y
a
M
1
0
4
1
n
u
J
1
0
4
1
l
u
J
1
0
4
1
g
u
A
1
0
4
1
n
a
J
1
0
4
1
b
e
F
1
0</p>
        <p>Date
(a) Market-based strategy
3
1
g
u
A
1
0
3
1
p
e
S
1
0
3
1
tc
O
1
0
3
1
v
o
N
1
0
3
1
c
e
D
1
0
4
1
n
a
J
1
0
4
1
b
e
F
1
0</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Date</title>
      <p>4
1
-r
a
M
1
0
4
1
rp
A
1
0
4
1
y
a
M
1
0
4
1
n
u
J
1
0
4
1
l
u
J
1
0
4
1
g
u
A
1
0
(b) Sector-based strategy
kets, long strategy, mindiv =70%, minr et =11%
AXJOIndex
NZ50Index
TWIIIndex
DISPLAN
Aggr. Index
BEL20Index</p>
      <p>FCHIIndex
FTSEMIBIndex</p>
      <p>FTSEIndex
GDIndex</p>
      <p>OMXIndex
OSEAXIndex</p>
      <p>DISPLAN
Aggr. Index
GSPCIndex</p>
      <p>MXXIndex
SPCDNXIndex</p>
      <p>DISPLAN
Aggr. Index
-10
70
60
50
10
0
-10
selling position. mindiv =70%, minr et =9%
A
A
3
1
g
u
A
1
0
3
1
p
e
S
1
0
3
1
tc
O
1
0
3
1
v
o
N
1
0
3
1
c
e
D
1
0
4
1
ra
M
1
0
4
1
rp
A
1
0
4
1
y
a
M
1
0
4
1
n
u
J
1
0
4
1
l
u
J
1
0
4
1
g
u
A
1
0
4
1
n
a
J
1
0
4
1
b
e
F
1
0</p>
      <p>Date
(a) Market-based strategy
3
1
g
u
A
1
0
3
1
p
e
S
1
0
3
1
tc
O
1
0
3
1
v
o
N
1
0
3
1
c
e
D
1
0
4
1
ra
M
1
0
4
1
rp
A
1
0
4
1
y
a
M
1
0
4
1
n
u
J
1
0
4
1
l
u
J
1
0
4
1
g
u
A
1
0
4
1
n
a
J
1
0
4
1
b
e
F
1
0</p>
      <sec id="sec-7-1">
        <title>Date</title>
        <p>(b) Sector-based strategy
.vg -1A 40
A
(a) mindiv =70%</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Date</title>
      <p>(b) mindiv =100%
.vg -1A 40
A</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.E.</given-names>
            <surname>Abdual-Salam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.M.</given-names>
            <surname>Abdul-Kader</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W.F.</given-names>
            <surname>Abdel-Wahed</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Comparative study between Diferential Evolution and Particle Swarm Optimization algorithms in training of feed-forward neural network for stock price prediction</article-title>
          .
          <source>In Informatics and Systems (INFOS)</source>
          ,
          <source>2010 The 7th International Conference on. 1-8.</source>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.S.</given-names>
            <surname>Abdullah</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.S.</given-names>
            <surname>Rahaman</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Stock market prediction model using TPWS and association rules mining</article-title>
          .
          <source>In Computer and Information Technology (ICCIT)</source>
          ,
          <year>2012</year>
          15th International Conference on.
          <fpage>390</fpage>
          -
          <lpage>395</lpage>
          . DOI:http://dx.doi.org/ 10.1109/ICCITechn.
          <year>2012</year>
          .6509756
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>Agrawal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Imieliński</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Swami</surname>
          </string-name>
          .
          <year>1993</year>
          .
          <article-title>Mining Association Rules between Sets of Items in Large Databases</article-title>
          .
          <source>In ACM SIGMOD Record</source>
          , Vol.
          <volume>22</volume>
          . ACM, New York,
          <fpage>207</fpage>
          -
          <lpage>216</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Elena</given-names>
            <surname>Baralis</surname>
          </string-name>
          , Luca Cagliero, and
          <string-name>
            <given-names>Tania</given-names>
            <surname>Cerquitelli</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Supporting stock trading in multiple foreign markets: a multilingual news summarization approach</article-title>
          .
          <source>In Proceedings of the Second International Workshop on Data Science for Macro-Modeling, DSMM@SIGMOD</source>
          <year>2016</year>
          , San Francisco, CA, USA, June 26 - July 1,
          <year>2016</year>
          .
          <volume>3</volume>
          :
          <fpage>1</fpage>
          -
          <issue>3</issue>
          :
          <fpage>6</fpage>
          . DOI:http://dx.doi.org/10.1145/2951894.2951897
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Elena</given-names>
            <surname>Baralis</surname>
          </string-name>
          , Luca Cagliero, Tania Cerquitelli, Paolo Garza, and
          <string-name>
            <given-names>Fabio</given-names>
            <surname>Pulvirenti</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Discovering profitable stocks for intraday trading</article-title>
          .
          <source>Information Sciences</source>
          <volume>405</volume>
          (
          <year>2017</year>
          ),
          <fpage>91</fpage>
          -
          <lpage>106</lpage>
          . DOI:http://dx.doi.org/https://doi.org/10.1016/j. ins.
          <year>2017</year>
          .
          <volume>04</volume>
          .013
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Elena</given-names>
            <surname>Baralis</surname>
          </string-name>
          , Luca Cagliero, and
          <string-name>
            <given-names>Paolo</given-names>
            <surname>Garza</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Planning stock portfolios by means of weighted frequent itemsets</article-title>
          .
          <source>Expert Syst. Appl</source>
          .
          <volume>86</volume>
          (
          <year>2017</year>
          ),
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          . DOI:http://dx.doi.org/10.1016/j.eswa.
          <year>2017</year>
          .
          <volume>05</volume>
          .051
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Diego</surname>
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Bodas-Sagi</surname>
            ,
            <given-names>Pablo</given-names>
          </string-name>
          <string-name>
            <surname>Fernández</surname>
            ,
            <given-names>J. Ignacio</given-names>
          </string-name>
          <string-name>
            <surname>Hidalgo</surname>
          </string-name>
          , Francisco J.
          <string-name>
            <surname>Soltero</surname>
          </string-name>
          , and José L.
          <string-name>
            <surname>Risco-Martín</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Multiobjective Optimization of Technical Market Indicators</article-title>
          .
          <source>In Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers (GECCO '09)</source>
          . ACM, New York, NY, USA,
          <fpage>1999</fpage>
          -
          <lpage>2004</lpage>
          . DOI:http://dx.doi.org/10. 1145/1570256.1570266
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Gen-Huey</surname>
            <given-names>Chen</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ming-Yang</surname>
            <given-names>Kao</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yuh-Dauh Lyuu</surname>
          </string-name>
          , and
          <string-name>
            <surname>Hsing-Kuo Wong</surname>
          </string-name>
          .
          <year>1999</year>
          .
          <article-title>Optimal Buy-and-hold Strategies for Financial Markets with Bounded Daily Returns</article-title>
          .
          <source>In Proceedings of the Thirty-first Annual ACM Symposium on Theory of Computing (STOC '99)</source>
          . ACM, New York, NY, USA,
          <fpage>119</fpage>
          -
          <lpage>128</lpage>
          . DOI: http://dx.doi.org/10.1145/301250.301284
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Chonghui</given-names>
            <surname>Guo</surname>
          </string-name>
          , Hongfeng Jia,
          <string-name>
            <given-names>and Na</given-names>
            <surname>Zhang</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Time Series Clustering Based on ICA for Stock Data Analysis</article-title>
          .
          <source>In Wireless Communications, Networking and Mobile Computing</source>
          ,
          <year>2008</year>
          . WiCOM '
          <volume>08</volume>
          . 4th International Conference on. 1-
          <fpage>4</fpage>
          . DOI:http://dx.doi.org/10.1109/WiCom.
          <year>2008</year>
          .2534
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Larry</given-names>
            <surname>Harris</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>Trading and Exchanges: Market Microstructure for Practitioners</article-title>
          . Oxford University Press. https://EconPapers.repec.org/RePEc:oxp: obooks:
          <fpage>9780195144703</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <article-title>Fu lai Chung, Tak chung Fu, Robert Luk</article-title>
          , and
          <string-name>
            <given-names>V.</given-names>
            <surname>Ng</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>Evolutionary time series segmentation for stock data mining</article-title>
          .
          <source>In Data Mining</source>
          ,
          <year>2002</year>
          .
          <article-title>ICDM 2003</article-title>
          . Proceedings. 2002 IEEE International Conference on.
          <fpage>83</fpage>
          -
          <lpage>90</lpage>
          . DOI:http: //dx.doi.org/10.1109/ICDM.
          <year>2002</year>
          .1183889
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Yuling</surname>
            <given-names>Lin</given-names>
          </string-name>
          , Haixiang
          <string-name>
            <surname>Guo</surname>
            , and
            <given-names>Jinglu</given-names>
          </string-name>
          <string-name>
            <surname>Hu</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>An SVM-based approach for stock market trend prediction</article-title>
          .
          <source>In Neural Networks (IJCNN)</source>
          ,
          <source>The 2013 International Joint Conference on. 1-7</source>
          . DOI:http://dx.doi.org/10.1109/IJCNN.
          <year>2013</year>
          .6706743
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Chao</surname>
            <given-names>Luo</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Yanchang</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Longbing</given-names>
            <surname>Cao</surname>
          </string-name>
          , Yuming Ou,
          <string-name>
            <given-names>and Chengqi</given-names>
            <surname>Zhang</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Exception Mining on Multiple Time Series in Stock Market</article-title>
          .
          <source>In Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03 (WI-IAT '08)</source>
          . IEEE Computer Society, Washington, DC, USA,
          <fpage>690</fpage>
          -
          <lpage>693</lpage>
          . DOI:http://dx.doi.org/10.1109/WIIAT.
          <year>2008</year>
          .302
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Harry</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Markowitz</surname>
          </string-name>
          .
          <year>1991</year>
          .
          <article-title>Portfolio Selection: Eficient Diversification of Investments (2 ed</article-title>
          .). Wiley.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Amin</given-names>
            <surname>Hedayati</surname>
          </string-name>
          <string-name>
            <surname>Moghaddam</surname>
          </string-name>
          , Moein Hedayati Moghaddam, and
          <string-name>
            <given-names>Morteza</given-names>
            <surname>Esfandyari</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Stock market index prediction using artificial neural network</article-title>
          .
          <source>Journal of Economics, Finance and Administrative Science</source>
          <volume>21</volume>
          ,
          <issue>41</issue>
          (
          <year>2016</year>
          ),
          <fpage>89</fpage>
          -
          <lpage>93</lpage>
          . DOI:http://dx.doi.org/https://doi.org/10.1016/j.jefas.
          <year>2016</year>
          .
          <volume>07</volume>
          .002
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C.</given-names>
            <surname>Rostoker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A</given-names>
            <surname>Wagner</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>H.</given-names>
            <surname>Hoos</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>A Parallel Workflow for Real-time Correlation and Clustering of High-Frequency Stock Market Data</article-title>
          .
          <source>In Parallel and Distributed Processing Symposium</source>
          ,
          <year>2007</year>
          .
          <article-title>IPDPS 2007</article-title>
          . IEEE International.
          <volume>1</volume>
          -
          <fpage>10</fpage>
          . DOI:http://dx.doi.org/10.1109/IPDPS.
          <year>2007</year>
          .370216
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Chih-Fong Tsai</surname>
          </string-name>
          and
          <string-name>
            <surname>Zen-Yu Quan</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Stock Prediction by Searching for Similarities in Candlestick Charts</article-title>
          .
          <source>ACM Trans. Manage. Inf. Syst. 5</source>
          ,
          <issue>2</issue>
          ,
          <string-name>
            <surname>Article 9</surname>
          </string-name>
          (
          <year>July 2014</year>
          ),
          <volume>21</volume>
          pages. DOI:http://dx.doi.org/10.1145/2591672
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Huacheng</surname>
            <given-names>Wang</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yanxia Jiang</surname>
            , and
            <given-names>Hui</given-names>
          </string-name>
          <string-name>
            <surname>Wang</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Stock return prediction based on Bagging-decision tree</article-title>
          .
          <source>In Grey Systems and Intelligent Services</source>
          ,
          <year>2009</year>
          .
          <article-title>GSIS 2009</article-title>
          . IEEE International Conference on.
          <fpage>1575</fpage>
          -
          <lpage>1580</lpage>
          . DOI:http://dx.doi.org/ 10.1109/GSIS.
          <year>2009</year>
          .5408165
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Wei</surname>
            <given-names>Wang</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jiong Yang</surname>
          </string-name>
          , and
          <string-name>
            <surname>Philip</surname>
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Yu</surname>
          </string-name>
          .
          <year>2000</year>
          .
          <article-title>Eficient mining of weighted association rules (WAR)</article-title>
          .
          <source>In Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          ,
          <source>KDD'00</source>
          .
          <fpage>270</fpage>
          -
          <lpage>274</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>T.</given-names>
            <surname>Williams</surname>
          </string-name>
          and
          <string-name>
            <given-names>V.</given-names>
            <surname>Turton</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Trading Economics: A Guide to Economic Statistics for Practitioners and Students</article-title>
          . Wiley.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Zeng</surname>
            <given-names>Xiu</given-names>
          </string-name>
          , Peng Hong, and
          <string-name>
            <given-names>Zeng</given-names>
            <surname>Zhen</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Clustering in stock market based on fractal theory</article-title>
          .
          <source>In Machine Learning and Cybernetics</source>
          , 2009 International Conference on, Vol.
          <volume>1</volume>
          .
          <fpage>161</fpage>
          -
          <lpage>164</lpage>
          . DOI:http://dx.doi.org/10.1109/ICMLC.
          <year>2009</year>
          . 5212496
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Qin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhao</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>FiDoop-DP: Data Partitioning in Frequent Itemset Mining on Hadoop Clusters</article-title>
          .
          <source>IEEE Transactions on Parallel and Distributed Systems 28, 1 (Jan</source>
          <year>2017</year>
          ),
          <fpage>101</fpage>
          -
          <lpage>114</lpage>
          . DOI:http://dx.doi.org/10. 1109/TPDS.
          <year>2016</year>
          .2560176
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>YahooFinance</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Yahoo Finance Website</article-title>
          .
          <source>Last access September</source>
          <year>2016</year>
          . (
          <year>2016</year>
          ). https://it.finance.yahoo.com/
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <surname>Defu</surname>
            <given-names>Zhang</given-names>
          </string-name>
          , Qingshan Jiang, and
          <string-name>
            <given-names>Xin</given-names>
            <surname>Li</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>Application of Neural Networks in Financial Data Mining.</article-title>
          .
          <source>In International Conference on Computational Intelligence</source>
          <volume>(</volume>
          <fpage>2005</fpage>
          -02-01), Ali Okatan (Ed.).
          <source>International Computational Intelligence Society</source>
          ,
          <fpage>392</fpage>
          -
          <lpage>395</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Zhe</surname>
            <given-names>Zhang</given-names>
          </string-name>
          , Jian Jiang, Xiaoyan Liu, Ricky Lau,
          <string-name>
            <given-names>Huaiqing</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Rui</given-names>
            <surname>Zhang</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>A Real Time Hybrid Pattern Matching Scheme for Stock Time Series</article-title>
          .
          <source>In Proceedings of the Twenty-First Australasian Conference on Database Technologies - Volume 104 (ADC '10)</source>
          . Australian Computer Society, Inc.,
          <string-name>
            <surname>Darlinghurst</surname>
          </string-name>
          , Australia, Australia,
          <fpage>161</fpage>
          -
          <lpage>170</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>