-

Automatic construction of neural networks for stock market trend prediction

Kouame Kouassi

Deshendran Moodley

deshen@cs.uct.ac.za 1 0 University of Cape Town - Centre for Arti cial Intelligence Research , South Africa 1 University of Cape Town - Centre for Arti cial Research , South Africa

Although signi cant work has been done for hyperparameter optimisation (HO) for deep learning structures in general, there is limited research exploring their use for time series applications, such as stock market prediction. In this research we evaluate the potential for automating structure learning and hyperparameter selection for neural networks for stock market trend prediction with a limited computational budget. We evaluate Bayesian Optimisation and Hyperband (BOHB), for automatic construction of LSTM and CNN models, and compare these to manually tuned models. The results show that the BOHB technique can select CNN and LSTM models that can compete against manually tuned models.

Stock Market Trend Prediction rameter Optimisation Neural Networks

Stock trend prediction problem, and hyperparameter optimisation Given a stock market index such as S&P 500, we aim to classify the next day trend direction (UP or DOWN) using the last l days price sequence. Wen et al. [ 6 ] recently tackled it using a convolutional neural networks (CNN) based approach whereas Guo and Li [ 3 ] proposed a long-short term memory (LSTM) recurrent neural networks approach.

The hyperparameter optimisation problem is about nding the optimal hyperparameter con guration that minimises the loss function of a learning algorithm over its hyperparameter space when trained on a training dataset and evaluated on the corresponding evaluation set. To evaluate the e ectiveness of BOHB [ 2 ], we compared the performance of the hyperparameters found with BOHB against manually tuned baselines. The baselines consists of a manually tuned CNN and a manually tuned LSTM using S&P 500 dataset, and Wen et al. [ 6 ], and Guo and Li's [ 3 ] results, where applicable.

K. Kouassi and D. Moodley

We used the the daily stock closing price of the S&P 500, JSE Limited (JSE), NASDAQ and Dow Jones 30 (DOW 30), with lag l = 10, and stride s = 1. We used 70% of the dataset for training, 15%, for validation and 15% for testing. Similar to Wen et al. [ 6 ], and Guo and Li [ 3 ], we used accuracy, and f1-score (f1) as generalisation metrics. We used HyperbandSter3 [ 2 ] with the number of epochs as budget where the minimum budget was 3 epochs and the maximum budget 30 epochs. We set to 3 and used 30 iterations. This is equivalent 80 full evaluations. Each experiment was run 10 times. 3

Results and Discussions

The performance on the 8 individual datasets are shown in table 1.

On the individual datasets, A-CNN improved on 7 metrics over M-CNN, under-performed on 9; whereas, A-LSTM out-performed M-LSTM on 4 metrics, and under-performed on 11. For NASDAQ, A-LSTM matched the accuracy of the manual LSTM. Compared to the results found in the literature by Wen. et al [ 6 ] and Guo and Li [ 3 ], our methods under-performed on accuracy but outperformed on f1-score. On average, A-CNN improved on M-CNN's and f1-score by 5.75% (54:96 23:06 to 58:12 19:28); but, decreased on accuracy by 0.81% (56:83 11:40 to 56:37 13:49 ). A-LSTM on the other hand, could not match M-LSTM on f1-score with a performance drop of 7.07% (68:56 11:95 versus 63:71 14:89). However, it beat M-LSTM on accuracy by 4.74% (53:40 16:24 versus 55:93 13:58). 4

Conclusions

We evaluated the potential for automatic construction of CNN and LSTM for stock market trend prediction in a resource constrained setting. Our results showed that BOHB can compete favourably with manual HO. 1 https:// nance.yahoo.com/ 2 https://www.investing.com/ 3 https://github.com/automl/HpBandSter

AutoNNs for stock trend prediction

1. Bergstra , J. , Bengio , Y. : Random search for hyper-parameter optimization . J. Mach. Learn. Res . 13 , 281 {305 (Feb 2012 ), http://dl.acm.org/citation.cfm?id= 2188385 . 2188395

2. Falkner , S. , Klein , A. , Hutter , F. : BOHB: Robust and E cient Hyperparameter Optimization at Scale ( 2018 ), http://arxiv.org/abs/ 1807 .01774

3. Guo , J. , Li , X.: Prediction of index trend based on lstm model for extracting image similarity feature . In: Proceedings of the 2019 International Conference on Arti cial Intelligence and Computer Science . pp. 335 { 340 . AICS 2019 , ACM, New York, NY, USA ( 2019 ). https://doi.org/10.1145/3349341.3349427, http://doi.acm. org/10 .1145/3349341.3349427

4. Li , L. , Jamieson , K. , DeSalvo , G., Rostamizadeh , A. , Talwalkar , A. , Berkeley, U.: HYPERBAND: BANDIT-BASED CONFIGURATION EVAL-UATION FOR HYPERPARAMETER OPTIMIZATION . In: International Conference on Learning Representations ( 2017 ), https://github.com/automl/RoBO.

5. Shahriari , B. , Swersky , K. , Wang , Z. , Adams , R.P. , de Freitas, N.: Taking the Human Out of the Loop: A Review of Bayesian Optimization . Proceedings of the IEEE 104 ( 1 ), 148 {175 (jan 2016 ). https://doi.org/10.1109/JPROC. 2015 .2494218

6. Wen , M. , Li , P. , Zhang , L. , Chen , Y. : Stock Market Trend Prediction Using High-Order Information of Time Series . IEEE Access 7 , 28299 { 28308 ( 2019 ). https://doi.org/10.1109/ACCESS. 2019 . 2901842 , https://ieeexplore.ieee.org/document/8653278/

7. Young , S.R. , Rose , D.C. , Karnowski , T.P. , Lim , S.H. , Patton , R.M.: Optimizing deep learning hyper-parameters through an evolutionary algorithm . In: Proceedings of the Workshop on Machine Learning in HighPerformance Computing Environments . pp. 4 : 1 { 4 : 5 . MLHPC '15, ACM , New York, NY, USA ( 2015 ). https://doi.org/10.1145/2834892.2834896, http://doi.acm. org/10 .1145/2834892.2834896