Failure in Stock Price Prediction: A Comparison bettwen the Curve-Shape-Feature and Non-Curve-Shape-Feature Modes of Existing Machine Learning Algorithms

Authors

  • Ping Zhang
  • Jia-Yao Yang
  • Hao Zhu
  • Yue-Jie Hou
  • Yi Liu
  • Chi-Chun Zhou

DOI:

https://doi.org/10.15837/ijccc.2021.6.4549

Keywords:

stock price prediction, financial time series, machine learning method, deep learning method

Abstract

In the era of artificial intelligence, machine learning methods are successfully used in various fields. Machine learning has attracted extensive attention from investors in the financial market, especially in stock price prediction. However, one argument for the machine learning methods used in stock price prediction is that they are black-box models which are difficult to interpret. In this paper, we focus on the future stock price prediction with the historical stock price by machine learning and deep learning methods, such as support vector machine (SVM), random forest (RF), Bayesian classifier (BC), decision tree (DT), multilayer perceptron (MLP), convolutional neural network (CNN), bi-directional long-short term memory (BiLSTM), the embedded CNN, and the embedded BiLSTM. Firstly, we manually design several financial time series where the future price correlates with the historical stock prices in pre-designed modes, namely the curve-shape-feature (CSF) and the non-curve-shape-feature (NCSF) modes. In the CSF mode, the future prices can be extracted from the curve shapes of the historical stock prices. Conversely, in the NCSF mode, they can’t. Secondly, we apply various algorithms to those pre-designed and real financial time series. We find that the existing machine learning and deep learning algorithms fail in stock price prediction because in the real financial time series, less information of future prices is contained in the CSF mode, and perhaps more information is contained in the NCSF. Various machine learning and deep learning algorithms are good at handling the CSF in historical data, which are successfully applied in image recognition and natural language processing. However, they are inappropriate for stock price prediction on account of the NCSF. Therefore, accurate stock price prediction is the key to successful investment, and new machine learning algorithms handling the NCSF series are needed.

Author Biographies

Jia-Yao Yang

 

Hao Zhu

 

Yue-Jie Hou

 

Chi-Chun Zhou

 

References

[1] R. S. Tsay, Analysis of financial time series, vol. 543. John wiley & sons, 2005. https://doi.org/10.1002/0471746193

[2] T. C. Mills and R. N. Markellos, The econometric modelling of financial time series. Cambridge University Press, 2008. https://doi.org/10.1017/CBO9780511817380

[3] T. G. Andersen, R. A. Davis, J.-P. KreiíŸ, and T. V. Mikosch, Handbook of financial time series. Springer Science & Business Media, 2009.

[4] F. E. Tay and L. Cao, Application of support vector machines in financial time series forecasting, omega 29 (2001), no. 4 309-317. https://doi.org/10.1016/S0305-0483(01)00026-3

[5] B. Krollner, B. J. Vanstone, and G. R. Finnie, Financial time series forecasting with machine learning techniques: a survey., in ESANN, 2010.

[6] Q. A. Al-Radaideh, A. A. Assaf, and E. Alnagi, Predicting stock prices using data mining techniques, in The International Arab Conference on Information Technology (ACIT'2013), 2013.

[7] R. Adhikari and R. Agrawal, A combination of artificial neural network and random walk models for financial time series forecasting, Neural Computing and Applications 24 (2014), no. 6 1441- 1449. https://doi.org/10.1007/s00521-013-1386-y

[8] O. B. Sezer, M. U. Gudelek, and A. M. Ozbayoglu, Financial time series forecasting with deep learning: A systematic literature review: 2005-2019, Applied Soft Computing 90 (2020) 106181. https://doi.org/10.1016/j.asoc.2020.106181

[9] S. Basu, Investment performance of common stocks in relation to their price-earnings ratios: A test of the efficient market hypothesis, The journal of Finance 32 (1977), no. 3 663-682. https://doi.org/10.1111/j.1540-6261.1977.tb01979.x

[10] B. G. Malkiel, Efficient market hypothesis, in Finance, pp. 127-134. Springer, 1989. https://doi.org/10.1007/978-1-349-20213-3_13

[11] B. G. Malkiel, The efficient market hypothesis and its critics, Journal of economic perspectives 17 (2003), no. 1 59-82. https://doi.org/10.1257/089533003321164958

[12] A. Degutis and L. Novickyt˙e, The efficient market hypothesis: A critical review of literature and methodology, Ekonomika 93 (2014) 7-23. https://doi.org/10.15388/Ekon.2014.2.3549

[13] K. Hamid, M. T. Suleman, S. Z. Ali Shah, and R. S. Imdad Akash, Testing the weak form of efficient market hypothesis: Empirical evidence from asia-pacific markets, Available at SSRN 2912908 (2017). https://doi.org/10.2139/ssrn.2912908

[14] L. Blume, D. Easley, and M. O'hara, Market statistics and technical analysis: The role of volume, The journal of finance 49 (1994), no. 1 153-181. https://doi.org/10.1111/j.1540-6261.1994.tb04424.x

[15] J. J. Murphy, Technical analysis of the financial markets: A comprehensive guide to trading methods and applications. Penguin, 1999.

[16] W.-K. Wong, M. Manzur, and B.-K. Chew, How rewarding is technical analysis? evidence from singapore stock market, Applied Financial Economics 13 (2003), no. 7 543-551. https://doi.org/10.1080/0960310022000020906

[17] C.-H. Park and S. H. Irwin, What do we know about the profitability of technical analysis?, Journal of Economic surveys 21 (2007), no. 4 786-826. https://doi.org/10.1111/j.1467-6419.2007.00519.x

[18] C. D. Kirkpatrick II and J. A. Dahlquist, Technical analysis: the complete resource for financial market technicians. FT press, 2010.

[19] R. Forsythe, T. R. Palfrey, and C. R. Plott, Asset valuation in an experimental market, Econometrica: Journal of the Econometric Society (1982) 537-567. https://doi.org/10.2307/1912600

[20] H. PH and A. Rishad, An empirical examination of investor sentiment and stock market volatility: evidence from india, Financial Innovation 6 (2020), no. 1 1-15. https://doi.org/10.1186/s40854-020-00198-x

[21] A. F. Darrat, B. Li, and R. Chung, Seasonal anomalies: A closer look at the johannesburg stock exchange, Contemporary Management Research 9 (2013), no. 2. https://doi.org/10.7903/cmr.10629

[22] R. Wuthisatian, An examination of calendar anomalies: evidence from the thai stock market, Journal of Economic Studies (2021). https://doi.org/10.1108/JES-06-2020-0298

[23] M. Chiah and A. Zhong, Tuesday blues and the day-of-the-week effect in stock returns, Journal of Banking & Finance 133 (2021) 106243. https://doi.org/10.1016/j.jbankfin.2021.106243

[24] J. Conrad, S. Wahal, and J. Xiang, High-frequency quoting, trading, and the efficiency of prices, Journal of Financial Economics 116 (2015), no. 2 271-291. https://doi.org/10.1016/j.jfineco.2015.02.008

[25] P. W. Santosa and P. W. Santoso, Does exchange rate volatility cause overreaction in the capital market? evidence from indonesia, International Journal of Finance and Accounting 8 (2019), no. 3 80-87.

[26] R. Amelia and A. Wijayanto, Winner loser anomaly in indonesia, Management Analysis Journal 7 (2018), no. 2 139-151.

[27] R. Ball, J. Gerakos, T. Linnainmaa, and V. Nikolaev, Book-to-market, retained earnings, and earnings in the cross-section of stock returns, Journal of Finance Economics 1 (2019), no. 1 1-24.

[28] M. Dash, S. Kantheti, G. K. Teja, et al., The book-to-market anomaly for banking stocks in the indian stock market: A panel regression analysis, Journal of Applied Management and Investments 7 (2018), no. 1 15-23.

[29] X. Zhao, G. Ran, B. Shen, and X. Li, Do etfs improve the pricing efficiency of the a-share market-examining etf holdings of individual stocks, Applied Economics (2021) 1-14. https://doi.org/10.1080/00036846.2021.1897075

[30] Y. Cui, B. Gebka, and V. Kallinterakis, Do closed-end fund investors herd?, Journal of Banking & Finance 105 (2019) 194-206. https://doi.org/10.1016/j.jbankfin.2019.05.015

[31] N. Rezaei and Z. Elmi, Behavioral finance models and behavioral biases in stock price forecasting, Advances in mathematical finance and applications 3 (2018), no. 4 67-82.

[32] J. A. Scheinkman and B. LeBaron, Nonlinear dynamics and stock returns, Journal of business (1989) 311-337. https://doi.org/10.1086/296465

[33] E. E. Peters, Fractal market analysis: applying chaos theory to investment and economics, vol. 24. John Wiley & Sons, 1994.

[34] E. E. Peters, Chaos and order in the capital markets: a new view of cycles, prices, and market volatility. John Wiley & Sons, 1996.

[35] N. Laskin, Fractional market dynamics, Physica A: Statistical Mechanics and its Applications 287 (2000), no. 3-4 482-492. https://doi.org/10.1016/S0378-4371(00)00387-3

[36] P. Rostan and A. Rostan, The versatility of spectrum analysis for forecasting financial time series, Journal of Forecasting 37 (2018), no. 3 327-339. https://doi.org/10.1002/for.2504

[37] K. Kawagoe and T. Ueda, A similarity search method of time series data with combination of fourier and wavelet transforms, in Proceedings Ninth International Symposium on Temporal Representation and Reasoning, pp. 86-92, IEEE, 2002.

[38] U. Cherubini, G. Della Lunga, S. Mulinacci, and P. Rossi, Fourier transform methods in finance, vol. 524. John Wiley & Sons, 2010.

[39] H. Zhang, T. B. Ho, Y. Zhang, and M.-S. Lin, Unsupervised feature extraction for time series clustering using orthogonal wavelet transform, Informatica 30 (2006), no. 3.

[40] P. S. Addison, The illustrated wavelet transform handbook: introductory theory and applications in science, engineering, medicine and finance. CRC press, 2017.

[41] L. Bauwens, S. Laurent, and J. V. Rombouts, Multivariate garch models: a survey, Journal of applied econometrics 21 (2006), no. 1 79-109. https://doi.org/10.1002/jae.842

[42] S. Ling and M. McAleer, Asymptotic theory for a vector arma-garch model, Econometric theory

(2003) 280-310.

[43] C. Francq, J.-M. Zakoian, et al., Maximum likelihood estimation of pure garch and arma-garch processes, Bernoulli 10 (2004), no. 4 605-637. https://doi.org/10.3150/bj/1093265632

[44] H. Liu, E. Erdem, and J. Shi, Comprehensive evaluation of arma-garch (-m) approaches for modeling the mean and volatility of wind speed, Applied Energy 88 (2011), no. 3 724-732. https://doi.org/10.1016/j.apenergy.2010.09.028

[45] D. J. Hand, Principles of data mining, Drug safety 30 (2007), no. 7 621-622. https://doi.org/10.2165/00002018-200730070-00010

[46] K.-j. Kim, Financial time series forecasting using support vector machines, Neurocomputing 55 https://doi.org/10.1016/S0925-2312(03)00372-2

(2003), no. 1-2 307-319.

[47] R. Madan and P. S. Mangipudi, Predicting computer network traffic: a time series forecasting approach using dwt, arima and rnn, in 2018 Eleventh International Conference on Contemporary Computing (IC3), pp. 1-5, IEEE, 2018. https://doi.org/10.1109/IC3.2018.8530608

[48] Y. Wu and J. Gao, Adaboost-based long short-term memory ensemble learning approach for financial time series forecasting., Current Science (00113891) 115 (2018), no. 1. https://doi.org/10.18520/cs/v115/i1/159-165

[49] J. Cao, Z. Li, and J. Li, Financial time series forecasting model based on ceemdan and lstm, Physica A: Statistical Mechanics and its Applications 519 (2019) 127-139. https://doi.org/10.1016/j.physa.2018.11.061

[50] I. E. Livieris, E. Pintelas, and P. Pintelas, A cnn-lstm model for gold price time-series forecasting, Neural computing and applications 32 (2020), no. 23 17351-17360. https://doi.org/10.1007/s00521-020-04867-x

[51] S. Mehtab, J. Sen, and S. Dasgupta, Analysis and forecasting of financial time series using cnn and lstm-based deep learning models, arXiv preprint arXiv:2011.08011 (2020). https://doi.org/10.1109/DASA51403.2020.9317207

[52] L. Guang, W. Xiaojie, and L. Ruifan, Multi-scale rcnn model for financial time-series classification, arXiv (2019).

[53] C. N. Babu and B. E. Reddy, A moving-average filter based hybrid arima-ann model for forecasting time series data, Applied Soft Computing 23 (2014), no. Complete 27-38. https://doi.org/10.1016/j.asoc.2014.05.028

[54] Z. Chengzhao, A Deep Learning Model for Financial Market Forecasting: FEPA Model. PhD thesis, Chengdu: School of Management and Economics of UESTC, 2016.

[55] G. Benrhmach, K. Namir, A. Namir, and J. Bouyaghroumni, Nonlinear autoregressive neural network and extended kalman filters for prediction of financial time series, Journal of Applied Mathematics 2020 (2020). https://doi.org/10.1155/2020/5057801

[56] E. Fons, P. Dawson, X. J. Zeng, J. Keane, and A. Iosifidis, Evaluating data augmentation for financial time series classification, Papers (2020).

[57] A. Dingli and K. S. Fournier, Financial time series forecasting-a deep learning approach, International Journal of Machine Learning and Computing 7 (2017), no. 5 118-122. https://doi.org/10.18178/ijmlc.2017.7.5.632

[58] J. Singh and P. Tripathi, Time series forecasting using back propagation neural network with ade algorithm, International Journal of Engineering and Technical Research 7 (2017), no. 5.

[59] D. Pradeepkumar and V. Ravi, Forecasting financial time series volatility using particle swarm optimization trained quantile regression neural network, Applied Soft Computing 58 (2017) 35-52. https://doi.org/10.1016/j.asoc.2017.04.014

[60] M. Nikou, G. Mansourfar, and J. Bagherzadeh, Stock price prediction using deep learning algorithm and its comparison with machine learning algorithms, Intelligent Systems in Accounting, Finance and Management 26 (2019), no. 4 164-174. https://doi.org/10.1002/isaf.1459

[61] B. T. Kelly, S. Pruitt, and Y. Su, Characteristics are covariances: A unified model of risk and return, Journal of Financial Economics 134 (2019), no. 3 501-524. https://doi.org/10.1016/j.jfineco.2019.05.001

[62] M. F. Dixon and I. Halperin, The four horsemen of machine learning in finance, Available at SSRN 3453564 (2019). https://doi.org/10.2139/ssrn.3453564

[63] G. Feng, S. Giglio, and D. Xiu, Taming the factor zoo: A test of new factors, The Journal of Finance 75 (2020), no. 3 1327-1370. https://doi.org/10.1111/jofi.12883

[64] S. Gu, B. Kelly, and D. Xiu, Empirical asset pricing via machine learning, The Review of Financial Studies 33 (2020), no. 5 2223-2273. https://doi.org/10.1093/rfs/hhaa009

[65] M. Q. Islam and M. L. Tiku, Multiple linear regression model under nonnormality, Communications in Statistics-Theory and Methods 33 (2005), no. 10 2443-2467. https://doi.org/10.1081/STA-200031519

[66] N. Udomsak, How do the naive bayes classifier and the support vector machine compare in their ability to forecast the stock exchange of thailand?, arXiv preprint arXiv:1511.08987 (2015).

[67] S. Lahmiri, A comparison of pnn and svm for stock market trend prediction using economic and technical information, International Journal of Computer Applications 29 no. 3 24-30.

[68] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, et al., Scikit-learn: Machine learning in python, the Journal of machine Learning research 12 (2011) 2825-2830.

[69] L. Khaidem, S. Saha, and S. R. Dey, Predicting the direction of stock market prices using random forest, arXiv preprint arXiv:1605.00003 (2016).

[70] S. B. Kotsiantis, Decision trees: a recent overview, Artificial Intelligence Review 39 (2013), no. 4 261-283. https://doi.org/10.1007/s10462-011-9272-4

[71] A. Pinkus, Approximation theory of the mlp model in neural networks, Acta numerica 8 (1999) 143-195. https://doi.org/10.1017/S0962492900002919

[72] K. M. Leung, Naive bayesian classifier, Polytechnic University Department of Computer Science/ Finance and Risk Engineering 2007 (2007) 123-156.

[73] T. Kim and H. Kim, Forecasting stock prices with a feature fusion lstm-cnn model using different representations of the same data, PLoS ONE 14 (2019), no. 2 e0212320. https://doi.org/10.1371/journal.pone.0212320

[74] M. Cho, J. Ha, C. Park, and S. Park, Combinatorial feature embedding based on cnn and lstm for biomedical named entity recognition, Journal of Biomedical Informatics 103 (2020) 103381. https://doi.org/10.1016/j.jbi.2020.103381

[75] D. Wang, P. Cui, and W. Zhu, Structural deep network embedding, in Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 1225-1234, 2016. https://doi.org/10.1145/2939672.2939753

[76] M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, M. Isard, et al., Tensorflow: A system for large-scale machine learning, in 12th {USENIX} symposium on operating systems design and implementation ({OSDI} 16), pp. 265-283, 2016.

[77] H. Wang, J. Wang, L. Cao, Y. Li, Q. Sun, and J. Wang, A stock closing price prediction model based on cnn-bislstm, Complexity 2021 (2021). https://doi.org/10.1155/2021/5360828

Additional Files

Published

2021-12-03

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.