An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE
| dc.contributor.advisor | Moodley, Deshendran | |
| dc.contributor.author | Boakes, Jamie | |
| dc.date.accessioned | 2024-12-02T10:46:34Z | |
| dc.date.available | 2024-12-02T10:46:34Z | |
| dc.date.issued | 2024 | |
| dc.date.updated | 2024-11-28T10:53:10Z | |
| dc.description.abstract | The prediction of long-term share returns is an essential yet complex task in financial analysis and formulating investment strategy. Machine learning is a promising approach for improving the accuracy of these predictions. However, the outputs of machine learning models are not transparent or interpretable, which limits their usability for real-world decision making. There is a lack of research on the use of machine learning algorithms to predict long-term share returns on the Johannesburg Stock Exchange (JSE), with no studies that specifically examine the interpretability of machine learning algorithms. This study investigates the use of machine learning algorithms to predict long-term share returns on the JSE based on fundamental data and analyses the interpretability of the top performing algorithms. Based on a review of the literature, eight machine learning classification algorithms were selected and compared to predict tercile class 12-month share returns using fundamental data, spanning a period of two decades. The XGBoost, Random Forest, and GradBoost algorithms were found to outperform the Support Vector Classifier, Logistic Regression, Decision Tree, Artificial Neural Network, and AdaBoost algorithms. XGBoost and Random Forest were further investigated using SHAP (SHapley Additive exPlanations) global summary plots to identify the most influential input features and to analyse the interpretability of these algorithms. The study found that ensemble-based classification algorithms, i.e. XGBoost, Random Forest and GradBoost, outperformed the other algorithms. Further analysis of the results varied, with some sectors outperforming the overall market. An analysis of the input features identified the most important valuation and profitability ratios that contributed to prediction performance, and thus improves the transparency and interpretability of the models. This research is the first to comprehensively compare and analyse the interpretability of machine learning algorithms to predict long-term share returns on the JSE. | |
| dc.identifier.apacitation | Boakes, J. (2024). <i>An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE</i>. (). ,Faculty of Science ,Department of Computer Science. Retrieved from http://hdl.handle.net/11427/40761 | en_ZA |
| dc.identifier.chicagocitation | Boakes, Jamie. <i>"An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE."</i> ., ,Faculty of Science ,Department of Computer Science, 2024. http://hdl.handle.net/11427/40761 | en_ZA |
| dc.identifier.citation | Boakes, J. 2024. An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE. . ,Faculty of Science ,Department of Computer Science. http://hdl.handle.net/11427/40761 | en_ZA |
| dc.identifier.ris | TY - Thesis / Dissertation AU - Boakes, Jamie AB - The prediction of long-term share returns is an essential yet complex task in financial analysis and formulating investment strategy. Machine learning is a promising approach for improving the accuracy of these predictions. However, the outputs of machine learning models are not transparent or interpretable, which limits their usability for real-world decision making. There is a lack of research on the use of machine learning algorithms to predict long-term share returns on the Johannesburg Stock Exchange (JSE), with no studies that specifically examine the interpretability of machine learning algorithms. This study investigates the use of machine learning algorithms to predict long-term share returns on the JSE based on fundamental data and analyses the interpretability of the top performing algorithms. Based on a review of the literature, eight machine learning classification algorithms were selected and compared to predict tercile class 12-month share returns using fundamental data, spanning a period of two decades. The XGBoost, Random Forest, and GradBoost algorithms were found to outperform the Support Vector Classifier, Logistic Regression, Decision Tree, Artificial Neural Network, and AdaBoost algorithms. XGBoost and Random Forest were further investigated using SHAP (SHapley Additive exPlanations) global summary plots to identify the most influential input features and to analyse the interpretability of these algorithms. The study found that ensemble-based classification algorithms, i.e. XGBoost, Random Forest and GradBoost, outperformed the other algorithms. Further analysis of the results varied, with some sectors outperforming the overall market. An analysis of the input features identified the most important valuation and profitability ratios that contributed to prediction performance, and thus improves the transparency and interpretability of the models. This research is the first to comprehensively compare and analyse the interpretability of machine learning algorithms to predict long-term share returns on the JSE. CY - University of Cape Town DA - 2024 DB - OpenUCT DP - University of Cape Town KW - Information Technology LK - https://open.uct.ac.za PP - University of Cape Town PY - 2024 T1 - An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE TI - An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE UR - http://hdl.handle.net/11427/40761 ER - | en_ZA |
| dc.identifier.uri | http://hdl.handle.net/11427/40761 | |
| dc.identifier.vancouvercitation | Boakes J. An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE. []. ,Faculty of Science ,Department of Computer Science, 2024 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/40761 | en_ZA |
| dc.language.rfc3066 | eng | |
| dc.publisher.department | Department of Computer Science | |
| dc.publisher.faculty | Faculty of Science | |
| dc.publisher.location | University of Cape Town | |
| dc.subject | Information Technology | |
| dc.title | An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE | |
| dc.type | Thesis / Dissertation | |
| dc.type.qualificationlevel | Masters | |
| dc.type.qualificationlevel | MSc |