An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE

dc.contributor.advisorMoodley, Deshendran
dc.contributor.authorBoakes, Jamie
dc.date.accessioned2024-12-02T10:46:34Z
dc.date.available2024-12-02T10:46:34Z
dc.date.issued2024
dc.date.updated2024-11-28T10:53:10Z
dc.description.abstractThe prediction of long-term share returns is an essential yet complex task in financial analysis and formulating investment strategy. Machine learning is a promising approach for improving the accuracy of these predictions. However, the outputs of machine learning models are not transparent or interpretable, which limits their usability for real-world decision making. There is a lack of research on the use of machine learning algorithms to predict long-term share returns on the Johannesburg Stock Exchange (JSE), with no studies that specifically examine the interpretability of machine learning algorithms. This study investigates the use of machine learning algorithms to predict long-term share returns on the JSE based on fundamental data and analyses the interpretability of the top performing algorithms. Based on a review of the literature, eight machine learning classification algorithms were selected and compared to predict tercile class 12-month share returns using fundamental data, spanning a period of two decades. The XGBoost, Random Forest, and GradBoost algorithms were found to outperform the Support Vector Classifier, Logistic Regression, Decision Tree, Artificial Neural Network, and AdaBoost algorithms. XGBoost and Random Forest were further investigated using SHAP (SHapley Additive exPlanations) global summary plots to identify the most influential input features and to analyse the interpretability of these algorithms. The study found that ensemble-based classification algorithms, i.e. XGBoost, Random Forest and GradBoost, outperformed the other algorithms. Further analysis of the results varied, with some sectors outperforming the overall market. An analysis of the input features identified the most important valuation and profitability ratios that contributed to prediction performance, and thus improves the transparency and interpretability of the models. This research is the first to comprehensively compare and analyse the interpretability of machine learning algorithms to predict long-term share returns on the JSE.
dc.identifier.apacitationBoakes, J. (2024). <i>An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE</i>. (). ,Faculty of Science ,Department of Computer Science. Retrieved from http://hdl.handle.net/11427/40761en_ZA
dc.identifier.chicagocitationBoakes, Jamie. <i>"An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE."</i> ., ,Faculty of Science ,Department of Computer Science, 2024. http://hdl.handle.net/11427/40761en_ZA
dc.identifier.citationBoakes, J. 2024. An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE. . ,Faculty of Science ,Department of Computer Science. http://hdl.handle.net/11427/40761en_ZA
dc.identifier.ris TY - Thesis / Dissertation AU - Boakes, Jamie AB - The prediction of long-term share returns is an essential yet complex task in financial analysis and formulating investment strategy. Machine learning is a promising approach for improving the accuracy of these predictions. However, the outputs of machine learning models are not transparent or interpretable, which limits their usability for real-world decision making. There is a lack of research on the use of machine learning algorithms to predict long-term share returns on the Johannesburg Stock Exchange (JSE), with no studies that specifically examine the interpretability of machine learning algorithms. This study investigates the use of machine learning algorithms to predict long-term share returns on the JSE based on fundamental data and analyses the interpretability of the top performing algorithms. Based on a review of the literature, eight machine learning classification algorithms were selected and compared to predict tercile class 12-month share returns using fundamental data, spanning a period of two decades. The XGBoost, Random Forest, and GradBoost algorithms were found to outperform the Support Vector Classifier, Logistic Regression, Decision Tree, Artificial Neural Network, and AdaBoost algorithms. XGBoost and Random Forest were further investigated using SHAP (SHapley Additive exPlanations) global summary plots to identify the most influential input features and to analyse the interpretability of these algorithms. The study found that ensemble-based classification algorithms, i.e. XGBoost, Random Forest and GradBoost, outperformed the other algorithms. Further analysis of the results varied, with some sectors outperforming the overall market. An analysis of the input features identified the most important valuation and profitability ratios that contributed to prediction performance, and thus improves the transparency and interpretability of the models. This research is the first to comprehensively compare and analyse the interpretability of machine learning algorithms to predict long-term share returns on the JSE. CY - University of Cape Town DA - 2024 DB - OpenUCT DP - University of Cape Town KW - Information Technology LK - https://open.uct.ac.za PP - University of Cape Town PY - 2024 T1 - An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE TI - An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE UR - http://hdl.handle.net/11427/40761 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/40761
dc.identifier.vancouvercitationBoakes J. An analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE. []. ,Faculty of Science ,Department of Computer Science, 2024 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/40761en_ZA
dc.language.rfc3066eng
dc.publisher.departmentDepartment of Computer Science
dc.publisher.facultyFaculty of Science
dc.publisher.locationUniversity of Cape Town
dc.subjectInformation Technology
dc.titleAn analysis of the performance and interpretability of machine learning classification algorithms to predict long-term share returns on the JSE
dc.typeThesis / Dissertation
dc.type.qualificationlevelMasters
dc.type.qualificationlevelMSc
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_sci_2024_boakes jamie.pdf
Size:
5.01 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.72 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections