Comparison of Logistic Regression and Classification Trees to Forecast Short Term Defaults on Repeat Consumer Loans

dc.contributor.advisorRajaratnam, Kanshukan
dc.contributor.authorNaicker, Keeland
dc.date.accessioned2022-03-10T10:08:54Z
dc.date.available2022-03-10T10:08:54Z
dc.date.issued2021
dc.date.updated2022-03-08T09:40:26Z
dc.description.abstractThis dissertation highlights the performance comparison between two popular contemporary consumer loan credit scoring techniques, namely logistic regression and classification trees. Literature has shown logistic regression to perform better than classification trees in terms of predictiveness and robustness when forecasting consumer loan default events over standard twelve-month outcome periods. One of the major shortcomings with classification trees is its tendency to overfit data eroding its robustness, making it vulnerable to underlying population characteristic shifts. Classification trees remains a popular technique due to its ease of application (algorithm machine learning basis) and model interpretation. Past research has found classification trees to perform marginally better than logistic regression with respect to predictiveness and robustness when modelling short term consumer credit default outcomes related to previously unseen new customer credit loan applications. This dissertation independently tested this finding on reloan consumer loan data, repeat customers who renewed loan facilities at a significant South African micro lender. This dissertation tests the finding if the classification tree technique would outperform logistic regression when modelling this new type of loan data. Credit scoring models were built and tested for each respective technique across identical data sets with the intent to eliminate bias. Robustness tests were constructed via careful iterative data splits. Performance tests measuring predictiveness and robustness were conducted via the weighted sums of squared error evaluation approach. Results reveal logistic regression to outperform classification trees on predictiveness and robustness across the designed uniform iterative data splits, which suggests that logistic regression remains the superior technique when modelling short term credit default outcomes on reloan consumer loan data.
dc.identifier.apacitationNaicker, K. (2021). <i>Comparison of Logistic Regression and Classification Trees to Forecast Short Term Defaults on Repeat Consumer Loans</i>. (). ,Faculty of Commerce ,Department of Finance and Tax. Retrieved from http://hdl.handle.net/11427/36027en_ZA
dc.identifier.chicagocitationNaicker, Keeland. <i>"Comparison of Logistic Regression and Classification Trees to Forecast Short Term Defaults on Repeat Consumer Loans."</i> ., ,Faculty of Commerce ,Department of Finance and Tax, 2021. http://hdl.handle.net/11427/36027en_ZA
dc.identifier.citationNaicker, K. 2021. Comparison of Logistic Regression and Classification Trees to Forecast Short Term Defaults on Repeat Consumer Loans. . ,Faculty of Commerce ,Department of Finance and Tax. http://hdl.handle.net/11427/36027en_ZA
dc.identifier.ris TY - Master Thesis AU - Naicker, Keeland AB - This dissertation highlights the performance comparison between two popular contemporary consumer loan credit scoring techniques, namely logistic regression and classification trees. Literature has shown logistic regression to perform better than classification trees in terms of predictiveness and robustness when forecasting consumer loan default events over standard twelve-month outcome periods. One of the major shortcomings with classification trees is its tendency to overfit data eroding its robustness, making it vulnerable to underlying population characteristic shifts. Classification trees remains a popular technique due to its ease of application (algorithm machine learning basis) and model interpretation. Past research has found classification trees to perform marginally better than logistic regression with respect to predictiveness and robustness when modelling short term consumer credit default outcomes related to previously unseen new customer credit loan applications. This dissertation independently tested this finding on reloan consumer loan data, repeat customers who renewed loan facilities at a significant South African micro lender. This dissertation tests the finding if the classification tree technique would outperform logistic regression when modelling this new type of loan data. Credit scoring models were built and tested for each respective technique across identical data sets with the intent to eliminate bias. Robustness tests were constructed via careful iterative data splits. Performance tests measuring predictiveness and robustness were conducted via the weighted sums of squared error evaluation approach. Results reveal logistic regression to outperform classification trees on predictiveness and robustness across the designed uniform iterative data splits, which suggests that logistic regression remains the superior technique when modelling short term credit default outcomes on reloan consumer loan data. DA - 2021_ DB - OpenUCT DP - University of Cape Town KW - Corporate Finance and Valuations LK - https://open.uct.ac.za PY - 2021 T1 - Comparison of Logistic Regression and Classification Trees to Forecast Short Term Defaults on Repeat Consumer Loans TI - Comparison of Logistic Regression and Classification Trees to Forecast Short Term Defaults on Repeat Consumer Loans UR - http://hdl.handle.net/11427/36027 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/36027
dc.identifier.vancouvercitationNaicker K. Comparison of Logistic Regression and Classification Trees to Forecast Short Term Defaults on Repeat Consumer Loans. []. ,Faculty of Commerce ,Department of Finance and Tax, 2021 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/36027en_ZA
dc.language.rfc3066eng
dc.publisher.departmentDepartment of Finance and Tax
dc.publisher.facultyFaculty of Commerce
dc.subjectCorporate Finance and Valuations
dc.titleComparison of Logistic Regression and Classification Trees to Forecast Short Term Defaults on Repeat Consumer Loans
dc.typeMaster Thesis
dc.type.qualificationlevelMasters
dc.type.qualificationlevelMCom
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_com_2021_naicker keeland.pdf
Size:
2.53 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description:
Collections