An exploration of alternative features in micro-finance loan default prediction models

dc.contributor.advisorBritz, Stefan
dc.contributor.authorStone, Devon
dc.date.accessioned2020-11-11T11:13:06Z
dc.date.available2020-11-11T11:13:06Z
dc.date.issued2020
dc.date.updated2020-11-11T11:12:36Z
dc.description.abstractDespite recent developments financial inclusion remains a large issue for the World's unbanked population. Financial institutions - both larger corporations and micro-finance companies - have begun to provide solutions for financial inclusion. The solutions are delivered using a combination of machine learning and alternative data. This minor dissertation focuses on investigating whether alternative features generated from Short Messaging Service (SMS) data and Android application data contained on borrowers' devices can be used to improve the performance of loan default prediction models. The improvement gained by using alternative features is measured by comparing loan default prediction models trained using only traditional credit scoring data to models developed using a combination of traditional and alternative features. Furthermore, the paper investigates which of 4 machine learning techniques is best suited for loan default prediction. The 4 techniques investigated are logistic regression, random forests, extreme gradient boosting, and neural networks. Finally the paper identifies whether or not accurate loan default prediction models can be trained using only the alternative features developed throughout this minor dissertation. The results of the research show that alternative features improve the performance of loan default prediction across 5 performance indicators, namely overall prediction accuracy, repaid prediction accuracy, default prediction accuracy, F1 score, and AUC. Furthermore, extreme gradient boosting is identified as the most appropriate technique for loan default prediction. Finally, the research identifies that models trained using the alternative features developed throughout this project can accurately predict loan that have been repaid, the models do not accurately predict loans that have not been repaid.
dc.identifier.apacitationStone, D. (2020). <i>An exploration of alternative features in micro-finance loan default prediction models</i>. (). ,Faculty of Science ,Department of Statistical Sciences. Retrieved from http://hdl.handle.net/11427/32377en_ZA
dc.identifier.chicagocitationStone, Devon. <i>"An exploration of alternative features in micro-finance loan default prediction models."</i> ., ,Faculty of Science ,Department of Statistical Sciences, 2020. http://hdl.handle.net/11427/32377en_ZA
dc.identifier.citationStone, D. 2020. An exploration of alternative features in micro-finance loan default prediction models. . ,Faculty of Science ,Department of Statistical Sciences. http://hdl.handle.net/11427/32377en_ZA
dc.identifier.ris TY - Master Thesis AU - Stone, Devon AB - Despite recent developments financial inclusion remains a large issue for the World's unbanked population. Financial institutions - both larger corporations and micro-finance companies - have begun to provide solutions for financial inclusion. The solutions are delivered using a combination of machine learning and alternative data. This minor dissertation focuses on investigating whether alternative features generated from Short Messaging Service (SMS) data and Android application data contained on borrowers' devices can be used to improve the performance of loan default prediction models. The improvement gained by using alternative features is measured by comparing loan default prediction models trained using only traditional credit scoring data to models developed using a combination of traditional and alternative features. Furthermore, the paper investigates which of 4 machine learning techniques is best suited for loan default prediction. The 4 techniques investigated are logistic regression, random forests, extreme gradient boosting, and neural networks. Finally the paper identifies whether or not accurate loan default prediction models can be trained using only the alternative features developed throughout this minor dissertation. The results of the research show that alternative features improve the performance of loan default prediction across 5 performance indicators, namely overall prediction accuracy, repaid prediction accuracy, default prediction accuracy, F1 score, and AUC. Furthermore, extreme gradient boosting is identified as the most appropriate technique for loan default prediction. Finally, the research identifies that models trained using the alternative features developed throughout this project can accurately predict loan that have been repaid, the models do not accurately predict loans that have not been repaid. DA - 2020 DB - OpenUCT DP - University of Cape Town KW - statistics LK - https://open.uct.ac.za PY - 2020 T1 - An exploration of alternative features in micro-finance loan default prediction models TI - An exploration of alternative features in micro-finance loan default prediction models UR - http://hdl.handle.net/11427/32377 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/32377
dc.identifier.vancouvercitationStone D. An exploration of alternative features in micro-finance loan default prediction models. []. ,Faculty of Science ,Department of Statistical Sciences, 2020 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/32377en_ZA
dc.language.rfc3066eng
dc.publisher.departmentDepartment of Statistical Sciences
dc.publisher.facultyFaculty of Science
dc.subjectstatistics
dc.titleAn exploration of alternative features in micro-finance loan default prediction models
dc.typeMaster Thesis
dc.type.qualificationlevelMasters
dc.type.qualificationlevelMSc
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_sci_2020_stone devon.pdf
Size:
3.53 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description:
Collections