Selecting the best model for predicting a term deposit product take-up in banking

dc.contributor.advisorRajaratnam, Kanshukan
dc.contributor.advisorHuang, Chun-Kai
dc.contributor.authorHlongwane, Rivalani Willie
dc.date.accessioned2019-02-22T12:07:13Z
dc.date.available2019-02-22T12:07:13Z
dc.date.issued2018
dc.date.updated2019-02-19T06:40:45Z
dc.description.abstractIn this study, we use data mining techniques to build predictive models on data collected by a Portuguese bank through a term savings product campaign conducted between May 2008 and November 2010. This data is imbalanced, given an observed take-up rate of 11.27%. Ling et al. (1998) indicated that predictive models built on imbalanced data tend to yield low sensitivity and high specificity, an indication of low true positive and high true negative rates. Our study confirms this finding. We, therefore, use three sampling techniques, namely, under-sampling, oversampling and Synthetic Minority Over-sampling Technique, to balance the data, this results in three additional datasets to use for modelling. We build the following predictive models: random forest, multivariate adaptive regression splines, neural network and support vector machine on the datasets and we compare the models against each other for their ability to identify customers that are likely to take-up a term savings product. As part of the model building process, we investigate parameter permutations related to each modelling technique to tune the models, we find that this assists in building robust models. We assess our models for predictive performance through the use of the receiver operating characteristic curve, confusion matrix, GINI, kappa, sensitivity, specificity, and lift and gains charts. A multivariate adaptive regression splines model built on over-sampled data is found to be the best model for predicting term savings product takeup.
dc.identifier.apacitationHlongwane, R. W. (2018). <i>Selecting the best model for predicting a term deposit product take-up in banking</i>. (). University of Cape Town ,Faculty of Science ,Department of Statistical Sciences. Retrieved from http://hdl.handle.net/11427/29789en_ZA
dc.identifier.chicagocitationHlongwane, Rivalani Willie. <i>"Selecting the best model for predicting a term deposit product take-up in banking."</i> ., University of Cape Town ,Faculty of Science ,Department of Statistical Sciences, 2018. http://hdl.handle.net/11427/29789en_ZA
dc.identifier.citationHlongwane, R. 2018. Selecting the best model for predicting a term deposit product take-up in banking. University of Cape Town.en_ZA
dc.identifier.ris TY - Thesis / Dissertation AU - Hlongwane, Rivalani Willie AB - In this study, we use data mining techniques to build predictive models on data collected by a Portuguese bank through a term savings product campaign conducted between May 2008 and November 2010. This data is imbalanced, given an observed take-up rate of 11.27%. Ling et al. (1998) indicated that predictive models built on imbalanced data tend to yield low sensitivity and high specificity, an indication of low true positive and high true negative rates. Our study confirms this finding. We, therefore, use three sampling techniques, namely, under-sampling, oversampling and Synthetic Minority Over-sampling Technique, to balance the data, this results in three additional datasets to use for modelling. We build the following predictive models: random forest, multivariate adaptive regression splines, neural network and support vector machine on the datasets and we compare the models against each other for their ability to identify customers that are likely to take-up a term savings product. As part of the model building process, we investigate parameter permutations related to each modelling technique to tune the models, we find that this assists in building robust models. We assess our models for predictive performance through the use of the receiver operating characteristic curve, confusion matrix, GINI, kappa, sensitivity, specificity, and lift and gains charts. A multivariate adaptive regression splines model built on over-sampled data is found to be the best model for predicting term savings product takeup. DA - 2018 DB - OpenUCT DP - University of Cape Town LK - https://open.uct.ac.za PB - University of Cape Town PY - 2018 T1 - Selecting the best model for predicting a term deposit product take-up in banking TI - Selecting the best model for predicting a term deposit product take-up in banking UR - http://hdl.handle.net/11427/29789 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/29789
dc.identifier.vancouvercitationHlongwane RW. Selecting the best model for predicting a term deposit product take-up in banking. []. University of Cape Town ,Faculty of Science ,Department of Statistical Sciences, 2018 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/29789en_ZA
dc.language.isoeng
dc.publisher.departmentDepartment of Statistical Sciences
dc.publisher.facultyFaculty of Science
dc.publisher.institutionUniversity of Cape Town
dc.subject.otherStatistical Science
dc.subject.otherdata mining
dc.subject.otherfinancial predictive models
dc.titleSelecting the best model for predicting a term deposit product take-up in banking
dc.typeMaster Thesis
dc.type.qualificationlevelMasters
dc.type.qualificationnameMSc
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_sci_2018_hlongwane_rivalani_willie.pdf
Size:
1.85 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description:
Collections