Generating new data points using singular value decomposition
| dc.contributor.advisor | Nyirenda, Juwa Chiza | |
| dc.contributor.author | Biyana, Tlhologello | |
| dc.date.accessioned | 2025-06-23T13:19:52Z | |
| dc.date.available | 2025-06-23T13:19:52Z | |
| dc.date.issued | 2025 | |
| dc.date.updated | 2025-06-23T13:17:18Z | |
| dc.description.abstract | This study presents an innovative solution to the challenge of generating new data points for small data sets. It introduces a Single Value Decomposition (SVD)-based model that draws inspiration from the ability of SVD to estimate a lower rank matrix. This approach seeks to overcome the limitations imposed by sample size constraints by expanding available data. Motivated by challenges faced during algorithm development due to small data sets, the study proposes the SVD-based model, evaluates its efficacy in replicating original data attributes and compares model performance with new and original data. The method involves utilising SVD to generate new data, mimicking a predictive modelling formula by combining systematic and error components. The generated data set retains the distribution of the original data but introduces distinct error values, facilitating efficient data generation. Through graphical and quantitative assessments, including histograms, box plots, correlation analysis and reconstruction error evaluations, the effectiveness of the method is demonstrated. The study focuses on comparing SVD-generated data sets with original data across three data sets: Abalone, Life Expectancy and NBA. Findings indicate close approximation of distribution, correlation and model performance attributes between SVD-generated and original data sets. Improved similarity with increasing observation count enhances comparability and model performance of SVD-generated data. While minor deviations are noted in specific scenarios, the study underscores potential of SVD in generating new data points from the original data sets, making it a valuable tool for data augmentation and analysis across diverse data sets. | |
| dc.identifier.apacitation | Biyana, T. (2025). <i>Generating new data points using singular value decomposition</i>. (). University of Cape town ,Faculty of Science ,Department of Statistical Sciences. Retrieved from http://hdl.handle.net/11427/41476 | en_ZA |
| dc.identifier.chicagocitation | Biyana, Tlhologello. <i>"Generating new data points using singular value decomposition."</i> ., University of Cape town ,Faculty of Science ,Department of Statistical Sciences, 2025. http://hdl.handle.net/11427/41476 | en_ZA |
| dc.identifier.citation | Biyana, T. 2025. Generating new data points using singular value decomposition. . University of Cape town ,Faculty of Science ,Department of Statistical Sciences. http://hdl.handle.net/11427/41476 | en_ZA |
| dc.identifier.ris | TY - Thesis / Dissertation AU - Biyana, Tlhologello AB - This study presents an innovative solution to the challenge of generating new data points for small data sets. It introduces a Single Value Decomposition (SVD)-based model that draws inspiration from the ability of SVD to estimate a lower rank matrix. This approach seeks to overcome the limitations imposed by sample size constraints by expanding available data. Motivated by challenges faced during algorithm development due to small data sets, the study proposes the SVD-based model, evaluates its efficacy in replicating original data attributes and compares model performance with new and original data. The method involves utilising SVD to generate new data, mimicking a predictive modelling formula by combining systematic and error components. The generated data set retains the distribution of the original data but introduces distinct error values, facilitating efficient data generation. Through graphical and quantitative assessments, including histograms, box plots, correlation analysis and reconstruction error evaluations, the effectiveness of the method is demonstrated. The study focuses on comparing SVD-generated data sets with original data across three data sets: Abalone, Life Expectancy and NBA. Findings indicate close approximation of distribution, correlation and model performance attributes between SVD-generated and original data sets. Improved similarity with increasing observation count enhances comparability and model performance of SVD-generated data. While minor deviations are noted in specific scenarios, the study underscores potential of SVD in generating new data points from the original data sets, making it a valuable tool for data augmentation and analysis across diverse data sets. DA - 2025 DB - OpenUCT DP - University of Cape Town KW - Statistical Sciences LK - https://open.uct.ac.za PB - University of Cape town PY - 2025 T1 - Generating new data points using singular value decomposition TI - Generating new data points using singular value decomposition UR - http://hdl.handle.net/11427/41476 ER - | en_ZA |
| dc.identifier.uri | http://hdl.handle.net/11427/41476 | |
| dc.identifier.vancouvercitation | Biyana T. Generating new data points using singular value decomposition. []. University of Cape town ,Faculty of Science ,Department of Statistical Sciences, 2025 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/41476 | en_ZA |
| dc.language.rfc3066 | Eng | |
| dc.publisher.department | Department of Statistical Sciences | |
| dc.publisher.faculty | Faculty of Science | |
| dc.publisher.institution | University of Cape town | |
| dc.subject | Statistical Sciences | |
| dc.title | Generating new data points using singular value decomposition | |
| dc.type | Thesis / Dissertation | |
| dc.type.qualificationlevel | Masters | |
| dc.type.qualificationlevel | MSc |