Generating new data points using singular value decomposition

dc.contributor.advisorNyirenda, Juwa Chiza
dc.contributor.authorBiyana, Tlhologello
dc.date.accessioned2025-06-23T13:19:52Z
dc.date.available2025-06-23T13:19:52Z
dc.date.issued2025
dc.date.updated2025-06-23T13:17:18Z
dc.description.abstractThis study presents an innovative solution to the challenge of generating new data points for small data sets. It introduces a Single Value Decomposition (SVD)-based model that draws inspiration from the ability of SVD to estimate a lower rank matrix. This approach seeks to overcome the limitations imposed by sample size constraints by expanding available data. Motivated by challenges faced during algorithm development due to small data sets, the study proposes the SVD-based model, evaluates its efficacy in replicating original data attributes and compares model performance with new and original data. The method involves utilising SVD to generate new data, mimicking a predictive modelling formula by combining systematic and error components. The generated data set retains the distribution of the original data but introduces distinct error values, facilitating efficient data generation. Through graphical and quantitative assessments, including histograms, box plots, correlation analysis and reconstruction error evaluations, the effectiveness of the method is demonstrated. The study focuses on comparing SVD-generated data sets with original data across three data sets: Abalone, Life Expectancy and NBA. Findings indicate close approximation of distribution, correlation and model performance attributes between SVD-generated and original data sets. Improved similarity with increasing observation count enhances comparability and model performance of SVD-generated data. While minor deviations are noted in specific scenarios, the study underscores potential of SVD in generating new data points from the original data sets, making it a valuable tool for data augmentation and analysis across diverse data sets.
dc.identifier.apacitationBiyana, T. (2025). <i>Generating new data points using singular value decomposition</i>. (). University of Cape town ,Faculty of Science ,Department of Statistical Sciences. Retrieved from http://hdl.handle.net/11427/41476en_ZA
dc.identifier.chicagocitationBiyana, Tlhologello. <i>"Generating new data points using singular value decomposition."</i> ., University of Cape town ,Faculty of Science ,Department of Statistical Sciences, 2025. http://hdl.handle.net/11427/41476en_ZA
dc.identifier.citationBiyana, T. 2025. Generating new data points using singular value decomposition. . University of Cape town ,Faculty of Science ,Department of Statistical Sciences. http://hdl.handle.net/11427/41476en_ZA
dc.identifier.ris TY - Thesis / Dissertation AU - Biyana, Tlhologello AB - This study presents an innovative solution to the challenge of generating new data points for small data sets. It introduces a Single Value Decomposition (SVD)-based model that draws inspiration from the ability of SVD to estimate a lower rank matrix. This approach seeks to overcome the limitations imposed by sample size constraints by expanding available data. Motivated by challenges faced during algorithm development due to small data sets, the study proposes the SVD-based model, evaluates its efficacy in replicating original data attributes and compares model performance with new and original data. The method involves utilising SVD to generate new data, mimicking a predictive modelling formula by combining systematic and error components. The generated data set retains the distribution of the original data but introduces distinct error values, facilitating efficient data generation. Through graphical and quantitative assessments, including histograms, box plots, correlation analysis and reconstruction error evaluations, the effectiveness of the method is demonstrated. The study focuses on comparing SVD-generated data sets with original data across three data sets: Abalone, Life Expectancy and NBA. Findings indicate close approximation of distribution, correlation and model performance attributes between SVD-generated and original data sets. Improved similarity with increasing observation count enhances comparability and model performance of SVD-generated data. While minor deviations are noted in specific scenarios, the study underscores potential of SVD in generating new data points from the original data sets, making it a valuable tool for data augmentation and analysis across diverse data sets. DA - 2025 DB - OpenUCT DP - University of Cape Town KW - Statistical Sciences LK - https://open.uct.ac.za PB - University of Cape town PY - 2025 T1 - Generating new data points using singular value decomposition TI - Generating new data points using singular value decomposition UR - http://hdl.handle.net/11427/41476 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/41476
dc.identifier.vancouvercitationBiyana T. Generating new data points using singular value decomposition. []. University of Cape town ,Faculty of Science ,Department of Statistical Sciences, 2025 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/41476en_ZA
dc.language.rfc3066Eng
dc.publisher.departmentDepartment of Statistical Sciences
dc.publisher.facultyFaculty of Science
dc.publisher.institutionUniversity of Cape town
dc.subjectStatistical Sciences
dc.titleGenerating new data points using singular value decomposition
dc.typeThesis / Dissertation
dc.type.qualificationlevelMasters
dc.type.qualificationlevelMSc
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_sci_2025_biyana tlhologello.pdf
Size:
12.1 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.72 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections