Exploring the application of Word2Vec to basket transaction data in the grocery retail industry

dc.contributor.advisorNyirenda, Juwa Chiza
dc.contributor.authorDe Swardt, Gideon Jacobus
dc.date.accessioned2022-05-30T10:33:38Z
dc.date.available2022-05-30T10:33:38Z
dc.date.issued2022
dc.date.updated2022-05-30T10:29:42Z
dc.description.abstractIn this thesis, we explore the application of Word2vec to basket transaction data provided by a large grocery retailer in South Africa. Word2vec is an algorithm based on representation learning. The objective of the exploration is to establish whether the application of Word2vec to basket transaction data would generate product embeddings that represent a useful relationship between products. Furthermore, we compareWord2vec's outputs and performance to traditional methods for studying product relationships which include Association Rules Mining (ARM) and Recommendation Systems. The results from the experiments showed that indeed product embeddings created by Word2vec on transaction data are meaningful and useful. It was clear that the idea of using transactions in the place of sentences to the neural network, provides analogous results to that of a natural language task. Word2vec clearly demonstrated its ability to cluster products that are homogeneous or fulfill similar needs. Furthermore this sort of product relationship was not provided by any other traditional methods, which was clear when comparing the outputs to that of ARM and Recommendation Systems. We also show that usingWord2vec could potentially provide insight on truly complementary products that ARM perhaps fails to do. Word2vec also proved to be incredibly scalable, taking input data of 20 times the size of what traditional methods could handle on a local computer. We end with a description of a potential application of the ideas learnt during the course of this study, with a real business problem, that we believe could lead to an enhanced customer shopping experience and in turn increase revenue and profits for the retailer.
dc.identifier.apacitationDe Swardt, G. J. (2022). <i>Exploring the application of Word2Vec to basket transaction data in the grocery retail industry</i>. (). ,Faculty of Science ,Department of Statistical Sciences. Retrieved from http://hdl.handle.net/11427/36434en_ZA
dc.identifier.chicagocitationDe Swardt, Gideon Jacobus. <i>"Exploring the application of Word2Vec to basket transaction data in the grocery retail industry."</i> ., ,Faculty of Science ,Department of Statistical Sciences, 2022. http://hdl.handle.net/11427/36434en_ZA
dc.identifier.citationDe Swardt, G.J. 2022. Exploring the application of Word2Vec to basket transaction data in the grocery retail industry. . ,Faculty of Science ,Department of Statistical Sciences. http://hdl.handle.net/11427/36434en_ZA
dc.identifier.ris TY - Master Thesis AU - De Swardt, Gideon Jacobus AB - In this thesis, we explore the application of Word2vec to basket transaction data provided by a large grocery retailer in South Africa. Word2vec is an algorithm based on representation learning. The objective of the exploration is to establish whether the application of Word2vec to basket transaction data would generate product embeddings that represent a useful relationship between products. Furthermore, we compareWord2vec's outputs and performance to traditional methods for studying product relationships which include Association Rules Mining (ARM) and Recommendation Systems. The results from the experiments showed that indeed product embeddings created by Word2vec on transaction data are meaningful and useful. It was clear that the idea of using transactions in the place of sentences to the neural network, provides analogous results to that of a natural language task. Word2vec clearly demonstrated its ability to cluster products that are homogeneous or fulfill similar needs. Furthermore this sort of product relationship was not provided by any other traditional methods, which was clear when comparing the outputs to that of ARM and Recommendation Systems. We also show that usingWord2vec could potentially provide insight on truly complementary products that ARM perhaps fails to do. Word2vec also proved to be incredibly scalable, taking input data of 20 times the size of what traditional methods could handle on a local computer. We end with a description of a potential application of the ideas learnt during the course of this study, with a real business problem, that we believe could lead to an enhanced customer shopping experience and in turn increase revenue and profits for the retailer. DA - 2022 DB - OpenUCT DP - University of Cape Town KW - statistical sciences LK - https://open.uct.ac.za PY - 2022 T1 - Exploring the application of Word2Vec to basket transaction data in the grocery retail industry TI - Exploring the application of Word2Vec to basket transaction data in the grocery retail industry UR - http://hdl.handle.net/11427/36434 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/36434
dc.identifier.vancouvercitationDe Swardt GJ. Exploring the application of Word2Vec to basket transaction data in the grocery retail industry. []. ,Faculty of Science ,Department of Statistical Sciences, 2022 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/36434en_ZA
dc.language.rfc3066eng
dc.publisher.departmentDepartment of Statistical Sciences
dc.publisher.facultyFaculty of Science
dc.subjectstatistical sciences
dc.titleExploring the application of Word2Vec to basket transaction data in the grocery retail industry
dc.typeMaster Thesis
dc.type.qualificationlevelMasters
dc.type.qualificationlevelMSc
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_sci_2022_de swardt gideon jacobus.pdf
Size:
3.5 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description:
Collections