APIC: A method for automated pattern identification and classification

Goss, Ryan Gavin

APIC: A method for automated pattern identification and classification

dc.contributor.advisor	Nitschke, Geoff Stuart	en_ZA
dc.contributor.author	Goss, Ryan Gavin	en_ZA
dc.date.accessioned	2018-01-25T14:10:52Z
dc.date.available	2018-01-25T14:10:52Z
dc.date.issued	2017	en_ZA
dc.description.abstract	Machine Learning (ML) is a transformative technology at the forefront of many modern research endeavours. The technology is generating a tremendous amount of attention from researchers and practitioners, providing new approaches to solving complex classification and regression tasks. While concepts such as Deep Learning have existed for many years, the computational power for realising the utility of these algorithms in real-world applications has only recently become available. This dissertation investigated the efficacy of a novel, general method for deploying ML in a variety of complex tasks, where best feature selection, data-set labelling, model definition and training processes were determined automatically. Models were developed in an iterative fashion, evaluated using both training and validation data sets. The proposed method was evaluated using three distinct case studies, describing complex classification tasks often requiring significant input from human experts. The results achieved demonstrate that the proposed method compares with, and often outperforms, less general, comparable methods designed specifically for each task. Feature selection, data-set annotation, model design and training processes were optimised by the method, where less complex, comparatively accurate classifiers with lower dependency on computational power and human expert intervention were produced. In chapter 4, the proposed method demonstrated improved efficacy over comparable systems, automatically identifying and classifying complex application protocols traversing IP networks. In chapter 5, the proposed method was able to discriminate between normal and anomalous traffic, maintaining accuracy in excess of 99%, while reducing false alarms to a mere 0.08%. Finally, in chapter 6, the proposed method discovered more optimal classifiers than those implemented by comparable methods, with classification scores rivalling those achieved by state-of-the-art systems. The findings of this research concluded that developing a fully automated, general method, exhibiting efficacy in a wide variety of complex classification tasks with minimal expert intervention, was possible. The method and various artefacts produced in each case study of this dissertation are thus significant contributions to the field of ML.	en_ZA
dc.identifier.apacitation	Goss, R. G. (2017). <i>APIC: A method for automated pattern identification and classification</i>. (Thesis). University of Cape Town ,Faculty of Science ,Department of Computer Science. Retrieved from http://hdl.handle.net/11427/27025	en_ZA
dc.identifier.chicagocitation	Goss, Ryan Gavin. <i>"APIC: A method for automated pattern identification and classification."</i> Thesis., University of Cape Town ,Faculty of Science ,Department of Computer Science, 2017. http://hdl.handle.net/11427/27025	en_ZA
dc.identifier.citation	Goss, R. 2017. APIC: A method for automated pattern identification and classification. University of Cape Town.	en_ZA
dc.identifier.ris	TY - Thesis / Dissertation AU - Goss, Ryan Gavin AB - Machine Learning (ML) is a transformative technology at the forefront of many modern research endeavours. The technology is generating a tremendous amount of attention from researchers and practitioners, providing new approaches to solving complex classification and regression tasks. While concepts such as Deep Learning have existed for many years, the computational power for realising the utility of these algorithms in real-world applications has only recently become available. This dissertation investigated the efficacy of a novel, general method for deploying ML in a variety of complex tasks, where best feature selection, data-set labelling, model definition and training processes were determined automatically. Models were developed in an iterative fashion, evaluated using both training and validation data sets. The proposed method was evaluated using three distinct case studies, describing complex classification tasks often requiring significant input from human experts. The results achieved demonstrate that the proposed method compares with, and often outperforms, less general, comparable methods designed specifically for each task. Feature selection, data-set annotation, model design and training processes were optimised by the method, where less complex, comparatively accurate classifiers with lower dependency on computational power and human expert intervention were produced. In chapter 4, the proposed method demonstrated improved efficacy over comparable systems, automatically identifying and classifying complex application protocols traversing IP networks. In chapter 5, the proposed method was able to discriminate between normal and anomalous traffic, maintaining accuracy in excess of 99%, while reducing false alarms to a mere 0.08%. Finally, in chapter 6, the proposed method discovered more optimal classifiers than those implemented by comparable methods, with classification scores rivalling those achieved by state-of-the-art systems. The findings of this research concluded that developing a fully automated, general method, exhibiting efficacy in a wide variety of complex classification tasks with minimal expert intervention, was possible. The method and various artefacts produced in each case study of this dissertation are thus significant contributions to the field of ML. DA - 2017 DB - OpenUCT DP - University of Cape Town LK - https://open.uct.ac.za PB - University of Cape Town PY - 2017 T1 - APIC: A method for automated pattern identification and classification TI - APIC: A method for automated pattern identification and classification UR - http://hdl.handle.net/11427/27025 ER -	en_ZA
dc.identifier.uri	http://hdl.handle.net/11427/27025
dc.identifier.vancouvercitation	Goss RG. APIC: A method for automated pattern identification and classification. [Thesis]. University of Cape Town ,Faculty of Science ,Department of Computer Science, 2017 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/27025	en_ZA
dc.language.iso	eng	en_ZA
dc.publisher.department	Department of Computer Science	en_ZA
dc.publisher.faculty	Faculty of Science	en_ZA
dc.publisher.institution	University of Cape Town
dc.subject.other	Pattern Recognition	en_ZA
dc.subject.other	Machine Learning	en_ZA
dc.title	APIC: A method for automated pattern identification and classification	en_ZA
dc.type	Doctoral Thesis
dc.type.qualificationlevel	Doctoral
dc.type.qualificationname	PhD	en_ZA
uct.type.filetype	Text
uct.type.filetype	Image
uct.type.publication	Research	en_ZA
uct.type.resource	Thesis	en_ZA

Files

Original bundle

Now showing 1 - 1 of 1

Name:: thesis_sci_2017_goss_ryan_gavin.pdf
Size:: 13.54 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

PhD / Doctoral