APIC: A method for automated pattern identification and classification
dc.contributor.advisor | Nitschke, Geoff Stuart | en_ZA |
dc.contributor.author | Goss, Ryan Gavin | en_ZA |
dc.date.accessioned | 2018-01-25T14:10:52Z | |
dc.date.available | 2018-01-25T14:10:52Z | |
dc.date.issued | 2017 | en_ZA |
dc.description.abstract | Machine Learning (ML) is a transformative technology at the forefront of many modern research endeavours. The technology is generating a tremendous amount of attention from researchers and practitioners, providing new approaches to solving complex classification and regression tasks. While concepts such as Deep Learning have existed for many years, the computational power for realising the utility of these algorithms in real-world applications has only recently become available. This dissertation investigated the efficacy of a novel, general method for deploying ML in a variety of complex tasks, where best feature selection, data-set labelling, model definition and training processes were determined automatically. Models were developed in an iterative fashion, evaluated using both training and validation data sets. The proposed method was evaluated using three distinct case studies, describing complex classification tasks often requiring significant input from human experts. The results achieved demonstrate that the proposed method compares with, and often outperforms, less general, comparable methods designed specifically for each task. Feature selection, data-set annotation, model design and training processes were optimised by the method, where less complex, comparatively accurate classifiers with lower dependency on computational power and human expert intervention were produced. In chapter 4, the proposed method demonstrated improved efficacy over comparable systems, automatically identifying and classifying complex application protocols traversing IP networks. In chapter 5, the proposed method was able to discriminate between normal and anomalous traffic, maintaining accuracy in excess of 99%, while reducing false alarms to a mere 0.08%. Finally, in chapter 6, the proposed method discovered more optimal classifiers than those implemented by comparable methods, with classification scores rivalling those achieved by state-of-the-art systems. The findings of this research concluded that developing a fully automated, general method, exhibiting efficacy in a wide variety of complex classification tasks with minimal expert intervention, was possible. The method and various artefacts produced in each case study of this dissertation are thus significant contributions to the field of ML. | en_ZA |
dc.identifier.apacitation | Goss, R. G. (2017). <i>APIC: A method for automated pattern identification and classification</i>. (Thesis). University of Cape Town ,Faculty of Science ,Department of Computer Science. Retrieved from http://hdl.handle.net/11427/27025 | en_ZA |
dc.identifier.chicagocitation | Goss, Ryan Gavin. <i>"APIC: A method for automated pattern identification and classification."</i> Thesis., University of Cape Town ,Faculty of Science ,Department of Computer Science, 2017. http://hdl.handle.net/11427/27025 | en_ZA |
dc.identifier.citation | Goss, R. 2017. APIC: A method for automated pattern identification and classification. University of Cape Town. | en_ZA |
dc.identifier.ris | TY - Thesis / Dissertation AU - Goss, Ryan Gavin AB - Machine Learning (ML) is a transformative technology at the forefront of many modern research endeavours. The technology is generating a tremendous amount of attention from researchers and practitioners, providing new approaches to solving complex classification and regression tasks. While concepts such as Deep Learning have existed for many years, the computational power for realising the utility of these algorithms in real-world applications has only recently become available. This dissertation investigated the efficacy of a novel, general method for deploying ML in a variety of complex tasks, where best feature selection, data-set labelling, model definition and training processes were determined automatically. Models were developed in an iterative fashion, evaluated using both training and validation data sets. The proposed method was evaluated using three distinct case studies, describing complex classification tasks often requiring significant input from human experts. The results achieved demonstrate that the proposed method compares with, and often outperforms, less general, comparable methods designed specifically for each task. Feature selection, data-set annotation, model design and training processes were optimised by the method, where less complex, comparatively accurate classifiers with lower dependency on computational power and human expert intervention were produced. In chapter 4, the proposed method demonstrated improved efficacy over comparable systems, automatically identifying and classifying complex application protocols traversing IP networks. In chapter 5, the proposed method was able to discriminate between normal and anomalous traffic, maintaining accuracy in excess of 99%, while reducing false alarms to a mere 0.08%. Finally, in chapter 6, the proposed method discovered more optimal classifiers than those implemented by comparable methods, with classification scores rivalling those achieved by state-of-the-art systems. The findings of this research concluded that developing a fully automated, general method, exhibiting efficacy in a wide variety of complex classification tasks with minimal expert intervention, was possible. The method and various artefacts produced in each case study of this dissertation are thus significant contributions to the field of ML. DA - 2017 DB - OpenUCT DP - University of Cape Town LK - https://open.uct.ac.za PB - University of Cape Town PY - 2017 T1 - APIC: A method for automated pattern identification and classification TI - APIC: A method for automated pattern identification and classification UR - http://hdl.handle.net/11427/27025 ER - | en_ZA |
dc.identifier.uri | http://hdl.handle.net/11427/27025 | |
dc.identifier.vancouvercitation | Goss RG. APIC: A method for automated pattern identification and classification. [Thesis]. University of Cape Town ,Faculty of Science ,Department of Computer Science, 2017 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/27025 | en_ZA |
dc.language.iso | eng | en_ZA |
dc.publisher.department | Department of Computer Science | en_ZA |
dc.publisher.faculty | Faculty of Science | en_ZA |
dc.publisher.institution | University of Cape Town | |
dc.subject.other | Pattern Recognition | en_ZA |
dc.subject.other | Machine Learning | en_ZA |
dc.title | APIC: A method for automated pattern identification and classification | en_ZA |
dc.type | Doctoral Thesis | |
dc.type.qualificationlevel | Doctoral | |
dc.type.qualificationname | PhD | en_ZA |
uct.type.filetype | Text | |
uct.type.filetype | Image | |
uct.type.publication | Research | en_ZA |
uct.type.resource | Thesis | en_ZA |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- thesis_sci_2017_goss_ryan_gavin.pdf
- Size:
- 13.54 MB
- Format:
- Adobe Portable Document Format
- Description: