Optimisation of galaxy identification methods on large interferometric surveys

dc.contributor.advisorKraan-Korteweg, Renee Christine
dc.contributor.advisorJarrett, Thomas
dc.contributor.authorGqaza, Themba
dc.date.accessioned2019-05-15T07:05:01Z
dc.date.available2019-05-15T07:05:01Z
dc.date.issued2018
dc.date.updated2019-05-14T11:41:35Z
dc.description.abstractThe astronomical size of spectral data cubes that will result from the SKA pathfinders planned large HI surveys such as LADUMA; Fornax HI survey; DINGO; WALLABY; etc. necessitate fully automated three-dimensional (3D) source finding and parametrization tools. A fraction of the percentage difference in the performance of these automated tools corresponds to a significant number of galaxies being detected or undetected. Failure or success to resolve satellites around big spirals will affect both the low and the high mass end of the HI mass function. As a result, the performance and efficiency of these automated tools are of great importance, especially in the epoch of big data. Here I present the comprehensive comparison of performance between the fully automated source identification and parametrization software: SOFIA, the visual galaxy identification method and the semi-automated galaxy identification method. Each galaxy identification method has been applied to the same ∼ 35 gigabytes 3D HI data cube. The data cube results from the blind HI imaging survey conducted using the Westerbork Synthesis Radio Telescope (WSRT). The survey mapped the overdensity corresponding to the Perseus-Pisces Supercluster filament crossing the Zone-of-Avoidance (ZoA), at (`, b) ≈ (160◦ , 0.5◦ ). A total of 211 galaxies detected using the semi-automated method by Ramatsoku et al. [2016]. In this work, I detected 194 galaxies (using the visual identification method) of which 89.7% (174) have cross-matches/counterparts on the galaxy catalogue produced through semi-automated identification method. A total of 130 detections were made using SOFIA of which 89 were also identified by the two other methods. I used the sample of 174 visual detections with semi-automated counterparts as a Testbed to calculate the reliability and completeness achieved by SOFIA. The achieved reliability is ∼ 0.68 whereas completeness is ∼ 0.51. Further parameter fine-tuning is necessary to have a better handle on all SOFIA parameters and achieve higher reliability and completeness values.
dc.identifier.apacitationGqaza, T. (2018). <i>Optimisation of galaxy identification methods on large interferometric surveys</i>. (). ,Faculty of Science ,Department of Astronomy. Retrieved from http://hdl.handle.net/11427/30072en_ZA
dc.identifier.chicagocitationGqaza, Themba. <i>"Optimisation of galaxy identification methods on large interferometric surveys."</i> ., ,Faculty of Science ,Department of Astronomy, 2018. http://hdl.handle.net/11427/30072en_ZA
dc.identifier.citationGqaza, T. 2018. Optimisation of galaxy identification methods on large interferometric surveys. . ,Faculty of Science ,Department of Astronomy. http://hdl.handle.net/11427/30072en_ZA
dc.identifier.ris TY - Thesis / Dissertation AU - Gqaza, Themba AB - The astronomical size of spectral data cubes that will result from the SKA pathfinders planned large HI surveys such as LADUMA; Fornax HI survey; DINGO; WALLABY; etc. necessitate fully automated three-dimensional (3D) source finding and parametrization tools. A fraction of the percentage difference in the performance of these automated tools corresponds to a significant number of galaxies being detected or undetected. Failure or success to resolve satellites around big spirals will affect both the low and the high mass end of the HI mass function. As a result, the performance and efficiency of these automated tools are of great importance, especially in the epoch of big data. Here I present the comprehensive comparison of performance between the fully automated source identification and parametrization software: SOFIA, the visual galaxy identification method and the semi-automated galaxy identification method. Each galaxy identification method has been applied to the same ∼ 35 gigabytes 3D HI data cube. The data cube results from the blind HI imaging survey conducted using the Westerbork Synthesis Radio Telescope (WSRT). The survey mapped the overdensity corresponding to the Perseus-Pisces Supercluster filament crossing the Zone-of-Avoidance (ZoA), at (`, b) ≈ (160◦ , 0.5◦ ). A total of 211 galaxies detected using the semi-automated method by Ramatsoku et al. [2016]. In this work, I detected 194 galaxies (using the visual identification method) of which 89.7% (174) have cross-matches/counterparts on the galaxy catalogue produced through semi-automated identification method. A total of 130 detections were made using SOFIA of which 89 were also identified by the two other methods. I used the sample of 174 visual detections with semi-automated counterparts as a Testbed to calculate the reliability and completeness achieved by SOFIA. The achieved reliability is ∼ 0.68 whereas completeness is ∼ 0.51. Further parameter fine-tuning is necessary to have a better handle on all SOFIA parameters and achieve higher reliability and completeness values. DA - 2018 DB - OpenUCT DP - University of Cape Town KW - techniques: interferometric-methods: data analysis-radio lines: galaxies LK - https://open.uct.ac.za PY - 2018 T1 - Optimisation of galaxy identification methods on large interferometric surveys TI - Optimisation of galaxy identification methods on large interferometric surveys UR - http://hdl.handle.net/11427/30072 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/30072
dc.identifier.vancouvercitationGqaza T. Optimisation of galaxy identification methods on large interferometric surveys. []. ,Faculty of Science ,Department of Astronomy, 2018 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/30072en_ZA
dc.language.rfc3066eng
dc.publisher.departmentDepartment of Astronomy
dc.publisher.facultyFaculty of Science
dc.subjecttechniques: interferometric-methods: data analysis-radio lines: galaxies
dc.titleOptimisation of galaxy identification methods on large interferometric surveys
dc.typeMaster Thesis
dc.type.qualificationlevelMasters
dc.type.qualificationnameMSc
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_sci_2018_gqaza_themba.pdf
Size:
10.09 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description:
Collections