A binaural sound sources localisation application for smart phones

dc.contributor.advisorWinberg, Simonen_ZA
dc.contributor.authorMugagga, Pius Kavuma Basajjabakaen_ZA
dc.date.accessioned2017-05-16T07:36:53Z
dc.date.available2017-05-16T07:36:53Z
dc.date.issued2015en_ZA
dc.description.abstractThe ability to estimate positions of sound sources is one that gives animals a 360° awareness of their acoustic environment. This helps compliment the visual scene which is restricted to 180° in humans. Unfortunately, deaf people are left out on this ability. Smart phones are rapidly becoming a common tool amongst mobile users in developed and emerging markets. Their processing ability has more than doubled since their introduction to mass consumer markets by Apple in 2007. Top-end smart phones such as the Samsung Galaxy Series; 3, 4, and 5 models, have two microphones with which one can acquire stereo recordings. The purpose of this research project was to establish a feasible Sound source localization algorithm for current top-end smart phones, and to recommend hardware improvements for future smart phones, to pave way for the use of smart phones as advanced auditory sensory devices capable of acting as avatars for intelligent remote systems to learn about different acoustic scenes with help of human users. The GCC-PHAT algorithm was chosen as the underlying core DOA algorithm due to its suitability for pair-wise localization as highlighted in literature. A stochastic power accumulation algorithm was designed and implemented to improve estimation outcomes by GCC-PHAT. This algorithm was based on inspiration from W-disjoint orthogonality assumption in literature and was extended to perform sound source counting and time domain source separation. The system yielded satisfactory azimuth estimates of sound source directions in real time with pin-point DOA estimation accuracy rates of 64%, and 90.67% accuracy rate when a tolerance of ± 1 correlation sample is considered. An effort to resolve front back ambiguity using phone orientation data from the MEMs sensors yielded un-satisfactory results prompting a recommendation that an extra microphone would be needed to achieve 360° localization in a more user friendly way. The dissertation concludes with plans for further work on the topic and provision of a further refined API and optimised libraries to facilitate development of customised solutions using this system.en_ZA
dc.identifier.apacitationMugagga, P. K. B. (2015). <i>A binaural sound sources localisation application for smart phones</i>. (Thesis). University of Cape Town ,Faculty of Engineering & the Built Environment ,Department of Electrical Engineering. Retrieved from http://hdl.handle.net/11427/24295en_ZA
dc.identifier.chicagocitationMugagga, Pius Kavuma Basajjabaka. <i>"A binaural sound sources localisation application for smart phones."</i> Thesis., University of Cape Town ,Faculty of Engineering & the Built Environment ,Department of Electrical Engineering, 2015. http://hdl.handle.net/11427/24295en_ZA
dc.identifier.citationMugagga, P. 2015. A binaural sound sources localisation application for smart phones. University of Cape Town.en_ZA
dc.identifier.ris TY - Thesis / Dissertation AU - Mugagga, Pius Kavuma Basajjabaka AB - The ability to estimate positions of sound sources is one that gives animals a 360° awareness of their acoustic environment. This helps compliment the visual scene which is restricted to 180° in humans. Unfortunately, deaf people are left out on this ability. Smart phones are rapidly becoming a common tool amongst mobile users in developed and emerging markets. Their processing ability has more than doubled since their introduction to mass consumer markets by Apple in 2007. Top-end smart phones such as the Samsung Galaxy Series; 3, 4, and 5 models, have two microphones with which one can acquire stereo recordings. The purpose of this research project was to establish a feasible Sound source localization algorithm for current top-end smart phones, and to recommend hardware improvements for future smart phones, to pave way for the use of smart phones as advanced auditory sensory devices capable of acting as avatars for intelligent remote systems to learn about different acoustic scenes with help of human users. The GCC-PHAT algorithm was chosen as the underlying core DOA algorithm due to its suitability for pair-wise localization as highlighted in literature. A stochastic power accumulation algorithm was designed and implemented to improve estimation outcomes by GCC-PHAT. This algorithm was based on inspiration from W-disjoint orthogonality assumption in literature and was extended to perform sound source counting and time domain source separation. The system yielded satisfactory azimuth estimates of sound source directions in real time with pin-point DOA estimation accuracy rates of 64%, and 90.67% accuracy rate when a tolerance of ± 1 correlation sample is considered. An effort to resolve front back ambiguity using phone orientation data from the MEMs sensors yielded un-satisfactory results prompting a recommendation that an extra microphone would be needed to achieve 360° localization in a more user friendly way. The dissertation concludes with plans for further work on the topic and provision of a further refined API and optimised libraries to facilitate development of customised solutions using this system. DA - 2015 DB - OpenUCT DP - University of Cape Town LK - https://open.uct.ac.za PB - University of Cape Town PY - 2015 T1 - A binaural sound sources localisation application for smart phones TI - A binaural sound sources localisation application for smart phones UR - http://hdl.handle.net/11427/24295 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/24295
dc.identifier.vancouvercitationMugagga PKB. A binaural sound sources localisation application for smart phones. [Thesis]. University of Cape Town ,Faculty of Engineering & the Built Environment ,Department of Electrical Engineering, 2015 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/24295en_ZA
dc.language.isoengen_ZA
dc.publisher.departmentDepartment of Electrical Engineeringen_ZA
dc.publisher.facultyFaculty of Engineering and the Built Environment
dc.publisher.institutionUniversity of Cape Town
dc.subject.otherElectrical Engineeringen_ZA
dc.titleA binaural sound sources localisation application for smart phonesen_ZA
dc.typeMaster Thesis
dc.type.qualificationlevelMasters
dc.type.qualificationnameMSc (Eng)en_ZA
uct.type.filetypeText
uct.type.filetypeImage
uct.type.publicationResearchen_ZA
uct.type.resourceThesisen_ZA
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_ebe_2015_mugagga_pius_kavuma_basajjabaka.pdf
Size:
8.19 MB
Format:
Adobe Portable Document Format
Description:
Collections