Integration and visualisation of data in bioinformatics

dc.contributor.advisorMulder Nicolaen_ZA
dc.contributor.authorSalazar, Gustavo Aen_ZA
dc.date.accessioned2016-02-08T07:18:55Z
dc.date.available2016-02-08T07:18:55Z
dc.date.issued2015en_ZA
dc.descriptionIncludes bibliographical referencesen_ZA
dc.description.abstractThe most recent advances in laboratory techniques aimed at observing and measuring biological processes are characterised by their ability to generate large amounts of data. The more data we gather, the greater the chance of finding clues to understand the systems of life. This, however, is only true if the methods that analyse the generated data are efficient, effective, and robust enough to overcome the challenges intrinsic to the management of big data. The computational tools designed to overcome these challenges should also take into account the requirements of current research. Science demands specialised knowledge for understanding the particularities of each study; in addition, it is seldom possible to describe a single observation without considering its relationship with other processes, entities or systems. This thesis explores two closely related fields: the integration and visualisation of biological data. We believe that these two branches of study are fundamental in the creation of scientific software tools that respond to the ever increasing needs of researchers. The distributed annotation system (DAS) is a community project that supports the integration of data from federated sources and its visualisation on web and stand-alone clients. We have extended the DAS protocol to improve its search capabilities and also to support feature annotation by the community. We have also collaborated on the implementation of MyDAS, a server to facilitate the publication of biological data following the DAS protocol, and contributed in the design of the protein DAS client called DASty. Furthermore, we have developed a tool called probeSearcher, which uses the DAS technology to facilitate the identification of microarray chips that include probes for regions on proteins of interest. Another community project in which we participated is BioJS, an open source library of visualisation components for biological data. This thesis includes a description of the project, our contributions to it and some developed components that are part of it. Finally, and most importantly, we combined several BioJS components over a modular architecture to create PINV, a web based visualiser of protein-protein interaction (PPI) networks, that takes advantage of the features of modern web technologies in order to explore PPI datasets on an almost ubiquitous platform (the web) and facilitates collaboration between scientific peers. This thesis includes a description of the design and development processes of PINV, as well as current use cases that have benefited from the tool and whose feedback has been the source of several improvements to PINV. Collectively, this thesis describes novel software tools that, by using modern web technologies, facilitates the integration, exploration and visualisation of biological data, which has the potential to contribute to our understanding of the systems of life.en_ZA
dc.identifier.apacitationSalazar, G. A. (2015). <i>Integration and visualisation of data in bioinformatics</i>. (Thesis). University of Cape Town ,Faculty of Health Sciences ,Institute of Infectious Disease and Molecular Medicine. Retrieved from http://hdl.handle.net/11427/16861en_ZA
dc.identifier.chicagocitationSalazar, Gustavo A. <i>"Integration and visualisation of data in bioinformatics."</i> Thesis., University of Cape Town ,Faculty of Health Sciences ,Institute of Infectious Disease and Molecular Medicine, 2015. http://hdl.handle.net/11427/16861en_ZA
dc.identifier.citationSalazar, G. 2015. Integration and visualisation of data in bioinformatics. University of Cape Town.en_ZA
dc.identifier.ris TY - Thesis / Dissertation AU - Salazar, Gustavo A AB - The most recent advances in laboratory techniques aimed at observing and measuring biological processes are characterised by their ability to generate large amounts of data. The more data we gather, the greater the chance of finding clues to understand the systems of life. This, however, is only true if the methods that analyse the generated data are efficient, effective, and robust enough to overcome the challenges intrinsic to the management of big data. The computational tools designed to overcome these challenges should also take into account the requirements of current research. Science demands specialised knowledge for understanding the particularities of each study; in addition, it is seldom possible to describe a single observation without considering its relationship with other processes, entities or systems. This thesis explores two closely related fields: the integration and visualisation of biological data. We believe that these two branches of study are fundamental in the creation of scientific software tools that respond to the ever increasing needs of researchers. The distributed annotation system (DAS) is a community project that supports the integration of data from federated sources and its visualisation on web and stand-alone clients. We have extended the DAS protocol to improve its search capabilities and also to support feature annotation by the community. We have also collaborated on the implementation of MyDAS, a server to facilitate the publication of biological data following the DAS protocol, and contributed in the design of the protein DAS client called DASty. Furthermore, we have developed a tool called probeSearcher, which uses the DAS technology to facilitate the identification of microarray chips that include probes for regions on proteins of interest. Another community project in which we participated is BioJS, an open source library of visualisation components for biological data. This thesis includes a description of the project, our contributions to it and some developed components that are part of it. Finally, and most importantly, we combined several BioJS components over a modular architecture to create PINV, a web based visualiser of protein-protein interaction (PPI) networks, that takes advantage of the features of modern web technologies in order to explore PPI datasets on an almost ubiquitous platform (the web) and facilitates collaboration between scientific peers. This thesis includes a description of the design and development processes of PINV, as well as current use cases that have benefited from the tool and whose feedback has been the source of several improvements to PINV. Collectively, this thesis describes novel software tools that, by using modern web technologies, facilitates the integration, exploration and visualisation of biological data, which has the potential to contribute to our understanding of the systems of life. DA - 2015 DB - OpenUCT DP - University of Cape Town LK - https://open.uct.ac.za PB - University of Cape Town PY - 2015 T1 - Integration and visualisation of data in bioinformatics TI - Integration and visualisation of data in bioinformatics UR - http://hdl.handle.net/11427/16861 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/16861
dc.identifier.vancouvercitationSalazar GA. Integration and visualisation of data in bioinformatics. [Thesis]. University of Cape Town ,Faculty of Health Sciences ,Institute of Infectious Disease and Molecular Medicine, 2015 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/16861en_ZA
dc.language.isoengen_ZA
dc.publisher.departmentInstitute of Infectious Disease and Molecular Medicineen_ZA
dc.publisher.facultyFaculty of Health Sciencesen_ZA
dc.publisher.institutionUniversity of Cape Town
dc.subject.otherBioinformaticsen_ZA
dc.titleIntegration and visualisation of data in bioinformaticsen_ZA
dc.typeDoctoral Thesis
dc.type.qualificationlevelDoctoral
dc.type.qualificationnamePhDen_ZA
uct.type.filetypeText
uct.type.filetypeImage
uct.type.publicationResearchen_ZA
uct.type.resourceThesisen_ZA
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_hsf_2015_salazar_gustavo_a.pdf
Size:
3.22 MB
Format:
Adobe Portable Document Format
Description:
Collections