GenGraph: a python module for the simple generation and manipulation of genome graphs

dc.contributor.authorAmbler, Jon M
dc.contributor.authorMulaudzi, Shandukani
dc.contributor.authorMulder, Nicola
dc.date.accessioned2019-12-10T09:20:21Z
dc.date.available2019-12-10T09:20:21Z
dc.date.issued2019-10-25
dc.date.updated2019-10-27T06:24:22Z
dc.description.abstractAbstract Background As sequencing technology improves, the concept of a single reference genome is becoming increasingly restricting. In the case of Mycobacterium tuberculosis, one must often choose between using a genome that is closely related to the isolate, or one that is annotated in detail. One promising solution to this problem is through the graph based representation of collections of genomes as a single genome graph. Though there are currently a handful of tools that can create genome graphs and have demonstrated the advantages of this new paradigm, there still exists a need for flexible tools that can be used by researchers to overcome challenges in genomics studies. Results We present GenGraph, a Python toolkit and accompanying modules that use existing multiple sequence alignment tools to create genome graphs. Python is one of the most popular coding languages for the biological sciences, and by providing these tools, GenGraph makes it easier to experiment and develop new tools that utilise genome graphs. The conceptual model used is highly intuitive, and as much as possible the graph structure represents the biological relationship between the genomes. This design means that users will quickly be able to start creating genome graphs and using them in their own projects. We outline the methods used in the generation of the graphs, and give some examples of how the created graphs may be used. GenGraph utilises existing file formats and methods in the generation of these graphs, allowing graphs to be visualised and imported with widely used applications, including Cytoscape, R, and Java Script. Conclusions GenGraph provides a set of tools for generating graph based representations of sets of sequences with a simple conceptual model, written in the widely used coding language Python, and publicly available on Github.
dc.identifier.apacitationAmbler, J. M., Mulaudzi, S., & Mulder, N. (2019). GenGraph: a python module for the simple generation and manipulation of genome graphs. http://hdl.handle.net/11427/30696en_ZA
dc.identifier.chicagocitationAmbler, Jon M, Shandukani Mulaudzi, and Nicola Mulder "GenGraph: a python module for the simple generation and manipulation of genome graphs." (2019) http://hdl.handle.net/11427/30696en_ZA
dc.identifier.citationBMC Bioinformatics. 2019 Oct 25;20(1):519
dc.identifier.ris TY - Journal Article AU - Ambler, Jon M AU - Mulaudzi, Shandukani AU - Mulder, Nicola AB - Abstract Background As sequencing technology improves, the concept of a single reference genome is becoming increasingly restricting. In the case of Mycobacterium tuberculosis, one must often choose between using a genome that is closely related to the isolate, or one that is annotated in detail. One promising solution to this problem is through the graph based representation of collections of genomes as a single genome graph. Though there are currently a handful of tools that can create genome graphs and have demonstrated the advantages of this new paradigm, there still exists a need for flexible tools that can be used by researchers to overcome challenges in genomics studies. Results We present GenGraph, a Python toolkit and accompanying modules that use existing multiple sequence alignment tools to create genome graphs. Python is one of the most popular coding languages for the biological sciences, and by providing these tools, GenGraph makes it easier to experiment and develop new tools that utilise genome graphs. The conceptual model used is highly intuitive, and as much as possible the graph structure represents the biological relationship between the genomes. This design means that users will quickly be able to start creating genome graphs and using them in their own projects. We outline the methods used in the generation of the graphs, and give some examples of how the created graphs may be used. GenGraph utilises existing file formats and methods in the generation of these graphs, allowing graphs to be visualised and imported with widely used applications, including Cytoscape, R, and Java Script. Conclusions GenGraph provides a set of tools for generating graph based representations of sets of sequences with a simple conceptual model, written in the widely used coding language Python, and publicly available on Github. DA - 2019-10-25 DB - OpenUCT DP - University of Cape Town KW - Genome KW - Graph KW - Toolkit KW - Python KW - Module LK - https://open.uct.ac.za PY - 2019 T1 - GenGraph: a python module for the simple generation and manipulation of genome graphs TI - GenGraph: a python module for the simple generation and manipulation of genome graphs UR - http://hdl.handle.net/11427/30696 ER - en_ZA
dc.identifier.urihttps://doi.org/10.1186/s12859-019-3115-8
dc.identifier.urihttp://hdl.handle.net/11427/30696
dc.identifier.vancouvercitationAmbler JM, Mulaudzi S, Mulder N. GenGraph: a python module for the simple generation and manipulation of genome graphs. 2019; http://hdl.handle.net/11427/30696.en_ZA
dc.language.rfc3066en
dc.rights.holderThe Author(s)
dc.subjectGenome
dc.subjectGraph
dc.subjectToolkit
dc.subjectPython
dc.subjectModule
dc.titleGenGraph: a python module for the simple generation and manipulation of genome graphs
dc.typeJournal Article
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
12859_2019_Article_3115.pdf
Size:
1.11 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description:
Collections