SDT: a virus classification tool based on pairwise sequence alignment and identity calculation

dc.contributor.authorMuhire, Brejnev Muhizi
dc.contributor.authorVarsani, Arvind
dc.contributor.authorMartin, Darren Patrick
dc.date.accessioned2016-08-15T12:24:31Z
dc.date.available2016-08-15T12:24:31Z
dc.date.issued2014
dc.date.updated2016-08-12T13:44:43Z
dc.description.abstractThe perpetually increasing rate at which viral full-genome sequences are being determined is creating a pressing demand for computational tools that will aid the objective classification of these genome sequences. Taxonomic classification approaches that are based on pairwise genetic identity measures are potentially highly automatable and are progressively gaining favour with the International Committee on Taxonomy of Viruses (ICTV). There are, however, various issues with the calculation of such measures that could potentially undermine the accuracy and consistency with which they can be applied to virus classification. Firstly, pairwise sequence identities computed based on multiple sequence alignments rather than on multiple independent pairwise alignments can lead to the deflation of identity scores with increasing dataset sizes. Also, when gap-characters need to be introduced during sequence alignments to account for insertions and deletions, methodological variations in the way that these characters are introduced and handled during pairwise genetic identity calculations can cause high degrees of inconsistency in the way that different methods classify the same sets of sequences. Here we present Sequence Demarcation Tool (SDT), a free user-friendly computer program that aims to provide a robust and highly reproducible means of objectively using pairwise genetic identity calculations to classify any set of nucleotide or amino acid sequences. SDT can produce publication quality pairwise identity plots and colour-coded distance matrices to further aid the classification of sequences according to ICTV approved taxonomic demarcation criteria. Besides a graphical interface version of the program for Windows computers, command-line versions of the program are available for a variety of different operating systems (including a parallel version for cluster computing platforms).en_ZA
dc.identifierhttp://dx.doi.org/ 10.1371/journal.pone.0108277
dc.identifier.apacitationMuhire, B. M., Varsani, A., & Martin, D. P. (2014). SDT: a virus classification tool based on pairwise sequence alignment and identity calculation. <i>PLoS ONE</i>, http://hdl.handle.net/11427/21242en_ZA
dc.identifier.chicagocitationMuhire, Brejnev Muhizi, Arvind Varsani, and Darren Patrick Martin "SDT: a virus classification tool based on pairwise sequence alignment and identity calculation." <i>PLoS ONE</i> (2014) http://hdl.handle.net/11427/21242en_ZA
dc.identifier.citationMuhire, B. M., Varsani, A., & Martin, D. P. (2014). SDT: a virus classification tool based on pairwise sequence alignment and identity calculation. PloS one, 9(9), e108277.en_ZA
dc.identifier.issn1932-6203en_ZA
dc.identifier.ris TY - Journal Article AU - Muhire, Brejnev Muhizi AU - Varsani, Arvind AU - Martin, Darren Patrick AB - The perpetually increasing rate at which viral full-genome sequences are being determined is creating a pressing demand for computational tools that will aid the objective classification of these genome sequences. Taxonomic classification approaches that are based on pairwise genetic identity measures are potentially highly automatable and are progressively gaining favour with the International Committee on Taxonomy of Viruses (ICTV). There are, however, various issues with the calculation of such measures that could potentially undermine the accuracy and consistency with which they can be applied to virus classification. Firstly, pairwise sequence identities computed based on multiple sequence alignments rather than on multiple independent pairwise alignments can lead to the deflation of identity scores with increasing dataset sizes. Also, when gap-characters need to be introduced during sequence alignments to account for insertions and deletions, methodological variations in the way that these characters are introduced and handled during pairwise genetic identity calculations can cause high degrees of inconsistency in the way that different methods classify the same sets of sequences. Here we present Sequence Demarcation Tool (SDT), a free user-friendly computer program that aims to provide a robust and highly reproducible means of objectively using pairwise genetic identity calculations to classify any set of nucleotide or amino acid sequences. SDT can produce publication quality pairwise identity plots and colour-coded distance matrices to further aid the classification of sequences according to ICTV approved taxonomic demarcation criteria. Besides a graphical interface version of the program for Windows computers, command-line versions of the program are available for a variety of different operating systems (including a parallel version for cluster computing platforms). DA - 2014 DB - OpenUCT DP - University of Cape Town J1 - PLoS ONE LK - https://open.uct.ac.za PB - University of Cape Town PY - 2014 SM - 1932-6203 T1 - SDT: a virus classification tool based on pairwise sequence alignment and identity calculation TI - SDT: a virus classification tool based on pairwise sequence alignment and identity calculation UR - http://hdl.handle.net/11427/21242 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/21242
dc.identifier.vancouvercitationMuhire BM, Varsani A, Martin DP. SDT: a virus classification tool based on pairwise sequence alignment and identity calculation. PLoS ONE. 2014; http://hdl.handle.net/11427/21242.en_ZA
dc.languageengen_ZA
dc.publisherPublic Library of Scienceen_ZA
dc.publisher.departmentDepartment of Clinical Laboratory Sciencesen_ZA
dc.publisher.facultyFaculty of Health Sciencesen_ZA
dc.publisher.institutionUniversity of Cape Town
dc.rightsCreative Commons Attribution 4.0 International (CC BY 4.0)*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en_ZA
dc.sourcePLoS ONEen_ZA
dc.source.urihttp://journals.plos.org/plosone/
dc.titleSDT: a virus classification tool based on pairwise sequence alignment and identity calculationen_ZA
dc.typeJournal Articleen_ZA
uct.type.filetypeText
uct.type.filetypeImage
uct.type.publicationResearchen_ZA
uct.type.resourceArticleen_ZA
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Muhire_SDT_a_virus_classification_2014.pdf
Size:
937.55 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.72 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections