• English
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Latviešu
  • Magyar
  • Nederlands
  • Português
  • Português do Brasil
  • Suomi
  • Svenska
  • Türkçe
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Log In
  • Communities & Collections
  • Browse OpenUCT
  • English
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Latviešu
  • Magyar
  • Nederlands
  • Português
  • Português do Brasil
  • Suomi
  • Svenska
  • Türkçe
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Log In
  1. Home
  2. Browse by Author

Browsing by Author "Martin, Darren"

Now showing 1 - 20 of 21
Results Per Page
Sort Options
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Adaptive evolution by recombination is not associated with increased mutation rates in Maize streak virus
    (BioMed Central Ltd, 2012) Monjane, Aderito; Pande, Daniel; Lakay, Francisco; Shepherd, Dionne; van der Walt, Eric; Lefeuvre, Pierre; Lett, Jean-Michel; Varsani, Arvind; Rybicki, Edward; Martin, Darren
    BACKGROUND: Single-stranded (ss) DNA viruses in the family Geminiviridae are proving to be very useful in real-time evolution studies. The high mutation rate of geminiviruses and other ssDNA viruses is somewhat mysterious in that their DNA genomes are replicated in host nuclei by high fidelity host polymerases. Although strand specific mutation biases observed in virus species from the geminivirus genus Mastrevirus indicate that the high mutation rates in viruses in this genus may be due to mutational processes that operate specifically on ssDNA, it is currently unknown whether viruses from other genera display similar strand specific mutation biases. Also, geminivirus genomes frequently recombine with one another and an alternative cause of their high mutation rates could be that the recombination process is either directly mutagenic or produces a selective environment in which the survival of mutants is favoured. To investigate whether there is an association between recombination and increased basal mutation rates or increased degrees of selection favoring the survival of mutations, we compared the mutation dynamics of the MSV-MatA and MSV-VW field isolates of Maize streak virus (MSV; Mastrevirus), with both a laboratory constructed MSV recombinant, and MSV recombinants closely resembling MSV-MatA. To determine whether strand specific mutation biases are a general characteristic of geminivirus evolution we compared mutation spectra arising during these MSV experiments with those arising during similar experiments involving the geminivirus Tomato yellow leaf curl virus (Begomovirus genus). RESULTS: Although both the genomic distribution of mutations and the occurrence of various convergent mutations at specific genomic sites indicated that either mutation hotspots or selection for adaptive mutations might elevate observed mutation rates in MSV, we found no association between recombination and mutation rates. Importantly, when comparing the mutation spectra of MSV and TYLCV we observed similar strand specific mutation biases arising predominantly from imbalances in the complementary mutations G->T: C->A. CONCLUSIONS: While our results suggest that recombination does not strongly influence mutation rates in MSV, they indicate that high geminivirus mutation rates are at least partially attributable to increased susceptibility of all geminivirus genomes to oxidative damage while in a single stranded state.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Characterization of HIV-1 gag and nef in Cameroon: further evidence of extreme diversity at the origin of the HIV-1 group M epidemic
    (BioMed Central Ltd, 2013) Tongo, Marcel; Martin, Darren; Zembe, Lycias; Mpoudi-Ngole, Eitel; Williamson, Carolyn; Burgers, Wendy
    BACKGROUND: Cameroon, in west central Africa, has an extraordinary degree of HIV diversity, presenting a major challenge for the development of an effective HIV vaccine. Given the continuing need to closely monitor the emergence of new HIV variants in the country, we analyzed HIV-1 genetic diversity in 59 plasma samples from HIV-infected Cameroonian blood donors. Full length HIV gag and nef sequences were generated and phylogenetic analyses were performed. FINDINGS: All gag and nef sequences clustered within HIV-1M. Circulating recombinant form CRF02_AG predominated, accounting for 50% of the studied infections, followed by clade G (11%), clade D and CRF37_cpx (4% each), and clades A, F, CRF01_AE and CRF36_cpx (2% each). In addition, 22% of the studied viruses apparently had nef and gag genes from viruses belonging to different clades, with the majority (8/10) having either a nef or gag gene derived from CRF02_AG. Interestingly, five gag sequences (10%) and three (5%) nef sequences were neither obviously recombinant nor easily classifiable into any of the known HIV-1M clades. CONCLUSION: This suggests the widespread existence of highly divergent HIV lineages in Cameroon. While the genetic complexity of the Cameroonian HIV-1 epidemic has potentially serious implications for the design of biomedical interventions, detailed analyses of divergent Cameroonian HIV-1M lineages could be crucial for dissecting the earliest evolutionary steps in the emergence of HIV-1M.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Conserved positive selection signals in gp41 across multiple subtypes and difference in selection signals detectable in gp41 sequences sampled during acute and chronic HIV-1 subtype C infection
    (BioMed Central Ltd, 2008) Bandawe, Gama; Martin, Darren; Treurnicht, Florette; Mlisana, Koleka; Karim, Salim; Williamson, Carolyn
    BACKGROUND:The high diversity of HIV variants driving the global AIDS epidemic has caused many to doubt whether an effective vaccine against the virus is possible. However, by identifying the selective forces that are driving the ongoing diversification of HIV and characterising their genetic consequences, it may be possible to design vaccines that pre-empt some of the virus' more common evasion tactics. One component of such vaccines might be the envelope protein, gp41. Besides being targeted by both the humoral and cellular arms of the immune system this protein mediates fusion between viral and target cell membranes and is likely to be a primary determinant of HIV transmissibility. RESULTS: Using recombination aware analysis tools we compared site specific signals of selection in gp41 sequences from different HIV-1 M subtypes and circulating recombinant forms and identified twelve sites evolving under positive selection across multiple major HIV-1 lineages. To identify evidence of selection operating during transmission our analysis included two matched datasets sampled from patients with acute or chronic subtype C infections. We identified six gp41 sites apparently evolving under different selection pressures during acute and chronic HIV-1 infections. These sites mostly fell within functional gp41 domains, with one site located within the epitope recognised by the broadly neutralizing antibody, 4E10. CONCLUSION: Whereas these six sites are potentially determinants of fitness and are therefore good candidate targets for subtype-C specific vaccines, the twelve sites evolving under diversifying selection across multiple subtypes might make good candidate targets for broadly protective vaccines.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    East African cassava mosaic-like viruses from Africa to Indian ocean islands: molecular diversity, evolutionary history and geographical dissemination of a bipartite begomovirus
    (BioMed Central Ltd, 2012) De Bruyn, Alexandre; Villemot, Julie; Lefeuvre, Pierre; Villar, Emilie; Hoareau, Murielle; Harimalala, Mireille; Abdoul-Karime, Anli; Abdou-Chakour, Chadhouliati; Reynaud, Bernard; Harkins, Gordon; Varsani, Arvind; Martin, Darren; Lett, Jean-Michel
    BACKGROUND: Cassava (Manihot esculenta) is a major food source for over 200 million sub-Saharan Africans. Unfortunately, its cultivation is severely hampered by cassava mosaic disease (CMD). Caused by a complex of bipartite cassava mosaic geminiviruses (CMG) species (Family: Geminivirideae; Genus: Begomovirus) CMD has been widely described throughout Africa and it is apparent that CMG's are expanding their geographical distribution. Determining where and when CMG movements have occurred could help curtail its spread and reveal the ecological and anthropic factors associated with similar viral invasions. We applied Bayesian phylogeographic inference and recombination analyses to available and newly described CMG sequences to reconstruct a plausible history of CMG diversification and migration between Africa and South West Indian Ocean (SWIO) islands. RESULTS: The isolation and analysis of 114 DNA-A and 41 DNA-B sequences demonstrated the presence of three CMG species circulating in the Comoros and Seychelles archipelagos (East African cassava mosaic virus, EACMV; East African cassava mosaic Kenya virus, EACMKV; and East African cassava mosaic Cameroon virus, EACMCV). Phylogeographic analyses suggest that CMG's presence on these SWIO islands is probably the result of at least four independent introduction events from mainland Africa occurring between 1988 and 2009. Amongst the islands of the Comoros archipelago, two major migration pathways were inferred: One from Grande Comore to Moheli and the second from Mayotte to Anjouan. While only two recombination events characteristic of SWIO islands isolates were identified, numerous re-assortments events were detected between EACMV and EACMKV, which seem to almost freely interchange their genome components. CONCLUSIONS: Rapid and extensive virus spread within the SWIO islands was demonstrated for three CMG complex species. Strong evolutionary or ecological interaction between CMG species may explain both their propensity to exchange components and the absence of recombination with non-CMG begomoviruses. Our results suggest an important role of anthropic factors in CMGs spread as the principal axes of viral migration correspond with major routes of human movement and commercial trade. Finer-scale temporal analyses of CMGs to precisely scale the relative contributions of human and insect transmission to their movement dynamics will require further extensive sampling in the SWIO region.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Experimental evidence indicating that mastreviruses probably did not co-diverge with their hosts
    (BioMed Central Ltd, 2009) Harkins, Gordon; Delport, Wayne; Duffy, Siobain; Wood, Natasha; Monjane, Aderito; Owor, Betty; Donaldson, Lara; Saumtally, Salem; Triton, Guy; Briddon, Rob; Shepherd, Dionne; Rybicki, Edward; Martin, Darren; Varsani, Arvind
    BACKGROUND:Despite the demonstration that geminiviruses, like many other single stranded DNA viruses, are evolving at rates similar to those of RNA viruses, a recent study has suggested that grass-infecting species in the genus Mastrevirus may have co-diverged with their hosts over millions of years. This "co-divergence hypothesis" requires that long-term mastrevirus substitution rates be at least 100,000-fold lower than their basal mutation rates and 10,000-fold lower than their observable short-term substitution rates. The credibility of this hypothesis, therefore, hinges on the testable claim that negative selection during mastrevirus evolution is so potent that it effectively purges 99.999% of all mutations that occur. RESULTS: We have conducted long-term evolution experiments lasting between 6 and 32 years, where we have determined substitution rates of between 2 and 3 x 10-4 substitutions/site/year for the mastreviruses Maize streak virus (MSV) and Sugarcane streak Reunion virus (SSRV). We further show that mutation biases are similar for different geminivirus genera, suggesting that mutational processes that drive high basal mutation rates are conserved across the family. Rather than displaying signs of extremely severe negative selection as implied by the co-divergence hypothesis, our evolution experiments indicate that MSV and SSRV are predominantly evolving under neutral genetic drift. CONCLUSION: The absence of strong negative selection signals within our evolution experiments and the uniformly high geminivirus substitution rates that we and others have reported suggest that mastreviruses cannot have co-diverged with their hosts.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Experimental observations of rapid Maize streak virus evolution reveal a strand-specific nucleotide substitution bias
    (BioMed Central Ltd, 2008) van der Walt, Eric; Martin, Darren; Varsani, Arvind; Polston, Jane; Rybicki, Edward
    BACKGROUND: Recent reports have indicated that single-stranded DNA (ssDNA) viruses in the taxonomic families Geminiviridae, Parvoviridae and Anellovirus may be evolving at rates of ~10-4 substitutions per site per year (subs/site/year). These evolution rates are similar to those of RNA viruses and are surprisingly high given that ssDNA virus replication involves host DNA polymerases with fidelities approximately 10 000 times greater than those of error-prone viral RNA polymerases. Although high ssDNA virus evolution rates were first suggested in evolution experiments involving the geminivirus maize streak virus (MSV), the evolution rate of this virus has never been accurately measured. Also, questions regarding both the mechanistic basis and adaptive value of high geminivirus mutation rates remain unanswered. RESULTS: We determined the short-term evolution rate of MSV using full genome analysis of virus populations initiated from cloned genomes. Three wild type viruses and three defective artificial chimaeric viruses were maintained in planta for up to five years and displayed evolution rates of between 7.4 x 10-4 and 7.9 x 10-4 subs/site/year. CONCLUSION: These MSV evolution rates are within the ranges observed for other ssDNA viruses and RNA viruses. Although no obvious evidence of positive selection was detected, the uneven distribution of mutations within the defective virus genomes suggests that some of the changes may have been adaptive. We also observed inter-strand nucleotide substitution imbalances that are consistent with a recent proposal that high mutation rates in geminiviruses (and possibly ssDNA viruses in general) may be due to mutagenic processes acting specifically on ssDNA molecules.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Exploring the phylodynamics, genetic reassortment and RNA secondary structure formation patterns of orthomyxoviruses by comparative sequence analysis
    (2019) Nindo, Fredrick Nzabanyi; Martin, Darren
    RNA viruses are among the most virulent microorganisms that threaten the health of humans and livestock. Among the most socio-economically important of the known RNA viruses are those found in the family Orthomyxovirus. In this era of rapid low-cost genome sequencing and advancements in computational biology techniques, many previously difficult research questions relating to the molecular epidemiology and evolutionary dynamics of these viruses can now be answered with ease. Using sequence data together with associated meta-data, in chapter two of this dissertation I tested the hypothesis that the Influenza A/H1N1 2009 pandemic virus was introduced multiple times into Africa, and subsequently dispersed heterogeneously across the continent. I further tested to what degree factors such as road distances and air travel distances impacted the observed pattern of spread of this virus in Africa using a generalised linear modelbased approach. The results suggested that their were multiple simultaneous introductions of 2009 pandemic A/H1N1 into Africa, and geographical distance and human mobility through air travel played an important role towards dissemination. In chapter three, I set out to test two hypotheses: (1) that there is no difference in the frequency of reassortments among the segments that constitute influenza virus genomes; and (2) that there is epochal temporal reassortment among influenza viruses and that all geographical regions are equally likely sources of epidemiologically important influenza virus reassortant lineages. The findings suggested that surface segments are more frequently exchanges than internal genes and that North America/Asia, Oceania, and Asia could be the most likely source locations for reassortant Influenza A, B and C virus lineages respectively. In chapter four of this thesis, I explored the formation of RNA secondary structures within the genomes of orthomyxoviruses belonging to five genera: Influenza A, B and C, Infectious Salmon Anaemia Virus and Thogotovirus using in silico RNA folding predictions and additional molecular evolution and phylogenetic tests to show that structured regions may be biologically functional. The presence of some conserved structures across the five genera is likely a reflection of the biological importance of these structures, warranting further investigation regarding their role in the evolution and possible development of antiviral resistance. The studies herein demonstrate that pathogen genomics-based analytical approaches are useful both for understanding the mechanisms that drive the evolution and spread of rapidly evolving viral pathogens such as orthomyxoviruses, and for illuminating how these approaches could be leveraged to improve the management of these pathogens.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Extensive purifying selection acting on synonymous sites in HIV-1 Group M sequences
    (BioMed Central Ltd, 2008) Ngandu, Nobubelo; Scheffler, Konrad; Moore, Penny; Woodman, Zenda; Martin, Darren; Seoighe, Cathal
    BACKGROUND: Positive selection pressure acting on protein-coding sequences is usually inferred when the rate of nonsynonymous substitution is greater than the synonymous rate. However, purifying selection acting directly on the nucleotide sequence can lower the synonymous substitution rate. This could result in false inference of positive selection because when synonymous changes at some sites are under purifying selection, the average synonymous rate is an underestimate of the neutral rate of evolution. Even though HIV-1 coding sequences contain a number of regions that function at the nucleotide level, and are thus likely to be affected by purifying selection, studies of positive selection assume that synonymous substitutions can be used to estimate the neutral rate of evolution. RESULTS: We modelled site-to-site variation in the synonymous substitution rate across coding regions of the HIV-1 genome. Synonymous substitution rates were found to vary significantly within and between genes. Surprisingly, regions of the genome that encode proteins in more than one frame had significantly higher synonymous substitution rates than regions coding in a single frame. We found evidence of strong purifying selection pressure affecting synonymous mutations in fourteen regions with known functions. These included an exonic splicing enhancer, the rev-responsive element, the poly-purine tract and a transcription factor binding site. A further five highly conserved regions were located within known functional domains. We also found four conserved regions located in env and vpu which have not been characterized previously. CONCLUSION: We provide the coordinates of genomic regions with markedly lower synonymous substitution rates, which are putatively under the influence of strong purifying selection pressure at the nucleotide level as well as regions encoding proteins in more than one frame. These regions should be excluded from studies of positive selection acting on HIV-1 coding regions.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Genetic variation and recombination of RdRp and HSP 70h genes of Citrus tristeza virus isolates from orange trees showing symptoms of citrus sudden death disease
    (BioMed Central Ltd, 2008) Gomes, Clarissa; Nagata, Tatsuya; de Jesus, Waldir; Neto, Carlos; Pappas, Georgios; Martin, Darren
    BACKGROUND: Citrus sudden death (CSD), a disease that rapidly kills orange trees, is an emerging threat to the Brazilian citrus industry. Although the causal agent of CSD has not been definitively determined, based on the disease's distribution and symptomatology it is suspected that the agent may be a new strain of Citrus tristeza virus (CTV). CTV genetic variation was therefore assessed in two Brazilian orange trees displaying CSD symptoms and a third with more conventional CTV symptoms. RESULTS: A total of 286 RNA-dependent-RNA polymerase (RdRp) and 284 heat shock protein 70 homolog (HSP70h) gene fragments were determined for CTV variants infecting the three trees. It was discovered that, despite differences in symptomatology, the trees were all apparently coinfected with similar populations of divergent CTV variants. While mixed CTV infections are common, the genetic distance between the most divergent population members observed (24.1% for RdRp and 11.0% for HSP70h) was far greater than that in previously described mixed infections. Recombinants of five distinct RdRp lineages and three distinct HSP70h lineages were easily detectable but respectively accounted for only 5.9 and 11.9% of the RdRp and HSP70h gene fragments analysed and there was no evidence of an association between particular recombinant mosaics and CSD. Also, comparisons of CTV population structures indicated that the two most similar CTV populations were those of one of the trees with CSD and the tree without CSD. CONCLUSION: We suggest that if CTV is the causal agent of CSD, it is most likely a subtle feature of population structures within mixed infections and not merely the presence (or absence) of a single CTV variant within these populations that triggers the disease.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    A highly divergent South African geminivirus species illuminates the ancient evolutionary history of this family
    (BioMed Central Ltd, 2009) Varsani, Arvind; Shepherd, Dionne; Dent, Kyle; Monjane, Aderito; Rybicki, Edward; Martin, Darren
    BACKGROUND:We have characterised a new highly divergent geminivirus species, Eragrostis curvula streak virus (ECSV), found infecting a hardy perennial South African wild grass. ECSV represents a new genus-level geminivirus lineage, and has a mixture of features normally associated with other specific geminivirus genera. RESULTS: Whereas the ECSV genome is predicted to express a replication associated protein (Rep) from an unspliced complementary strand transcript that is most similar to those of begomoviruses, curtoviruses and topocuviruses, its Rep also contains what is apparently a canonical retinoblastoma related protein interaction motif such as that found in mastreviruses. Similarly, while ECSV has the same unusual TAAGATTCC virion strand replication origin nonanucleotide found in another recently described divergent geminivirus, Beet curly top Iran virus (BCTIV), the rest of the transcription and replication origin is structurally more similar to those found in begomoviruses and curtoviruses than it is to those found in BCTIV and mastreviruses. ECSV also has what might be a homologue of the begomovirus transcription activator protein gene found in begomoviruses, a mastrevirus-like coat protein gene and two intergenic regions. CONCLUSION: Although it superficially resembles a chimaera of geminiviruses from different genera, the ECSV genome is not obviously recombinant, implying that the features it shares with other geminiviruses are those that were probably present within the last common ancestor of these viruses. In addition to inferring how the ancestral geminivirus genome may have looked, we use the discovery of ECSV to refine various hypotheses regarding the recombinant origins of the major geminivirus lineages.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Human Immunodeficiency Virus? One of nature's greatest evolutionary machines
    (2004) Martin, Darren; Williamson, Carolyn
    Transmission of an HIV-like virus from chimpanzees to humans approximately 80 years ago triggered the worldwide AIDS pandemic. Possessing very high mutation and recombination rates, the descendants of this ancestral virus have evolved greatly. Most of this evolution has been in response to selective pressures imposed by human immune responses and has not provided HIV with any significant new biological characteristics. The continuing diversification of HIV variants is a principal obstruction to controlling the virus with drugs and vaccines.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Identification of evolutionarily conserved nucleic acid secondary structures within viral genomes
    (2013) Golden, Michael Thomas; Martin, Darren
    Large single-stranded nucleotide sequences such as certain DNA and RNA virus genomes form complex secondary structures through Watson-Crick base-pairing interactions between bases on the same DNA or RNA strand. Both computational and experimental methods are available for determining the secondary structure conformations of nucleotide sequences and have been used to identify many thousands of structural elements within these sequences. However, to date only a small subset of these elements have been shown to have any biological functionality. In this study, a software tool enabling the graphical visualisation of structural elements overlaid with additional biological information was developed in order to assist researchers in identifying elements with likely biological functionality within larger secondary structures. Additionally, a tool was developed for identifying conserved structural elements amongst a set of related sequences. Using these tools a number of conserved elements were identified within computationally predicted secondary structures of HIV and five flavivirus species. Further evidence of selection having acted to preserve the identified elements, in support of their likely biological importance, was provided by evolutionary analysis of related sequences. In addition to the identification of individual structural elements, a large-scale analysis of HIV-1 and seven flavivirus species revealed statistically significant associations between base-paired nucleotide positions and various patterns of mutation. While the results of the various analyses carried out were consistent with the presence of between one and five highly-conserved functionally important secondary structures within the various genomes analysed, they were also consistent with the occurrence of many less-conserved ( or only transiently conserved) structures that potentially have no specific biological functions. Although many of these less-conserved structures may, as has previously been suggested, defend these genomes against intra-cellular anti-viral siRNAs, a strong association was found between sites that are base-paired and codon sites that are apparently evolving under purifying selection: a finding which suggests that an additional or alternative role of these structures may simply be to reduce basal mutation frequencies, and hence amino acid substitution rates at codon sites that encode functionally important amino acids. An investigation of HIV-1 natural inter- and intra-subtype recombinant sequences revealed that recombination breakpoints had a significant tendency to occur more frequently at base-paired nucleotide positions within the computationally predicted secondary structures of these sequences. Furthermore, it was found that these natural HIV-1 recombinants were less disruptive of protein tertiary structure and RNA secondary structure conformations than expected by chance: suggesting that natural selection favouring the maintenance of both protein and nucleic acid folding has likely had a major influence on the patterns of recombination that are detectable within HIV genomes. Finally, a general method for predicting the secondary structures of a set of related sequences sharing similar secondary structure conformations was developed. By taking alignment uncertainty into account through the analysis of large numbers of similarly plausible multiple sequence alignments that are sampled using a statistical alignment procedure, the method was found to have better predictive accuracy than existing related RNA secondary structure prediction methods.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Identifying recombinants in human and primate immunodeficiency virus sequence alignments using quartet scanning
    (BioMed Central Ltd, 2009) Lemey, Philippe; Lott, Martin; Martin, Darren; Moulton, Vincent
    BACKGROUND:Recombination has a profound impact on the evolution of viruses, but characterizing recombination patterns in molecular sequences remains a challenging endeavor. Despite its importance in molecular evolutionary studies, identifying the sequences that exhibit such patterns has received comparatively less attention in the recombination detection framework. Here, we extend a quartet-mapping based recombination detection method to enable identification of recombinant sequences without prior specifications of either query and reference sequences. Through simulations we evaluate different recombinant identification statistics and significance tests. We compare the quartet approach with triplet-based methods that employ additional heuristic tests to identify parental and recombinant sequences. RESULTS: Analysis of phylogenetic simulations reveal that identifying the descendents of relatively old recombination events is a challenging task for all methods available, and that quartet scanning performs relatively well compared to the triplet based methods. The use of quartet scanning is further demonstrated by analyzing both well-established and putative HIV-1 recombinant strains. In agreement with recent findings, we provide evidence that the presumed circulating recombinant CRF02_AG is a 'pure' lineage, whereas the presumed parental lineage subtype G has a recombinant origin. We also demonstrate HIV-1 intrasubtype recombination, confirm the hybrid origin of SIV in chimpanzees and further disentangle the recombinant history of SIV lineages in a primate immunodeficiency virus data set. CONCLUSION: Quartet scanning makes a valuable addition to triplet-based methods for identifying recombinant sequences without prior specifications of either query and reference sequences. The new method is available in the VisRD v.3.0 package http://www.cmp.uea.ac.uk/~vlm/visrd.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Immunology and virology of HIV-1 infection in Cameroon
    (2014) Tongo Passo, Aime Marcel Simon; Burgers, Wendy; Martin, Darren
    This study confirms the widespread existence of highly divergent HIV lineages in Cameroon. While the genetic complexity of the Cameroonian HIV-1 epidemic has potentially serious implications for the design of biomedical interventions, detailed analyses of divergent Cameroonian HIV-1 group M lineages could be crucial for dissecting the earliest evolutionary steps in the emergence of HIV-1 group M. In addition, the central nature of HIV-1 consensus M sequences resulted in their broad recognition, but failed to identify highly immunodominant peptides between homogeneous and diverse HIV epidemics. Further refinement of these immunogens may contribute to the development of a globally relevant vaccine. Finally, the use of PTE peptides did not increase the breadth of T cell recognition in Abstract Page xvi this divergent population when compared to consensus M peptides. This underlies the need to include more mosaic peptides representing the variety of viruses that circulate in the region.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    In Silico Characterisation of Putative Prophages in Lactobacillaceae Used in Probiotics for Vaginal Health
    (2022-01-20) Happel, Anna-Ursula; Kullin, Brian R; Gamieldien, Hoyam; Jaspan, Heather B; Varsani, Arvind; Martin, Darren; Passmore, Jo-Ann S; Froissart, Rémy
    While live biotherapeutics offer a promising approach to optimizing vaginal microbiota, the presence of functional prophages within introduced Lactobacillaceae strains could impact their safety and efficacy. We evaluated the presence of prophages in 895 publicly available Lactobacillaceae genomes using Phaster, Phigaro, Phispy, Prophet and Virsorter. Prophages were identified according to stringent (detected by ≥4 methods) or lenient criteria (detected by ≥2 methods), both with >80% reciprocal sequence overlap. The stringent approach identified 448 prophages within 359 genomes, with 40.1% genomes harbouring at least one prophage, while the lenient approach identified 1671 prophages within 83.7% of the genomes. To confirm our in silico estimates in vitro, we tested for inducible prophages in 57 vaginally-derived and commercial Lactobacillaceae isolates and found inducible prophages in 61.4% of the isolates. We characterised the in silico predicted prophages based on weighted gene repertoire relatedness and found that most belonged to the Siphoviridae or Myoviridae families. ResFam and eggNOG identified four potential antimicrobial resistance genes within the predicted prophages. Our results suggest that while Lactobacillaceae prophages seldomly carry clinically concerning genes and thus unlikely a pose a direct risk to human vaginal microbiomes, their high prevalence warrants the characterisation of Lactobacillaceae prophages in live biotherapeutics.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    The influence of secondary structure, selection and recombination on rubella virus nucleotide substitution rate estimates
    (BioMed Central Ltd, 2014) Cloete, Leendert; Tanov, Emil; Muhire, Brejnev; Martin, Darren; Harkins, Gordon
    BACKGROUND: Annually, rubella virus (RV) still causes severe congenital defects in around 100 000 children globally. An attempt to eradicate RV is currently underway and analytical tools to monitor the global decline of the last remaining RV lineages will be useful for assessing the effectiveness of this endeavour. RV evolves rapidly enough that much of this information might be inferable from RV genomic sequence data. METHODS: Using BEASTv1.8.0, we analysed publically available RV sequence data to estimate genome-wide and gene-specific nucleotide substitution rates to test whether current estimates of RV substitution rates are representative of the entire RV genome. We specifically accounted for possible confounders of nucleotide substitution rate estimates, such as temporally biased sampling, sporadic recombination, and natural selection favouring either increased or decreased genetic diversity (estimated by the PARRIS and FUBAR methods), at nucleotide sites within the genomic secondary structures (predicted by the NASP method). RESULTS: We determine that RV nucleotide substitution rates range from 1.19 x 10-3 substitutions/site/year in the E1 region to 7.52 x 10-4 substitutions/site/year in the P150 region. We find that differences between substitution rate estimates in different RV genome regions are largely attributable to temporal sampling biases such that datasets containing higher proportions of recently sampled sequences, will tend to have inflated estimates of mean substitution rates. Although there exists little evidence of positive selection or natural genetic recombination in RV, we show that RV genomes possess pervasive biologically functional nucleic acid secondary structure and that purifying selection acting to maintain this structure contributes substantially to variations in estimated nucleotide substitution rates across RV genomes. CONCLUSION: Both temporal sampling biases and purifying selection favouring the conservation of RV nucleic acid secondary structures have an appreciable impact on substitution rate estimates but do not preclude the use of RV sequence data to date ancestral sequences. The combination of uniformly high substitution rates across the RV genome and strong temporal structure within the available sequence data, suggests that such data should be suitable for tracking the demographic, epidemiological and movement dynamics of this virus during eradication attempts.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Leveraging big data resources and data integration in biology: applying computational systems analyses and machine learning to gain insights into the biology of cancers
    (2020) Sinkala, Musalula; Martin, Darren; Mulder, Nicola; Barth, Stefan
    Recently, many "molecular profiling" projects have yielded vast amounts of genetic, epigenetic, transcription, protein expression, metabolic and drug response data for cancerous tumours, healthy tissues, and cell lines. We aim to facilitate a multi-scale understanding of these high-dimensional biological data and the complexity of the relationships between the different data types taken from human tumours. Further, we intend to identify molecular disease subtypes of various cancers, uncover the subtype-specific drug targets and identify sets of therapeutic molecules that could potentially be used to inhibit these targets. We collected data from over 20 publicly available resources. We then leverage integrative computational systems analyses, network analyses and machine learning, to gain insights into the pathophysiology of pancreatic cancer and 32 other human cancer types. Here, we uncover aberrations in multiple cell signalling and metabolic pathways that implicate regulatory kinases and the Warburg effect as the likely drivers of the distinct molecular signatures of three established pancreatic cancer subtypes. Then, we apply an integrative clustering method to four different types of molecular data to reveal that pancreatic tumours can be segregated into two distinct subtypes. We define sets of proteins, mRNAs, miRNAs and DNA methylation patterns that could serve as biomarkers to accurately differentiate between the two pancreatic cancer subtypes. Then we confirm the biological relevance of the identified biomarkers by showing that these can be used together with pattern-recognition algorithms to infer the drug sensitivity of pancreatic cancer cell lines accurately. Further, we evaluate the alterations of metabolic pathway genes across 32 human cancers. We find that while alterations of metabolic genes are pervasive across all human cancers, the extent of these gene alterations varies between them. Based on these gene alterations, we define two distinct cancer supertypes that tend to be associated with different clinical outcomes and show that these supertypes are likely to respond differently to anticancer drugs. Overall, we show that the time has already arrived where we can leverage available data resources to potentially elicit more precise and personalised cancer therapies that would yield better clinical outcomes at a much lower cost than is currently being achieved.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    The population genomics of begomoviruses: global scale population structure and gene flow
    (BioMed Central Ltd, 2010) Prasanna, HC; Sinha, D; Verma, Ajay; Singh, Major; Singh, Bijendra; Rai, Mathura; Martin, Darren
    BACKGROUND: The rapidly growing availability of diverse full genome sequences from across the world is increasing the feasibility of studying the large-scale population processes that underly observable pattern of virus diversity. In particular, characterizing the genetic structure of virus populations could potentially reveal much about how factors such as geographical distributions, host ranges and gene flow between populations combine to produce the discontinuous patterns of genetic diversity that we perceive as distinct virus species. Among the richest and most diverse full genome datasets that are available is that for the dicotyledonous plant infecting genus, Begomovirus, in the Family Geminiviridae. The begomoviruses all share the same whitefly vector, are highly recombinogenic and are distributed throughout tropical and subtropical regions where they seriously threaten the food security of the world's poorest people. RESULTS: We focus here on using a model-based population genetic approach to identify the genetically distinct sub-populations within the global begomovirus meta-population. We demonstrate the existence of at least seven major sub-populations that can further be sub-divided into as many as thirty four significantly differentiated and genetically cohesive minor sub-populations. Using the population structure framework revealed in the present study, we further explored the extent of gene flow and recombination between genetic populations. CONCLUSIONS: Although geographical barriers are apparently the most significant underlying cause of the seven major population sub-divisions, within the framework of these sub-divisions, we explore patterns of gene flow to reveal that both host range differences and genetic barriers to recombination have probably been major contributors to the minor population sub-divisions that we have identified. We believe that the global Begomovirus population structure revealed here could facilitate population genetics studies into how central parameters of population genetics namely selection, recombination, mutation, gene flow, and genetic drift shape the global begomovirus diversity.
  • Loading...
    Thumbnail Image
    Item
    Open Access
    Recombination hotspots and host susceptibility modulate the adaptive value of recombination during maize streak virus evolution
    (BioMed Central Ltd, 2011) Monjane, Aderito; van der Walt, Eric; Varsani, Arvind; Rybicki, Edward; Martin, Darren
    BACKGROUND:Maize streak virus -strain A (MSV-A; Genus Mastrevirus, Family Geminiviridae), the maize-adapted strain of MSV that causes maize streak disease throughout sub-Saharan Africa, probably arose between 100 and 200 years ago via homologous recombination between two MSV strains adapted to wild grasses. MSV recombination experiments and analyses of natural MSV recombination patterns have revealed that this recombination event entailed the exchange of the movement protein - coat protein gene cassette, bounded by the two genomic regions most prone to recombination in mastrevirus genomes; the first surrounding the virion-strand origin of replication, and the second around the interface between the coat protein gene and the short intergenic region. Therefore, aside from the likely adaptive advantages presented by a modular exchange of this cassette, these specific breakpoints may have been largely predetermined by the underlying mechanisms of mastrevirus recombination. To investigate this hypothesis, we constructed artificial, low-fitness, reciprocal chimaeric MSV genomes using alternating genomic segments from two MSV strains; a grass-adapted MSV-B, and a maize-adapted MSV-A. Between them, each pair of reciprocal chimaeric genomes represented all of the genetic material required to reconstruct - via recombination - the highly maize-adapted MSV-A genotype, MSV-MatA. We then co-infected a selection of differentially MSV-resistant maize genotypes with pairs of reciprocal chimaeras to determine the efficiency with which recombination would give rise to high-fitness progeny genomes resembling MSV-MatA. RESULTS: Recombinants resembling MSV-MatA invariably arose in all of our experiments. However, the accuracy and efficiency with which the MSV-MatA genotype was recovered across all replicates of each experiment depended on the MSV susceptibility of the maize genotypes used and the precise positions - in relation to known recombination hotspots - of the breakpoints required to re-create MSV-MatA. Although the MSV-sensitive maize genotype gave rise to the greatest variety of recombinants, the measured fitness of each of these recombinants correlated with their similarity to MSV-MatA. CONCLUSIONS: The mechanistic predispositions of different MSV genomic regions to recombination can strongly influence the accessibility of high-fitness MSV recombinants. The frequency with which the fittest recombinant MSV genomes arise also correlates directly with the escalating selection pressures imposed by increasingly MSV-resistant maize hosts.
  • No Thumbnail Available
    Item
    Open Access
    Using machine learning to understand the link between gene essentiality, gene expression and the chemosensitivity of cancer cells
    (2024) Mcinga, Kuhle; Sinkala, Musalula; Martin, Darren
    The emergence of pharmacogenomics databases has presented unique opportunities to leverage machine learning in precision medicine, particularly in drug response prediction. In this thesis, an in-depth investigation is conducted on carefully curated and integrated breast cancer focused datasets from the GDSC (Genomics of Drug Sensitivity in Cancer) and Achilles (CRISPR derived) project databases. Specifically, machine learning techniques are employed to accurately predict the drug responses of cancer cells, laying the groundwork for personalised treatment strategies. Through rigorous training of machine learning models, drug-response classifiers were devised that demonstrated remarkable predictive capabilities, with the best performing classifier achieving an F1-score of 0.86 and an AUC of 0.85, indicating its effectiveness in drug response prediction. Training these models on GDSC and Achilles datasets encompassing various drug IC50 values, ensured generalization of the models across different drugs and cell
  • «
  • 1 (current)
  • 2
  • »
UCT Libraries logo

Contact us

Jill Claassen

Manager: Scholarly Communication & Publishing

Email: openuct@uct.ac.za

+27 (0)21 650 1263

  • Open Access @ UCT

    • OpenUCT LibGuide
    • Open Access Policy
    • Open Scholarship at UCT
    • OpenUCT FAQs
  • UCT Publishing Platforms

    • UCT Open Access Journals
    • UCT Open Access Monographs
    • UCT Press Open Access Books
    • Zivahub - Open Data UCT
  • Site Usage

    • Cookie settings
    • Privacy policy
    • End User Agreement
    • Send Feedback

DSpace software copyright © 2002-2026 LYRASIS