Studies of microbial evolutionary dynamics are being transformed by the availability of affordable high-throughput sequencing technologies, which allow whole-genome sequencing of hundreds of related taxa in a single study. Reconstructing a phylogenetic tree of these taxa is generally a crucial step in any evolutionary analysis. Instead of constructing genome assemblies for all taxa, annotating these assemblies, and aligning orthologous genes, many recent studies 1) directly map raw sequencing reads to a single reference sequence, 2) extract single nucleotide polymorphisms (SNPs), and 3) infer the phylogenetic tree using maximum likelihood methods from the aligned SNP positions. However, here we show that, when using such methods to reconstruct phylogenies from sets of simulated sequences, both the exclusion of nonpolymorphic positions and the alignment to a single reference genome, introduce systematic biases and errors in phylogeny reconstruction. To address these problems, we developed a new method that combines alignments from mappings to multiple reference sequences and show that this successfully removes biases from the reconstructed phylogenies. We implemented this method as a web server named REALPHY (Reference sequence Alignment-based Phylogeny builder), which fully automates phylogenetic reconstruction from raw sequencing reads.
The origins of crop diseases are linked to domestication of plants. Most crops were domesticated centuries – even millennia – ago, thus limiting opportunity to understand the concomitant emergence of disease. Kiwifruit (Actinidia spp.) is an exception: domestication began in the 1930s with outbreaks of canker disease caused by P. syringae pv. actinidiae (Psa) first recorded in the 1980s. Based on SNP analyses of two circularized and 34 draft genomes, we show that Psa is comprised of distinct clades exhibiting negligible within-clade diversity, consistent with disease arising by independent samplings from a source population. Three clades correspond to their geographical source of isolation; a fourth, encompassing the Psa-V lineage responsible for the 2008 outbreak, is now globally distributed. Psa has an overall clonal population structure, however, genomes carry a marked signature of within-pathovar recombination. SNP analysis of Psa-V reveals hundreds of polymorphisms; however, most reside within PPHGI-1-like conjugative elements whose evolution is unlinked to the core genome. Removal of SNPs due to recombination yields an uninformative (star-like) phylogeny consistent with diversification of Psa-V from a single clone within the last ten years. Growth assays provide evidence of cultivar specificity, with rapid systemic movement of Psa-V in Actinidia chinensis. Genomic comparisons show a dynamic genome with evidence of positive selection on type III effectors and other candidate virulence genes. Each clade has highly varied complements of accessory genes encoding effectors and toxins with evidence of gain and loss via multiple genetic routes. Genes with orthologs in vascular pathogens were found exclusively within Psa-V. Our analyses capture a pathogen in the early stages of emergence from a predicted source population associated with wild Actinidia species. In addition to candidate genes as targets for resistance breeding programs, our findings highlight the importance of the source population as a reservoir of new disease.
Repetitive sequences are a conserved feature of many bacterial genomes. While first reported almost thirty years ago, and frequently exploited for genotyping purposes, little is known about their origin, maintenance, or processes affecting the dynamics of within-genome evolution. Here, beginning with analysis of the diversity and abundance of short oligonucleotide sequences in the genome of Pseudomonas fluorescens SBW25, we show that over-represented short sequences define three distinct groups (GI, GII, and GIII) of repetitive extragenic palindromic (REP) sequences. Patterns of REP distribution suggest that closely linked REP sequences form a functional replicative unit: REP doublets are over-represented, randomly distributed in extragenic space, and more highly conserved than singlets. In addition, doublets are organized as inverted repeats, which together with intervening spacer sequences are predicted to form hairpin structures in ssDNA or mRNA. We refer to these newly defined entities as REPINs (REP doublets forming hairpins) and identify short reads from population sequencing that reveal putative transposition intermediates. The proximal relationship between GI, GII, and GIII REPINs and specific REP-associated tyrosine transposases (RAYTs), combined with features of the putative transposition intermediate, suggests a mechanism for within-genome dissemination. Analysis of the distribution of REPs in a range of RAYT–containing bacterial genomes, including Escherichia coli K-12 and Nostoc punctiforme, show that REPINs are a widely distributed, but hitherto unrecognized, family of miniature non-autonomous mobile DNA.
Phenotype switching is commonly observed in nature. This prevalence has allowed the elucidation of a number of underlying molecular mechanisms. However, little is known about how phenotypic switches arise and function in their early evolutionary stages. The first opportunity to provide empirical insight was delivered by an experiment in which populations of the bacterium Pseudomonas fluorescens SBW25 evolved, de novo, the ability to switch between two colony phenotypes. Here we unravel the molecular mechanism behind colony switching, revealing how a single nucleotide change in a gene enmeshed in central metabolism (carB) generates such a striking phenotype. We show that colony switching is underpinned by ON/OFF expression of capsules consisting of a colanic acid-like polymer. We use molecular genetics, biochemical analyses, and experimental evolution to establish that capsule switching results from perturbation of the pyrimidine biosynthetic pathway. Of central importance is a bifurcation point at which uracil triphosphate is partitioned towards either nucleotide metabolism or polymer production. This bifurcation marks a cell-fate decision point whereby cells with relatively high pyrimidine levels favour nucleotide metabolism (capsule OFF), while cells with lower pyrimidine levels divert resources towards polymer biosynthesis (capsule ON). This decision point is present and functional in the wild-type strain. Finally, we present a simple mathematical model demonstrating that the molecular components of the decision point are capable of producing switching. Despite its simple mutational cause, the connection between genotype and phenotype is complex and multidimensional, offering a rare glimpse of how noise in regulatory networks can provide opportunity for evolution.
This new century's biology promises more of everything-more genes, more organisms, more species and, in short, more data. The flood of data challenges us to find better and quicker ways to summarize and analyse. Here, we present preliminary results and proofs of concept from three of our research projects that are motivated by our search for solutions to the perils of plenty. First, we discuss how models of evolution can accommodate change to better reflect the dynamics of sequence diversity, particularly when it is becoming a lot easier to obtain sequences at different times and across intervals where the probability of new mutations contributing to this diversity is high. Second, we describe our work on the use of a single locus for species delimitation; this research targets the new DNA-barcoding approach that aims to catalogue the entirety of life. We have developed a single-locus test based on the coalescent that tests the null hypothesis of panmixis. Finally, we discuss new sequencing technologies, the types of data available and the efficacy of alignment-free methods to estimate pairwise distances for phylogenetic analyses.
Selfish genetic elements, such as insertion sequences and transposons are found in most genomes. Transposons are usually identifiable by their high copy number within genomes. In contrast, REP-associated tyrosine transposases (RAYTs), a recently described class of bacterial transposase, are typically present at just one copy per genome. This suggests that RAYTs no longer copy themselves and thus they no longer function as a typical transposase. Motivated by this possibility we interrogated thousands of fully sequenced bacterial genomes in order to determine patterns of RAYT diversity, their distribution across chromosomes and accessory elements, and rate of duplication. RAYTs encompass exceptional diversity and are divisible into at least five distinct groups. They possess features more similar to housekeeping genes than insertion sequences, are predominantly vertically transmitted and have persisted through evolutionary time to the point where they are now found in 24% of all species for which at least one fully sequenced genome is available. Overall, the genomic distribution of RAYTs suggests that they have been coopted by host genomes to perform a function that benefits the host cell.
Pseudomonas fluorescens is a model for the study of adaptive radiation. When propagated in a spatially structured environment, the bacterium rapidly diversifies into a range of niche specialist genotypes. Here we present a genetic dissection and phenotypic characterization of the fuzzy spreader (FS) morphotype-a type that arises repeatedly during the course of the P. fluorescens radiation and appears to colonize the bottom of static broth microcosms. The causal mutation is located within gene fuzY (pflu0478)-the fourth gene of the five-gene fuzVWXYZ operon. fuzY encodes a b-glycosyltransferase that is predicted to modify lipopolysaccharide (LPS) O antigens. The effect of the mutation is to cause cell flocculation. Analysis of 92 independent FS genotypes showed each to have arisen as the result of a loss-of-function mutation in fuzY, although different mutations have subtly different phenotypic and fitness effects. Mutations within fuzY were previously shown to suppress the phenotype of mat-forming wrinkly spreader (WS) types. This prompted a reinvestigation of FS niche preference. Time-lapse photography showed that FS colonizes the meniscus of broth microcosms, forming cellular rafts that, being too flimsy to form a mat, collapse to the vial bottom and then repeatably reform only to collapse. This led to a reassessment of the ecology of the P. fluorescens radiation. Finally, we show that ecological interactions between the three dominant emergent types (smooth, WS, and FS), combined with the interdependence of FS and WS on fuzY, can, at least in part, underpin an evolutionary arms race with bacteriophage SBW25F2, to which mutation in fuzY confers resistance.A DAPTIVE radiation-the rapid emergence of phenotypic and ecological diversity within an expanding lineage-is among the most striking of evolutionary phenomena (Darwin 1859;Lack 1947;Dobzhansky 1951;Simpson 1953;Schluter 2000;Kassen 2009;Losos 2010). Fueled by competition and facilitated by ecological opportunity, successive radiations have shaped much of life's diversity. Of central importance are the phenotypic innovations that fashion the fit between organism and environment (Schluter 2000).Understanding the nature of these innovations and the pathways by which they emerge and rise to prominenceoften in parallel across estranged populations experiencing similar environments-is a central issue (Colosimo et al. 2005;Bantinaki et al. 2007;Conte et al. 2012). How mutational processes generate the variation presented to selection (McDonald et al. 2009;Braendle et al. 2010), how genetic architecture underpinning extant phenotypes determines the capacity of lineages to generate new and adaptive phenotypes (Poole et al. 2003;Wagner and Zhang 2011), and how ecological factors drive phenotypic divergence (Schluter 2009) are questions of seminal interest.The relative simplicity of microbial systems, their capacity for rapid evolutionary change, and advances in technology that enable detailed genotypic traceability offer a unique opportunity to log moment-by-...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.