The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55–83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
We report the largest eukaryotic genome to date in the monocot Paris japonica (Melanthiaceae, 1C = 152.23 pg), measured using flow cytometry. This value is 15% larger than any previous estimate and extends the range of genome sizes to c. 2400-fold across angiosperms and c. 66 000-fold across eukaryotes.
Genome size is a biodiversity trait that shows staggering diversity across eukaryotes, varying over 64,000-fold. Of all major taxonomic groups, land plants stand out due to their staggering genome size diversity, ranging ca. 2400-fold. As our understanding of the implications and significance of this remarkable genome size diversity in land plants grows, it is becoming increasingly evident that this trait plays not only an important role in shaping the evolution of plant genomes, but also in influencing plant community assemblages at the ecosystem level. Recent advances and improvements in novel sequencing technologies, as well as analytical tools, make it possible to gain critical insights into the genomic and epigenetic mechanisms underpinning genome size changes. In this review we provide an overview of our current understanding of genome size diversity across the different land plant groups, its implications on the biology of the genome and what future directions need to be addressed to fill key knowledge gaps.
Summary The genome evolution of ferns has been considered to be relatively static compared with angiosperms. In this study, we analyse genome size data and chromosome numbers in a phylogenetic framework to explore three hypotheses: the correlation of genome size and chromosome number, the origin of modern ferns from ancestors with high chromosome numbers, and the occurrence of several whole‐genome duplications during the evolution of ferns. To achieve this, we generated new genome size data, increasing the percentage of fern species with genome sizes estimated to 2.8% of extant diversity, and ensuring a comprehensive phylogenetic coverage including at least three species from each fern order. Genome size was correlated with chromosome number across all ferns despite some substantial variation in both traits. We observed a trend towards conservation of the amount of DNA per chromosome, although Osmundaceae and Psilotaceae have substantially larger chromosomes. Reconstruction of the ancestral genome traits suggested that the earliest ferns were already characterized by possessing high chromosome numbers and that the earliest divergences in ferns were correlated with substantial karyological changes. Evidence for repeated whole‐genome duplications was found across the phylogeny. Fern genomes tend to evolve slowly, albeit genome rearrangements occur in some clades.
Summary Plants exhibit an extraordinary range of genome sizes, varying by > 2000‐fold between the smallest and largest recorded values. In the absence of polyploidy, changes in the amount of repetitive DNA (transposable elements and tandem repeats) are primarily responsible for genome size differences between species. However, there is ongoing debate regarding the relative importance of amplification of repetitive DNA versus its deletion in governing genome size.Using data from 454 sequencing, we analysed the most repetitive fraction of some of the largest known genomes for diploid plant species, from members of Fritillaria.We revealed that genomic expansion has not resulted from the recent massive amplification of just a handful of repeat families, as shown in species with smaller genomes. Instead, the bulk of these immense genomes is composed of highly heterogeneous, relatively low‐abundance repeat‐derived DNA, supporting a scenario where amplified repeats continually accumulate due to infrequent DNA removal.Our results indicate that a lack of deletion and low turnover of repetitive DNA are major contributors to the evolution of extremely large genomes and show that their size cannot simply be accounted for by the activity of a small number of high‐abundance repeat families.
New sequencing technologies allow development of genome-wide markers for any genus of ecological interest, including plant genera such as Betula (birch) that have previously proved difficult to study due to widespread polyploidy and hybridization. We present a de novo reference genome sequence assembly, from 66× short read coverage, of Betula nana (dwarf birch) - a diploid that is the keystone woody species of subarctic scrub communities but of conservation concern in Britain. We also present 100 bp PstI RAD markers for B. nana and closely related Betula tree species. Assembly of RAD markers in 15 individuals by alignment to the reference B. nana genome yielded 44-86k RAD loci per individual, whereas de novo RAD assembly yielded 64-121k loci per individual. Of the loci assembled by the de novo method, 3k homologous loci were found in all 15 individuals studied, and 35k in 10 or more individuals. Matching of RAD loci to RAD locus catalogues from the B. nana individual used for the reference genome showed similar numbers of matches from both methods of RAD locus assembly but indicated that the de novo RAD assembly method may overassemble some paralogous loci. In 12 individuals hetero-specific to B. nana 37-47k RAD loci matched a catalogue of RAD loci from the B. nana individual used for the reference genome, whereas 44-60k RAD loci aligned to the B. nana reference genome itself. We present a preliminary study of allele sharing among species, demonstrating the utility of the data for introgression studies and for the identification of species-specific alleles.
SignificanceWe performed de novo, full-genome sequence analysis of two Populus species, North American quaking and Eurasian trembling aspen, that contain striking levels of genetic variation. Our results showed that positive and negative selection broadly affects patterns of genomic variation, but to varying degrees across coding and noncoding regions. The strength of selection and rates of sequence divergence were strongly related to differences in gene expression and coexpression network connectivity. These results highlight the importance of both positive and negative selection in shaping genome-wide levels of genetic variation in an obligately outcrossing, perennial plant. The resources we present establish aspens as a powerful study system enabling future studies for understanding the genomic determinants of adaptive evolution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.