Structural variation (SV) constitutes an important type of genetic mutations providing the raw material for evolution. Here, we uncover the genome-wide spectrum of intra-and interspecific SV segregating in natural populations of seven songbird species in the genus Corvus. Combining short-read (N = 127) and long-read re-sequencing (N = 31), as well as optical mapping (N = 16), we apply both assembly-and read mapping approaches to detect SV and characterize a total of 220,452 insertions, deletions and inversions. We exploit sampling across wide phylogenetic timescales to validate SV genotypes and assess the contribution of SV to evolutionary processes in an avian model of incipient speciation. We reveal an evolutionary young (~530,000 years) cis-acting 2.25-kb LTR retrotransposon insertion reducing expression of the NDP gene with consequences for premating isolation. Our results attest to the wealth and evolutionary significance of SV segregating in natural populations and highlight the need for reliable SV genotyping.
The tuatara (Sphenodon punctatus)-the only living member of the reptilian order Rhynchocephalia (Sphenodontia), once widespread across Gondwana 1,2-is an iconic species that is endemic to New Zealand 2,3. A key link to the now-extinct stem reptiles (from which dinosaurs, modern reptiles, birds and mammals evolved), the tuatara provides key insights into the ancestral amniotes 2,4. Here we analyse the genome of the tuatara, which-at approximately 5 Gb-is among the largest of the vertebrate genomes yet assembled. Our analyses of this genome, along with comparisons with other vertebrate genomes, reinforce the uniqueness of the tuatara. Phylogenetic analyses indicate that the tuatara lineage diverged from that of snakes and lizards around 250 million years ago. This lineage also shows moderate rates of molecular evolution, with instances of punctuated evolution. Our genome sequence analysis identifies expansions of proteins, non-protein-coding RNA families and repeat elements, the latter of which show an amalgam of reptilian and mammalian features. The sequencing of the tuatara genome provides a valuable resource for deep comparative analyses of tetrapods, as well as for tuatara biology and conservation. Our study also provides important insights into both the technical challenges and the cultural obligations that are associated with genome sequencing.
Genome assemblies are currently being produced at an impressive rate by consortia and individual laboratories. The low costs and increasing efficiency of sequencing technologies now enable assembling genomes at unprecedented quality and contiguity. However, the difficulty in assembling repeat‐rich and GC‐rich regions (genomic “dark matter”) limits insights into the evolution of genome structure and regulatory networks. Here, we compare the efficiency of currently available sequencing technologies (short/linked/long reads and proximity ligation maps) and combinations thereof in assembling genomic dark matter. By adopting different de novo assembly strategies, we compare individual draft assemblies to a curated multiplatform reference assembly and identify the genomic features that cause gaps within each assembly. We show that a multiplatform assembly implementing long‐read, linked‐read and proximity sequencing technologies performs best at recovering transposable elements, multicopy MHC genes, GC‐rich microchromosomes and the repeat‐rich W chromosome. Telomere‐to‐telomere assemblies are not a reality yet for most organisms, but by leveraging technology choice it is now possible to minimize genome assembly gaps for downstream analysis. We provide a roadmap to tailor sequencing projects for optimized completeness of both the coding and noncoding parts of nonmodel genomes.
It is a broadly observed pattern that the non-recombining regions of sex-limited chromosomes (Y and W) accumulate more repeats than the rest of the genome, even in species like birds with a low genome-wide repeat content. Here, we show that in birds with highly heteromorphic sex chromosomes, the W chromosome has a transposable element (TE) density of greater than 55% compared to the genome-wide density of less than 10%, and contains over half of all full-length (thus potentially active) endogenous retroviruses (ERVs) of the entire genome. Using RNA-seq and protein mass spectrometry data, we were able to detect signatures of female-specific ERV expression. We hypothesize that the avian W chromosome acts as a refugium for active ERVs, probably leading to female-biased mutational load that may influence female physiology similar to the ‘toxic-Y’ effect in Drosophila males. Furthermore, Haldane's rule predicts that the heterogametic sex has reduced fertility in hybrids. We propose that the excess of W-linked active ERVs over the rest of the genome may be an additional explanatory variable for Haldane's rule, with consequences for genetic incompatibilities between species through TE/repressor mismatches in hybrids. Together, our results suggest that the sequence content of female-specific W chromosomes can have effects far beyond sex determination and gene dosage. This article is part of the theme issue ‘Challenging the paradigm in sex chromosome evolution: empirical and theoretical insights with a focus on vertebrates (Part II)’.
The genomes of eukaryotes are full of parasitic sequences known as transposable elements (TEs). Here, we report the discovery of a putative giant tyrosine-recombinase-mobilized DNA transposon, Enterprise, from the model fungus Podospora anserina. Previously, we described a large genomic feature called the Spok block which is notable due to the presence of meiotic drive genes of the Spok gene family. The Spok block ranges from 110 kb to 247 kb and can be present in at least four different genomic locations within P. anserina, despite what is an otherwise highly conserved genome structure. We propose that the reason for its varying positions is that the Spok block is not only capable of meiotic drive but is also capable of transposition. More precisely, the Spok block represents a unique case where the Enterprise has captured the Spoks, thereby parasitizing a resident genomic parasite to become a genomic hyperparasite. Furthermore, we demonstrate that Enterprise (without the Spoks) is found in other fungal lineages, where it can be as large as 70 kb. Lastly, we provide experimental evidence that the Spok block is deleterious, with detrimental effects on spore production in strains which carry it. This union of meiotic drivers and a transposon has created a selfish element of impressive size in Podospora, challenging our perception of how TEs influence genome evolution and broadening the horizons in terms of what the upper limit of transposition may be.
29Songbirds have a species number almost equivalent to that of mammals, and are classic 30 models for studying mechanisms of speciation and sexual selection. Sex chromosomes are 31 hotspots of both processes, yet their evolutionary history in songbirds remains unclear. To 32 elucidate that, we characterize female genomes of 11 songbird species having ZW sex 33 chromosomes, with 5 genomes of bird-of-paradise species newly produced in this work. We 34 conclude that songbird sex chromosomes have undergone at least four steps of recombination 35 suppression before their species radiation, producing a gradient pattern of pairwise sequence 36 divergence termed 'evolutionary strata'. Interestingly, the latest stratum probably emerged due 37 to a songbird-specific burst of retrotransposon CR1-E1 elements at its boundary, or 38 chromosome inversion on the W chromosome. The formation of evolutionary strata has 39 reshaped the genomic architecture of both sex chromosomes. We find stepwise variations of Z-40 linked inversions, repeat and GC contents, as well as W-linked gene loss rate that are 41 associated with the age of strata. Over 30 W-linked genes have been preserved for their 42 essential functions, indicated by their higher and broader expression of orthologs in lizard than 43 those of other sex-linked genes. We also find a different degree of accelerated evolution of Z-44 linked genes vs. autosomal genes among different species, potentially reflecting their diversified 45 intensity of sexual selection. Our results uncover the dynamic evolutionary history of songbird 46 sex chromosomes, and provide novel insights into the mechanisms of recombination 47
University 28 29 Correspondence 30 31 V.P. (valentina.peona@ebc.uu.se), A.S. (alexander.suh@ebc.uu.se) 32 33 Abstract 38 39 Genome assemblies are currently being produced at an impressive rate by consortia and individual 40 laboratories. The low costs and increasing efficiency of sequencing technologies have opened up a 41 whole new world of genomic biodiversity. Although these technologies generate high-quality genome 42assemblies, there are still genomic regions difficult to assemble, like repetitive elements and GC-rich 43 regions (genomic "dark matter"). In this study, we compare the efficiency of currently used 44 a non-model organism (paradise crow) for which only suboptimal samples are available. Our 51 approach was able to reconstruct complex chromosomes like the repeat-rich W sex chromosome and 52 several GC-rich microchromosomes. Telomere-to-telomere assemblies are not a reality yet for most 53 organisms, but by leveraging technology choice it is possible to minimize genome assembly gaps for 54 downstream analysis. We provide a roadmap to tailor sequencing projects around the completeness 55 of both the coding and non-coding parts of the genomes. 56 57 2005). It is key now for the genomics field to overcome these limitations and investigate this dark 70 matter. 71 72Repetitive elements represent an important and prevalent part of the genomic dark matter of many 73 genomes, given that their abundance and repetitive nature makes it difficult to fully and confidently 74 assemble their sequences. This is particularly problematic when the read length is significantly shorter 75 than the repetitive element, in which case it is impossible to anchor the reads to unique genomic 76 regions. To what extent repeats can hamper genome assemblies depends on whether they are 77 interspersed or arranged in tandem. Highly similar interspersed repeats, like for example transposable 78
Background Repetitive DNA sequences, including transposable elements (TEs) and tandemly repeated satellite DNA (satDNAs), collectively called the “repeatome”, are found in high proportion in organisms across the Tree of Life. Grasshoppers have large genomes, averaging 9 Gb, that contain a high proportion of repetitive DNA, which has hampered progress in assembling reference genomes. Here we combined linked-read genomics with transcriptomics to assemble, characterize, and compare the structure of repetitive DNA sequences in four chromosomal races of the morabine grasshopper Vandiemenella viatica species complex and determine their contribution to genome evolution. Results We obtained linked-read genome assemblies of 2.73–3.27 Gb from estimated genome sizes of 4.26–5.07 Gb DNA per haploid genome of the four chromosomal races of V. viatica. These constitute the third largest insect genomes assembled so far. Combining complementary annotation tools and manual curation, we found a large diversity of TEs and satDNAs, constituting 66 to 75% per genome assembly. A comparison of sequence divergence within the TE classes revealed massive accumulation of recent TEs in all four races (314–463 Mb per assembly), indicating that their large genome sizes are likely due to similar rates of TE accumulation. Transcriptome sequencing showed more biased TE expression in reproductive tissues than somatic tissues, implying permissive transcription in gametogenesis. Out of 129 satDNA families, 102 satDNA families were shared among the four chromosomal races, which likely represent a diversity of satDNA families in the ancestor of the V. viatica chromosomal races. Notably, 50 of these shared satDNA families underwent differential proliferation since the recent diversification of the V. viatica species complex. Conclusion This in-depth annotation of the repeatome in morabine grasshoppers provided new insights into the genome evolution of Orthoptera. Our TEs analysis revealed a massive recent accumulation of TEs equivalent to the size of entire Drosophila genomes, which likely explains the large genome sizes in grasshoppers. Despite an overall high similarity of the TE and satDNA diversity between races, the patterns of TE expression and satDNA proliferation suggest rapid evolution of grasshopper genomes on recent timescales.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.