While short-read sequencing technology has resulted in a sharp increase in the number of species with genome assemblies, these assemblies are typically highly fragmented. Repeats pose the largest challenge for reference genome assembly, and pericentromeric regions and the repeat-rich Y chromosome are typically ignored from sequencing projects. Here, we assemble the genome of Drosophila miranda using long reads for contig formation, chromatin interaction maps for scaffolding and short reads, and optical mapping and bacterial artificial chromosome (BAC) clone sequencing for consensus validation. Our assembly recovers entire chromosomes and contains large fractions of repetitive DNA, including about 41.5 Mb of pericentromeric and telomeric regions, and >100 Mb of the recently formed highly repetitive neo-Y chromosome. While Y chromosome evolution is typically characterized by global sequence loss and shrinkage, the neo-Y increased in size by almost 3-fold because of the accumulation of repetitive sequences. Our high-quality assembly allows us to reconstruct the chromosomal events that have led to the unusual sex chromosome karyotype in D. miranda, including the independent de novo formation of a pair of sex chromosomes at two distinct time points, or the reversion of a former Y chromosome to an autosome.
Tandemly repeating satellite DNA elements in heterochromatin occupy a substantial portion of many eukaryotic genomes. Although often characterized as genomic parasites deleterious to the host, they also can be crucial for essential processes such as chromosome segregation. Adding to their interest, satellite DNA elements evolve at high rates; among Drosophila, closely related species often differ drastically in both the types and abundances of satellite repeats. However, due to technical challenges, the evolutionary mechanisms driving this rapid turnover remain unclear. Here we characterize natural variation in simple-sequence repeats of 2-10 bp from inbred Drosophila melanogaster lines derived from multiple populations, using a method we developed called k-Seek that analyzes unassembled Illumina sequence reads. In addition to quantifying all previously described satellite repeats, we identified many novel repeats of low to medium abundance. Many of the repeats show population differentiation, including two that are present in only some populations. Interestingly, the population structure inferred from overall satellite quantities does not recapitulate the expected population relationships based on the demographic history of D. melanogaster. We also find that some satellites of similar sequence composition are correlated across lines, revealing concerted evolution. Moreover, correlated satellites tend to be interspersed with each other, further suggesting that concerted change is partially driven by higher order structure. Surprisingly, we identified negative correlations among some satellites, suggesting antagonistic interactions. Our study demonstrates that current genome assemblies vastly underestimate the complexity, abundance, and variation of highly repetitive satellite DNA and presents approaches to understand their rapid evolutionary divergence.satellite DNA | population differentiation | rapid evolution H eterochromatin occupies a substantial portion of most eukaryotic genomes and contains vast quantities of tandemly repeating, noncoding DNA elements known as satellite DNA. These sequences, along with transposable elements, are often described as selfish elements or genomic parasites, as they can increase their copy numbers irrespective of host fitness (1, 2). Indeed, they can be highly deleterious for the host genome; for example, ectopic recombination between homologous satellite repeats can lead to devastating chromosomal rearrangements (3, 4). Consequently, these elements are mostly sequestered in repressive chromatin environments around the centromeres and telomeres where there is minimal recombination and transcriptional activity. However, paradoxically, repetitive sequences are also crucial components of euchromatic genomes, as they recruit the centromeric histone H3 variant to form centromeres in many species (5, 6), thereby affecting the fidelity of chromosome segregation (7,8).Adding to the perplexity, satellite DNA turns over at remarkably high rates between species (9, 10). In Drosophila mel...
Hybrid incompatibilities (HIs) cause reproductive isolation between species and thus contribute to speciation. Several HI genes encode adaptively evolving proteins that localize to or interact with heterochromatin, suggesting that HIs may result from co-evolution with rapidly evolving heterochromatic DNA. Little is known, however, about the intraspecific function of these HI genes, the specific sequences they interact with, or the evolutionary forces that drive their divergence. The genes Hmr and Lhr genetically interact to cause hybrid lethality between Drosophila melanogaster and D. simulans, yet mutations in both genes are viable. Here, we report that Hmr and Lhr encode proteins that form a heterochromatic complex with Heterochromatin Protein 1 (HP1a). Using RNA-Seq analyses we discovered that Hmr and Lhr are required to repress transcripts from satellite DNAs and many families of transposable elements (TEs). By comparing Hmr and Lhr function between D. melanogaster and D. simulans we identify several satellite DNAs and TEs that are differentially regulated between the species. Hmr and Lhr mutations also cause massive overexpression of telomeric TEs and significant telomere lengthening. Hmr and Lhr therefore regulate three types of heterochromatic sequences that are responsible for the significant differences in genome size and structure between D. melanogaster and D. simulans and have high potential to cause genetic conflicts with host fitness. We further find that many TEs are overexpressed in hybrids but that those specifically mis-expressed in lethal hybrids do not closely correlate with Hmr function. Our results therefore argue that adaptive divergence of heterochromatin proteins in response to repetitive DNAs is an important underlying force driving the evolution of hybrid incompatibility genes, but that hybrid lethality likely results from novel epistatic genetic interactions that are distinct to the hybrid background.
Simple satellites are tandemly repeating short DNA motifs that can span megabases in eukaryotic genomes. Because they can cause genomic instability through nonallelic homologous exchange, they are primarily found in the repressive heterochromatin near centromeres and telomeres where recombination is minimal, and on the Y chromosome, where they accumulate as the chromosome degenerates. Interestingly, the types and abundances of simple satellites often vary dramatically between closely related species, suggesting that they turn over rapidly. However, limited sampling has prevented detailed understanding of their evolutionary dynamics. Here, we characterize simple satellites from whole-genome sequences generated from males and females of nine Drosophila species, spanning 40 Ma of evolution. We show that PCR-free library preparation and postsequencing GC-correction better capture satellite quantities than conventional methods. We find that over half of the 207 simple satellites identified are species-specific, consistent with previous descriptions of their rapid evolution. Based on a maximum parsimony framework, we determined that most interspecific differences are due to lineage-specific gains. Simple satellites gained within a species are typically a single mutation away from abundant existing satellites, suggesting that they likely emerge from existing satellites, especially in the genomes of satellite-rich species. Interestingly, unlike most of the other lineages which experience various degrees of gains, the lineage leading up to the satellite-poor D. pseudoobscura and D. persimilis appears to be recalcitrant to gains, providing a counterpoint to the notion that simple satellites are universally rapidly evolving.
Meiotic drivers are selfish genetic elements that bias their transmission into gametes, often to the detriment of the rest of the genome. The resulting intragenomic conflicts triggered by meiotic drive create evolutionary arms races and shape genome evolution. The phenomenon of meiotic drive is widespread across taxa but is particularly prominent in the Drosophila genus. Recent studies in Drosophila have provided insights into the genetic origins of drivers and their molecular mechanisms. Here, we review the current literature on mechanisms of drive with an emphasis on sperm killers in Drosophila species. In these systems, meiotic drivers often evolve from gene duplications and targets are generally linked to heterochromatin. While dense in repetitive elements and difficult to study using traditional genetic and genomic approaches, recent work in Drosophila has made progress on the heterochromatic compartment of the genome. Although we still understand little about precise drive mechanisms, studies of male drive systems are converging on common themes such as heterochromatin regulation, small RNA pathways, and nuclear transport pathways. Meiotic drive systems are therefore promising models for discovering fundamental features of gametogenesis.
Male Drosophila typically have achiasmatic meiosis, and fusions between autosomes and the Y chromosome have repeatedly created non-recombining neo-Y chromosomes that degenerate. Intriguingly, Drosophila nasuta males recombine, but their close relative D. albomicans reverted back to achiasmy after evolving neo-sex chromosomes. Here we use genome-wide polymorphism data to reconstruct the complex evolutionary history of neo-sex chromosomes in D. albomicans and examine the effect of recombination and its cessation on the initiation of neo-Y decay. Population and phylogenomic analyses reveal three distinct neo-Y types that are geographically restricted. Due to ancestral recombination with the neo-X, overall nucleotide diversity on the neo-Y is similar to the neo-X but severely reduced within neo-Y types. Consistently, the neo-Y chromosomes fail to form a monophyletic clade in sliding window trees outside of the region proximal to the fusion. Based on tree topology changes, we inferred the recombination breakpoints that produced haplotypes specific to each neo-Y type. We show that recombination became suppressed at different time points for the different neo-Y haplotypes. Haplotype age correlates with onset of neo-Y decay, and older neo-Y haplotypes show more fixed gene disruption via frameshift indels and down-regulation of neo-Y alleles. Genes are downregulated independently on the different neo-Ys, but are depleted of testes-expressed genes across all haplotypes. This indicates that genes important for male function are initially shielded from degeneration. Our results offer a time course of the early progression of Y chromosome evolution, showing how the suppression of recombination, through the reversal to achiasmy in D. albomicans males, initiates the process of degeneration.
Large portions of eukaryotic genomes consist of transposable elements (TEs), and the establishment of transcription-repressing heterochromatin during early development safeguards genome integrity in Drosophila. Repeat-rich Y chromosomes can act as reservoirs for TEs (‘toxic’ Y effect), and incomplete epigenomic defenses during early development can lead to deleterious TE mobilization. Here, we contrast the dynamics of early TE activation in two Drosophila species with vastly different Y chromosomes of different ages. Zygotic TE expression is elevated in male embryos relative to females in both species, mostly due to expression of Y-linked TEs. Interestingly, male-biased TE expression diminishes across development in D. pseudoobscura, but remains elevated in D. miranda, the species with the younger and larger Y chromosome. The repeat-rich Y of D. miranda still contains many actively transcribed genes, which compromise the formation of silencing heterochromatin. Elevated TE expression results in more de novo insertions of repeats in males compared to females. This lends support to the idea that the ‘toxic’ Y chromosome can create a mutational burden in males when genome-wide defense mechanisms are compromised, and suggests a previously unappreciated epigenetic conflict on evolving Y chromosomes between transcription of essential genes and silencing of selfish DNA.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.