Speciation is a continuous process during which genetic changes gradually accumulate in the genomes of diverging species. Recent studies have documented highly heterogeneous differentiation landscapes, with distinct regions of elevated differentiation ("differentiation islands") widespread across genomes. However, it remains unclear which processes drive the evolution of differentiation islands; how the differentiation landscape evolves as speciation advances; and ultimately, how differentiation islands are related to speciation. Here, we addressed these questions based on population genetic analyses of 200 resequenced genomes from 10 populations of four Ficedula flycatcher sister species. We show that a heterogeneous differentiation landscape starts emerging among populations within species, and differentiation islands evolve recurrently in the very same genomic regions among independent lineages. Contrary to expectations from models that interpret differentiation islands as genomic regions involved in reproductive isolation that are shielded from gene flow, patterns of sequence divergence (d xy and relative node depth) do not support a major role of gene flow in the evolution of the differentiation landscape in these species. Instead, as predicted by models of linked selection, genome-wide variation in diversity and differentiation can be explained by variation in recombination rate and the density of targets for selection. We thus conclude that the heterogeneous landscape of differentiation in Ficedula flycatchers evolves mainly as the result of background selection and selective sweeps in genomic regions of low recombination. Our results emphasize the necessity of incorporating linked selection as a null model to identify genome regions involved in adaptation and speciation.[Supplemental material is available for this article.]Uncovering the genetic architecture of reproductive isolation and its evolutionary history are central tasks in evolutionary biology. The identification of genome regions that are highly differentiated between closely related species, and thereby constitute candidate regions involved in reproductive isolation, has recently been a major focus of speciation genetic research. Studies from a broad taxonomic range, involving organisms as diverse as plants (Renaut et al.
Detailed linkage and recombination rate maps are necessary to use the full potential of genome sequencing and population genomic analyses. We used a custom collared flycatcher 50 K SNP array to develop a high-density linkage map with 37 262 markers assigned to 34 linkage groups in 33 autosomes and the Z chromosome. The best-order map contained 4215 markers, with a total distance of 3132 cm and a mean genetic distance between markers of 0.12 cm. Facilitated by the array being designed to include markers from most scaffolds, we obtained a second-generation assembly of the flycatcher genome that approaches full chromosome sequences (N50 super-scaffold size 20.2 Mb and with 1.042 Gb (of 1.116 Gb) anchored to and mostly ordered and oriented along chromosomes). We found that flycatcher and zebra finch chromosomes are entirely syntenic but that inversions at mean rates of 1.5–2.0 event (6.6–7.5 Mb) per My have changed the organization within chromosomes, rates high enough for inversions to potentially have been involved with many speciation events during avian evolution. The mean recombination rate was 3.1 cm/Mb and correlated closely with chromosome size, from 2 cm/Mb for chromosomes >100 Mb to >10 cm/Mb for chromosomes <10 Mb. This size dependence seemed entirely due to an obligate recombination event per chromosome; if 50 cm was subtracted from the genetic lengths of chromosomes, the rate per physical unit DNA was constant across chromosomes. Flycatcher recombination rate showed similar variation along chromosomes as chicken but lacked the large interior recombination deserts characteristic of zebra finch chromosomes.
The ratio of divergence at nonsynonymous and synonymous sites, dN/dS, is a widely used measure in evolutionary genetic studies to investigate the extent to which selection modulates gene sequence evolution. Originally tailored to codon sequences of distantly related lineages, dN/dS represents the ratio of fixed nonsynonymous to synonymous differences. The impact of ancestral and lineage-specific polymorphisms on dN/dS, which we here show to be substantial for closely related lineages, is generally neglected in estimation techniques of dN/dS. To address this issue, we formulate a codon model that is firmly anchored in population genetic theory, derive analytical expressions for the dN/dS measure by Poisson random field approximation in a Markovian framework and validate the derivations by simulations. In good agreement, simulations and analytical derivations demonstrate that dN/dS is biased by polymorphisms at short time scales and that it can take substantial time for the expected value to settle at its time limit where only fixed differences are considered. We further show that in any attempt to estimate the dN/dS ratio from empirical data the effect of the intrinsic fluctuations of a ratio of stochastic variables, can even under neutrality yield extreme values of dN/dS at short time scales or in regions of low mutation rate. Taken together, our results have significant implications for the interpretation of dN/dS estimates, the McDonald–Kreitman test and other related statistics, in particular for closely related lineages.
BackgroundObtaining a draft genome sequence of the zebra finch (Taeniopygia guttata), the second bird genome to be sequenced, provides the necessary resource for whole-genome comparative analysis of gene sequence evolution in a non-mammalian vertebrate lineage. To analyze basic molecular evolutionary processes during avian evolution, and to contrast these with the situation in mammals, we aligned the protein-coding sequences of 8,384 1:1 orthologs of chicken, zebra finch, a lizard and three mammalian species.ResultsWe found clear differences in the substitution rate at fourfold degenerate sites, being lowest in the ancestral bird lineage, intermediate in the chicken lineage and highest in the zebra finch lineage, possibly reflecting differences in generation time. We identified positively selected and/or rapidly evolving genes in avian lineages and found an over-representation of several functional classes, including anion transporter activity, calcium ion binding, cell adhesion and microtubule cytoskeleton.ConclusionsFocusing specifically on genes of neurological interest and genes differentially expressed in the unique vocal control nuclei of the songbird brain, we find a number of positively selected genes, including synaptic receptors. We found no evidence that selection for beneficial alleles is more efficient in regions of high recombination; in fact, there was a weak yet significant negative correlation between ω and recombination rate, which is in the direction predicted by the Hill-Robertson effect if slightly deleterious mutations contribute to protein evolution. These findings set the stage for studies of functional genetics of avian genes.
Recombination rate is heterogeneous across the genome of various species and so are genetic diversity and differentiation as a consequence of linked selection. However, we still lack a clear picture of the underlying mechanisms for regulating recombination.Here we estimated fine-scale population recombination rate based on the patterns of linkage disequilibrium across the genomes of multiple populations of two closely related flycatcher species (Ficedula albicollis and F. hypoleuca). This revealed an overall conservation of the recombination landscape between these species at the scale of 200 kb, but we also identified differences in the local rate of recombination despite their recent divergence (<1 million years). Genetic diversity and differentiation were associated with recombination rate in a lineage-specific manner, indicating differences in the extent of linked selection between species. We detected 400-3,085 recombination hotspots per population. Location of hotspots was conserved between species, but the intensity of hotspot activity varied between species. Recombination hotspots were primarily associated with CpG islands (CGIs), regardless of whether CGIs were at promoter regions or away from genes. Recombination hotspots were also associated with specific transposable elements (TEs), but this association appears indirect due to shared preferences of the transposition machinery and the recombination machinery for accessible open chromatin regions. Our results suggest that CGIs are a major determinant of the localization of recombination hotspots, and we propose that both the distribution of TEs and fine-scale variation in recombination rate may be associated with the evolution of the epigenetic landscape.
Theoretical work suggests that sexual conflict should promote the maintenance of genetic diversity by the opposing directions of selection on males and females. If such conflict is pervasive, it could potentially lead to genomic heterogeneity in levels of genetic diversity an idea that so far has not been empirically tested on a genomewide scale. We used large-scale population genomic and transcriptomic data from the collared flycatcher (Ficedula albicollis) to analyse how sexual conflict, for which we use sex-biased gene expression as a proxy, relates to genetic variability. Here, we demonstrate that the extent of sex-biased gene expression of both male-biased and female-biased genes is significantly correlated with levels of nucleotide diversity in gene sequences and that this correlation extends to diversity levels also in intergenic DNA and introns. We find signatures of balancing selection in sex-biased genes but also note that relaxed purifying selection could potentially explain part of the observed patterns. The finding of significant genetic differentiation between males and females for male-biased (and gonad-specific) genes indicates ongoing sexual conflict and sex-specific viability selection, potentially driven by sexual selection. Our results thus indicate that sexual antagonism could potentially be considered as one viable explanation to the long-standing question in evolutionary biology of how genomes can remain so genetically variable in face of strong natural and sexual selection.
The ratio of nonsynonymous to synonymous substitution rates (ω) is often used to measure the strength of natural selection. However, ω may be influenced by linkage among different targets of selection, that is, Hill–Robertson interference (HRI), which reduces the efficacy of selection. Recombination modulates the extent of HRI but may also affect ω by means of GC-biased gene conversion (gBGC), a process leading to a preferential fixation of G:C (“strong,” S) over A:T (“weak,” W) alleles. As HRI and gBGC can have opposing effects on ω, it is essential to understand their relative impact to make proper inferences of ω. We used a model that separately estimated S-to-S, S-to-W, W-to-S, and W-to-W substitution rates in 8,423 avian genes in the Ficedula flycatcher lineage. We found that the W-to-S substitution rate was positively, and the S-to-W rate negatively, correlated with recombination rate, in accordance with gBGC but not predicted by HRI. The W-to-S rate further showed the strongest impact on both dN and dS. However, since the effects were stronger at 4-fold than at 0-fold degenerated sites, likely because the GC content of these sites is farther away from its equilibrium, ω slightly decreases with increasing recombination rate, which could falsely be interpreted as a consequence of HRI. We corroborated this hypothesis analytically and demonstrate that under particular conditions, ω can decrease with increasing recombination rate. Analyses of the site-frequency spectrum showed that W-to-S mutations were skewed toward high, and S-to-W mutations toward low, frequencies, consistent with a prevalent gBGC-driven fixation bias.
Recombination is an engine of genetic diversity and therefore constitutes a key process in evolutionary biology and genetics. While the outcome of crossover recombination can readily be detected as shuffled alleles by following the inheritance of markers in pedigreed families, the more precise location of both crossover and non-crossover recombination events has been difficult to pinpoint. As a consequence, we lack a detailed portrait of the recombination landscape for most organisms and knowledge on how this landscape impacts on sequence evolution at a local scale. To localize recombination events with high resolution in an avian system, we performed whole-genome re-sequencing at high coverage of a complete three-generation collared flycatcher pedigree. We identified 325 crossovers at a median resolution of 1.4 kb, with 86% of the events localized to <10 kb intervals. Observed crossover rates were in excellent agreement with data from linkage mapping, were 52% higher in male (3.56 cM/Mb) than in female meiosis (2.28 cM/Mb), and increased towards chromosome ends in male but not female meiosis. Crossover events were non-randomly distributed in the genome with several distinct hot-spots and a concentration to genic regions, with the highest density in promoters and CpG islands. We further identified 267 non-crossovers, whose location was significantly associated with crossover locations. We detected a significant transmission bias (0.18) in favour of ‘strong’ (G, C) over ‘weak’ (A, T) alleles at non-crossover events, providing direct evidence for the process of GC-biased gene conversion in an avian system. The approach taken in this study should be applicable to any species and would thereby help to provide a more comprehensive portray of the recombination landscape across organism groups.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.