The feasibility to sequence entire genomes of virtually any organism provides unprecedented insights into the evolutionary history of populations and species. Nevertheless, many population genomic inferences -including the quantification and dating of admixture, introgression and demographic events, and the inference of selective sweeps -are still limited by the lack of high-quality haplotype information. In this respect, the newest generation of sequencing technology now promises significant progress. To establish the feasibility of haplotype-resolved genome resequencing at population scale, we investigated properties of linked-read sequencing data of songbirds of the genus Oenanthe across a range of sequencing depths. Our results based on the comparison of downsampled (25x, 20x, 15x, 10x, 7x, and 5x) with high-coverage data (46-68x) of seven bird genomes suggest that phasing contiguities and accuracies adequate for most population genomic analyses can be reached already with moderate sequencing effort. At 15x coverage, phased haplotypes span about 90% of the genome assembly, with 50 and 90 percent of the phased sequence located in phase blocks longer than 1.25-4.6 Mb (N50) and 0.27-0.72 Mb (N90), respectively. Phasing accuracy reaches beyond 99% starting from 15x coverage. Higher coverages yielded higher contiguities (up to about 7 Mb/1Mb (N50/N90) at 25x coverage), but only marginally improved phasing accuracy. Finally, phasing contiguity improved with input DNA molecule length; thus, higher-quality DNA may help keeping sequencing costs at bay. In conclusion, even for organisms with gigabase-sized genomes like birds, linked-read sequencing at moderate depth opens an affordable avenue towards haplotype-resolved genome resequencing data at population scale.
The feasibility to sequence entire genomes of virtually any organism provides unprecedented insights into the evolutionary history of populations and species. Nevertheless, many population genomic inferences – including the quantification and dating of admixture, introgression and demographic events, and inference of selective sweeps – are still limited by the lack of high‐quality haplotype information. The newest generation of sequencing technology now promises significant progress. To establish the feasibility of haplotype‐resolved genome resequencing at population scale, we investigated properties of linked‐read sequencing data of songbirds of the genus Oenanthe across a range of sequencing depths. Our results based on the comparison of downsampled (25×, 20×, 15×, 10×, 7×, and 5×) with high‐coverage data (46–68×) of seven bird genomes mapped to a reference suggest that phasing contiguities and accuracies adequate for most population genomic analyses can be reached already with moderate sequencing effort. At 15× coverage, phased haplotypes span about 90% of the genome assembly, with 50% and 90% of phased sequences located in phase blocks longer than 1.25–4.6 Mb (N50) and 0.27–0.72 Mb (N90). Phasing accuracy reaches beyond 99% starting from 15× coverage. Higher coverages yielded higher contiguities (up to about 7 Mb/1 Mb [N50/N90] at 25× coverage), but only marginally improved phasing accuracy. Phase block contiguity improved with input DNA molecule length; thus, higher‐quality DNA may help keeping sequencing costs at bay. In conclusion, even for organisms with gigabase‐sized genomes like birds, linked‐read sequencing at moderate depth opens an affordable avenue towards haplotype‐resolved genome resequencing at population scale.
Insights into the processes underpinning convergent evolution advance our understanding of the contributions of ancestral, introgressed, and novel genetic variation to phenotypic evolution. Phylogenomic analyses characterizing genome-wide gene tree heterogeneity can provide first clues about the extent of ILS and of introgression and thereby into the potential of these processes or (in their absence) the need to invoke novel mutations to underpin convergent evolution. Here, we were interested in understanding the processes involved in convergent evolution in open-habitat chats (wheatears of the genus Oenanthe and their relatives). To this end, based on whole-genome resequencing data from 50 taxa of 44 species, we established the species tree, characterized gene tree heterogeneity, and investigated the footprints of ILS and introgression within the latter. The species tree corroborates the pattern of abundant convergent evolution, especially in wheatears. The high levels of gene tree heterogeneity in wheatears are explained by ILS alone only for 30% of internal branches. For multiple branches with high gene tree heterogeneity, D-statistics and phylogenetic networks identified footprints of introgression. Finally, long branches without extensive ILS between clades sporting similar phenotypes provide suggestive evidence for a role of novel mutations in the evolution of these phenotypes. Together, our results suggest that convergent evolution in open-habitat chats involved diverse processes and highlight that phenotypic diversification is often complex and best depicted as a network of interacting lineages.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.