Large genomic insertions and deletions are a potent source of functional variation, but are challenging to resolve with short-read sequencing, limiting knowledge of the role of such structural variants (SVs) in human evolution. Here, we used a graph-based method to genotype long-read-discovered SVs in short-read data from diverse human genomes. We then applied an admixture-aware method to identify 220 SVs exhibiting extreme patterns of frequency differentiation—a signature of local adaptation. The top two variants traced to the immunoglobulin heavy chain locus, tagging a haplotype that swept to near fixation in certain Southeast Asian populations, but is rare in other global populations. Further investigation revealed evidence that the haplotype traces to gene flow from Neanderthals, corroborating the role of immune-related genes as prominent targets of adaptive introgression. Our study demonstrates how recent technical advances can help resolve signatures of key evolutionary events that remained obscured within technically challenging regions of the genome.
In mammalian cells, maternal and paternal alleles usually have similar transcriptional activity. Epigenetic mechanisms such as X-chromosome inactivation (XCI) and imprinting were historically viewed as rare exceptions to this rule. Discovery of autosomal monoallelic expression (MAE) a decade ago revealed an additional allele-specific mode regulating thousands of mammalian genes. Despite MAE prevalence, its mechanistic basis remains unknown. Using an RNA sequencing-based screen for reactivation of silenced alleles, we identified DNA methylation as key mechanism of MAE mitotic maintenance. In contrast with the all-or-nothing allelic choice in XCI, allele-specific expression in MAE loci is tunable, with exact allelic imbalance dependent on the extent of DNA methylation. In a subset of MAE genes, allelic imbalance was insensitive to DNA demethylation, implicating additional mechanisms in MAE maintenance in these loci. Our findings identify a key mechanism of MAE maintenance and provide basis for understanding the biological role of MAE.
Large genomic insertions, deletions, and inversions are a potent source of functional and fitness-altering variation, but are challenging to resolve with short-read DNA sequencing alone. While recent long-read sequencing technologies have greatly expanded the catalog of structural variants (SVs), their costs have so far precluded their application at population scales. Given these limitations, the role of SVs in human adaptation remains poorly characterized. Here, we used a graph-based approach to genotype 107,866 long-read-discovered SVs in short-read sequencing data from diverse human populations. We then applied an admixture-aware method to scan these SVs for patterns of population-specific frequency differentiation—a signature of local adaptation. We identified 220 SVs exhibiting extreme frequency differentiation, including several SVs that were among the lead variants at their corresponding loci. The top two signatures traced to separate insertion and deletion polymorphisms at the immunoglobulin heavy chain locus, together tagging a 325 Kbp haplotype that swept to high frequency and was subsequently fragmented by recombination. Alleles defining this haplotype are nearly fixed (60-95%) in certain Southeast Asian populations, but are rare or absent from other global populations composing the 1000 Genomes Project. Further investigation revealed that the haplotype closely matches with sequences observed in two of three high-coverage Neanderthal genomes, providing strong evidence of a Neanderthal-introgressed origin. This extraordinary episode of positive selection, which we infer to have occurred between 1700 and 8400 years ago, corroborates the role of immune-related genes as prominent targets of adaptive archaic introgression. Our study demonstrates how combining recent advances in genome sequencing, genotyping algorithms, and population genetic methods can reveal signatures of key evolutionary events that remained hidden within poorly resolved regions of the genome.
Thousands of mammalian genes show epigenetically controlled unequal transcription of the parental alleles. Genes subject to autosomal monoallelic expression (MAE) display mitotically stable allelic choice, leading to persistent transcriptional differences between clonal cell lineages. Mechanism of MAE mitotic maintenance is unknown. Using a new screening-by-sequencing strategy, we uncovered a key role for DNA methylation in MAE maintenance. Subset of MAE loci were insensitive to DNA demethylation, suggesting mechanistic heterogeneity of MAE. Genome-wide analyses indicate that MAE is part of a more general mode of gene regulation and reveal a previously unappreciated interplay of genetic and epigenetic control of allele-specific transcription. While cis-acting regulation defines a common underlying state for all cells, DNA methylation plays the role of an allele-specific rheostat and determines multiple regulatory states distinguishing between developmentally equivalent clonal cell lineages. Our findings imply that allele-specific analyses of clonal cell populations can unmask longterm transcriptional responses to drug-driven perturbations.
Recently published single-cell sequencing data from individual human sperm (n = 41,189; 969-3,377 cells from each of 25 donors) offer an opportunity to investigate questions of inheritance with improved statistical power, but require new methods tailored to these extremely low-coverage data (∼0.01 x per cell). To this end, we developed a method, named rhapsodi, that leverages sparse gamete genotype data to phase the diploid genomes of the donor individuals, impute missing gamete genotypes, and discover meiotic recombination breakpoints, benchmarking its performance across a wide range of study designs. Mendel's Law of Segregation states that the offspring of a diploid, heterozygous parent will inherit either allele with equal probability. While the vast majority of loci adhere to this rule, research in model and non-model organisms has uncovered numerous exceptions whereby 'selfish' alleles are disproportionately transmitted to the next generation. Evidence of such 'transmission distortion' (TD) in humans remains equivocal in part because scans of human pedigrees have been under-powered to detect small effects. After applying rhapsodi to the sperm sequencing data, we therefore scanned the gametes for evidence of TD. Our results exhibited close concordance with binomial expectations under balanced transmission. Together, our work demonstrates that rhapsodi can facilitate novel uses of inferred genotype data and meiotic recombination events, while offering a powerful quantitative framework for testing for TD in other cohorts and study systems.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.