Summary Structural variants (SVs) are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight SV classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype-blocks in 26 human populations. Analyzing this set, we identify numerous gene-intersecting SVs exhibiting population stratification and describe naturally occurring homozygous gene knockouts suggesting the dispensability of a variety of human genes. We demonstrate that SVs are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of SV complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex SVs with multiple breakpoints likely formed through individual mutational events. Our catalog will enhance future studies into SV demography, functional impact and disease association.
We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.
Tyrosine kinase domain mutations are a common cause of acquired clinical resistance to tyrosine kinase inhibitors (TKIs) used to treat cancer, including the FLT3 inhibitor quizartinib. Mutation of kinase “gatekeeper” residues, which control access to an allosteric pocket adjacent to the ATP-binding site, have been frequently implicated in TKI resistance. The molecular underpinnings of gatekeeper mutation-mediated resistance are incompletely understood. We report the first co-crystal structure of FLT3 with the TKI quizartinib, which demonstrates that quizartinib binding relies on essential edge-to-face aromatic interactions with the gatekeeper F691 residue, and F830 within the highly conserved DFG motif in the activation loop. This reliance makes quizartinib critically vulnerable to gatekeeper and activation loop substitutions while minimizing the impact of mutations elsewhere. Moreover, we identify PLX3397, a novel FLT3 inhibitor that retains activity against the F691L mutant due to a binding mode that depends less vitally on specific interactions with the gatekeeper position.
Higher-order chromatin structure arises from the combinatorial physical interactions of many genomic loci. To investigate this aspect of genome architecture we developed Pore-C, which couples chromatin conformation capture with Oxford Nanopore Technologies (ONT) long reads to directly sequence multi-way chromatin contacts without amplification. In GM12878, we demonstrate that the pairwise interaction patterns implicit in Pore-C multi-way contacts are consistent with gold standard Hi-C pairwise contact maps at the compartment, TAD, and loop scales. In addition, Pore-C also detects higher-order chromatin structure at 18.5-fold higher efficiency and greater fidelity than SPRITE, a previously published higher-order chromatin profiling technology. We demonstrate Pore-C's ability to detect and visualize multi-locus hubs associated with histone locus bodies and active / inactive nuclear compartments in GM12878. In the breast cancer cell line HCC1954, Pore-C contacts enable the reconstruction of complex and aneuploid rearranged alleles spanning multiple megabases and chromosomes. Finally, we apply Pore-C to generate a chromosome scale de novo assembly of the HG002 genome. Our results establish Pore-C as the most simple and scalable assay for the genome-wide assessment of combinatorial chromatin interactions, with additional applications for cancer rearrangement reconstruction and de novo genome assembly. Chromatin structure | Structural variation | Long read sequencing | De novo genome assembly | cancer genomicsCorrespondence: mski@mskilab.org
Background Intrachromosomal triplications (TRP) can contribute to disease etiology via gene dosage effects, gene disruption, position effects, or fusion gene formation. Recently, post-zygotic de novo triplications adjacent to copy-number neutral genomic intervals with runs of homozygosity (ROH) have been shown to result in uniparental isodisomy (UPD). The genomic structure of these complex genomic rearrangements (CGRs) shows a consistent pattern of an inverted triplication flanked by duplications (DUP-TRP/INV-DUP) formed by an iterative DNA replisome template-switching mechanism during replicative repair of a single-ended, double-stranded DNA (seDNA), the ROH results from an interhomolog or nonsister chromatid template switch. It has been postulated that these CGRs may lead to genetic abnormalities in carriers due to dosage-sensitive genes mapping within the copy-number variant regions, homozygosity for alleles at a locus causing an autosomal recessive (AR) disease trait within the ROH region, or imprinting-associated diseases. Methods Here, we report a family wherein the affected subject carries a de novo 2.2-Mb TRP followed by 42.2 Mb of ROH and manifests clinical features overlapping with those observed in association with chromosome 14 maternal UPD (UPD(14)mat). UPD(14)mat can cause clinical phenotypic features enabling a diagnosis of Temple syndrome. This CGR was then molecularly characterized by high-density custom aCGH, genome-wide single-nucleotide polymorphism (SNP) and methylation arrays, exome sequencing (ES), and the Oxford Nanopore long-read sequencing technology. Results We confirmed the postulated DUP-TRP/INV-DUP structure by multiple orthogonal genomic technologies in the proband. The methylation status of known differentially methylated regions (DMRs) on chromosome 14 revealed that the subject shows the typical methylation pattern of UPD(14)mat. Consistent with these molecular findings, the clinical features overlap with those observed in Temple syndrome, including speech delay. Conclusions These data provide experimental evidence that, in humans, triplication can lead to segmental UPD and imprinting disease. Importantly, genotype/phenotype analyses further reveal how a post-zygotically generated complex structural variant, resulting from a replication-based mutational mechanism, contributes to expanding the clinical phenotype of known genetic syndromes. Mechanistically, such events can distort transmission genetics resulting in homozygosity at a locus for which only one parent is a carrier as well as cause imprinting diseases. Electronic supplementary material The online version of this article (10.1186/s13073-019-0633-y) contains supplementary material, which is available to authorized users.
Multifocal lung parenchymal cavities containing multiple aspergillomas are not well-recognized features of chronic pulmonary aspergillosis (CPA). We identified five patients with multiple cavities containing fungal balls from our accumulated cohort of ∼300 patients with CPA. Corticosteroid exposure and radiological and serological characteristics were assessed. The patients, aged 19-55 years, developed 3-11 cavities (or nodules in one case) with thin walls, usually within the lung parenchyma, with very limited pleural involvement. Four had asthma (severe in three) and one had cystic fibrosis; three had allergic bronchopulmonary aspergillosis. All patients had received corticosteroids orally or by inhalation. Four patients had elevated Aspergillus IgG antibodies; one had elevated Aspergillus-specific IgE. Three patients developed azole resistance on antifungal therapy, after benefit, one of whom underwent a successful bilateral lung transplant, later complicated by a fatal mycotic cerebral aneurysm. Multiple aspergillomas is a new distinct manifestation of CPA. The lack of inflammatory response and the distribution of the cavities in the lungs are remarkable. Aspergillus nodules could evolve into cavities with aspergillomas. The management and development of azole resistance in these patients is problematic.
Mutations at positions 70 and/or 91 in the core protein of genotype-1b, hepatitis C virus (HCV) are associated with hepatocellular carcinoma (HCC) risk in Asian patients. To evaluate this in a US population, the relationship between the percentage of 70 and/or 91 mutant HCV quasispecies in baseline serum samples of chronic HCV patients from the HALT-C trial and the incidence of HCC was determined by deep sequencing. Quasispecies percentage cut-points, ≥42% of non-arginine at 70 (non-R70) or ≥98.5% of non-leucine at 91 (non-L91) had optimal sensitivity at discerning higher or lower HCC risk. In baseline samples, 88.5% of chronic HCV patients who later developed HCC and 68.8% of matched HCC-free control patients had ≥42% non-R70 quasispecies (P = 0.06). Furthermore, 30.8% of patients who developed HCC and 54.7% of matched HCC-free patients had quasispecies with ≥98.5% non-L91 (P = 0.06). By Kaplan-Meier analysis, HCC incidence was higher, but not statistically significant, among patients with quasispecies ≥42% non-R70 (P = 0.08), while HCC incidence was significantly reduced among patients with quasispecies ≥98.5% non-L91 (P = 0.01). In a Cox regression model, non-R70 ≥42% was associated with increased HCC risk. This study of US patients indicates the potential utility of HCV quasispecies analysis as a non-invasive biomarker of HCC risk.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.