BackgroundMany neurodegenerative diseases are caused by nucleotide repeat expansions, but most expansions, like the C9orf72 ‘GGGGCC’ (G4C2) repeat that causes approximately 5–7% of all amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) cases, are too long to sequence using short-read sequencing technologies. It is unclear whether long-read sequencing technologies can traverse these long, challenging repeat expansions. Here, we demonstrate that two long-read sequencing technologies, Pacific Biosciences’ (PacBio) and Oxford Nanopore Technologies’ (ONT), can sequence through disease-causing repeats cloned into plasmids, including the FTD/ALS-causing G4C2 repeat expansion. We also report the first long-read sequencing data characterizing the C9orf72 G4C2 repeat expansion at the nucleotide level in two symptomatic expansion carriers using PacBio whole-genome sequencing and a no-amplification (No-Amp) targeted approach based on CRISPR/Cas9.ResultsBoth the PacBio and ONT platforms successfully sequenced through the repeat expansions in plasmids. Throughput on the MinION was a challenge for whole-genome sequencing; we were unable to attain reads covering the human C9orf72 repeat expansion using 15 flow cells. We obtained 8× coverage across the C9orf72 locus using the PacBio Sequel, accurately reporting the unexpanded allele at eight repeats, and reading through the entire expansion with 1324 repeats (7941 nucleotides). Using the No-Amp targeted approach, we attained > 800× coverage and were able to identify the unexpanded allele, closely estimate expansion size, and assess nucleotide content in a single experiment. We estimate the individual’s repeat region was > 99% G4C2 content, though we cannot rule out small interruptions.ConclusionsOur findings indicate that long-read sequencing is well suited to characterizing known repeat expansions, and for discovering new disease-causing, disease-modifying, or risk-modifying repeat expansions that have gone undetected with conventional short-read sequencing. The PacBio No-Amp targeted approach may have future potential in clinical and genetic counseling environments. Larger and deeper long-read sequencing studies in C9orf72 expansion carriers will be important to determine heterogeneity and whether the repeats are interrupted by non-G4C2 content, potentially mitigating or modifying disease course or age of onset, as interruptions are known to do in other repeat-expansion disorders. These results have broad implications across all diseases where the genetic etiology remains unclear.Electronic supplementary materialThe online version of this article (10.1186/s13024-018-0274-4) contains supplementary material, which is available to authorized users.
Background: A repeat expansion in the C9orf72-SMCR8 complex subunit (C9orf72) is the most common genetic cause of two debilitating neurodegenerative diseases: amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Currently, much remains unknown about which variables may modify these diseases. We sought to investigate associations between C9orf72 promoter methylation, RNA expression levels, and repeat length, their potential effects on disease features, as well as changes over time and within families.Methods: All samples were obtained through the ALS Center at Mayo Clinic Florida. Our primary cohort included 75 unrelated patients with an expanded C9orf72 repeat, 33 patients who did not possess this expansion, and 20 control subjects without neurodegenerative diseases. Additionally, 67 members from 17 independent C9orf72 families were selected of whom 33 harbored this expansion. Longitudinally collected samples were available for 35 C9orf72 expansion carriers. To increase our understanding of C9orf72-related diseases, we performed quantitative methylation-sensitive restriction enzyme-based assays, digital molecular barcoding, quantitative real-time PCR, and Southern blotting.
Introduction:The Advancing Research and Treatment for Frontotemporal Lobar Degeneration (ARTFL) and Longitudinal Evaluation of Familial Frontotemporal Dementia Subjects (LEFFTDS) consortia are two closely connected studies, involving multiple North American centers that evaluate both sporadic and familial frontotemporal dementia (FTD) participants and study longitudinal changes. Methods:We screened the major dementia-associated genes in 302 sporadic and 390 familial (symptomatic or at-risk) participants enrolled in these studies.Results: Among the sporadic patients, 16 (5.3%) carried chromosome 9 open reading frame 72 (C9orf72), microtubule-associated protein tau (MAPT), and progranulin (GRN) pathogenic variants, whereas in the familial series we identified 207 carriers from 146 families. Of interest, one patient was found to carry a homozygous C9orf72 expansion, while another carried both a C9orf72 expansion and a GRN pathogenic variant. We also identified likely pathogenic variants in the TAR DNA binding protein (TARDBP), presenilin 1 (PSEN1), and valosin containing protein (VCP) genes, and a subset of variants of unknown significance in other rare FTD genes.Discussion: Our study reports the genetic characterization of a large FTD series and supports an unbiased sequencing screen, irrespective of clinical presentation or family history.
The majority of the clinico-pathological variability observed in patients harboring a repeat expansion in the C9orf72-SMCR8 complex subunit (C9orf72) remains unexplained. This expansion, which represents the most common genetic cause of frontotemporal lobar degeneration (FTLD) and motor neuron disease (MND), results in a loss of C9orf72 expression and the generation of RNA foci and dipeptide repeat (DPR) proteins. The C9orf72 protein itself plays a role in vesicular transport, serving as a guanine nucleotide exchange factor that regulates GTPases. To further elucidate the mechanisms underlying C9orf72-related diseases and to identify potential disease modifiers, we performed an extensive RNA sequencing study. We included individuals for whom frontal cortex tissue was available: FTLD and FTLD/MND patients with (n = 34) or without (n = 44) an expanded C9orf72 repeat as well as control subjects (n = 24). In total, 6706 genes were differentially expressed between these groups (false discovery rate [FDR] < 0.05). The top gene was C9orf72 (FDR = 1.41E-14), which was roughly two-fold lower in C9orf72 expansion carriers than in (disease) controls. Co-expression analysis revealed groups of correlated genes (modules) that were enriched for processes such as protein folding, RNA splicing, synaptic signaling, metabolism, and Golgi vesicle transport. Within our cohort of C9orf72 expansion carriers, machine learning uncovered interesting candidates associated with clinico-pathological features, including age at onset (vascular endothelial growth factor A [VEGFA]), C9orf72 expansion size (cyclin dependent kinase like 1 [CDKL1]), DPR protein levels (eukaryotic elongation factor 2 kinase [EEF2K]), and survival after onset (small G protein signaling modulator 3 [SGSM3]). Given the fact that we detected a module involved in vesicular transport in addition to a GTPase activator (SGSM3) as a potential modifier, our findings seem to suggest that the presence of a C9orf72 repeat expansion might hamper vesicular transport and that genes affecting this process may modify the phenotype of C9orf72-linked diseases.
Since the discovery of the C9orf72 repeat expansion as the most common genetic cause of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis, it has increasingly been associated with a wider spectrum of phenotypes, including other types of dementia, movement disorders, psychiatric symptoms and slowly progressive FTD. Prompt recognition of patients with C9orf72-associated diseases is essential in light of upcoming clinical trials. The striking clinical heterogeneity associated with C9orf72 repeat expansions remains largely unexplained. In contrast to other repeat expansion disorders, evidence for an effect of repeat length on phenotype is inconclusive. Patients with C9orf72-associated diseases typically have very long repeat expansions, containing hundreds to thousands of GGGGCC-repeats, but smaller expansions might also have clinical significance. The exact threshold at which repeat expansions lead to neurodegeneration is unknown, and discordant cut-offs between laboratories pose a challenge for genetic counselling. Accurate and large-scale measurement of repeat expansions has been severely hindered by technical difficulties in sizing long expansions and by variable repeat lengths across and within tissues. Novel long-read sequencing approaches have produced promising results and open up avenues to further investigate this enthralling repeat expansion, elucidating whether its length, purity, and methylation pattern might modulate clinical features of C9orf72-related diseases.
To examine the length of a hexanucleotide expansion in C9orf72, which represents the most frequent genetic cause of frontotemporal lobar degeneration and motor neuron disease, we employed a targeted amplification-free long-read sequencing technology: No-Amp sequencing. In our cross-sectional study, we assessed cerebellar tissue from 28 well-characterized C9orf72 expansion carriers. We obtained 3507 on-target circular consensus sequencing reads, of which 814 bridged the C9orf72 repeat expansion (23%). Importantly, we observed a significant correlation between expansion sizes obtained using No-Amp sequencing and Southern blotting (P = 5.0 × 10−4). Interestingly, we also detected a significant survival advantage for individuals with smaller expansions (P = 0.004). Additionally, we uncovered that smaller expansions were significantly associated with higher levels of C9orf72 transcripts containing intron 1b (P = 0.003), poly(GP) proteins (P = 1.3 × 10− 5), and poly(GA) proteins (P = 0.005). Thorough examination of the composition of the expansion revealed that its GC content was extremely high (median: 100%) and that it was mainly composed of GGGGCC repeats (median: 96%), suggesting that expanded C9orf72 repeats are quite pure. Taken together, our findings demonstrate that No-Amp sequencing is a powerful tool that enables the discovery of relevant clinicopathological associations, highlighting the important role played by the cerebellar size of the expanded repeat in C9orf72-linked diseases.
Under homeostatic conditions, esophageal epithelium displays a proliferation/differentiation gradient that is generated as proliferative basal cells give rise to suprabasal cells then terminally differentiated superficial cells. This proliferation/differentiation gradient is perturbed in esophageal pathologies both benign and malignant. Esophageal cancer is among the deadliest forms of human malignancy with 5-year survival rates of <20%. Esophageal squamous cell carcinoma (ESCC) and esophageal adenocarcinoma (EAC) are the two most common subtypes of esophageal cancer. Gastroesophageal reflux disease (GERD) is a primary risk factor for EAC. Although GERD and the food allergy-mediated condition eosinophilic esophagitis (EoE) are both associated with chronic esophageal inflammation and epithelial remodeling, including basal cell hyperplasia, epidemiological evidence suggests that EoE patients do not develop esophageal malignancy. Here, we perform single cell RNA-sequencing in murine models of EoE and ESCC to delineate the effects that these two conditions have specifically upon the cellular landscape of esophageal epithelium. In mice with EoE or ESCC, we find expansion of cell populations as compared to normal esophageal epithelium. In mice with EoE, we detect expansion of 4 suprabasal populations coupled with depletion of 4 basal cell populations. By contrast, mice with ESCC display expansion of 4 basal populations as well as depletion of 3 superficial populations. We further evaluated modules of co-expressed genes in EoE- and ESCC-enriched epithelial cell clusters. Senescence, glucocorticoid receptor signaling, and granulocyte-macrophage colony-stimulating factor pathways were associated with EoE-enriched clusters while pathways associated with cell proliferation and metabolism were identified in ESCC-enriched clusters. Finally, by pairing murine models of EoE and ESCC, we demonstrate that exposure to EoE inflammation limits esophageal carcinogenesis. Our findings provide the first functional investigation of the relationship between EoE and esophageal cancer and suggest that esophageal epithelial remodeling events occurring in response to EoE inflammation may limit act to esophageal carcinogenesis which may have future implications for leveraging allergic inflammation-associated alterations in epithelial biology to prevent and/or treat esophageal cancer.
Background: Many neurodegenerative diseases are caused by nucleotide repeat expansions, but most expansions, like the C9orf72 ‘GGGGCC’ (G4C2) repeat that causes approximately 5-7% of all amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) cases, are too long to sequence using short-read sequencing technologies. It is unclear whether long-read sequencing technologies can traverse these long, challenging repeat expansions. Here, we demonstrate that two long-read sequencing technologies, Pacific Biosciences’ (PacBio) and Oxford Nanopore Technologies’ (ONT), can sequence through disease-causing repeats cloned into plasmids, including the FTD/ALS-causing G4C2 repeat expansion. We also report the first long-read sequencing data characterizing the C9orf72 G4C2 repeat expansion at the nucleotide level in two symptomatic expansion carriers using PacBio whole-genome sequencing and a no-amplification (No-Amp) targeted approach based on CRISPR/Cas9.Results: Both the PacBio and ONT platforms successfully sequenced through the repeat expansions in plasmids. Throughput on the MinlON was a challenge for whole-genome sequencing; we were unable to attain reads covering the human C9orf72 repeat expansion using 15 flow cells. We obtained 8x coverage across the C9orf72 locus using the PacBio Sequel, accurately reporting the unexpanded allele at eight repeats, and reading through the entire expansion with 1324 repeats (7941 nucleotides). Using the No-Amp targeted approach, we attained >800x coverage and were able to identify the unexpanded allele, closely estimate expansion size, and assess nucleotide content in a single experiment. We estimate the individual’s repeat region was >99% G4C2 content, though we cannot rule out small interruptions.Conclusions: Our findings indicate that long-read sequencing is well suited to characterizing known repeat expansions, and for discovering new disease-causing, disease-modifying, or risk-modifying repeat expansions that have gone undetected with conventional short-read sequencing. The PacBio No-Amp targeted approach may have future potential in clinical and genetic counseling environments. Larger and deeper long-read sequencing studies in C9orf72 expansion carriers will be important to determine heterogeneity and whether the repeats are interrupted by non-G4C2 content, potentially mitigating or modifying disease course or age of onset, as interruptions are known to do in other repeat-expansion disorders. These results have broad implications across all diseases where the genetic etiology remains unclear.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.